中科院提出高效训练的视频基础模型

【推荐理由】本文提出了一个高效训练视觉大模型的方法，仅使用公共资源在 32 个 A100 GPU 上进行 6 天的预训练，从头构建的 ViT-L/16 在各种视频任务上实现了最先进的性能。

Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao

[Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences]

【论文链接】https://arxiv.org/pdf/2303.16058.pdf

【项目链接】https://github.com/OpenGVLab/unmasked_teacher

【摘要】由于计算成本高和数据稀缺，视频基础模型（VFMs）受到了有限的探索。以前的VFMs依赖于图像基础模型（IFMs），面临着转移到视频领域的挑战。虽然VideoMAE已经从有限的数据中训练出了一个强大的ViT，但其低级重建会导致收敛困难，并与高级跨模态对齐产生冲突。本文提出了一种训练高效的时间敏感VFMs方法，结合了现有方法的优势。为了增加数据效率，作者屏蔽了大部分低语义视频令牌，但是选择性地将未屏蔽的令牌与IFM对齐，IFM作为未屏蔽的教师（UMT）。通过提供语义指导，该方法使得收敛更快，同时也更加友好跨模态。通过渐进式预训练框架，此模型可以处理各种任务，包括与场景相关的、与时间相关的和复杂的视频语言理解。仅使用公共来源进行预训练，在32个A100 GPU上进行6天的零起点构建ViT-L/16，在各种视频任务上实现了最先进的性能。

中科院提出高效训练的视频基础模型

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

中科院提出高效训练的视频基础模型

n8n实战：Webhook、条件判断与API集成详解

谷歌太壕了！编程Agent大招至简：开源且免费，百万上下文、多模态、MCP全支持

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

老黄新鲜一刀，RTX 5050正式官宣

国产GPU历史性时刻！摩尔线程、沐曦同日获IPO受理

00后投身具身智能创业，剑指机器人界「Model 3」！已推出21个自由度灵巧手

监督学习也能从错误中学习反思？！清华英伟达联合提出隐式负向策略爆炸提升数学能力

AI也会闹情绪了！Gemini代码调试不成功直接摆烂，马斯克都来围观

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

曝苹果拟收购Perplexity AI，人才一并拿走