Meta AI等提出超级令牌视频transformer用于视频理解

【推荐理由】文章提出了一个 Supertoken Video Transformer大模型结构用于视频理解，提高了MAE预训练的ViT-B和ViT-L的准确性，同时需要更少的计算。

SVT: Supertoken Video Transformer for Efficient Video Understanding

Chenbin Pan, Rui Hou, Hanchao Yu, Qifan Wang, Senem Velipasalar, Madian Khabsa

[Syracuse University & Meta AI]

【论文链接】https://arxiv.org/pdf/2304.00325.pdf

【摘要】现有的视频转换器通过从开始到结束处理具有固定分辨率的视频或将池化和降采样策略纳入其中，通过整个网络处理整个视频内容而没有特别处理大量冗余信息。在本文中，作者提出了一个超级令牌视频转换器（SVT），它包括一个语义池化模块（SPM），以根据它们的语义沿着视觉变换器的深度聚合潜在表示，从而减少视频输入中固有的冗余。定性结果表明，该方法可以通过合并具有相似语义的潜在表示来有效减少冗余，并因此增加下游任务的显著信息比例。定量上，该方法在Kinectics和SomethingSomething-V2基准测试中显著减少计算量的同时，提高了ViT和MViT的性能。具体而言，通过SPM，在Kinectics-400基准测试中，作者将MAE预训练的ViT-B和ViT-L的准确性分别提高了1.5％和0.2％，同时GFLOPs减少了33％和55％，在Kinectics-400和Something-Something-V2上，作者将MViTv2-B的准确性分别提高了0.2％和0.3％，同时GFLOPs减少了22％。

Meta AI等提出超级令牌视频transformer用于视频理解

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

Meta AI等提出超级令牌视频transformer用于视频理解

n8n实战：Webhook、条件判断与API集成详解

谷歌太壕了！编程Agent大招至简：开源且免费，百万上下文、多模态、MCP全支持

老黄新鲜一刀，RTX 5050正式官宣

国产GPU历史性时刻！摩尔线程、沐曦同日获IPO受理

一张小卡片敢卖999？原来是智能体AI硬件

让AI主动干活，给你找服务，鸿蒙“6”啊

这个AI能救命！提前6个月发现胃癌病灶，突破医学影像认知，达摩院做成了

科大讯飞“AI+教育”再提速：学习机功能升级引领行业发展

7B小模型超越DeepSeek-R1：模仿人类教师，弱模型也能教出强推理LLM | Transformer作者团队

多模态AI黑马刷榜后再造神器：一个产品搞定图片视频播客生成，自带百种特效，大牛梅涛团队出品