韩国科学技术院 | 利用记忆高效的双向Transformer实现长视频的端到端生成建模

【推荐理由】自回归Transformer在视频生成方面取得了显著的成功。然而，由于自注意力的二次复杂度，Transformer被禁止直接学习视频中的长期依赖关系。本文所提出的Transformer实现了编码和解码的线性时间复杂度。

Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers

Jaehoon Yoo, Semin Kim, Doyup Lee, Chiheon Kim, Seunghoon Hong

[KAIST, Kakao Brain]

【论文链接】https://arxiv.org/pdf/2303.11251.pdf

【项目链接】https://sites.google.com/view/mebt-cvpr2023

【摘要】自回归Transformer在视频生成方面表现出了非凡的成功。然而，由于自注意力的二次复杂度，Transformer被禁止直接学习视频中的长期依赖关系，同时由于自回归过程，存在慢推断时间和错误传播的问题。在本文中，我们提出了一种记忆高效的双向Transformer(MeBT)，用于端到端地学习视频中的长期依赖关系并实现快速推断。基于双向Transformer的最新进展，我们的方法学习从部分观察到的补丁中并行解码整个时空体积的视频。通过将可观察上下文令牌投影到固定数量的潜在令牌中，并通过交叉注意力对其进行解码，我们的方法在编码和解码中均实现了线性时间复杂度。在双向建模和线性复杂度的支持下，我们的方法在生成中等长度视频的质量和速度方面都比自回归Transformer有了显著的改进。

韩国科学技术院 | 利用记忆高效的双向Transformer实现长视频的端到端生成建模

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

韩国科学技术院 | 利用记忆高效的双向Transformer实现长视频的端到端生成建模

潞晨尤洋：日常办公没必要上私有模型，这三类企业才需要 | MEET2026

世界模型和具身大脑最新突破：90%生成数据，VLA性能暴涨300%｜开源

DeepSeek-V3.2系列开源，性能直接对标Gemini-3.0-Pro

SpaceX估值8000亿美元超OpenAI，IPO就在明年

“豆包手机”在二手市场价格都翻倍了……

6小时告破30年数学难题，亚里士多德一夜成名

AI也会被DDL逼疯！正经研究发现：压力越大，AI越危险

完整议程｜12.10-11第二十届中国IDC产业年度大典北京·首钢园启动

免费国产Banana真香！我想把PS给卸载了

deepseek当选网易有道词典2025年度词汇，全年搜索量超867万次