浙江大学 | 使用现成的图像扩散模型进行零样本视频编辑

【推荐理由】本文首次将大规模的文本到图像扩散模型用于零样本视频编辑领域，不需要对任何视频进行训练。

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Wen Wang, Kangyang Xie, Zide Liu, Hao Chen, Yue Cao, Xinlong Wang, Chunhua Shen

[Zhejiang University & Beijing Academy of Artificial Intelligence]

【论文链接】https://arxiv.org/pdf/2303.17599.pdf

【项目链接】https://github.com/baaivision/vid2vid-zero

【摘要】大规模的文本到图像扩散模型在图像生成和编辑方面取得了前所未有的成功。然而，如何将这种成功扩展到视频编辑领域尚不清楚。最近的视频编辑初步尝试需要大量的文本到视频数据和计算资源进行训练，这通常是不可访问的。在这项工作中，作者提出了 vid2vid-zero，一种简单而有效的零样本视频编辑方法。 vid2vid-zero 利用现成的图像扩散模型，并且不需要对任何视频进行训练。该方法的核心是一个空文本反演模块，用于文本到视频对齐，一个跨帧建模模块，用于时态一致性，以及一个空间正则化模块，用于保持对原始视频的真实性。在没有任何训练的情况下，作者利用注意力机制的动态特性，在测试时实现双向时态建模。实验和分析显示，该方法在编辑属性、主题、地点等方面在现实世界的视频中展现了有希望的结果。

浙江大学 | 使用现成的图像扩散模型进行零样本视频编辑

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

浙江大学 | 使用现成的图像扩散模型进行零样本视频编辑

潞晨尤洋：日常办公没必要上私有模型，这三类企业才需要 | MEET2026

世界模型和具身大脑最新突破：90%生成数据，VLA性能暴涨300%｜开源

DeepSeek-V3.2系列开源，性能直接对标Gemini-3.0-Pro

SpaceX估值8000亿美元超OpenAI，IPO就在明年

“豆包手机”在二手市场价格都翻倍了……

6小时告破30年数学难题，亚里士多德一夜成名

AI也会被DDL逼疯！正经研究发现：压力越大，AI越危险

完整议程｜12.10-11第二十届中国IDC产业年度大典北京·首钢园启动

免费国产Banana真香！我想把PS给卸载了

deepseek当选网易有道词典2025年度词汇，全年搜索量超867万次