NAVER AI实验室提出SeiT，将视觉大模型的训练存储较低到1%

632次阅读

为了得到通用和精确的视觉大模型，训练图像数据的规模已经发展到了十亿级别，促使研究研究人员必须在训练过程中配置大量的存储资源。然而，这一条件并不实际，因此一系列存储优化训练方法（storage-efficient training methods）被研究人员提出，但它们大多会造成模型性能的退化。在本文中，作者提出一个新的策略，可以将视觉大模型的训练存储压缩到每个实例仅需1024个Token，且相较于原始像素，仅需要1%的存储资源。作者进一步提出了Stem-adaptor模块和Token增强策略提升训练效率。实验证明，本文提出的方法在预训练、持续学习多个设定下都能以低存储损耗的条件训练得到更好的视觉大模型。

论文地址：https://arxiv.org/pdf/2303.11114.pdf

开源代码：https://arxiv.org/pdf/2303.11114.pdf

作者：Song Park Sanghyuk Chun Byeongho Heo Wonjae Kim Sangdoo Yun

NAVER AI实验室提出SeiT，将视觉大模型的训练存储较低到1%

正文完

可以使用微信扫码关注公众号（ID：xzluomor）

AI AR 开源

发表至：智源

2023-03-21

Nature 速递：热带森林正接近临界温度阈值

0.2美元微调就能让ChatGPT彻底破防！普林斯顿、斯坦福发布LLM风险预警：普通用户微调也影响LLM安全性

腾讯视频，你怎么回事儿？

ChatGPT文明模拟器再上线！一键穿越回火山爆发当天的庞贝古城

Vol. 84 数码荔枝: 正版软件生态、独立开发与远程办公

港科大与加州大学最新机器人研究：利用触觉信息不用看就可以旋转

评论（没有评论）

文章搜索

最新评论

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!