Meta AI | MAE预-预训练对十亿级预训练的有效性

论文标题：The effectiveness of MAE pre-pretraining for billion-scale pretraining

论文链接：https://arxiv.org/pdf/2303.13496.pdf

作者姓名：Mannat Singh∗,† Quentin Duval∗ Kalyan Vasudev Alwala∗, etc

作者单位：Meta AI

本文重新审视了计算机视觉中用于视觉识别任务的标准预训练-然后微调范例。通常，最先进的基础模型是使用具有数十亿图像的大规模（弱）监督数据集进行预训练的。我们引入了一个额外的预训练阶段，该阶段很简单，并使用自我监督的 MAE 技术来初始化模型。虽然 MAE 仅被证明可以随模型的大小进行缩放，但我们发现它也可以随训练数据集的大小进行缩放。因此，我们基于 MAE 的预训练可根据模型和数据大小进行缩放，使其适用于训练基础模型。预训练在一系列模型规模（数百万到数十亿个参数）和数据集大小（数百万到数十亿张图像）中持续改进模型收敛和下游传输性能。我们测量了预训练对 10 种不同视觉识别任务的有效性，这些任务涵盖图像分类、视频识别、目标检测、low-shot分类和零样本识别。我们最大的模型在 iNaturalist-18 (91.3%)、1-shot ImageNet-1k (62.1%) 和 Food-101 (96.0%) 上实现了新的最先进结果。我们的研究表明，模型初始化起着重要作用，即使对于使用数十亿张图像进行网络规模的预训练也是如此。

Meta AI | MAE预-预训练对十亿级预训练的有效性

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

Meta AI | MAE预-预训练对十亿级预训练的有效性

潞晨尤洋：日常办公没必要上私有模型，这三类企业才需要 | MEET2026

世界模型和具身大脑最新突破：90%生成数据，VLA性能暴涨300%｜开源

DeepSeek-V3.2系列开源，性能直接对标Gemini-3.0-Pro

SpaceX估值8000亿美元超OpenAI，IPO就在明年

“豆包手机”在二手市场价格都翻倍了……

6小时告破30年数学难题，亚里士多德一夜成名

AI也会被DDL逼疯！正经研究发现：压力越大，AI越危险

完整议程｜12.10-11第二十届中国IDC产业年度大典北京·首钢园启动

免费国产Banana真香！我想把PS给卸载了

deepseek当选网易有道词典2025年度词汇，全年搜索量超867万次