腾讯优图实验室 | SoftCLIP: 更软的跨模态对齐使CLIP更强大

【推荐理由】本文使用软对齐方法改进多模态大模型CLIP，使其不需要image-text pair对训练。

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu, Wei Liu, Jie Yang, Ke Li, Xing Sun

【论文链接】https://arxiv.org/pdf/2303.17561.pdf

【摘要】在过去的两年中，视觉语言预训练在多个下游任务中取得了显著的成功。然而，获取高质量的图像文本对，其中对是完全独立的，仍然是一个具有挑战性的任务，常用的数据集中存在噪声。为了解决这个问题，本文提出了SoftCLIP，一种新颖的方法，通过引入一个软化的目标来实现软的跨模态对齐，该目标是从细粒度的内模态自相似性生成的。内模态指导有助于使两个对有一些局部相似性，并在两种模态之间建立多对多的关系。此外，由于正样本在软化的目标分布中仍然占主导地位，我们将负样本从分布中分离出来，以进一步提高跨模态学习中与负样本的关系对齐。大量实验证明了SoftCLIP的有效性。特别是，在ImageNet零样本分类任务中，使用CC3M/CC12M作为预训练数据集，SoftCLIP相对于CLIP基线带来了6.8%/7.2%的top-1准确率提升。

腾讯优图实验室 | SoftCLIP: 更软的跨模态对齐使CLIP更强大

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

腾讯优图实验室 | SoftCLIP: 更软的跨模态对齐使CLIP更强大

n8n实战：Webhook、条件判断与API集成详解

谷歌太壕了！编程Agent大招至简：开源且免费，百万上下文、多模态、MCP全支持

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

老黄新鲜一刀，RTX 5050正式官宣

国产GPU历史性时刻！摩尔线程、沐曦同日获IPO受理

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

曝苹果拟收购Perplexity AI，人才一并拿走

有道14B低成本轻量模型“子曰3”开源，数学推理性能超越大模型

马斯克Robotaxi今日上路：画饼十年终兑现！团队合影C位武汉理工校友引关注

蚂蚁开源轻量级推理模型Ring-lite，多项Benchmark达到SOTA