CLIPScore: A Reference-free Evaluation Metric for Image Captioning

Motivation

image captions 通常用reference-based的评价指标（即利用人写的caption来作为GT评估生成字幕的好坏），本文提出了无需reference caption 的一个评价标准。

Proposal

本文提出了一种reference-free的评价指标——CLIPScore，这种评价标准更具鲁棒性，不再是单纯用人标注的caption来评估。
本文的方法比reference-based的评价指标CIDEr和SPICE，和人类的判断一致性更强。与现在text-text的评价标准相比，image-text的评价更为完善。

Method

直接计算图文相似度，作为评价标准~

CLIPScore: A Reference-free Evaluation Metric for Image Captioning

找了特例来说明原来reference-based的方法的缺点

Evaluation With CLIP

固定住text的prompt为 A photo depicts (原始论文显示这样的效果也更好?)
其中�是一个scaling系数, 文中设置为25; �为caption的 CLIPembedding; �为CLIP image embdding
需要强调的是, 这个评价指标只用了reference的图片, 并没有reference的文本, 所以是reference-free的评价方法.

RefCLIPScore

把reference的文本也拿过来做evaluation, 最后结果取harmonic mean

Benchmark

Flickr8K-Expert和Flickr8K-CF都是由人进行二次判断的数据集，判断字幕是否和图像对应。（human ratings）。因而我们可以计算评价指标的结果和human rating结果的相关性，从而评估这个指标的好坏。
实验结果：

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

CLIPScore: A Reference-free Evaluation Metric for Image Captioning

Motivation

Proposal

Method

Evaluation With CLIP

RefCLIPScore

Benchmark

清华大学研究团队测评：夸克AI与志愿填报专家专业水平相当

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

谷歌太壕了！编程Agent大招至简：开源且免费，百万上下文、多模态、MCP全支持

n8n实战：Webhook、条件判断与API集成详解

大模型掌握人类空间思考能力！三阶段训练框架学会“边画边想”，5个基准平均提升18.4%

陶哲轩罕见长长长长长访谈：数学、AI和给年轻人的建议

华人学者助力”数学大一统理论”新突破！4位数学家近10年完成证明

清华大学研究团队测评：夸克AI与志愿填报专家专业水平相当

只改2行代码，RAG效率暴涨30%！可扩展至百亿级数据规模应用

大模型掌握人类空间思考能力！三阶段训练框架学会“边画边想”，5个基准平均提升18.4%