纽约大学｜通过自然语言反馈训练提高代码生成质量

Improving Code Generation by Training with Natural Language Feedback

提出一种基于自然语言反馈的仿真学习算法，通过训练时的自然语言反馈来提高代码生成模型的性能。

Angelica Chen, Jérémy Scheurer, Tomasz Korbak, Jon Ander Campos, Jun Shern Chan, Samuel R. Bowman, Kyunghyun Cho, Ethan Perez
[New York University]

通过自然语言反馈训练提高代码生成质量

动机：当前大型语言模型(LLM)在推理时使用自然语言反馈的潜力受到广泛关注，本文旨在探索使用自然语言反馈在训练期间提高代码生成模型性能的方法。
方法：提出一种名为“基于自然语言反馈的仿真学习(ILF)”算法来学习自然语言反馈，并将其用于代码生成任务。ILF只需要在训练期间使用少量的人工编写的反馈信息，并且在测试时不需要相同的反馈信息，这使其既用户友好又样本高效。
优势：该方法证明了使用自然语言反馈来提高代码生成模型的性能是有效的，具有样本高效的优点，并可以连续多轮地改进模型。该方法还可以纠正代码中特定类型的错误，并生成与现实场景更加接近的训练数据。使用ILF方法在基本Python问题上的表现比在MBPP数据集上进行微调或人工编写程序的微调效果更好，相对提高了38%（绝对提高了10%）。

https://arxiv.org/abs/2303.16749

纽约大学｜通过自然语言反馈训练提高代码生成质量

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

纽约大学｜通过自然语言反馈训练提高代码生成质量

Improving Code Generation by Training with Natural Language Feedback

n8n实战：Webhook、条件判断与API集成详解

谷歌太壕了！编程Agent大招至简：开源且免费，百万上下文、多模态、MCP全支持

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

老黄新鲜一刀，RTX 5050正式官宣

国产GPU历史性时刻！摩尔线程、沐曦同日获IPO受理

00后投身具身智能创业，剑指机器人界「Model 3」！已推出21个自由度灵巧手

监督学习也能从错误中学习反思？！清华英伟达联合提出隐式负向策略爆炸提升数学能力

AI也会闹情绪了！Gemini代码调试不成功直接摆烂，马斯克都来围观

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

曝苹果拟收购Perplexity AI，人才一并拿走