活动报名｜RoboGen：通过生成模拟释放无限数据来实现自动化机器人学习

王宇飞

王宇飞是卡内基梅隆大学机器人研究所三年级博士生。他由 Zackory Erickson 教授和 David Held 教授共同指导。他于 2020 年 12 月在 David Held 教授的指导下获得卡内基梅隆大学计算机科学系计算机科学硕士学位。

在来到卡耐基梅隆大学之前，他于2019 年 7 月获得北京大学元培学院数据科学学士学位，导师为董彬教授。他的主要研究兴趣是机器人学习。他的研究生学习得到了优步总统奖学金的支持。

Yufei Wang is a third year Phd student at Robotics Institute, Carnegie Mellon University. He is co-advised by Prof. Zackory Erickson and Prof. David Held. He received M.S. in Computer Science from Computer Science Department, Carnegie Mellon University in Dec, 2020, advised by Prof. David Held.

Before coming to CMU, he received B.S. in Data Science from Yuanpei College, Peking University in July 2019, advised by Prof. Bin Dong. His general research interest is robot learning. His graduate study is supported by the Uber Presidential Fellowship.

活动报名｜RoboGen：通过生成模拟释放无限数据来实现自动化机器人学习

RoboGen：通过生成模拟释放无限数据来实现自动化机器人学习

报告将围绕RoboGen进行介绍，RoboGen是一种生成式的机器人智能体，可以通过“生成模拟”自动大规模学习各种机器人技能。

RoboGen 利用基础模型和生成模型的最新进展。我们不直接使用或调整这些模型来产生策略或低级动作，而是提倡一种生成方案，该方案使用这些模型自动生成多样化的任务、场景和训练监督，从而在最少的人类监督下扩大机器人技能的学习。

我们的方法为机器人智能体配备了一个自我引导的提议-生成-学习循环：智能体首先提出要开发的有趣任务和技能，然后通过使用适当的空间配置填充相关对象和资产来生成相应的模拟环境。然后，智能体将提议的高级任务分解为子任务，选择最佳学习方法（强化学习、运动规划或轨迹优化），生成所需的训练监督，然后学习策略以获得提议的技能。

我们的工作试图提取大型模型中嵌入的广泛且通用的知识，并将其转移到机器人领域。我们的完全生成管道可以重复查询，产生与不同任务和环境相关的无穷无尽的技能演示。

Foundation models pretrained on internet vision and language data have acquired broad knowledge. As these models continue to evolve, they inevitably face the challenge of outperforming their data-driven behaviors especially in low-data situations when faced with intricate tasks demanding extended reasoning, search, or optimization. Such tasks have been at the core of sequential decision making, encompassing areas such as planning and reinforcement learning. Sequential decision making has traditionally faced the challenges of sample efficiency and generalization, partially due to the inability to incorporate broad knowledge from internet data.

In this talk, I will provide three foundation model inspired approaches including representation learning, conditional generative modeling, and repurposing pretrained vision and language models, in order to leverage broad knowledge from foundation models to solve more complex tasks such as continuous control, navigation, robotic manipulation, and game play.

活动时间：11月16日（周四）11:00-12:00

活动形式：线上直播，扫描下方二维码报名

线上交流：点击“阅读原文”，在智源社区一对一交流

活动报名｜RoboGen：通过生成模拟释放无限数据来实现自动化机器人学习

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

活动报名｜RoboGen：通过生成模拟释放无限数据来实现自动化机器人学习

模型“看视频写网页”，GPT-5仅36.35分！首个video2code基准发布

真够卷的！DeepSeek更完智谱更：GLM-4.6，代码国内最强

九章云极率先完成DeepSeek-V3.2-Exp适配，提供安全高效部署方案

OpenAI突然发布Sora 2：好一个“AI版抖音”！

DeepSeek-V3.2-Exp第一时间上线华为云

DeepSeek-V3.2-Exp第一时间上线华为云

DeepSeek突然拥抱国产GPU语言!对标CUDA替代Triton,华为Day0适配

ChatGPT可以下单买买买了

宇树机器人被曝漏洞，机器人之间可相互感染，官方火速回应

九章云极率先完成DeepSeek-V3.2-Exp适配，提供安全高效部署方案