RoboGen:通过生成模拟释放无限数据来实现自动化机器人学习

578次阅读
没有评论

RoboGen:通过生成模拟释放无限数据来实现自动化机器人学习

简介:

在本次演讲中,我将介绍 RoboGen,这是一种生成式的机器人智能体,可以通过“生成模拟”自动大规模学习各种机器人技能。 RoboGen 利用基础模型和生成模型的最新进展。我们不直接使用或调整这些模型来产生策略或低级动作,而是提倡一种生成方案,该方案使用这些模型自动生成多样化的任务、场景和训练监督,从而在最少的人类监督下扩大机器人技能的学习。我们的方法为机器人智能体配备了一个自我引导的提议-生成-学习循环:智能体首先提出要开发的有趣任务和技能,然后通过使用适当的空间配置填充相关对象和资产来生成相应的模拟环境。然后,智能体将提议的高级任务分解为子任务,选择最佳学习方法(强化学习、运动规划或轨迹优化),生成所需的训练监督,然后学习策略以获得提议的技能。我们的工作试图提取大型模型中嵌入的广泛且通用的知识,并将其转移到机器人领域。我们的完全生成管道可以重复查询,产生与不同任务和环境相关的无穷无尽的技能演示。

In this talk, I will present RoboGen, a generative robotic agent that automatically learns diverse robotic skills at scale via generative simulation. RoboGen leverages the latest advancements in foundation and generative models. Instead of directly using or adapting these models to produce policies or low-level actions, we advocate for a generative scheme, which uses these models to automatically generate diversified tasks, scenes, and training supervisions, thereby scaling up robotic skill learning with minimal human supervision. Our approach equips a robotic agent with a self-guided propose-generate-learn cycle: the agent first proposes interesting tasks and skills to develop, and then generates corresponding simulation environments by populating pertinent objects and assets with proper spatial configurations. Afterwards, the agent decomposes the proposed high-level task into sub-tasks, selects the optimal learning approach (reinforcement learning, motion planning, or trajectory optimization), generates required training supervision, and then learns policies to acquire the proposed skill. Our work attempts to extract the extensive and versatile knowledge embedded in large-scale models and transfer them to the field of robotics. Our fully generative pipeline can be queried repeatedly, producing an endless stream of skill demonstrations associated with diverse tasks and environments.

个人简介:
王宇飞是卡内基梅隆大学机器人研究所三年级博士生。他由 Zackory Erickson 教授和 David Held 教授共同指导。他于 2020 年 12 月在 David Held 教授的指导下获得卡内基梅隆大学计算机科学系计算机科学硕士学位。在来到卡耐基梅隆大学之前,他于2019 年 7 月获得北京大学元培学院数据科学学士学位,导师为董彬教授。他的主要研究兴趣是机器人学习。他的研究生学习得到了优步总统奖学金的支持。

Yufei Wang is a third year Phd student at Robotics Institute, Carnegie Mellon University. He is co-advised by Prof. Zackory Erickson and Prof. David Held. He received M.S. in Computer Science from Computer Science Department, Carnegie Mellon University in Dec, 2020, advised by Prof. David Held. Before coming to CMU, he received B.S. in Data Science from Yuanpei College, Peking University in July 2019, advised by Prof. Bin Dong. His general research interest is robot learning. His graduate study is supported by the Uber Presidential Fellowship.

 

Read More 

正文完
可以使用微信扫码关注公众号(ID:xzluomor)
post-qrcode
 
评论(没有评论)
Generated by Feedzy