Virtual training uses generative AI to teach robots how to traverse real world terrain

Teaching robots to traverse real world terrain using generative AI-developed scenarios in a virtual world — Learning a real-world policy from generated images. Left: we generate diverse and on-policy visual data by combining structured image prompts with geometric and semantic control from an underlying physics simulator. Right: the policy is sufficiently robust to transfer to a variety of challenging terrains in the real world, despite never having seen real data during training. Credit: *arXiv* (2024). DOI: 10.48550/arxiv.2411.00083

A team of roboticists and engineers at MIT CSAIL, Institute for AI and Fundamental Interactions, has developed a generative AI approach to teaching robots how to traverse terrain and move around objects in the real world.

Tin Can Is a Dumb Phone for Kids. Can Someone Teach Them How to Use It?

February 28, 2026

Everything Larry and David Ellison Will Control If Paramount Buys Warner Bros.

February 28, 2026

The group has published a paper describing their work and possible uses for it on the arXiv preprint server. They also presented their ideas at the recent Conference on Robot Learning (CORL 2024), held in Munich Nov. 6–9.

Getting robots to navigate in the real world at some point involves teaching them to learn on the fly, or by training them with videos of similar robots in a real-world environment. While such training has proven to be effective in limited environments, it tends to fail when a robot encounters something novel. In this new effort, the team at MIT developed virtual training that better translates to the real world.

The work involved using generative AI and a physics simulator to allow a robot to navigate a virtual world as a means for learning to operate in the real world. They call the system LucidSim and have used it to train a robotic dog in parkour, a sport where players attempt to traverse obstacles in unknown territory as quickly as possible.

The approach involves first prompting ChatGPT with thousands of queries designed to get the LLM to create descriptions of a wide range of environments, including outdoor weather. Next, the descriptions given by ChatGPT are fed to a 3D mapping system that uses them (along with AI generated images and physics simulators) to generate a video that also gives a trajectory for the robot to follow.

The robot is then trained to make its way through the terrain in the virtual world and learn skills that it can use in a real environment. Robots trained using the system learned to clamber over boxes, climb stairs and deal with whatever they encountered. After virtual training, the robot was tested in the real world.

The researchers tested their system using a small, four-legged robot equipped with a webcam. They found it performed better than a similar system trained the traditional way. The team suggests that improvements to their system could lead to a new approach to training robots in general.

More information:
Alan Yu et al, Learning Visual Parkour from Generated Images, arXiv (2024). DOI: 10.48550/arxiv.2411.00083

LucidSim: lucidsim.github.io/

Journal information:
arXiv

Citation:
Virtual training uses generative AI to teach robots how to traverse real world terrain (2024, November 12)
retrieved 12 November 2024
from https://techxplore.com/news/2024-11-virtual-generative-ai-robots-traverse.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Source link