MIT debuts a large language model-inspired method for teaching robots new skills

ByShivam Sharma

Nov 2, 2024

MIT this week showcased a new model for training robots. Rather than the standard set of focused data used to teach robots new tasks, the method goes big, mimicking the massive troves of information used to train large language models (LLMs).

The researchers note that imitation learning — in which the agent learns by following an individual performing a task — can fail when small challenges are introduced. These could be things like lighting, a different setting, or new obstacles. In those scenarios, the robots simply don’t have enough data to draw upon in order to adapt.

The team looked to models like GPT-4 for a kind of brute force data approach to problem solving.

“In the language domain, the data are all just sentences,” says Lirui Wang, the new paper’s lead author. “In robotics, given all the heterogeneity in the data, if you want to pretrain in a similar manner, we need a different architecture.”

The team introduced a new architecture called Heterogeneous Pretrained Transformers (HPT), which pulls together information from different sensors and different environments. A transformer was then used to pull together the data into training models. The larger the transformer, the better the output.

Users then input the robot design, configuration, and the job they want done.

“Our dream is to have a universal robot brain that you could download and use for your robot without any training at all,” CMU associate professor David Held said of the research. “While we are just in the early stages, we are going to keep pushing hard and hope scaling leads to a breakthrough in robotic policies, like it did with large language models.”

The research was founded, in part, by Toyota Research Institute. Last year at TechCrunch Disrupt, TRI debuted a method for training robots overnight. More recently, it struck a watershed partnership that will unite its robot learning research with Boston Dynamics hardware.

By Shivam Sharma

Gadgets Tech Technology

MIT debuts a large language model-inspired method for teaching robots new skills

ByShivam Sharma

By Shivam Sharma

Related Post

The best wireless earbuds for 2024

Quantum Machines and Nvidia use machine learning to get closer to an error-corrected quantum computer

Can you build a startup without sacrificing your mental health? Bonobos founder Andy Dunn thinks so

Leave a Reply Cancel reply

You missed

Patriots QB Drake Maye clears protocol, will start vs. Titans

Singham Again Screening: बेटी काजोल संग दामाद की फिल्म देखने पहुंचीं तनुजा, रेड ड्रेस में दिखीं ‘सिंघम’ के बेटी, देखें तस्वीरें

The best wireless earbuds for 2024

After emotional Tampa tribute, Stamkos writing new legacy in Nashville