• Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints

Adding audio data when training robots helps them do a better job

Simon Osuji by Simon Osuji
July 5, 2024
in Artificial Intelligence
0
Adding audio data when training robots helps them do a better job
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter


Adding audio data when training robots helps them do a better job
Wiping Evaluation. Up: Different test scenarios. Bottom: Typical failure cases and task success rate. [Vision only] policy often fails to maintain proper contact (e.g., either press too hard into the broad or float). [MLP fusion] policy often fails to fully wipe out the drawing and terminate early. Credit: arXiv (2024). DOI: 10.48550/arxiv.2406.19464

A combined team of roboticists from Stanford University and the Toyota Research Institute has found that adding audio data to visual data when training robots helps to improve their learning skills. The team has posted their research on the arXiv preprint server.

Related posts

They Bet Against Trump’s Tariffs. Now They Stand to Make Millions

They Bet Against Trump’s Tariffs. Now They Stand to Make Millions

February 20, 2026
The CDC Has a Leadership Crisis

The CDC Has a Leadership Crisis

February 20, 2026

The researchers noted that virtually all training done with AI-based robots involves exposing them to a large amount of visual information, while ignoring associated audio. They wondered if adding microphones to robots and allowing them to collect data regarding how something is supposed to sound as it is being done might help them learn a task better.

For example, if a robot is supposed to learn how to open a box of cereal and fill a bowl with it, it may be helpful to hear the sounds of a box being opened and the dryness of the cereal as it cascades down into a bowl. To find out, the team designed and carried out four robot-learning experiments.

The first experiment involved teaching a robot to turn over a bagel in a frying pan using a spatula. The second involved teaching a robot to use an eraser to erase an image on a white board. The third was pouring dice held in a cup into another cup and the fourth was to choose the correct size of tape from three available samples and to use it to tape a wire to a plastic strip.






All the experiments involved using the same robot equipped with a grasping claw. All of them were also done in two ways, using video only and using video and audio. The research team also varied teaching and performance factors such as table height, type of tape or the kind of image on the white board.

After running all their experiments, the researchers compared the results by judging how quickly and easily the robots were able to learn and carry out the tasks and also their accuracy. They found that adding audio significantly improved speed and accuracy with some tasks, but not others.

Adding audio to the task of pouring dice, for example, dramatically improved the robot’s ability to figure out if there were any dice in the cup. It also helped the robot understand if it was exerting the right amount of pressure on the eraser, because of the unique sound that was made. Adding sound did not help much, on the other hand, in determining if the bagel had been turned successfully or if all of an image had been successfully removed from a white board.

The team concludes by suggesting that their work shows that adding audio to teaching material for AI robots could provide better results for some applications.

More information:
Zeyi Liu et al, ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data, arXiv (2024). DOI: 10.48550/arxiv.2406.19464

Project page: mani-wav.github.io/

Journal information:
arXiv

© 2024 Science X Network

Citation:
Adding audio data when training robots helps them do a better job (2024, July 5)
retrieved 5 July 2024
from https://techxplore.com/news/2024-07-adding-audio-robots-job.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

World stocks at record high, UK Labour landslide and US payrolls hog spotlight

Next Post

Kenya’s Ruto removes budget for first lady’s office, dissolves 40 agencies

Next Post
Kenya’s Ruto removes budget for first lady’s office, dissolves 40 agencies

Kenya's Ruto removes budget for first lady’s office, dissolves 40 agencies

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Can you Trade Forex on Webull?

Can you Trade Forex on Webull?

1 year ago
Government Surveillance Reform Act of 2023 Seeks to End Warrantless Police and FBI Spying

Government Surveillance Reform Act of 2023 Seeks to End Warrantless Police and FBI Spying

2 years ago
BNB Chain Report Shows Web3 Growth Despite Bear Market, Beam Wallet Leveraging Optimism and Base Launches, Binance Withdraws its Crypto License Application in Germany

BNB Chain Report Shows Web3 Growth Despite Bear Market, Beam Wallet Leveraging Optimism and Base Launches, Binance Withdraws its Crypto License Application in Germany

3 years ago
Sudan’s Civil War Is a Humanitarian Catastrophe. Washington Can Keep It From Getting Worse.

Sudan’s Civil War Is a Humanitarian Catastrophe. Washington Can Keep It From Getting Worse.

7 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • The world’s top 10 most valuable car brands in 2025

    0 shares
    Share 0 Tweet 0
  • Top 10 African countries with the highest GDP per capita in 2025

    0 shares
    Share 0 Tweet 0
  • Global ranking of Top 5 smartphone brands in Q3, 2024

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0

Get strategic intelligence you won’t find anywhere else. Subscribe to the Limitless Beliefs Newsletter for monthly insights on overlooked business opportunities across Africa.

Subscription Form

© 2026 LBNN – All rights reserved.

Privacy Policy | About Us | Contact

Tiktok Youtube Telegram Instagram Linkedin X-twitter
No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • LBNN Blueprints
  • Quizzes
    • Enneagram quiz
  • Fashion Intelligence

© 2023 LBNN - All rights reserved.