• Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints

Using AI to turn sound recordings into accurate street images

Simon Osuji by Simon Osuji
November 27, 2024
in Artificial Intelligence
0
Using AI to turn sound recordings into accurate street images
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Researchers use AI to turn sound recordings into accurate street images
Credit: University of Texas at Austin

Using generative artificial intelligence, a team of researchers at The University of Texas at Austin has converted sounds from audio recordings into street-view images. The visual accuracy of these generated images demonstrates that machines can replicate human connection between audio and visual perception of environments.

Related posts

Here’s Every Country Directly Impacted by the War on Iran

Here’s Every Country Directly Impacted by the War on Iran

March 5, 2026
The Controversies Finally Caught Up to Kristi Noem

The Controversies Finally Caught Up to Kristi Noem

March 5, 2026

In a paper published in Computers, Environment and Urban Systems, the research team describes training a soundscape-to-image AI model using audio and visual data gathered from a variety of urban and rural streetscapes and then using that model to generate images from audio recordings.

“Our study found that acoustic environments contain enough visual cues to generate highly recognizable streetscape images that accurately depict different places,” said Yuhao Kang, assistant professor of geography and the environment at UT and co-author of the study. “This means we can convert the acoustic environments into vivid visual representations, effectively translating sounds into sights.”

Using YouTube video and audio from cities in North America, Asia and Europe, the team created pairs of 10-second audio clips and image stills from the various locations and used them to train an AI model that could produce high-resolution images from audio input. They then compared AI sound-to-image creations made from 100 audio clips to their respective real-world photos, using both human and computer evaluations.

Computer evaluations compared the relative proportions of greenery, building and sky between source and generated images, whereas human judges were asked to correctly match one of three generated images to an audio sample.

Researchers use AI to turn sound recordings into accurate street images
Credit: University of Texas at Austin

The results showed strong correlations in the proportions of sky and greenery between generated and real-world images and a slightly lesser correlation in building proportions. And human participants averaged 80% accuracy in selecting the generated images that corresponded to source audio samples.

“Traditionally, the ability to envision a scene from sounds is a uniquely human capability, reflecting our deep sensory connection with the environment. Our use of advanced AI techniques supported by large language models (LLMs) demonstrates that machines have the potential to approximate this human sensory experience,” Kang said.

“This suggests that AI can extend beyond mere recognition of physical surroundings to potentially enrich our understanding of human subjective experiences at different places.”

In addition to approximating the proportions of sky, greenery and buildings, the generated images often maintained the architectural styles and distances between objects of their real-world image counterparts, as well as accurately reflecting whether soundscapes were recorded during sunny, cloudy or nighttime lighting conditions.

The authors note that lighting information might come from variations in activity in the soundscapes. For example, traffic sounds or the chirping of nocturnal insects could reveal time of day. Such observations further the understanding of how multisensory factors contribute to our experience of a place.

“When you close your eyes and listen, the sounds around you paint pictures in your mind,” Kang said. “For instance, the distant hum of traffic becomes a bustling cityscape, while the gentle rustle of leaves ushers you into a serene forest. Each sound weaves a vivid tapestry of scenes, as if by magic, in the theater of your imagination.”

Kang’s work focuses on using geospatial AI to study the interaction of humans with their environments. In another recent paper published in Humanities and Social Sciences Communications, he and his co-authors examined the potential of AI to capture the characteristics that give cities their unique identities.

More information:
Yonggai Zhuang et al, From hearing to seeing: Linking auditory and visual place perceptions with soundscape-to-image generative artificial intelligence, Computers, Environment and Urban Systems (2024). DOI: 10.1016/j.compenvurbsys.2024.102122

Kee Moon Jang et al, Place identity: a generative AI’s perspective, Humanities and Social Sciences Communications (2024). DOI: 10.1057/s41599-024-03645-7

Provided by
University of Texas at Austin

Citation:
Using AI to turn sound recordings into accurate street images (2024, November 27)
retrieved 27 November 2024
from https://techxplore.com/news/2024-11-ai-accurate-street-images.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

Kronos, Idorsia plan layoffs; PTC shelves ALS drug

Next Post

Army aims to move its pay system to the cloud by 2027

Next Post
Army aims to move its pay system to the cloud by 2027

Army aims to move its pay system to the cloud by 2027

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

UAE Hits Record $340 Billion in Non-Oil Trade as It Joins

UAE Hits Record $340 Billion in Non-Oil Trade as It Joins

3 years ago
Bias in AI amplifies our own biases, finds study

Bias in AI amplifies our own biases, finds study

1 year ago
How to Sell Large Amounts of Bitcoin? (2023 Guide)

US House Speaker Uncertainty Threaten Crypto Legislation

2 years ago
The Secret to Living Past 120 Years Old? Nanobots

The Secret to Living Past 120 Years Old? Nanobots

2 years ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • Mahama attends Liberia’s 178th independence anniversary

    0 shares
    Share 0 Tweet 0
  • The world’s top 10 most valuable car brands in 2025

    0 shares
    Share 0 Tweet 0
  • Top 10 African countries with the highest GDP per capita in 2025

    0 shares
    Share 0 Tweet 0
  • Global ranking of Top 5 smartphone brands in Q3, 2024

    0 shares
    Share 0 Tweet 0

Get strategic intelligence you won’t find anywhere else. Subscribe to the Limitless Beliefs Newsletter for monthly insights on overlooked business opportunities across Africa.

Subscription Form

© 2026 LBNN – All rights reserved.

Privacy Policy | About Us | Contact

Tiktok Youtube Telegram Instagram Linkedin X-twitter
No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • LBNN Blueprints
  • Quizzes
    • Enneagram quiz
  • Fashion Intelligence

© 2023 LBNN - All rights reserved.