Close Menu
Xarkas BlogXarkas Blog
    What's Hot

    OnePlus, Nothing Realme, Redmi Smartphones Said to Get Price Hikes in India: Which Phones Will Be Impacted

    May 2, 2026

    Xiaomi 17T Specs Leaked Ahead Of Launch: Geekbench Listing Reveals Dimensity 8500, 12GB RAM

    May 2, 2026

    OnePlus Ace 6 Ultra Full Specifications Revealed Ahead of Tomorrow’s Launch: Get All the Details Here

    May 2, 2026
    Facebook X (Twitter) Instagram
    Xarkas BlogXarkas Blog
    • Tech News

      Hummer EV Price in India 2026: Complete Guide, Features, Specifications & Availability

      April 2, 2026

      Apple Vision Pro vs Meta Quest 3: The Ultimate VR Headset Showdown

      December 3, 2025

      ChatGPT told them they were special — their families say it led to tragedy

      November 24, 2025

      Beehiiv’s CEO isn’t worried about newsletter saturation

      November 24, 2025

      TechCrunch Mobility: Searching for the robotaxi tipping point

      November 24, 2025
    • Mobiles

      OnePlus, Nothing Realme, Redmi Smartphones Said to Get Price Hikes in India: Which Phones Will Be Impacted

      May 2, 2026

      Xiaomi 17T Specs Leaked Ahead Of Launch: Geekbench Listing Reveals Dimensity 8500, 12GB RAM

      May 2, 2026

      OnePlus Ace 6 Ultra Full Specifications Revealed Ahead of Tomorrow’s Launch: Get All the Details Here

      May 2, 2026

      Vivo Y600 Pro with 10,200mAh Battery, MediaTek Dimensity 7300e SoC Launched: Check Price & Specifications

      May 1, 2026

      OpenAI Smartphone with Custom Chipset in Development: Could Launch in 2028, According to Ming-Chi Kuo

      May 1, 2026
    • Gaming

      AI Dungeon maker Latitude unveils Voyage, a platform for creating AI-powered RPGs

      April 22, 2026

      Roblox’s AI assistant gets new agentic tools to plan, build, and test games

      April 17, 2026

      How the rewards app Freecash scammed its way to the top of the app stores

      April 15, 2026

      Where Baldur’s Gate 3 Gets Player Agency vs. Narrative Control Right (and Wrong)

      April 14, 2026

      Best Fallout 4 Romance Mods

      April 14, 2026
    • SEO Tips
    • PC/ Laptops

      Dell Pro 14 (AMD Ryzen AI 7 Pro 350) Review: The Sensible Choice for Everyday Office Work

      January 9, 2026

      CES 2026: MSI Unveils New Prestige, Raider, Stealth and Crosshair Laptops with Intel Core Ultra SoCs

      January 7, 2026

      CES 2026: Samsung Unveils New Galaxy Book6 Laptops

      January 6, 2026

      CES 2026: HP Shows a Keyboard-Based PC and New EliteBooks

      January 6, 2026

      CES 2026: Intel Unveils Core Ultra Series 3, Its First Platform Built on 18A

      January 6, 2026
    • EV

      Hummer EV Price in India 2026: Complete Guide, Features, Specifications & Availability

      April 2, 2026

      Here’s How Much It Costs

      November 15, 2025

      Sodium-Ion Batteries Have Landed In America. The Hard Part Starts Now

      November 15, 2025

      Mazda Begins Testing Its Long-Overdue U.S. EV

      November 14, 2025

      Volkswagen Adds Smartwatch Support For U.S. Vehicles

      November 14, 2025
    • Gadget
    • AI
    Facebook
    Xarkas BlogXarkas Blog
    Home - Editor's Choice - Using AI to turn sound recordings into accurate street images
    Editor's Choice

    Using AI to turn sound recordings into accurate street images

    KavishBy KavishDecember 16, 2024No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Using AI to turn sound recordings into accurate street images
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    Researchers use AI to turn sound recordings into accurate street images
    Credit: University of Texas at Austin

    Using generative artificial intelligence, a team of researchers at The University of Texas at Austin has converted sounds from audio recordings into street-view images. The visual accuracy of these generated images demonstrates that machines can replicate human connection between audio and visual perception of environments.

    In a paper published in Computers, Environment and Urban Systems, the research team describes training a soundscape-to-image AI model using audio and visual data gathered from a variety of urban and rural streetscapes and then using that model to generate images from audio recordings.

    “Our study found that acoustic environments contain enough visual cues to generate highly recognizable streetscape images that accurately depict different places,” said Yuhao Kang, assistant professor of geography and the environment at UT and co-author of the study. “This means we can convert the acoustic environments into vivid visual representations, effectively translating sounds into sights.”

    Using YouTube video and audio from cities in North America, Asia and Europe, the team created pairs of 10-second audio clips and image stills from the various locations and used them to train an AI model that could produce high-resolution images from audio input. They then compared AI sound-to-image creations made from 100 audio clips to their respective real-world photos, using both human and computer evaluations.

    Computer evaluations compared the relative proportions of greenery, building and sky between source and generated images, whereas human judges were asked to correctly match one of three generated images to an audio sample.

    Researchers use AI to turn sound recordings into accurate street images
    Credit: University of Texas at Austin

    The results showed strong correlations in the proportions of sky and greenery between generated and real-world images and a slightly lesser correlation in building proportions. And human participants averaged 80% accuracy in selecting the generated images that corresponded to source audio samples.

    “Traditionally, the ability to envision a scene from sounds is a uniquely human capability, reflecting our deep sensory connection with the environment. Our use of advanced AI techniques supported by large language models (LLMs) demonstrates that machines have the potential to approximate this human sensory experience,” Kang said.

    “This suggests that AI can extend beyond mere recognition of physical surroundings to potentially enrich our understanding of human subjective experiences at different places.”

    In addition to approximating the proportions of sky, greenery and buildings, the generated images often maintained the architectural styles and distances between objects of their real-world image counterparts, as well as accurately reflecting whether soundscapes were recorded during sunny, cloudy or nighttime lighting conditions.

    The authors note that lighting information might come from variations in activity in the soundscapes. For example, traffic sounds or the chirping of nocturnal insects could reveal time of day. Such observations further the understanding of how multisensory factors contribute to our experience of a place.

    “When you close your eyes and listen, the sounds around you paint pictures in your mind,” Kang said. “For instance, the distant hum of traffic becomes a bustling cityscape, while the gentle rustle of leaves ushers you into a serene forest. Each sound weaves a vivid tapestry of scenes, as if by magic, in the theater of your imagination.”

    Kang’s work focuses on using geospatial AI to study the interaction of humans with their environments. In another recent paper published in Humanities and Social Sciences Communications, he and his co-authors examined the potential of AI to capture the characteristics that give cities their unique identities.

    More information:
    Yonggai Zhuang et al, From hearing to seeing: Linking auditory and visual place perceptions with soundscape-to-image generative artificial intelligence, Computers, Environment and Urban Systems (2024). DOI: 10.1016/j.compenvurbsys.2024.102122

    Kee Moon Jang et al, Place identity: a generative AI’s perspective, Humanities and Social Sciences Communications (2024). DOI: 10.1057/s41599-024-03645-7

    Provided by
    University of Texas at Austin


    Citation:
    Using AI to turn sound recordings into accurate street images (2024, November 27)
    retrieved 16 December 2024
    from https://techxplore.com/news/2024-11-ai-accurate-street-images.html

    This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
    part may be reproduced without the written permission. The content is provided for information purposes only.





    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Kavish
    • Website

    Related Posts

    OnePlus, Nothing Realme, Redmi Smartphones Said to Get Price Hikes in India: Which Phones Will Be Impacted

    May 2, 2026

    Xiaomi 17T Specs Leaked Ahead Of Launch: Geekbench Listing Reveals Dimensity 8500, 12GB RAM

    May 2, 2026

    OnePlus Ace 6 Ultra Full Specifications Revealed Ahead of Tomorrow’s Launch: Get All the Details Here

    May 2, 2026

    Vivo Y600 Pro with 10,200mAh Battery, MediaTek Dimensity 7300e SoC Launched: Check Price & Specifications

    May 1, 2026

    OpenAI Smartphone with Custom Chipset in Development: Could Launch in 2028, According to Ming-Chi Kuo

    May 1, 2026

    Motorola Edge 70 Pro Sale in India Kicks Off at 12 pm Today via Flipkart: Check Price, Specifications, Offers

    May 1, 2026

    Comments are closed.

    Top Reviews
    Editors Picks

    OnePlus, Nothing Realme, Redmi Smartphones Said to Get Price Hikes in India: Which Phones Will Be Impacted

    May 2, 2026

    Xiaomi 17T Specs Leaked Ahead Of Launch: Geekbench Listing Reveals Dimensity 8500, 12GB RAM

    May 2, 2026

    OnePlus Ace 6 Ultra Full Specifications Revealed Ahead of Tomorrow’s Launch: Get All the Details Here

    May 2, 2026

    Vivo Y600 Pro with 10,200mAh Battery, MediaTek Dimensity 7300e SoC Launched: Check Price & Specifications

    May 1, 2026
    About Us
    About Us

    Email Us: info@xarkas.com

    Facebook Pinterest
    © 2026 . Designed by Xarkas Technologies.
    • Home
    • Mobiles
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.