Close Menu
Xarkas BlogXarkas Blog
    What's Hot

    Pokemon TCG Live is Giving Away More Free Packs and Full-Art Cards

    April 12, 2026

    CMF Phone 3 Pro Launch Timeline Tipped: Snapdragon 7s Gen 4 SoC, Larger Battery, FHD+ OLED Display Expected

    April 11, 2026

    8 Strongest Characters Madara Never Fought

    April 11, 2026
    Facebook X (Twitter) Instagram
    Xarkas BlogXarkas Blog
    • Tech News

      Hummer EV Price in India 2026: Complete Guide, Features, Specifications & Availability

      April 2, 2026

      Apple Vision Pro vs Meta Quest 3: The Ultimate VR Headset Showdown

      December 3, 2025

      ChatGPT told them they were special — their families say it led to tragedy

      November 24, 2025

      Beehiiv’s CEO isn’t worried about newsletter saturation

      November 24, 2025

      TechCrunch Mobility: Searching for the robotaxi tipping point

      November 24, 2025
    • Mobiles

      CMF Phone 3 Pro Launch Timeline Tipped: Snapdragon 7s Gen 4 SoC, Larger Battery, FHD+ OLED Display Expected

      April 11, 2026

      Infinix Note 60 Pro Full Specifications Confirmed Ahead of Launch in India on April 13

      April 11, 2026

      Samsung One UI 8.5 Beta Update Officially Rolling Out to More Galaxy Phones in India: How to Update

      April 11, 2026

      Samsung Galaxy A57, Galaxy A37 Now Available for Purchase in India: Check Price, Specifications, Offers

      April 11, 2026

      Motorola Edge 70 Pro+ Launch in India Tipped: New Model Surfaces In Certification Listing

      April 10, 2026
    • Gaming

      Pokemon TCG Live is Giving Away More Free Packs and Full-Art Cards

      April 12, 2026

      8 Strongest Characters Madara Never Fought

      April 11, 2026

      Crimson Desert Guide: Quests, Puzzles, & Tips

      April 11, 2026

      Top 10 Anime Masterpieces of the Last 30 Years, Ranked

      April 11, 2026

      “Horrendous” Crimson Desert Players Want a Fix for Indoor Lighting

      April 11, 2026
    • SEO Tips
    • PC/ Laptops

      Dell Pro 14 (AMD Ryzen AI 7 Pro 350) Review: The Sensible Choice for Everyday Office Work

      January 9, 2026

      CES 2026: MSI Unveils New Prestige, Raider, Stealth and Crosshair Laptops with Intel Core Ultra SoCs

      January 7, 2026

      CES 2026: Samsung Unveils New Galaxy Book6 Laptops

      January 6, 2026

      CES 2026: HP Shows a Keyboard-Based PC and New EliteBooks

      January 6, 2026

      CES 2026: Intel Unveils Core Ultra Series 3, Its First Platform Built on 18A

      January 6, 2026
    • EV

      Hummer EV Price in India 2026: Complete Guide, Features, Specifications & Availability

      April 2, 2026

      Here’s How Much It Costs

      November 15, 2025

      Sodium-Ion Batteries Have Landed In America. The Hard Part Starts Now

      November 15, 2025

      Mazda Begins Testing Its Long-Overdue U.S. EV

      November 14, 2025

      Volkswagen Adds Smartwatch Support For U.S. Vehicles

      November 14, 2025
    • Gadget
    • AI
    Facebook
    Xarkas BlogXarkas Blog
    Home - Editor's Choice - AI system can envision an entire world from a single picture
    Editor's Choice

    AI system can envision an entire world from a single picture

    KavishBy KavishDecember 19, 2024No Comments5 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    AI system can envision an entire world from a single picture
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    AI system can envision an entire world from a single picture
    Three panorama representations that can be transformed into one another. Credit: arXiv (2024). DOI: 10.48550/arxiv.2412.09624

    Johns Hopkins computer scientists have created an artificial intelligence system capable of “imagining” its surroundings without having to physically explore them, bringing AI closer to humanlike reasoning.

    The new system—called Generative World Explorer, or GenEx—needs only a single still image to conjure an entire world, giving it a significant advantage over previous systems that required a robot or agent to physically move through a scene to map the surrounding environment, which can be costly, unsafe, and time-consuming. The team’s results are posted to the arXiv preprint server.

    “Say you’re in an area you’ve never been before—as a human, you use environmental cues, past experiences, and your knowledge of the world to imagine what might be around the corner,” says senior author Alan Yuille, the Bloomberg Distinguished Professor of Computational Cognitive Science at Johns Hopkins.

    “GenEx ‘imagines’ and reasons about its environment the way humans do, making educated decisions about what steps it should take next without having to physically check its environment first.”

    GenEx uses sophisticated world knowledge to generate multiple possibilities of what might exist beyond the visible image, assigning different probabilities to each scenario rather than making a single definitive guess. This ability to mentally map surroundings from limited visual data is crucial for many real-world applications, including in scenarios such as disaster response. For instance, rescue teams could use a single surveillance image to help explore hazardous sites from afar without risk to humans or valuable equipment.

    “This technology can also improve navigation apps, assist in training autonomous robots, and power immersive gaming and VR experiences,” says lead author Jieneng Chen, a Ph.D. student in computer science.






    Credit: JHU Center for Language and Speech Processing

    From a single image, GenEx generates a realistic, synthetic virtual world where AI agents can navigate and make decisions through reasoning and planning. The agent needs only a view of its current scene, a direction of movement, and the distance to traverse. As demonstrated in the animation below, the agent can move forward, change direction, and explore its environment with unlimited flexibility.

    And unlike the dreamlike AI world exploration apps now gaining popularity—such as Oasis, an AI-generated Minecraft simulator—GenEx’s environments are consistent. This is because the model was trained on large-scale data with a technique called “spherical consistency learning,” which ensures that its predictions of new environments fit within a panoramic sphere.

    “We measure this by having GenEx navigate a randomly sampled closed path, returning to the origin in a fixed loop,” Chen says. “Our goal was to make the start and end views identical, thus ensuring consistency in GenEx’s world modeling.”

    While this consistency isn’t unique to GenEx, the research team says it is the first and only generative world explorer to empower AI agents to make logical decisions based on new observations about the world they’re exploring in a process the computer scientists call “imagination-augmented policy.”

    For example, say you are driving and the light ahead is green, but you notice that the taxi in front of you has come to an abrupt, unexpected stop. Getting out of your car to investigate would be unsafe, but by imagining the scene from the taxi driver’s perspective, you can come up with a possible reason for their sudden stop: maybe an emergency vehicle is approaching—and you should make way, too.

    “While humans can use other cues like sirens to identify this kind of situation, current AI models developed for autonomous driving and other similar tasks only have access to image and language inputs, making imaginative exploration necessary in the absence of other multimodal information,” Chen says.







    Rendering of an AI model making an observation-based decision. Credit: Whiting School of Engineering

    The Hopkins team evaluated the consistency and quality of GenEx’s output against standard video generation benchmarks. The researchers also conducted experiments with human users to determine if and how GenEx could augment their logic and planning abilities and found that users made more accurate and informed decisions when they had access to the model’s exploration capabilities.

    “Our experimental results demonstrate that GenEx can generate high-quality, consistent observations during an extended exploration of a large virtual physical world,” Chen says. “Additionally, beliefs updated with the generated observations can inform an existing decision-making model, such as a large language model agent, and even human users to make better plans.”

    Joined by Tianmin Shu and Daniel Khashabi—both assistant professors of computer science—and undergraduate student TaiMing Lu, Yuille and Chen will incorporate real-world sensor data and dynamic scenes for more realistic, immersive planning scenarios.

    Bloomberg Distinguished Professor of Computer Vision and Artificial Intelligence Rama Chellappa and Cheng Peng, an assistant research professor in the Mathematical Institute for Data Science, will help curate the real-world sensor data.

    The cross-disciplinary project, which involves computer vision, natural language processing, and cognitive science, marks a significant achievement toward achieving humanlike intelligence in embodied AI, Yuille says.

    More information:
    Taiming Lu et al, GenEx: Generating an Explorable World, arXiv (2024). DOI: 10.48550/arxiv.2412.09624

    Journal information:
    arXiv


    Provided by
    Johns Hopkins University


    Citation:
    AI system can envision an entire world from a single picture (2024, December 19)
    retrieved 19 December 2024
    from https://techxplore.com/news/2024-12-ai-envision-entire-world-picture.html

    This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
    part may be reproduced without the written permission. The content is provided for information purposes only.





    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Kavish
    • Website

    Related Posts

    Pokemon TCG Live is Giving Away More Free Packs and Full-Art Cards

    April 12, 2026

    CMF Phone 3 Pro Launch Timeline Tipped: Snapdragon 7s Gen 4 SoC, Larger Battery, FHD+ OLED Display Expected

    April 11, 2026

    8 Strongest Characters Madara Never Fought

    April 11, 2026

    Infinix Note 60 Pro Full Specifications Confirmed Ahead of Launch in India on April 13

    April 11, 2026

    Crimson Desert Guide: Quests, Puzzles, & Tips

    April 11, 2026

    Samsung One UI 8.5 Beta Update Officially Rolling Out to More Galaxy Phones in India: How to Update

    April 11, 2026
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    Pokemon TCG Live is Giving Away More Free Packs and Full-Art Cards

    April 12, 2026

    CMF Phone 3 Pro Launch Timeline Tipped: Snapdragon 7s Gen 4 SoC, Larger Battery, FHD+ OLED Display Expected

    April 11, 2026

    8 Strongest Characters Madara Never Fought

    April 11, 2026

    Infinix Note 60 Pro Full Specifications Confirmed Ahead of Launch in India on April 13

    April 11, 2026
    About Us
    About Us

    Email Us: info@xarkas.com

    Facebook Pinterest
    © 2026 . Designed by Xarkas Technologies.
    • Home
    • Mobiles
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.