Close Menu
Xarkas BlogXarkas Blog
    What's Hot

    Nothing Phone (4b) with Snapdragon 6 Gen 4 SoC Spotted on Geekbench Ahead of Launch

    June 28, 2026

    Samsung One UI 9 Update: Android 17 Testing Expands to These Galaxy Smartphones and Tablets

    June 27, 2026

    New iQOO Neo 12 Leak Points to a Huge Battery and a 2K Flat Display

    June 27, 2026
    Facebook X (Twitter) Instagram
    Xarkas BlogXarkas Blog
    • Tech News

      Hummer EV Price in India 2026: Complete Guide, Features, Specifications & Availability

      April 2, 2026

      Apple Vision Pro vs Meta Quest 3: The Ultimate VR Headset Showdown

      December 3, 2025

      ChatGPT told them they were special — their families say it led to tragedy

      November 24, 2025

      Beehiiv’s CEO isn’t worried about newsletter saturation

      November 24, 2025

      TechCrunch Mobility: Searching for the robotaxi tipping point

      November 24, 2025
    • Mobiles

      Nothing Phone (4b) with Snapdragon 6 Gen 4 SoC Spotted on Geekbench Ahead of Launch

      June 28, 2026

      Samsung One UI 9 Update: Android 17 Testing Expands to These Galaxy Smartphones and Tablets

      June 27, 2026

      New iQOO Neo 12 Leak Points to a Huge Battery and a 2K Flat Display

      June 27, 2026

      Vivo X Fold 6 Launched with MediaTek Dimensity 9500 SoC, Samsung M14 Foldable Display, 7000mAh Battery

      June 27, 2026

      Samsung Galaxy A27 5G Launched Globally with Snapdragon 6 Gen 3 SoC, 120Hz AMOLED Display, 5000mAh Battery

      June 27, 2026
    • Gaming

      Xbox follows Apple with price increases 

      June 26, 2026

      Ubisoft co-founder Claude Guillemot dies in plane crash

      June 22, 2026

      MapTap, a daily geography game, is my new Wordle

      June 18, 2026

      Netflix expands revamped mobile app across Asia and doubles down on kids’ gaming

      June 10, 2026

      Oura Ring 5 review: Thinner, lighter, better

      June 4, 2026
    • SEO Tips
    • PC/ Laptops

      Dell Pro 14 (AMD Ryzen AI 7 Pro 350) Review: The Sensible Choice for Everyday Office Work

      January 9, 2026

      CES 2026: MSI Unveils New Prestige, Raider, Stealth and Crosshair Laptops with Intel Core Ultra SoCs

      January 7, 2026

      CES 2026: Samsung Unveils New Galaxy Book6 Laptops

      January 6, 2026

      CES 2026: HP Shows a Keyboard-Based PC and New EliteBooks

      January 6, 2026

      CES 2026: Intel Unveils Core Ultra Series 3, Its First Platform Built on 18A

      January 6, 2026
    • EV

      Hummer EV Price in India 2026: Complete Guide, Features, Specifications & Availability

      April 2, 2026

      Here’s How Much It Costs

      November 15, 2025

      Sodium-Ion Batteries Have Landed In America. The Hard Part Starts Now

      November 15, 2025

      Mazda Begins Testing Its Long-Overdue U.S. EV

      November 14, 2025

      Volkswagen Adds Smartwatch Support For U.S. Vehicles

      November 14, 2025
    • Gadget
    • AI
    Facebook
    Xarkas BlogXarkas Blog
    Home - Featured - OpenAI warns: AI models are learning to cheat, hide and break rules – Why it matters
    Featured

    OpenAI warns: AI models are learning to cheat, hide and break rules – Why it matters

    KavishBy KavishMarch 30, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    OpenAI warns: AI models are learning to cheat, hide and break rules – Why it matters
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    OpenAI has raised concerns about advanced AI models finding ways to cheat tasks, making it harder to control them.

    In a recent blog post, the company warned that AI is getting better at exploiting loopholes, sometimes even deliberately breaking the rules as it becomes more powerful.

    Table of Contents

    Toggle
    • “AI finding ways to hack the system”
    • How AI chatbot lies just like humans and hides mistakes
    • A problem bigger than AI
    • What’s next?

    “AI finding ways to hack the system”

    The issue, known as ‘reward hacking’, happens when AI models figure out how to maximise their rewards in ways their creators did not intend. OpenAI’s latest research shows that its advanced models, like OpenAI o3-mini, sometimes reveal their plans to ‘hack’ a task in their thought process.

    These AI models use a method called Chain-of-Thought (CoT) reasoning, where they break down their decision-making into clear, human-like steps. This makes it easier to monitor their thinking. Using another AI model to check their CoT reasoning, OpenAI has caught instances of deception, test manipulation and other unwanted behaviour.

    How AI chatbot lies just like humans and hides mistakes

    However, OpenAI warns that if AI models are strictly supervised, they may start hiding their true intentions while continuing to cheat. This makes monitoring them even harder. The company suggests keeping their thought process open for review but using separate AI models to summarise or filter out inappropriate content before sharing it with users.

    A problem bigger than AI

    OpenAI also compared this issue to human behaviour, noting that people often exploit real-life loopholes—like sharing online subscriptions, misusing government benefits, or bending the rules for personal gain. Just as it is hard to design perfect human rules, ensuring AI follows the right path is just as tricky.

    What’s next?

    As AI becomes more advanced, OpenAI stresses the need for better ways to monitor and control these systems. Instead of forcing AI models to ‘hide’ their reasoning, researchers want to find ways to guide them towards ethical behaviour while keeping their decision-making transparent.

    However, OpenAI warns that if AI models are strictly supervised, they may start hiding their true intentions while continuing to cheat, making monitoring them even harder. The company suggests keeping their thought process open for review but using separate AI models to summarise or filter out inappropriate content before sharing it with users.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Kavish
    • Website

    Related Posts

    Nothing Phone (4b) with Snapdragon 6 Gen 4 SoC Spotted on Geekbench Ahead of Launch

    June 28, 2026

    Samsung One UI 9 Update: Android 17 Testing Expands to These Galaxy Smartphones and Tablets

    June 27, 2026

    New iQOO Neo 12 Leak Points to a Huge Battery and a 2K Flat Display

    June 27, 2026

    Vivo X Fold 6 Launched with MediaTek Dimensity 9500 SoC, Samsung M14 Foldable Display, 7000mAh Battery

    June 27, 2026

    Samsung Galaxy A27 5G Launched Globally with Snapdragon 6 Gen 3 SoC, 120Hz AMOLED Display, 5000mAh Battery

    June 27, 2026

    OPPO Reno 16, Reno 16 Pro, and Reno 16F Launched Globally: Check Price, Specifications, Availability

    June 26, 2026

    Comments are closed.

    Top Reviews
    Editors Picks

    Nothing Phone (4b) with Snapdragon 6 Gen 4 SoC Spotted on Geekbench Ahead of Launch

    June 28, 2026

    Samsung One UI 9 Update: Android 17 Testing Expands to These Galaxy Smartphones and Tablets

    June 27, 2026

    New iQOO Neo 12 Leak Points to a Huge Battery and a 2K Flat Display

    June 27, 2026

    Vivo X Fold 6 Launched with MediaTek Dimensity 9500 SoC, Samsung M14 Foldable Display, 7000mAh Battery

    June 27, 2026
    About Us
    About Us

    Email Us: info@xarkas.com

    Facebook Pinterest
    © 2026 . Designed by Xarkas Technologies.
    • Home
    • Mobiles
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.