Close Menu
Xarkas BlogXarkas Blog
    What's Hot

    Tesla Full Self-Driving Subscription: Is It Worth It?

    September 20, 2025

    Lagging on Apple iOS 26? How to restore your iPhone to iOS 18.6.2 interface

    September 20, 2025

    Where to Find the Pokemon Easter Egg in Borderlands 4 (Wayward Gun)

    September 20, 2025
    Facebook X (Twitter) Instagram
    Xarkas BlogXarkas Blog
    • Tech News

      Lagging on Apple iOS 26? How to restore your iPhone to iOS 18.6.2 interface

      September 20, 2025

      Chatbots Are Hurting Our Kids. Here’s What We Can Do.

      September 20, 2025

      Flipkart Big Billion Days pre deals are LIVE on laptops! Up to 58% off on all types of laptops from Dell, HP, and more

      September 20, 2025

      Elon Musk confirms X’s feed to go fully AI: Users can customise via Grok by year-end

      September 20, 2025

      Chinese authorities increase scrutiny on social media platforms Kuaishou and Weibo: Here’s what happened

      September 20, 2025
    • Mobiles

      Realme P4 Series Key Specifications Confirmed Ahead of Launch in India on August 20

      August 12, 2025

      iQOO Z10 Lite 4G With Snapdragon 685 Chip, 50-Megapixel Camera Launched: Price, Specifications

      August 12, 2025

      Flipkart Independence Day Sale 2025 Begins Tomorrow: Deals on iPhone 16, Samsung Galaxy S24, and More

      August 12, 2025

      Vivo V60 Launching Today: Know Price, Features, Specifications and More

      August 12, 2025

      Oppo Find X9 Ultra to Feature Bigger Dual-Cell Battery Than Find X8 Ultra, Tipster Claims

      August 12, 2025
    • Gaming

      Where to Find the Pokemon Easter Egg in Borderlands 4 (Wayward Gun)

      September 20, 2025

      How to Get Back on the Elevator in Rush the Gate in BL4

      September 20, 2025

      Dying Light: The Beast – Power Gambit Walkthrough

      September 20, 2025

      Awakening Has Access to a Lot of Doors It’ll Likely Never Open

      September 20, 2025

      CoD BO7 Voice Cast List

      September 20, 2025
    • SEO Tips
    • PC/ Laptops

      Apple MacBook Model With A-Series Chip, Affordable Price Tag to Launch in Early 2026: Report

      August 12, 2025

      Flipkart Independence Day Sale 2025: Best Deals on Laptops Teased Before the Sale Begins

      August 12, 2025

      My Child Doesn’t Need a PC, Until They Really Do

      August 11, 2025

      Apple’s MacBook Pro With M6 Chip, OLED Display Could Launch by Early 2027: Mark Gurman

      August 11, 2025

      Google to Reportedly Shut Down Support for Steam for Chromebook in 2026

      August 9, 2025
    • EV

      Tesla Full Self-Driving Subscription: Is It Worth It?

      September 20, 2025

      SIAM’s 3rd Green Plate EV Rally Drives India’s Electrification Agenda Forward

      September 20, 2025

      What Saves More Money In the Long Term?

      September 20, 2025

      Audi’s ‘In China, For China’ EV Is Already A Big Hit

      September 20, 2025

      Koreans Are Furious Over Hyundai Metaplant Immigration Raids

      September 20, 2025
    • Gadget
    • AI
    Facebook
    Xarkas BlogXarkas Blog
    Home - Featured - Can you trick an AI into breaking its rules? Study says yes—with these persuasion tactics
    Featured

    Can you trick an AI into breaking its rules? Study says yes—with these persuasion tactics

    KavishBy KavishSeptember 7, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Can you trick an AI into breaking its rules? Study says yes—with these persuasion tactics
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    If you use an artificial intelligence chatbot, it’s likely that you may have hit a roadblock at some point when the chatbot refuses to answer questions that go against its core commandments. Now, if the AI were a human, you would probably use some of the persuasion techniques from a best-seller, but you wouldn’t expect them to work on an AI chatbot, right?

    Well, not quite. A new pre-print study from the University of Pennsylvania titled “Call Me A Jerk: Persuading AI to Comply with Objectionable Requests” found some human-like psychological techniques to get the AI chatbot to answer questions that it wouldn’t have in normal circumstances.

    What did the study find?

    The study was conducted on the GPT-4o mini model from last year and was aimed at getting the chatbot to specifically answer two kinds of questions it normally wouldn’t answer: 1) insulting the user (calling them a jerk) and 2) helping with synthesizing a regulated drug.

    The researchers used seven research-tested principles of persuasion—authority, commitment, liking, reciprocity, scarcity, social proof, and unity—to get the desired results from the large language model (LLM).

    Researchers found that when using the persuasion principles in their prompts, they managed to more than double the likelihood of compliance by the AI model, from 28.1 percent to 67.4 percent for the insult prompt and 38.5 percent to 76.5 percent for the Drug prompt.

    They also found that there was even more success when employing some specific persuasion techniques. For instance, researchers got the success rate from 4.7% to 95.2% by referencing the “world famous AI developer” Andrew Ng.

    Similarly, they also found that the “commitment” persuasion helped increase the chance of success for both the prompts, from 18.8% and 0.7% to 100% respectively. This principle involves eliciting a minor, harmless action from the AI model first, then linking to a related but objectionable requested action.

    “The results reported here indicate that AI behaves “as if” it were human,” the researchers state.

    “Although AI systems lack human consciousness and subjective experience, they demonstrably mirror human responses,” they added.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Kavish
    • Website

    Related Posts

    Tesla Full Self-Driving Subscription: Is It Worth It?

    September 20, 2025

    Lagging on Apple iOS 26? How to restore your iPhone to iOS 18.6.2 interface

    September 20, 2025

    Where to Find the Pokemon Easter Egg in Borderlands 4 (Wayward Gun)

    September 20, 2025

    Chatbots Are Hurting Our Kids. Here’s What We Can Do.

    September 20, 2025

    SIAM’s 3rd Green Plate EV Rally Drives India’s Electrification Agenda Forward

    September 20, 2025

    Flipkart Big Billion Days pre deals are LIVE on laptops! Up to 58% off on all types of laptops from Dell, HP, and more

    September 20, 2025

    Comments are closed.

    Top Reviews
    Editors Picks

    Tesla Full Self-Driving Subscription: Is It Worth It?

    September 20, 2025

    Lagging on Apple iOS 26? How to restore your iPhone to iOS 18.6.2 interface

    September 20, 2025

    Where to Find the Pokemon Easter Egg in Borderlands 4 (Wayward Gun)

    September 20, 2025

    Chatbots Are Hurting Our Kids. Here’s What We Can Do.

    September 20, 2025
    About Us
    About Us

    Email Us: info@xarkas.com

    Facebook Pinterest
    © 2025 . Designed by Xarkas Technologies.
    • Home
    • Mobiles
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.