How we tricked AI chatbots into creating misinformation, despite ‘safety’ measures

When you ask ChatGPT or other AI assistants to help create misinformation, they typically refuse, with responses like “I cannot assist with creating false information.” But our tests show these safety measures are surprisingly shallow—often just a few words deep—making them alarmingly easy to circumvent.

This post was originally published on this site

Skip The Dishes Referral Code

KeyLegal.ca - Consult a Lawyer Online in a variety of legal subjects