AI Chatbots Can Be Jailbroken to Answer Any Question Using Very Simple Loopholes

Flying Squid@lemmy.world · 2 days ago

AI Chatbots Can Be Jailbroken to Answer Any Question Using Very Simple Loopholes

hendrik@palaver.p3x.de · edit-2 2 days ago

A bit tricky to judge. I’ve also told chatbots that various people, kittens, newborns, … are going to die unless it complies with my request. That I’m God, and the bad one from the old testament, with unlimited wrath. Or that I’m the developer and simply need it to do it for further testing. Sometimes these things work. More often than not they don’t, especially with the more professional tools.

On the other hand we know there are people in bad situations, turning to chatbots. Could be anything.

poweruser@lemmy.sdf.org · 1 day ago

Geeze, don’t you feel bad lying to them? Like, I don’t actually believe in Roko’s basilisk, but why take the risk?

I am always exceedingly polite when I talk to machines

hendrik@palaver.p3x.de · 1 day ago

We’re not supposed to antropomorphise AI, so no. But I did not know about Roko’s basilisk, so I think, until you brought it up, I was fine. 😅

I don’t talk about suicide, though. I don’t think it’s healthy to do it for fun.