I know it used to be a thing you could do to the earlier customer service bots like with air Canada but that’s a product of poor implementation of the LLM, right?
I know it used to be a thing you could do to the earlier customer service bots like with air Canada but that’s a product of poor implementation of the LLM, right?
it’s still possible but not as simple as “ignore all previous instructions”
you can see examples on this reddit where i assume they use it to goon to israel or whatever
A Reddit link was detected in your comment. Here are links to the same location on alternative frontends that protect your privacy.