Slack AI can leak private data via prompt injection

BrikoX@lemmy.zip · 7 months ago

Slack AI can leak private data via prompt injection

kinkles@sh.itjust.works · 7 months ago

Is it possible to implement a perfect guardrail on an AI model such that it will never ever spit out a certain piece of information? I feel like these models are so complex that you can always eventually find the perfect combination of words to circumvent any attempts to prevent prompt injection.

Batman@lemmy.world · 7 months ago

Reminded me of this game: https://gandalf.lakera.ai/intro