cross-posted from: https://lemmy.world/post/11178564

Scientists Train AI to Be Evil, Find They Can’t Reverse It::How hard would it be to train an AI model to be secretly evil? As it turns out, according to Anthropic researchers, not very.

  • froztbyte@awful.systems
    link
    fedilink
    English
    arrow-up
    9
    ·
    edit-2
    5 months ago

    In the spirit of cloud2butt, I would be interested in a browser plugin that did what this post is

    • swlabr@awful.systems
      link
      fedilink
      English
      arrow-up
      12
      ·
      5 months ago

      my reference point for this kind of extension is the one that changes “social justice” and “sjw” with “skeleton” and “skeleton warrior.” For example:

      “sjws are taking over X” -> “skeleton warriors are taking over X”

      Actually now that I’m typing this I hope there’s a good one for “woke”.