They allege that OpenAI’s ChatGPT and Meta’s LLaMA chatbots were trained on datasets that included their copyrighted books, without their permission. The datasets in question were allegedly obtained from “shadow library” websites like Bibliotik, Library Genesis, and Z-Library. These websites are known for distributing pirated content.

  • harry_h0udini911@lemmy.fmhy.ml
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 year ago

    Ahhh. When I ask the GPT about piracy, there is silence. But GPT itself is running on the data that was scraped illegally. Hypocrisy = 1000

  • wowitsverycool@lemmy.fmhy.ml
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 year ago

    does it bother anyone else that those most fervently working against ai are also always fervently against piracy? not to be an fmhy user or anything but i think when people cheer for these cases against ai, especially pirates, they’re ignoring the way that it’s actually not very good for them in the long run. i’m up for any defense against companies using ai to replace writers in need of work, but going about it in a way that’s so pro intellectual property annoys me.