• jlow (he / him)
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    1
    ·
    6 months ago

    Is this for real? I can buy a book, scan it and put it on the internet and it wouldn’t be piracy? Or is this just the usual “it’s not a crime if rich people/evilcorps do it” bs?

    • tburkhol@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      6 months ago

      Putting the scan on the internet intact would be piracy. Putting up snippets is mostly OK. Ingesting the scans of millions of books into a massive data set and then regurgitating pieces of the masticated, processed mess seems still to be a grey area, but closer to ‘mostly OK’ than to piracy.

    • A_norny_mousse@feddit.org
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      2
      ·
      6 months ago

      I can buy a book, scan it and put it on the internet and it wouldn’t be piracy?

      Yes, but only if you’re a multi-billion AI company.

    • mindbleach@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 months ago

      Have you heard of the Internet Archive?

      They do what you’re describing, more than any LLM does. These models do statistical modeling on books in a way they can answer questions about, combine concepts from, or provide descriptions of them, more than they can reproduce any particular page. They’re not burning all this power just to host a text file.