I know we can’t do this with any copyrighted materials. But a lot of books, music, art, knowledge is in the creative commons. Is it possible to create one massive torrent that includes all that can be legally included and then have people only download what they actually want to enjoy?

  • rufus
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    “All” is impossible. You’re going to miss something. And it’s a lot of work. Maybe have a look at the datasets people/researchers use to train Artificial Intelligence. I think some people put in the effort to compile large datasets with just freely licensed data.

    • AnarchistsForDemocracy@lemmy.worldOP
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      it’s a lot of work

      so per your suggestion using for example the zlibrary book/paper repo and training sets of openai as starting point one could maybe get around the brunt of the work.