• hydroptic@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    26
    ·
    3 days ago

    Can’t really have an open federated protocol if you want to be able to prevent scraping 🤷

      • hydroptic@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        2 days ago

        Weelll, depends on what you mean with “joining your own server.” The model is definitely different from ActivityPub’s, but lots of people on the network already have their own PDSs or “personal data servers”. They do still go through Bluesky’s relay, but nothing is really stopping people from running relays as such, it’s just fairly costly (as in some hundreds of dollars per month) currently as they need to hold the full state and history of the network (but apparently that’s being worked on.)

        But it’s definitely a federated protocol even though it’s different from AP

  • Lvxferre@mander.xyz
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    3 days ago

    This isn’t exactly surprising, is it? Odds are that Lemmy is also being scrapped the shit out of.

    And it would be fine if it was a bunch of nobodies training their homebrew small language models, for the sake of whatever.

    Except that it isn’t - it’s a bunch of big arse companies, with a “NEED MOAR DATAS!” approach, and more than enough money to bake the already too warm planet, since they struggle with the fact that those “things” called “humans” care about consent. “This thing didn’t opt out, so training on its data is fair game!”. Just to shove the tech back into the thing’s throat, in the hopes that it makes the tech eventually profitable.

    …I guess that my point is that this should be handled legally, not through closing down the protocol. The issue is not people scraping it, but who does it, and why.

    • buddascrayon@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 days ago

      Anyone who doesn’t believe that literally everything and anything they post on the internet is being scraped for LLM’s is an idiot.