• t3rmit3@beehaw.org
    link
    fedilink
    arrow-up
    64
    ·
    edit-2
    5 months ago

    There is leaked Windows source code online… Is that also freeware for me to train an OS-building model on?

    • RBG
      link
      fedilink
      arrow-up
      55
      ·
      5 months ago

      Sorry, you need high quality data for training.

    • smeg@feddit.uk
      link
      fedilink
      English
      arrow-up
      31
      ·
      5 months ago

      Seems like we’re all in agreement that all information should be free, I look forward to them open-sourcing every proprietary bit of code they have

  • 4dpuzzle@beehaw.org
    link
    fedilink
    English
    arrow-up
    13
    ·
    5 months ago

    The sort of mental gymnastics and cognitive dissonance that these subhuman c-suite employ to justify stealing everyone else’s data while demonizing sharing of their data, is just infuriating. If these scumbags were incarcerated for a day each for every time they showed this hypocrisy, they would all rot in the jails for their entire lifetime, perhaps more.

    • renard_roux@beehaw.org
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      5 months ago

      Lifetime jail + lifetime jail for X generations of offspring, depending on severity?

      Or instead of jailing children at birth, maybe just confiscating X yachts, depending on severity.

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    6
    ·
    5 months ago

    🤖 I’m a bot that provides automatic summaries for articles:

    Click here to see the summary

    Mustafa Suleyman, the CEO of Microsoft AI, said this week that machine-learning companies can scrape most content published online and use it to train neural networks because it’s essentially “freeware.”

    Shortly afterwards the Center for Investigative Reporting sued OpenAI and its largest investor Microsoft “for using the nonprofit news organization’s content without permission or offering compensation.”

    Also, in 2022, several unidentified developers sued OpenAI and GitHub based on claims that the organizations used publicly posted programming code to train generative models in violation of software licensing terms

    Most people posting content online as individuals will have compromised their rights in some way by accepting the Terms of Service agreements offered by major social media platforms.

    The fact that OpenAI and others making AI models are striking content deals with major publishers shows that a strong brand, deep pockets, and a legal team can bring large technology operations to the negotiating table.

    People will stop making work available online, they predict, if it just gets used to power AI models that reduce the marginal cost of content creation to zero and deprive creators of the possibility of any reward.


    Saved 74% of original text.

  • jarfil@beehaw.org
    link
    fedilink
    arrow-up
    2
    ·
    5 months ago

    Using the term “freeware” is silly, but consider this:

    Is the act of reading/watching something, equivalent to making a copy? Freedom of thought is an agreement much older than the 1990s, it has nothing to do with copyright, and all to do with secrecy. If something is made public, then it isn’t secret, so obviously anyone can read/watch it, be it with a wetware neural network, or an AI neural network. Making an exact copy is either plagiarism, or copyright infringement… but abstracting a style, then applying it to some other data, is “inspiration”.

    Imagine a website with a licensing disclaimer like “you are allowed to read the content, but not to comprehend or express any thoughts based on it”. Nonsense, right?