• Tony Bark@pawb.social
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    11
    ·
    9 months ago

    While I appreciate them going a greener route, if these chat AIs are still this inefficient to simply train, maybe it is best left to return them back to the research phrase.

    • fleabs@lemmy.world
      cake
      link
      fedilink
      English
      arrow-up
      20
      ·
      edit-2
      9 months ago

      You say “simply train,” but really, the training of these models is The most intensive part. Once they are trained, they require less power (relatively) to actually run for inference.

      • Corkyskog@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        9 months ago

        So it sounds like they need a shitload of GPU power. You know what also costs a shitload of GPU power crypto mining? Could they not outsource the work to all those GPUs that stopped mining crypto once it plummeted?

        I am surprised this hasn’t become a community project already. I assume there is some limitation that I am unaware of.

        • pivot_root@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          9 months ago

          The limitation is intellectual property. You need to model to train it, and no for-profit company is going to just give that away.

          • Corkyskog@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            1
            ·
            9 months ago

            But they (MS) are planning on doing it either way, why not crowdsource and even pay a small pittance for the GPU power? I think it would be popular… there are a lot of sad people with extra GPUs sitting around not being used for much.

    • Fidelity9373@artemis.camp
      link
      fedilink
      arrow-up
      5
      arrow-down
      1
      ·
      9 months ago

      There’s tradeoffs. If training LLMs (and similar systems that feed on pure physics data) can improve nuclear processes, then overall it could be a net benefit. Fusion energy research takes a huge amount of power to trigger every test ignition and we do them all the time, learning little by little.

      The real question is if the LLMs are even capable of revealing those kinds of insights to us. If they are, nuclear is hardly the worst path to go down.