• TheFinn
    link
    fedilink
    arrow-up
    4
    arrow-down
    1
    ·
    1 day ago

    I’m having difficulty with getting off the ground with these. Primarily I don’t trust the companies or individuals involved. I’m hoping for open source, local, with a GUI for desktop use and an API for automation.

    What model do you use? And in what kind of framework?

    • wonderingwanderer@sopuli.xyz
      link
      fedilink
      arrow-up
      5
      ·
      1 day ago

      Huggingface lists thousands of open source models. Each one has a page telling you what base model it’s based on, what other models are merged into it, what data its fine-tuned on, etc.

      You can search by number of parameters, you can find quantized versions, you can find datasets to fine-tune your own model on.

      I don’t know about GUI, but I’m sure there are some out there. Definitely options for API too

        • wonderingwanderer@sopuli.xyz
          link
          fedilink
          arrow-up
          2
          ·
          9 hours ago

          Yeah, more people should know about it. There’s really no reason to pay for an API for these giant 200 billion parameter commercial models sucking up intense resources in data centers.

          A quantized 24-32 billion parameter model works just fine, can be self-hosted, and can be fine-tuned on ethically-sourced datasets to suit your specific purposes. Bonus points for running your home lab on solar power.

          Not only are the commercial models trained on stolen data, but they’re so generalized that they’re basically worthless for any specialized purpose. A 12 billion parameter model with Retrieval-Augmented Generation is far less likely to hallucinate.

    • Alloi@lemmy.world
      link
      fedilink
      arrow-up
      6
      ·
      1 day ago

      R1 last i checked seems to be decent enough for a local model. customizable. but that was a while ago. its release temporarily crashed Nvidia stock because they showed how smart software design trumps mass spending on cutting edge hardware.

      at the end of the day its all of our data. we should own the means, especially if we built it by simply existing on the internet. without consent.

      if we wish to do this, its crucial that we do everything in our power to dismantle the “profit” structure and investment hype. sooner or later someone will leak the data, and we will have access to locally run versions we can train ourselves. as long as we dont allow them to monopolize hardware, we can have the brain, and the body of it run local.

      thats the only time it will be remotely ethical to use, unless its the persuit of attaining these goals.