Looking for a low-end setup

Rez@sh.itjust.works · 1 year ago

Looking for a low-end setup

olicvb@lemmy.ca · edit-2 1 year ago

Take a look at GPT4All, very user friendly

rufus · edit-2 1 year ago

I like KoboldCpp. It is easy to set up and runs well with little resources.

With something like that, you should be able to fit a much larger and better model into your RAM. If you use the quantized versions. Look for models in GGUF format on Huggingface. Q4_K_M is a good compromise between size and quality.

Which model depends on your exact use-case. I like Mythomax-L2-13b or Llama2-13B-Tiefighter for roleplay, Mistral 7B (Dolphin 2.1 Mistral 7B) or Toppy-M for more factual things. All of those are uncensored.

rufus · 1 year ago

Hope you had some success. Don’t hesitate to ask if you have further questions.

Sims@lemmy.ml · 1 year ago

As an alternative you could look at distributed/shared inferencing. There’s https://horde.koboldai.net/ (which you probably know), and petals.dev

I haven’t tested tho…