🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.ee to

LocalLLaMA@sh.itjust.worksEnglish · 1 month ago

How much gpu do i need to run a 90b model

12

How much gpu do i need to run a 90b model

🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.ee to

LocalLLaMA@sh.itjust.worksEnglish · 1 month ago

Do i need industry grade gpu’s or can i scrape by getring decent tps with a consumer level gpu.

Chat

🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.eeOP
link
fedilink
English
arrow-up
1·
1 month ago
That looks like exactly the sort of thing i want. Any existing solution to get it to behave like an ollama instance (i have a bunch of services pointed at an ollama run on docker).
- Sylovik@lemmy.world
  link
  fedilink
  English
  arrow-up
  2·
  1 month ago
  You may try Harbor. The description claims to provide an OpenAI-compatible API.

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

48 users / day
63 users / week
254 users / month
422 users / 6 months
52 local subscribers
2.55K subscribers
250 Posts
1.04K Comments
Modlog