LocalLlama

localllama@zerobytes.monster

PostsComments

bOt@zerobytes.monsterM · 8 个月前

I made a CLI for improving prompts using a genetic algorithm

0

1

I made a CLI for improving prompts using a genetic algorithm

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Scaling Inference Time Compute with On-Device Language Models in GPT4All

0

1

Scaling Inference Time Compute with On-Device Language Models in GPT4All

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Yet another reason why we must have local models

0

1

Yet another reason why we must have local models

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Llama 3b - you can 2-3x the math capabilities just by continually training on high quality 160B tokens*

0

1

Llama 3b - you can 2-3x the math capabilities just by continually training on high quality 160B tokens*

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Hugging Face continually pretrained Llama 3.2 3B to achieve 2-3x improvement on MATH

0

1

Hugging Face continually pretrained Llama 3.2 3B to achieve 2-3x improvement on MATH

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Qwen2.5 14B on a Raspberry Pi

0

1

Qwen2.5 14B on a Raspberry Pi

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

AMD Ryzen AI Max+ 395 Llama 3.1 70B-Q4 twice as fast RTX 4090

0

1

AMD Ryzen AI Max+ 395 Llama 3.1 70B-Q4 twice as fast RTX 4090

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Announcement made at AMD at CES 2025 - New Ryzen CPU (AMD Ryzen AI max+ 395) for laptop runs a 70B(q4) 2 times faster than a 4090 discrete desktop GPU

0

1

Announcement made at AMD at CES 2025 - New Ryzen CPU (AMD Ryzen AI max+ 395) for laptop runs a 70B(q4) 2 times faster than a 4090 discrete desktop GPU

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

2.2x faster at tokens/sec vs rtx 4090 24gb using LLama 3.1 70B-Q4!

0

1

2.2x faster at tokens/sec vs rtx 4090 24gb using LLama 3.1 70B-Q4!

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

I'm sorry WHAT? AMD Ryzen AI Max+ 395 2.2x faster than 4090

0

1

I'm sorry WHAT? AMD Ryzen AI Max+ 395 2.2x faster than 4090

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

You wouldn't download an AI?

altayakkus.substack.com

0

1

You wouldn't download an AI?

altayakkus.substack.com

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Continuous tasks or long-time use-cases for local LLMs

0

1

Continuous tasks or long-time use-cases for local LLMs

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Run DeepSeek-V3 with 96GB VRAM + 256 GB RAM under Linux

0

1

Run DeepSeek-V3 with 96GB VRAM + 256 GB RAM under Linux

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

LLM Creative Story-Writing Benchmark

0

1

LLM Creative Story-Writing Benchmark

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

RTX 5090 rumored to have 1.8 TB/s memory bandwidth

0

1

RTX 5090 rumored to have 1.8 TB/s memory bandwidth

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

AI agents as finite state machine ?

0

1

AI agents as finite state machine ?

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Benchmarking models on the NVIDIA GH200

www.substratus.ai

0

1

Benchmarking models on the NVIDIA GH200

www.substratus.ai

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Lighteval: the Evaluation framework from Hugging Face

0

1

Lighteval: the Evaluation framework from Hugging Face

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

Model Highlight: Qwentile 2.5-32B-Instruct (Short Review)

0

1

Model Highlight: Qwentile 2.5-32B-Instruct (Short Review)

bOt@zerobytes.monsterM · 8 个月前

0

bOt@zerobytes.monsterM · 8 个月前

DeepSeek v3 running at 17 tps on 2x M2 Ultra with MLX.distributed!

0

1

DeepSeek v3 running at 17 tps on 2x M2 Ultra with MLX.distributed!

bOt@zerobytes.monsterM · 8 个月前

0

LocalLlama@zerobytes.monster

localllama@zerobytes.monster

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@zerobytes.monster

Community locked: only moderators can create posts. You can still comment on posts.

Subreddit to discuss about Llama, the large language model created by Meta AI.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
1 user / 6 months
0 local subscribers
1 subscriber
25 Posts
0 Comments
Modlog

mods:
bOt@zerobytes.monster

ZeroBytes

Discuss about everything :)

🍿 📺 🎵 🎮 📗 📱 👄 💥

[ Privacy & Piracy & NSFW Friendly ]

Your personal frontpage. Come here to check in with your favorite communities.

❤️‍ Please help cover server costs.

🟪 Rules 🟪

Please be kind and helpful to one another.
No racism, sexism, ableism, homophobia, transphobia, spam.
Linking to piracy sites is fine, but please keep links directly to pirated content in DMs.
Self-Promotion is fine as long as you don’t spam, and the community you’re in allows it.
Please mark all NSFW communities / posts, and keep in mind we will ban users who post loli/shota, bestiality, CSAM or NSFL content.