AA comparison of the latest local models
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I picked models I consider local (usable on 3×3090), so there are no 300B models, and you should probably skip 200B models too (but MiniMax and Step are pretty fast in Q3) Gemma-4 12B is still missing [link] [comments] |
More from r/LocalLLaMA
-
DeepSeek V4 Flash is amazing! (WIP llama.cpp PR #24162)
Jun 6
-
A quick Gemma4 31B comparison (Q4_k_M, QAT, heretic)
Jun 6
-
Github Copilot finally supporting custom endpoints
Jun 6
-
OpenLumara - A different kind of AI agent, written from scratch, not vibecoded. Extremely token-efficient, super small system prompt, made for local models. Everything is modular.
Jun 5
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.