LocalLLaMA post tier list
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Since there is much (justified) whining about post quality, I thought it would be helpful to get a sense of what people actually DO like. Here's my take:
S-tier:
-GGUFs/MLX or benchmark data for new best-in-class local model released
- New Optimizations that are actually a big deal for most people (e.g. MTP)
- Hardware capability posts that include both prefill and decode t/s and specify engine, quant, and context size.
- weird stuff like that robot in the suitcase
A Tier:
-New optimizations that are real but only help a minority of people or aren't yet ready for primetime (e.g. turbo quant)
-Memes making fun of closed-source AI
-New harnesses or agents or major updates, e.g. opencode can now do ________ new thing and this is why it is helpful/how to take advantage of it
-Research that affects the industry overall and is supplied with actual reasonable analysis;
- In-depth model capability comparisons across a broad range of tasks or benchmarks, that haven't already been done 1000x (i.e. not qwen or gemma)
B tier:
-Non-ai generated reports of specific use cases where certain models did well.
-Posts sharing new builds that include price and model fitting capability, but are sparse on actual performance
-Memes making fun of local ai (feel free to also post in a sub I am trying to get going r/localaicirclejerk)
C Tier:
- memes whining about Sam Altman or Dario or Elon
- Stories about Cloud AI models that don't have anything to do with local AI
- "what's the best model I can run on a 3060?"
- Posts that make macs look like perfect at home data centers
- Posts that make macs look like garbage that don't work for "AgEnTic CodiNg" which apparently always requires a fresh prefill of 50k+ tokens every single call.
D tier:
-random "strawberry" or "car wash" type benchmark that we've all seen 500 fucking times; "look Qwen thinks it's Claude." "Look, Qwen thinks it's still 2024! I knew local AI was garbage!"
-"Is local AI good? How does it compare to Claude Opus 4.8 for me asking random questions about nothing or generating power ranger erotic fanfiction?"
-AI generated post alleging some improvements in workflows or optimizations, but where it's difficult to tell if there is any actual information or it's just pure slop
F tier:
-AI generated shitpost asking stupid questions to gain karma, usually full of "it's not x, it's y" often disguised, poorly, by instructing model not to capitalize letters at beginnings of sentences
-thinly veiled ads for AI startup that is a claude wrapper
[link] [comments]
More from r/LocalLLaMA
-
Was BitNet a dead end? What happened to ternary LLMs?
Jun 8
-
When every other post is an AI generated benchmark report, a question about the best model, or a slop-coded application or engine that pretends to be groundbreaking
Jun 8
-
Friends from the localllama community, if you love local llm, don't participate in the IPO (spaceX, OpenAI, Anthropic)
Jun 8
-
An Implementation of NanoQuant: A flexible binary quantization method
Jun 8
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.