Are there good closed vs open LLM rankings? Also, are 70B–350B models actually worth it?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
hey,
I’m currently getting enough VRAM to run something in the GLM-5.2 range, but I’m wondering: do we actually have a solid ranking that compares closed-source and open-weight LLMs side by side?
I’ve been trying to find a clear “closed vs open” leaderboard, but most benchmarks feel fragmented or don’t really answer the practical question of what’s actually best to run locally versus what’s only competitive through API models.
Also, are there any open models that feel as impressive for their size as something like GLM-5.2 or Qwen3.6 27B? I might be missing something, but a lot of the 70B–350B range feels kind of… empty? Like the size goes up massively, but the real-world quality jump doesn’t always feel worth the VRAM/complexity.
Maybe I’ve just missed the right models or benchmarks, so I’d love to hear what people are using and what actually feels worth running locally.
[link] [comments]
More from r/LocalLLaMA
-
Been running Qwen3.6-27B through a 3-critic harness. The harness matters more than I thought
Jun 30
-
I Hate Dario Amodei, and everything he stands for.
Jun 29
-
Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'.
Jun 29
-
Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images!
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.