Why there is a lack of new 100B-120B models?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
GPT-OSS-120B was the first model of that family, which was followed by GLM-4.5-Air, Nemotron-3-Super, Qwen3.5-122B, Mistral-Small-4-119B. However, all models are at least 3 months old (10 months for GPT-OSS-120B) and all latest releases are either 25B-35B (Gemma4, Qwen3.6) or 200B+ (Step 3.5/3.7 Flash, DeepSeek-V4-Flash, MiniMax-M3, Nemotron-3-Ultra). Did the ~120B MoE family "die" like the 70B/80B one or there will likely be new releases for H2 2026?
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.