Not looking good for GLM 5.2 Air... but maybe a flash model?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| Unofficial conversation on the official Z.ai Discord. My impression is they are focused on full size (500B+) and flash size (~30B) models right now, and that their turbo model is closer in parameters to flash than Air? [link] [comments] |
More from r/LocalLLaMA
-
Well.. it's a step up from nonstop bot spam I guess
Jun 30
-
Qwen 3.6 27B Speculative Decoding Bench: Pushing ~100 TPS on a single RTX 3090
Jun 30
-
Meta secretly tested ChatGPT, Gemini, and Character.AI with thousands of minor-perspective crisis prompts
Jun 30
-
Huawei open-sources OpenPangu-2.0-Flash - 92B total,6B active
Jun 30
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.