GLM-5.2 can now run locally in llama.cpp and Unsloth Studio.
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| The 2-bit model retains ~82% accuracy after we shrunk it from 1.51TB to 238GB (-84% size). Run on a 256GB Mac or RAM/VRAM setups. GLM-5.2 is the strongest open model to date. Check the graph for the accuracy of each GLM-5.2-GGUF quantization. Full guide: https://unsloth.ai/docs/models/glm-5.2 [link] [comments] |
More from r/LocalLLaMA
-
Researchers trained a Deep Research agent with 32 H100s and open-sourced everything
Jun 19
-
[NEW MODEL] SupraLabs just released SupraVL-Nano-900k, a Vision-Language Model built entirely from scratch!
Jun 19
-
SETI @ Home aka distributed LLM inference engine. Does this exist and if not, should we make one?
Jun 19
-
GLM-5.2 is above GPT-5.5 in AA-Briefcase, Artificial Analysis' new agentic knowledge work eval
Jun 19
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.