r/LocalLLaMA · · 1 min read

GLM-5.2 can now run locally in llama.cpp and Unsloth Studio.

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

GLM-5.2 can now run locally in llama.cpp and Unsloth Studio.

The 2-bit model retains ~82% accuracy after we shrunk it from 1.51TB to 238GB (-84% size).

Run on a 256GB Mac or RAM/VRAM setups.

GLM-5.2 is the strongest open model to date.

Check the graph for the accuracy of each GLM-5.2-GGUF quantization.

Full guide: https://unsloth.ai/docs/models/glm-5.2

GGUF: https://huggingface.co/unsloth/GLM-5.2-GGUF

submitted by /u/beasthunterr69
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA