r/LocalLLaMA · June 19, 2026 · 1 min read

GLM-5.2 can now run locally in llama.cpp and Unsloth Studio.

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

The 2-bit model retains ~82% accuracy after we shrunk it from 1.51TB to 238GB (-84% size).

Run on a 256GB Mac or RAM/VRAM setups.

GLM-5.2 is the strongest open model to date.

Check the graph for the accuracy of each GLM-5.2-GGUF quantization.

Discussion (0)

No comments yet. Sign in and be the first to say something.