GLM-5.1 has 40B active parameters per token versus 37B in DeepSeek V3.2, so that can't be it.
Regarding quantization, sure, but quantization general technique that could also be applied similarly to DeepSeek V3.2.
GLM-5.1 Active Parameters Comparison with DeepSeek V3.2
By
–
Leave a Reply