AI Dynamics

Global AI News Aggregator

GLM-5.1 Active Parameters Comparison with DeepSeek V3.2

GLM-5.1 has 40B active parameters per token versus 37B in DeepSeek V3.2, so that can't be it.
Regarding quantization, sure, but quantization general technique that could also be applied similarly to DeepSeek V3.2.

→ View original post on X — @rasbt,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *