2. K2-Think A 32B-parameter system built on Qwen2.5 that rivals or beats far larger models on hard math by combining long CoT SFT, RL with verifiable rewards, lightweight test-time scaffolding, and inference optimization.
K2-Think: 32B Model Rivals Larger Models on Math
By
–
