For edge applications (it's a 1B model!), this is not enough. We need more concise traces. With a 4k token budget, the GRPO (LFM-1.3B-Math) model achieves best-in-class performance. Note that it only cost us a few points on average at the 32k token setting.
GRPO 1.3B Model Achieves Best Performance Edge Applications
By
–
