AI Dynamics

Global AI News Aggregator

About

32B model with 817 examples beats o1-preview on math reasoning

Less is More for Reasoning (LIMO): a 32B model fine-tuned with 817 examples can beat o1-preview on math reasoning! Do we really need o1's huge RL procedure to see reasoning emerge? It seems not.
Researchers from Shanghai Jiaotong University just demonstrated that carefully

→ View original post on X — @aymericroucher