AI Dynamics

Global AI News Aggregator

About

RL Scaling Progress: Engineering Breakthrough in Model Training

i am GIDDILY following the open progress in RL scaling notable that it takes such a herculean engineering effort to train a model for 3000 steps. but these results are undeniable, esp with shortened context also it's only been two months since v1… hard to keep up…

→ View original post on X — @jxmnop