Global AI News Aggregator
About
By
–
ur right, data mixes sure aren't solved but the issues with the current RL paradigm go far beyond just the data mixes
→ View original post on X — @jxmnop