AI Dynamics

Global AI News Aggregator

About

Dataset Diversity Requirements for Language Model Scaling

Isn't it also related to diversity in your dataset? 2B = not diverse enough to scale, 10B = maybe okay?

→ View original post on X — @maximelabonne,