AI Dynamics

Global AI News Aggregator

About

UL2 Model Training Dataset C4 Quality Assessment

ul2 uses c4 only. c4 alone is not the best due to lack of diversity but it's pretty strong. the c4 dataset is quite good imo.

→ View original post on X — @yitayml