ul2 uses c4 only. c4 alone is not the best due to lack of diversity but it's pretty strong. the c4 dataset is quite good imo.
UL2 Model Training Dataset C4 Quality Assessment
By
–
Global AI News Aggregator
By
–
ul2 uses c4 only. c4 alone is not the best due to lack of diversity but it's pretty strong. the c4 dataset is quite good imo.
Leave a Reply