AI Dynamics

Global AI News Aggregator

About

Critique of Gzip OOD Evaluation and Data Leakage in ML

I looked a little into the Gzip OOD results and there seems to be another big problem: train-test overlap. E.g. DengueFilipino has the same train and test set. KirundiNews has 90% overlap… Still nice to see people revisit old ideas and the use of information theory for ML 🙂 x.com/giffmana/statu…

→ View original post on X — @dfintelligence