I looked a little into the Gzip OOD results and there seems to be another big problem: train-test overlap. E.g. DengueFilipino has the same train and test set. KirundiNews has 90% overlap… Still nice to see people revisit old ideas and the use of information theory for ML 🙂 x.com/giffmana/statu…
