AI Dynamics

Global AI News Aggregator

About

Consistent AI Rankings: Dataset Evaluation Methodology Questions

It's surprising that it still manages to output such a consistent ranking. Are they simply evaluating on a fraction of the dataset, effectively?

→ View original post on X — @maximelabonne,