Leaderboards are *a thing* in *every industry.* I can tell you as a journalist we all have an obsession with leaderboards. But we'll probably hear a lot more going forward about how Model A beats Model B by 0.1 points in this eval model, so A is obviously obsolete.
Leaderboard obsession: Why 0.1 point differences mislead AI evaluation
By
–
Leave a Reply