AI Dynamics

Global AI News Aggregator

Critique du classement LMArena : illusion des performances modèles

There's a new paper circulating looking in detail at LMArena leaderboard: "The Leaderboard Illusion" https://arxiv.org/abs/2504.20879 I first became a bit suspicious when at one point a while back, a Gemini model scored #1 way above the second best, but when I tried to switch for a few

→ View original post on X — @karpathy,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *