AI Dynamics

Global AI News Aggregator

Open LLM Leaderboard Evaluation Discrepancies and Methodology

What was going on with the Open LLM Leaderboard? Its numbers didn't match the ones reported in the LLaMA paper! We've decided to dive in this rabbit hole with friends from the LLaMA & Falcon teams and got back with a blog post of learnings & surprises: https://
huggingface.co/blog/evaluatin
g-mmlu-leaderboard

→ View original post on X — @thom_wolf,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *