What was going on with the Open LLM Leaderboard? Its numbers didn't match the ones reported in the LLaMA paper! We've decided to dive in this rabbit hole with friends from the LLaMA & Falcon teams and got back with a blog post of learnings & surprises: https://
huggingface.co/blog/evaluatin
g-mmlu-leaderboard
…
Open LLM Leaderboard Evaluation Discrepancies and Methodology
By
–
Leave a Reply