Pumped to announce the brand new open LLM leaderboard. We burned 300 H100 to re-run new evaluations like MMLU-pro for all major open LLMs! Some learning:
– Qwen 72B is the king and Chinese open models are dominating overall
– Previous evaluations have become too easy for recent
New Open LLM Leaderboard: Qwen 72B Dominates Chinese Models
By
–
Leave a Reply