AI Dynamics

Global AI News Aggregator

MMLU Evaluation Benchmarks Updated with HELM Prompts

Great, thanks for sharing Yao. As mentioned on the leaderboard we are using the MMLU of the Eleuther harness for now but we're in the process of adding all the evals of the MMLU of HELM which is closer to the original prompts soon (actually working on it as we speak this weekd).

→ View original post on X — @thom_wolf,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *