AI Dynamics

Global AI News Aggregator

Llama 3.1 405B Achieves Top Performance on SEAL Leaderboard

We evaluated Llama3.1 405B Instruct on SEAL Leaderboard. As a reminder, our evals are:
PRIVATE (no overfitting)
EXPERT EVALUATED (trustworthy)
EVOLVE (no saturation) Our results show Llama3.1 is top notch:
Instruction Following
Math
#4 Coding

→ View original post on X — @alexandr_wang,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *