AI Dynamics

Global AI News Aggregator

New Coding Benchmark Released, But Uses Older Sonnet Version

Cool new coding benchmark! I always love to see new evals out in the world. Note though that the testing here is on the June version of Sonnet, not the latest version, so technically not "current frontier models."

→ View original post on X — @alexalbert__,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *