AI Dynamics

Global AI News Aggregator

Claude 3.5 Sonnet Achieves 49% on SWE-bench Verified

Lots of folks have asked how we achieved 49% on SWE-bench Verified with the new Claude 3.5 Sonnet, beating the previous SOTA of 45%. Here's how we did it:

→ View original post on X — @alexalbert__,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *