AI Dynamics

Global AI News Aggregator

Codex-1 Achieves State-of-the-Art Performance on SWE-Bench Verified

The codex-1 model is state of the art on SWE-Bench Verified, we published the numbers on the blog. More importantly, we optimized it to generate code that people actually want to merge, not just code that scores well on benchmarks!

→ View original post on X — @romainhuet,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *