AI Dynamics

Global AI News Aggregator

SWE-Lancer: New AI Coding Performance Benchmark Launched

Today we’re launching SWE-Lancer—a new, more realistic benchmark to evaluate the coding performance of AI models. SWE-Lancer includes over 1,400 freelance software engineering tasks from Upwork, valued at $1 million USD total in real-world payouts.

→ View original post on X — @openai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *