AI Dynamics

Global AI News Aggregator

About

Fine-tuning drives record AI performance on SWE-bench and BIRD-SQL

Success stories are already rolling in. Cosine used fine-tuning to set a new record on the SWE-bench benchmark with its AI agent Genie. Distyl topped the BIRD-SQL benchmark with impressive accuracy in SQL tasks.

→ View original post on X — @godofprompt