
Success stories are already rolling in. Cosine used fine-tuning to set a new record on the SWE-bench benchmark with its AI agent Genie. Distyl topped the BIRD-SQL benchmark with impressive accuracy in SQL tasks.
By
–


Success stories are already rolling in. Cosine used fine-tuning to set a new record on the SWE-bench benchmark with its AI agent Genie. Distyl topped the BIRD-SQL benchmark with impressive accuracy in SQL tasks.