We integrated the latest Sonnet 3.5 model into our codebase and already saw improvements — some of hard evals started passing. Going to start experimenting with the computer use APIs too!
Sonnet 3.5 Integration Improves Hard Evals Performance
By
–
By
–
We integrated the latest Sonnet 3.5 model into our codebase and already saw improvements — some of hard evals started passing. Going to start experimenting with the computer use APIs too!