All of the above makes us excited to see what developers build with the new Claude 3.5 Sonnet. There's still lots of room for improvement on this scaffolding and no model has crossed 50% yet on this benchmark.
Claude 3.5 Sonnet: Developer Potential and Benchmark Improvements
By
–
Leave a Reply