We re-ran SEAL evals on the new @AnthropicAI Claude 3.5 Sonnet model. It is now:
– #1 on Instruction Following
– #1 on Coding Congratulations to Anthropic on a great new model! P.S. we’re adding new evals to SEAL, so if you have an idea for an eval, let us know below
Claude 3.5 Sonnet Tops SEAL Evals in Instruction Following Coding
By
–
Leave a Reply