AI Dynamics

Global AI News Aggregator

Claude 2.5 Pro Achieves New SOTA Across AI Benchmarks

2.5 Pro sets new SOTA capabilities across benchmarks, including: — 18.8% on Humanity's Last Exam — 63.8% on SWE-Bench Verified (agentic coding)
— GPQA Diamond and AIME 2025 (STEM)
— Long-context and visual reasoning

→ View original post on X — @rowancheung,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *