Our new Agentic leaderboard is now live! I've long wanted a way to quickly know which LLM is best for powering agents. So we've just built a leaderboard with Albert Villanova! This ranks LLMs powering a smolagents CodeAgent on subsets of various benchmarks. GPT-4.5
New Agentic Leaderboard Ranks LLMs for Agents
By
–
