AI Dynamics

Global AI News Aggregator

About

New Agentic Leaderboard Ranks LLMs for Agents

Our new Agentic leaderboard is now live! I've long wanted a way to quickly know which LLM is best for powering agents. So we've just built a leaderboard with Albert Villanova! This ranks LLMs powering a smolagents CodeAgent on subsets of various benchmarks. GPT-4.5

→ View original post on X — @aymericroucher