Our team tested single- vs multi-agent setups using GPT-5, GPT-5-mini, and GPT-5-nano, using 10K+ tools across 30 domains. In the single-agent setup, GPT-5-mini and GPT-5-nano performance degrades with longer context and more reasoning, while GPT-5 remains remarkably consistent.
GPT-5 Outperforms Mini and Nano in Multi-Agent Tool Tasks
By
–
