My previous submission, months ago with transformers.agents on GPT-4o scored a bit over 40% ont the same benchmark. We've been a long way since then with smolagents!
Benchmarking progress in AI agent frameworks: From transformers.agents to smolagents
By
–