Also on your thought in the line "Agents are not solved yet, they'll come when AGI comes" in your blog post, 100% agree:
When any system reaches 90% on GAIA benchmark (hard general tasks up to 1hour long, cc @clefourrier @Thom_Wolf @ThomasScialom
), that means that we have really
Discussion on AI Agents and AGI Progress via GAIA Benchmark
By
–