New benchmark for deep research agents! An agent that is creative and persistent should be able to find any piece of information on the open web, even if it requires browsing hundreds of webpages. Models that exercise this ability are like a frictionless interface to the
New Benchmark for Deep Research Agents and Web Browsing
By
–
Leave a Reply