AI Dynamics

Global AI News Aggregator

About

BrowseComp: New Benchmark for Testing AI Browsing Agents

We’re open-sourcing BrowseComp (“Browsing Competition”), a new, challenging benchmark designed to test how well AI agents can browse the internet to find hard-to-locate information. It’s like an online scavenger hunt…but for browsing agents.

→ View original post on X — @openai