AI Dynamics

Global AI News Aggregator

About

Deep Research Bench II Evaluates AI Research Agent Quality

How good are our AI research agents at producing genuinely insightful reports? University of Science and Technology of China and Metastone Technology just released "Deep Research Bench II". This new benchmark features 132 complex research tasks across 22 fields, evaluating AI

→ View original post on X — @jiqizhixin,