How good are our AI research agents at producing genuinely insightful reports? University of Science and Technology of China and Metastone Technology just released "Deep Research Bench II". This new benchmark features 132 complex research tasks across 22 fields, evaluating AI
Deep Research Bench II Evaluates AI Research Agent Quality
By
–
