AI Dynamics

Global AI News Aggregator

About

Soohak: A Benchmark for Evaluating Research-level Math in LLMs

Soohak A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

→ View original post on X — @_akhaliq