AI Dynamics

Global AI News Aggregator

About

DeepMind solves 48% of a complex math benchmark

Google DeepMind's AI co-mathematician just scored 48% on FrontierMath Tier 4, a new high on a benchmark of 50 research-level math problems some professors expected AI wouldn't touch for decades. The system generated a proof so flawed its own reviewer flagged it as wrong. But

→ View original post on X — @therundownai