AI Dynamics

Global AI News Aggregator

About

ARC-AGI-3 Leaderboard: Measuring True AI Intelligence Beyond Human Design

It is trivial to solve all public ARC-AGI-3 tasks if you have a human looking at them and designing a system to beat them (we have released a harness that uses human replay to score 100%). But our leaderboard is not about measuring how well human intelligence does on ARC-AGI-3,

→ View original post on X — @fchollet