We’ve also begun work on ARC-AGI-3, a new frontier benchmark intended to last for a while, launching in 2026. It completely departs from the earlier format — it tests new capabilities like exploration, goal-setting, and extremely data-efficient skill acquisition. Today, we're
Leave a Reply