AI Dynamics

Global AI News Aggregator

About

Evaluating Gemini 2.5 Flash on ARC-AGI benchmarks

I've tested if nano-banana / Gemini-2.5-flash-image beat ARC-AGI – it's quite far. Btw bravo to the ARC_AGI team, the delta between easiness of problems for humans vs difficulty for LLMs is just

→ View original post on X — @aymericroucher