I've tested if nano-banana / Gemini-2.5-flash-image beat ARC-AGI – it's quite far. Btw bravo to the ARC_AGI team, the delta between easiness of problems for humans vs difficulty for LLMs is just
By
–

I've tested if nano-banana / Gemini-2.5-flash-image beat ARC-AGI – it's quite far. Btw bravo to the ARC_AGI team, the delta between easiness of problems for humans vs difficulty for LLMs is just