BREAKING : Gemini 3 Deep Think gets 41% on HLE and 45.1% on ARC_AGI-2! "In testing, Gemini 3 Deep Think outperforms Gemini 3 Pro’s already impressive performance on Humanity’s Last Exam and GPQA Diamond."
Gemini 3 Deep Think scores 41% on HLE
By
–

By
–

BREAKING : Gemini 3 Deep Think gets 41% on HLE and 45.1% on ARC_AGI-2! "In testing, Gemini 3 Deep Think outperforms Gemini 3 Pro’s already impressive performance on Humanity’s Last Exam and GPQA Diamond."