Our latest update to our Gemini 2.0 Flash Thinking model (available here: https://
goo.gle/4jsCqZC) scores 73.3% on AIME (math) & 74.2% on GPQA Diamond (science) benchmarks. Thanks for all your feedback, this represents super fast progress from our first release just this past
Gemini 2.0 Flash Thinking Achieves 73.3% AIME Math Score
By
–
Leave a Reply