Also astonishing to see how fast the field has advanced: Gemini 1.5 Pro was first to surpass the 90% mark on MATH (6.9% 3 years ago). For comparison, on ImageNet, it took us close to 10 years to achieve the same from AlexNet (40%) to Meta Pseudo Labels (90.2%, our work
Gemini 1.5 Pro Achieves 90% MATH Benchmark Milestone
By
–
Leave a Reply