GPT-4.5 Preview evals results are out on SEAL 👀
— Alexandr Wang (@alexandr_wang) 28 février 2025
⚡ #2 in Tool Use – Chat
🏢 #3 in Tool Use – Enterprise
🥉 #3 in EnigmaEval (behind Claude 3.7 Sonnet)
📚 #4 in MultiChallenge
🎓 #5 in Humanity’s Last Exam
🔍 #6 in VISTA (multimodal)
See rankings here: https://t.co/pVIgk6rIcL pic.twitter.com/Fu47KLMyn4
GPT-4.5 Preview evals results are out on SEAL #2 in Tool Use – Chat #3 in Tool Use – Enterprise #3 in EnigmaEval (behind Claude 3.7 Sonnet) #4 in MultiChallenge #5 in Humanity’s Last Exam #6 in VISTA (multimodal) See rankings here: https://
scale.com/leaderboard