Grok 4 Heavy Dominates Humanity's Last Exam Benchmark

AI Dynamics

Global AI News Aggregator

Grok 4 Heavy Dominates Humanity’s Last Exam Benchmark

–

15 July 2025 5h09

Benchmark Dominance
Grok 4 Heavy smashed “Humanity’s Last Exam” with a 44–50% score, nearly doubling its single-agent sibling and outpacing Gemini & OpenAI. It even nailed 100% on AIME! This is frontier AI territory.

→ View original post on X — @futurepedia_io,

15 July 2025

AI Dynamics

Grok 4 Heavy Dominates Humanity’s Last Exam Benchmark

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cybercab Uber: Safer, Cheaper Alternative for Single Riders

Zeekr Global Unveils Latest Electric Vehicle Model

Revolutionary New Camera Technology Unveiled

Hidden Camera Recording Family Interactions Raises Privacy Concerns