AI Dynamics

Global AI News Aggregator

o4-mini Outperforms Grok 4 on SciCode Benchmark

o4-mini beats grok 4 on SciCode, despite being way faster and cheaper. It feels like maybe grok 4 was fine tuned specifically for some specific popular high status tasks? (Still fairly impressive, but more of a party trick than real capability AFAICT.)

→ View original post on X — @jeremyphoward,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *