I feel like OpenAI is the only lab that really nailed reasoning. DeepSeek was probably the closest, but need to see a frontier model from them to be sure. Gemini's reasoning was quite weird and all over the place. Claude's reasoning never used to matter (non thinking were
OpenAI leads reasoning capabilities, DeepSeek close behind
By
–