OpenAI's new open models are the new reference! Fewer parameters (both total and activated), yet much better performance than previous SOTA by Qwen EDIT: put the thinking scores for Qwen3, to be more fair!
By
–

OpenAI's new open models are the new reference! Fewer parameters (both total and activated), yet much better performance than previous SOTA by Qwen EDIT: put the thinking scores for Qwen3, to be more fair!