MIT just quietly humbled every major AI lab — and almost nobody’s talking about it. They built a new benchmark called WorldTest to see if AI actually understands the world. The results are brutal.
Even the top models — Claude, Gemini 2.5 Pro, OpenAI o3 — got crushed by
MIT’s WorldTest Benchmark Challenges Top AI Models’ World Understanding
By
–
Leave a Reply