AI Dynamics

Global AI News Aggregator

About

Multimodal LLMs Fail 3D Puzzles: MARBLE Benchmark

Multimodal LLMs can write essays.
They can chat, caption, and even summarize papers. But give them a Portal map or a 3D puzzle, they break instantly. Zero percent accuracy. Welcome to the edge of AI reasoning: The MARBLE Benchmark

→ View original post on X — @aibreakfast