AI Dynamics

Global AI News Aggregator

Multimodal LLMs Fail at Car Detection Task Compared to YOLO

I tested Opus 4.7 on a simple car detection task.
→ Took 5 minutes
→ Missed multiple cars
→ Pointed at blank spaces
→ Bounding boxes were even worse
Codex did better (24 cars, accurate points) but still took 3 minutes.
YOLO does the same thing in 30ms.
Multimodal LLMs are

→ View original post on X — @learnopencv,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *