AI Dynamics

Global AI News Aggregator

About

DeepSeek v4 Underperforms on BullshitBench Reasoning Tasks

BullshitBench: sorry to say but DeepSeek v4 did really badly, towards the bottom of the table, whether it is high or low reasoning.

→ View original post on X — @petergostev,