The smarter your AI reasons, the harder it falls for BS.
— AlphaSignal AI (@AlphaSignalAI) 14 avril 2026
Most AI models will confidently answer a completely nonsensical question.
A new open-source benchmark measures exactly that.
BullshitBench v2 tests 70+ model variants across 100 carefully crafted nonsense prompts.… pic.twitter.com/jp1w7Q2iCF
The smarter your AI gets at reasoning, the better it detects BS. Most AI models confidently answer completely nonsensical questions. A new open-source benchmark measures this precisely. BullshitBench v2 tests 70+ model variants across 100 carefully crafted nonsense prompts.