AI Dynamics

Global AI News Aggregator

About

Gemma 4 Scores Low on BullshitBench Evaluation

BullshitBench update: Gemma 4 is scoring pretty low – 58/67th for 31b and 62/87th for 26 A4B. Not super surprising, we didn't have any small models ranking particularly highly.

→ View original post on X — @petergostev,