AI Dynamics

Global AI News Aggregator

About

New Models Added to BullshitBench: Qwen Performance Analysis

I did a big clean up of some new models to add to the BullshitBench – none of them are particularly interesting tbh. Qwen scored relatively well, but below Qwen 3.5

→ View original post on X — @petergostev