AI Dynamics

Global AI News Aggregator

About

Disclosure paper: label harness with benchmark scores

Exactly, the fix isn't fewer benchmarks, it's putting the wrapper on the label. Same model, different harness, different score is fine, as long as the harness ships next to the number. That's the disclosure paper's whole proposal.

→ View original post on X — @alphasignalai