AI Dynamics

Global AI News Aggregator

Larger Language Models Show More Bias on BBQ Benchmark

First, we find larger LMs are more biased on the BBQ benchmark. Prompting models to avoid bias by giving them instructions (IF) and asking for reasoning (CoT) reverses the trend but only for the largest models and only with enough RLHF training! (Darker lines = more RLHF)

→ View original post on X — @anthropicai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *