Larger Language Models Show More Bias on BBQ Benchmark

AI Dynamics

Global AI News Aggregator

Larger Language Models Show More Bias on BBQ Benchmark

–

16 February 2023 17h43

First, we find larger LMs are more biased on the BBQ benchmark. Prompting models to avoid bias by giving them instructions (IF) and asking for reasoning (CoT) reverses the trend but only for the largest models and only with enough RLHF training! (Darker lines = more RLHF)

→ View original post on X — @anthropicai,

16 February 2023

AI Dynamics

Larger Language Models Show More Bias on BBQ Benchmark

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer