Read the paper here: https://
anthropic.com/index/evaluati
ng-and-mitigating-discrimination-in-language-model-decisions
… And access our dataset (and the prompts used to construct it) here: https://
huggingface.co/datasets/Anthr
opic/discrim-eval
…
Anthropic evaluates discrimination mitigation in language models
By
–