Testing Language Models at scale for social bias is challenging. We build automated testing: test sentences are automatically generated given the bias dimensions. This allows getting statistically meaningful measures for bias as opposed to a small number of hand-written templates
Automated Testing for Social Bias in Large Language Models
By
–
Leave a Reply