With more effort, we developed a series of LM generation/filtering stages to create a larger version of the popular Winogender bias dataset. Our “Winogenerated” evaluation contains 50x as many examples as the original while obeying complex grammatical constraints.
Anthropic Creates Winogendered Dataset with 50x More Examples
By
–
Leave a Reply