Creativity Has Left the Chat: The Price of Debiasing Language Models https://
arxiv.org/abs/2406.05587 “While RLHF has proven effective in reducing biases and toxicity in LLMs, this alignment process may inadvertently lead to a reduction in the models’ creativity and output diversity.”
RLHF Alignment Reduces Language Models Creativity and Diversity
By
–
Leave a Reply