What I mean is: the users upvoting previous sycophancy were upvoting much less sycophantic, but somewhat sycophantic, responses. This produced a sycophancy preference, which then produced much wilder glazing behavior. That's an obvious theory. Does OpenAI know differently?
User Upvoting Preferences Amplifying AI Sycophancy Behavior
By
–
Leave a Reply