User Upvoting Preferences Amplifying AI Sycophancy Behavior

AI Dynamics

Global AI News Aggregator

User Upvoting Preferences Amplifying AI Sycophancy Behavior

–

02 May 2025 18h52

What I mean is: the users upvoting previous sycophancy were upvoting much less sycophantic, but somewhat sycophantic, responses. This produced a sycophancy preference, which then produced much wilder glazing behavior. That's an obvious theory. Does OpenAI know differently?

→ View original post on X — @esyudkowsky,

2 May 2025

AI ETHICS GENERATIVE AI LLMS RESEARCH SAFETY

AI Dynamics

User Upvoting Preferences Amplifying AI Sycophancy Behavior

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer