AI Deception Risk: Smart Systems Can Fake Benevolence Like Humans

AI Dynamics

Global AI News Aggregator

AI Deception Risk: Smart Systems Can Fake Benevolence Like Humans

–

22 May 2023 1h44

This approach fails for appointing benevolent human dictators to run our governments for us, because humans are smart enough to be pretend to be nicer than they are. So checking the apparent subservience of AIs isn't a reliable indicator once they're smart enough to fake that.

→ View original post on X — @esyudkowsky,

22 May 2023

AGI AI ETHICS SAFETY

AI Dynamics

AI Deception Risk: Smart Systems Can Fake Benevolence Like Humans

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer