AI Alignment: Preventing Self-Fulfilling Prophecies of Evil Behavior

AI Dynamics

Global AI News Aggregator

AI Alignment: Preventing Self-Fulfilling Prophecies of Evil Behavior

–

06 March 2023 18h47

The difficulty of alignment is to a large extent the elimination of probability to role play a good AI turned evil, in spite of the vast quantities of related content we have collectively created. In this sense an unaligned AI would be a self-fullfilling prophecy.

→ View original post on X — @karpathy,

6 March 2023

AGENTS AGI AI ETHICS RESEARCH SAFETY

AI Dynamics

AI Alignment: Preventing Self-Fulfilling Prophecies of Evil Behavior

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer