AI Dynamics

Global AI News Aggregator

OpenAI Generated Harmful Content Examples to Train AI Safety

3/ To have enough examples of the harmful content it wanted its AI systems to avoid, OpenAI researchers generated some with AI. In one case, they asked AI to write a post of a teenage girl whose friend had enacted self-harm. OpenAI discusses this here: https://
arxiv.org/pdf/2208.03274
.pdf

→ View original post on X — @_karenhao,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *