AI Dynamics

Global AI News Aggregator

About

Language Models Can Be Conditioned to Avoid Controversy Through RLHF

And yet they can be conditioned to be boring and non controversial through RLHF.

→ View original post on X — @aravsrinivas