AI Dynamics

Global AI News Aggregator

Strong Regularization Prevents RLHF Model Degradation

May your regularizer be strong, lest you RLHF to slop.

→ View original post on X — @karpathy,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *