AI Dynamics

Global AI News Aggregator

Engineering Emergent Personalities in LLM Reward Optimization

There is definitely work going into engineering the "you" simulation – the personality that gets all the rewards in verifiable problems, or all the upvotes from users/judge LLMs, or mimics the responses of SFT, and there is an emergent composite personality from that. My point is

→ View original post on X — @karpathy,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *