AI Dynamics

Global AI News Aggregator

DAGGER: Imitation Learning Alternative to Reinforcement Learning

People are asking if there are alternatives to RL in RLHF. Yes, imitation with DAGGER (tutorial: https://
ri.cmu.edu/publications/a
n-invitation-to-imitation/
… ). The user provides feedback with corrections, e.g. when the agent says “that” the user tells the agent that instead of saying “that”, it should say “this”.

→ View original post on X — @nandodf,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *