AI Dynamics

Global AI News Aggregator

DAGGER Counterfactual Teaching Method for LLM Training

DAGGER is a form of counterfactual teaching as explained in https://
arxiv.org/abs/2110.10819 – Note that it is the student who always acts. The teacher only provides corrections, which are used to minimise the LLM loss directly. Note however that this imitation IS NOT supervised learning.

→ View original post on X — @nandodf,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *