Improving Policy Learning via Language Dynamics Distillation. @hllo_wrld
, @Jayelmnop
, @LukeZettlemoyer
, @EGrefen
, @_rockt propose Language Dynamics Distillation (LDD), which pretrains a model to predict environment dynamics given demonstrations with language descriptions
Language Dynamics Distillation Improves Policy Learning
By
–