Curiously, AlphaGo was using imitation learning (cake, yummy ) + RL refinement (cherry, yes plz ). A recipe that should sound familiar to many today.
AlphaGo’s Imitation Learning and Reinforcement Learning Recipe
By
–
Global AI News Aggregator
By
–
Curiously, AlphaGo was using imitation learning (cake, yummy ) + RL refinement (cherry, yes plz ). A recipe that should sound familiar to many today.
Leave a Reply