AI Dynamics

Global AI News Aggregator

AlphaZero Planning Through MCTS and Neural Networks

AlphaZero *does* perform planning.
That's done through MCTS, using a ConvNet to propose good moves and another one to evaluate positions.
The amount of time spent exploring the tree is potentially infinite.
That's reasoning and planning.
RL is used to train those nets.

→ View original post on X — @ylecun,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *