AI Dynamics

Global AI News Aggregator

About

AlphaZero Planning Through MCTS and Neural Networks

AlphaZero *does* perform planning.
That's done through MCTS, using a ConvNet to propose good moves and another one to evaluate positions.
The amount of time spent exploring the tree is potentially infinite.
That's reasoning and planning.
RL is used to train those nets.

→ View original post on X — @ylecun