MONTE CARLO TREE SEARCH
Overview
MCTS is a probabilistic algorithm that uses random sampling to make decisions in games and optimization problems. It builds a search tree asymmetrically, focusing computational resources on promising paths.
Four Phases
- 1.Selection - Navigate to leaf node
- 2.Expansion - Add new child nodes
- 3.Simulation - Random playout
- 4.Backpropagation - Update statistics
UCB1 Formula
UCB1 = w_i/n_i + c√(ln(N)/n_i) where: w_i = wins for node i n_i = visits for node i N = total parent visits c = exploration constant
Advantages
Works without domain knowledge
Anytime algorithm
Handles large search spaces
Asymmetric tree growth