Markov Decision Process - Tree search

Online search (approximate solution)

Monte Carlo Tree Search

Reach a leaf node: you have a certain result of winning or losing.
If you have not reach the depth, you can add more nodes to the tree(expansion).

Upper confidence tree (UCT)

Trial in MCTs: selection + …
Trial in UCT: selection
select a to max Q and min N(n, a) : adaptivity

POMDP

Partial Observed