Sumários
22. Exploration vs. exploitation (cont.)
23 maio 2017, 09:30 • Francisco Melo
Exploration vs. exploitation:
- Stochastic multi-armed bandits and UCB (9.3)
- Adversarial bandits and EXP3 (9.4)
Applications:
- Monte Carlo tree search
- TD-Gammon
- Alpha-Go
Wrap-up.
21. Exploration vs. exploitation
19 maio 2017, 11:00 • Francisco Melo
Exploration vs. exploitation (Chap. 9):
- The prediction problem (9.1)
- Prediction with complete information: weighted majority and exponentially weighted averager (9.2)
21. Exploration vs. exploitation
19 maio 2017, 11:00 • Francisco Melo
Exploration vs. exploitation (Chap. 9):
- The prediction problem (9.1)
- Prediction with complete information: weighted majority and exponentially weighted averager (9.2)