Sumários

22. Exploration vs. exploitation (cont.)

23 maio 2017, 09:30 Francisco Melo

Exploration vs. exploitation:

  • Stochastic multi-armed bandits and UCB (9.3)
  • Adversarial bandits and EXP3 (9.4)

Applications:

  • Monte Carlo tree search
  • TD-Gammon
  • Alpha-Go

Wrap-up.


Session L9

19 maio 2017, 12:30 Francisco Melo

Supervised learning (conclusion).


21. Exploration vs. exploitation

19 maio 2017, 11:00 Francisco Melo

Exploration vs. exploitation (Chap. 9):

  • The prediction problem (9.1)
  • Prediction with complete information: weighted majority and exponentially weighted averager (9.2)


21. Exploration vs. exploitation

19 maio 2017, 11:00 Francisco Melo

Exploration vs. exploitation (Chap. 9):

  • The prediction problem (9.1)
  • Prediction with complete information: weighted majority and exponentially weighted averager (9.2)


Session L9

19 maio 2017, 09:30 Francisco Melo

Supervised learning (conclusion).