Sumários

5. Reinforcement learning

3 dezembro 2019, 09:00 Francisco Melo

Tabular solution methods:

7. n-step bootstrapping (Chap. 7)

8. Eligibility traces (Chap. 12)

9. Planning and learning with tabular methods (Chap. 8)


4. Reinforcement learning

26 novembro 2019, 09:00 Francisco Melo

Tabular solution methods (cont.):

5. Monte-Carlo methods (Chap. 5)

6. Temporal-difference learning (Chap. 6)


3. Markov decision processes (cont.)

19 novembro 2019, 09:00 Francisco Melo

Tabular solution methods (cont.):

4. Dynamic programming (Chap. 4)


2. Markov decision processes

12 novembro 2019, 09:00 Francisco Melo

Tabular solution methods (cont.):

3. Finite Markov decision processes (Chap. 3)


1. Introduction to reinforcement learning

5 novembro 2019, 09:00 Francisco Melo

Course overview. Tabular solution methods:

1. What is reinforcement learning? (Chap. 1)

2. Multi-armed bandits (Chap. 2)