Solving MDPs - Q-learning. Cooperative Systems.

4 dezembro 2012, 14:00 Pedro Urbano Lima

Reinforcement learning as an online iterative optimal solution for MDPs.

Cooperative Systems: Motivation.