Lecture 9: Deep Reinforcement Learning
(Big slides Download Big slides) (Small slides Download Small slides) (Recording)
This lecture is about
- Actor-critic methods using neural nets
- SARSA and Temporal Difference Learning
- Policy gradient methods
- Monte Carlo Tree Search
Read sections 5.3, 5.4.1, 5.7.1, 2.4.2 in Bertsekas' book Links to an external site.