Lecture 9: Deep Reinforcement Learning

(Big slides Download Big slides) (Small slides Download Small slides) (Recording)

This lecture is about

  • Actor-critic methods using neural nets
  • SARSA and Temporal Difference Learning
  • Policy gradient methods
  • Monte Carlo Tree Search

Read sections 5.3, 5.4.1, 5.7.1, 2.4.2 in Bertsekas' book Links to an external site.