Du måste aktivera JavaScript för att få tillgång till den här sidan.

Lecture 9: Deep Reinforcement Learning

(Big slides Download Big slides) (Small slides Download Small slides) (Recording)

This lecture is about

Actor-critic methods using neural nets
SARSA and Temporal Difference Learning
Policy gradient methods
Monte Carlo Tree Search

Read sections 5.3, 5.4.1, 5.7.1, 2.4.2 in Bertsekas' book Links to an external site.