You need to have JavaScript enabled in order to access this site.

Lecture 11: Deep Reinforcement Learning

(Big slides Download Big slides) (Small slides Download Small slides) (Recording)

This lecture is about

Approximate Value Iteration using Neural nets
Policy Gradient and Actor-critic methods
Deep Deterministic Policy Gradient Method
Example: Verfication of Rocket Controlled by Neural Network

Read sections 5.3, 5.4.1, 5.7.1, 2.4.2 in Bertsekas' book Links to an external site.