Lecture 11: Deep Reinforcement Learning
(Big slides Download Big slides) (Small slides Download Small slides) (Recording)
This lecture is about
- Approximate Value Iteration using Neural nets
- Policy Gradient and Actor-critic methods
- Deep Deterministic Policy Gradient Method
- Example: Verfication of Rocket Controlled by Neural Network
Read sections 5.3, 5.4.1, 5.7.1, 2.4.2 in Bertsekas' book Links to an external site.