Lecture 9: Policy iteration and policy gradient methods
(Big slides Download Big slides) (Small slides Download Small slides) (Recordings)
This lecture is about
- Policy iteration
- Optimistic policy iteration
- Rollout and approximate policy improvement
- Approximate policy iteration
Read sections 4.6 and 5.1 in Bertsekas' book Links to an external site..