Lecture 9: Policy iteration and policy gradient methods

(Big slides Download Big slides) (Small slides Download Small slides) (Recordings)

This lecture is about

  • Policy iteration
  • Optimistic policy iteration
  • Rollout and approximate policy improvement
  • Approximate policy iteration

Read sections 4.6 and 5.1 in Bertsekas' book Links to an external site..