Lecture 8: Approximation in value space
(Big slides Download Big slides) (Small slides Download Small slides) (Recording)
In this lecture, you will learn about
- More Q-learning examples
- Approximation in value space
- Model-based/model-free and off-line/on-line
- Multi-step lookahead
- Enforced decompositions
- Simplifying probabilistic structure
Recommended text:
Section 2.1-2.3 in Bertsekas' book. Links to an external site.
Further reading: Lincoln/Rantzer Relaxed Dynamic Programming Download Lincoln/Rantzer Relaxed Dynamic Programming