Exercise 7: Q-learning

Paper Exercises

In this exercise we will discuss key concepts such as

  • Q-formulation of the Bellman equation
  • Q-learning
  • Epsilon-greedy exploration
  • Optimal Q-values, Q*
  • Learning rate and discount factor

Paper exercises and solutions can be found in the exercise compendium Download exercise compendium.

It is recommended that you complete this paper exercise before moving on to the computer exercise.

Computer Exercises

Can be found directly here;

Q_Learning_Warehouse.ipynb Download Q_Learning_Warehouse.ipynb

The exercise explores:

  • Warehouse problem (Q-learning maze)

Solution is available at Q_Learning_Solution.ipynb Download Q_Learning_Solution.ipynb or on the jupyterhub server.