Q-Learning

  • Intro
  • Video Overview

Information

Primary software used Jupyter Notebook
Course Q-Learning
Primary subject AI & ML
Secondary subject Machine Learning
Level Intermediate
Last updated November 11, 2024
Keywords

Q-Learning 0/1

Q-Learning link copied

Q-learning is a model-free reinforcement learning algorithm that seeks to learn the optimal action-selection policy by iteratively updating the estimated value, or “Q-value,” of taking a specific action in a given state based on the received reward and the estimated value of future states. It does this by balancing exploration of new actions and exploitation of known rewards, eventually converging to the optimal policy that maximizes cumulative rewards over time.

For this tutorial you need to have installed Python, Jupyter notebooks, and some common libraries including Scikit Learn. Please see the following tutorial for more information.

Download the Jupyter notebook here to follow along with the tutorial.

Download MDPQLearning_PYscript_01
application/zip (ZIP, 63 KB)