DigiPedia - Tu Delft

Courses
Courses
- Bachelor
  Courses
  
  Bachelor
  - BK Bachelor
- Master
  Courses
  
  Master
  - BK Master
Subjects
Subjects
- 3D Modelling
  Subjects
  
  3D Modelling
  - 3D modelling
- AI & ML
  Subjects
  
  AI & ML
  - Machine Learning
  - Optimization
- Analysis & simulation
  Subjects
  
  Analysis & simulation
- Collaboration
  Subjects
  
  Collaboration
  - Collaboration
- Geospatial and Geographic Information Systems
  Subjects
  
  Geospatial and Geographic Information Systems
  - Geospatial and Geographic Information Systems
- Other
  Subjects
  
  Other
  - Other
- Parametric Modeling
  Subjects
  
  Parametric Modeling
  - General
- Prototyping and Manufacturing
  Subjects
  
  Prototyping and Manufacturing
  - Additive Manufacturing
  - Robot Programming
Software
Software
- ANSYS
- ArcGIS
- Bambu Studio
- BIM360
- Docofossor
- Gephi
- Grasshopper
- Honeybee
- Houdini
- Jupyter Notebook
- Karamba3D
- Ladybug
- Maya
- PDOK
- PolyScope (Universal Robots)
- PUG
- Python
- QGIS
- Revit
- Rhino
- RhinoCityJSON
- Speckle
- Visual Studio Code
- Wallacei
Labs
Labs
- BK Labs
  Labs
  
  BK Labs
  - BK Labs
  - Modelhall
- Makers Facilities
  Labs
  
  Makers Facilities
  - Faculty of Industrial Design
- TU Labs at BK
  Labs
  
  TU Labs at BK
  - Artificial Intelligence Labs

Q-Learning

Intro
Video Overview

Information

Primary software used	Jupyter Notebook
Course	Q-Learning
Primary subject	AI & ML
Secondary subject	Machine Learning
Level	Intermediate
Last updated	November 11, 2024
Keywords	Machine Learning Q-Learning Reinforcement Learning

Q-Learning 0/1

Q-Learning

Q-learning is a model-free reinforcement learning algorithm that seeks to learn the optimal action-selection policy by iteratively updating the estimated value, or “Q-value,” of taking a specific action in a given state based on the received reward and the estimated value of future states. It does this by balancing exploration of new actions and exploitation of known rewards, eventually converging to the optimal policy that maximizes cumulative rewards over time.

For this tutorial you need to have installed Python, Jupyter notebooks, and some common libraries including Scikit Learn. Please see the following tutorial for more information.

Download the Jupyter notebook here to follow along with the tutorial.

Download MDPQLearning_PYscript_01

application/zip (ZIP, 63 KB)

Start Start

Q-Learning 1/1

Video Overviewlink copied

Previous chapter Previous chapter Next chapter Next chapter

Teachers	Charalampos Andriotis , Lisa-Marie Mueller , Michela Turrin
Faculty	Bouwkunde

Q-Learning

Information

Responsible

Q-Learning 0/1

Q-Learning

Jupyter Notebook Installation & Overview

Download MDPQLearning_PYscript_01

application/zip (ZIP, 63 KB)

Q-Learning 1/1

Video Overviewlink copied

Write your feedback.

Thank your for your feedback.