DigiPedia - Tu Delft

Courses
Courses
- Bachelor
  Courses
  
  Bachelor
  - BK Bachelor
- Master
  Courses
  
  Master
  - BK Master
Subjects
Subjects
- 3D Modelling
  Subjects
  
  3D Modelling
  - 3D modelling
- AI & ML
  Subjects
  
  AI & ML
  - Machine Learning
  - Optimization
- Analysis & simulation
  Subjects
  
  Analysis & simulation
- Collaboration
  Subjects
  
  Collaboration
  - Collaboration
- Geospatial and Geographic Information Systems
  Subjects
  
  Geospatial and Geographic Information Systems
  - Geospatial and Geographic Information Systems
- Other
  Subjects
  
  Other
  - Other
- Parametric Modeling
  Subjects
  
  Parametric Modeling
  - General
- Prototyping and Manufacturing
  Subjects
  
  Prototyping and Manufacturing
  - Additive Manufacturing
  - Robot Programming
Software
Software
- ANSYS
- ArcGIS
- BIM360
- Gephi
- Grasshopper
- Honeybee
- Houdini
- Jupyter Notebook
- Karamba3D
- Ladybug
- PDOK
- PolyScope (Universal Robots)
- PUG
- Python
- QGIS
- Revit
- Rhino
- RhinoCityJSON
- Speckle
- Visual Studio Code
- Wallacei
Labs
Labs
- BK Labs
  Labs
  
  BK Labs
  - BK Labs
  - Modelhall
- TU Labs at BK
  Labs
  
  TU Labs at BK
  - Artificial Intelligence Labs

Policy Iteration

Intro
Video Overview

Information

Primary software used	Jupyter Notebook
Course	Computational Intelligence for Integrated Design
Primary subject	AI & ML
Secondary subject	Machine Learning
Level	Intermediate
Last updated	December 18, 2024
Keywords	Machine Learning Markov Decision Processes Policy Iteration Reinforcement Learning

Responsible

Teachers	Charalampos Andriotis , Michela Turrin
Faculty	Bouwkunde

Policy Iteration 0/1

Policy Iteration

Policy iteration is an algorithm used to find the optimal policy for a Markov Decision Process (MDP) by iteratively improving a given policy. It alternates between evaluating the current policy by calculating the value of each state and updating the policy by selecting actions that maximize the expected value, repeating this process until the policy stabilizes and becomes optimal.

For this tutorial you need to have installed Python, Jupyter notebooks, and some common libraries including Scikit Learn. Please see the following tutorial for more information.

Download the Jupyter notebook here to follow along with the tutorial.

Download MDPPolicyIteration_PYscript_01

application/zip (ZIP, 155 KB)

Start Start

Policy Iteration 1/1

Video Overviewlink copied

Previous chapter Previous chapter Next chapter Next chapter

Policy Iteration

Information

Responsible

Policy Iteration 0/1

Policy Iteration

Jupyter Notebook Installation & Overview

Download MDPPolicyIteration_PYscript_01

application/zip (ZIP, 155 KB)

Policy Iteration 1/1

Video Overviewlink copied

Write your feedback.

Thank your for your feedback.