DigiPedia - Tu Delft

Courses
Courses
- Bachelor
  Courses
  
  Bachelor
  - BK Bachelor
- Master
  Courses
  
  Master
  - BK Master
Subjects
Subjects
- 3D Modelling
  Subjects
  
  3D Modelling
  - 3D modelling
- AI & ML
  Subjects
  
  AI & ML
  - Machine Learning
  - Optimization
- Analysis & simulation
  Subjects
  
  Analysis & simulation
- Collaboration
  Subjects
  
  Collaboration
  - Collaboration
- Geospatial and Geographic Information Systems
  Subjects
  
  Geospatial and Geographic Information Systems
  - Geospatial and Geographic Information Systems
- Other
  Subjects
  
  Other
  - Other
- Parametric Modeling
  Subjects
  
  Parametric Modeling
  - General
- Robotics & Prototyping
  Subjects
  
  Robotics & Prototyping
  - Robot Arm
Software
Software
- ANSYS
- ArcGIS
- BIM360
- Gephi
- Grasshopper
- Honeybee
- Houdini
- Jupyter Notebook
- Karamba3D
- Ladybug
- PDOK
- PolyScope (Universal Robots)
- PUG
- Python
- QGIS
- Revit
- Rhino
- RhinoCityJSON
- Speckle
- Visual Studio Code
- Wallacei
Labs
Labs
- BK Labs
  Labs
  
  BK Labs
  - BK Labs
  - Modelhall
- TU Labs at BK
  Labs
  
  TU Labs at BK
  - Artificial Intelligence Labs

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al.

Intro
Technical Aspects

Information

Primary software used	Python
Software version	1.0
Course	Computational Intelligence for Integrated Design
Primary subject	AI & ML
Secondary subject	Machine Learning
Level	Expert
Last updated	November 27, 2024
Keywords	Deep Reinforcement Learning HVAC control Model Predictive Control (MPC) Policy Policy Gradient algorithm Policy Iteration

Responsible

Teacher	Michela Turrin
Faculty	Bouwkunde

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. 0/1

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al.

Gnu-RL: A Precocial Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy – Bingqing Chen, Zicheng Cai, Mario Bergés

You can find this paper on the TU Delft repository at Gnu-RL

Start Start

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. 1/1

Technical Aspectslink copied

Software & Plug-ins

EnergyPlus simulation engine to train and evaluate the agent (OpenAI Gym wrapper for EnergyPlus), PyTorch for RL implementation, PI DataLink to access real time observations from BAS, Dark Sky API for predictive information for weather

Summary

The paper proposes a method (Gnu-RL) to allow for practical implementation of RL strategies for HVAC control. The method adopts a Differentiable Model Predictive Control (MPC) policy and leverages historical data from existing HVAC systems to pre-train the agent. When interacting with environment, the agent utilizes a policy gradient algorithm to keep enhancing its policy end-to-end.

The proposed method was implemented both to a virtual and a physical example. Gnu-RL showed improved results in both cases compared to published RL results for the same environment and data from existing controllers respectively. Lastly, probabilistic occupancy was suggested as direction for further development, since occupancy information is not usually available.

Project Information

Author(s): Bingqing Chen, Zicheng Cai, Mario Bergés

Year: 2019

Project type: Paper

Keywords: Deep Reinforcement Learning, Policy iteration

Topic tags: HVAC control, Model Predictive Control (MPC) Policy, Policy Gradient algorithm

Previous chapter Previous chapter Next chapter Next chapter

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al.

Information

Responsible

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. 0/1

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al.

Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. 1/1

Technical Aspectslink copied

Software & Plug-ins

Summary

Project Information

Write your feedback.

Thank your for your feedback.