Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al.
-
Intro
-
Technical Aspects
Information
Primary software used | Python |
Software version | 1.0 |
Course | Computational Intelligence for Integrated Design |
Primary subject | AI & ML |
Secondary subject | Machine Learning |
Level | Expert |
Last updated | November 27, 2024 |
Keywords |
Responsible
Teacher | |
Faculty |
Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. 0/1
Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. link copied
Gnu-RL: A Precocial Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy – Bingqing Chen, Zicheng Cai, Mario Bergés
You can find this paper on the TU Delft repository at Gnu-RL
Paper: A Precocial Reinforcement Learning Solution for Building HVAC Control – Chen et al. 1/1
Technical Aspects link copied
Software & Plug-ins
EnergyPlus simulation engine to train and evaluate the agent (OpenAI Gym wrapper for EnergyPlus), PyTorch for RL implementation, PI DataLink to access real time observations from BAS, Dark Sky API for predictive information for weather
Summary
The paper proposes a method (Gnu-RL) to allow for practical implementation of RL strategies for HVAC control. The method adopts a Differentiable Model Predictive Control (MPC) policy and leverages historical data from existing HVAC systems to pre-train the agent. When interacting with environment, the agent utilizes a policy gradient algorithm to keep enhancing its policy end-to-end.
The proposed method was implemented both to a virtual and a physical example. Gnu-RL showed improved results in both cases compared to published RL results for the same environment and data from existing controllers respectively. Lastly, probabilistic occupancy was suggested as direction for further development, since occupancy information is not usually available.
Project Information
Author(s): Bingqing Chen, Zicheng Cai, Mario Bergés
Year: 2019
Project type: Paper
Keywords: Deep Reinforcement Learning, Policy iteration
Topic tags: HVAC control, Model Predictive Control (MPC) Policy, Policy Gradient algorithm