Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps
Abstract
:1. Introduction
- (1)
- A model-free optimization system for chilled units is developed. It adjusts operations based on predictions of future cooling loads, and effectively balances the cooling efficiency with energy-saving needs while maintaining very high control performance within environmental fluctuations.
- (2)
- Eight different machine learning algorithms are incorporated into deep reinforcement learning to accurately adjust the chilled water supply temperature setpoint. This allows for the optimization of energy consumption while ensuring indoor comfort.
- (3)
- The proposed method is compared with different control strategies, including rule-based control, empirical model control, and non-predictive deep reinforcement learning control.
2. Methods
2.1. Overview of Load Forecasting Models
- (1)
- Extreme gradient boosting
- (2)
- Random forest
- (3)
- Extreme learning machine
- (4)
- Light gradient boosting machine
- (5)
- Radial basis function network
- (6)
- Support vector machine
- (7)
- Long short-term memory neural network
- (8)
- Convolutional neural network
2.2. Reinforcement Learning
- (1)
- The policy Q-network and target Q′-network, having the same architecture and initial parameters, are used to stabilize the update of the Q-function.
- (2)
- In order to balance the exploration and exploitation, a ε-greedy strategy is adopted for action selection. The agent randomly selects actions with a probability of ε, which gradually decreases to εmin over time at a rate of Δε.
- (3)
- After executing an action, the agent stores the ((s, a, r, s′)) experience tuple in a replay memory.
- (4)
- When a state ends, a mini-batch of experiences is randomly sampled from the replay memory to update the parameters of the network, while prioritized experience replay is used to increase the learning efficiency.
3. Case Study
3.1. Load Prediction Using Different Machine Learning Algorithms
3.1.1. Data Collection
3.1.2. Data Preprocessing
3.1.3. Feature Selection
3.1.4. Hyperparameter Settings for Predictive Models
3.1.5. Evaluation Metrics
3.2. Deep Reinforcement Learning Optimization Control
3.2.1. Model Establishment Process
3.2.2. Hyperparameter Configuration for the Deep Q-Network
3.2.3. Comparative Strategy Configuration
4. Results
4.1. Performance Analysis of the Prediction Models
4.2. Performance Analysis of Reinforcement Learning
4.3. Analysis of the Source-Side Water Supply Temperature Setpoint
4.4. Energy Consumption Analysis of Ground Source Heat Pump Units
5. Discussion
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Nomenclature
CL | cooling load, kW |
SR | solar radiation, W/m2 |
Tout | outdoor temperature, °C |
v | outdoor wind speed, m/s |
α | wind direction, ° |
φ | relative humidity, % |
Abbreviation | |
A | action |
BPNN | back propagation neural network |
CNN | convolutional neural network |
CV-RMSE | coefficient of variation in the root mean squared error |
DRL | deep reinforcement learning |
ELM | extreme learning machine |
GA-BP | genetic algorithm-based back propagation neural network |
HVAC | heating, ventilation, and air conditioning |
LightGBM | light gradient boosting machine |
LSTM | long short-term memory |
MAE | mean absolute error |
MAPE | mean absolute percentage error |
MDP | markov decision process |
MEA-BP | mind evolutionary algorithm-based back propagation neural network |
P | transition probability |
R | reward |
R2 | coefficient of determination |
RBFN | radial basis function network |
RF | random forest |
RL | reinforcement Learning |
RMSE | root mean squared error |
S | state |
SVM | support vector machine |
XGBoost | extreme gradient boosting |
γ | discount factor |
References
- Gorzałczany, M.B.; Rudziński, F. Energy Consumption Prediction in Residential Buildings—An Accurate and Interpretable Machine Learning Approach Combining Fuzzy Systems with Evolutionary Optimization. Energies 2024, 17, 3242. [Google Scholar] [CrossRef]
- Shi, Z.; Zheng, R.; Shen, R.; Yang, D.; Wang, G.; Liu, Y.; Li, Y.; Zhao, J. Building heating load forecasting based on the theory of transient heat transfer and deep learning. Energy Build. 2024, 313, 114290. [Google Scholar] [CrossRef]
- Cui, M.; Liu, J.; Kim, M.K.; Wu, X. Application potential analysis of different control strategies for radiant floor cooling systems in office buildings in different climate zones of China. Energy Build. 2023, 282, 112772. [Google Scholar] [CrossRef]
- Homod, R.Z.; Mohammed, H.I.; Ben Hamida, M.B.; Albahri, A.S.; Alhasnawi, B.N.; Albahri, O.S.; Alamoodi, A.H.; Mahdi, J.M.; Albadr, M.A.A.; Yaseen, Z.M. Optimal shifting of peak load in smart buildings using multiagent deep clustering reinforcement learning in multi-tank chilled water systems. J. Energy Storage 2024, 92, 112140. [Google Scholar] [CrossRef]
- Kim, Y.-S.; Kim, M.K.; Fu, N.; Liu, J.; Wang, J.; Srebric, J. Investigating the Impact of Data Normalization Methods on Predicting Electricity Consumption in a Building Using different Artificial Neural Network Models. Sustain. Cities Soc. 2025, 118, 105570. [Google Scholar] [CrossRef]
- Sun, L.; Hu, Z.; Mae, M.; Imaizumi, T. Individual room air-conditioning control in high-insulation residential building during winter: A deep reinforcement learning-based control model for reducing energy consumption. Energy Build. 2024, 323, 114799. [Google Scholar] [CrossRef]
- He, K.; Fu, Q.; Lu, Y.; Ma, J.; Zheng, Y.; Wang, Y.; Chen, J. Efficient model-free control of chiller plants via cluster-based deep reinforcement learning. J. Build. Eng. 2024, 82, 108345. [Google Scholar] [CrossRef]
- Su, Y.; Zou, X.; Tan, M.; Peng, H.; Chen, J. Integrating few-shot personalized thermal comfort model and reinforcement learning for HVAC demand response optimization. J. Build. Eng. 2024, 91, 109509. [Google Scholar] [CrossRef]
- Fu, Q.; Chen, X.; Ma, S.; Fang, N.; Xing, B.; Chen, J. Optimal control method of HVAC based on multi-agent deep reinforcement learning. Energy Build. 2022, 270, 112284. [Google Scholar] [CrossRef]
- Gupta, A.; Badr, Y.; Negahban, A.; Qiu, R.G. Energy-efficient heating control for smart buildings with deep reinforcement learning. J. Build. Eng. 2021, 34, 101739. [Google Scholar] [CrossRef]
- Du, Y.; Li, F.; Munk, J.; Kurte, K.; Kotevska, O.; Amasyali, K.; Zandi, H. Multi-task deep reinforcement learning for intelligent multi-zone residential HVAC control. Electr. Power Syst. Res. 2021, 192, 106959. [Google Scholar] [CrossRef]
- Deng, Z.; Chen, Q. Reinforcement learning of occupant behavior model for cross-building transfer learning to various HVAC control systems. Energy Build. 2021, 238, 110860. [Google Scholar] [CrossRef]
- Jiang, Z.; Risbeck, M.J.; Ramamurti, V.; Murugesan, S.; Amores, J.; Zhang, C.; Lee, Y.M.; Drees, K.H. Building HVAC control with reinforcement learning for reduction of energy cost and demand charge. Energy Build. 2021, 239, 110833. [Google Scholar] [CrossRef]
- Zou, Z.; Yu, X.; Ergan, S. Towards optimal control of air handling units using deep reinforcement learning and recurrent neural network. Build. Environ. 2020, 168, 106535. [Google Scholar] [CrossRef]
- Qiu, S.; Li, Z.; Li, Z.; Li, J.; Long, S.; Li, X. Model-free control method based on reinforcement learning for building cooling water systems: Validation by measured data-based simulation. Energy Build. 2020, 218, 110055. [Google Scholar] [CrossRef]
- Moayedi, H.; Mukhtar, A.; Khedher, N.B.; Elbadawi, I.; Amara, M.B.; Tt, Q.; Khalilpoor, N. Forecasting of energy-related carbon dioxide emission using ANN combined with hybrid metaheuristic optimization algorithms. Eng. Appl. Comput. Fluid Mech. 2024, 18, 2322509. [Google Scholar] [CrossRef]
- Khedher, N.B.; Mukhtar, A.; Md Yasir, A.S.H.; Khalilpoor, N.; Foong, L.K.; Nguyen Le, B.; Yildizhan, H. Approximating heat loss in smart buildings through large scale experimental and computational intelligence solutions. Eng. Appl. Comput. Fluid Mech. 2023, 17, 2226725. [Google Scholar] [CrossRef]
- Khan, M.I.; Ghodhbani, R.; Taha, T.; Al-Yarimi, F.A.M.; Zeeshan, A.; Ijaz, N.; Khedher, N.B. Advanced intelligent computing ANN for momentum, thermal, and concentration boundary layers in plasma electro hydrodynamics burgers fluid. Int. Commun. Heat Mass Transf. 2024, 159, 108195. [Google Scholar] [CrossRef]
- Lu, S.; Zhou, S.; Ding, Y.; Kim, M.K.; Yang, B.; Tian, Z.; Liu, J. Exploring the comprehensive integration of artificial intelligence in optimizing HVAC system operations: A review and future outlook. Results Eng. 2025, 25, 103765. [Google Scholar] [CrossRef]
- Liu, X.; Gou, Z. Occupant-centric HVAC and window control: A reinforcement learning model for enhancing indoor thermal comfort and energy efficiency. Build. Environ. 2024, 250, 111197. [Google Scholar] [CrossRef]
- Corriou, J.-P. Dynamic Optimization. In Numerical Methods and Optimization: Theory and Practice for Engineers; Springer International Publishing: Cham, Switzerland, 2021; pp. 653–708. [Google Scholar]
- Homod, R.Z.; Yaseen, Z.M.; Hussein, A.K.; Almusaed, A.; Alawi, O.A.; Falah, M.W.; Abdelrazek, A.H.; Ahmed, W.; Eltaweel, M. Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management. J. Build. Eng. 2023, 65, 105689. [Google Scholar] [CrossRef]
- Chen, B.; Zeng, W.; Nie, H.; Deng, Z.; Yang, W.; Yan, B. Optimal load distribution control for airport terminal chiller units based on deep reinforcement learning. J. Build. Eng. 2024, 97, 110787. [Google Scholar] [CrossRef]
- Borja-Conde, J.A.; Nadales, J.M.; Ordonez, J.G.; Fele, F.; Limon, D. Efficient management of HVAC systems through coordinated operation of parallel chiller units: An economic predictive control approach. Energy Build. 2024, 304, 113879. [Google Scholar] [CrossRef]
- Sun, M.; Yang, J.; Yang, C.; Wang, W.; Wang, X.; Li, H. Research on prediction of PPV in open-pit mine used RUN-XGBoost model. Heliyon 2024, 10, e28246. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Chen, Y.; Kang, J.; Ding, Z.; Zhu, H. An XGBoost-Based predictive control strategy for HVAC systems in providing day-ahead demand response. Build. Environ. 2023, 238, 110350. [Google Scholar] [CrossRef]
- Seyyedattar, M.; Zendehboudi, S.; Ghamartale, A.; Afshar, M. Advancing hydrogen storage predictions in metal-organic frameworks: A comparative study of LightGBM and random forest models with data enhancement. Int. J. Hydrogen Energy 2024, 69, 158–172. [Google Scholar] [CrossRef]
- Aparicio-Ruiz, P.; Barbadilla-Martín, E.; Guadix, J.; Nevado, J. Analysis of Variables Affecting Indoor Thermal Comfort in Mediterranean Climates Using Machine Learning. Buildings 2023, 13, 2215. [Google Scholar] [CrossRef]
- Hou, X.; Guo, X.; Yuan, Y.; Zhao, K.; Tong, L.; Yuan, C.; Teng, L. The state of health prediction of Li-ion batteries based on an improved extreme learning machine. J. Energy Storage 2023, 70, 108044. [Google Scholar] [CrossRef]
- Lei, L.; Shao, S. Prediction model of the large commercial building cooling loads based on rough set and deep extreme learning machine. J. Build. Eng. 2023, 80, 107958. [Google Scholar] [CrossRef]
- Wang, Q.; Qi, J.; Hosseini, S.; Rasekh, H.; Huang, J. ICA-LightGBM Algorithm for Predicting Compressive Strength of Geo-Polymer Concrete. Buildings 2023, 13, 2278. [Google Scholar] [CrossRef]
- Ni, C.; Huang, H.; Cui, P.; Ke, Q.; Tan, S.; Ooi, K.T.; Liu, Z. Light Gradient Boosting Machine (LightGBM) to forecasting data and assisting the defrosting strategy design of refrigerators. Int. J. Refrig. 2024, 160, 182–196. [Google Scholar] [CrossRef]
- Zheng, G.; Feng, Z.; Jiang, M.; Tan, L.; Wang, Z. Predicting the Energy Consumption of Commercial Buildings Based on Deep Forest Model and Its Interpretability. Buildings 2023, 13, 2162. [Google Scholar] [CrossRef]
- An, W.; Zhu, X.; Yang, K.; Kim, M.K.; Liu, J. Hourly Heat Load Prediction for Residential Buildings Based on Multiple Combination Models: A Comparative Study. Buildings 2023, 13, 2340. [Google Scholar] [CrossRef]
- Pastre, G.G.; Balbinot, A.; Pedroni, R. Virtual temperature sensor using Support Vector Machines for autonomous uninterrupted automotive HVAC systems control. Int. J. Refrig. 2022, 144, 128–135. [Google Scholar] [CrossRef]
- He, K.; Fu, Q.; Lu, Y.; Wang, Y.; Luo, J.; Wu, H.; Chen, J. Predictive control optimization of chiller plants based on deep reinforcement learning. J. Build. Eng. 2023, 76, 107158. [Google Scholar] [CrossRef]
- An, W.; Gao, B.; Liu, J.; Ni, J.; Liu, J. Predicting hourly heating load in residential buildings using a hybrid SSA–CNN–SVM approach. Case Stud. Therm. Eng. 2024, 59, 104516. [Google Scholar] [CrossRef]
- Ajifowowe, I.; Chang, H.; Lee, C.S.; Chang, S. Prospects and challenges of reinforcement learning-based HVAC control. J. Build. Eng. 2024, 98, 111080. [Google Scholar] [CrossRef]
- Lu, S.; Cui, M.; Gao, B.; Liu, J.; Ni, J.; Liu, J.; Zhou, S. A Comparative Analysis of Machine Learning Algorithms in Predicting the Performance of a Combined Radiant Floor and Fan Coil Cooling System. Buildings 2024, 14, 1659. [Google Scholar] [CrossRef]
- Wen, H.; Liu, B.; Di, M.; Li, J.; Zhou, X. A SHAP-enhanced XGBoost model for interpretable prediction of coseismic landslides. Adv. Space Res. 2024, 74, 3826–3854. [Google Scholar] [CrossRef]
- Sun, Z.; Wang, X.; Huang, H.; Yang, Y.; Wu, Z. Predicting compressive strength of fiber-reinforced coral aggregate concrete: Interpretable optimized XGBoost model and experimental validation. Structures 2024, 64, 106516. [Google Scholar] [CrossRef]
- Hu, J. Prediction of the internal corrosion rate for oil and gas pipelines and influence factor analysis with interpretable ensemble learning. Int. J. Press. Vessel. Pip. 2024, 212, 105329. [Google Scholar] [CrossRef]
- Keun Kim, M.; Cremers, B.; Fu, N.; Liu, J. Predictive and correlational analysis of heating energy consumption in four residential apartments with sensitivity analysis using long Short-Term memory and Generalized regression neural network models. Sustain. Energy Technol. Assess. 2024, 71, 103976. [Google Scholar] [CrossRef]
- Zhang, C.; Ma, L.; Han, X.; Zhao, T. Reconstituted data-driven air conditioning energy consumption prediction system employing occupant-orientated probability model as input and swarm intelligence optimization algorithms. Energy 2024, 288, 129799. [Google Scholar] [CrossRef]
- Zhang, C.; Luo, Z.; Rezgui, Y.; Zhao, T. Enhancing multi-scenario data-driven energy consumption prediction in campus buildings by selecting appropriate inputs and improving algorithms with attention mechanisms. Energy Build. 2024, 311, 114133. [Google Scholar] [CrossRef]
- Xu, M.; Liu, W.; Wang, S.; Tian, J.; Wu, P.; Xie, C. A 24-Step Short-Term Power Load Forecasting Model Utilizing KOA-BiTCN-BiGRU-Attentions. Energies 2024, 17, 4742. [Google Scholar] [CrossRef]
- Guenoukpati, A.; Agbessi, A.P.; Salami, A.A.; Bakpo, Y.A. Hybrid Long Short-Term Memory Wavelet Transform Models for Short-Term Electricity Load Forecasting. Energies 2024, 17, 4914. [Google Scholar] [CrossRef]
Equipment | Performance Parameters | Quantity (units) | Remarks | |
---|---|---|---|---|
Ground source heat pump | Summer rated cooling capacity: 959 kW/unit, COP: 6.836 Winter rated heating capacity: 921 kW/unit, COP: 4.736 | 2 | - | |
User-side pump | Flow rate: 46 L/s, Head: 31 m, Power: 22 kW | 3 | 2 operational, 1 standby | |
Ground-side pump | Flow rate: 53 L/s, Head: 24 m, Power: 18.5 kW | 3 | 2 operational, 1 standby |
Serial Number | Feature Parameter | Symbol |
---|---|---|
x1 | Outdoor temperature (°C) | Tout |
x2 | Relative humidity (%) | φ |
x3 | Solar radiation (W/m2) | SR |
x4 | Outdoor wind speed (m/s) | v |
x5 | Wind direction (°) | α |
x6 | Building cooling Load (kW) | CLt |
Model | Hyperparameter Settings |
---|---|
XGBoost | The learning rate, maximum depth of the decision tree, minimum child weight, gamma value, and L2 regularization parameter (lambda) are set to 0.01, 3, 1, and 2, respectively. |
RF | Training is conducted using the regRF_train function, with a minimum leaf node size of 5, 100 decision trees, and 5-fold cross-validation. |
ELM | The sigmoid activation function is used, with 50 hidden layer nodes, and L2 regularization parameter of 0.01. |
RBF | The expansion rate for the radial basis function is set to 50, the number of hidden layer nodes is set to 10, the learning algorithm employs gradient descent, and the L2 regularization parameter is set to 0.001. |
LightGBM | The number of leaf nodes, learning rate, number of estimators, number of trees, feature sampling rate, and L1 regularization parameter are set to 11, 0.006, 2000, 100, 0.7, and 0.1, respectively. |
SVM | The RBF kernel is used with a penalty factor of 4, a loss function value of 0.01, and a radial basis function parameter of 0.8, as well as 3-fold cross-validation. |
LSTM | The Adam optimizer is used with a maximum number of iterations of 1200 and an initial learning rate of 0.01. After 800 iterations, the learning rate is decreased to 0.005 and the dropout rate is set to 0.2. |
CNN | The Stochastic Gradient Descent with Momentum (SGDM) optimizer is applied with a maximum number of iterations of 1200 and an initial learning rate of 0.01. After 800 iterations, the learning rate is decreased to 0.001 and the L2 regularization parameter is set to 0.001. |
Hyperparameter | Value | Hyperparameter | Value |
---|---|---|---|
Number of neurons in hidden layers | 24 | Capacity of experience replay buffer | 1 × 106 |
Learning rate | 1 × 10−4 | Batch size | 128 |
Gradient threshold | 1 | Epsilon (ε) | 5 × 10−4 |
L2 regularization factor | 1 × 10−4 | Delta epsilon (Δε) | 1 × 10−2 |
Target smoothing factor | 1 × 10−3 | Maximum number of episodes | 300 |
Discount factor | 0.99 | Length of score averaging window | 200 |
Model | Evaluation Metrics | |||
---|---|---|---|---|
R2 | MAE | MAPE (%) | CV-RMSE (%) | |
XGBoost | 0.9816 | 29.5769 | 6.6207 | 10.6118 |
RF | 0.9720 | 63.7865 | 14.2630 | 12.8096 |
ELM | 0.9416 | 97.8478 | 24.4457 | 18.3890 |
RBF | 0.9095 | 119.4694 | 32.3066 | 22.7541 |
LightGBM | 0.9728 | 63.7283 | 13.4827 | 12.8199 |
SVM | 0.9235 | 98.3865 | 20.4315 | 21.3756 |
LSTM | 0.9371 | 101.5192 | 31.9522 | 18.8020 |
CNN | 0.9468 | 95.3509 | 34.4444 | 17.1945 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Z.; Qiu, Y.; Zhou, S.; Tian, Y.; Zhu, X.; Liu, J.; Lu, S. Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps. Energies 2025, 18, 199. https://doi.org/10.3390/en18010199
Wang Z, Qiu Y, Zhou S, Tian Y, Zhu X, Liu J, Lu S. Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps. Energies. 2025; 18(1):199. https://doi.org/10.3390/en18010199
Chicago/Turabian StyleWang, Zhitao, Yubin Qiu, Shiyu Zhou, Yanfa Tian, Xiangyuan Zhu, Jiying Liu, and Shengze Lu. 2025. "Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps" Energies 18, no. 1: 199. https://doi.org/10.3390/en18010199
APA StyleWang, Z., Qiu, Y., Zhou, S., Tian, Y., Zhu, X., Liu, J., & Lu, S. (2025). Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps. Energies, 18(1), 199. https://doi.org/10.3390/en18010199