A Cluster- and Temperature-Aware Auto-Ensemble Model for Airport Cooling Load Forecasting
Abstract
1. Introduction
1.1. Research Background
1.2. Related Work
1.3. Major Contributions and Novelties
- A comprehensive feature engineering pipeline is developed for cooling load forecasting, including correlation-based feature screening, temporal dependency representation, and temperature-derived indicator construction.
- Representative deep learning and machine learning baselines are implemented and evaluated under a unified experimental protocol, ensuring consistent data preprocessing and leakage-free training–testing settings for both short-term (day-ahead) and long-term (season-wide) forecasting.
- An ON/OFF gating mechanism is introduced to explicitly identify operating states, effectively reducing structural mispredictions and ensuring physical consistency during intermittent zero-load periods.
- A joint regime partitioning strategy is developed by integrating operational clustering and quantile-based temperature thresholds, systematically characterizing heterogeneous operating modes and thermal dynamics.
- CATS-Ens is proposed to support regime-specific expert selection and adaptive blending, improving forecasting robustness under complex operational states and peak-risk thermal conditions.
2. Methods
2.1. Dataset
2.2. Baseline Models and Evaluation Metrics
2.3. Algorithm Framework and Workflow
2.4. Problem Formulation and Notations
3. Experimental Analysis
3.1. Feature Engineering
3.1.1. Temporal Dependency Analysis and Feature Construction
3.1.2. Environmental Feature Selection via Spearman Analysis
3.1.3. Operational Regime Identification via Clustering
- Cluster 0: Medium-load Dynamic Operation (Medium dynamic);
- Cluster 1: OFF / Near-zero Load (OFF-like);
- Cluster 2: Peak-load / Stress Operation (Peak / stress);
- Cluster 3: Low-load Steady Operation (Low steady).
3.2. Development and Evaluation of the CATS-Ens Approach
3.2.1. Cooling Load ON/OFF Gating and Temperature-Based Regime Partitioning
- Data-Adaptive Robustness: It provides a dynamic partition that is robust to climate distribution shifts across different seasons and geographical sites, avoiding the bias typically associated with fixed absolute temperature thresholds.
- Sample Balance: It ensures sufficient and relatively balanced training samples within each thermal regime, which is crucial for the stable estimation of regime-wise expert blending weights.
- Nonlinear Adaptation: A three-regime design aligns well with the physical reality of nonlinear temperature–cooling load responses, specifically enhancing the model’s sensitivity and robustness under high-temperature peak-demand conditions.
3.2.2. Cluster- and Temperature-Aware Auto-Ensemble Mechanism
- Step 1: Regime Matching. For each timestamp in the forecasting horizon, the system evaluates the real-time weather inputs and operational cluster tags to determine the active joint regime index.
- Step 2: Top-2 Expert Selection. Instead of deploying all available models, which increases the computational overhead and the risk of overfitting, CATS-Ens retrieves the Top-2 best-performing experts for the currently active regime based on their historical validation RMSE. The expert pool comprises the baseline models discussed in Section 2.
- Step 3: Adaptive Blending. The predictions from the selected Top-2 experts are fused using the optimized blending weights specific to that regime. This step ensures that models excelling in capturing peak loads dominate during high-stress regimes, while models better at capturing steady-state trends dominate during baseline operations.
- Step 4: Final Constraint Integration. In the final step, the integrated ensemble prediction is filtered through the ON/OFF gate. This guarantees that the final output remains physically bounded and strictly non-negative.
3.2.3. Short-Term Cooling Load Forecasting Performance
3.2.4. Long-Term Cooling Load Forecasting Performance
3.3. Results and Discussion
Economic Implications Based on Forecasting Accuracy
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| RNN | Recurrent neural networks |
| LSTM | Long short-term memory networks |
| GRU | Gated recurrent units |
| MAE | Mean absolute error |
| RMSE | Root mean square error |
| MAPE | Mean absolute percentage error |
| sMAPE | Symmetric mean absolute percentage error |
| ACF | Autocorrelation function |
| EWMA | Exponentially weighted moving average |
| CDH | Cooling degree-hour |
| COP | Coefficient of performance |
| HPO | Hyperparameter optimization |
| CATS-Ens | Cluster- and temperature-aware auto-ensemble model |
| Nomenclature | |
| Observed cooling load at time t (kW) | |
| Normalized cooling load at time t (-) | |
| Predicted cooling load at time t (kW) | |
| Ensemble prediction before ON/OFF gating (kW) | |
| Final gated prediction (kW) | |
| Prediction of the m-th expert model at time t (kW) | |
| Forecasting error at time t (kW) | |
| Electrical power consumption at time t (kW) | |
| Electrical power deviation induced by forecasting error (kW) | |
| Cumulative incremental operational cost deviation ($) | |
| Electricity price at time t ($/kWh) | |
| Time resolution (h) | |
| Coefficient of performance (-) | |
| xt | Input feature vector at time t (-) |
| L | Look-back window length (h) |
| Forecasting mapping function (-) | |
| Forecasting function parameterized by (-) | |
| Model parameters (-) | |
| N | Number of samples (-) |
| Mean observed cooling load (kW) | |
| max(y) | Maximum observed cooling load during the selected cooling seasons (kW) |
| Coefficient of determination (-) | |
| ACF(k) | Autocorrelation coefficient at lag k (-) |
| k | Time lag (h) |
| Spearman rank correlation coefficient (-) | |
| Rank difference of the i-th sample (-) | |
| K | Number of clusters (-) |
| The j-th cluster (-) | |
| Centroid of cluster (-) | |
| Predicted ON probability at time t (-) | |
| ON/OFF gating variable at time t (-) | |
| Probability threshold for ON/OFF gating (-) | |
| Outdoor air temperature at time t (°C) | |
| Outdoor air temperature series of the validation set (°C) | |
| Quantile-based temperature thresholds (°C) | |
| Temperature regime index (-) | |
| Cluster label (-) | |
| Joint regime index (-) | |
| M | Number of expert models (-) |
| Validation index set of regime s (-) | |
| Optimal regime-specific blending weight (-) | |
| Indicator function (-) | |
| Quantile function at probability p (-) | |
| Quantile probabilities for temperature partitioning (-) | |
| Validation dataset (-) | |
| The m-th expert forecasting model (-) | |
| Root mean square error of the m-th expert in regime s (kW) | |
| Indices of the Top-2 best-performing expert models for regime s (-) | |
| h | Prediction horizon (h) |
| Prediction horizon set (h) | |
| zt | Exogenous input variables at time t (-) |
References
- Fan, C.; Xiao, F.; Zhao, Y. A short-term building cooling load prediction method using deep learning algorithms. Appl. Energy 2017, 195, 222–233. [Google Scholar] [CrossRef]
- Chen, B.; Yang, W.; Yan, B.; Zhang, K. An advanced airport terminal cooling load forecasting model integrating SSA and CNN-Transformer. Energy Build. 2024, 309, 114000. [Google Scholar] [CrossRef]
- Ciulla, G.; D’Amico, A. Building energy performance forecasting: A multiple linear regression approach. Appl. Energy 2019, 253, 113500. [Google Scholar] [CrossRef]
- Wei, Y.; Zhang, X.; Shi, Y.; Xia, L.; Pan, S.; Wu, J.; Han, M.; Zhao, X. A review of data-driven approaches for prediction and classification of building energy consumption. Renew. Sustain. Energy Rev. 2018, 82, 1027–1047. [Google Scholar] [CrossRef]
- Zhou, Y.; Zheng, S.; Hensen, J.L. Machine learning-based digital district heating/cooling with renewable integrations and advanced low-carbon transition. Renew. Sustain. Energy Rev. 2024, 199, 114466. [Google Scholar] [CrossRef]
- Pan, Y.; Zhu, M.; Lv, Y.; Yang, Y.; Liang, Y.; Yin, R.; Yang, Y.; Jia, X.; Wang, X.; Zeng, F.; et al. Building energy simulation and its application for building performance optimization: A review of methods, tools, and case studies. Adv. Appl. Energy 2023, 10, 100135. [Google Scholar] [CrossRef]
- Hasan, M.; Mifta, Z.; Papiya, S.J.; Roy, P.; Dey, P.; Salsabil, N.A.; Chowdhury, N.U.R.; Farrok, O. A state-of-the-art comparative review of load forecasting methods: Characteristics, perspectives, and applications. Energy Convers. Manag. X 2025, 26, 100922. [Google Scholar] [CrossRef]
- Quanwei, T.; Guijun, X.; Wenju, X. MGMI: A novel deep learning model based on short-term thermal load prediction. Appl. Energy 2024, 376, 124209. [Google Scholar] [CrossRef]
- Lim, B.; Zohren, S. Time-series forecasting with deep learning: A survey. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2021, 379, 20200209. [Google Scholar] [CrossRef] [PubMed]
- Li, K.; Duan, P.; Cao, X.; Cheng, Y.; Zhao, B.; Xue, Q.; Feng, M. A multi-energy load forecasting method based on complementary ensemble empirical model decomposition and composite evaluation factor reconstruction. Appl. Energy 2024, 365, 123283. [Google Scholar] [CrossRef]
- Dai, S.; Meng, F.; Dai, H.; Wang, Q.; Chen, X.; Bai, W.; Shi, P.; Allmendinger, R.; Zhang, Y.; Liu, J. Machine learning in peak demand forecasting: Foundations, trends, and insights. Renew. Sustain. Energy Rev. 2026, 227, 116500. [Google Scholar] [CrossRef]
- Fida, K.; Abbasi, U.; Adnan, M.; Iqbal, S.; Mohamed, S.E.G. A comprehensive survey on load forecasting hybrid models: Navigating the Futuristic demand response patterns through experts and intelligent systems. Results Eng. 2024, 23, 102773. [Google Scholar] [CrossRef]
- Duan, P.; Zhao, X.; Hu, J.; Li, K.; Xue, Q.; Cao, X.; Wang, Y.; Zhao, B.; Zhang, C.; Yuan, X. Multi-energy load forecasting incorporating AI algorithms: Research status and trends in integrated energy systems. Renew. Sustain. Energy Rev. 2026, 229, 116611. [Google Scholar] [CrossRef]
- Foucquier, A.; Robert, S.; Suard, F.; Stéphan, L.; Jay, A. State of the art in building modelling and energy performances prediction: A review. Renew. Sustain. Energy Rev. 2013, 23, 272–288. [Google Scholar] [CrossRef]
- Frayssinet, L.; Merlier, L.; Kuznik, F.; Hubert, J.L.; Milliez, M.; Roux, J.J. Modeling the heating and cooling energy demand of urban buildings at city scale. Renew. Sustain. Energy Rev. 2018, 81, 2318–2327. [Google Scholar] [CrossRef]
- Balali, Y.; Chong, A.; Busch, A.; O’Keefe, S. Energy modelling and control of building heating and cooling systems with data-driven and hybrid models—A review. Renew. Sustain. Energy Rev. 2023, 183, 113496. [Google Scholar] [CrossRef]
- Chen, Z.; Xiao, F.; Guo, F.; Yan, J. Interpretable machine learning for building energy management: A state-of-the-art review. Adv. Appl. Energy 2023, 9, 100123. [Google Scholar] [CrossRef]
- Wang, Z.; Wang, Y.; Zeng, R.; Srinivasan, R.S.; Ahrentzen, S. Random Forest based hourly building energy prediction. Energy Build. 2018, 171, 11–25. [Google Scholar] [CrossRef]
- Runge, J.; Zmeureanu, R. Forecasting energy use in buildings using artificial neural networks: A review. Energies 2019, 12, 3254. [Google Scholar] [CrossRef]
- Eren, Y.; Küçükdemiral, İ. A comprehensive review on deep learning approaches for short-term load forecasting. Renew. Sustain. Energy Rev. 2024, 189, 114031. [Google Scholar] [CrossRef]
- Dong, F.; Yu, J.; Quan, W.; Xiang, Y.; Li, X.; Sun, F. Short-term building cooling load prediction model based on DwdAdam-ILSTM algorithm: A case study of a commercial building. Energy Build. 2022, 272, 112337. [Google Scholar] [CrossRef]
- Fan, C.; Wang, J.; Gang, W.; Li, S. Assessment of deep recurrent neural network-based strategies for short-term building energy predictions. Appl. Energy 2019, 236, 700–710. [Google Scholar] [CrossRef]
- Zhao, Y.; Gong, M.; Sun, J.; Han, C.; Jing, L.; Li, B.; Zhao, Z. A new hybrid optimization prediction strategy based on SH-Informer for district heating system. Energy 2023, 282, 129010. [Google Scholar] [CrossRef]
- Zhang, Y.; Wang, D.; Wang, G.; Xu, P.; Zhu, Y. Data-driven building load prediction and large language models: Comprehensive overview. Energy Build. 2025, 326, 115001. [Google Scholar] [CrossRef]
- Li, L.; Su, X.; Bi, X.; Lu, Y.; Sun, X. A novel Transformer-based network forecasting method for building cooling loads. Energy Build. 2023, 296, 113409. [Google Scholar] [CrossRef]
- Chen, Y.; Wang, H.; Chen, Z. Sensitivity analysis of physical regularization in physics-informed neural networks (PINNs) of building thermal modeling. Build. Environ. 2025, 273, 112693. [Google Scholar] [CrossRef]
- Xu, L.; Wang, S.; Tang, R. Probabilistic load forecasting for buildings considering weather forecasting uncertainty and uncertain peak load. Appl. Energy 2019, 237, 180–195. [Google Scholar] [CrossRef]
- Xu, L.; Hu, M.; Fan, C. Probabilistic electrical load forecasting for buildings using Bayesian deep neural networks. J. Build. Eng. 2022, 46, 103853. [Google Scholar] [CrossRef]
- Lin, L.; Liu, X.; Zhang, T.; Liu, X.; Rong, X. Cooling load characteristic and uncertainty analysis of a hub airport terminal. Energy Build. 2021, 231, 110619. [Google Scholar] [CrossRef]
- Hu, M.; Ge, D.; Telford, R.; Stephen, B.; Wallom, D.C. Classification and characterization of intra-day load curves of PV and non-PV households using interpretable feature extraction and feature-based clustering. Sustain. Cities Soc. 2021, 75, 103380. [Google Scholar] [CrossRef]
- Amin, A.; Mourshed, M. Weather and climate data for energy applications. Renew. Sustain. Energy Rev. 2024, 192, 114247. [Google Scholar] [CrossRef]
- Xianliang, G.; Jingchao, X.; Zhiwen, L.; Jiaping, L. Analysis to energy consumption characteristics and influencing factors of terminal building based on airport operating data. Sustain. Energy Technol. Assessments 2021, 44, 101034. [Google Scholar] [CrossRef]
- Zhang, Z.; Sun, H.; Huang, T. Spatiotemporal Analysis of Building Energy Performance and Urban Morphology Interactions Across Chinese Climate Zones: A Machine Learning Approach to Predicting Cooling Demand in High-Density Districts. Case Stud. Therm. Eng. 2025, 79, 107592. [Google Scholar] [CrossRef]
- Liu, J.; Zhang, Y.; Zhai, Z.; Wang, Y.; Ye, L.; Ding, Y. Comparison and analysis of different machine learning models for predicting air-conditioning cooling loads in comprehensive large public buildings across various time scales. Appl. Therm. Eng. 2025, 278, 127328. [Google Scholar] [CrossRef]
- Wu, S.; Wang, Y.; Zhang, H.; Hu, R.; Zhang, Y.; Du, J.; Liu, L. Dual-modal cross-attention integrated model for airport terminal cooling load prediction using variational mode decomposition. J. Build. Eng. 2025, 104, 112344. [Google Scholar] [CrossRef]
- Chalapathy, R.; Khoa, N.L.D.; Sethuvenkatraman, S. Comparing multi-step ahead building cooling load prediction using shallow machine learning and deep learning models. Sustain. Energy Grids Netw. 2021, 28, 100543. [Google Scholar] [CrossRef]
- Atalla, T.; Gualdi, S.; Lanza, A. A global degree days database for energy-related applications. Energy 2018, 143, 1048–1055. [Google Scholar] [CrossRef]
- Zhang, L.; Cao, M.; Li, N.; Luo, L.; Chen, Y.; Li, Z. Machine learning prediction of heating and cooling loads based on Athenian residential buildings’ simulation dataset. Energy Build. 2025, 342, 115808. [Google Scholar] [CrossRef]
- Erbey, M.M.; Ozdil, N.F.T. Forecasting the European Union’s Space Cooling Potential Using the Cooling Degree-Day Factor (1970–2024). Energy 2026, 344, 140113. [Google Scholar] [CrossRef]
- Yu, M.; Niu, D.; Zhao, J.; Li, M.; Sun, L.; Yu, X. Building cooling load forecasting of IES considering spatiotemporal coupling based on hybrid deep learning model. Appl. Energy 2023, 349, 121547. [Google Scholar] [CrossRef]
- Yu, D.; Liu, T.; Wang, K.; Li, K.; Mercangöz, M.; Zhao, J.; Lei, Y.; Zhao, R. Transformer based day-ahead cooling load forecasting of hub airport air-conditioning systems with thermal energy storage. Energy Build. 2024, 308, 114008. [Google Scholar] [CrossRef]
- Huang, S.; Ali, N.A.M.; Shaari, N.; Noor, M.S.M. Multi-scene design analysis of integrated energy system based on feature extraction algorithm. Energy Rep. 2022, 8, 466–476. [Google Scholar] [CrossRef]
- Zhu, C.; Yang, Y.; Chen, H.; Zeng, M. An Enhanced, Lightweight Large Language Model-Driven Time Series Forecasting Approach for Air Conditioning System Cooling Load Forecasting. Mathematics 2025, 13, 3887. [Google Scholar] [CrossRef]








| Ref. | Main Idea/Method | Pros | Cons/Limitations | Relevance to This Work |
|---|---|---|---|---|
| [14] | Review of building modeling and energy performance prediction | Comprehensive taxonomy of building modeling methods and prediction tasks | Not focused on modern deep learning or condition-adaptive ensembles | Provides foundational background and motivates data-driven forecasting |
| [16] | Review of data-driven and hybrid models for building heating/cooling systems | Summarizes ML/hybrid modeling and control-oriented implications | Limited emphasis on regime-wise adaptive ensemble mechanisms | Supports the motivation of robustness and hybrid perspectives |
| [18] | Random forest for hourly building energy prediction | Strong generalization and robustness under noise; interpretable feature importance | Typically learns a global mapping; limited regime specialization | Representative tree-ensemble baseline family; contrasts with regime-wise adaptation |
| [22] | Systematic assessment of deep recurrent multi-step forecasting strategies | Clarifies inference strategies (recursive/direct/MIMO) and error accumulation effects | Not designed for regime-aware expert selection or intermittency gating | Supports the need for stability under long-horizon forecasting |
| [25] | Transformer-based network for building cooling load forecasting | Models long-range temporal dependencies; strong nonlinear representation | May still behave as a global predictor without explicit regime adaptation | Motivates including Transformer-style baselines and long-range modeling discussion |
| [27] | Probabilistic building load forecasting considering weather and peak uncertainty | Improves reliability under weather/peak uncertainties; quantifies predictive distributions | Increased modeling complexity; not focused on regime-wise expert blending | Complements reliability discussion; orthogonal to our deterministic regime-wise ensemble |
| [28] | Bayesian deep neural networks for probabilistic building load forecasting | Quantifies epistemic/aleatoric uncertainty with Bayesian deep learning | Focuses on uncertainty rather than condition-dependent expert selection | Provides uncertainty-aware perspective and contextualizes deterministic setting |
| [29] | Cooling load characteristics and uncertainty analysis of a hub airport terminal | Reveals heterogeneity and uncertainty sources in airport cooling demand | Does not propose an adaptive ensemble forecasting mechanism | Domain evidence motivating regime heterogeneity modeling in airports |
| [30] | Interpretable feature extraction and clustering for load curve classification | Effective mode discovery and interpretable clustering features | Not a forecasting framework; clustering not integrated with adaptive ensemble | Inspires the use of clustering to represent operational regimes |
| Model | Bias (Mean) | MAE | P95 | P99 |
|---|---|---|---|---|
| CATS-Ens | 0.19% | 0.96% | 2.15% | 2.63% |
| RNN | −0.52% | 1.06% | 2.68% | 3.11% |
| GRU | 0.85% | 1.21% | 2.91% | 3.25% |
| LSTM | 0.58% | 1.47% | 4.54% | 5.51% |
| Transformer | 0.61% | 1.54% | 2.89% | 3.70% |
| XGBoost | 0.41% | 2.04% | 4.16% | 5.62% |
| Model | Bias (Mean) | MAE | P95 | P99 |
|---|---|---|---|---|
| CATS-Ens | 0.21% | 2.79% | 8.18% | 11.60% |
| RNN | −0.47% | 3.04% | 8.96% | 13.00% |
| GRU | 0.89% | 3.15% | 8.55% | 13.10% |
| LSTM | 0.60% | 3.29% | 9.60% | 14.60% |
| XGBoost | 0.46% | 3.51% | 12.10% | 16.10% |
| Transformer | 0.53% | 3.66% | 11.10% | 16.90% |
| Model | MAE | RMSE | MAPE (%) | sMAPE (%) | |
|---|---|---|---|---|---|
| RNN | 222.34 | 307.19 | 41.80 | 34.10 | 0.960 |
| LSTM | 240.01 | 342.49 | 41.14 | 34.22 | 0.951 |
| GRU | 229.78 | 314.57 | 45.31 | 34.13 | 0.958 |
| Transformer | 267.18 | 385.11 | 34.31 | 42.49 | 0.938 |
| XGBoost | 256.06 | 388.74 | 45.25 | 34.66 | 0.937 |
| CATS-Ens | 203.92 | 281.51 | 36.99 | 31.68 | 0.967 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Xie, X.-Y.; Fan, Y.-W.; Wang, Y.-Z.; Li, J.-R.; Zhang, X.-R. A Cluster- and Temperature-Aware Auto-Ensemble Model for Airport Cooling Load Forecasting. Energies 2026, 19, 1375. https://doi.org/10.3390/en19051375
Xie X-Y, Fan Y-W, Wang Y-Z, Li J-R, Zhang X-R. A Cluster- and Temperature-Aware Auto-Ensemble Model for Airport Cooling Load Forecasting. Energies. 2026; 19(5):1375. https://doi.org/10.3390/en19051375
Chicago/Turabian StyleXie, Xiao-Yu, Yu-Wei Fan, Yi-Zhou Wang, Jie-Ru Li, and Xin-Rong Zhang. 2026. "A Cluster- and Temperature-Aware Auto-Ensemble Model for Airport Cooling Load Forecasting" Energies 19, no. 5: 1375. https://doi.org/10.3390/en19051375
APA StyleXie, X.-Y., Fan, Y.-W., Wang, Y.-Z., Li, J.-R., & Zhang, X.-R. (2026). A Cluster- and Temperature-Aware Auto-Ensemble Model for Airport Cooling Load Forecasting. Energies, 19(5), 1375. https://doi.org/10.3390/en19051375
