An Enhanced, Lightweight Large Language Model-Driven Time Series Forecasting Approach for Air Conditioning System Cooling Load Forecasting
Abstract
1. Introduction
1.1. Research Background
1.2. Related Works
- This work is the first to investigate the application of LLM-assisted time series forecasting for cooling load forecasting in commercial buildings. By deploying a frozen LLM at the edge or terminal level and combining it with historical building data, the model enables accurate forecasting of future cooling loads, providing essential support for proactive HVAC parameter tuning. The results demonstrate the promising potential of LLMs in this domain.
- A lightweight BERT-base model is deployed as the backbone LLM strikes an effective balance between model adaptability and computational efficiency. It achieves competitive forecasting accuracy while significantly reducing training time, hardware overhead, and reliance on costly external data sources.
- The improved temporal feature attention mechanism is embedded prompt as prefix. Experimental results demonstrate that, compared to the original version, this mechanism effectively alleviates the prediction response lag during trend transitions in long-term time series forecasting.
2. Enhance-Time-LLM Forecasting Framework
2.1. Input Transformation and Preprocessing
2.2. Lightweight Backbone Model Inference and Prediction
2.3. Flattened Output Projection
3. Experimental Results
3.1. Engineering Scene Description
3.2. Data Processing and Analysis
3.3. Experimental Design
3.4. Long-Term Forecasting
3.5. Short-Term Forecasting
3.6. Few-Shot Forecasting
3.7. Result Discussion
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| ACO | Ant Colony Optimization |
| ANNs | artificial neural networks |
| COP | Coefficient of Performance |
| EERa | Energy Efficiency Ratio |
| FFT | Fast Fourier Transform |
| GCN | Graph Convolutional Networks |
| GRNs | Gated Residual Networks |
| HVAC | Heating, Ventilation, and Air Conditioning |
| IQR | interquartile range |
| LLM | large language model |
| MAE | mean absolute error |
| NLP | natural language processing |
| PaP | Prompt-as-Prefix |
| RMSE | root mean square error |
| RNNs | recurrent neural networks |
| TFT | Temporal Fusion Transformer |
| VSNs | Variable Selection Networks |
| WNN | Wavelet Neural Network |
References
- The State Council Information Office of the People’s Republic of China. China’s Energy Transition. 2024. Available online: http://www.scio.gov.cn/zfbps/zfbps_2279/202408/t20240829_860523.html (accessed on 30 June 2025).
- National Development and Reform Commission of the People’s Republic of China. Action Plan for Green and Efficient Refrigeration. 2019. Available online: https://www.ndrc.gov.cn/xxgk/zcfb/tz/201906/W020190905514433438027.pdf (accessed on 30 June 2025).
- Liu, X.; Lin, L.; Liu, X.; Zhang, T.; Rong, X.; Yang, L.; Xiong, D. Evaluation of air infiltration in a hub airport terminal: On-site measurement and numerical simulation. Build. Environ. 2018, 143, 163–177. [Google Scholar] [CrossRef]
- Sun, L.; Hu, Z.; Mae, M.; Imaizumi, T. Deep transfer learning strategy based on TimesBlock-CDAN for predicting thermal environment and air conditioner energy consumption in residential buildings. Appl. Energy 2025, 381, 125188. [Google Scholar] [CrossRef]
- Zou, M.; Huang, W.; Jin, J.; Hu, B.; Liu, Z. Deep spatio-temporal feature fusion learning for multi-step building cooling load forecasting. Energy Build. 2024, 1, 9. [Google Scholar] [CrossRef]
- Huang, X.; Zhou, X.; Yan, J.; Huang, X. Cooling load forecasting method for central air conditioning systems in manufacturing plants based on iTransformer-BiLSTM. Appl. Sci. 2025, 15, 5214. [Google Scholar] [CrossRef]
- Zhou, M.; Wang, L.; Hu, F.; Zhu, Z.; Zhang, Q.; Kong, W.; Zhou, G.; Wu, C.; Cui, E. ISSA-LSTM: A new data-driven method of heat load forecasting for building air conditioning. Energy Build. 2024, 321, 114698. [Google Scholar] [CrossRef]
- Hu, M.; Xiao, F.; Jørgensen, J.B.; Wang, S. Frequency control of air conditioners in response to real-time dynamic electricity prices in smart grids. Appl. Energy 2019, 242, 92–106. [Google Scholar] [CrossRef]
- Miller, J.A.; Aldosari, M.; Saeed, F.; Barna, N.H.; Rana, S.; Arpinar, I.B.; Liu, N. A survey of deep learning and foundation models for time series forecasting. arXiv 2024, arXiv:2401.13912. [Google Scholar] [CrossRef]
- Guo, H.; Xiao, G.; Su, L.; Zhou, J.; Wang, D.-H. Local neighbor propagation on graphs for mismatch removal. Inf. Sci. 2024, 653, 119749. [Google Scholar] [CrossRef]
- Huang, Y.; Li, C. Accurate heating, ventilation and air conditioning system load prediction for residential buildings using improved ant colony optimization and wavelet neural network. J. Build. Eng. 2021, 35, 101972. [Google Scholar] [CrossRef]
- Dong, F.; Yu, J.; Quan, W.; Xiang, Y.; Li, X.; Sun, F. Short-term building cooling load prediction model based on DwdAdam-ILSTM algorithm: A case study of a commercial building. Energy Build. 2022, 272, 112337. [Google Scholar] [CrossRef]
- Qu, Z.; Meng, Y.; Hou, X.; Chi, R.; Ai, Y.; Wu, Z. Integrated energy short-term multivariate load forecasting based on PatchTST secondary decoupling reconstruction for progressive layered extraction multi-task learning network. Expert Syst. Appl. 2025, 269, 126446. [Google Scholar] [CrossRef]
- Luo, Q.; Chen, Y.; Gong, C.; Lu, Y.; Cai, Y.; Ying, Y.; Liu, G. Research on Short-Term Air Conditioning Cooling Load Forecasting Based on Bidirectional LSTM. In Proceedings of the 2022 4th International Conference on Intelligent Control, Measurement and Signal Processing (ICMSP), Hangzhou, China, 8–10 July 2022; pp. 507–511. [Google Scholar] [CrossRef]
- Li, L.; Su, X.; Bi, X.; Lu, Y.; Sun, X. A novel Transformer-based network forecasting method for building cooling loads. Energy Build. 2023, 296, 113409. [Google Scholar] [CrossRef]
- Lim, B.; Arık, S.Ö.; Loeff, N.; Pfister, T. Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int. J. Forecast. 2021, 37, 1748–1764. [Google Scholar] [CrossRef]
- Yu, D.; Liu, T.; Wang, K.; Li, K.; Mercangöz, M.; Zhao, J.; Lei, Y.; Zhao, R. Transformer based day-ahead cooling load forecasting of hub airport air-conditioning systems with thermal energy storage. Energy Build. 2024, 308, 114008. [Google Scholar] [CrossRef]
- Vanting, N.B.; Ma, Z.; Jørgensen, B.N. A scoping review of deep neural networks for electric load forecasting. Energy Inform. 2021, 4, 49. [Google Scholar] [CrossRef]
- Chen, W.; Liu, W.; Zheng, J.; Zhang, X. Leveraging large language model as news sentiment predictor in stock markets: A knowledge-enhanced strategy. Discov. Comput. 2025, 28, 74. [Google Scholar] [CrossRef]
- Chen, W.; Hussain, W.; Cauteruccio, F.; Zhang, X. Deep learning for financial time series prediction: A state-of-the-art review of standalone and hybrid models. Comput. Model. Eng. Sci. 2024, 139, 187–224. [Google Scholar] [CrossRef]
- Su, J.; Jiang, C.; Jin, X.; Qiao, Y.; Xiao, T.; Ma, H.; Wei, R.; Jing, Z.; Xu, J.; Lin, J. Large language models for forecasting and anomaly detection: A systematic literature review. arXiv 2024, arXiv:2402.10350. [Google Scholar] [CrossRef]
- Paroha, A.D.; Chotrani, A. A comparative analysis of TimeGPT and Time-LLM in predicting ESP maintenance needs in the oil and gas sector. Int. J. Comput. Appl. 2024, 186, 975–8887. [Google Scholar]
- Lin, H.; Yu, M. A novel distributed PV power forecasting approach based on Time-LLM. arXiv 2025, arXiv:2503.06216. [Google Scholar] [CrossRef]
- Ma, Y.; Lou, H.; Yan, M.; Sun, F.; Li, G. Spatio-temporal fusion graph convolutional network for traffic flow forecasting. Inf. Fusion 2024, 104, 102196. [Google Scholar] [CrossRef]
- Cen, S.; Lim, C.G. Multi-Task Learning of the PatchTCN-TST Model for Short-Term Multi-Load Energy Forecasting Considering Indoor Environments in a Smart Building. IEEE Access 2024, 12, 19553–19568. [Google Scholar] [CrossRef]
- Jin, M.; Wang, S.; Ma, L.; Chu, Z.; Zhang, J.Y.; Shi, X.; Chen, P.Y.; Liang, Y.; Li, Y.F.; Pan, S.; et al. Time-LLM: Time series forecasting by reprogramming large language models. In Proceedings of the International Conference on Learning Representations (ICLR), Vienna, Austria, 7–11 May 2024. [Google Scholar]
- Devlin, J.; Chang, M.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Jiang, Y.; Sun, W. Day-Ahead Electricity Price Prediction and Error Correction Method Based on Feature Construction–Singular Spectrum Analysis–Long Short-Term Memory. Energies 2025, 18, 919. [Google Scholar] [CrossRef]
- Xu, J.; Guo, Z.; He, J.; Hu, H.; He, T.; Bai, S.; Chen, K.; Wang, J.; Fan, Y.; Dang, K.; et al. Qwen2.5-Omni Technical Report. arXiv 2025, arXiv:2503.20215. [Google Scholar] [CrossRef]
- Touvron, H.; Lavril, T.; Izacard, G.; Martinet, X.; Lachaux, M.A.; Lacroix, T.; Rozière, B.; Goyal, N.; Hambro, E.; Azhar, F. Llama: Open and efficient foundation language models. arXiv 2023, arXiv:2302.13971. [Google Scholar] [CrossRef]
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I. Language models are unsupervised multitask learners. OpenAI Blog 2019, 1, 9. [Google Scholar]
- Zhou, J.; Yao, Y.; Chen, X.; Guo, H.; Li, Q.; Deng, Z. Triplet relationship guided density clustering for feature matching with a large number of outliers. Clust. Comput. 2025, 28, 1–15. [Google Scholar] [CrossRef]
- Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
- Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 19–21 May 2021; Volume 35, pp. 11106–11115. [Google Scholar]
- Wu, H.; Hu, T.; Liu, Y.; Zhou, H.; Wang, J.; Long, M. Timesnet: Temporal 2d-variation modeling for general time series analysis. arXiv 2022, arXiv:2210.02186. [Google Scholar]
- Nie, Y.; Nguyen, N.H.; Sinthong, P.; Kalagnanam, J. A time series is worth 64 words: Long-term forecasting with transformers. arXiv 2022, arXiv:2211.14730. [Google Scholar]
- Zeng, A.; Chen, M.; Zhang, L.; Xu, Q. Are transformers effective for time series forecasting? In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 11121–11128. [Google Scholar]
- Zhang, T.; Zhang, Y.; Cao, W.; Bian, J.; Yi, X.; Zheng, S.; Li, J. Less is more: Fast multivariate time series forecasting with light sampling-oriented mlp structures. arXiv 2022, arXiv:2207.01186. [Google Scholar] [CrossRef]








| Scholars | Network Type | Application Scenarios |
|---|---|---|
| Ma et al. [24] | GCN | Traffic flow forecasting. |
| Cen et al. [25] | PatchTCN-TST | Multi-load energy forecasting in a smart building. |
| Huang et al. [11] | WNN | Cooling load prediction for residential buildings. |
| Luo et al. [14] | LSTM | Air Conditioning Cooling Load Forecasting. |
| Yu et al. [17] | TFT | Airport cooling load forecasting. |
| Paroha et al. [22] | Time-LLM | ESP maintenance needs in the oil and gas sector. |
| Lin et al. [23] | Time-LLM | Distributed PV power forecasting. |
| This study | ETime-LLM | Commercial building cooling load forecasting. |
| Backbone | Bert-Base | Qwen2.5-3B | LLaMA-7B |
|---|---|---|---|
| Parameter | 110 M | 3 B | 7 B |
| LLM-dimension | 768 | 2048 | 4096 |
| Training speed (s/iter) | 0.19 | 60.474 | 201.69 |
| LLM-size | 420 MB | 5.75 GB | 12.5 GB |
| No. | Equipment | Operating Parameters | Quantity | Power (kW) | Total Power (kW) |
|---|---|---|---|---|---|
| 1 | Dual-mode Centrifugal Chiller | Cooling: 850 RT (Refrigeration Tons) Ice storage: 550 RT | 1 | 581 453 | 581 |
| 2 | Base Load Chiller Unit A | Cooling Capacity: 950 RT | 1 | 581 | 581 |
| 3 | Base Load Chiller Unit B | Cooling Capacity: 376 RT | 1 | 265 | 265 |
| 4 | Ice Storage Device | Ice Storage Capacity: 707 RTH | 7 | / | / |
| 5 | Cooling Tower | 150 m/h × 3 | 4 | 16.5 | 66 |
| 6 | Cooling Plate Heat Exchanger | Capacity: 2660 kW | 2 | / | / |
| 7 | Ethylene Glycol Pump | 674 m/h, 42 m head | 1 | 110 | 110 |
| 8 | Plate Heat Exchange Circulating Pump | 458 m/h, 38 m head | 2 Primary/1 backup | 75 | 150 |
| 9 | Base Load Chiller Circulation Pump A | 575 m/h, 38 m head | 1 Primary/1 backup | 90 | 90 |
| 10 | Base Load Chiller Circulation Pump B | 228 m/h, 35 m head | 1 Primary/1 backup | 45 | 45 |
| 11 | Cooling Water Pump A | 340 m/h, 29 m head | 3 Primary/1 backup | 45 | 135 |
| 12 | Cooling Water Pump B | 760 m/h, 24 m head | 1 Primary/1 backup | 75 | 150 |
| 13 | Ethylene Glycol Tank | / | 1 | / | / |
| 14 | Water Distributor | / | 1 | / | / |
| 15 | Water Collector | / | 1 | / | / |
| Total Power | 2173 |
| Model | MAE (±std) | RMSE (±std) | (±std) |
|---|---|---|---|
| ETime-LLM | 0.0459 ± 0.0007 | 0.0608 ± 0.0012 | 0.9508 ± 0.0184 |
| Time-LLM | 0.0555 ± 0.0008 | 0.0753 ± 0.0015 | 0.9245 ± 0.0201 |
| Informer | 0.1964 ± 0.0041 | 0.2395 ± 0.0050 | 0.3177 ± 0.0376 |
| DLinear | 0.0589 ± 0.0020 | 0.0761 ± 0.0018 | 0.9229 ± 0.0228 |
| PatchTST | 0.0685 ± 0.0032 | 0.0857 ± 0.0021 | 0.9033 ± 0.0241 |
| TimesNet | 0.0863 ± 0.0072 | 0.1078 ± 0.0025 | 0.8469 ± 0.0279 |
| LightTS | 0.0876 ± 0.0069 | 0.1114 ± 0.0028 | 0.8523 ± 0.0240 |
| Model | MAE (±std) | RMSE (±std) | (±std) |
|---|---|---|---|
| ETime-LLM | 0.0416 ± 0.0006 | 0.0518 ± 0.0013 | 0.9627 ± 0.0082 |
| Time-LLM | 0.0431 ± 0.0006 | 0.0526 ± 0.0017 | 0.9616 ± 0.0094 |
| Informer | 0.1002 ± 0.0077 | 0.1294 ± 0.0108 | 0.7945 ± 0.0252 |
| DLinear | 0.0825 ± 0.0027 | 0.1036 ± 0.0035 | 0.8683 ± 0.0124 |
| PatchTST | 0.0355 ± 0.0010 | 0.0441 ± 0.0015 | 0.9730 ± 0.0077 |
| TimesNet | 0.0542 ± 0.0046 | 0.0662 ± 0.0055 | 0.9463 ± 0.0211 |
| LightTS | 0.1322 ± 0.0106 | 0.1745 ± 0.0134 | 0.6265 ± 0.0297 |
| Model Limitation | MAE | RMSE 50% | MAE | RMSE 30% | ||
|---|---|---|---|---|---|---|
| ETime-LLM | 0.0492 | 0.0619 | 0.9487 | 0.0502 | 0.0651 | 0.9435 |
| Time-LLM | 0.0460 | 0.0591 | 0.9533 | 0.0658 | 0.0790 | 0.9161 |
| Informer | 0.2592 | 0.3005 | 0.0000 | 0.2750 | 0.3249 | 0.0000 |
| DLinear | 0.0698 | 0.0879 | 0.9081 | 0.0714 | 0.0929 | 0.8973 |
| PatchTST | 0.0734 | 0.0923 | 0.8987 | 0.0750 | 0.0945 | 0.8938 |
| TimesNet | 0.1259 | 0.1632 | 0.6830 | 0.1531 | 0.1888 | 0.5758 |
| LightTS | 0.1045 | 0.1347 | 0.7843 | 0.1599 | 0.2092 | 0.4795 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhu, C.; Yang, Y.; Chen, H.; Zeng, M. An Enhanced, Lightweight Large Language Model-Driven Time Series Forecasting Approach for Air Conditioning System Cooling Load Forecasting. Mathematics 2025, 13, 3887. https://doi.org/10.3390/math13233887
Zhu C, Yang Y, Chen H, Zeng M. An Enhanced, Lightweight Large Language Model-Driven Time Series Forecasting Approach for Air Conditioning System Cooling Load Forecasting. Mathematics. 2025; 13(23):3887. https://doi.org/10.3390/math13233887
Chicago/Turabian StyleZhu, Cong, Yongkuan Yang, Haiping Chen, and Miao Zeng. 2025. "An Enhanced, Lightweight Large Language Model-Driven Time Series Forecasting Approach for Air Conditioning System Cooling Load Forecasting" Mathematics 13, no. 23: 3887. https://doi.org/10.3390/math13233887
APA StyleZhu, C., Yang, Y., Chen, H., & Zeng, M. (2025). An Enhanced, Lightweight Large Language Model-Driven Time Series Forecasting Approach for Air Conditioning System Cooling Load Forecasting. Mathematics, 13(23), 3887. https://doi.org/10.3390/math13233887

