Next Article in Journal
Bridging Territoriality and Sense of Community in University Lounges: A Mixed Reality Simulation Study
Previous Article in Journal
Reference Static Pressure Effect on Fluctuating Wind Pressure on Roofs of Low-Rise Buildings in Open-Circuit Wind Tunnels
Previous Article in Special Issue
Valorization of Alkali–Thermal Activated Red Mud for High-Performance Geopolymer: Performance Evaluation and Environmental Effects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Zero-Carbon Parks’ Electric Load Forecasting Considering Feature Extraction of Multi-Type Electric Load and Dual-Layer Optimization Modal Decomposition

1
School of Civil Engineering, Hunan University, Changsha 410083, China
2
Inner Mongolia Electric Power Survey & Design Institute Co., Ltd., Hohhot 010011, China
3
School of Civil Engineering, Inner Mongolia University of Technology, Hohhot 010051, China
*
Author to whom correspondence should be addressed.
Buildings 2025, 15(23), 4209; https://doi.org/10.3390/buildings15234209
Submission received: 13 October 2025 / Revised: 13 November 2025 / Accepted: 19 November 2025 / Published: 21 November 2025
(This article belongs to the Special Issue Research on Energy Efficiency and Low-Carbon Pathways in Buildings)

Abstract

The construction of zero-carbon parks has become an urgent priority. Electric load forecasting plays a decisive role in enabling the efficient operation of industrial parks; however, the complexity of electric load features within the parks has limited the accuracy of electric load forecasting. A novel electric load forecasting framework with feature extraction (TPE-AVMD-BiLSTM with feature extraction) is proposed to improve the forecasting accuracy. This framework combines feature extraction, decomposition with TPE optimization, and BiLSTM prediction. Together, these components work to remove the influence of irrelevant or redundant features. To verify the superiority of the proposed model, ablation experiments were carried out. The annual hourly electric load (8760 h) of typical industries was predicted within the park, including a data center, chemical manufacturing company, residence, shopping mall, cement manufacturing plant, and hospital. The results showed that the proposed model achieved high accuracy for all typical industries (R2 > 0.9891, EMAE < 0.3714, ERMSE < 0.4694), indicating that the forecasting has excellent coverage performance. The performance of the proposed model over the feature-free baseline confirms that incorporating more correlated features enhances prediction stability. The framework presents a viable solution for achieving accurate electric load forecasting within zero-carbon parks.

1. Introduction

Amid the ongoing global shift toward sustainable energy systems, zero-carbon parks play a critical role in diminishing dependence on conventional fossil fuels through renewable energy technologies, including solar photovoltaic systems, wind turbines, and hydrogen-based energy solutions, among other clean energy alternatives [1]. Consequently, these parks represent a crucial strategy for advancing carbon neutrality goals, underscoring their substantial relevance in the broader context of climate change mitigation.
The electric load in a zero-carbon park is shaped by its energy sources and operational dynamics, displaying the following key features:
(1)
The electric load exhibits pronounced volatility due to the variable and unpredictable nature of renewable energy generation [2].
(2)
The industrial electric load exhibits a significant bidirectional temporal correlation feature. An interdependent relationship exists between thermal and electric loads within the park. Furthermore, a strong correlation links current and past values of industrial load.
(3)
The electric load is influenced by the diverse energy requirements of various industries, endowing it with a degree of adaptability. The data center, chemical manufacturing company, residence, shopping mall, cement manufacturing plant, and hospital are typical industries. The electric load variations in these typical industries have distinct impacts on the operation management of the park. Some scholars [3,4] proposed that the optimization of electric load not only enhanced the utilization efficiency of renewable energy but also reduced the overall energy costs in zero-carbon parks. Accurate forecasting of electric load can prevent disruptions to production activities that may arise from inadequate or excessive electricity supply. However, conventional load forecasting models fail to capture bidirectional, time-sensitive, and nonlinear effects. This limitation constrains their accuracy in predicting industrial power demand, which remains a critical challenge in the field.
Various types of electric loads within the park are affected by diverse external factors. The impact may greatly differ as time goes by [5,6,7]. For example, the residential electric load can be influenced by various external factors such as holidays, special events, and weather conditions. The industrial electric load can be affected by many factors such as production plan, equipment operation efficiency, and market demand. These influences can vary significantly over time. Therefore, more and more researchers [8,9,10] began to focus on considering the forecasting of electric load with different features to improve the forecasting accuracy. The electric load features are an important topic in load forecasting and the demand response of power systems. They are extracted from time domain [11], frequency domain [12], statistical features [13], shape features [14], and deep learning-based feature extraction methods [15]. Although these methods have proven effective to some extent, the existing approaches still face certain limitations due to the nonlinear characteristics of the different electric loads and the complexity of external disturbances. Therefore, inputting valid electric load features has become a crucial direction for further improving the forecasting accuracy of electric load in zero-carbon parks.
Conventional Convolutional Neural Networks (CNNs) [16] and Long Short-Term Memory networks (LSTMs) [17] have found extensive utility across domains such as power systems, load forecasting, and energy management. CNNs are capable of extracting local temporal patterns, such as short-term load forecasting from historical load data. LSTMs are a specialized type of Recurrent Neural Network (RNN), which have been designed specifically for handling time-series data. Single load models face several challenges, including the failure to capture diversity [18], a high dependency on high-quality data [19], and an inability to leverage multiple sources of data [8]. Alhussein et al. [20] found that the CNN-LSTM hybrid model outperformed traditional methods in terms of forecasting accuracy and robustness. Subbiah et al. [21] introduced an enhanced hybrid LSTM model with advanced feature extraction to improve forecasting accuracy, which was designed to effectively capture the intricate temporal dynamics and non-linear characteristics present in the data. Future research should focus on integrating multiple deep learning models to further improve model performance.
Parameter optimization, modal decomposition, and prediction techniques are important methods for accurately predicting the changing patterns of electric load. The methods currently used for prediction are presented in Table 1. Comparing Grid Search, Tree-structured Parzen Estimator (TPE), and Random Search, TPE was found to be a comprehensive and efficient method for parameter optimization. Adaptive Variational Mode Decomposition (AVMD) is the most intelligent decomposition. The adaptive parameters (number of modes (K), bandwidth parameter alpha (α), tolerance (ε), and convergence speed tau (τ)) affect the decomposition accuracy of AVMD. The features that can be captured by Bidirectional Long Short-Term Memory (BiLSTM) are the closest to those of industrial electric loads in Table 1. As forecasting methodologies advance, researchers [4] have increasingly utilized Mean Impact Value (MIV) to quantify the contribution of individual features to a single prediction model’s output. The selection of key features based on MIV serves to enhance forecasting accuracy. Therefore, we speculate the following: (1) Combining TPE with AVMD will achieve global optimization of the adaptive parameters (K, α, ε, and τ), which can enhance the accuracy of the electrical load mode decomposition. This approach can effectively mitigate the impact of sequence non-stationarity on the prediction model. (2) If a relatively stable sequence of electric load is input for training, BiLSTM can simultaneously capture the complex relationships of power load changes from both past and future bidirectional temporal features. (3) Theoretically, TPE-AVMD-BiLSTM with feature extraction can achieve higher performance in electric load forecasting.
Based on the above scientific speculation, the purpose of this study is to systematically construct and verify a hybrid electric load forecasting framework (TPE-AVMD-BiLSTM with feature extraction), consisting of multi-dimensional feature extraction, AVMD with TPE optimization, and BiLSTM prediction. Their complementary nature can be better exploited when dealing with the diverse characteristics of electric load data in industrial parks. The proposed model is utilized to forecast the annual hourly electric load (8760 h). The predicted load profiles are subsequently integrated in a combined manner to facilitate accurate and holistic park-level load forecasting. The proposed model is significant in its stability and robustness, offering an effective solution for electric load forecasting and designing zero-carbon parks.

2. Methods

2.1. Framework for the Electric Load Forecasting

A schematic illustration of the framework is presented in Figure 1.
(1) This study incorporates an 8760 h electric load of six typical industries, including a data center, chemical manufacturing company, residence, shopping mall, cement manufacturing plant, and hospital. Box plots were employed to identify outliers in the data to ensure data integrity. Subsequently, the k-Nearest Neighbors (k-NN) algorithm was used to impute missing values, mitigating outlier-induced degradation in subsequent predictive modeling.
(2) The MIV effectively captures external factors closely related to changes in electric load by quantifying the interdependence between various features and the electric load [31]. The working time and meteorological conditions were selected as influencing features for each type of electric load. The MIV for each feature was calculated by varying the input characteristics. Subsequently, features with minimal influence were filtered out.
(3) A dual-layer optimization modal decomposition model was constructed by integrating AVMD with TPE. This approach leveraged the adaptive parameter optimization capabilities of AVMD to enhance the decomposition performance while TPE was utilized to efficiently optimize the adaptive parameters.
(4) The individual components of electric load and the filtered input feature groups were concatenated to reconstruct the datasets. The BiLSTM based on temporal information was employed for prediction, which was capable of capturing long-term dependencies in time-series data more effectively.
(5) The types of power-consuming enterprises within the park were quite flexible. Therefore, an 8760 h electric load was selected as the training set. It was necessary to evaluate the results of the predicted 8760 h electric load before output. The coefficient of determination (R2), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE) were evaluated, respectively.

2.2. Outlier Correction

It was necessary to correct outliers in the historical data before forecasting. The box-plot detection and k-NN [32] were employed for outlier analysis, which was characterized by its simplicity in implementation, lack of a training phase, high flexibility, and suitability for dynamic data. The k-NN algorithm estimates missing values based on the similarity of other samples. Firstly, the Euclidean distance between the sample with missing values and other non-missing values was calculated (Equation (1)), then the nearest k neighbors were selected according to the Euclidean distance. The mean value of the k nearest neighbors was usually selected to fill in the missing values (Equation (2)). By comparing the error variations in the filling results under different k values, the model’s sensitivity to its changes can be assessed.
d = ( x 1 y 1 ) 2 + ( x 2 y 2 ) 2 + + ( x n y n ) 2
X ^ = 1 k i = 1 k x i
where d is the Euclidean distance; x1, x2,……, xn represent the features of the current missing value sample; y1, y2,……, yn represent the characteristics of other samples; xi is the nearest sample value; X ^ is the filled-in value.

2.3. The Dual-Layer Optimization Modal Decomposition Model

The dual-layer optimization modal decomposition model that integrates AVMD with TPE is shown in Figure 2. It decomposed electric load signals into intrinsic mode functions (IMFs). This process reduced noise impact on predictions, thereby enhancing both signal stability and forecasting performance.
The electric load data of the six typical industries are strongly nonlinear and non-stationary. AVMD is a signal processing method based on variational mode decomposition (VMD) that aims to decompose a complex signal into several IMFs. Each IMF represents a specific frequency component in the signal. AVMD has strong adaptability, which can automatically adjust the frequency range of modes and decompose the electric load signal into IMFs. AVMD can match signal characteristics without the need to pre-determine the number of patterns. The adaptive parameters include the number of K, alpha (α), ε, and tau (τ), which will affect the decomposition accuracy and computational efficiency.
The AVMD was used to process the electric load signal to minimize the modal component and maximize the spectral concentration. The detailed calculation method is shown in Equations (3) and (4).
f ( t ) = k = 1 K u k ( t )
u k ( t ) , ω k = arg min u k , ω k k = 1 K α t δ ( t ) + j ω k α u k
where u k ( t ) is the Kth IMF; K is the number of modes, and set to integer values ranging from 2 to 10 [33]; uk is the component of each mode; and ω k is the center frequency of the Kth IMF.
A common method in Bayesian optimization is the TPE, which is a probabilistic model-based optimization method that is particularly suitable for optimization tasks in high-dimensional complex spaces. Model initialization of TPE built a Gaussian Mixture Density Model; this was continuously calculated and optimized. Therefore, the TPE was used to guide the exploration of the search space. To ensure the complete reproducibility of all stochastic processes, a fixed random seed was used (seed = 42). The TPE algorithm was used for sampling, with the parameter evaluation count set to a maximum of 100 times [34,35]. The algorithm was configured with a fixed maximum evaluation count as the stopping criterion and without any restart strategy. All experiments were conducted on a 4-core Intel platform, where each individual run lasted approximately 30 min.
With the aim of minimizing the spectral entropy H (alpha, K, and tau) of each IMF, the TPE was employed to obtain the optimal parameters of AVMD. The detailed calculation method is shown in Equation (5). The integration of TPE-AVMD effectively extracts multi-scale features from load signals. The TPE-AVMD enhanced decomposition precision and robustness against noise while simultaneously reducing model complexity and bolstering prediction stability.
H ( a l p h a , K , t a u ) = i = 1 K P i ( f ) log ( P i ( f ) + ε )
where Pi (f) was the power spectral density of the jth modal component; ε was the smoothing factor (value was 10−10) used to avoid numerical calculation errors; H (alpha, K, tau) represents the spectral entropy. The search range for the bandwidth parameter alpha (α) was set from 500 to 10,000 [33]. The convergence speed tau (τ) of the mode was set between 0 and 1, which was considered a broad and safe range based on experience [33].

2.4. Electric Load Forecasting Based on Multi-Categorical Load Feature Extraction

The prediction framework of electric load forecasting based on multi-dimensional feature extraction is shown in Figure 3. The process mainly included outlier correction, feature extraction, training and prediction, evaluation, and interaction for outputting results. To assess the generalizability of the models, a rigorous rolling-origin (walk-forward) procedure [36] was used simultaneously for three models. The approach of fixed training/testing sets was to divide the data into an 80% training set and 20% testing set. Taking into account the long-term seasonal characteristics of the 8760 h dataset, the core settings for the rolling-origin (walk-forward) were as follows: the starting point was from the beginning of the data, the training set ratio was 50%, the rolling step ratio was 10%, the test set ratio was 10%, and the number of folds was set to 5 times.

2.4.1. Extract Features Based on BiLSTM and MIV

The realization of zero-carbon parks primarily relies on the use of wind and solar energy, either individually or in combination, as the main sources of power generation. Considering the coupling relationships between different energy sources, the extraction of relevant load-influencing features was critical for enabling quantitative analysis and supporting the development of predictive models. The MIV was computed by systematically varying each input feature’s magnitude and averaging the corresponding prediction results [37]. Effective feature extraction can be achieved by eliminating features with lower absolute MIV. Therefore, the important influencing factors of different industries, identified through MIV screening, were combined with the electric load signals processed by the dual-layer optimization modal decomposition model. The resulting concatenated features, denoted as X(t), were used as the input features Z(t) for the predictive model. Then the BiLSTM was used as the prediction. The specific steps for feature extraction are shown in Figure 4.

2.4.2. Evaluation of the Model

The standard parameters for evaluation were taken into account—the precision and validity of the assessment outcomes, including the coefficient of determination (R2) [38], Root Mean Square Error (RMSE) [39], and Mean Absolute Error (MAE) [40]. Please refer to Equations (6)–(8) for the relevant parameter calculation formulas.
R M S E = 1 n i = 1 n ( y i y ^ i ) 2
M A E = 1 n i = 1 n y i y ^ i
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2
where y i represents the actual value; y ^ i represents the predicted value; n represents the sample size.
The closer R2 is to 1, the higher the forecasting accuracy is. The lower the values of RMSE and MAE are, the higher the forecasting accuracy is. The MAE can better reveal the average accuracy of the model on stable and predictable loads. RMSE is a more crucial indicator for measuring whether a model can effectively handle extreme situations and avoid serious prediction errors. The electric load forecasting model was mainly evaluated by R2 and MAE, with RMSE serving as a reference value.

3. Results and Discussion

3.1. Raw Data and Processing

The various categories of electric load from the parks were sourced from the Inner Mongolia Electric Power Survey and Design Institute Co., Ltd. (Hohhot, China). The data were collected at a 1 h interval, encompassing a total of 8760 entries, which were collected for each industry by selecting representative companies in 2021. The year 2021 was selected as a representative typical year for analysis due to the resumption of stable post-pandemic economic activities and the absence of major industrial disruption. The working time included the hour, day of the week, month, and season. The meteorological data were utilized with the typical year [41], encompassing dry-bulb temperature, dew point temperature, relative humidity, atmospheric pressure, total radiation, direct radiation, diffuse radiation, and wind speed.
As described in Section 2.2 of this article, the box-plot detection and k-NN algorithm were applied to correct outliers in the raw data of electric loads (Figure 5). The red data points represent the algorithm’s corrections of outliers in the raw data. Such corrections can improve the convergence speed and accuracy of subsequent models [42].

3.2. The Dual-Layer Optimization Modal Decomposition Model (TPE-AVMD)

Figure 6 illustrates the decomposition outcomes for the electric load of a chemical manufacturing company by using the dual-layer optimization modal decomposition model. The TPE-AVMD decomposed the electric load sequence into IMFs. The decomposition results for all categories of electric loads showed no signs of mode mixing, indicating that the TPE-AVMD method yielded satisfactory outcomes. The IMFs for the electric load of five typical industries were eight steps, except for the cement manufacturing plant, proving that the TPE-AVMD can avoid the excessive number of IMFs. The modal decomposition results for the remaining five types of electrical loads are provided in Appendix A as Figure A1, Figure A2, Figure A3, Figure A4 and Figure A5.

3.3. Feature Extraction

The values of hour, day of the week, month, season, dry bulb temperature, dew point temperature, relative humidity, atmospheric pressure, total radiation, direct radiation, diffuse radiation, wind speed, and each type of electric load were input into the BiLSTM neural network. The BiLSTM neural network’s inputs were perturbed to calculate the MIV for each type of electric load. The absolute values of MIV for each feature are illustrated in Figure 7, which were used to express the importance of input features for different types of electric loads. The absolute values of MIV for dry bulb temperature were 1.0 for all electric loads, proving that dry-bulb temperature was usually the primary influencing factor. The absolute value of MIV for radiation parameters was less in the meteorological characteristics. There were more factors exceeding the MIV of 0.6, which included the shopping mall and chemical manufacturing company, while the residence, data center, cement manufacturing plant, and hospital were less. This was more in line with the practical application scenario to choose the features of large absolute MIV for modeling. Therefore, the top factors with an absolute MIV of 0.6 were selected as the partial input features for each type of electric load. The feature extraction results are shown in Table 2.
In order to reveal the impact of a feature on the electric load of different facilities, SHAP [43] was applied to assess the average contribution of these features. Figure 8 presents the SHAP results. The key features affecting electric load agreed with the MIV findings (Table 2). However, the ranking for the impact features of electric load obtained by SHAP was slightly different from that of MIV. This is mainly due to the different effects of the two feature extraction mechanisms. The SHAP plot clearly showed that the same factor contributed differently to the six types of electric loads, and the contribution of different factors to the same electric load is also different.

3.4. Ablation Experiment

The ablation experiments were conducted in order to verify the influence of feature extraction on the accuracy of electric load prediction. Comparison experiments were conducted on the three models, including the TPE-AVMD-BiLSTM with feature extraction (Model 1), the TPE-AVMD-BiLSTM with the most features (Model 2), and the TPE-AVMD-BiLSTM (Model 3). The evaluation indices of each model prediction result are presented in Table 3. Except for the chemical manufacturing company and cement manufacturing plant of Model 2, the evaluation accuracy of the model with rolling-origin (walk-forward) was always lower than that of the fixed division. Under the more stable rolling-origin validation method, all three models exhibited minimal reduction in R2 values, with decreases ranging from 0.02% to 0.47%. However, the performance declined significantly in terms of EMAE and ERMSE. The increased range of EMAE was 2.0–264.4%, while the ERMSE was 3.15% to 218.2%. It can be seen that the fixed training/testing sets overestimated the actual performance of the model. In contrast, when dealing with constantly real conditions, the rolling-origin (walk-forward) method provided a more realistic and rigorous assessment of the model’s generalizability and robustness. The experimental results with rolling-origin (walk-forward) are shown in Figure 9, Figure 10 and Figure 11, respectively. The R2 of the prediction results with rolling-origin (walk-forward) for all models was greater than 0.9891, indicating the approach of TPE-AVMD-BiLSTM is generally applicable to electric load forecasting within parks. The comprehensive analysis of the evaluation indicators revealed that the residence in Model 1 demonstrated the most outstanding performance (R2 = 0.9987, EMAE = 0.0091, ERMSE = 0.0141). The ERMSE of the residence and hospital was relatively smaller, which was due to the smaller base load and stable load pattern of these two types. On the contrary, the relatively higher ERMSE for the industries such as a chemical manufacturing company and cement manufacturing plant was due to the large base load they had.
Figure 12 shows the statistical distribution of metrics across folds for the three models. All three models exhibited relatively narrow dispersion ranges in R2 across folds for each electric load type. This indicates that the three models have excellent fitting performance when applying the rolling-origin (walk-forward) verification mode. EMAE generally exhibited relatively stable performance for the three models. The mean value of MAE was highly representative for the evaluation of each component. However, the models showed unstable performance (highly dispersed RMSE) in the IMF1-IMF2 of the data center, shopping mall, residence, and hospital.
In order to conduct a more in-depth analysis of feature extractions on the performance of electric load forecasting, the ratio of MAE/RMSE was examined (Table 2). This ratio quantifies the dispersion of predicted values around actual measurements [44], thereby revealing the stability of the model. By comparing the three models, the ratios of Model 2 were generally closer to 1, and higher than those of Model 1 and Model 3. Apart from the chemical manufacturing company, Model 2 has eleven input features and the month feature has been excluded. The month feature caused instability in Model 2 with rolling-origin (walk-forward), resulting in R2 values above 1 in some folds due to potential overfitting. The ratios of Model 1 were also higher than those of Model 2. The TPE-AVMD-BiLSTM model showed more stability when highly correlated features were added. This stabilizing effect became more pronounced as more relevant features were incorporated. This finding with the selected feature set further corroborates that the electric load profile of the industrial park is a direct reflection of its inherent production and living demands, rather than the product of random fluctuation.

4. Conclusions

The electric load forecasting framework (TPE-AVMD-BiLSTM with feature extraction) proposed in this article showed significant advantages in systematic experiments. We conducted the ablation experiments under the same parameter conditions. The proposed model achieved optimal performance on various test sets, verifying that the combined prediction architecture with feature extraction can optimize the accuracy and stability of electric load prediction. TPE and AVMD were coupled with BiLSTM to create a dual-level optimization across decomposition and forecasting, yielding higher efficiency and forecasting accuracy. Feature extraction enabled a comprehensive analysis of the factors influencing various electrical loads within industrial parks so that the same feature contributed differently to the prediction of different electric loads. The TPE-AVMD-BiLSTM model showed more stability when highly correlated features were added. This stabilizing effect became pronounced as more relevant features were incorporated. Nevertheless, the number of iterations of TPE and the parameters of AVMD were selected according to the literature. The most significant limitation lies in the lack of a comprehensive sensitivity analysis regarding the key hyperparameters. Specifically, the performance and robustness of the model were likely influenced by the pre-defined search ranges for the AVMD parameters and the computational budget allocated for the TPE iterations. We anticipated the model performance to be highly sensitive to K, while extreme values of α would adversely affect the bandwidth of the extracted modes, and τ would impact the convergence criteria and the final fidelity of the decomposition. Future, advanced global sensitivity analysis methods will be utilized, such as Sobol’ indices, which can efficiently optimize the parameters of AVMD and TPE in an effective manner. Future research will prioritize the integration of renewable energy generation forecasting into the model framework, alongside the development of a real-time adaptive control strategy. These technological advancements will jointly contribute to the creation of a fully dynamic low-carbon energy management system for industrial parks.

Author Contributions

Conceptualization, R.S. and Q.Z.; Methodology, L.G. and Q.Z.; Software, S.W.; Formal Analysis, S.W.; Data Curation, S.H.; Writing—original draft, R.S.; Writing—review and editing, R.S., J.K., L.G. and Q.Z.; Supervision, J.K.; Project Administration, J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This current work was supported by the Postdoctoral Workstation Project of Inner Mongolia Electric Power Survey & Design Institute Co., Ltd. (KJGCSJY024001).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

Authors Rui Shi, Jianyu Kou, Lei Guo and Shuai Hu are employed by the Inner Mongolia Electric Power Survey & Design Institute Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The authors declare that this study received funding from Inner Mongolia Electric Power Survey & Design Institute Co., Ltd. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Appendix A

Figure A1. Modal decomposition results of data center.
Figure A1. Modal decomposition results of data center.
Buildings 15 04209 g0a1
Figure A2. Modal decomposition results of cement manufacturing plant.
Figure A2. Modal decomposition results of cement manufacturing plant.
Buildings 15 04209 g0a2
Figure A3. Modal decomposition results of hospital.
Figure A3. Modal decomposition results of hospital.
Buildings 15 04209 g0a3
Figure A4. Modal decomposition results of shopping mall.
Figure A4. Modal decomposition results of shopping mall.
Buildings 15 04209 g0a4
Figure A5. Modal decomposition results of residence.
Figure A5. Modal decomposition results of residence.
Buildings 15 04209 g0a5

References

  1. Yin, J.; Jia, H.; Chen, L.; Liu, D.; Mei, S.; Wang, S. Optimal scheduling of zero-carbon park considering variational characteristics of hydrogen energy storage systems. Glob. Energy Interconnect. 2024, 7, 603–615. [Google Scholar] [CrossRef]
  2. Wang, J.; Gao, J.; Wei, D. Electric load prediction based on a novel combined interval forecasting system. Appl. Energy 2022, 322, 119420. [Google Scholar] [CrossRef]
  3. Jiao, M.; Qi, X.; Mneg, F.; Wang, T.; Hou, Q.; Zheng, X.; Zhou, H. LSTM-RBF Short-Term Load Forecasting Method Based on Gaussian Similar Days. In Proceedings of the 2023 5th Asia Energy and Electrical Engineering Symposium (AEEES), Chengdu, China, 23–26 March 2023; pp. 1189–1194. [Google Scholar] [CrossRef]
  4. Zhu, L.; Wang, S.; Wu, L.; Kang, L.; Liu, Y. Construction of a Coal Chemical Industry Park with Zero Carbon Emission by Integrating Renewable Energy Based on Life Cycle Analysis. Process Integr. Optim. Sustain. 2024, 8, 1569–1580. [Google Scholar] [CrossRef]
  5. Pang, X.; Sun, W.; Li, H.; Wang, Y.; Luan, C. Short-term power load forecasting based on gray relational analysis and support vector machine optimized by artificial bee colony algorithm. PeerJ Comput. Sci. 2022, 8, e1108. [Google Scholar] [CrossRef]
  6. Son, J.; Cha, J.; Kim, H.; Wi, Y.M. Day-Ahead Short-Term Load Forecasting for Holidays Based on Modification of Similar Days’ Load Profiles. IEEE Access 2022, 10, 17864–17880. [Google Scholar] [CrossRef]
  7. Zhang, D.; Tong, H.; Li, F.; Xiang, L.; Ding, X. An Ultra-Short-Term Electrical Load Forecasting Method Based on Temperature-Factor-Weight and LSTM Model. Energies 2020, 13, 4875. [Google Scholar] [CrossRef]
  8. Lucas Segarra, E.; Ramos Ruiz, G.; Fernández Bandera, C. Probabilistic Load Forecasting Optimization for Building Energy Models via Day Characterization. Sensors 2021, 21, 3299. [Google Scholar] [CrossRef] [PubMed]
  9. Zhang, R.; Zhang, C.; Yu, M. A Similar Day Based Short Term Load Forecasting Method Using Wavelet Transform and LSTM. IEEJ Trans. Electr. Electron. Eng. 2022, 17, 506–513. [Google Scholar] [CrossRef]
  10. Hasan, M.; Mifta, Z.; Papiya, S.J.; Roy, P.; Dey, P.; Salsabil, N.A.; Chowdhury, N.-U.-R.; Farrok, O. A state-of-the-art comparative review of load forecasting methods: Characteristics, perspectives, and applications. Energy Convers. Manag. X 2025, 26, 100922. [Google Scholar] [CrossRef]
  11. Mamun, A.A.; Sohel, M.; Mohammad, N.; Sunny, M.S.H.; Dipta, D.R.; Hossain, E. A Comprehensive Review of the Load Forecasting Techniques Using Single and Hybrid Predictive Models. IEEE Access 2020, 8, 134911–134939. [Google Scholar] [CrossRef]
  12. Jin, Y.; Shen, W.; Wu, C.Q. Power Prediction Based on Signal Decomposition and Differentiated Processing with Multi-Level Features. Electronics 2025, 14, 2036. [Google Scholar] [CrossRef]
  13. Xiao, L.; An, R.; Zhang, X. A Deep Learning Approach Based on Novel Multi-Feature Fusion for Power Load Prediction. Processes 2024, 12, 793. [Google Scholar] [CrossRef]
  14. Si, C.; Xu, S.; Wan, C.; Chen, D.; Cui, W.; Zhao, J. Electric Load Clustering in Smart Grid: Methodologies, Applications, and Future Trends. J. Mod. Power Syst. Clean Energy 2021, 9, 237–252. [Google Scholar] [CrossRef]
  15. Syed, D.; Abu-Rub, H.; Ghrayeb, A.; Refaat, S.S.; Houchati, M.; Bouhali, O.; Bañales, S. Deep Learning-Based Short-Term Load Forecasting Approach in Smart Grid With Clustering and Consumption Pattern Recognition. IEEE Access 2021, 9, 54992–55008. [Google Scholar] [CrossRef]
  16. Agga, F.A.; Abbou, S.A.; Houm, Y.E.; Labbadi, M. Short-Term Load Forecasting Based on CNN and LSTM Deep Neural Networks. IFAC-Pap. 2022, 55, 777–781. [Google Scholar] [CrossRef]
  17. Lin, J.; Ma, J.; Zhu, J.; Cui, Y. Short-term load forecasting based on LSTM networks considering attention mechanism. Int. J. Electr. Power Energy Syst. 2022, 137, 107818. [Google Scholar] [CrossRef]
  18. Guo, X.; Liu, R.; Wang, Y.; Ning, Y.; Qu, Q.; Wang, Z.; Cong, W. Short-term power load forecasting for estate-level buildings considering multilevel feature extraction and adaptive fusion. Energy 2025, 337, 138657. [Google Scholar] [CrossRef]
  19. Maltais, L.-G.; Gosselin, L. Forecasting of short-term lighting and plug load electricity consumption in single residential units: Development and assessment of data-driven models for different horizons. Appl. Energy 2022, 307, 118229. [Google Scholar] [CrossRef]
  20. Alhussein, M.; Aurangzeb, K.; Haider, S.I. Hybrid CNN-LSTM Model for Short-Term Individual Household Load Forecasting. IEEE Access 2020, 8, 180544–180557. [Google Scholar] [CrossRef]
  21. Subbiah, S.S.; Chinnappan, J. Deep learning based short term load forecasting with hybrid feature selection. Electr. Power Syst. Res. 2022, 210, 108065. [Google Scholar] [CrossRef]
  22. Alibrahim, H.; Ludwig, S.A. Hyperparameter Optimization: Comparing Genetic Algorithm against Grid Search and Bayesian Optimization. In Proceedings of the 2021 IEEE Congress on Evolutionary Computation (CEC), Kraków, Poland, 28 June–1 July 2021; pp. 1551–1559. [Google Scholar] [CrossRef]
  23. Rong, G.; Li, K.; Su, Y.; Tong, Z.; Liu, X.; Zhang, J.; Zhang, Y.; Li, T. Comparison of Tree-Structured Parzen Estimator Optimization in Three Typical Neural Network Models for Landslide Susceptibility Assessment. Remote Sens. 2021, 13, 4694. [Google Scholar] [CrossRef]
  24. Andradóttir, S. A Review of Random Search Methods. In Handbook of Simulation Optimization; Springer: Berlin/Heidelberg, Germany, 2015; pp. 277–292. [Google Scholar] [CrossRef]
  25. Xiao, G.; Xu, F.; Tong, L.; Xu, H.; Zhu, P. A hybrid energy storage system based on self-adaptive variational mode decomposition to smooth photovoltaic power fluctuation. J. Energy Storage 2022, 55, 105509. [Google Scholar] [CrossRef]
  26. Liu, Y.; Zhang, Y.; Yu, S.; Guo, Y.; Li, Y.; Han, X.-J.; Zhao, Y.-D.; Chen, S.; Wang, G. An efficient new adaptive variational mode decomposition algorithm for extracting adventitious lung sounds. Biomed. Signal Process. Control 2024, 89, 105700. [Google Scholar] [CrossRef]
  27. Zeiler, A.; Faltermeier, R.; Keck, I.R.; Tomé, A.M.; Puntonet, C.G.; Lang, E.W. Empirical Mode Decomposition-an introduction. In Proceedings of the The 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, Spain, 18–23 July 2010; pp. 1–8. [Google Scholar] [CrossRef]
  28. Shen, G.; Tan, Q.; Zhang, H.; Zeng, P.; Xu, J. Deep Learning with Gated Recurrent Unit Networks for Financial Sequence Predictions. Procedia Comput. Sci. 2018, 131, 895–903. [Google Scholar] [CrossRef]
  29. Wan, R.; Mei, S.; Wang, J.; Liu, M.; Yang, F. Multivariate Temporal Convolutional Network: A Deep Neural Networks Approach for Multivariate Time Series Forecasting. Electronics 2019, 8, 876. [Google Scholar] [CrossRef]
  30. Jamshidzadeh, Z.; Ehteram, M.; Shabanian, H. Bidirectional Long Short-Term Memory (BILSTM)-Support Vector Machine: A new machine learning model for predicting water quality parameters. Ain Shams Eng. J. 2024, 15, 102510. [Google Scholar] [CrossRef]
  31. Bennasar, M.; Hicks, Y.; Setchi, R. Feature selection using Joint Mutual Information Maximisation. Expert Syst. Appl. 2015, 42, 8520–8532. [Google Scholar] [CrossRef]
  32. Liu, S.-Z.; Ma, S.; Chen, H.-Q.; Cui, L.-Z.; Ding, J. Combining KNN with AutoEncoder for Outlier Detection. J. Comput. Sci. Technol. 2024, 39, 1153–1166. [Google Scholar] [CrossRef]
  33. Liu, P.; Gu, H.; Gu, C.; Wang, Y. A Deformation Prediction Model for Concrete Dams Based on RSA-VMD-AttLSTM. Buildings 2025, 15, 357. [Google Scholar] [CrossRef]
  34. Shen, K.; Qin, H.; Zhou, J.; Liu, G. Runoff Probability Prediction Model Based on Natural Gradient Boosting with Tree-Structured Parzen Estimator Optimization. Water 2022, 14, 545. [Google Scholar] [CrossRef]
  35. Abe, K.; Wang, Y.; Watanabe, S. Tree-Structured Parzen Estimator Can Solve Black-Box Combinatorial Optimization More Efficiently. arXiv 2025, arXiv:2507.08053. [Google Scholar]
  36. Netzell, P.; Kazmi, H.; Kyprianidis, K. Deriving Input Variables through Applied Machine Learning for Short-Term Electric Load Forecasting in Eskilstuna, Sweden. Energies 2024, 17, 2246. [Google Scholar] [CrossRef]
  37. Zhang, H.; Ping, X.; Wan, H.; Luan, X.; Liu, F. Algae content prediction based on transfer learning and mean impact value. Chemom. Intell. Lab. Syst. 2024, 254, 105244. [Google Scholar] [CrossRef]
  38. Kim, S.; Kim, H. A new metric of absolute percentage error for intermittent demand forecasts. Int. J. Forecast. 2016, 32, 669–679. [Google Scholar] [CrossRef]
  39. Chen, M.; Beuchée, A.; Levine, E.; Storme, L.; Gascoin, G.; Hernández, A.I. Model-based characterization of total serum bilirubin dynamics in preterm infants. Pediatr. Res. 2025, 97, 1873–1881. [Google Scholar] [CrossRef] [PubMed]
  40. Du, S.; Zou, J.; Zheng, X.; Zhong, P. Solar Radiation Prediction Based on the Sparrow Search Algorithm, Convolutional Neural Networks, and Long Short-Term Memory Networks. Processes 2025, 13, 1308. [Google Scholar] [CrossRef]
  41. Meteorological Data. 2021. Available online: https://energyplus.net/weather (accessed on 10 November 2025).
  42. Halder, R.K.; Uddin, M.N.; Uddin, M.A.; Aryal, S.; Khraisat, A. Enhancing K-nearest neighbor algorithm: A comprehensive review and performance analysis of modifications. J. Big Data 2024, 11, 113. [Google Scholar] [CrossRef]
  43. Wang, H.; Liang, Q.; Hancock, J.T.; Khoshgoftaar, T.M. Feature selection strategies: A comparative analysis of SHAP-value and importance-based methods. J. Big Data 2024, 11, 44. [Google Scholar] [CrossRef]
  44. Willmott, C.J.; Matsuura, K. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res. 2005, 30, 79–82. [Google Scholar] [CrossRef]
Figure 1. Framework of the model.
Figure 1. Framework of the model.
Buildings 15 04209 g001
Figure 2. Framework of the dual-layer optimization modal decomposition model.
Figure 2. Framework of the dual-layer optimization modal decomposition model.
Buildings 15 04209 g002
Figure 3. The prediction framework.
Figure 3. The prediction framework.
Buildings 15 04209 g003
Figure 4. The prediction process of MIV.
Figure 4. The prediction process of MIV.
Buildings 15 04209 g004
Figure 5. The results of electric load corrections: (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Figure 5. The results of electric load corrections: (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Buildings 15 04209 g005
Figure 6. Modal decomposition results of chemical manufacturing company.
Figure 6. Modal decomposition results of chemical manufacturing company.
Buildings 15 04209 g006
Figure 7. The MIV of each input feature.
Figure 7. The MIV of each input feature.
Buildings 15 04209 g007
Figure 8. The results of SHAP: (a) data center; (b) chemical manufacturing company; (c) residence; (d) shopping mall; (e) cement manufacturing plant; (f) hospital.
Figure 8. The results of SHAP: (a) data center; (b) chemical manufacturing company; (c) residence; (d) shopping mall; (e) cement manufacturing plant; (f) hospital.
Buildings 15 04209 g008
Figure 9. The prediction results of TPE-AVMD-BiLSTM with MIV feature extraction (rolling-origin): (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Figure 9. The prediction results of TPE-AVMD-BiLSTM with MIV feature extraction (rolling-origin): (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Buildings 15 04209 g009
Figure 10. The prediction results of TPE-AVMD-BiLSTM with most feature (rolling-origin): (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Figure 10. The prediction results of TPE-AVMD-BiLSTM with most feature (rolling-origin): (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Buildings 15 04209 g010
Figure 11. The prediction results of TPE-AVMD-BiLSTM model (rolling-origin): (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Figure 11. The prediction results of TPE-AVMD-BiLSTM model (rolling-origin): (a) chemical manufacturing company; (b) data center; (c) cement manufacturing plant; (d) hospital; (e) shopping mall; (f) residence.
Buildings 15 04209 g011
Figure 12. Significance analysis results of absolute error for each model: (a) mean value of R2, (b) mean value of MAE, (c) mean value of RMSE for chemical manufacturing company; (d) mean value of R2, (e) mean value of MAE, (f) mean value of RMSE for data center; (g) mean value of R2, (h) mean value of MAE, (i) mean value of RMSE for cement manufacturing plant; (j) mean value of R2, (k) mean value of MAE, (l) mean value of RMSE for hospital; (m) mean value of R2, (n) mean value of MAE, (o) mean value of RMSE for shopping mall; (p) mean value of R2, (q) mean value of MAE, (r) mean value of RMSE for residence.
Figure 12. Significance analysis results of absolute error for each model: (a) mean value of R2, (b) mean value of MAE, (c) mean value of RMSE for chemical manufacturing company; (d) mean value of R2, (e) mean value of MAE, (f) mean value of RMSE for data center; (g) mean value of R2, (h) mean value of MAE, (i) mean value of RMSE for cement manufacturing plant; (j) mean value of R2, (k) mean value of MAE, (l) mean value of RMSE for hospital; (m) mean value of R2, (n) mean value of MAE, (o) mean value of RMSE for shopping mall; (p) mean value of R2, (q) mean value of MAE, (r) mean value of RMSE for residence.
Buildings 15 04209 g012
Table 1. The commonly used methods for prediction.
Table 1. The commonly used methods for prediction.
MethodFeatureReference
Grid SearchEnumerate all parameter combinations, the cost of calculation increases exponentially with the dimension, and it is completely unintelligent. This approach requires pre-defining a candidate list for each hyperparameter, followed by exhaustive evaluation of all possible combinations.[22]
Tree-structured Parzen EstimatorImproves the predictive accuracy of the model, and automatically finds the globally optimal parameters. The method flow is as follows: based on historical observations, hyperparameters are categorized as “good” or “bad” and modeled using distinct density functions. Subsequent sampling is guided by the improvement probability—the ratio of “good” to “bad” densities—favoring regions most likely to enhance performance.[23]
Random SearchSampling randomly in the parameter space lacks directional guidance and makes it difficult to find the precise optimal solution. This method entails randomly sampling a set of points within the parameter space, evaluating their performance, and selecting the top-performing candidate.[24]
Variational Mode DecompositionThe method sets bandwidth limits to separate components of different frequencies. Its objective is to minimize the total spectral bandwidth of all principal component functions, thereby effectively suppressing modal confusion. The optimization process requires preset modal parameters. Improper parameter selection may cause over-decomposition or under-decomposition.[25]
Adaptive Variational Mode DecompositionIt is the result of further development of the VMD model. The key advantage is the ability to adaptively determine parameters without the need for manual pre-setting. Core improvement: by using certain criteria (such as envelope entropy, center frequency observation), the optimal modal number K and α are automatically determined, avoiding manual trial and error and making the decomposition results more accurate and objective.[26]
Empirical Mode DecompositionSensitive to intermittent signals and noise, and the frequency components of different modalities will be mixed together. The optimization process has no parameters. Each IMF must meet two conditions: the number of extreme points and zero-crossing points must be equal or differ by at most one; at any point, the envelope mean defined by the local maxima and minima is zero.[27]
Gated Recurrent UnitIt has a simpler structure and faster training speed, and can achieve comparable accuracy to LSTM in many scenarios. Core structure consists of two gates: “Update Gate” and “Reset Gate”.[28]
Temporal Convolutional NetworkUses causal convolution to capture the long-term historical dependencies of the sequence and achieves high parallel computing efficiency. The core mechanism involves the use of causal convolution and dilated convolution.[29]
Bidirectional Long Short-Term MemoryIt can capture the long-term correlations of time series in addressing specific problems. This method is composed of a forward LSTM and a backward LSTM. The forward LSTM learns information from the past to the future, while the backward LSTM learns information from the future to the past.[30]
Table 2. The result of feature extraction (MIV > 0.6).
Table 2. The result of feature extraction (MIV > 0.6).
Building TypeSelected Features
Data CenterDry bulb temperature, atmospheric pressure
Chemical Manufacturing CompanyDry bulb temperature, month,
atmospheric pressure, date
ResidenceDry bulb temperature, dew point temperature
Shopping MallDry bulb temperature, dew point temperature,
relative humidity
Cement Manufacturing PlantDry bulb temperature, dew point temperature
HospitalDry bulb temperature, dew point temperature
Table 3. The evaluation indices of each model prediction result.
Table 3. The evaluation indices of each model prediction result.
Prediction ModelModel 1
Training Set/Testing Set—80%/20%
Model 1
Rolling-Origin (Walk-Forward)
Test Sample/
Evaluation Index
R2EMAE
(MW)
ERMSE
(MW)
R2EMAE
(MW)
ERMSE
(MW)
EMAE/ERMSE
Data Center0.99890.04740.07260.99450.09490.16300.5822
Chemical Manufacturing Company0.99550.22690.30020.99440.23650.33650.7028
Residence0.99890.00960.01280.99870.00910.01410.6453
Shopping Mall0.99930.07840.10930.99880.08800.14610.6023
Cement Manufacturing Plant0.99450.28140.43190.99470.29160.42370.6882
Hospital0.99820.01810.02580.99740.01970.03130.6293
Prediction ModelModel 2
Training Set/Testing Set—80%/20%
Model 2
Rolling-Origin (Walk-Forward)
Test Sample/
Evaluation Index
R2EMAE
(MW)
ERMSE
(MW)
R2EMAE
(MW)
ERMSE
(MW)
EMAE/ERMSE
Data Center0.99690.08850.12180.99220.10280.19420.5293
Chemical Manufacturing Company0.99470.24600.32670.99650.18170.26670.6813
Residence0.99840.01200.01560.99640.01820.02320.7845
Shopping Mall0.99900.10190.13320.98960.37140.42390.8762
Cement Manufacturing Plant0.99420.30540.44390.99820.16760.24520.6835
Hospital0.99760.02260.03000.99590.02900.03900.7435
Prediction ModelModel 3
Training Set/Testing Set—80%/20%
Model 3
Rolling-Origin (Walk-Forward)
Test Sample/
Evaluation Index
R2EMAE
(MW)
ERMSE
(MW)
R2EMAE
(MW)
ERMSE
(MW)
EMAE/ERMS
Data Center0.99510.05910.15400.99470.06700.15950.4200
Chemical Manufacturing Company0.99280.26790.38000.98910.27490.46940.5856
Residence0.99820.00990.01630.99790.01010.01760.5738
Shopping Mall0.99720.10940.22170.99810.10810.18060.5986
Cement Manufacturing Plant0.99460.27950.42870.99380.28710.45880.6257
Hospital0.99570.02760.03990.99690.02120.03400.6235
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shi, R.; Kou, J.; Guo, L.; Wei, S.; Hu, S.; Zhang, Q. Zero-Carbon Parks’ Electric Load Forecasting Considering Feature Extraction of Multi-Type Electric Load and Dual-Layer Optimization Modal Decomposition. Buildings 2025, 15, 4209. https://doi.org/10.3390/buildings15234209

AMA Style

Shi R, Kou J, Guo L, Wei S, Hu S, Zhang Q. Zero-Carbon Parks’ Electric Load Forecasting Considering Feature Extraction of Multi-Type Electric Load and Dual-Layer Optimization Modal Decomposition. Buildings. 2025; 15(23):4209. https://doi.org/10.3390/buildings15234209

Chicago/Turabian Style

Shi, Rui, Jianyu Kou, Lei Guo, Shen Wei, Shuai Hu, and Quan Zhang. 2025. "Zero-Carbon Parks’ Electric Load Forecasting Considering Feature Extraction of Multi-Type Electric Load and Dual-Layer Optimization Modal Decomposition" Buildings 15, no. 23: 4209. https://doi.org/10.3390/buildings15234209

APA Style

Shi, R., Kou, J., Guo, L., Wei, S., Hu, S., & Zhang, Q. (2025). Zero-Carbon Parks’ Electric Load Forecasting Considering Feature Extraction of Multi-Type Electric Load and Dual-Layer Optimization Modal Decomposition. Buildings, 15(23), 4209. https://doi.org/10.3390/buildings15234209

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop