1. Introduction
The world is gradually shifting toward clean, renewable energy sources for power generation and systematically restructuring its energy mix. Increasing the share of renewable energy in the energy mix is crucial to building an ecologically sustainable power supply system. Photovoltaic (PV) power generation has become an important technical approach for constructing low-carbon power systems and optimizing energy structures due to its advantages of low operating costs, renewability, and modular deployment [
1,
2,
3]. However, PV power generation systems exhibit significant randomness, volatility, and intermittency due to the influence of various meteorological parameters such as ambient temperature, component temperature, solar irradiance, humidity, and wind speed, significantly increasing the complexity of power prediction. In the context of large-scale grid connection, power prediction errors can easily lead to imbalances between power supply and demand, threatening system safety and stability [
4]. Therefore, obtaining reliable PV power generation information is of great significance for optimizing energy dispatch decisions and ensuring the reliable operation of the power grid.
Currently, the main technologies in the field of PV power prediction can be divided into three categories: physical methods, statistical methods, and artificial intelligence models [
5,
6,
7]. Physical methods construct physical models and atmospheric radiation transfer equations, combining weather forecast data to simulate the PV conversion process and achieve accurate prediction of PV power. However, the prediction accuracy of this physical method exhibits significant dependency: its accuracy is strictly constrained by the precision of PV system design parameters and the reliability of weather forecast data [
8,
9,
10]. Compared with the limitations of physical methods, statistical methods construct prediction models by exploring statistical correlations between historical power data and meteorological parameters, thereby avoiding the need to construct complex physical equations [
11]. While statistical methods have made significant progress in PV power forecasting, their performance remains limited under irradiance fluctuation scenarios due to the inherent intermittency and instability of PV generation, particularly when relying on simple irradiance-power mapping relationships. In recent years, artificial intelligence models have significantly improved the unstable prediction performance of traditional statistical methods caused by intense fluctuations in irradiance due to their excellent nonlinear modeling capabilities and multiscale feature extraction mechanisms. They have been widely applied in PV power prediction tasks [
12,
13,
14]. Artificial intelligence models, particularly Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN) and their variants, have demonstrated superior capability in capturing complex spatiotemporal dependencies between meteorological features and power generation, significantly improving the reliability and stability of multi-timescale forecasting [
15,
16,
17].
While numerous PV power forecasting methods and strategies have been explored in existing studies [
18,
19,
20], AI-driven models have emerged as a research focus owing to their superior prediction accuracy and robust generalization capabilities across diverse operational conditions. For example, Min Kyeong Park et al. [
21] used an LSTM model to predict PV power, avoiding the problems of gradient vanishing or gradient explosion caused by excessive sequence length in traditional Recurrent Neural Network (RNN) models. Ze Wu et al. [
22] compared the performance of the Informer model and the LSTM model in short-term PV power forecasting tasks. The results showed that the Informer model effectively addressed the limitation of the LSTM model in focusing on key information when processing long sequence data by introducing multi-head self-attention (MHA). Jiahui Wang et al. [
23] introduced a CNN model to predict PV power under three typical weather conditions, and validated the generalization and robustness of the CNN model through comparative analysis with other models. Although artificial intelligence models had made breakthrough progress in the field of PV power prediction, single model architectures still faced core challenges such as insufficient generalization capabilities and unstable performance in extreme weather prediction.
With the development of computer technology, combination models constructed using computer technology have been widely applied in PV power prediction research, especially combination models of machine learning and deep learning [
24]. For example, Keyong Hu et al. [
25] proposed a CNN–Transformer hybrid model that leverages CNN’s local feature extraction, parameter sharing, and hierarchical compression capabilities to compensate for Transformer’s limitations in capturing fine-grained details and computational efficiency, demonstrating superior PV power forecasting performance compared to standalone LSTM or Transformer models. Jibo Wang et al. [
26] developed a dedicated short-term forecasting method for PV clusters, where the Adaptive Multi-Objective Differential Evolution (AMODE) algorithm optimizes Bidirectional Long Short-Term Memory (BiLSTM) parameters, significantly enhancing its capability to handle long-range dependencies and complex multi-feature time series tasks. Jie Meng et al. [
27] proposed a hybrid BiLSTM-Attention-ISOA forecasting model, where the attention mechanism addresses BiLSTM’s limitation in capturing long-sequence dependencies, while the Improved Sea-horse Optimization Algorithm (ISOA) enhances model performance through hyperparameter optimization. Following a comparative analysis between Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and Variational Mode Decomposition (VMD), Wenbo Zhao et al. [
28] developed a hybrid VMD-IDBO-KELM forecasting model, where the VMD reduces noise interference in historical data while the Improved Dung Beetle Optimizer (IDBO) optimizes kernel and regularization parameters in the Kernel Extreme Learning Machine (KELM), collectively enhancing prediction accuracy. However, although CEEMDAN and VMD effectively reduce the interference of noise in the data on optimal prediction results, certain issues remain in their practical applications [
29,
30,
31]. CEEMDAN may produce some spurious modes in the early stages of decomposition, which can adversely affect the predictive performance of the model. In contrast, although VMD does not suffer from the aforementioned issue, the quality of its decomposition is susceptible to noise—strong noise can interfere with the convergence of the algorithm, potentially exacerbating mode mixing. In comparison, Improved Complementary Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) effectively avoids the problem of generating spurious modes in the early stages by improving the noise introduction strategy; meanwhile, its strong adaptability enhances its robustness to noise compared to VMD, enabling cleaner modal separation in high-noise environments and thereby providing higher-fidelity components for predictive models. Existing research primarily adopts hybrid approaches that combine signal decomposition with deep learning models, or integrate deep learning architectures with attention mechanisms as shown in
Table 1. However, these methods often focus solely on mitigating the impact of data noise on prediction accuracy, or fail to adequately capture fine-grained features, thereby limiting model performance under extreme weather conditions [
32,
33,
34]. To address these limitations, the proposed ICEEMDAN-TCN-BiLSTM-MHA hybrid model employs ICEEMDAN to effectively suppress non-stationarity in input data, thereby supplying higher-quality components for subsequent forecasting. Furthermore, the integrated TCN-BiLSTM architecture enhanced with multi-head attention mechanism simultaneously improves local feature extraction, long-range dependency modeling, and key information highlighting, leading to enhanced prediction accuracy and stability in challenging scenarios such as extreme weather.
To address the aforementioned challenges in PV power forecasting, this study proposes a novel ICEEMDAN-TCN-BiLSTM-MHA hybrid model that systematically integrates signal decomposition, temporal feature extraction, and adaptive weighting mechanisms. The framework first employs ICEEMDAN to decompose historical PV power and meteorological data into intrinsic mode functions, enhancing data stationarity and noise robustness. Building upon this preprocessing, the model architecture combines three key components: (1) a Temporal Convolutional Network (TCN) with dilated causal convolutions to capture long-range temporal patterns while preserving local details, (2) a Bidirectional LSTM (BiLSTM) layer that processes both forward and backward temporal dependencies through its gating mechanisms for comprehensive feature learning, and (3) a Multi-head Attention (MHA) module that dynamically weights meteorological features to suppress noise interference and focus on critical weather determinants. This integrated design not only addresses the limitations of individual components (e.g., BiLSTM’s difficulty in processing long sequences and the TCN’s potential information loss) but also creates synergistic effects that significantly improve prediction accuracy, particularly under challenging weather transitions. Finally, to validate the accuracy and stability of the proposed model, a dataset from a PV power plant in Qinghai Province, China, was selected for comparative experiments using different algorithms to evaluate the predictive performance of the ICEEMDAN-TCN-BiLSTM-MHA combined model.
The structure of this paper is organized as follows: The theoretical methods employed in this study are described in
Section 2. The proposed PV power forecasting methodology is presented in
Section 3. The experimental results and corresponding analysis are discussed in
Section 4. Finally, the main conclusions are summarized in
Section 5.
3. Method
3.1. The Short-Term Prediction Process of Photovoltaic Based on ICEEMDAN-TCN-BiLSTM-MHA
The combination of TCN and BiLSTM addresses the limitation of BiLSTM networks in capturing detailed features of input data while reducing the computational complexity of BiLSTM networks. The integration of BiLSTM with MHA enables the model to focus on key information in the data, enhancing its ability to establish long-term effective temporal dependencies. Using MAE as the objective function and introducing ICEEMDAN to smooth the original power data effectively suppresses the modal aliasing issues of traditional EMD are effectively suppressed, thereby improving the model’s prediction accuracy and reducing asymmetric randomness in PV power forecasting. The TCN model comprises one TCN layer and two fully connected layers. The TCN layer contains 30 filters and uses the ReLU activation function. The first fully connected layer also consists of 30 filters with ReLU activation, while the second fully connected layer is configured with one filter and employs a linear activation function. The BiLSTM model is structured with four hidden layers and one dropout layer. The numbers of neurons in the hidden layers are 8, 32, 64, and 128, respectively, and the dropout rate is set to 0.174. The architectural layers and parameter configurations for both the TCN and BiLSTM models are implemented according to [
40,
41]. The parameters of the TCN-BiLSTM model component are summarized in
Table 2. Furthermore, other key parameters in the model—including the number of heads in the MHA mechanism, the Gaussian noise level in ICEEMDAN, the learning rate, the batch size, the number of iterations, and the dropout rate in the BiLSTM layer—were optimized using the Particle Swarm Optimization (PSO) algorithm within predefined upper and lower bounds. The resulting optimal parameter values are presented in
Table 3. In summary, this paper constructs a PV power prediction model based on ICEEMDAN-TCN-BiLSTM-MHA, with the prediction process shown in
Figure 4.
This prediction model consists of the following six steps:
Step 1. Remove data with PV output power of 0 and detect outliers in the data. Use the mean interpolation method of adjacent data points to fill in the outliers.
Step 2. Based on the weather annotation information in the dataset, classify the historical PV output power into sunny, cloudy, and rainy days according to weather type characteristics.
Step 3. Perform Pearson correlation analysis on the processed data, and combine point-line diagrams of different features and PV output power to screen the primary meteorological features influencing PV output power as model inputs.
Step 4. Perform ICEEMDAN modal decomposition on the screened data, reducing noise and complexity in the input data by decomposing the original input data into multiple independent sub-sequences.
Step 5. Construct the ICEEMDAN-TCN-BiLSTM-MHA prediction model, using mean squared error (MSE) as the error loss function.
Step 6. The parameters of the hybrid ICEEMDAN-TCN-BiLSTM-MHA model are optimized using the PSO algorithm.
Step 7. Evaluate the prediction error of the model for PV power output using MAE, RMSE, MAPE, U1, and R2.
3.2. Dataset
The experimental data were obtained from a PV power station in Haidong City, Qinghai Province, China, with a longitude of 101°49′3.81″ and a latitude of 36°39′3.016″. The dataset covers PV power generation data for the entire year of 2024, comprising a total of 29,713 data samples, with a sampling frequency of 15 min for PV output power. The dataset includes six meteorological features: Component temperature, ambient temperature, humidity, total solar radiation, direct radiation, and diffuse radiation, as well as historical actual values of PV output power, all of which were collected by on-site sensors. Furthermore, the dataset includes three weather type labels: sunny, cloudy, and rainy. The corresponding meteorological data were derived from a national reference climatological station operated by the China Meteorological Administration. As shown in
Table 4 the determination thresholds for each weather type are provided.
3.3. Data Anomaly Handling
There is a strong linear correlation between PV output power and total solar radiation, direct radiation, and diffuse radiation [
42]. However, during data collection at PV power plants, issues such as signal interruptions caused by extreme weather conditions affecting wireless communication and instrument errors in the data collection system may result in missing or anomalous data in the raw data. To enhance the validity of the data and improve the predictive accuracy of the model, it is necessary to eliminate the interference of unfavorable data on model training. First, 15,671 datasets with constant zero power at night were excluded; then, the remaining 14,042 datasets were subjected to Random Sample Consensus (RANSAC) outlier detection in batches of 5000. The RANSAC regression algorithm identifies outliers by calculating the regression residuals for each data point. If the absolute value of the residual exceeds three times the standard deviation (exceeding 3
), the data point is deemed an outlier. As shown in
Figure 5, the detection results indicate the presence of some outliers in the data. For the 97 missing values and 32 outliers in the dataset, the mean interpolation method using neighboring data points was employed for filling. Ultimately, 13,913 valid experimental datasets were obtained. The characteristics of the dataset are presented in
Table 5.
3.4. Pearson Correlation Analysis
To reduce the complexity of model learning and enhance computational efficiency, while avoiding the negative impact of redundant and irrelevant features on model symmetry, Pearson correlation analysis was used to analyze the correlation between six types of meteorological features affecting PV output power and different seasons, including spring (March to May), summer (June to August), autumn (September to November), and winter (December to February), as shown in
Figure 6. Based on the correlation analysis results, the Pearson correlation coefficients for total solar radiation, direct radiation, and scattered radiation were significantly higher than those for other features, indicating a strong linear correlation between these features and PV output power. In contrast, humidity and ambient temperature exhibit lower correlations with actual PV system output power, indicating their limited influence on output power. As shown in the short-term photovoltaic power forecasting curve in
Figure 7 and the error evaluation metrics in
Table 6 he results indicate that when features—including component temperature, ambient temperature, humidity, total solar radiation, direct radiation, and scattered radiation—are used as inputs, compared to using only 4 features (module temperature, global irradiance, direct irradiance, and diffuse irradiance), the model’s prediction performance does not improve significantly, whereas the computation time increases by 28.57%. Therefore, this study selects total solar radiation, direct radiation, scattered radiation, and PV module temperature—which significantly impact PV output power—as model input features. The calculation process for the Pearson correlation coefficient is shown in Equation (29), with a value range of [−1, 1]. The closer the absolute value is to 1, the stronger the linear correlation between variables, while the closer the absolute value is to 0, the weaker the linear correlation [
43].
where
and
represent the rank data of feature
and target variable
, respectively,
represents the covariance of the rank data of the two,
and
represent the standard deviations of
and
, respectively.
3.5. ICEEMDAN Decomposition
In the study of PV power output prediction, historical data often exhibit multi-peak and complex fluctuation characteristics due to uncertainty and non-stationarity, making data preprocessing a key step in improving model performance. For non-stationary input data, ICEEMDAN is used as the data decomposition method. This method decomposes the original input data into multiple independent sub-sequences, significantly reducing data non-stationarity and enhancing the model’s ability to learn the correlation between meteorological characteristics and PV power output.
To further validate the effectiveness of the data preprocessing method, six different machine learning and deep learning prediction models (Back Propagation Neural Network (BPNN), Kernel-based Extreme Learning Machine (KELM), TCN, LSTM, eXtreme Gradient Boosting (XGBoost), and Transformer) were selected for a comparative experiment, as shown in
Figure 8 The results indicate that the data processed by the ICEEMDAN algorithm exhibits higher accuracy in PV power output prediction.
4. Results and Discussion
To validate the accuracy and generalization capability of the proposed ICEEMDAN-TCN-BiLSTM-MHA hybrid model, a cross-regional comparative analysis was conducted using operational data from two photovoltaic power stations located in different climatic zones: Xining City, Qinghai Province and Haian City, Jiangsu Province.
The dataset from Xining, which features a plateau continental climate, was collected in 2024. It includes four types of feature variables—ambient temperature, total solar radiation, direct radiation, and scattered radiation—as well as the actual photovoltaic power output. All parameters were measured and collected via on-site sensors. After preprocessing, a total of 13,913 valid data samples were obtained. The dataset from Haian, characterized by a subtropical monsoon climate, was collected in 2022. It also contains the same four feature variables and the actual power output, similarly acquired through field sensors. This dataset yielded 13,716 valid samples after preprocessing. In both cases, 80% of each dataset was used for model training. Furthermore, based on weather records from the China Meteorological Administration, the data from both stations were categorized into three typical weather conditions for comparative analysis: sunny, cloudy, and rainy.
The experimental configuration utilized a Windows 64-bit operating system, an Intel Core i9-13900HX processor, an NVIDIA GeForce RTX 4060 graphics card, and Python version 3.11 as the programming language, with GPU acceleration utilized to expedite the computational process of the model. To mitigate the impact of random factors on experimental results, each model was run 10 times, and the average of these results was used to evaluate the model’s predictive performance.
4.1. A Comparative Analysis of Model Prediction Results on the Qinghai Dataset
Based on meteorological scenario data from PV power plants in Qinghai Province, China, an ICEEMDAN-TCN-BiLSTM-MHA combined model framework was constructed to achieve intraday prediction of PV output power. Four different comparison models—BiLSTM, TCN-BiLSTM, BiLSTM-MHA, and TCN-BiLSTM-MHA—were compared with the proposed ICEEMDAN-TCN-BiLSTM-MHA combined model through experimental comparisons, as shown in
Figure 9 and
Table 7,
Table 8 and
Table 9 In sunny weather scenarios, all models demonstrated high prediction accuracy (R
2 0.98). Among them, the ICEEMDAN-TCN-BiLSTM-MHA model demonstrated a significant advantage with a MAPE of 2.891%, significantly outperforming other comparison models and single models (BiLSTM model MAPE: 13.429%, TCN-BiLSTM model MAPE: 8.850%, BiLSTM-MHA model MAPE: 7.757%, TCN-BiLSTM-MHA model MAPE: 8.324%). In cloudy scenes, the predictive performance of different models showed significant differentiation. The combined models generally performed better, with the BiLSTM-MHA and TCN-BiLSTM-MHA models achieving MAPE values of 13.49% and 7.00%, respectively, and RMSE values of 2741.33 and 1367.42, respectively. In contrast, the ICEEMDAN-TCN-BiLSTM-MHA model achieved more accurate PV power output prediction results. In rainy weather scenarios, model performance showed significant differences due to fluctuations in irradiance. Among them, the ICEEMDAN-TCN-BiLSTM-MHA combination model outperformed other comparison models in terms of prediction accuracy, with MAPE reduced by 36.27% and 26.34% compared to the BiLSTM-MHA and TCN-BiLSTM-MHA models, respectively. The BiLSTM and TCN-BiLSTM models exhibit more pronounced errors in this scenario, with MAPE values reaching 2.41 times and 2.11 times that of ICEEMDAN-TCN-BiLSTM-MHA, respectively.
4.2. A Comparative Analysis of Model Generalization Performance on the Jangsu Dataset
The experimental results from the photovoltaic power station in Jiangsu Province, China, are presented in
Figure 10 and
Table 10,
Table 11 and
Table 12. Based on the error evaluation metrics listed in
Table 10,
Table 11 and
Table 12, the proposed ICEEMDAN-TCN-BiLSTM-MHA hybrid model demonstrates accurate predictive performance under three typical weather conditions—sunny, cloudy, and rainy all models demonstrated high prediction accuracy (R
2 0.97). A comprehensive comparison of the error metrics between Qinghai Province and Jiangsu Province, as summarized in
Table 7,
Table 8,
Table 9,
Table 10,
Table 11 and
Table 12 shows that the differences in the relative error metrics MAPE and U
1 between the Jiangsu and Qinghai datasets are relatively small across all three weather scenarios, ranging from 4.65% to 11.28% and from 7.05% to 10.91%, respectively. These results indicate that the proposed model maintains strong stability and generalization capability under different regional conditions.
The experimental results demonstrate that the ICEEMDAN-TCN-BiLSTM-MHA hybrid model consistently exhibits high predictive accuracy and stability across datasets from diverse climatic regions. The cross-regional comparative experiments further validate its strong generalization capability.
5. Discussion
In practical applications, the proposed ICEEMDAN-TCN-BiLSTM-MHA hybrid model can provide high-precision short-term forecasting for PV power plant operations, supporting power generation planning, energy storage regulation, and market trading strategy optimization. Particularly under abrupt weather changes, it helps mitigate operational risks caused by prediction deviations and can be further integrated into smart grid monitoring systems to facilitate the efficient grid integration of renewable energy. Owing to the intermittent, fluctuating, and non-stationary nature of PV power generation, the ICEEMDAN algorithm effectively reduces noise interference in PV power prediction through modal decomposition of the data. Meanwhile, the multi-scale temporal feature extraction capability of the TCN-BiLSTM model, combined with the dynamic weighting mechanism of MHA for key meteorological features, further enhances the prediction performance of the model. Therefore, the proposed hybrid model demonstrates stronger specialized applicability in PV power forecasting. However, the model currently faces challenges related to high computational complexity, with major bottlenecks including the iterative computational overhead of ICEEMDAN decomposition, the parameter growth caused by deep BiLSTM structures, and the additional computational burden introduced by the global dynamic weighting calculations in MHA. Therefore, future work will focus on further optimizing model parameters and comparing various algorithmic models, such as Kolmogorov–Arnold Networks (KANs), XGBoost, and KELM, to reduce the computational burden.
6. Conclusions
PV power generation is characterized by intermittency, fluctuation, and a symmetrical distribution of daily power output. Therefore, high-accuracy forecasting of its output power is of great significance for ensuring the security and stability of large-scale grid integration, as well as improving energy utilization efficiency. This study constructs a hybrid prediction model named ICEEMDAN-TCN-BiLSTM-MHA based on a BiLSTM backbone. By incorporating a TCN module to enhance local feature extraction and a multi-head attention mechanism to explore feature correlations across multiple subspaces, the model significantly improves its capability for feature representation and dependency capture. Meanwhile, the ICEEMDAN algorithm is employed to decompose meteorological inputs, constructing high signal-to-noise ratio input data, thereby comprehensively improving the prediction accuracy and generalization performance of the model. Experiments under three typical weather conditions in the Qinghai dataset show that the proposed hybrid model significantly outperforms single benchmark models. Particularly under rainy conditions, compared to the single BiLSTM model, MAE, MAPE, RMSE, and U1 decrease by 65.75%, 58.44%, 68.96%, and 67.82%, respectively, while R2 increases by 13.95%. Furthermore, ICEEMDAN decomposition effectively suppresses data non-stationarity and noise interference, accelerating convergence and improving prediction performance across six different machine learning and deep learning models. Finally, cross-regional tests based on data from the Qinghai and Jiangsu provinces demonstrate that the model maintains stable and reliable performance under different climatic and regional conditions, verifying its superior accuracy and generalization capability. Future research will explore further improvements, such as optimizing the model’s internal hyperparameters through a comparative analysis of various deep learning methods, to enhance prediction accuracy.