Next Article in Journal
Analyzing the Impact of a Structured LLM Workshop in Different Education Levels
Previous Article in Journal
Potential of Portuguese Viticulture By-Products as Natural Resources of Bioactive Compounds—Antioxidant and Antimicrobial Activities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model

1
Suzhou Research Institute of Southeast University, Suzhou 215123, China
2
School of Electrical Engineering, Southeast University, Nanjing 210096, China
3
College of Energy and Electrical Engineering, Hohai University, Nanjing 211100, China
4
School of Cyber Science and Engineering, Southeast University, Nanjing 211189, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(14), 6279; https://doi.org/10.3390/app14146279
Submission received: 15 May 2024 / Revised: 10 July 2024 / Accepted: 16 July 2024 / Published: 18 July 2024
(This article belongs to the Section Energy Science and Technology)

Abstract

:
Photovoltaic (PV) power fluctuates with weather changes, and traditional forecasting methods typically decompose the power itself to study its characteristics, ignoring the impact of multidimensional weather conditions on the power decomposition. Therefore, this paper proposes a short-term PV power generation method based on MVMD (multivariate variational mode decomposition) feature extraction and the Informer model. First, MIC correlation analysis is used to extract weather features most related to PV power. Next, to more comprehensively describe the relationship between PV power and environmental conditions, MVMD is used for time–frequency synchronous analysis of the PV power time series combined with the highest MIC correlation weather data, obtaining frequency-aligned multivariate intrinsic modes. These modes incorporate multidimensional weather factors into the data-decomposition-based forecasting method. Finally, to enhance the model’s learning capability, the Informer neural network model is employed in the prediction phase. Based on the input PV IMF time series and associated weather mode components, the Informer prediction model is constructed for training and forecasting. The predicted results of different PV IMF modes are then superimposed to obtain the total PV power generation. Experiments show that this method improves PV power generation accuracy, with an MAPE value of 4.31%, demonstrating good robustness. In terms of computational efficiency, the Informer model’s ability to handle long sequences with sparse attention mechanisms reduces training and prediction times by approximately 15%, making it faster than conventional deep learning models.

1. Introduction

With the rapid development of the world economy, the demand for energy consumption is increasing. The scarcity of traditional resources and the need for environmental protection have led to a sharp increase in the demand for renewable energy. Among all renewable energy sources, solar energy, as the most typical one, has attracted widespread attention due to its abundance, almost ubiquitous availability, and clean, renewable characteristics, gradually evolving from standalone power generation to grid-connected power generation [1]. However, PV power generation is easily affected by various meteorological factors within the region, such as solar irradiance, temperature, and relative humidity, leading to characteristics of randomness, intermittency, and uncertainty in PV power generation [2]. These uncertainties can degrade real-time control performance, reduce system economy, and jeopardize the stable operation of the power system, posing significant challenges to the management and operation of power and energy systems. To overcome these shortcomings, accurate PV power forecasting is needed.
Regarding the aforementioned issue, scholars have proposed various prediction methods to address the challenges of PV power generation and enhance prediction accuracy. In terms of prediction methods, PV power prediction can be categorized into four main types: (1) statistical and probabilistic methods such as autoregressive modeling (AR) [3]. Statistical methods are more suitable for ultra-short-term (from seconds to under 30 min) and short-term (30 min to under 6 h) forecasting [4]. Reference [5] proposes a short-term probabilistic photovoltaic power forecasting model based on the beta distribution, which improves the forecasting process and results using a semi-parametric method. (2) Machine learning-based methods such as k-nearest neighbor (KNN), support vector machine (SVM), and gradient boosting decision trees [6,7,8]. These methods are also suitable for short-term forecasts. (3) Deep learning methods, which can be divided into three categories: convolutional neural network (CNN) methods, recurrent neural network (RNN) methods, and hybrid models combining both [9]. RNN usually includes long short-term memory (LSTM) and gated recurrent unit (GRU) [10]. They can automatically learn complex mappings from input to output and support multiple inputs and outputs [11]. (4) Physical modeling method is based on NWP (such as solar irradiance, temperature, wind speed, etc.) and the physical characteristics of a photovoltaic system. Such methods can provide a detailed understanding of the behavior of photovoltaic systems and are often suitable for in-depth analysis and simulation of the physical properties of the system. Reference [12] proposes a probabilistic photovoltaic power forecasting method that combines ensemble numerical weather prediction (NWP) and a chain of physical models. By incorporating quantile regression, this method improves the accuracy and reliability of the forecasts. In addition to the above methods, combinatorial forecasting methods combine the advantages of multiple forecasting methods, such as combining physical models with statistical models, machine learning, or deep learning models, taking advantage of the complementary properties of the different methods to improve prediction performance. These methods are the most widely used in current research.
Photovoltaic (PV) power output forecasting is primarily divided into short-term and medium-to-long-term predictions. For short-term forecasting (less than 6 h), reference [13] proposes a combined ultra-short-term PV power prediction method considering ground-based cloud images and chaotic characteristics, significantly improving prediction accuracy under sudden irradiance changes caused by moving clouds. For medium-to-long-term forecasting (more than 6 h), reference [14] presents a method based on copula functions and LSTM networks. This method effectively extracts key meteorological factors and deeply explores long-term dependencies within limited data samples, significantly enhancing prediction accuracy. Reference [15] introduces a new adaptive discrete gray model, which improves the accuracy and reliability of long-term PV power prediction by incorporating time power terms and periodic terms and optimizing using genetic algorithms.
In recent years, data-driven PV forecasting methods have been extensively studied and applied. These methods integrate machine learning, deep learning, time-series analysis, and feature engineering for PV output prediction. Reference [16] proposes a deep learning model based on an intelligent fusion gate mechanism for a bimodal feature fusion network, aimed at short-term point-interval prediction of PV power. This model effectively extracts and fuses global and local features of PV power sequences, significantly improving prediction accuracy and stability. In terms of similar day data processing, reference [17] employs the K-means clustering method to select similar days, reduces multidimensional feature data through factor analysis, and finally models and predicts using the extreme gradient boosting (XGBoost) algorithm, significantly improving prediction accuracy. Combining time–frequency analysis, weather characteristics, and algorithm optimization, reference [18] uses variational mode decomposition (VMD) technology and K-means clustering for time–frequency analysis of weather-conditioned grouped datasets. The resulting sub-signal sequences are then predicted using an LSTM neural network model optimized by the grey wolf optimizer (GWO). Reference [19] employs multiscale permutation entropy (MPE) technology to quantitatively evaluate different weather patterns, using these patterns as input features to construct a prediction model with the least squares support vector machine (LSSVM). The study verifies LSSVM’s performance advantage in handling small-scale dataset prediction tasks. However, detailed parameter tuning of the LSSVM model was not conducted in this study, which might affect the model’s generalization performance. Reference [20] proposes a CNN-LSTM-Transformer hybrid model, where CNN extracts spatial features, LSTM extracts temporal features, and the Transformer’s encoder–decoder strategy improves prediction accuracy by learning the combination of spatiotemporal features. For model capability enhancement, reference [21] introduces a hybrid model based on guided LSTM, using an improved chicken swarm optimization (ICSO) and predator–prey mechanism to further enhance particle search capabilities, optimizing neural network (NN) weights and biases to improve prediction accuracy. Addressing issues such as data redundancy in VMD, insufficient VMD accuracy, and limited generalization capability of LSSVM in PV power prediction models, reference [22] proposes an improved VMD (IVMD)-grey wolf optimizer least squares support vector machine (GLSSVM) method, combining grey relational analysis (GRA) and Kendall correlation coefficient (KCC). This integrated approach effectively reduces the complexity of raw data, improves VMD decomposition accuracy, and enhances the model’s generalization capability by optimizing LSSVM parameters, thereby improving PV power prediction accuracy and reliability. In comparative optimization of various models, reference [23] proposes a meta-learning method that dynamically mixes the base predictions of multiple deep learning models to learn the conditions under which each model performs best, improving the deterministic prediction of PV systems one hour in advance. In terms of utilizing satellite cloud images, reference [24] presents a time-segmented PV power prediction model based on cloud images. This model improves short-term prediction methods and processes through real-time adjustment of clear-sky PV power estimates and cloud transmittance, achieving high prediction accuracy under various weather conditions and having potential application value in practical grid dispatching.
The methods discussed provide an overview of various existing techniques for PV power prediction, including statistical, probabilistic, machine learning, and deep learning approaches. While these methods have significantly improved prediction accuracy, they still face notable limitations. Traditional methods typically focus on decomposing the PV power data itself without accounting for environmental conditions, leading to incomplete decomposition results. EMD and VMD are single-sequence decomposition methods, and traditional multivariate PV power and meteorological sequence decompositions require multiple applications of these algorithms. The frequencies of the decomposed subsequences cannot be aligned, preventing time–frequency synchronous analysis of multivariate sequences and reducing prediction accuracy. Additionally, as neural network models become more complex with increasing layers, they face issues like low feature learning efficiency, slow training speeds, and overfitting. These challenges hinder the accurate prediction of PV power generation, particularly when dealing with long sequence data and multivariate inputs.
To address these issues, this paper proposes a short-term photovoltaic (PV) power generation forecasting method based on multivariate variational mode decomposition (MVMD) and the Informer model. First, the most relevant weather features to PV power are extracted through maximal information coefficient (MIC) correlation analysis and incorporated into the PV power prediction, enhancing the model’s adaptability to environmental conditions. Next, to comprehensively describe the relationship between PV power and environmental conditions, MVMD is employed for multivariate variational mode decomposition [25,26,27,28,29]. This method performs time–frequency synchronous analysis on the PV power sequence combined with the weather data having the highest MIC correlation, obtaining intrinsic mode functions (IMFs) modes characterizing PV power features and corresponding weather feature IMFs modes. By obtaining these frequency-aligned multivariate intrinsic modes, multiple weather factors are introduced into the data decomposition-based prediction method, improving the completeness of data decomposition and the accuracy of predictions. Finally, the Informer neural network model is adopted in the prediction phase [30,31]. Based on the respective power IMF sequences, combined with the relevant weather modes of each power IMF, separate Informer prediction models are constructed for training and prediction. The predicted results of different IMF modes are then aggregated to obtain the overall PV power generation forecast. The proposed method’s effectiveness and accuracy are validated using historical data from a PV power station in Shengzhou, Zhejiang Province.
In summary, this paper systematically extracts weather features related to PV power through MIC correlation analysis, employs MVMD for time–frequency synchronous analysis, and achieves the integration of PV power with multidimensional weather factors, thereby improving the completeness of data decomposition and the accuracy of predictions. Additionally, the combination with the Informer neural network model enhances the model’s learning ability and prediction accuracy while significantly reducing computational complexity and memory requirements.

2. Power Feature Extraction

2.1. MIC Correlation Analysis

This paper utilizes MIC correlation analysis to extract weather features related to PV power. Through this method, a better understanding of the relationship between PV power and weather factors can be obtained, which helps in better comprehending the data and improving prediction accuracy.
In statistics, MIC stands for maximum information coefficient, which is used to measure the strength of correlation between two variables. Its correlation analysis formula involves the calculation of entropy, and the specific formula is as follows:
M I C ( X , Y ) = m a x L x , L y I ( X L , Y M ) log ( min ( L x , L y ) )
where X and Y are two variables, Lx and Ly represent the number of bins for X and Y respectively, and I(XL, YM) represents the mutual information between the L-th sample of X and the M-th sample of Y.

2.2. MVMD

After obtaining the MIC correlation weather features, a thorough analysis of the PV power time series combined with weather factors is conducted. In traditional prediction methods based on VMD, VMD is often only used to decompose historical PV power sequences, while environmental factors such as meteorological sequences are often neglected. Even when considering meteorological sequences, they are usually decomposed separately and cannot achieve time–frequency synchronous analysis of multidimensional sequences. Therefore, this paper adopts MVMD to decompose PV power along with its weather features with the highest correlation, thereby achieving time–frequency synchronous analysis.
The core idea of MVMD is to decompose the input multivariate time-series data into its intrinsic multivariate modulated oscillations. The model is as follows:
min u k , w k k c t u + k , c e - j w k t 2 2 k u k , c ( t ) = x c ( t ) , c = 1 , 2 , 3 , , c ,
In the formula, k is the number of separated modes set in the algorithm; c is the number of original signals input into the algorithm; wk is the center frequency of each mode after MVMD decomposition; xc(t) is the set of input multivariate time-series feature signals; and uk,c(t) is the set of decomposed time-series feature signals.
When performing calculations, unconstrained variational problems are easier to handle and optimize. Common optimization algorithms and mathematical tools are more mature and efficient for unconstrained problems. In contrast, solving constrained problems is relatively complex, requiring consideration of the degree to which constraints are met. Thus, converting a constrained problem into an unconstrained one can simplify the computation process. To convert a constrained variational problem into an unconstrained one, the Lagrange multiplier method is used. The form of its Lagrangian function is as follows:
L ( u k , c , W k , λ c ) = ε k c t u + k , c e - j w k t 2 2 + c x c ( t ) k u k , c ( t ) 2 2 + c λ c , x c ( t ) k u k , c ( t )
In the formula, ε is the penalty factor, which is a constant; λ(t) is the time-dependent Lagrange multiplier.
In the process of conversion, the introduction of penalty factors and Lagrange multipliers will affect the optimization process of the objective function, which may lead to deviations in the final variational mode decomposition results. Therefore, in the decomposition operation, it is essential to adjust the penalty factor or Lagrange multiplier appropriately in real time.
By solving the above equation, the solutions for each variable are obtained:
u ^ k , c n + 1 ( w ) = x ^ c i k u ^ i , c ( w ) + λ ^ c ( w ) 2 1 + 2 ε ( w w k ) 2
w k n + 1 = c 0 w u ^ k , c ( w ) 2 d w c 0 u ^ k , c ( w ) 2 d w
λ ^ c n + 1 ( w ) = λ ^ c n ( w ) + τ x ^ c ( w ) k u ^ k , c n + 1 ( w )
In the formula, denotes the Fourier transform; τ is the update coefficient. Through continuous iterative updates of the algorithm, the desired decomposition modes are obtained.

3. Informer Forecasting Model

PV power generation, compared to other new energy sources like wind power, has higher electricity generation and a more stable power pattern. Time-series forecasting methods can predict future power variations based on historical power patterns, giving them a natural advantage in forecasting tasks with strong periodicity.
In recent years, the Transformer model in deep learning has shown far better capabilities in handling time-series problems than traditional recurrent neural network (RNN) models. The Informer model, designed to address time-series data processing and forecasting issues, improves upon the Transformer by addressing three challenges in long time-series forecasting: quadratic self-attention computation, the memory bottleneck of long input stacking layers, and reduced speed in predicting long power generation. This section will introduce the Informer model and propose an improved version for PV power generation.

3.1. Self-Attention Mechanism

In the Transformer, the traditional attention mechanism proposed by the authors calculates the scaled dot product after receiving the input.
A ( Q , K , V ) = s o f t max Q K T d V
In Equation (7), Q R L Q × d , K R L K × d , V R L V × d , d is the input dimension. The authors of Informer found through experimentation and observation that the self-attention scores of the traditional attention computation method follow a long-tailed distribution. This means that only a small portion of the query matrix Q significantly contributes to the attention. Therefore, in the Informer model, the attention computation method has been improved to:
A ( Q , K , V ) = s o f t max Q ¯ K T d V
In Equation (8), Q ¯ is a sparse matrix that has been filtered through the attention mechanism and is the same size as Q. It only contains queries that significantly contribute to the attention. This process enables the Informer model to maintain high accuracy in long time-series data processing while using less memory and improving computation speed.

3.2. Encoding and Decoding Structure

Informer continues with the encoder and decoder structure of the Transformer, and its structure can be represented in Figure 1.
The encoder receives long-sequence input and uses a multi-head ProbSparse self-attention module and a self-attention distillation module to obtain feature representations. The multi-head ProbSparse self-attention mechanism replaces the original attention matrix with a sparse matrix, significantly reducing computation requirements while maintaining good performance.
The decoder receives long-sequence input, interacts with the encoded features through multi-head attention, and finally directly predicts the target value. The one-shot generative prediction method used by Informer is faster in inference speed than the step-by-step method while achieving comparable results.

3.3. Loss Function

In this paper, we use the Huber loss function to train the PV power generation model. Its calculation formula is as follows:
η Huber = 1 2 ( y i y i ^ ) 2 , y i y i ^ δ δ y i y i ^ 1 2 δ 2 , y i y i ^ > δ
Here, yi is the actual power value, ŷi is the predicted value from the model, and δ is the hyperparameter of the Huber loss function, set to 0.5 in this model’s training.
When predicting the total PV power of a large city, two key factors need to be considered when handling large-scale training data. First, the training accuracy must be ensured, and second, the training time required is also an important factor that cannot be ignored. Therefore, to improve the training accuracy without sacrificing training speed, this paper proposes using the Huber loss function in an improved Informer model for PV power generation [32]. The Huber loss function has a hyperparameter, δ, which can be adjusted to accommodate different types of power prediction model training.
When the prediction deviation is less than δ, a loss calculation method similar to MSE is used to improve the model’s convergence speed when the gradient of the loss is small. When the prediction deviation is greater than δ, a linear error similar to MAE is used to reduce the impact of large losses caused by outliers on model training, thereby improving the model’s robustness to outliers.

4. PV Power Generation

Based on the methods mentioned above, this paper proposes a short-term PV power generation method using MVMD feature extraction and the Informer neural network model. The structure of the prediction model is shown in Figure 2.
Firstly, to more accurately extract the weather features most correlated with PV power stations, this paper uses the MIC correlation analysis method to optimize the selection of multidimensional weather factors. MIC can quantify the nonlinear relationship between various weather factors and PV power, which is crucial for capturing the impact of complex weather variables on PV power generation.
Considering that PV power stations have almost no power generation at night, this paper proposes a strategy of performing correlation analysis in different time segments, namely, full-time correlation analysis and power-generation–time correlation analysis. Full-time correlation analysis covers 24 h data, helping reveal common, overall weather-influencing factors. However, since PV power generation mainly occurs during the day, especially during periods of high solar irradiance, power-generation–time correlation analysis focuses on these critical periods, directly identifying the most crucial weather factors affecting PV power. This dual analysis method provides a comprehensive understanding of the impact of weather factors on PV power and allows for a thorough evaluation of the most critical weather features.
Additionally, this analysis method covers not only the daylight hours but also the nighttime. Although photovoltaic stations generate almost no power at night, nighttime meteorological conditions, such as temperatures and humidity, can affect the temperature of the modules during the night and the following morning, thereby impacting the efficiency of power production the next day. For example, if the nighttime temperatures are lower, it may slow down the heat loss from the photovoltaic panels, ensuring that the panels start the day at a relatively lower temperature. As solar irradiance increases, the efficiency of the panels could be higher. Moreover, nighttime weather conditions, such as frost or dew formation, could affect the cleanliness of the photovoltaic panels’ surface, further impacting the next day’s power production efficiency.
Next, to more comprehensively describe the relationship between PV power and environmental conditions, this paper employs MVMD to perform time–frequency synchronized analysis on PV power time series in combination with the meteorological data most correlated to it via MIC analysis.
MVMD, an advanced signal decomposition tool, can simultaneously decompose multiple time series and extract their respective intrinsic mode functions. Using MIC-based correlation analysis, the weather factors most strongly correlated with PV power are selected as input variables. These input variables, along with the PV power time series, are input into the MVMD model and, through decomposition, produce frequency-aligned multivariate intrinsic modes. These frequency-aligned modes not only retain the essential characteristics of each data set but also reveal the coupling relationships between PV power and weather factors across different frequency components, providing multidimensional feature input for subsequent PV power generation. Decomposing PV power and multivariate meteorological sequences into simpler subsequences also reduces the effects of nonlinearity and volatility in PV power time series.
To further enhance the model’s learning ability, an Informer neural network model with a Huber loss function is used during the prediction phase. Based on the corresponding PV power IMFs (intrinsic mode functions), combined with relevant weather modal components, multiple Informer prediction models are built for training and forecasting. Each model predicts different IMFs independently, and finally, the predictions of each IMF mode are combined to yield the forecast value of total PV power generation.

5. Case Study

To verify the effectiveness of the short-term PV power generation method based on MVMD feature extraction and the Informer neural network model, this paper uses historical PV power generation and weather data from the PV station in Shengzhou, Zhejiang Province, China, as experimental data. The historical PV power and meteorological data cover the entire year from 1 January 2023, 00:00 to 31 December 2023, 23:55, with a sampling frequency of 15 min per point and power data measured in MW. The historical PV output data is provided by the State Grid Zhejiang Shengzhou Power Supply Company, and the weather data comes from the European Centre for Medium-Range Weather Forecasts (ECMWF).

5.1. Prediction Evaluation Metrics

This paper uses the root mean squared error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) as prediction evaluation metrics. Their respective calculation formulas are:
e M A P E = 1 n i = 1 n y i y i y i
e R M S E = 1 n i = 1 n y i y i 2
e M A E = 1 n i = 1 n y i y i
In power prediction, yi is the actual value at time i, and yi is the predicted value at time i.

5.2. MIC Correlation Analyss

First, perform the segmented MIC correlation analysis proposed in this paper, which includes full-time correlation analysis and power-generation–time correlation analysis. The MIC correlation coefficients are shown in Table 1. In this table, total cloud cover refers to the proportion of the sky covered by clouds, usually expressed as a percentage. It significantly impacts solar radiation, particularly direct and diffuse radiation reaching the surface. The irradiance involved in the EC meteorological sources in this paper is global horizontal irradiance (GHI), which is the total irradiance on a horizontal plane, including both direct and diffuse radiation, ranging from 0 to 2000 W/m2.
From Table 1, it can be seen that cloud cover shows a moderate correlation with PV power in both full-time and power-generation–time periods. The correlation is slightly higher during the power-generation–time period, indicating that cloud cover has a more significant impact during PV power generation. Solar irradiance is highly correlated with PV power in both time periods, especially over the full-time period, with correlation coefficients exceeding 0.9. This shows that solar irradiance is a decisive factor in PV power. Meanwhile, the correlation between various seasonal meteorological variables and PV power varies significantly. For instance, relative humidity is much more strongly correlated with PV power in summer (July–September) than in winter. This may be because the high humidity in summer reduces the efficiency of PV panels.
After comprehensive analysis, this paper selects total cloud cover, irradiance, relative humidity, and surface temperature as relevant weather features for PV power to proceed with the next MVMD and prediction process.

5.3. MVMD Decomposition

Next, the relevant weather features and PV power time series are combined for MVMD to obtain frequency-aligned multivariate intrinsic modes. Considering that when the number of decompositions exceeds seven, the IMF central frequencies show too many similarities, and having too many decompositions increases the computational burden on the model, this paper sets the number of MVMD K to seven. The multivariate mode diagram and FFT frequency diagram are shown in Figure 3 and Figure 4.
According to Figure 3, PV power, total cloud cover, irradiance, relative humidity, and surface temperature are input as five channels into MVMD for decomposition. Each channel consists of seven IMF features. Unlike VMD, each channel’s IMF is a feature decomposition chart of multidimensional data in both time and spatial dimensions. IMF1 is the primary decomposition component of each channel, retaining the main feature information. IMF2 to IMF7 in each channel represent different frequency components, with their frequencies progressively decreasing. They represent lower-frequency components or long-term variations over different time scales.
As seen in Figure 3 and Figure 4, the same IMF index across different feature channels has a consistent frequency distribution. This indicates that because MVMD can perform multivariate decomposition, the central frequencies of IMF1 to IMF7 obtained from the decomposition of PV power and meteorological factor channels are the same, achieving time–frequency synchronized analysis. This ensures that the prediction model is built at the same central frequency. Therefore, the same IMF index in each channel represents a distinct time scale, providing the model with rich information across different time scales and improving the prediction accuracy for various PV power IMFs.

5.4. Informer PV Prediction

The experimental environment for the Informer model includes Python 3.9, Pytorch 2.0.0 framework, Windows 11 system, with hardware configuration of 16 GB RAM, 12th Gen Intel i7-12700H processor, and NVIDIA GeForce RTX 3060 workstation GPU.
In the prediction phase, for each power IMF, an experimental dataset is built using weather multivariate intrinsic modes of the same frequency. These are then input into the prediction model to train and forecast the predicted results of each mode, which are combined to obtain the overall PV power generation.
In this study, a single day from the experimental dataset is randomly selected as the target prediction day. The data from the preceding 50 days are partitioned into 80% for training, 10% for validation, and 10% for testing to initialize the model parameters. Given that the sampling frequency of the training data is every 15 min, the prediction step length is set to 96. The Adam optimizer is employed, with the learning rate adjusted within the [0, 1] range. Model parameters such as batch size, dropout rate, number of encoders, number of decoders, and the number of training epochs are fine-tuned based on the training dataset. By iteratively selecting different target prediction days and repeating the aforementioned process, a highly generalized model was ultimately developed for subsequent experiments. Table 2 presents the key parameter settings for the prediction model on the target prediction day.
In the comparison experiment, the models introduced include the non-decomposed Informer prediction model, CNN-LSTM prediction model, WaveNet prediction model, MVMD-CNN-LSTM prediction model, MVMD-WaveNet prediction model, and the proposed MVMD-Informer prediction model. These models are compared to explore the differences between the proposed method and other methods. The actual and predicted power values for the target day are shown in Figure 5, and the error boxplot is shown in Figure 6.
Based on Figure 5 and Figure 6, the prediction curve of the proposed MVMD-Informer prediction model is the closest to the actual values, yielding the best predictive performance. Comparing the prediction performance across different neural networks, the inclusion of MVMD improves prediction accuracy in all cases, showing that MVMD achieves time–frequency synchronized analysis of PV power and meteorological sequences and performs multivariate decomposition. Therefore, it aligns more closely with the real operating conditions of distributed PV power plants, improving prediction accuracy.
At the same time, the Informer model demonstrates advantages over other models in prediction accuracy and model size. The evaluation metrics for the prediction results are shown in Table 3.
According to the evaluation metrics data in Table 3, the proposed MVMD-Informer model performed best in predicting PV power, with the lowest RMSE (0.939 MW), MAE (0.758 MW), and MAPE (4.31%). Notably, compared to the Informer model, the MVMD-Informer reduced RMSE by 62.48%, MAE by 52.18%, and MAPE by 20.92%. Compared to the CNN-LSTM model, the performance improvement was even more significant, with RMSE reduced by 81.67%, MAE by 79.92%, and MAPE by 43.40%. This demonstrates that the MVMD-Informer model combines the temporal decomposition capabilities of MVMD with the self-attention mechanism of the Informer, effectively capturing the dynamic characteristics and long-term dependencies of PV power.
Due to the addition of the MVMD process, the training time of the MVMD-Informer model is longer than that of the Informer model alone. However, this additional complexity is reasonably compensated by a significant improvement in prediction accuracy. Compared to other models incorporating MVMD, the training time of the MVMD-Informer model is also shorter, reduced by approximately 13%. Furthermore, the training efficiency of the Informer model, compared to other models without MVMD, has increased by at least 15%. This indicates that the self-attention mechanism of the Informer model not only enhances prediction accuracy but also improves computational efficiency.
Overall, the MVMD-Informer model strikes a balance between accuracy and computational complexity, making it a robust and efficient method for short-term photovoltaic power generation forecasting.
Moreover, to investigate the impact of different seasons on PV power and the superiority of MVMD over VMD, the model was trained and predicted quarterly. Example plots of days to be predicted for different seasons are shown in Figure 7. The MAPE error results are shown in Figure 8.
According to Figure 7 and Figure 8, the MVMD-Informer model consistently exhibited the lowest MAPE values across all quarters: 4.46% (January–March), 4.27% (April–June), 4.22% (July–September), and 4.39% (October–December), demonstrating its superiority in handling multivariate information. In contrast, the VMD-Informer model had higher MAPE values, at 5.21%, 4.91%, 4.89%, and 5.11%, respectively. Although the Informer model had its highest MAPE value in the first quarter (6.01%), it gradually decreased over subsequent quarters to 5.66%, 5.51%, and 5.76%.
While VMD can reduce the volatility and nonlinearity of PV power and meteorological sequences, MVMD achieves synchronized time–frequency analysis of these sequences and completes multivariate decomposition. This aligns better with the actual operation of distributed PV power stations and highlights the algorithm’s advantages.
Based on its robustness and accuracy, the proposed short-term PV power forecasting method using MVMD and the Informer model shows extensive application prospects. It can be employed in PV power station monitoring systems for accurate short-term forecasts and optimizing operation and maintenance. Grid dispatch centers can use it for better power generation forecasts, enabling rational load allocation, reducing power curtailment, and optimizing grid operations. Additionally, integrating this method into distributed energy management systems can enhance energy utilization efficiency, reduce costs, and promote renewable energy adoption. In smart grids, accurate forecasts aid in demand response and load balancing, improving grid resilience and reliability. Therefore, this method has significant potential in monitoring systems, grid dispatch, energy management, and smart grids.

6. Conclusions

Traditional prediction methods often decompose the power itself to study its characteristics, neglecting the impact of the multidimensional weather environment on the decomposition. This paper proposes a short-term PV power forecasting method based on MVMD feature extraction and the Informer model, which has shown significant results through experimental validation. The main conclusions are as follows:
  • The proposed segmented MIC correlation analysis strategy, which includes full-time and power-generation–time correlation analysis, effectively captures the nonlinear relationships between PV power and multidimensional weather factors. The analysis revealed that solar irradiance, total cloud cover, relative humidity, and surface temperature are the most influential factors in the experimental region.
  • A multivariate decomposition method based on MVMD is proposed. Compared to the VMD method, this method can perform synchronized time–frequency analysis of PV power generation and various meteorological factors, such as temperature, humidity, wind speed, and solar radiation intensity, decomposing them into frequency-aligned multivariate intrinsic modes. This leads to a more comprehensive and accurate decomposition of the PV power series (see Figure 3 and Figure 4).
  • The Informer model enhanced with the Huber loss function excels at capturing long-term dependencies and handling large-scale time-series data, improving prediction accuracy and robustness. Experimental results show that the RMSE, MAE, and MAPE values of the MVMD-Informer model are 0.939 MW, 0.758 MW, and 4.31%, respectively. Compared to the CNN-LSTM model, RMSE is reduced by 81.67%, MAE by 79.92%, and MAPE by 43.40%. Compared to the Informer model, RMSE is reduced by 62.48%, MAE by 52.18%, and MAPE by 20.92%.
  • The MVMD-Informer model demonstrates notable advantages in computational efficiency. While the inclusion of the MVMD process slightly increases training time, it significantly enhances prediction accuracy. Compared to the MVMD-CNN-LSTM model, the training time is reduced by about 13%. Compared to models without MVMD, such as WaveNet, the Informer model’s training efficiency improves by at least 15%. These results indicate that the self-attention mechanism of the Informer model improves both prediction accuracy and computational efficiency.
  • In summary, the proposed short-term PV power forecasting method based on MVMD feature extraction and the Informer model has demonstrated excellent performance, achieving the expected research goals. This method effectively extracts key weather features, performs multivariate decomposition, and improves prediction accuracy and computational efficiency, showing extensive practical application potential in PV station monitoring systems, grid dispatch, energy management systems, and smart grids. Despite its significant advantages, there are potential limitations and challenges in its practical application. The quality and acquisition cost of high-quality meteorological and PV power data are crucial for model performance but can be difficult or costly to obtain in some regions. Furthermore, large-scale applications still require substantial computational resources, posing challenges for resource-constrained scenarios. As the PV industry develops, more data and richer meteorological information will become available, providing new opportunities to further optimize and enhance short-term PV power forecasting methods.

Author Contributions

R.X. and J.Z. analyzed the data and wrote the paper; R.X. and F.M. performed the simulation and modeling; X.Y. and Y.W. analyzed the results and reviewed the modeling and text; H.Z. and Y.W prepared the data and contributed analysis tools. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Jiangsu Province International Cooperation Project, grant number BZ2021012.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

We extend our deepest gratitude to all authors who have participated in this study and contributed to the manuscript. Each author’s expertise and tireless work have been pivotal to the completion of our research findings. Special thanks go to R.X. and J.Z., who undertook a thorough analysis of the data and co-authored the paper. Their collaborative efforts have ensured the accuracy of our research outcomes and the high quality of the manuscript’s content. We are also grateful to R.X. and F.M. for their work in performing the simulations and modeling. Their contributions have provided a solid theoretical foundation and innovative perspectives for our study. X.Y.’s work in analyzing the results and reviewing the modeling and text is equally commendable. His meticulous review has deepened the insights of our research and clarified the presentation in the paper. H.Z. has made significant contributions in preparing the data and providing essential analysis tools. His efforts have been instrumental in facilitating the smooth progress of our research process. Additionally, we wish to express our heartfelt thanks to Southeast University for the invaluable support provided in conducting the experiments of this study. The resources and facilities offered by the university have been crucial for the completion of our experimental and analytical work.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AbbreviationFull Term
PVPhotovoltaic
MVMDMultivariate variational mode decomposition
IMFIntrinsic mode function
MICMaximum information coefficient
CNNConvolutional neural network
LSTMLong short-term memory
GRUGated recurrent unit
VMDVariational mode decomposition
ECMWFEuropean Centre for Medium-Range Weather Forecasts
RMSERoot mean squared error
MAEMean absolute error
MAPEMean absolute percentage error
NWPNumerical weather prediction
SVMSupport vector machine
LSSVMLeast squares support vector machine
XGBoostExtreme gradient boosting
GHIGlobal horizontal irradiance
MPEMultiscale permutation entropy
ARAutoregressive
RNNRecurrent neural network
GWOGrey wolf optimizer
ICSOImproved chicken swarm optimization
GLSSVMGrey wolf optimizer least squares support vector machine
KNNk-nearest neighbor
FFTFast Fourier transform
AdamAdaptive moment estimation

References

  1. Fan, T.; Sun, T.; Liu, H.; Xie, X.; Na, Z. Spatial-Temporal Genetic-Based Attention Networks for Short-Term Photovoltaic Power Forecasting. IEEE Access 2021, 9, 138762–138774. [Google Scholar] [CrossRef]
  2. Dimd, B.D.; Völler, S.; Cali, U.; Midtgård, O.M. A Review of Machine Learning-Based Photovoltaic Output Power Forecasting: Nordic Context. IEEE Access 2022, 10, 26404–26425. [Google Scholar] [CrossRef]
  3. Alhmoud, L.; Nawafleh, Q. Short-Term Load Forecasting for Jordan Power System Based on NARX-ELMAN Neural Network and ARMA Model. IEEE Can. J. Electr. Comput. Eng. 2021, 44, 356–363. [Google Scholar] [CrossRef]
  4. Nguyen, T.N.; Müsgens, F. What drives the accuracy of PV output forecasts? Appl. Energy 2022, 323, 119603. [Google Scholar] [CrossRef]
  5. Fernandez-Jimenez, L.A.; Monteiro, C.; Ramirez-Rosado, I.J. Short-term probabilistic forecasting models using Beta distributions for photovoltaic plants. Energy Rep. 2023, 9, 495–502. [Google Scholar] [CrossRef]
  6. Shi, J.; Chen, Y.; Cheng, X.; Yang, M.; Wang, M. Four-Stage Space-Time Hybrid Model for Distributed Photovoltaic Power Forecasting. IEEE Trans. Ind. Appl. 2022, 59, 1129–1138. [Google Scholar] [CrossRef]
  7. Ding, R.; Xu, X. Non-Classical K-Nearest Neighbor Clustering Algorithm. In Proceedings of the 2022 2nd International Conference on Networking, Communications and Information Technology (NetCIT), Manchester, UK, 26–27 December 2022. [Google Scholar]
  8. Kazemi, A.; Boostani, R.; Odeh, M.; AL-Mousa, M.R. Two-Layer SVM, Towards Deep Statistical Learning. In Proceedings of the 2022 International Engineering Conference on Electrical, Energy, and Artificial Intelligence (EICEEAI), Zarqa, Jordan, 29 November–1 December 2022. [Google Scholar]
  9. Fan, Y.; Lei, W. Wind Speed Prediction Based on Gradient Boosting Decision Tree. In Proceedings of the 2022 International Conference on Big Data, Information and Computer Network (BDICN), Sanya, China, 20–22 January 2022. [Google Scholar]
  10. Gaboitaolelwe, J.; Zungeru, A.M.; Yahya, A.; Lebekwe, C.K.; Vinod, D.N.; Salau, A.O. Machine Learning Based Solar Photovoltaic Power Forecasting: A Review and Comparison. IEEE Access 2023, 11, 40820–40845. [Google Scholar] [CrossRef]
  11. Yao, T.; Wang, J.; Wu, H.; Zhang, P.; Li, S.; Xu, K.; Liu, X.; Chi, X. Intra-Hour Photovoltaic Generation Forecasting Based on Multi-Source Data and Deep Learning Methods. IEEE Trans. Sustain. Energy 2021, 13, 607–618. [Google Scholar] [CrossRef]
  12. Mayer, M.J.; Yang, D. Pairing ensemble numerical weather prediction with ensemble physical model chain for probabilistic photovoltaic power forecasting. Renew. Sustain. Energy Rev. 2023, 175, 113171. [Google Scholar] [CrossRef]
  13. Wang, Y.; Wang, X.; Hao, D.; Sang, Y.; Xue, H.; Mi, Y. Combined ultra-short-term prediction method of PV power considering ground-based cloud images and chaotic characteristics. Sol. Energy 2024, 274, 112597. [Google Scholar] [CrossRef]
  14. Han, S.; Qiao, Y.H.; Yan, J.; Liu, Y.Q.; Li, L.; Wang, Z. Mid-to-long term wind and photovoltaic power generation prediction based on copula function and long short term memory network. Appl. Energy 2019, 239, 181–191. [Google Scholar] [CrossRef]
  15. Ding, S.; Li, R.; Tao, Z. A novel adaptive discrete grey model with time-varying parameters for long-term photovoltaic power generation forecasting. Energy Convers. Manag. 2021, 227, 113644. [Google Scholar] [CrossRef]
  16. Liu, Z.F.; Chen, X.R.; Huang, Y.H.; Luo, X.F.; Zhang, S.R.; You, G.D.; Qiang, X.Y.; Kang, Q. A novel bimodal feature fusion network-based deep learning model with intelligent fusion gate mechanism for short-term photovoltaic power point-interval forecasting. Energy 2024, 303, 131947. [Google Scholar] [CrossRef]
  17. Dong, D.; Wen, F.; Zhang, Y.; Qiu, W. Application of XGboost in Electricity Consumption Prediction. In Proceedings of the 2023 IEEE 3rd International Conference on Electronic Technology, Communication and Information (ICETCI), Changchun, China, 26–28 May 2023. [Google Scholar]
  18. Jiang, Z.; Wang, H.; Xu, R.; Yu, X.; Song, S.; Yu, X. Wind Power Point Prediction Based on VMD-GWO-LSTM. In Proceedings of the 2023 3rd International Conference on Energy, Power and Electrical Engineering (EPEE), Wuhan, China, 15–17 September 2023. [Google Scholar]
  19. Zhu, R.; Gong, X.; Zhang, J. Forecast of Photovoltaic Power Generation Based on EEMD-MPE-LSSVM. China Meas. Test 2021, 47, 158–162. [Google Scholar]
  20. Al-Ali, E.M.; Hajji, Y.; Said, Y.; Hleili, M.; Alanzi, A.M.; Laatar, A.H.; Atri, M. Solar Energy Production Forecasting Based on a Hybrid CNN-LSTM-Transformer Model. Mathematics 2023, 11, 676. [Google Scholar] [CrossRef]
  21. Bazionis, I.K.; Kousounadis-Knousen, M.A.; Katsigiannis, V.E.; Catthoor, F.; Georgilakis, P.S. An Advanced Hybrid Boot-LSTM-ICSO-PP Approach for Day-Ahead Probabilistic PV Power Yield Forecasting and Intra-Hour Power Fluctuation Estimation. IEEE Access 2024, 12, 43704–43720. [Google Scholar] [CrossRef]
  22. Yuan, J.; Li, H.; Xie, B.; He, B.; Jiang, W.; Xu, J. Short Term Forecasting of Photovoltaic Output Based on Feature Factor Selection and IVMD-GLSSVM. Electron. Meas. Technol. 2024, 46, 77–83. [Google Scholar]
  23. Sarmas, E.; Spiliotis, E.; Stamatopoulos, E.; Marinakis, V.; Doukas, H. Short-term photovoltaic power forecasting using meta-learning and numerical weather prediction independent Long Short-Term Memory models. Renew. Energy 2023, 216, 118997. [Google Scholar] [CrossRef]
  24. Paulescu, M.; Blaga, R.; Dughir, C.; Stefu, N.; Sabadus, A.; Calinoiu, D.; Badescu, V. Intra-hour PV power forecasting based on sky imagery. Energy 2023, 279, 128135. [Google Scholar] [CrossRef]
  25. Ali, M.; Prasad, R.; Jamei, M.; Malik, A.; Xiang, Y.; Abdulla, S.; Deo, R.C.; Farooque, A.A.; Labban, A.H. Short-Term Wave Power Forecasting with Hybrid Multivariate Variational Mode Decomposition Model Integrated with Cascaded Feedforward Neural Networks. Renew. Energy 2024, 221, 119773. [Google Scholar] [CrossRef]
  26. Cao, R.; Tian, H.; Li, D.; Feng, M.; Fan, H. Short-Term Photovoltaic Power Generation Prediction Model Based on Improved Data Decomposition and Time Convolution Network. Energies 2023, 17, 33. [Google Scholar] [CrossRef]
  27. Zhuang, W.; Li, Z.; Wang, Y.; Xi, Q.; Xia, M. GCN–Informer: A Novel Framework for Mid-Term Photovoltaic Power Forecasting. Appl. Sci. 2024, 14, 2181. [Google Scholar] [CrossRef]
  28. Peng, T.; Fu, Y.; Wang, Y.; Xiong, J.; Suo, L.; Nazir, M.S.; Zhang, C. An Intelligent Hybrid Approach for Photovoltaic Power Forecasting Using Enhanced Chaos Game Optimization Algorithm and Locality Sensitive Hashing Based Informer Model. J. Build. Eng. 2023, 78, 107635. [Google Scholar] [CrossRef]
  29. Wu, Z.; Pan, F.; Li, D.; He, H.; Zhang, T.; Yang, S. Prediction of Photovoltaic Power by the Informer Model Based on Convolutional Neural Network. Sustainability 2022, 14, 13022. [Google Scholar] [CrossRef]
  30. Zhou, Y.; Liu, L.; Li, H. A High-Performance Photovoltaic Module-Integrated Converter (MIC) Based on Cascaded Quasi-Z-Source Inverters (qZSI) Using eGaN FETs. IEEE Trans. Power Electron. 2012, 28, 2727–2738. [Google Scholar] [CrossRef]
  31. Wei, J.; Wu, X.; Yang, T.; Jiao, R. Ultra-Short-Term Forecasting of Wind Power Based on Multi-Task Learning and LSTM. Int. J. Electr. Power Energy Syst. 2023, 149, 109073. [Google Scholar] [CrossRef]
  32. Xu, Y.; Zhao, J.; Wan, B.; Cai, J.; Wan, J. Flood Forecasting Method and Application Based on Informer Model. Water 2024, 16, 765. [Google Scholar] [CrossRef]
Figure 1. The overall framework of informer.
Figure 1. The overall framework of informer.
Applsci 14 06279 g001
Figure 2. Flowchart of short-term PV power generation based on MVMD and informer.
Figure 2. Flowchart of short-term PV power generation based on MVMD and informer.
Applsci 14 06279 g002
Figure 3. IMF chart after MVMD.
Figure 3. IMF chart after MVMD.
Applsci 14 06279 g003
Figure 4. IMF frequency distribution chart after MVMD.
Figure 4. IMF frequency distribution chart after MVMD.
Applsci 14 06279 g004
Figure 5. Actual and predicted power values for the target day.
Figure 5. Actual and predicted power values for the target day.
Applsci 14 06279 g005
Figure 6. Boxplot of errors for the target day.
Figure 6. Boxplot of errors for the target day.
Applsci 14 06279 g006
Figure 7. Example plots of PV power prediction for different seasons.
Figure 7. Example plots of PV power prediction for different seasons.
Applsci 14 06279 g007
Figure 8. MAPE error plot for different quarters.
Figure 8. MAPE error plot for different quarters.
Applsci 14 06279 g008
Table 1. PV power MIC correlation coefficient.
Table 1. PV power MIC correlation coefficient.
Full-Time PeriodCorrelation Coefficient
Time
Period
Total Cloud CoverIrradiancePressurePrecipitationRelative Humidity2 m TemperatureSurface Temperature70 m Wind Speed70 m Wind Direction
January to March0.3390.9190.0780.0630.1880.1130.3120.1190.124
April to June0.3090.9350.0920.0750.2410.1680.3970.1420.110
July to September0.2830.9590.0780.0710.4500.3450.6250.1290.071
October to December0.3290.9430.0840.0610.2970.2150.4670.1310.094
Year 20230.2700.9020.0610.0530.1620.1610.3600.0550.065
Power-Generation–Time Period Correlation Coefficient
Time
Period
Total Cloud CoverIrradiancePressurePrecipitationRelative Humidity2 m TemperatureSurface Temperature70 m Wind Speed70 m Wind Direction
January to March0.4120.7220.1110.0880.2560.1570.3880.1650.147
April to June0.3540.7290.1050.1000.2530.1600.3870.1300.129
July to September0.3070.7820.1010.0750.3980.3540.6190.1240.089
October to December0.3430.7810.1100.1010.3140.1590.3130.1430.140
Year 20230.3020.7260.1090.0520.2260.1370.2740.0830.080
Table 2. Key parameter settings for the prediction model on the target day.
Table 2. Key parameter settings for the prediction model on the target day.
Parameter NameSetting
batch_size32
dropout0.05
learning_rate0.0001
enc_in1
dec_in1
n_heads16
e_layers1
s_layers[3, 2, 1]
c_out1
d_model1024
seq_len288
label_len96
pred_len96
Table 3. Evaluation metrics for prediction results.
Table 3. Evaluation metrics for prediction results.
RMSE (MW)MAE (MW)MAPE (%)Training Duration (s)
MVMD-Informer0.9390.7584.311369
Informer2.501.595.45623
MVMD-CNN-LSTM3.241.916.311820
CNN-LSTM5.883.717.57826
MVMD-WaveNet1.881.115.011582
WaveNet3.241.855.92733
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, R.; Zheng, J.; Mei, F.; Yang, X.; Wu, Y.; Zhang, H. Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model. Appl. Sci. 2024, 14, 6279. https://doi.org/10.3390/app14146279

AMA Style

Xu R, Zheng J, Mei F, Yang X, Wu Y, Zhang H. Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model. Applied Sciences. 2024; 14(14):6279. https://doi.org/10.3390/app14146279

Chicago/Turabian Style

Xu, Ruilin, Jianyong Zheng, Fei Mei, Xie Yang, Yue Wu, and Heng Zhang. 2024. "Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model" Applied Sciences 14, no. 14: 6279. https://doi.org/10.3390/app14146279

APA Style

Xu, R., Zheng, J., Mei, F., Yang, X., Wu, Y., & Zhang, H. (2024). Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model. Applied Sciences, 14(14), 6279. https://doi.org/10.3390/app14146279

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop