7.1. Results of Decomposition Step
7.1.1. Decomposition by VMD
VMD has proven effective in decomposing complex energy signals, significantly improving the forecasting of global solar radiation and wind power (SD). The analysis of the results highlighted that the optimal selection of decomposition parameters directly affects the predictive model’s performance, strongly impacting the quality of mode separation and the ability to capture the fundamental characteristics of the signal. One of the most relevant findings was the bandwidth constraint’s role in determining the decomposition quality. Experimental data showed that setting this parameter to 200 allowed for a clear separation of modal components without frequency overlaps. Comparative analysis with other methods revealed that a value too low would result in overly broad modes, potentially merging multiple frequencies, while a higher value would increase computational complexity without significant gains in predictive accuracy.
Another key element was the Lagrangian stop multiplier, which was set as zero divided by the parameter r. This configuration avoided additional penalties on mode deviation during iterations, allowing the model to adapt more naturally to the signal structure. The convergence of the iterative process was determined solely by the predefined tolerance level, ensuring a balance between accuracy and stability in the decomposition. The method also excluded the zero-frequency component, removing any constant mean value from the signal. This made it possible to focus exclusively on dynamic variations, enhancing the ability to identify meaningful patterns for forecasting. Central frequencies of the modes were uniformly initialized, ensuring a balanced distribution of components across the frequency spectrum.
The decomposition using VMD was applied to time series solar radiation and wind power data collected from the Tamanrasset, Brasilia, and Colorado sites. As shown in
Figure 5,
Figure 6 and
Figure 7, the signals were divided into ten main modes plus one residual. This decomposition enabled the precise isolation of the different frequency components, simplifying the modeling process for forecasting tasks.
The analysis of sample entropy (SE) applied to the extracted IMFs revealed that the highest values were observed in modes five to ten for the Tamanrasset and Brasilia sites, whereas in the Colorado site, the highest entropy values were concentrated in the first three modes. Specifically, the maximum entropy value for Tamanrasset reached 1.650 and 1.514 for Brasilia. In contrast, the values for Colorado were lower, with a maximum of 1.284 in the first mode. This suggests that in the first two locations, the higher modes contain more informational complexity, while in Colorado, the most relevant information for prediction is concentrated in the initial components.
The method’s effectiveness was further confirmed by the results obtained in solar radiation and wind speed forecasting. Data from the Tamanrasset site showed that the VMD-based model, combined with sample entropy, artificial neural networks (ANNs), and least-squares regression, achieved a root mean square error (RMSE) of 9.9046 and a mean absolute error (MAE) of 0.0137. The coefficient of determination reached 0.9990, the highest among all models analyzed. For the Brasilia site, slightly lower values were recorded, with an RMSE of 10.4583 and a coefficient of determination of 0.9954. The Colorado site showed slightly lower performance than the other two, with an RMSE of 12.7856 and a coefficient of determination of 0.9921, though still outperforming the competing models tested in the study.
Compared to other approaches, the hybrid VMD-based model significantly improved predictive accuracy over individual ML techniques such as neural networks or support vector regression, demonstrating greater adaptability to the non-stationary nature of energy data. This result was especially evident at the Tamanrasset site, where solar radiation variability was high, and the decomposition strategy effectively isolated the most impactful fluctuations.
Remark 11. It is worth noting that τ is not the same as in Equation (2). In fact, is used to enforce the reconstruction constraint of the original signal from the extracted modes in the VMD optimization problem. This parameter is updated iteratively within the Lagrange multipliers method to ensure that the sum of the modes reconstructs the original signal. On the other hand, τ is the stopping criterion that determines when the VMD iterative process should terminate. It defines the convergence threshold, i.e., the level of variation between two successive iterations below which the decomposition is considered stable and complete. 7.1.2. Decomposition by ICEEMDAN
The analysis of the results obtained using the ICEEMDAN method highlights how this technique enabled a detailed and accurate decomposition of RES signals from the three study sites, as illustrated in
Figure 8,
Figure 9 and
Figure 10. Compared to the VMD method, which decomposed the signal into a fixed number of modes with predefined frequency constraints (
Figure 5,
Figure 6 and
Figure 7), ICEEMDAN showed greater flexibility in separating components, better adapting to the specific characteristics of each dataset.
One of ICEEMDAN’s main advantages is its ability to reduce mode mixing, ensuring a clearer separation of the intrinsic mode functions. The decomposition generated a series of modes, each associated with a specific frequency range, allowing for the more effective extraction of features useful for forecasting global solar radiation and wind speed.
For the Tamanrasset site, ICEEMDAN decomposition showed that the most informative modes were mainly concentrated between the fifth and tenth modes. The sample entropy (SE) calculated for each mode reached a maximum value of 1.650, indicating significant complexity in the higher-order informative components.
As shown in
Figure 8, the initial modes were dominated by high-frequency fluctuations, mainly associated with noise and short-term variations, while the subsequent modes contained more structured signals with relevant predictive information. The final residual captured the overall trend of the signal, helping to identify long-term patterns. Comparing these results to those obtained with VMD (see
Figure 5), a clear difference emerges in how information is distributed across extracted modes. While VMD divides the signal into a predefined number of modes with fixed frequency bands, ICEEMDAN produces modes that dynamically adapt to the signal’s structure. This leads to more effective noise reduction and clearer separation of meaningful components.
ICEEMDAN produced a different information distribution for the Brasilia site compared to Tamanrasset. SE recorded a maximum value of 1.514, slightly lower than in the first case, indicating lower complexity in the higher-order modes. The modal analysis in
Figure 9 revealed that intermediate modes, particularly between IMF4 and IMF7, contained most of the relevant predictive information. In contrast, the first modes had significant noise, while the residual showed a regular pattern with evident seasonal cycles. A comparison with VMD decomposition (
Figure 6) confirmed that ICEEMDAN ensured more effective separation between informative and noisy components. With VMD, some modes overlapped more regarding information content, while ICEEMDAN provided a clearer distribution across modes, improving the distinction between relevant and irrelevant signals.
ICEEMDAN revealed a different information distribution for the Colorado site from the previous two cases. SE reached a maximum value of 1.284, the lowest among the three sites analyzed, suggesting lower complexity in the informative components. As shown in
Figure 10, the first three modes contained most of the significant predictive information, while the higher-order modes were less relevant than those at the other sites. The residual exhibited less pronounced fluctuations, indicating more stable meteorological conditions in Colorado. Compared to the VMD decomposition (see
Figure 7), ICEEMDAN extracted modes with a more concentrated distribution of information in the initial components, while VMD provided a more uniform spread across the frequency spectrum. This implies that, for the Colorado site, ICEEMDAN was more effective in capturing essential information with fewer components, simplifying the predictive modeling process.
The results show that the two decomposition methods differ significantly in information distribution and signal component separation. ICEEMDAN demonstrated greater flexibility in mode allocation, adapting better to the specific characteristics of each site. With a fixed number of modes and predefined frequency bands, VMD produced a more uniform decomposition but showed lower adaptability to local signal variations. At the Tamanrasset and Brasilia sites, ICEEMDAN concentrated information in the higher-order modes, while in Colorado, it focused on the initial components, demonstrating its adaptability to local conditions. ICEEMDAN more effectively reduced high-frequency noise and separated informative components with greater clarity compared to VMD.
These differences between decomposition methods directly impacted the performance of the forecasting models. The prediction results indicated that the ICEEMDAN-based model, combined with SE and ANN, achieved slightly better performance than the VMD-based model. Specifically, for the Tamanrasset site, ICEEMDAN reached a root mean square error (RMSE) of 9.7498, slightly lower than the 9.9046 obtained with VMD. For the Brasilia site, the RMSE with ICEEMDAN was 10.2157, compared to 10.4583 with VMD, confirming a slight advantage for the former. Finally, for the Colorado site, ICEEMDAN achieved an RMSE of 12.5132, outperforming the 12.7856 recorded with VMD.
These results demonstrate that combining ICEEMDAN decomposition, component selection via sample entropy, and deep learning models provides more accurate forecasts than those based on VMD. ICEEMDAN’s enhanced adaptability to the specific characteristics of the signals improved the quality of information separation, reduced overall error, and increased the reliability of the predictions.
7.1.3. Application of SE
The analysis of SE represents a crucial step in selecting the most relevant modal components after time series decomposition using VMD and ICEEMDAN. This method makes it possible to identify the components with the highest degree of complexity and informational variability, thereby improving the quality of forecasts on renewable energy data.
As shown in
Table 6, for the Tamanrasset site, the IMF1–IMF5 components derived from the ICEEMDAN decomposition exhibit significantly higher SE values compared to the remaining components, suggesting that these modes contain the most relevant information for forecasting. In particular, the SE value for IMF1 is 1.006, while for IMF5, it decreases to 0.279, indicating a progressive reduction in complexity in the lower-frequency components. Similarly, for the Brasilia site, ICEEMDAN decomposition reveals that IMF1–IMF5 holds the highest sample entropy values, with IMF1 at 0.240 and IMF5 at 0.350. This trend suggests that the data structure is characterized by greater variability in the high-frequency components. SE values show a similar distribution for the Colorado site: IMF1 records the highest value at 1.284, while IMF5 drops to 0.441. This indicates that the complexity of the information is concentrated in the higher-order components, making them essential for forecasting.
Compared with the VMD decomposition (see
Table 7), it is observed that for the Tamanrasset site, the IMF5–IMF10 components show the highest SE values, with IMF5 reaching 1.514 and IMF10 at 1.135. This suggests that VMD distributes informational complexity into lower-frequency modes compared to ICEEMDAN, potentially improving forecasting stability. For the Brasilia site, the IMF5–IMF10 components obtained from VMD report SE values ranging from 0.475 to 0.484, confirming a more homogeneous information distribution than ICEEMDAN. Finally, the maximum SE value for the Colorado site is recorded in IMF10 at 1.553, suggesting that VMD tends to retain more information in lower-frequency modes.
The results indicate that the choice of decomposition technique influences the selection of the most informative modal components. ICEEMDAN tends to concentrate on high-frequency components, making it particularly effective in capturing rapid fluctuations in the data. On the other hand, VMD better distributes informational complexity across intermediate and low-frequency modes, improving the stability of long-term forecasts.
Integrating SE-based component selection with ML models enhances forecasting accuracy. The results in
Table 6 and
Table 7 demonstrate that selecting modal components with higher SE values leads to a significant reduction in forecasting error, suggesting that an optimal combination of decomposition and mode selection can improve the reliability of solar and wind energy prediction models.
7.2. Performance Comparison of Forecasting Models
Without data decomposition, the predicted values from all models follow the actual values in the forecasting results but with a noticeable time lag. After decomposition using VMD and ICEEMDAN, this delay is reduced. Specifically,
Figure 11 illustrates the predictive performance of the proposed model in terms of statistical accuracy, comparing the forecasts obtained through the hybrid framework with the actual observed values. The results highlight how combining decomposition techniques, sample entropy-based selection, and the hybrid of predictive models enables a highly accurate estimation of solar radiation and wind speed. In particular, a close alignment between the predicted and real data is observed, with reduced forecasting error and strong generalization capability across different geographic contexts. Moreover,
Figure 12 displays the solar radiation forecasting using ELM, SVM, and Bi-LSTM (along with their combination using LSR) before decomposition for the Tamanrasset site. At the top, the original trend (
) is shown overlaid with the forecast curve generated by ELM, while immediately below, the overlays between (
) and the forecasted curves (
) using SVM, Bi-LSTM, and LSR, respectively, are illustrated.
Similarly,
Figure 12 and
Figure 13 show the same predictive trends for the Tamanrasset site, after decomposing the signals using VMD and ICEEMDAN.
Table 8,
Table 9 and
Table 10 provide a detailed overview of the weights associated with the different models for each site, namely Tamanrasset, Brasilia, and Colorado sites, respectively, and the preprocessing method.
For the Tamanrasset site, Bi-LSTM emerges as the dominant model with a weight of 1.10 without preprocessing, while SVM and ELM have very low and negative weights. When VMD is applied, the effectiveness of SVM improves significantly, with a weight reaching 49.32%, almost equivalent to Bi-LSTM’s weight of 55.99%. While still providing a weak contribution, ELM shows a slight improvement with a positive weight of 0.53. With the application of ICEEMDAN, the impact on SVM is significant, with a weight of 37%, while Bi-LSTM continues to dominate with a weight of 50.48%. ELM achieves a positive weight of 12.39%, indicating that this decomposition technique allows for a more balanced contribution of all models to the prediction. Therefore, for the Tamanrasset site, ICEEMDAN seems to improve the performance of SVM and ELM, balancing the contributions of the various models.
Without any treatment, Bi-LSTM has a dominant contribution for the Brasilia site with a weight of 1.13, while SVM and ELM show negative weights of −0.03 and −0.11, respectively. The application of VMD improves the performance of all models, with BiLSTM remaining dominant with a weight of 75.98%, followed by SVM with a contribution of 21.12%, and ELM showing an improvement with a positive weight of 1.88%. With the application of ICEEMDAN, Bi-LSTM continues to dominate with a weight of 1.21, but SVM and ELM remain with inferior performances, with weights of −0.0439 and −0.1951, respectively. These results suggest that, even with preprocessing techniques such as VMD and ICEEMDAN, Bi-LSTM remains the most effective model for this site, with SVM benefiting more from VMD but never reaching the performance level of Bi-LSTM.
For the Colorado site, Bi-LSTM is again the dominant model without preprocessing, with a contribution of 70%. The SVM model shows a moderate contribution (23.6%), while ELM provides the lowest contribution (5.9%). When VMD is applied, Bi-LSTM’s effectiveness increases dramatically, with a weight of 98%. ELM shows a negative weight of −6.4%, indicating that this model does not benefit from VMD preprocessing. The SVM model maintains a moderate performance with a weight of 8.5%. With the application of ICEEMDAN, the effect on Bi-LSTM is reduced, but the model remains dominant with a weight of 65%. SVM benefits from the application of ICEEMDAN, with the weight increasing to 49.38%, while ELM continues producing poor results with a negative weight of −13%. Overall, ICEEMDAN helps balance the contribution of the models, but Bi-LSTM remains the most effective model for the Colorado site.
Overall, the application of VMD proves to be highly beneficial for Bi-LSTM, significantly increasing its weight, especially for the Colorado site, where BiLSTM becomes the dominant model. However, VMD does not seem helpful for ELM, which sometimes shows negative weights, suggesting that ELM is unsuitable for these datasets, regardless of the preprocessing treatment. ICEEMDAN, on the other hand, proves to be particularly useful for improving the performance of SVM and, in some cases, helps balance the contribution of the various models. However, Bi-LSTM remains the dominant model in nearly all cases, with SVM showing improvements at the Brasilia and Tamanrasset sites but never surpassing Bi-LSTM’s performance.
Table 11 summarizes the evaluation metric results for the Tamanrasset site using the various approaches tested. From the analysis, it emerges that individual models without decomposition deliver similar performance, with RMSE values around 70 and
near 0.954, suggesting that these models can not accurately capture the characteristics of the time series. Adopting Bi-LSTM significantly improves, with an RMSE of 38.62 and an
of 0.9865, demonstrating a stronger ability to model data variability. The integration of the LSR method further improves performance, reducing RMSE to 37.24 and increasing
to 0.9869. The application of the ICEEMDAN decomposition shows moderate improvements in the ELM and SVM models, with a slight reduction in RMSE. The inclusion of the same technique in Bi-LSTM and LSR models allows for greater predictive accuracy, with RMSE values of 40.69 and 39.75 and a coefficient of determination reaching 0.9844 in the case of the LSR-based model.
The use of VMD decomposition yields a more significant improvement than ICEEMDAN and the models without any decomposition. Applying this technique to the SVM and LSR models produces much more accurate results, with the former achieving an RMSE of 14.29 and an MAE of 0.0181 and the latter recording an RMSE of 23.21, an MAE of 0.0137, and a coefficient of determination of 0.9990. This indicates that combining VMD with LSR represents the most effective approach among those tested, ensuring extremely high predictive accuracy. The differentiated weighting of the most relevant factors through least-squares regression thus significantly improves forecast performance, making this model the best-performing among those analyzed (2.13% error and 99.97% ).
Regarding the Brasilia site, the error evaluation results for the four models applied to the three datasets are presented in
Table 12. The results show that models without decomposition, such as ELM and SVM, perform poorly, with RMSE values of 127.97 and 122.90, respectively, and relatively low
values around 0.88. Using Bi-LSTM significantly improves the forecast, reducing the RMSE to 63.12 and increasing
to 0.97. The LSR-based model provides a further enhancement, achieving an RMSE of 60.40 and an
of 0.9745, suggesting that this technique can optimize the forecasting process.
Applying ICEEMDAN decomposition to the ELM and SVM models does not significantly improve, with the RMSE values remaining high. However, when the same technique is applied to the Bi-LSTM and LSR models, forecasting accuracy improves considerably, with RMSE values dropping to 56.65 and 52.54, respectively, and increasing to 0.9790 in the case of LSR. These results indicate that decomposition can offer a competitive advantage, but the benefits depend on the model it is combined with.
The integration of VMD decomposition leads to a significant performance improvement compared to ICEEMDAN and models without decomposition. The results show that LSR-VMD is the best-performing model, with an RMSE of 9.90 and an of 0.9956, confirming this combination’s ability to enhance forecast quality drastically. This is followed in order of accuracy by VMD-Bi-LSTM, VMD-SVM, and VMD-ELM, with the first achieving an RMSE of 14.63 and an of 0.9979. These findings suggest that VMD decomposition enables a more effective separation of the signal’s informative components, significantly improving prediction accuracy, especially when combined with advanced methods such as Bi-LSTM and LSR.
An analysis of the results obtained for the Colorado site (see
Table 13) reveals that the use of VMD led to a significant improvement in performance compared to models that do not include a decomposition phase. In particular, the SVM-VMD and LSR-VMD models outperformed all others, with RMSE values of 2.2942 and 2.1823, respectively, and a coefficient of determination of 0.9997 in both cases. This outcome highlights how combining VMD with advanced regression strategies can capture time series variations more effectively, reduce error, and enhance predictive capability.
The ICEEMDAN-based approach showed improvements over models without decomposition but did not reach the performance level of VMD. The LSR-ICEEMDAN model achieved an RMSE of 10.6804 and an of 0.9941, confirming good predictive ability, though still lower than the VMD-based models. Similarly, the Bi-LSTM-ICEEMDAN model showed decent accuracy, with an RMSE of 11.5208 and an MAE of 0.0414, although inferior to the VMD alternatives.
Among the models without decomposition, Bi-LSTM performed better than ELM and SVM, with an RMSE of 12.8850 and an of 0.9920. However, the integration of VMD led to a substantial improvement in accuracy, significantly reducing prediction error. This demonstrates that time series decomposition is crucial in optimizing forecast performance.
Overall, the results indicate that the most effective method for the Colorado site is the combination of VMD with LSR, which achieved the lowest error rate and the highest value. The effectiveness of VMD decomposition is attributed to its ability to isolate the most informative modal components, eliminate noise, and facilitate the extraction of relevant patterns for prediction. Although ICEEMDAN proved helpful in improving the separation of frequency components compared to baseline models, its contribution was less substantial than that of VMD.
These findings confirm the importance of integrating advanced decomposition methods into predictive models to enhance forecast quality and improve uncertainty handling in energy time series.
Figure 20,
Figure 21 and
Figure 22 show the RMSE for each model at each site, clearly highlighting that the SVM-ELM-Bi-LSTM-LSR model outperforms the best individual model (Bi-LSTM). This is due to the aggregation strategy, which sums the products of the weights and the forecasts generated by the three models, assigning greater weight to the higher-performing ones. Additionally, Bi-LSTM enhances the model’s ability to retain long-term information, while the attention mechanism highlights the most critical features through the dynamic weighting of details after the Bi-LSTM output. In summary, the prediction accuracy of the combined model surpasses that of the other models analyzed, achieving better results than individual approaches.
Remark 12. To evaluate the effectiveness of the proposed forecasting framework, a systematic comparison was conducted with several baseline models commonly used in the literature. Specifically, the following approaches were implemented and tested: (i) the ARIMA model, representative of traditional statistical time series methods; (ii) a Bi-LSTM model applied directly to the raw data, without any decomposition or entropy-based selection; and (iii) a standard SVM model applied without any pre-processing phase. The results clearly demonstrate that the proposed hybrid framework, combining decomposition (VMD or ICEEMDAN), SE-based selection, and LSR, significantly outperforms all baseline models. In particular, the proposed method achieves up to a 30% reduction in forecasting error, with a coefficient of determination approaching 0.999.