1. Introduction
Photovoltaic (PV) energy is regarded as one of the most promising opportunities in the renewable energy industry due to its clean and abundant nature [
1]. In recent years, with the improvement of technology and capital investment, PV electric generation has received more attention worldwide [
2]. The dramatic drop in solar PV costs has fueled a rapid expansion in global capacity, with solar expected to account for 80% of renewable energy growth by 2030, surpassing wind and hydropower as the largest renewable source [
3]. However, due to the influence of factors such as environmental changes, PV power output has large fluctuations [
4,
5]. Therefore, accurately predicting PV power generation based on environmental variables is an important task, as it can optimize power dispatching, ensure grid stability, and improve energy utilization efficiency and economic benefits.
PV power forecasting is an important topic, but in actual systems, it needs to take into account many issues, such as the time stamp, weather data collection, input correlation analysis, data pre- and post-processing, etc. [
6,
7,
8]. In past studies, physical and statistical methods were mainly used to address this type of problem [
9]. Based on the fundamental physical interactions between solar panel cells and the environment, physical models predict the output of PV systems by the design of the mathematical formula model, and their accuracy is highly sensitive to input data quality via dynamic weather conditions [
10,
11]. Statistical methods analyze the relationship between meteorological factors and PV power generation, with accuracy dependent on the forecast horizon and input data quality [
12]. Widely used techniques include regression and autoregression [
13,
14]; furthermore, autoregressive moving average (ARMA) [
15], autoregressive-integrated moving average (ARIMA) [
16], and exponential smoothing [
17] have also been applied to PV power forecasting. Their prediction performance is good for linear data, but they face challenges with the nonlinear relationships between complex variables.
With the development of computer technology, machine learning (ML) has emerged as an important approach for PV power forecasting; techniques such as support vector regression (SVR) [
18,
19,
20], decision tree (DT) [
21], and extreme learning machine (ELM) [
22] have been applied. These conventional ML methods are promising approaches, but they struggle with large datasets and are overly dependent on the complexity of the data and feature extraction. To enhance the ability to process big data, neural network (NN) algorithms were introduced to deal with the problem of PV power forecasting, increasing the accuracy of prediction [
23,
24,
25,
26]. Recurrent neural networks (RNNs) and long short-term memory (LSTM) methods in deep learning (DL) are also applicable to PV prediction problems, as they can fully extract hidden information in time series and capture the correlation between historical data and future forecasts [
27,
28,
29].
With the widespread availability of high-performance computing servers, particularly GPU-based computing devices and algorithms, the application of advanced models in PV power forecasting is no longer limited to ML approaches of a simple structure. An improved sparrow search algorithm (ISSA)-optimized LSTM (ISSA-LSTM) model [
30] and temporal convolutional networks (TCNs) have been employed to achieve high-accuracy, hour-level PV power forecasting under various weather conditions and seasons [
31]. A hybrid model combining one-dimensional convolutional neural networks (1D-CNNs) and Transformer architecture achieves multi-step forecasting from 5 min to hour-level resolutions through multi-timescale fusion [
32]. Artificial neural networks (ANNs), combined with gray wolf optimization (GWO) and genetic algorithms (GAs), have been utilized to optimize model structures and improve short-term forecasting accuracy [
33]. The Fourier graph neural network (FourierGNN) captures spatiotemporal dependencies in PV data by constructing a hypervariable graph and applying Fourier transforms [
34]. Most of these studies focus on short-term forecasting at minute- and hour-level resolutions. This is because, for short-interval data, e.g., 5 min intervals, the length of the daily time series becomes too long, posing challenges for memory-based DL models. Additionally, training neural network models on such long sequences often results in GPU memory overload, making high-performance hardware a necessary requirement for these tasks.
The time-series dense encoder (TiDE) is a multi-layer perceptron (MLP)-based encoder–decoder architecture, which combines the efficiency of linear models with the expressive power of nonlinear models [
35]. Through feature projection and dense residual blocks, TiDE effectively integrates dynamic covariates and static covariates. In the architecture of TiDE, global linear residual connections preserve linear trend information, enhancing the model’s robustness in scenarios with complex dependencies. With its linear computational complexity, TiDE enables faster training and inference compared to Transformer-based models, making it particularly suitable for medium- to long-term time series forecasting. For PV power forecasting, TiDE can demonstrate strong capabilities in modeling nonlinear relationships between generation output and dynamic environmental variables such as solar irradiance, temperature, and seasonal effects. Its residual structure separates linear trends, like seasonal cycles, from nonlinear fluctuations, like abrupt weather changes, thereby improving prediction accuracy. Dimensionality reduction on large-scale covariates also reduces noise introduced by high-dimensional meteorological data, ensuring model stability under complex environmental conditions. For short-term forecasting, TiDE’s computational efficiency supports high-frequency data updates, meeting the real-time scheduling demands of PV power systems. These features make TiDE a highly efficient and reliable solution for PV power forecasting.
In this paper, we study the application of TiDE in forecasting short-term (5 min and 1 h ahead) and medium-term (1-day-ahead) PV power outputs using environmental variables. A dataset spanning three years is used for training, with the following one-year data serving as the test set to evaluate TiDE’s forecasting accuracy. Additionally, we analyze forecasting performance under various weather conditions, i.e., sunny, cloudy, and rainy, to assess TiDE’s power to capture both weather trends and abrupt changes in one-day-ahead forecasts. We also evaluate its performance across different seasons of climatic conditions to test the model’s adaptability and accuracy in diverse environments. Furthermore, we examine TiDE’s responsiveness to environmental input variables, its flexibility in handling different conditions, and the impact of forecasting errors on the integration of PV power into the grid. The results emphasize TiDE’s exceptional forecasting accuracy and strong adaptability to varying weather patterns, while also identifying its challenges in dealing with sudden changes. The remaining content of this paper is structured in the following manner. In
Section 2, we provide the details of data preprocessing and the specific architecture of the TiDE algorithm used in the study.
Section 3 shows the results of 5 min, 1 h, and 1-day-ahead PV power forecasting of TiDE. And in this part, we also analyze the one-day-ahead forecasting values of different weather types and seasons.
Section 4 presents a discussion of TiDE’s sensitivity to input environmental variables, algorithmic versatility, and the implications of forecasting errors on PV grid integration.
Section 5 presents the summary of this work.
4. Discussion
4.1. Sensitivity of TiDE to Environmental Variables in PV Generation Forecasting
In
Section 3.3 and
Section 3.4, we presented the one-day-ahead forecasting accuracy values under varying weather and climate conditions. These results reflect the influences of different environmental variables on model performance, particularly under extreme climatic scenarios, which can lead to significant prediction deviations. In this section, we analyze the sensitivity of the TiDE model to specific environmental variables.
We begin with the analysis of rainfall. Due to data limitations, only total daily rainfall is available, which constrains our discussion to a macro level. Alice Springs is characterized by a tropical desert climate, with less than 10% of days in a year experiencing rainfall and an average annual precipitation of 285 mm. Based on the dataset, the maximum recorded daily rainfall is 24.6 mm. We categorize rainy days into light rain (daily rainfall below 10 mm) and moderate rain (daily rainfall above 10 mm). Under light rain conditions, TiDE yields an R
2 of 0.0182, a mean absolute error (MAE) of 0.9254, and a root mean square error (RMSE) of 1.3093. In contrast, under moderate rain, the model’s R
2 increases to 0.0874, while MAE and RMSE increase to 1.1464 and 1.5514, respectively. These low R
2 values and high error metrics indicate poor performance in predicting PV output on rainy days, which is primarily due to the limited number of rainy-day samples and the exclusion of rainfall as an input variable, as discussed in
Section 2.1.
We next examine the impact of temperature on prediction accuracy. This analysis focuses on extreme temperatures, specifically high temperatures exceeding 35 °C and low temperatures below 0 °C, which can negatively affect PV system performance through overheating or condensation. Under high-temperature conditions, TiDE achieves an R2 of 0.628, an MAE of 0.556, and an RMSE of 0.866, indicating acceptable predictive performance. However, under low-temperature conditions, the model’s R2 drops drastically to 0.008, while MAE and RMSE are 0.172 and 0.592, respectively. This suggests prediction failure, possibly caused by abnormal PV outputs. Moreover, since our analysis focuses on daytime hours (07:00–19:00), most nighttime hours with sub-zero temperatures are excluded, resulting in a limited number of relevant samples and contributing to poor performance in cold conditions.
Humidity is another key factor influencing prediction accuracy. In the dataset, humidity shows a strong negative correlation with PV output, which can be attributed to the fact that low humidity often corresponds to sunny, cloudless conditions with high photovoltaic efficiency in tropical desert climates. When humidity is below 30%, TiDE achieves an R2 of 0.757, an MAE of 0.441, and an RMSE of 0.753. As humidity increases to the medium range (30–70%), R2 decreases to 0.620, with corresponding MAE and RMSE values of 0.593 and 0.950. When humidity exceeds 70%, the model’s R2 drops sharply to −1.2697, indicating prediction failure. These results suggest that prediction accuracy declines with increasing humidity, especially under conditions conducive to condensation.
Cloud cover is estimated indirectly due to limited meteorological data. Although direct cloud cover measurements are unavailable, the dataset includes global horizontal radiation (GHI), which allows us to calculate the clear sky index (CSI) using the pvlib Python package. By comparing GHI with theoretical clear sky radiation, we compute CSI as , and estimate cloud cover as . Under clear or mostly clear conditions (cloud cover below 40%), TiDE yields an R2 of 0.692, an MAE of 0.521, and an RMSE of 0.865. As cloud cover increases to 40–70%, corresponding to partly cloudy skies, the R2 declines to 0.167, while MAE and RMSE reach 0.545 and 0.928. When cloud cover exceeds 80%, the R2 value drops significantly to −14.8226, indicating a complete loss of predictive accuracy under overcast conditions.
Extreme weather events such as sandstorms and condensation are also considered. Due to the absence of real-time dust concentration data, we simulate sandstorm conditions by selecting samples with sunny skies, low humidity (below 30%), and high CSI (above 60%). Similarly, condensation-prone conditions are approximated by selecting data with temperatures below 15 °C and humidity above 70%. In both simulated scenarios, TiDE produces negative R2 values, indicating prediction failure. These results suggest that TiDE struggles significantly under extreme environmental conditions, which consequently reduce the overall prediction performance.
4.2. Evaluation of the General Applicability of TiDE in PV Forecasting Scenarios
In this section, we examine the general applicability of TiDE for photovoltaic (PV) power generation forecasting. To this end, we use another dataset from DKASC, specifically the Kaneka system with a 6.0 kW installation, as the experimental subject. Unlike the Calyxo system, the Kaneka system utilizes amorphous silicon, representing a different type of photovoltaic material. Moreover, we select a different set of environmental input variables for TiDE based on their relevance to this system, including relative humidity, global horizontal radiation, diffuse horizontal radiation, total solar radiation received by inclined surfaces, and total scattering on inclined surfaces.
The forecasting results of this dataset for the 5 min ahead, 1 h ahead, and 1-day-ahead horizons are presented in
Table 8. The comparative performance of TiDE and other algorithms is generally consistent with the findings from the Calyxo dataset. In the 5 min ahead short-term forecast, TiDE achieves an MAE of 0.145, which is comparable to that of BiLSTM (0.143) and GRU (0.145), indicating strong average error control. However, its RMSE (0.340) is slightly higher than that of LSTM (0.329) and other models, suggesting limited robustness to abrupt irradiance changes (e.g., instantaneous cloud cover). In the 1 h ahead medium-term forecast, TiDE’s performance declines considerably, with
and MAE = 0.669, significantly lower than LSTM’s performance (
, MAE = 0.267). This degradation is mainly attributed to the limited capacity of the MLP architecture in modeling complex temporal dependencies over medium time horizons. For the one-day-ahead long-term forecast, TiDE achieves the lowest MAE (0.514) among all models, demonstrating a strong ability to capture medium- and long-term patterns (e.g., seasonal cycles). Nevertheless, its RMSE (0.836) remains higher than that of GRU (0.781), revealing substantial deviations during extreme weather events such as rainstorms or sandstorms. These results suggest that TiDE maintains stable performance across datasets generated by different photovoltaic systems and offers certain advantages in medium- and long-term forecasting tasks.
To further investigate the effect of input features, we employ mutual information analysis to re-evaluate the correlation between PV output and environmental variables in the Kaneka system. Environmental factors with mutual information greater than 0.5 are retained, including global horizontal radiation, diffuse horizontal radiation, total solar radiation on inclined surfaces, and total scattering on inclined surfaces; relative humidity is excluded due to its weaker correlation. Using these four inputs, TiDE is re-trained and evaluated. The resulting metrics are as follows: for the 5 min ahead forecast, R
2 = 0.955, MAE = 0.146, RMSE = 0.340; for the 1 h ahead forecast, R
2 = 0.711, MAE = 0.649, RMSE = 0.861; and for the 1-day-ahead forecast, R
2 = 0.721, MAE = 0.515, RMSE = 0.843. Compared to the original results in
Table 8, the performance remains largely unchanged for the 5 min and 1-day forecasts, while the 1 h prediction shows a slight improvement. Nonetheless, TiDE’s performance in the 1 h ahead task continues to lag behind other advanced models, indicating that reducing input dimensionality does not fully resolve the model’s limitations in short- to medium-term forecasting.
4.3. Impact of Forecasting Errors on PV Grid-Connected Operation
Forecasting errors in PV power generation, particularly in 1 h and 1-day horizons, have multifaceted impacts on grid operation and energy management. Short-term prediction errors (e.g., a 1 h MAE of 0.647) pose direct threats to grid stability. When the forecasted output exceeds the actual generation, the grid must urgently dispatch backup power sources, such as gas turbines or energy storage systems, to compensate for the shortfall. The cost of such an emergency response can reach tens of thousands of yuan per hour. Conversely, output underestimation may result in solar energy curtailment or reduced efficiency in load scheduling.
Medium- to long-term forecast errors (e.g., a 1-day RMSE of 0.856) influence power generation planning. Persistent overestimation of PV output can lead to uneconomical load reductions in conventional power plants, whereas underestimation increases reliance on high-cost electricity purchases in the spot market. In extreme weather conditions (e.g., rainy days with an MAE of 0.962), forecast inaccuracies may cause electricity prices to spike by 30–50%, exacerbating system and market volatility.
In energy markets, forecasting errors heighten trading risks and induce price instability. If the deviation in day-ahead market declarations exceeds predefined thresholds (e.g., ±5%), power generators must procure real-time electricity at elevated prices and may incur penalties, significantly eroding profit margins. Additionally, supply–demand imbalances caused by forecast errors can disrupt market pricing mechanisms. For instance, a failure to predict sharp output declines during rainy conditions may lead to short-term regional electricity price surges due to sudden supply shortages.
Forecast uncertainties also constrain financial planning and investment decisions. High short-term errors (e.g., a 1 h RMSE of 0.857) necessitate increased reserve capacity. Empirically, a 0.1 increase in MAE correlates with a 2–3% rise in reserve costs. In the long term, the model’s limitations under extreme climatic conditions (e.g., a summer RMSE of 1.035) may result in the misestimation of PV penetration potential, delay investments in energy storage or grid infrastructure, and ultimately hinder progress toward energy transition objectives.
From a technical perspective, output fluctuations induced by forecast errors can lead to frequency deviations (exceeding ±0.2 Hz) and voltage violations (exceeding ±5%), necessitating the deployment of advanced regulation equipment such as static var generators (SVGs). In extreme cases, abrupt drops in output may trigger protection system malfunctions, increasing the risk of widespread outages.
To mitigate these risks, a comprehensive strategy is required. This includes enhancing the TiDE model’s ability to capture trends and abrupt changes, integrating real-time meteorological data to improve forecast sensitivity, refining market penalty mechanisms to equitably allocate backup costs, and designing cooperative scheduling strategies with energy storage systems. Together, these measures aim to ensure the resilience and stability of power systems under high levels of PV penetration.
5. Conclusions
This study proposes a time-series dense encoder (TiDE)-based deep learning framework for photovoltaic (PV) power forecasting, addressing the challenges of nonlinear dependencies and computational inefficiencies in traditional methods. By integrating a multi-layer perceptron (MLP) encoder–decoder architecture with residual connections, TiDE can achieve high-precision predictions of PV power generation based on environmental variables. Using three years of operational data from a 5.4 kW PV system at DKASC for training and one year of data for testing, the TiDE model demonstrates good performance across varying time horizons and weather conditions.
Key findings indicate that TiDE excels in short-term (5 min ahead) forecasts with R2, MAE, and RMSE values of 0.952, 0.150, and 0.349, comparable to LSTM, GRU, and other models. However, its accuracy decreases for longer horizons, such as 1 h ahead forecasting, highlighting limitations in capturing abrupt environmental changes. Notably, TiDE shows superior adaptability in medium-term (1-day-ahead) forecasting, achieving the lowest MAE (0.507) among the compared models, particularly under complex meteorological conditions such as cloudy and rainy weather. Seasonal analysis further reveals its strengths in stable climates, such as autumn and winter, but also identifies challenges during extreme summer heat and spring dust storms, where RMSE values increase due to unmodeled nonlinear effects.
In this part of the discussion, our experiments show that the performance of the TiDE model is highly sensitive to extreme environmental conditions such as heavy rainfall, high humidity, overcast skies, and condensation scenarios, under which prediction accuracy significantly deteriorates. These limitations are mainly due to the scarcity of representative data samples and the exclusion of some relevant environmental variables. Nevertheless, TiDE shows promising generalizability across different PV systems and material types, as verified using an independent dataset from the Kaneka amorphous silicon system. While TiDE maintains competitive accuracy in short- and medium-term forecasts, its performance remains constrained by the underlying MLP structure. And from a business perspective, PV forecast errors threaten grid stability, increase market risks and costs, and require better models, real-time data, and coordinated energy management.
The innovation of TiDE lies in the introduction of time-intensive encoders, based on multi-layer perceptrons (MLPs), into PV power forecasting. By utilizing residual connections and staged encoding-decoding mechanisms, it effectively integrates both linear trends and nonlinear fluctuations, overcoming the trade-off between computational efficiency and complex dependency modeling found in traditional models. In comparison to recurrent models like LSTM and GRU, TiDE avoids the gradient vanishing/explosion problem and significantly reduces computational complexity. Compared to the Transformer, TiDE’s linear complexity enables real-time processing of high-frequency data and prevents GPU memory overload. Moreover, TiDE efficiently mitigates meteorological noise and retains critical trend information through dynamic covariate dimension reduction and global residual connections. This results in significantly lower MAE (0.791 and 0.962) under challenging weather conditions, such as cloudy and rainy days, and the best MAE (0.507) for 1-day forecasts. This design breaks free from the traditional reliance on attention mechanisms or convolutional structures for medium- and long-term predictions, offering a new paradigm that balances efficiency and robustness for the precise scheduling of PV systems.
The model’s accuracy in trend prediction makes it a viable tool for grid management and energy optimization. Future work could incorporate more environmental data, such as information on dust cover and extreme weather events like thunderstorms, to enhance the robustness and accuracy of predictions in specific geographic regions, such as tropical desert climates. These improvements could further solidify TiDE’s role in advancing renewable energy systems through reliable and scalable PV power forecasting. However, TiDE’s performance declines in the 1 h ahead forecast, suggesting that there is still room for structural improvement in the algorithm. Additional experiments based on the current framework could help refine the model and improve its predictive capabilities.