1. Introduction
As one of the most important renewable energy sources, photovoltaic (PV) power generation has experienced a sharp increase in its proportion within modern power systems nationwide. By the end of April 2025, the national installed power generation capacity in China had reached 3.49 billion kilowatts (equivalent to 349 gigawatts), with a year-on-year increase of 15.9% [
1]. Accurate and reliable PV power forecasting is of great significance for the safe and economical operation of power systems with high PV penetration [
2].
Short-term PV forecasting techniques can be divided into two categories: deterministic forecasting and probabilistic forecasting [
3]. Deterministic forecasting can provide a point prediction of PV power capacity at a specific time but does not reflect the uncertainty of PV output [
4]. This limits its application in power system scheduling and PV generation risk assessments. Probabilistic forecasting methods, based on cumulative distribution functions and probability density functions [
5], can reflect the future distribution of PV power generation and characterize its variability [
6].
Combining non-parametric methods and artificial intelligence for probabilistic forecasting techniques enables efficient quantitative assessment of the uncertainty in photovoltaic power output [
7,
8,
9]. In [
10], it is noted that training convolutional neural networks (CNNs) using the loss functions generated by quantile regression (QR) is challenging due to their non-differentiable nature. This algorithm only compares shallow neural networks and does not incorporate the temporal characteristics of photovoltaic data. Reference [
11] introduces a temporal convolutional network (TCN) for time-series modeling, leveraging the strengths of both CNNs and recurrent neural networks (RNNs) to effectively analyze time series data. In [
12], QR and a TCN are combined to construct a quantile regression temporal convolutional network (QRTCN) for short-term power demand forecasting in power systems, demonstrating that this method outperforms the TCN model in accuracy. However, this approach only focuses on features derived from the mechanisms and neglects the temporal relationships between historical factors.
Facing the increasingly complex variations in photovoltaic power output, existing single-modeling methods are struggling to meet application demands [
13]. Therefore, the emergence of multi-model fusion techniques integrates the advantages of multiple models and gradually becomes a new research direction [
14]. A probability forecasting method combining Kernel Density Estimation (KDE) is proposed in [
15], with Copula functions to obtain appropriate prediction interval ranges. In [
16], a QR probability prediction model is presented based on the minimum absolute shrinkage and selection operator, incorporating shrinkage and selection operators in the estimation process to ensure the model’s good performance. A photovoltaic power interval prediction model is introduced combining extreme learning machine with QR, featuring a learning mechanism with good generalization performance in [
17]. Ref. [
18] combines quantile regression to construct a distributed photovoltaic prediction method based on long short-term memory networks (LSTM) and gated recurrent networks, using multiple loss functions to enhance their ability to identify distributed power sources. In [
19], a new photovoltaic output prediction algorithm is established based on CNN and LSTM by comprehensively considering various meteorological factors, predicting photovoltaic output under different weather conditions. Experimental results demonstrate that this combined model has good interval and probability prediction capabilities. Ref. [
20] shows that at 95%, the Prediction Interval Normalized Average Width (PINAW) reaches 0.066, while at 90%, PINAW narrows to 0.045 with a still satisfactory coverage of 0.920. This balances reliability and resolution, aligning with power dispatch needs for “narrow intervals + high coverage”. However, CNN has limitations in long-term time series modeling: fixed-size convolution kernels and pooling operations mainly capture local patterns and struggle to model long-term dependencies. Consequently, using CNN alone exhibits limited effectiveness in predicting the long-term trends of photovoltaic output. To address this limitation, this study proposes using Temporal Convolutional Network (TCN) for short-term local feature extraction and combining it with Bidirectional Long Short-Term Memory (BiLSTM) to capture long-term time series dependencies, thereby balancing local fluctuations and long-term trends while enhancing the accuracy and reliability of photovoltaic power probabilistic forecasting.
The CNN-LSTM model proposed in Reference [
21] (Li, Z. et al., 2022) can only output deterministic point predictions, failing to characterize PV output uncertainty, and the error exceeds 15% under complex weather; the CNN-LSTM-QR model in Reference [
22] (Wang, H. et al., 2023) is prone to local optima during training due to the non-differentiable PINBALL loss function, and it cannot capture short-term power surge features, with PINAW consistently higher than 0.18; the CNN-LSTM-KDE model in Reference [
23] (Zhang, C. et al., 2024) uses a fixed 12 h input window, failing to fully utilize long-cycle data, and lacks weather-specific verification, with PICP lower than 90% on cloudy days. Compared with existing CNN-LSTM fusion studies (References [
21,
22,
23]), the differences in these studies are mainly reflected in four aspects: First, the traditional CNNs with a TCN, which expands the temporal receptive field, are replaced via dilated convolutions without increasing parameters. Second, the Huber norm is introduced to replace the non-differentiable PINBALL loss, reducing the number of model convergence iterations by 30% and making the interval boundaries output by QR smoother. Third, the dataset is divided into three scenarios (sunny, cloudy, and rainy), optimizing the input window and KDE bandwidth. Fourth, simultaneously outputting “future 4-h power values + 90% confidence interval + continuous probability density distribution” forms a “point-interval-distribution” trinity uncertainty characterization system; the lowest BIC index score proves the optimal fitting degree of the probability distribution, providing a more comprehensive basis for power system risk assessments.
Based on the above analysis, this paper proposes a photovoltaic power probability prediction method based on quantile regression models, integrating a TCN and bidirectional long short-term memory (BiLSTM) to form the TCN-QRBiLSTM probability prediction model. Here, the TCN can replace CNNs to address the issue of elongated convolutional kernels that may arise due to data preprocessing. BiLSTM, compared to LSTM, can better exploit the internal information inherent in the photovoltaic power generation process.
2. Construction of the TCN-QR-BiLSTM Probabilistic Forecasting Model
The probabilistic forecasting model proposed in this paper, TCN-QRBiLSTM, is shown in
Figure 1. First, data preprocessing is performed on historical photovoltaic power and meteorological data, including missing value imputation, outlier removal, and normalization. Then, the processed sequences are used to construct input samples via a sliding window, which are input into the TCN for local temporal feature extraction. Subsequently, the high-dimensional feature vectors output by the TCN are input into the BiLSTM to capture long-term dependency information. Finally, combined with Quantile Regression (QR), prediction intervals are generated, and the probability density forecasting of photovoltaic power is obtained via Kernel Density Estimation (KDE).
To clarify the temporal correlation logic of short-term PV power forecasting, the model adopts a “sliding window” mechanism to construct input sets and output sets, with specific definitions as follows: The length of the input window is set to 24 h (matching the 15 min time resolution of the data, corresponding to 96 time steps). Each input sample contains two types of core features, and all features undergo preprocessing—normalization to the [0, 1] interval via Min-Max normalization, filling of missing values through linear interpolation, and removal of outliers using Isolation Forest. Among them, the historical PV power data are the measured power values (unit: kW) every 15 min within the window, totaling 96 time-series observation points; the synchronized meteorological data include wind direction, wind speed, humidity (dimensionless, with a value range of 0–1), and rainfall (unit: mm), which are aligned in time with the power data. Each of these meteorological variables has 1 observation value every 15 min, forming 96 time-series observation points, respectively. These two types of features are finally integrated into an input vector with dimensions [5, 96].
In the “short-term prediction” scenario, the model outputs PV power forecasts for the next 4 h (this duration is determined by data characteristics, short-term power grid dispatching needs and TCN-QRBiLSTM model capability compatibility), along with the corresponding 90% confidence level prediction interval and probability density distribution. The lower and upper bounds of the interval correspond to the quantiles at α = 0.05 and α = 0.95, respectively.
2.1. TCN
Based on Convolutional Neural Networks (CNN), the Temporal Convolutional Network (TCN) uses causal and dilated convolution layers to replace traditional convolutions. Causal convolutions allow for effective sequence learning, while dilated convolutions provide a sufficient receptive field to capture more information. TCN is more suitable for handling time-series problems compared to fully convolutional networks [
24].
Figure 2 shows the causal dilated convolution structure of the TCN. The input sequence of the TCN-QRBiLSTM model—constructed by sliding window processing of historical photovoltaic (PV) power data and key meteorological data—passes through the causal dilated convolution structure of the TCN Here,
are the input sequences at each time step,
is the dilation factor (which increases exponentially with the model depth), and
is the causal dilated convolution feature at time
.
The color circles distinguish nodes of different layers/functional roles: green circles in the Output Layer represent output nodes; blue circles in the Hidden Layers represent hidden nodes; light green circles in the Input Layer represent input nodes.
The input sequence
passes through the causal dilated convolution structure of the TCN to obtain the causal dilated convolution feature
as follows:
In the formula: is the total number of convolutional filters; is the size of the filter; and are the input sequence values being convolved only with the prior time steps.
The TCN model consists of residual blocks and BN (Batch Normalization) layers. The model structure is shown in
Figure 3. In this structure,
represents the abstract feature obtained after optimizing
through the BN layer at time
.
As shown in
Figure 3, the residual block contains two convolutional units and a nonlinear mapping. The convolutional units can fully extract the deep information of the abstract features, while the nonlinear mapping ensures that the input and output have the same dimensions. The BN (Batch Normalization) layer can improve training speed and standardize the data to maintain a consistent distribution. After optimizing the causal dilated convolution feature at time
through BN, the TCN model produces the abstract feature expressed as:
In the formula, and are the mean and variance of , respectively; and are learnable parameters; and is a small positive constant.
Figure 3 shows the architecture of the model, where the red dashed box depicts the internal structure of a convolutional unit, containing two sequential sub-modules (each composed of causal expansion, normalisation, activation, and regularization operations).
2.2. BiLSTM
LSTM is a variant of recurrent neural networks that can effectively address the vanishing gradient problem encountered during the learning process [
25]. The feature dimensions of the temporal feature vectors (derived from the TCN’s processing of the original input sequence) are fed into the BiLSTM network. Conventional LSTM learning methods typically employ a single forward propagation approach for training, which may not fully exploit the internal information inherent in the photovoltaic power generation process. This paper utilizes BiLSTM networks as shown in
Figure 4,
represents the input time series at each time step, which, after passing through BiLSTM, yields the output vector
combining forward and backward passes to effectively address the limitations of unidirectional LSTM neural networks in information exploration, thereby delving deeper into the intrinsic connections between current photovoltaic power generation data and historical data.
2.3. QR
Quantiles can achieve a discrete representation of the predictive probability distribution, providing the quantiles of the prediction target at specific probability levels [
26]. Using quantiles, prediction intervals can be constructed, making the application more convenient. The value of the prediction target at time
is denoted by
, and the input vector of the prediction model is denoted by
. The quantile
of the prediction target
at the quantile level
can be defined by the following equation:
Prediction intervals can provide the boundaries within which the forecasted variable is expected to vary. The value of the prediction target is covered within this boundary with a certain probability. The prediction interval
for the prediction target at time
with a nominal coverage probability of
is given by:
where
and
are the lower and upper bounds of the prediction interval, respectively. Typically, the lower and upper bounds of the prediction interval correspond to quantile levels that have symmetrical probabilities, which can be expressed as:
Based on the definition of quantiles in Equation (4), the quantiles representing prediction uncertainty can be uniquely approximated by minimizing an appropriate loss function.
In the equation,
is the number of training samples;
is the symmetric loss function, defined as:
Deep learning models are typically trained using gradient descent algorithms, but the pinball loss function is non-differentiable at zero. Therefore, an everywhere differentiable Huber pinball (H-pinball) loss function
is introduced to replace the pinball loss
, defined as:
where
is the threshold, set to a small positive value, and
is the quantile deviation operator.
2.4. KDE
The fusion model incorporating QR methods can obtain conditional quantile results for the predicted variables, but probability density functions cannot be obtained through QR. Predicting the probability of photovoltaic generation requires finding a method to solve the probability density function of the random variable. The problem of solving the probability density function of a random variable is one of the basic problems in probability statistics, and its methods include parameter estimation and non-parameter estimation. Parameter estimation methods require assumptions about the distribution of the data beforehand, but these fundamental assumptions do not lead to satisfactory estimation results. To overcome the shortcomings of parameter estimation methods, Rosenblatt and Emanuel Parzen proposed a non-parameter estimation method, namely the kernel density estimation method.
KDE is a typical non-parametric estimation method used in probability theory to estimate unknown density functions [
27]. The KDE method examines the distribution characteristics of data without relying on any prior knowledge about the data distribution or imposing any assumptions on it. It has been highly valued in both theoretical and applied statistics fields. In specific point photovoltaic forecasting, a set of
conditional quantiles can be obtained for
sample sets, denoted as
. The calculation of this vector KDE is determined by Equation (10).
where
is the total number of samples;
is the bandwidth, and
> 0.
is the kernel function. The kernel function used in this paper is the Gaussian kernel, which can be expressed by the following formula:
The final result is the probability density function of the photovoltaic output.
The innovativeness of the TCN-QRBiLSTM model proposed in this study is mainly reflected in three aspects:
(1) Fusion of Short-Term and Long-Term Features: The TCN focuses on extracting short-term local fluctuation features, while the BiLSTM captures long-term time series dependencies. This design enables unified modeling of short-term fluctuations and long-term trends.
(2) Improvement of QR: Aiming at the non-differentiable issue of traditional quantile loss, Huber norm approximation is introduced. This ensures the differentiability of model training and further improves the accuracy of prediction intervals.
(3) Optimization of Probabilistic Output: The model integrates KDE for probability density estimation, allowing it to not only output confidence intervals but also provide continuous probability distribution information. This enhancement strengthens the ability to characterize the uncertainty of photovoltaic power.
3. Evaluation Metrics
This study selects photovoltaic power data and meteorological data from a distributed photovoltaic (PV) power plant in a region of East China during July, August, and September 2022 as the experimental data. Then, based on cloud cover, rainfall amount, and rainfall duration from numerical weather prediction (NWP), the weather is classified into three types: sunny, cloudy, and rainy, as shown in
Table 1. The time resolution of this dataset is 15 min. A total of 4128 data points (from 1 July to 24 September) within the daily time window of 7:00–19:00 are retained for case analysis. Among these, the first 3840 data points are used as the training set, whereas the remaining 288 (from 25 September to 30 September) serve as the test set. The model parameters of TCN-QRBiLSTM are shown in
Table 1.
3.1. Interval Prediction Evaluation Metrics
3.1.1. Prediction Interval Coverage Probability
The ratio of the total number of points participating in interval coverage to the number of points in the prediction interval. Interval coverage reflects prediction reliability and can be expressed as:
where
denotes the coverage of the area.
is taken as 0 or 1. If the actual data are within the coverage of the area,
is taken as 1, and vice versa.
3.1.2. Percentage Interval Normalized Average Width
The average distance between the upper and lower bounds of the reaction prediction interval. Under the same interval coverage conditions, the prediction is better if the PINAW is smaller.
denotes the mean value of the interval, and denote the upper and lower bounds of the interval prediction at the sampling point, and is the absolute value of the upper and lower bounds of the predicted interval normalised to .
3.1.3. The Comprehensive Evaluation Index for Interval Prediction
To evaluate the overall performance of probability distribution predictions, a lower value indicates a better prediction, and it is expressed as:
3.2. Probabilistic Forecasting Evaluation Metrics
The Bayesian Information Criterion (BIC) is utilized as the probabilistic prediction index. BIC is employed to assess the quality of the model. A smaller BIC value indicates better suitability of the model and simplicity. The calculation formula for BIC is:
In the formula: represents the number of unknown parameters in the model; represents the maximum likelihood function; represents the parameter to be estimated.
4. Experimental Analysis
The TCN-QRBiLSTM model was developed using Python 3.9, with deep learning frameworks TensorFlow 2.10 (for TCN/BiLSTM construction) and Scikit-learn 1.2 (for data preprocessing). Kernel density estimation was implemented via SciPy 1.10.
This study selects photovoltaic power data and meteorological data from a distributed PV power plant in a region of East China during July, August, and September 2022 as the experimental data. Different weather conditions’ data are obtained via the joint measurement of NWP and on-site observations: PV power is measured in real time by inverters at the power plant, while meteorological factors including cloud cover and rainfall are provided by NWP systems and cross-validated with data from on-site sensors to ensure measurement accuracy. Then, based on cloud cover, rainfall amount, and rainfall duration from NWP, the weather is classified into three types: sunny, cloudy, and rainy, as shown in
Table 1. The time resolution of this dataset is 15 min. A total of 4128 data points (from 1 July to 24 September) within the daily time window of 7:00–19:00 are retained for case analysis. Among these, the first 3840 data points are used as the training set, whereas the remaining 288 (from 25 September to 30 September) serve as the test set.
In this paper, one day in each of the similar weather results is selected as a prediction day, and the TCN-QRLSTM model and QRBiLSTM model are used as the reference group for comparison and analysis. In order to better portray the prediction effect of the model, the confidence interval under 90% confidence level is selected.
4.1. Sunny Day Forecast Results
The prediction results of TCN-QRBiLSTM, TCN-QRLSTM and QRBiLSTM for 90% confidence range under sunny conditions are shown in
Figure 5a–c.
The prediction evaluation metrics under the three confidence levels are listed in
Table 2. The results show that all three prediction methods have almost 100% area coverage under clear weather conditions; TCN-QRBiLSTM has the largest area coverage, its prediction range is better than TCN-QRLSTM and QRBiLSTM, and its interval composite evaluation index is the best among the three, with better clarity and higher confidence.
4.2. Cloudy Day Forecast Results
Under cloudy conditions, cloud formation and subsequent movement lead to fluctuations in solar radiation intensity, which introduce challenges for accurate forecasting. These variations can significantly affect the reliability of prediction models.
Figure 6 presents the prediction intervals at the 90% confidence level for three models: TCN-QRBiLSTM, TCN-QRLSTM, and QRBiLSTM.
As shown in the figure, the prediction intervals of TCN-QRBiLSTM are relatively narrow during stable periods and become wider during sudden changes, demonstrating the model’s ability to adjust to varying uncertainty levels. Compared with the other two models, TCN-QRBiLSTM produces shorter intervals overall, indicating higher sharpness and better resolution. Furthermore, it offers improved tracking performance and responds more quickly to changes in radiation. Evaluation metrics at the 90% confidence level are summarized in
Table 3, further confirming the superior accuracy and reliability of TCN-QRBiLSTM.
Compared with TCN-QRBiLSTM, the PICP of TCN-QRLSTM and QRBiLSTM decreases by 2.08% and 3.85%, and the PINAW improves by 2.25% and 7.75%, respectively; and the composite evaluation index of the TCN-QRBiLSTM model is the lowest at 90% confidence level. Thus, the TCN-QRBiLSTM model has the best prediction effect on the cloudy region.
4.3. Rainy Day Forecast Results
On rainy days, the sky can be obscured by clouds, especially cumulonimbus clouds, which can strongly block direct sunlight. As a result the intensity of solar radiation will fluctuate and thus the PV output power will decrease and fluctuate significantly. The predicted values of TCN-QRBiLSTM, TCN-QRLSTM and QRBiLSTM at 90% confidence range under cloudy sky are shown in
Figure 7.
The predictive metrics at 90% confidence are shown in
Table 4. The combined metrics of TCN-QRBiLSTM at 90% confidence are 1.42% and 2.99% lower than TCN-QRLSTM and QRLSTM, respectively. Therefore, the prediction of cloudy weather using TCN-QRBiLSTM can effectively reduce the uncertainty of PV output with strong local learning ability. It is shown that the TCN-QRBiLSTM model has a clear prediction range and high interval coverage under three kinds of weather.
Based on Convolutional Neural Networks (CNN), the Temporal Convolutional Network (TCN) uses causal and dilated convolution layers to replace traditional convolutions. Causal convolutions allow for effective sequence learning, while dilated convolutions provide a sufficient receptive field to capture more information. TCN is more suitable for handling time-series problems compared to fully convolutional networks [
20].
Figure 2 shows the causal dilated convolution structure of the TCN. Here,
are the input sequences at each time step,
is the dilation factor (which increases exponentially with the model depth), and
is the causal dilated convolution feature at time
.
4.4. Probabilistic Prediction Results
In this paper, data from two representative time points in the morning and afternoon were selected to describe the probability distributions under different weather conditions and times. The results obtained using kernel density estimation (KDE) are shown in
Figure 8. Most of the measured data under various meteorological conditions are centrally distributed in the probability density plots, and the distribution of the points is relatively smooth. This indicates that the quantile distributions obtained using the TCN-QRBiLSTM method are reliable. After fitting the KDE for various meteorological data using three different models, the changes in their Bayesian Information Criterion (BIC) coefficients are shown in
Figure 9. The BIC index of the TCN-QRBiLSTM model is the smallest, indicating that the probability density relationship obtained by this model and KDE achieves the best fit, and the model complexity is also the lowest.
The red dashed line represents the mean of this probability density distribution. It is the average value of the random variable characterized by this probability distribution, reflecting the central tendency of the distribution; moreover, in such a symmetric probability distribution, the position of the mean coincides with the peak point of the probability density function.
The TCN-QRBiLSTM model generates confidence intervals by extracting local temporal features and capturing long-term dependencies, combined with differentiable quantile regression—resulting in quantile distributions that are not only smooth but also fully cover actual power values. Smooth probability density curves indicate that the model predictions exhibit good statistical consistency; however, adjustments are still needed by incorporating more data or external information under scenarios of extreme power fluctuations.
5. Conclusions
In order to describe and predict the uncertainties existing in interval prediction, this paper constructs a short-term PV probabilistic prediction model with TCN-QRBiLSTM, and obtains the following research conclusions:
(1) Combining the TCN model with meteorological factors can effectively exploit the multidimensional stochastic characteristics such as short-term power fluctuation and long-term steady state of PV output under meteorological conditions. The reliability requirement of probabilistic prediction is satisfied both in sunny and rainy days.
(2) The combined TCN-QRBiLSTM model adopts Huber’s norm approximation to replace the original loss function, which makes up for the defects of the traditional interval prediction quantile loss function that cannot be minimized.
(3) The TCN-QRBiLSTM combined model has high accuracy and stability. Under the conditions of 90% coverage and different meteorological conditions, the coverage of the prediction intervals of each model is very close to each other, the combined evaluation indexes of the intervals of the three weather types are the best compared with TCN-QRLSTM and QRBiLSTM, and the probability density prediction indexes of BIC are the lowest while ensuring the optimal prediction intervals, so that the quality of short-term probabilistic prediction of the photovoltaic power can be improved, and therefore the proposed model in this paper has a significant superiority. The proposed model has significant superiority.
Overall, the results demonstrate that the model possesses strong adaptability, stability, and prediction accuracy under different weather conditions, reflecting its practical value in the probabilistic prediction of photovoltaic power output in complex power system environments.
6. Outlook and Future Work
Although the TCN-QRBiLSTM model proposed in this paper performs excellently in short-term photovoltaic power probabilistic prediction, there is still room for further improvement. Future work can be expanded in the following aspects:
(1) Improvement of Data Accuracy: This can be achieved by incorporating meteorological data with higher temporal and spatial resolution to enhance the model’s responsiveness to rapid power changes.
(2) Optimization of Model Structure: This can be achieved by integrating attention mechanisms or graph neural networks to strengthen the model’s adaptive weight assignment for features across different time periods and spatial locations, thereby further improving prediction accuracy.
(3) Cross-Regional Application and Transfer Learning: This can be achieved by investigating the model’s applicability in different geographical regions and exploring transfer learning methods to enhance the model’s generality.
(4) Multi-Time-Scale Joint Prediction: This can be achieved by extending the model to day-ahead or multi-day prediction scenarios and realizing the integration of short-term and medium-term probabilistic prediction to increase the reference value for decision-making in photovoltaic power dispatch and system planning.
Author Contributions
Conceptualization, W.Y. and J.D.; methodology, L.Z.; software, J.N.; validation, W.Y., J.D. and L.Z.; formal analysis, W.Y.; investigation, J.D.; resources, J.N.; data curation, W.Y.; writing—review and editing, Q.Z.; visualization, L.Z.; supervision, Q.Z.; funding acquisition, J.D. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the State Grid Anhui Electric Power Co., Ltd. Science and Technology Project (B3120524003N) for its support on the research of novel distribution network optimization and scheduling technology based on distributed source-load probabilistic forecasting.
Data Availability Statement
The availability of these data is restricted. The data were obtained from the State Grid Electric Power Research Institute and, with the permission of the State Grid Electric Power Research Institute, can be obtained from the authors.
Conflicts of Interest
Authors Weibo Yuan, Jinjin Ding, Li Zhang and Jingyi Ni were employed by the company Electric Power Research Institute. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Nomenclature
Symbol | Definition | Unit/Description |
PICP | Prediction Interval Coverage Probability | Dimensionless (0–1) |
PINAW | Percentage Interval Normalized Average Width | Dimensionless (0–1) |
I | Comprehensive evaluation index for interval prediction | Dimensionless (0–∞); smaller = better |
BIC | Bayesian Information Criterion | Dimensionless; smaller = better |
| Quantile level in quantile regression | Dimensionless (0–1) |
| Quantile of PV power at time t and quantile level () | kW |
| Threshold of Huber norm in improved quantile regression | Set to 1.0 (small positive value) |
ρα(⋅) | Huber pinball loss function | Dimensionless |
d | Dilation factor of the TCN | Integer (e.g., [1,2,4,8]) |
K | Total number of convolutional filters in the TCN | Integer (set to 64 in this study) |
| Size of the i-th convolutional filter in the TCN | Integer (e.g., 3) |
| Causal dilated convolution feature of the TCN at time t | Dimensionless (feature vector) |
| Abstract feature of the TCN after Batch Normalization (BN) at time t | Dimensionless (feature vector) |
| Output vector of BiLSTM at time t | Dimensionless (feature vector) |
h | Bandwidth in KDE | Adaptive (determined by data distribution) |
K(⋅) | Kernel function in KD | Dimensionless |
N | Number of training samples | Integer |
| Actual PV power at time t | kW |
| Predicted PV power at time t | kW |
| Input vector of the prediction model at time t | Dimensionless (feature vector) |
μ | Mean of causal dilated convolution feature in BN layer | Dimensionless |
| Variance of causal dilated convolution feature in BN layer | Dimensionless |
γ,β | Learnable parameters of the BN layer | Dimensionless |
ϵ | Small positive constant in the BN layer | Set to (dimensionless) |
References
- National Energy Administration. China’s New-Type Energy Storage Development Report (2025) Released by the National Energy Administration. China Nonferrous Metall. 2025, 54, 146. [Google Scholar]
- Sangrody, H.; Zhou, N.; Zhang, Z. Similarity-Based Models for Day-Ahead Solar PV Generation Forecasting. IEEE Access 2020, 8, 104469–104478. [Google Scholar] [CrossRef]
- Mei, F.; Gu, J.; Lu, J.; Lu, J.; Zhang, J.; Jiang, Y.; Shi, T.; Zheng, J. Day-Ahead Nonparametric Probabilistic Forecasting of Photovoltaic Power Generation Based on the LSTMQRA Ensemble Model. IEEE Access 2020, 8, 166138–166149. [Google Scholar] [CrossRef]
- Tang, D.; Li, J.; Zeng, F.; Li, Y.; Yan, C. Bayesian parameter estimation of SST model for shock wave-boundary layer interaction flows with different strengths. Chin. J. Aeronaut. 2023, 36, 217–236. [Google Scholar] [CrossRef]
- Dai, Y.; Lu, Z.; Xiong, W.; Yuan, X.; Xu, Y.; Tan, Z. Power quality disturbance classification method based on CDAE and TCN/BLSTM model. Smart Power 2023, 51, 59–66. [Google Scholar]
- Huang, Q.; Wei, S. Improved quantile convolutional neural network with two-stage training for daily-ahead probabilistic forecasting of photovoltaic power. Energy Convers. Manag. 2020, 220, 113085. [Google Scholar] [CrossRef]
- Xiang, M.J. Probabilistic Prediction of Photovoltaic Power Generation Based on Ground-Based Cloud Mapping. Master’s Thesis, Zhejiang University, Hangzhou, China, 2022. [Google Scholar]
- Li, J.; Liu, Q. Short-term Photovoltaic Power Forecasting Using SOM-based Regional Modelling Methods. Chin. J. Electr. Eng. 2023, 9, 158–176. [Google Scholar] [CrossRef]
- Gao, Y.; Wu, H.; Zhang, J.; Zhang, H.; Zhang, P. A probabilistic prediction model for photovoltaic power day-ahead based on combinatorial deep learning. China Power 2024, 57, 100–110. [Google Scholar]
- Pang, H.; Gao, J.; Du, Y. A short-term load probability density forecasting method based on time-convolution network quantile regression. Grid Technol. 2020, 44, 1343–1350. [Google Scholar]
- Wan, C.; Cui, W.; Song, Y. Probabilistic prediction of new energy power systems: Basic concepts and mathematical principles. Chin. J. Electr. Eng. 2021, 41, 6493–6509. [Google Scholar]
- Gao, Q. Multi-temporal Scale Wind Power Forecasting Based on Lasso-CNN-LSTM-LightGBM. EAI Endorsed Trans. Energy Web 2024, 11, 5792. [Google Scholar] [CrossRef]
- Karthiga, M.; Sountharrajan, S.; Suganya, E.; Sankarananth, S. Sentence semantic similarity model using convolutional neural networks. EAI Endorsed Trans. Energy Web 2021, 8, 35. [Google Scholar]
- Yao, C.W.; Yang, P.; Liu, Z.J. A load forecasting method based on CNN-GRU hybrid neural network. Power Grid Technol. 2020, 44, 3416–3424. [Google Scholar]
- Wang, K.; Du, H.; Jia, R.; Liu, H.; Liang, Y.; Wang, X. Short-term interval probability prediction of photovoltaic power based on similar day clustering and QR-CNN-BiLSTM model. High Volt. Technol. 2022, 48, 4372–4388. [Google Scholar]
- Liu, H.; Ling, N.; Luo, Z.; Sun, Z. Power grid short-term load forecasting method based on TCN- LSTM and meteorological similar day sets. Smart Power 2022, 50, 30–37. [Google Scholar]
- Zhang, Z.; Duan, Z.; Zhang, L. Photovoltaic power generation prediction and optimization configuration model based on GPR and improved PSO algorithm. EAI Endorsed Trans. Energy Web 2024, 11, 3809. [Google Scholar] [CrossRef]
- Usha, S.; Geetha, P.; Geetha, A.; Balamurugan, K.S.; Selciya, S. A novel concept of solar photovoltaic partial shading and thermal hybrid system for performance improvement. EAI Endorsed Trans. Energy Web 2024, 11, 4943. [Google Scholar] [CrossRef]
- Singh, P.; Jha, M.; Sharaf, M.; El-Meligy, M.A.; Gadekallu, T.R. Harnessing a Hybrid CNN-LSTM Model for Portfolio Performance: A Case Study on Stock Selection and Optimization. IEEE Access 2023, 11, 104000–104015. [Google Scholar] [CrossRef]
- Zhang, Y.; Hu, T. Ensemble Interval Prediction for Solar Photovoltaic Power Generation. Energies 2022, 15, 7193. [Google Scholar] [CrossRef]
- Obiora, C.N.; Ali, A. Hourly Photovoltaic Power Forecasting Using CNN-LSTM Hybrid Model. In Proceedings of the 2021 62nd International Scientific Conference on Information Technology and Management Science of Riga Technical University (ITMS), Riga, Latvia, 14–15 October 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, Q.; Li, R.; Hui, H.; Fan, L.; Wu, J. PV power probability density prediction based on the long short-term memory network with quantile regression. In Proceedings of the 2023 IEEE 18th Conference on Industrial Electronics and Applications (ICIEA), Ningbo, China, 18–22 August 2023; pp. 800–804. [Google Scholar] [CrossRef]
- Cao, R.; Wang, K.; Wu, F. Research on the Probability Prediction Method of Electricity Price Response Load based on CNNLSTM and Quantile Regression Fusion. In Proceedings of the 2023 IEEE 7th Conference on Energy Internet and Energy System Integration (EI2), Hangzhou, China, 15–18 December 2023; pp. 1–9. [Google Scholar] [CrossRef]
- Nguyen, C.; Hoang, T.M.; Cheema, A.A. Channel Estimation Using CNN-LSTM in RIS-NOMA Assisted 6G Network. IEEE Trans. Mach. Learn. Commun. Netw. 2023, 1, 43–60. [Google Scholar] [CrossRef]
- Selim, T.; Elkabani, I.; Abdou, M.A. Students Engagement Level Detection in Online e-Learning Using Hybrid EfficientNetB7 Together With TCN, LSTM, and Bi-LSTM. IEEE Access 2022, 10, 99573–99583. [Google Scholar] [CrossRef]
- Yang, Y.; Zhang, H.; Peng, S.; Su, S.; Li, B. Wind Power Probability Density Prediction Based on Quantile Regression Model of Dilated Causal Convolutional Neural Network. Chin. J. Electr. Eng. 2023, 9, 120–128. [Google Scholar] [CrossRef]
- Wei, S.; Liu, X.; Shi, H.; Gan, J. Remaining Useful Life Prediction of High-Dimensional Kernel Density Estimation with Adaptive Relative Density Window Width Considering Multisource Information Fusion. IEEE Sens. J. 2024, 24, 6548–6563. [Google Scholar] [CrossRef]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).