A Multi-Scale CNN-Transformer Network with Residual Correction for Ultra-Short-Term Photovoltaic Power Forecasting
Abstract
1. Introduction
- Multi-scale feature extraction and global dependency modeling: A hybrid architecture integrating multi-scale CNN and Transformer is proposed to jointly capture local temporal dynamics and global dependencies, enabling high-precision forecasting across multiple time scales.
- Residual Correction Module: The RCM is designed to mitigate error accumulation in multi-step forecasting. By incorporating historical error data to refine baseline predictions, it enhances forecasting accuracy, stability, and robustness under complex meteorological conditions.
- Two-Stage Training Strategy: A two-stage training strategy is introduced, where the CNN-Transformer baseline model is first pre-trained independently for robust feature extraction, then jointly optimized with the RCM. This strategy stabilizes convergence and improves multi-step forecasting performance.
2. Proposed Scheme
- Data Preparation and Preprocessing: Raw PV power and meteorological data undergo cleaning, outlier handling, and normalization to form a continuous, complete, and high-quality input feature matrix, laying a reliable data foundation for subsequent modeling.
- Feature Engineering: This step includes feature enhancement using EMD to decompose numerical variables into multi-scale components, and feature selection based on Gradient Boosting Regression (GBR) to rank and select the top 10 features. This process improves the multi-scale representation of inputs and reduces redundancy, as detailed in Section 2.1.
- Multi-Scale CNN-Transformer Model Construction: The model architecture integrates multi-scale CNN branches and a Transformer encoder. The CNN branches extract local temporal features at different resolutions using parallel convolutional kernels, capturing short-term fluctuations, medium-term trends, and long-term patterns. The Transformer backbone then models global dependencies to generate baseline predictions, as described in Section 2.2.
- Residual Correction Module (RCM): Dynamic correction is performed using historical residuals. The interaction between baseline predictions and historical residuals is captured by a lightweight Transformer encoder, and the prediction results are iteratively optimized
- Final Prediction Output: After fusing the baseline prediction with the residual correction output, high-precision ultra-short-term photovoltaic power prediction results are generated.
2.1. Data Preprocessing and Feature Enhancement
- (1)
- Data Cleaning and Outlier Handling
- (2)
- Feature Enhancement and Selection
- (3)
- Normalization and Dataset Partitioning
2.2. Proposed MSCT-RCM Model Architecture
- (1)
- Multi-scale convolution branch
- (2)
- Transformer Backbone Network
- (3)
- Residual Correction Module (RCM)
- Using the pre-trained CNN-Transformer baseline model from Phase I, multi-step predictions are performed on all training samples. The prediction residual for each sample is calculated as (where is the true value and is the baseline prediction).
- For each training sample, extract its input feature vector and corresponding residual sequence (where H is the prediction step: H = 1 for 15 min prediction, H = 2 for 30 min prediction, H = 4 for 1 h prediction) to form a “feature vector–residual sequence” pair.
- Min–Max normalization is applied to the feature vectors of all training samples (reusing extremum parameters from the training set preprocessing stage) to build a standardized feature library; residual sequences are directly stored in the residual library, whose scale matches the number of training samples.
- Residual encoding
- 2.
- Sequence concatenation
- 3.
- Lightweight residual Transformer encoder
- 4.
- Residual generation
- 5.
- Final predication output
3. Model Architecture and Its Two-Stage Training Strategy
3.1. Phase I: Baseline Pre-Training
3.2. Phase II: Joint Training
| Algorithm 1: MSCT-RCM Two-Phase Training |
| Input: historical sequence X, ground truth y Output: trained full model parameters θfull 1: Initialize CNN-Transformer backbone θbase 2: # Phase I: Baseline Pre-training 3: for epoch in range(E1): 4: for (xbatch, ybatch) in training_data: 5: ypred = BaselineModel(xbatch; θbase) 6: loss = MSE (ypred, ybatch) 7: Update θbase via AdamW optimizer 8: Save best θbase 9: Initialize full model θfull with θbase 10: # Phase II: Joint Training 11: for epoch in range(E2): 12: for (xbatch, Ehist,batch, ybatch) in training_data: 13: ypred,batch = FullModel(xbatch, Ehist,batch; θfull) 14: Δybatch = ypred,batch − BaselineModel(xbatch;θbase) 15: loss = MSE (ypred,batch, ybatch) +λ⋅MSE(Δybatch,0) 16: Update θfull via AdamW optimizer 17: Save best θfull 18: return θfull |
4. Case Study and Comparative Results
4.1. Dataset and Sample Construction
4.2. Experimental Environment
4.3. Evaluation Metrics
- Mean Absolute Error (MAE)
- 2.
- Root Mean Square Error (RMSE)
- 3.
- Coefficient of Determination (R2)
4.4. Baseline Models and Hyperparameters
5. Experimental Results and Analysis
- (1)
- Impact of Feature Quantity
- (2)
- Sensitivity Analysis of Different Input Sequence Lengths
- (3)
- Convergence Stability Analysis
- (4)
- Comparative Analysis with Baseline Models
- (5)
- Cross-Dataset Generalization Analysis
- (6)
- Prediction Results at Different Time Periods
- (7)
- Ablation Study
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| PV | Photovoltaic |
| CNN | Convolutional Neural Network |
| RCM | Residual Correction Module |
| EMD | Empirical Mode Decomposition |
| GBR | Gradient Boosting Regression |
| LSTM | Long Short-Term Memory |
| Transformer | Transformer Architecture |
| KNN | K-Nearest Neighbors |
References
- Tian, J.; Ooka, R.; Lee, D. Multi-scale solar radiation and photovoltaic power forecasting with machine learning algorithms in urban environment: A state-of-the-art review. J. Clean. Prod. 2023, 426, 139040. [Google Scholar] [CrossRef]
- Santana, E.J.; Silva, R.P.; Zarpelao, B.B.; Barbon Junior, S. Detecting and Mitigating Adversarial Examples in Regression Tasks: A Photovoltaic Power Generation Forecasting Case Study. Information 2021, 12, 394. [Google Scholar] [CrossRef]
- Lara-Benítez, P.; Carranza-Garcia, M.; Luna-Romera, J.M.; Riquelme, J.C. Temporal Convolutional Networks Applied to Energy-Related Time Series Forecasting. Appl. Sci. 2020, 10, 2322. [Google Scholar] [CrossRef]
- Salinas, D.; Flunkert, V.; Gasthaus, J.; Januschowski, T. DeepAR: Probabilistic forecasting with autoregressive recurrent networks. Int. J. Forecast. 2020, 36, 1181–1191. [Google Scholar] [CrossRef]
- Sarmas, E.; Spiliotis, E.; Stamatopoulos, E.; Marinakis, V.; Doukas, H. Short-term photovoltaic power forecasting using meta-learning and numerical weather prediction independent Long Short-Term Memory models. Renew. Energy 2023, 216, 118997. [Google Scholar] [CrossRef]
- Luo, P.; Li, C.; Kang, D.; Zhang, F.; Lv, Q. PMWC: A hybrid framework based causal inference and multi-scale feature fusion for day-ahead PV power forecasting. Renew. Energy 2026, 257, 124753. [Google Scholar] [CrossRef]
- Gupta, M.; Arya, A.; Varshney, U.; Mittal, J.; Tomar, A. A review of PV power forecasting using machine learning techniques. Prog. Eng. Sci. 2025, 2, 100058. [Google Scholar] [CrossRef]
- Dong, Z.; Tian, Z.; Lv, S. A short-term power load forecasting system based on data decomposition, deep learning and weighted linear error correction with feedback mechanism. Appl. Soft Comput. 2024, 162, 111863. [Google Scholar] [CrossRef]
- Libra, M.; Kozelka, M.; Šafránková, J.; Belza, R.; Poulek, V.; Beránek, V.; Sedlacek, J.; Zholobov, M.; Subrt, T.; Severová, L. Agrivoltaics: Dual usage of agricultural land for sustainable development. Int. Agrophys. 2024, 38, 121–126. [Google Scholar] [CrossRef]
- Luan, J.; Li, Q.; Qiu, Y.; Liu, W. Ensemble learning unlocking point load forecasting accuracy: A novel framework based on two-stage data preprocessing and improved multi-objective optimisation strategy. Comput. Electr. Eng. 2025, 124, 110282. [Google Scholar] [CrossRef]
- Akhter, M.N.; Mekhilef, S.; Mokhlis, H.; Almohaimeed, Z.M.; Muhammad, M.A.; Khairuddin, A.S.M. An Hour-Ahead PV Power Forecasting Method Based on an RNN-LSTM Model for Three Different PV Plants. Energies 2022, 15, 2243. [Google Scholar] [CrossRef]
- Bae, D.-J.; Kwon, B.-S.; Song, K.-B. XGBoost-Based Day-Ahead Load Forecasting Algorithm Considering Behind-the-Meter Solar PV Generation. Energies 2021, 15, 128. [Google Scholar] [CrossRef]
- Scott, C.; Ahsan, M.; Albarbar, A. Machine learning for forecasting a photovoltaic (PV) generation system. Energy 2023, 278, 127807. [Google Scholar] [CrossRef]
- Wang, B.; Wang, J. Energy futures and spots prices forecasting by hybrid SW-GRU with EMD and error evaluation. Energy Econ. 2020, 90, 104827. [Google Scholar] [CrossRef]
- Li, G.; Wei, X.; Yang, H. Decomposition integration and error correction method for photovoltaic power forecasting. Measurement 2023, 208, 112462. [Google Scholar] [CrossRef]
- Yu, M.; Niu, D.; Wang, K.; Du, R.; Yu, X.; Sun, L.; Wang, F. Short-term photovoltaic power point-interval forecasting based on double-layer decomposition and WOA-BiLSTM-Attention and considering weather classification. Energy 2023, 275, 127348. [Google Scholar] [CrossRef]
- Wang, L.; Mao, M.; Xie, J.; Liao, Z.; Zhang, H.; Li, H. Accurate solar PV power prediction interval method based on frequency-domain decomposition and LSTM model. Energy 2023, 262, 125592. [Google Scholar] [CrossRef]
- Wang, H.; Lei, Z.; Zhang, X.; Zhou, B.; Peng, J. A review of deep learning for renewable energy forecasting. Energy Convers. Manag. 2019, 198, 111799. [Google Scholar] [CrossRef]
- Sun, Y.; Venugopal, V.; Brandt, A.R. Short-term solar power forecast with deep learning: Exploring optimal input and output configuration. Sol. Energy 2019, 188, 730–741. [Google Scholar] [CrossRef]
- Nguyen-Duc, T.; Do-Dinh, H.; Fujita, G.; Tran-Thanh, S. Multi 2D-CNN-based model for short-term PV power forecast embedded with Laplacian Attention. Energy Rep. 2024, 12, 2086–2096. [Google Scholar] [CrossRef]
- Abou Houran, M.; Bukhari, S.M.S.; Zafar, M.H.; Mansoor, M.; Chen, W. COA-CNN-LSTM: Coati optimization algorithm-based hybrid deep learning model for PV/wind power forecasting in smart grid applications. Appl. Energy 2023, 349, 121638. [Google Scholar] [CrossRef]
- Agga, A.; Abbou, A.; Labbadi, M.; El Houm, Y. Short-term self consumption PV plant power production forecasts based on hybrid CNN-LSTM, ConvLSTM models. Renew. Energy 2021, 177, 101–112. [Google Scholar] [CrossRef]
- Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. In Cognitive Modeling; MIT Press: Cambridge, MA, USA, 1986. [Google Scholar]
- Mellit, A.; Pavan, A.M.; Lughi, V. Deep learning neural networks for short-term photovoltaic power forecasting. Renew. Energy 2021, 172, 276–288. [Google Scholar] [CrossRef]
- Dai, Y.; Wang, Y.; Leng, M.; Yang, X.; Zhou, Q. LOWESS smoothing and Random Forest based GRU model: A short-term photovoltaic power generation forecasting method. Energy 2022, 256, 124661. [Google Scholar] [CrossRef]
- Negash, T.; Weldemikael, N.; Ghebregziabiher, M.; Tedla, Y.; István, S.; István, F. Addressing photovoltaic (PV) forecasting challenges: Satellite-driven data models for predicting actual PV generation using hybrid (LSTM-GRU) model. Energy Rep. 2025, 14, 2141–2156. [Google Scholar] [CrossRef]
- Ait Mansour, A.; Tilioua, A.; Touzani, M. Bi-LSTM, GRU and 1D-CNN models for short-term photovoltaic panel efficiency forecasting case amorphous silicon grid-connected PV system. Results Eng. 2024, 21, 101886. [Google Scholar] [CrossRef]
- Wu, Q.; Han, C. A novel framework for ultra-short-term photovoltaic power forecasting based on improved transformer and weather pattern recognition. Sol. Energy 2026, 304, 114183. [Google Scholar] [CrossRef]
- Ma, Y.; Li, F.; Zhang, H.; Fu, G.; Yi, M. Two-stage photovoltaic power forecasting method with an optimized transformer. Glob. Energy Interconnect. 2024, 7, 812–824. [Google Scholar] [CrossRef]
- Zhang, Z.; Huang, X.; Li, C.; Cheng, F.; Tai, Y. CRAformer: A cross-residual attention transformer for solar irradiation multistep forecasting. Energy 2025, 320, 135214. [Google Scholar] [CrossRef]
- Cheikh, G.; Ammar, B.; Benhadj, N.; Bentounes, K.A.; Bentounes, H.A.; Ksibi, A. Transformer-based deep neural networks for short-term solar power prediction in the Middle East and North Africa regions. Eng. Appl. Artif. Intell. 2025, 160, 111848. [Google Scholar] [CrossRef]
- Liu, M.; Rao, S.; Huang, M.; Deng, S. Short-term photovoltaic power forecasting based on improved transformer with feature enhancement. Sustain. Energy Grids Netw. 2025, 43, 101759. [Google Scholar] [CrossRef]
- Yuan, L.; Wang, X.; Sun, Y.; Liu, X.; Dong, Z.Y. Multistep photovoltaic power forecasting based on multi-timescale fluctuation aggregation attention mechanism and contrastive learning. Int. J. Electr. Power Energy Syst. 2025, 164, 110389. [Google Scholar] [CrossRef]
- Cui, S.; Lyu, S.; Ma, Y.; Wang, K. Improved informer PV power short-term prediction model based on weather typing and AHA-VMD-MPE. Energy 2024, 307, 132766. [Google Scholar] [CrossRef]
- Bai, M.; Zhou, G.; Yao, P.; Dong, F.; Chen, Y.; Zhou, Z.; Yang, X.; Liu, J.; Yu, D. Deep multi-attribute spatial–temporal graph convolutional recurrent neural network-based multivariable spatial–temporal information fusion for short-term probabilistic forecast of multi-site photovoltaic power. Expert Syst. Appl. 2025, 279, 127458. [Google Scholar] [CrossRef]
- Chen, S.; Wan, H.; Peng, B.; Quan, R.; Chang, Y.; Derigent, W. Accurate multi-step wind and solar power forecasting based on multi-scale convolutional Kolmogorov-Arnold network and improved Lemming-optimized attention fusion. Eng. Appl. Artif. Intell. 2026, 163, 112832. [Google Scholar] [CrossRef]
- Wu, X.; Wu, R.; Wu, S.; Li, W.; Chen, H.; Tong, N. Short-term PV prediction using multiperiod similar days and TimeGAN-inception. Int. J. Electr. Power Energy Syst. 2025, 172, 111287. [Google Scholar] [CrossRef]
- Li, Z.; Ye, L.; Song, X.; Luo, Y.; Pei, M.; Wang, K.; Yu, Y.; Tang, Y. Heterogeneous Spatiotemporal Graph Convolution Network for Multi-Modal Wind-PV Power Collaborative Prediction. IEEE Trans. Power Syst. 2024, 39, 5591–5608. [Google Scholar] [CrossRef]
- Zhan, Y.; Wang, X.; Xu, Y.; Li, W. A hybrid TCN-LSTM-attention framework for multi-scenario short-term photovoltaic power forecasting incorporating physics-informed neural network strategy. Energy 2026, 344, 139968. [Google Scholar] [CrossRef]
- Wang, T.; Xu, Y.; Qin, Y.; Wang, X.; Zheng, F.; Li, W. Short-term PV forecasting of multiple scenarios based on multi-dimensional clustering and hybrid transformer-BiLSTM with ECPO. Energy 2025, 334, 137654. [Google Scholar] [CrossRef]
- Ismail, I.; Azeem, A.; Shamim, S.; Shamim, S. comparative analysis of concept drift detection algorithms for electrical streams across different generation modalities. In Sustainable and Eco-Friendly Process Management; Apple Academic Press: Boca Raton, FL, USA, 2025; pp. 307–316. [Google Scholar]






| Module | Configuration |
|---|---|
| Multi-Scale CNN Branch | Kernel sizes: {3, 5, 7}; Output channels per branch: 32; Activation: ReLU |
| Transformer Backbone | Hidden size dmodel: 96; Number of heads: 4; Layers: 3; Dropout: 0.1 |
| Residual Correction Module | Lightweight Transformer with 1 layer; Hidden size: 96; Dropout: 0.1 |
| Optimizer | AdamW; Initial learning rate: 0.001; Weight decay: 1 × 10−5 |
| Learning Rate Schedule | Cosine Annealing; Warm-up: 6 epochs |
| Training Epochs | Phase I: 40 (main pre-training); Phase II: 80 (joint training) |
| Batch Size | 64 |
| Model | Key Architecture | Hyperparameters |
|---|---|---|
| LSTM | Two-layer LSTM; Hidden size: 100; | Learning rate: 0.001; Epochs: 200; Batch: 64 |
| BiLSTM | Three-layer bidirectional LSTM; Hidden: 32; Dropout: 0.1188 | Learning rate: 0.001; Epochs: 200; Batch: 64 |
| GRU | Single GRU layer; Hidden size: 100; Output: Dense with Sigmoid | Learning rate: 0.001; Epochs:200; Batch: 64 |
| Transformer | Four-layer encoder; Hidden size: 256; Heads: 8; Dropout: 0.1 | Learning rate: 0.001; Epochs:200; Batch: 64 |
| TCN-LSTM | 8-layer TCN; 2-layer LSTM; Multi-head self-attention (4 heads); TCN for global/local feature extraction, LSTM for temporal dependency modeling | Learning rate: 0.001; Epochs: 200; Batch: 64; Activation: GELU; LSTM hidden size: 64; Attention head dimension: 32 |
| Transformer-BiLSTM | 4-layer Transformer encoder; 2-layer bidirectional LSTM; Linear transition layer; Dense output layer | Learning rate: 0.001; Epochs: 200; Batch: 64; Transformer hidden size: 128; Attention heads: 4; FFN dimension: 256; BiLSTM hidden size: 64; GELU |
| Feature Name | Abbreviation | GBR Importance Score |
|---|---|---|
| Global Tilted Irradiance | GTI | 0.186 |
| Direct Normal Irradiance | DNI | 0.162 |
| Historical PV Power | — | 0.145 |
| Air Temperature | — | 0.113 |
| Cloud Opacity | — | 0.098 |
| Relative Humidity | — | 0.087 |
| Global Horizontal Irradiance | GHI | 0.074 |
| Dew Point Temperature | — | 0.062 |
| Wind Speed | — | 0.051 |
| seq_len | 15 Min (MAE/RMSE/R2) | 30 Min (MAE/RMSE/R2) | 1 h (MAE/RMSE/R2) |
|---|---|---|---|
| 4 | 2.75/4.46/0.9766 | 3.02/5.23/0.9660 | 4.14/6.93/0.9545 |
| 8 | 2.24/3.21/0.9895 | 2.69/4.30/0.9770 | 3.93/6.14/0.9653 |
| 12 | 1.91/2.73/0.9944 | 2.56/3.94/0.9885 | 3.22/5.20/0.9799 |
| 16 | 1.89/2.73/0.9946 | 2.56/3.91/0.9785 | 3.20/5.16/0.9881 |
| 24 | 1.88/2.70/0.9946 | 2.53/3.91/0.9788 | 3.17/5.16/0.9881 |
| Training Strategy | MAE (Mean ± Std, kW) | RMSE (Mean ± Std, kW) | R2 (Mean ± Std) |
|---|---|---|---|
| Two-stage Training | 1.91 ± 0.032 | 2.73 ± 0.045 | 0.9944 ± 0.0008 |
| Single-stage Training | 2.68 ± 0.087 | 3.85 ± 0.102 | 0.9867 ± 0.0021 |
| Model | 15 Min (MAE/RMSE/R2) | 30 Min (MAE/RMSE/R2) | 1 h (MAE/RMSE/R2) |
|---|---|---|---|
| LSTM | 7.16/9.34/0.9352 | 8.29/10.82/0.9131 | 9.2703/12.07/0.8919 |
| BiLSTM | 5.17/6.85/0.9652 | 6.44/8.54/0.9459 | 7.35/9.80/0.9288 |
| GRU | 6.82/8.57/0.9455 | 7.97/10.02/0.9255 | 8.82/11.22/0.9065 |
| Transformer | 4.71/6.26/0.9709 | 5.60/7.72/0.9557 | 7.66/10.02/0.9255 |
| TCN-LSTM | 3.89/5.37/0.9789 | 4.72/6.68/0.9675 | 6.54/8.83/0.9398 |
| Transformer-BiLSTM | 3.56/4.98/0.9815 | 4.35/6.21/0.9712 | 6.12/8.35/0.9452 |
| MSCT-RCM | 1.91/2.73/0.9944 | 2.56/3.94/0.9885 | 3.22/5.20/0.9799 |
| Model | SSMC (MAE/RMSE/R2) | NASAC (MAE/RMSE/R2) | SPMC (MAE/RMSE/R2) |
|---|---|---|---|
| LSTM | 7.52/9.68/0.9285 | 9.68/12.25/0.8975 | 7.95/10.32/0.9172 |
| BiLSTM | 5.45/6.98/0.9630 | 6.98/9.05/0.9338 | 5.88/7.58/0.9485 |
| GRU | 6.95/8.70/0.9420 | 8.70/10.80/0.9145 | 7.35/9.25/0.9260 |
| Transformer | 4.85/6.40/0.9680 | 6.40/8.35/0.9440 | 5.25/6.95/0.9565 |
| TCN-LSTM | 4.02/5.50/0.9765 | 5.50/7.35/0.9588 | 4.38/6.02/0.9668 |
| Transformer-BiLSTM | 3.68/5.10/0.9790 | 5.10/6.85/0.9650 | 3.98/5.55/0.9718 |
| MSCT-RCM | 1.96/2.81/0.9938 | 2.27/3.31/0.9895 | 2.08/3.02/0.9912 |
| Configuration | MAE | RMSE | R2 |
|---|---|---|---|
| w/o CNN + RCM (pure Transformer) | 4.37 | 5.96 | 0.9708 |
| w/o Transformer + RCM (pure CNN) | 4.31 | 5.88 | 0.9715 |
| w/o CNN (Transformer + RCM) | 3.07 | 4.22 | 0.9853 |
| w/o Transformer (CNN + RCM) | 2.89 | 4.08 | 0.9867 |
| w/o RCM (CNN + Transformer only) | 2.41 | 3.49 | 0.9902 |
| Full MSCT-RCM | 1.91 | 2.73 | 0.9944 |
| Design Scheme | 15 Min (MAE/RMSE/R2) | 30 Min (MAE/RMSE/R2) | 1 h (MAE/RMSE/R2) |
|---|---|---|---|
| Baseline Scheme (Concatenation + Learnable PE + Last Time Step) | 1.91/2.73/0.9944 | 2.56/3.94/0.9885 | 3.22/5.20/0.9799 |
| No Concatenation (CNN Features Only) | 2.00/2.89/0.9931 | 2.68/4.11/0.9867 | 3.36/5.42/0.9772 |
| Sinusoidal PE (Replacing Learnable PE) | 1.98/2.84/0.9935 | 2.66/4.05/0.9876 | 3.33/5.38/0.9778 |
| Average Pooling (Replacing Last Time Step) | 2.02/2.91/0.9929 | 2.71/4.15/0.9862 | 3.39/5.45/0.9769 |
| Retrieval Strategy | MAE | RMSE | R2 |
|---|---|---|---|
| Random Residual Selection (No Retrieval) | 2.43 | 3.57 | 0.9872 |
| K = 3 + Cosine Similarity | 1.98 | 2.89 | 0.9931 |
| K = 5 + Cosine Similarity (Proposed Method) | 1.91 | 2.73 | 0.9944 |
| K = 7 + Cosine Similarity | 1.93 | 2.78 | 0.9940 |
| K = 9 + Cosine Similarity | 3.24 | 4.44 | 0.9935 |
| K = 5 + Euclidean Distance | 2.05 | 2.97 | 0.9925 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ye, X.; Yin, J.; Zhang, J.; Li, A.; Liu, Z.; Chen, B.; Yang, J.; Li, S.; Li, H. A Multi-Scale CNN-Transformer Network with Residual Correction for Ultra-Short-Term Photovoltaic Power Forecasting. Processes 2026, 14, 759. https://doi.org/10.3390/pr14050759
Ye X, Yin J, Zhang J, Li A, Liu Z, Chen B, Yang J, Li S, Li H. A Multi-Scale CNN-Transformer Network with Residual Correction for Ultra-Short-Term Photovoltaic Power Forecasting. Processes. 2026; 14(5):759. https://doi.org/10.3390/pr14050759
Chicago/Turabian StyleYe, Xiao, Jun Yin, Jiajia Zhang, Anping Li, Zhibo Liu, Bin Chen, Jingyao Yang, Shilei Li, and Hongmei Li. 2026. "A Multi-Scale CNN-Transformer Network with Residual Correction for Ultra-Short-Term Photovoltaic Power Forecasting" Processes 14, no. 5: 759. https://doi.org/10.3390/pr14050759
APA StyleYe, X., Yin, J., Zhang, J., Li, A., Liu, Z., Chen, B., Yang, J., Li, S., & Li, H. (2026). A Multi-Scale CNN-Transformer Network with Residual Correction for Ultra-Short-Term Photovoltaic Power Forecasting. Processes, 14(5), 759. https://doi.org/10.3390/pr14050759

