This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
A Hybrid Ensemble Learning Framework for Accurate Photovoltaic Power Prediction
by
Wajid Ali
Wajid Ali
My Name is Wajid Ali, I am Ph.D student in Electronic Engineering Department at Jeju National we are [...]
My Name is Wajid Ali, I am Ph.D student in Electronic Engineering Department at Jeju National University. Currently we are working on energy harvestign devices such as triboelectric nanogenerators and moisture-driven electricity generation. we are also implementign artificial intelligence to predict the pattern to acknowldge the results generated from TENG devices.
1,†,
Farhan Akhtar
Farhan Akhtar 1,†,
Asad Ullah
Asad Ullah 2 and
Woo Young Kim
Woo Young Kim 1,*
1
Department of Electronic Engineering, Jeju National University, Jeju 63243, Republic of Korea
2
Department of Mechanical Engineering, University of Engineering and Technology Mardan, Mardan 23200, Pakistan
*
Author to whom correspondence should be addressed.
†
These authors contributed equally to this work.
Energies 2026, 19(2), 453; https://doi.org/10.3390/en19020453 (registering DOI)
Submission received: 12 November 2025
/
Revised: 24 December 2025
/
Accepted: 13 January 2026
/
Published: 16 January 2026
Abstract
Accurate short-term forecasting of solar photovoltaic (PV) power output is essential for efficient grid integration and energy management, especially given the widespread global adoption of PV systems. To address this research gap, the present study introduces a scalable, interpretable ensemble learning model of PV power prediction with respect to a large PVOD v1.0 dataset, which encompasses more than 270,000 points representing ten PV stations. The proposed methodology involves data preprocessing, feature engineering, and a hybrid ensemble model consisting of Random Forest, XGBoost, and CatBoost. Temporal features, which included hour, day, and month, were created to reflect the diurnal and seasonal characteristics, whereas feature importance analysis identified global irradiance, temperature, and temporal indices as key indicators. The hybrid ensemble model presented has a high predictive power, with an R2 = 0.993, a Mean Absolute Error (MAE) = 0.227 kW, and a Root Mean Squared Error (RMSE) = 0.628 kW when applied to the PVOD v1.0 dataset to predict short-term PV power. These findings were achieved on standardized, multi-station, open access data and thus are not in an entirely rigorous sense comparable to previous studies that may have used other datasets, forecasting horizons, or feature sets. Rather than asserting numerical dominance over other approaches, this paper focuses on the real utility of integrating well-known tree-based ensemble techniques with time-related feature engineering to derive real, interpretable, and computationally efficient PV power prediction models that can be used in smart grid applications. This paper shows that a mixture of conventional ensemble methods and extensive temporal feature engineering is effective in producing consistent accuracy in PV forecasting. The framework can be reproduced and run efficiently, which makes it applicable in the integration of smart grid applications.
Share and Cite
MDPI and ACS Style
Ali, W.; Akhtar, F.; Ullah, A.; Kim, W.Y.
A Hybrid Ensemble Learning Framework for Accurate Photovoltaic Power Prediction. Energies 2026, 19, 453.
https://doi.org/10.3390/en19020453
AMA Style
Ali W, Akhtar F, Ullah A, Kim WY.
A Hybrid Ensemble Learning Framework for Accurate Photovoltaic Power Prediction. Energies. 2026; 19(2):453.
https://doi.org/10.3390/en19020453
Chicago/Turabian Style
Ali, Wajid, Farhan Akhtar, Asad Ullah, and Woo Young Kim.
2026. "A Hybrid Ensemble Learning Framework for Accurate Photovoltaic Power Prediction" Energies 19, no. 2: 453.
https://doi.org/10.3390/en19020453
APA Style
Ali, W., Akhtar, F., Ullah, A., & Kim, W. Y.
(2026). A Hybrid Ensemble Learning Framework for Accurate Photovoltaic Power Prediction. Energies, 19(2), 453.
https://doi.org/10.3390/en19020453
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.