Rasterized Data Image Processing (RDIP) Techniques for Photovoltaic (PV) Data Cleaning and Application in Power Prediction
Abstract
:1. Introduction
2. Related Work
3. Materials and Methods
3.1. Photovoltaic Anomalous Data Analysis
3.2. Design and Implementation of RDIP Technology
- a.
- Data Rasterization and Visualization
- b.
- Image Morphological Operations and Contour Extraction
- c.
- Maximum Contour Extraction and Resolution Selection
- For each numerical value, extract all its coordinates in the matrix to form a coordinate set.
- Then, for all coordinates of that numerical value, consider them as a sample set. When computing the weighted dispersion, use the number of samples as weights. In other words, the weight of each sample equals the number of times that numerical value appears in the matrix.
- Finally, according to the formula for calculating weighted dispersion, use these weights to compute the dispersion, ensuring that the number of samples is correctly applied as weights.
3.3. Experimental Design and Evaluation Metrics
4. Experimental Results and Analysis
4.1. Data Acquisition and Processing
- The region’s intricate and varied terrain, encompassing mountainous, hilly, and plain areas, which heightens uncertainty in solar radiation owing to terrain-induced effects;
- The humid climate, susceptible to weather elements like clouds and rainfall, necessitating prediction models to incorporate additional meteorological variables such as precipitation.
4.2. Experimental Results Analysis
4.3. Discussion of Results
5. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Sun, L.; Zhou, K.; Zhang, X.; Yang, S. Outlier data treatment methods toward smart grid applications. IEEE Access 2018, 6, 39849–39859. [Google Scholar] [CrossRef]
- Liu, H.; Chen, C. Data processing strategies in wind energy forecasting models and applications: A comprehensive review. Appl. Energy 2019, 249, 392–408. [Google Scholar] [CrossRef]
- Eroshenko, S.A.; Khalyasmaa, A.I.; Snegirev, D.A.; Dubailova, V.V.; Romanov, A.M.; Butusov, D.N. The impact of data filtration on the accuracy of multiple time-domain forecasting for photovoltaic power plants generation. Appl. Sci. 2020, 10, 8265. [Google Scholar] [CrossRef]
- Li, B.; Delpha, C.; Migan-Dubois, A.; Diallo, D. Fault diagnosis of photovoltaic panels using full I–V characteristics and machine learning techniques. Energy Convers. Manag. 2021, 248, 114785. [Google Scholar] [CrossRef]
- Xu, P.; Zhang, M.; Chen, Z.; Wang, B.; Cheng, C.; Liu, R. A deep learning framework for day ahead wind power short-term prediction. Appl. Sci. 2023, 13, 4042. [Google Scholar] [CrossRef]
- Celikel, R.; Yilmaz, M.; Gundogdu, A. A voltage scanning-based MPPT method for PV power systems under complex partial shading conditions. Renew. Energy 2022, 184, 361–373. [Google Scholar] [CrossRef]
- Akinci, T.C.; Akgun, O.; Yilmaz, M.; Martinez-Morales, A.A. High order spectral analysis of ferroresonance phenomena in electric power systems. IEEE Access 2023, 11, 61289–61297. [Google Scholar] [CrossRef]
- Wang, B.; Deng, X.; Chen, T.; Li, Y. Photovoltaic data cleaning method based on DBSCAN clustering, quartile algorithm and Pearson correlation coefficient interpolation method. In Proceedings of the 2023 6th International Conference on Energy, Electrical and Power Engineering (CEEPE), Guangzhou, China, 12–14 May 2023; IEEE: Piscataway, NJ, USA; pp. 1539–1544. [Google Scholar]
- Ilyas, I.F.; Rekatsinas, T. Machine learning and data cleaning: Which serves the other? ACM J. Data Inf. Qual. (JDIQ) 2022, 14, 1–11. [Google Scholar] [CrossRef]
- Ray, P.K.; Mohanty, A.; Panigrahi, T. Power quality analysis in solar PV integrated microgrid using independent component analysis and support vector machine. Optik 2019, 180, 691–698. [Google Scholar] [CrossRef]
- Ibrahim, M.; Alsheikh, A.; Awaysheh, F.M.; Alshehri, M.D. Machine learning schemes for anomaly detection in solar power plants. Energies 2022, 15, 1082. [Google Scholar] [CrossRef]
- Zhang, Y.; Chen, B.; Pan, G.; Zhao, Y. A novel hybrid model based on VMD-WT and PCA-BP-RBF neural network for short-term wind speed forecasting. Energy Convers. Manag. 2019, 195, 180–197. [Google Scholar] [CrossRef]
- de Oliveira AK, V.; Aghaei, M.; Rüther, R. Automatic inspection of photovoltaic power plants using aerial infrared thermography: A review. Energies 2022, 15, 2055. [Google Scholar] [CrossRef]
- Bassous, G.F.; Calili, R.F.; Barbosa, C.H. Development of a low-cost data acquisition system for very short-term photovoltaic power forecasting. Energies 2021, 14, 6075. [Google Scholar] [CrossRef]
- Micheli, L.; Fernández, E.F.; Almonacid, F. Photovoltaic cleaning optimization through the analysis of historical time series of environmental parameters. Sol. Energy 2021, 227, 645–654. [Google Scholar] [CrossRef]
- Ranjan, K.G.; Prusty, B.R.; Jena, D. Comparison of two data cleaning methods as applied to volatile time-series. In Proceedings of the 2019 International Conference on Power Electronics Applications and Technology in Present Energy Scenario (PETPES), Mangalore, India, 29–31 August 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
- Hopwood, M.; Gunda, T.; Seigneur, H.; Walters, J. An assessment of the value of principal component analysis for photovoltaic IV trace classification of physically-induced failures. In Proceedings of the 2020 47th IEEE Photovoltaic Specialists Conference (PVSC), Calgary, AB, Canada, 15 June–21 August 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 798–802. [Google Scholar]
- Balachandran, G.B.; Devisridhivyadharshini, M.; Ramachandran, M.E.; Santhiya, R. Comparative investigation of imaging techniques, pre-processing and visual fault diagnosis using artificial intelligence models for solar photovoltaic system–A comprehensive review. Measurement 2024, 232, 114683. [Google Scholar] [CrossRef]
- Saquib, D.; Nasser, M.N.; Ramaswamy, S. Image Processing Based Dust Detection and prediction of Power using ANN in PV systems. In Proceedings of the 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, 20–22 August 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1286–1292. [Google Scholar]
- Yap, W.K.; Galet, R.; Yeo, K.C. Quantitative analysis of dust and soiling on solar pv panels in the tropics utilizing image-processing methods. In Proceedings of the Asia-Pacific Solar Research Conference 2015, Canberra, Australia, 29 November–1 December 2016. [Google Scholar]
- Natarajan, K.; Bala, P.K.; Sampath, V. Fault detection of solar PV system using SVM and thermal image processing. Int. J. Renew. Energy Res. 2020, 10, 967–977. [Google Scholar]
- Dantas, G.M.; Mendes, O.L.; Maia, S.M.; de Alexandria, A.R. Dust detection in solar panel using image processing techniques: A review. Res. Soc. Dev. 2020, 9, e321985107. [Google Scholar] [CrossRef]
- Gönenç, A.; Acar, E.; Demir, İ.; Yılmaz, M. Artificial Intelligence Based Regression Models for Prediction of Smart Grid Stability. In Proceedings of the 2022 Global Energy Conference (GEC), Batman, Turkey, 26–29 October 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 374–378. [Google Scholar]
- Tang, L.; Deng, Y.; Ma, Y.; Huang, J.; Ma, J. SuperFusion: A versatile image registration and fusion network with semantic awareness. IEEE/CAA J. Autom. Sin. 2022, 9, 2121–2137. [Google Scholar] [CrossRef]
- Rezvani, A.; Bigverdi, M.; Rohban, M.H. Image-based cell profiling enhancement via data cleaning methods. PLoS ONE 2022, 17, e0267280. [Google Scholar] [CrossRef]
- Lanini, F. Division of Global Radiation into Direct Radiation and Diffuse Radiation. Master’s Thesis, University of Bern, Bern, Switzerland, 2010. [Google Scholar]
- Despotovic, M.; Nedic, V.; Despotovic, D.; Cvetanovic, S. Review and statistical analysis of different global solar radiation sunshine models. Renew. Sustain. Energy Rev. 2015, 52, 1869–1880. [Google Scholar] [CrossRef]
- Zhao, Y.; Liu, Q.; Li, D.; Kang, D.; Lv, Q.; Shang, L. Hierarchical anomaly detection and multimodal classification in large-scale photovoltaic systems. IEEE Trans. Sustain. Energy 2018, 10, 1351–1361. [Google Scholar] [CrossRef]
- Yao, S.; Kang, Q.; Zhou, M.; Abusorrah, A.; Al-Turki, Y. Intelligent and data-driven fault detection of photovoltaic plants. Processes 2021, 9, 1711. [Google Scholar] [CrossRef]
- Hussin, M.Z.; Sin, N.D.; Zainuddin, H.; Omar, A.M.; Shaari, S. Anomaly Detection of Grid Connected Photovoltaic System Based on Degradation Rate: A Case Study in Malaysia. Pertanika J. Trop. Agric. Sci. 2021, 29, 3143–3159. [Google Scholar] [CrossRef]
- Ibrahim, I.A.; Hossain, M.J.; Duck, B.C. An optimized offline random forests-based model for ultra-short-term prediction of PV characteristics. IEEE Trans. Ind. Inform. 2019, 16, 202–214. [Google Scholar] [CrossRef]
- Cruz, J.; Mamani, W.; Romero, C.; Pineda, F. Selection of Characteristics by Hybrid Method: RFE, Ridge, Lasso, and Bayesian for the Power Forecast for a Photovoltaic System. SN Comput. Sci. 2021, 2, 202. [Google Scholar] [CrossRef]
- Agga, A.; Abbou, A.; Labbadi, M.; El Houm, Y.; Ali, I.H. CNN-LSTM: An efficient hybrid deep learning architecture for predicting short-term photovoltaic power production. Electr. Power Syst. Res. 2022, 208, 107908. [Google Scholar] [CrossRef]
- Li, Q.; Zhang, X.; Ma, T.; Liu, D.; Wang, H.; Hu, W. A Multi-step ahead photovoltaic power forecasting model based on TimeGAN, Soft DTW-based K-medoids clustering, and a CNN-GRU hybrid neural network. Energy Rep. 2022, 8, 10346–10362. [Google Scholar] [CrossRef]
- Lindig, S.; Louwen, A.; Moser, D.; Topic, M. Outdoor PV system monitoring—Input data quality, data imputation and filtering approaches. Energies 2020, 13, 5099. [Google Scholar] [CrossRef]
Data Cleaning Method | Forecast Model | MSE | Accuracy |
---|---|---|---|
RDIP(Ours) | Random Forest | 14.24 | 90.59% |
RDIP(Ours) | Lasso | 14.13 | 90.58% |
RDIP(Ours) | CNN-LSTM | 13.77 | 90.82% |
RDIP(Ours) | DWT-EN-GRU | 13.71 | 90.86% |
Fusion of Pearson Correlation Interpolation | Random Forest | 15.30 | 89.85% |
Fusion of Pearson Correlation Interpolation | Lasso | 15.57 | 89.62% |
Fusion of Pearson Correlation Interpolation | CNN-LSTM | 14.99 | 90.01% |
Fusion of Pearson Correlation Interpolation | DWT-EN-GRU | 15.29 | 89.80% |
Cubic Spline Interpolation | Random Forest | 18.66 | 87.56% |
Cubic Spline Interpolation | Lasso | 21.19 | 85.87% |
Cubic Spline Interpolation | CNN-LSTM | 17.83 | 88.11% |
Cubic Spline Interpolation | DWT-EN-GRU | 17.55 | 88.30% |
Month | Forecast Model | Accuracy (Uncleaned) | Accuracy (Cleaned) | Efficiency |
---|---|---|---|---|
April | Random Forest | 87.11% | 89.13% | 2.02% |
April | Lasso | 88.35% | 90.34% | 1.99% |
April | CNN-LSTM | 86.58% | 88.68% | 2.10% |
April | DWT-EN-GRU | 87.19% | 89.47% | 2.28% |
June | Random Forest | 81.53% | 84.33% | 2.80% |
June | Lasso | 83.09% | 85.95% | 2.86% |
June | CNN-LSTM | 82.21% | 85.11% | 2.90% |
June | DWT-EN-GRU | 82.99% | 84.89% | 1.90% |
September | Random Forest | 89.94% | 91.13% | 1.19% |
September | Lasso | 88.22% | 90.03% | 1.81% |
September | CNN-LSTM | 87.52% | 89.42% | 1.90% |
September | DWT-EN-GRU | 89.53% | 91.05% | 1.52% |
December | Random Forest | 82.86% | 85.63% | 2.77% |
December | Lasso | 84.76% | 87.17% | 2.41% |
December | CNN-LSTM | 83.87% | 86.86% | 2.99% |
December | DWT-EN-GRU | 82.88% | 85.95% | 3.07% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zang, N.; Tao, Y.; Yuan, Z.; Yuan, C.; Jing, B.; Liu, R. Rasterized Data Image Processing (RDIP) Techniques for Photovoltaic (PV) Data Cleaning and Application in Power Prediction. Energies 2024, 17, 3000. https://doi.org/10.3390/en17123000
Zang N, Tao Y, Yuan Z, Yuan C, Jing B, Liu R. Rasterized Data Image Processing (RDIP) Techniques for Photovoltaic (PV) Data Cleaning and Application in Power Prediction. Energies. 2024; 17(12):3000. https://doi.org/10.3390/en17123000
Chicago/Turabian StyleZang, Ning, Yong Tao, Zuoteng Yuan, Chen Yuan, Bailin Jing, and Renfeng Liu. 2024. "Rasterized Data Image Processing (RDIP) Techniques for Photovoltaic (PV) Data Cleaning and Application in Power Prediction" Energies 17, no. 12: 3000. https://doi.org/10.3390/en17123000
APA StyleZang, N., Tao, Y., Yuan, Z., Yuan, C., Jing, B., & Liu, R. (2024). Rasterized Data Image Processing (RDIP) Techniques for Photovoltaic (PV) Data Cleaning and Application in Power Prediction. Energies, 17(12), 3000. https://doi.org/10.3390/en17123000