Statistical Analysis and Development of an Ensemble-Based Machine Learning Model for Photovoltaic Fault Detection
Abstract
:1. Introduction
- Approaches that are not dependent on environmental data such as solar irradiance, temperature, and wind speed. For example, time-domain reflectometry (TDR) is proposed in [3] for the detection of PV string disconnection.
- Methodologies based on the analysis of electrical parameters; primarily, current and voltage characteristics. Silvestre et al. [3] calculated the series resistance (Rs), fill factor (FF), and shunt resistance (Rsh) on the basis of I–V characteristics.
- Approaches based on maximum power point tracking (MPPT). Li et al. [4] presented an automated monitoring and fault detection model utilizing a power loss analysis, leading to the identification of problems such as faulty modules, strings, partial shading, MPPT failure, and aging.
- Artificial intelligence (AI)-based methodologies. The authors in [5] explored the effectiveness of BP neural networks with the aim of diagnosing faults occurring in PV systems, and compared the results with those of fuzzy logic approaches. In their conclusion, the authors presented BP neural networks as a solution to most of the limitations faced whilst implementing fuzzy logic for PV fault detection.
1.1. Literature Review
1.2. Paper Contribution and Organization
2. Methodology
2.1. Dataset
2.2. Statistical Approach
2.3. Average of the Daily Measured Ratios and Hierarchical Clustering
2.4. K-Means on the Average of the Daily Measured Ratios
2.5. Machine Learning-Based Network
3. Results
3.1. ML-Based Detection Accuracy
3.2. ML Model Accuracy Using the Past 15 Years of PV Installation Data: A Case Study for a Noisy Dataset
- (1)
- Days 1 and 6: normal operation, “NO”.
- (2)
- Days 2 and 5: four PV modules were disconnected, which corresponded with an LF faulty condition.
- (3)
- Days 3 and 7: twelve PV modules were disconnected, which corresponded with an HF faulty condition.
- (4)
- Day 4: five PV strings were disconnected, which corresponded with an SF faulty condition.
3.3. ML Model Accuracy Using the Past 15 Years of PV Installation Data: A Case Study for a Missing Dataset
4. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Hu, Y.; Cao, W.; Ma, J.; Finney, S.J.; Li, D. Identifying PV Module Mismatch Faults by a Thermography-Based Temperature Distribution Analysis. IEEE Trans. Device Mater. Reliab. 2014, 14, 951–960. [Google Scholar] [CrossRef] [Green Version]
- Schirone, L.; Califano, F.P.; Pastena, M. Fault Detection in a photovoltaic plant by time domain reflectometry. Prog. Photovolt. Res. Appl. 1994, 2, 35–44. [Google Scholar] [CrossRef]
- Silvestre, S.; da Silva, M.A.; Chouder, A.; Guasch, D.; Karatepe, E. New procedure for fault detection in grid connected PV systems based on the evaluation of current and voltage indicators. Energy Convers. Manag. 2014, 86, 241–249. [Google Scholar] [CrossRef]
- Li, X.; Wen, H.; Hu, Y.; Jiang, L. Drift-free current sensorless MPPT algorithm in photovoltaic systems. Sol. Energy 2019, 177, 118–126. [Google Scholar] [CrossRef]
- Yuchuan, W.; Qinli, L.; Yaqin, S. Application of BP neural network fault diagnosis in solar Photovoltaic System. In Proceedings of the IEEE International Conference on Mechatronics and Automation, Changchun, China, 9–12 August 2009; pp. 9–12. [Google Scholar]
- Mellit, A.; Sağlam, Ş.; Kalogirou, S.A. Artificial neural network-based model for estimating the produced power of a photovoltaic module. Renew. Energy 2013, 60, 71–78. [Google Scholar] [CrossRef]
- Dhimish, M. Defining the best-fit machine learning classifier to early diagnose photovoltaic solar cells hot-spots. Case Stud. Therm. Eng. 2021, 25, 100980. [Google Scholar] [CrossRef]
- Moreno, G.; Martin, P.; Santos, C.; Rodriguez, F.J.; Santiso, E. A Day-Ahead Irradiance Forecasting Strategy for the Integration of Photovoltaic Systems in Virtual Power Plants. IEEE Access 2020, 8, 204226–204240. [Google Scholar] [CrossRef]
- Chine, W.; Mellit, A.; Lughi, V.; Malek, A.; Sulligoi, G.; Pavan, A.M. A novel fault diagnosis technique for photovoltaic systems based on artificial neural networks. Renew. Energy 2016, 90, 501–512. [Google Scholar] [CrossRef]
- Hussain, M.; Dhimish, M.; Holmes, V.; Mather, P. Deployment of AI-based RBF network for photovoltaics fault detection procedure. AIMS Electron. Electr. Eng. 2020, 4, 1–18. [Google Scholar] [CrossRef]
- Hussain, M.; Chen, T.; Titrenko, S.; Su, P.; Mahmud, M. A Gradient Guided Architecture Coupled with Filter Fused Representations for Micro-Crack Detection in Photovoltaic Cell Surfaces. IEEE Access 2022, 10, 58950–58964. [Google Scholar] [CrossRef]
- Garoudja, E.; Harrou, F.; Sun, Y.; Kara, K.; Chouder, A.; Silvestre, S. Statistical fault detection in photovoltaic systems. Sol. Energy 2017, 150, 485–499. [Google Scholar] [CrossRef]
- Mhamdi, S.; Girard, P.; Virazel, A.; Bosio, A.; Faehn, E.; Ladhar, A. Cell-Aware Defect Diagnosis of Customer Returns Based on Supervised Learning. IEEE Trans. Device Mater. Reliab. 2020, 20, 329–340. [Google Scholar] [CrossRef]
- Dhimish, M.; Chen, Z. Novel Open-Circuit Photovoltaic Bypass Diode Fault Detection Algorithm. IEEE J. Photovolt. 2019, 9, 1819–1827. [Google Scholar] [CrossRef] [Green Version]
- Yao, H.; Fu, D.; Zhang, P.; Li, M.; Liu, Y. MSML: A Novel Multilevel Semi-Supervised Machine Learning Framework for Intrusion Detection System. IEEE Internet Things J. 2018, 6, 1949–1959. [Google Scholar] [CrossRef] [Green Version]
- Zhao, Y.; Ball, R.; Mosesian, J.; de Palma, J.-F.; Lehman, B. Graph-Based Semi-supervised Learning for Fault Detection and Classification in Solar Photovoltaic Arrays. IEEE Trans. Power Electron. 2014, 30, 2848–2858. [Google Scholar] [CrossRef]
- Gaggero, G.B.; Rossi, M.; Girdinio, P.; Marchese, M. Detecting System Fault/Cyberattack within a Photovoltaic System Connected to the Grid: A Neural Network-Based Solution. J. Sens. Actuator Netw. 2020, 9, 20. [Google Scholar] [CrossRef] [Green Version]
- Park, S.; Park, S.; Kim, M.; Hwang, E. Clustering-Based Self-Imputation of Unlabeled Fault Data in a Fleet of Photovoltaic Generation Systems. Energies 2020, 13, 737. [Google Scholar] [CrossRef] [Green Version]
- Parvez, I.; Aghili, M.; Sarwat, A.I.; Rahman, S.; ALAM, F. Online power quality disturbance detection by support vector machine in smart meter. J. Mod. Power Syst. Clean Energy 2018, 7, 1328–1339. [Google Scholar] [CrossRef] [Green Version]
- Rezvani, S.; Wang, X.; Pourpanah, F. Intuitionistic Fuzzy Twin Support Vector Machines. IEEE Trans. Fuzzy Syst. 2019, 27, 2140–2151. [Google Scholar] [CrossRef]
- Dhimish, M.; Holmes, V. Fault detection algorithm for grid-connected photovoltaic plants. Sol. Energy 2016, 137, 236–245. [Google Scholar] [CrossRef]
- Li, M.J.; Ng, M.K.; Cheung, Y.-M.; Huang, J.Z. Agglomerative Fuzzy K-Means Clustering Algorithm with Selection of Number of Clusters. IEEE Trans. Knowl. Data Eng. 2008, 20, 1519–1534. [Google Scholar] [CrossRef] [Green Version]
- Jaworski, M.; Duda, P.; Rutkowski, L. New Splitting Criteria for Decision Trees in Stationary Data Streams. IEEE Trans. Neural Netw. Learn. Syst. 2017, 29, 2516–2529. [Google Scholar] [CrossRef]
- Dhimish, M.; Badran, G. Photovoltaic Hot-Spots Fault Detection Algorithm Using Fuzzy Systems. IEEE Trans. Device Mater. Reliab. 2019, 19, 671–679. [Google Scholar] [CrossRef] [Green Version]
- Vieira, R.G.; Dhimish, M.; De Araújo, F.M.U.; Guerra, M.I.S. PV Module Fault Detection Using Combined Artificial Neural Network and Sugeno Fuzzy Logic. Electronics 2020, 9, 2150. [Google Scholar] [CrossRef]
- Hussain, M.; Chen, T.; Hill, R. Moving toward Smart Manufacturing with an Autonomous Pallet Racking Inspection System Based on MobileNetV2. J. Manuf. Mater. Process. 2022, 6, 75. [Google Scholar] [CrossRef]
Day | Type | Missing Period |
---|---|---|
Week 1 | NO | Normal operation: no faulty PV module(s) |
Week 2–4 | LF | Low percentage of PV faults |
Week 5–9 | HF | High percentage of PV faults |
Week 10 | SF | Faulty PV string |
Classification Type | Index | Number of Classes | ||||
---|---|---|---|---|---|---|
8 | 9 | 10 | 11 | 12 | ||
Hierarchical | Dunn | 0.72 | 1.14 | 4.44 | 0.57 | 0.72 |
K-Means | Silhouette | 0.86 | 0.93 | 0.99 | 0.97 | 0.94 |
Model | Accuracy (%) |
---|---|
Linear Discriminant Analysis (LDR) | 97 |
K-Nearest Neighbor (KNN) | 100 |
Decision Tree (CART) | 100 |
Random Forest (RF) | 100 |
Gaussian Naïve Bayes (NB) | 100 |
Support Vector Machine (SVM) | 100 |
Multi-Layer Perceptron (MLP) | 100 |
Model | Accuracy (%) |
---|---|
Linear Discriminant Analysis (LDR) | 86 |
K-Nearest Neighbor (KNN) | 58 |
Decision Tree (CART) | 100 |
Random Forest (RF) | 100 |
Gaussian Naïve Bayes (NB) | 100 |
Support Vector Machine (SVM) | 81 |
Multi-Layer Perceptron (MLP) | 44 |
Day No. | Period of Missing Data Samples | Count of Missing Hours in the Filtered Time Window (from 9:00 a.m.–3:00 p.m.) |
---|---|---|
Day 1 | 11:00–14:00 | 3 |
Day 2 | 7:00–11:00 | 2 |
Day 3 | 16:00–19:00 | 0 |
Day 4 | 12:00–17:00 | 3 |
Day 5 | 10:00–15:00 | 5 |
Day 6 | 12:00–19:00 | 3 |
Day 7 | 6:00–12:00 | 3 |
Model | Accuracy (%) |
---|---|
Linear Discriminant Analysis (LDR) | 77 |
K-Nearest Neighbor (KNN) | 51 |
Decision Tree (CART) | 83 |
Random Forest (RF) | 81 |
Gaussian Naïve Bayes (NB) | 94 |
Support Vector Machine (SVM) | 73 |
Multi-Layer Perceptron (MLP) | 42 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hussain, M.; Al-Aqrabi, H.; Hill, R. Statistical Analysis and Development of an Ensemble-Based Machine Learning Model for Photovoltaic Fault Detection. Energies 2022, 15, 5492. https://doi.org/10.3390/en15155492
Hussain M, Al-Aqrabi H, Hill R. Statistical Analysis and Development of an Ensemble-Based Machine Learning Model for Photovoltaic Fault Detection. Energies. 2022; 15(15):5492. https://doi.org/10.3390/en15155492
Chicago/Turabian StyleHussain, Muhammad, Hussain Al-Aqrabi, and Richard Hill. 2022. "Statistical Analysis and Development of an Ensemble-Based Machine Learning Model for Photovoltaic Fault Detection" Energies 15, no. 15: 5492. https://doi.org/10.3390/en15155492
APA StyleHussain, M., Al-Aqrabi, H., & Hill, R. (2022). Statistical Analysis and Development of an Ensemble-Based Machine Learning Model for Photovoltaic Fault Detection. Energies, 15(15), 5492. https://doi.org/10.3390/en15155492