Defect Prediction for Capacitive Equipment in Power System
Abstract
:1. Introduction
- Unlike traditional methods relying on online monitoring and diagnostic techniques, we introduce a proactive approach. By utilizing machine learning algorithms, we predict whether defects will occur in capacitive equipment and their severity level before they manifest. This proactive prediction enables preemptive maintenance and intervention, ultimately enhancing the reliability and safety of the equipment.
- Successful application of the weight of evidence (WOE) feature encoding, based on the scorecard model, for preprocessing capacitive equipment data. This approach enhances the data preparation stage and improves the effectiveness of subsequent analysis.
2. Related Work
3. Method
3.1. WOE
3.2. Random Forest
- RF employs classification and regression tree (CART) algorithms as its constituent weak learners;
- RF randomly selects features every time;
- The number of samples selected by RF is identical to that of the training set. Due to its randomness, it can reduce the variance of the model. Therefore, RF exhibits superior generalization and antioverfitting capabilities compared to Bagging [31].
3.3. Comparison Models
- Liner Classifier: Linear classifiers categorize targets by linearly combining features. The model facilitates decision making by summing the product of each feature and its corresponding weight [32].
- MLP: Multilayer perceptron (MLP) is a forward-structured artificial neural network characterized by its layered structure. It can be conceptualized as a composition of multiple single-layer perceptron. The output layer of one perceptron serves as the input layer for the subsequent perceptron, with the final output layer representing the overall output of the MLP [33].
- SVM: Support vector machines (SVMs) are algorithms rooted in statistical theory, proficient in solving classification and regression problems with small-scale data. SVM addresses inner product operations in high-dimensional spaces by employing a kernel function, facilitating the effective implementation of nonlinear classification [34].
- XGBoost: Extreme gradient boosting (XGBoost) algorithm employs ensemble thinking to combine multiple weak learners into a strong learner through specific methodologies. XGBoost comprises multiple classification and regression trees (CARTs) and can handle diverse problems, including classification and regression [35].
4. Experiments
4.1. Data Collection and Preprocessing
- Equipment Name: Isolating switch, C phase current transformer, circuit breaker, etc.;
- Power Supply Bureau: The power supply bureau to which the equipment belongs, such as Kunming Power Supply Bureau (501) and Qujing Power Supply Bureau (502);
- Equipment Type: Optical current transformer, oil-filled transformer, DC current transformer, etc.;
- Full Name: The comprehensive name of the equipment along with its corresponding category, for instance, ‘substation equipment/primary equipment/combined electrical appliance/COMPASS/current transformer’;
- Equipment Type Remarks: The designation of equipment types, such as main transformer bushing (B A GT10 GT11 KH00) and current transformer (B A GG00 GG20 GT70);
- Equipment Model: The specific model information of the equipment, such as LZZBJ-35W, SZ11-4000/35, etc.;
- Manufacturer: The name of the equipment manufacturer;
- Topography: The geographic environment of the equipment’s location, classified into six types: high mountain, hill, plain, river network, paddy field, and mountain, represented by numbers 1–6, respectively;
- Equipment longitude, latitude, and altitude;
- Pollution Level: The pollution level in the area where the equipment is situated, categorized into five levels;
- Substation: The substation to which the equipment belongs, e.g., 110 kV Lunan substation;
- Running State: Refers to the operational status of the substation, represented by the numbers 1–9, indicating operation, under construction, standby, etc.;
- Voltage Level: Indicates the rated voltage of the equipment, represented by the numbers 1–18, corresponding to voltage levels of 10,000 V, 110,000 V, 220,000 V, etc.;
- Voltage Type: Specifies whether the voltage is DC or AC, ‘1’ indicates DC, ‘2’ denotes AC, and ‘3’ signifies that the voltage type is not distinguished. For example, 500,000 V voltage encompasses both AC and DC;
- Production Date and Commissioning Date: The date of the equipment leaving the factory and the date of the equipment being put into operation, respectively;
- Defect Occurrence Time: The timestamp when equipment defects occur;
- Years of Operation: For normal equipment, it represents the duration between the commissioning year and the current year. For faulty equipment, it signifies the duration between the commissioning year and the year the fault occurred;
- Defect Occurrence: A binary classification variable serving as the output for the defect occurrence prediction model, with two possible values: ‘defect’ and ‘normal’;
- Defect Level: A four-class variable used as the output for the defect level prediction model, comprising the following categories: ‘urgent’, ‘critical’, ‘general’, and ‘others’.
4.1.1. Data Cleaning
4.1.2. Feature Encoding
4.1.3. Data Balancing
4.2. Defect Prediction Model Based on RF
4.3. Performance Metrics
5. Experiment Results and Discussion
5.1. Results of Defect Occurrence Prediction
5.2. Results of Defect Level Prediction
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Xie, C.; Peng, Q.; Zheng, Z.; Li, Z.; Wang, Z.; Li, M.; Jiang, L.; Liu, Q.; Li, X. Relationship between defects of capacitive equipment and geomorphology. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; pp. 4508–4511. [Google Scholar]
- Xu, Y.; Wang, J.; Liu, W.; Jiang, Y.; Xu, T.; Zhang, J. Online monitoring device for partial discharge in high-voltage switchgear based on capacitance coupling method. In Proceedings of the 2019 IEEE 3rd International Conference on Green Energy and Applications (ICGEA), Taiyuan, China, 16–18 March 2019; pp. 56–60. [Google Scholar]
- Yang, Y. Study on measurement error test method of on-line insulation monitoring device for capacitive equipment. Appl. Mech. Mater. 2013, 373–375, 844–847. [Google Scholar] [CrossRef]
- Lachman, M.F.; Walter, W.; von Guggenberg, P.A. Online diagnostics of high-voltage bushings and current transformers using the sum current method. IEEE Trans. Power Deliv. 2000, 15, 155–162. [Google Scholar] [CrossRef]
- Zhang, H.; Tan, K.; Dong, F.; Wang, J. The analysis of on-line monitored results for capacitive type of equipment. In Proceedings of the 2001 International Symposium on Electrical Insulating Materials (ISEIM 2001). 2001 Asian Conference on Electrical Insulating Diagnosis (ACEID 2001). 33rd Symposium on Electrical and Ele, Himeji, Japan, 22–22 November 2001; pp. 805–808. [Google Scholar]
- Chen, T.; Zhang, B.; Wang, J.; Liu, J. New on-line high precision tanδ monitoring system for capacitive equipment. Automat. Electr. Power Syst. 2004, 15, 67–70. [Google Scholar]
- Gao, Q.; Ding, P.; Han, Y.; Geng, B. Development of distributed on-line monitoring system for dielectric loss tangent of high voltage capacitive apparatus. In Proceedings of the 2008 International Conference on Condition Monitoring and Diagnosis, Beijing, China, 21–24 April 2008; pp. 1179–1182. [Google Scholar]
- Damião, L.; Guimarães, J.; Ferraz, G.; Bortoni, E.; Rossi, R.; Capelini, R.; Salustiano, R.; Tavares, E. Online monitoring of partial discharges in power transformers using capacitive coupling in the tap of condenser bushings. Energies 2020, 13, 4351. [Google Scholar] [CrossRef]
- Aljohani, O.; Abu-Siada, A. Application of digital image processing to detect transformer bushing faults and oil degradation using FRA polar plot signature. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 428–436. [Google Scholar] [CrossRef]
- Lee, J.G.; Koo, J.-H.; Han, K.-S.; Choi, W. Development of transformer bushing diagnosis system based on high frequency PD measurement. In Proceedings of the 2022 9th International Conference on Condition Monitoring and Diagnosis (CMD), Kitakyushu, Japan, 13–18 November 2022; pp. 372–375. [Google Scholar]
- Yang, Z.; Zhang, Z.; Xue, W.; Chen, Y.; Mou, X.; Yang, Q. Study on power equipment condition based maintenance (CBM) technology in smart grid. In Proceedings of the 2021 3rd International Conference on Smart Power & Internet Energy Systems (SPIES), Shanghai, China, 25–28 September 2021; pp. 291–295. [Google Scholar]
- Mokhtar, K.Z.; Saleh, J.M.; Talib, H.; Ali, N.O. Flow regime classification using artificial neural network trained on electrical capacitance tomography sensor data. Comput. Inf. Sci. 2008, 1, 25–32. [Google Scholar]
- Hashemi, M.; Ghaisari, J.; Salighehdar, A. Identification and compensation of a capacitive differential pressure sensor based on support vector regression using particle swarm optimization. Intell. Autom. Soft Comput. 2012, 18, 263–277. [Google Scholar] [CrossRef]
- Cheng, J.; Li, A.; Duan, Z. Transformer fault diagnosis based on improved evidence theory and neural network integrated method. Power Syst. Prot. Control 2013, 41, 92–96. [Google Scholar]
- Ghoneim, S.S.M.; Taha, I.B. Artificial neural networks for power transformers fault diagnosis based on IEC code using dissolved gas analysis. Int. J. Control Automat. Syst. 2015, 4, 18–21. [Google Scholar]
- Yang, X.; Chen, W.; Li, A.; Yang, C.; Xie, Z.; Dong, H. BA-PNN-based methods for power transformer fault diagnosis. Adv. Eng. Inform. 2019, 39, 178–185. [Google Scholar] [CrossRef]
- Jin, Y.; Wu, H.; Zheng, J.; Zhang, J.; Liu, Z. Power transformer fault diagnosis based on improved BP neural network. Electronics 2023, 12, 3526. [Google Scholar] [CrossRef]
- Yang, P.; Wang, T.; Yang, H.; Meng, C.; Zhang, H.; Cheng, L. The performance of electronic current transformer fault diagnosis model: Using an improved whale optimization algorithm and RBF neural network. Electronics 2023, 12, 1066. [Google Scholar] [CrossRef]
- Fang, H.; Deng, J.; Chen, D.; Jiang, W.; Shao, S.; Tang, M.; Liu, J. You can get smaller: A lightweight self-activation convolution unit modified by transformer for fault diagnosis. Adv. Eng. Inform. 2023, 55, 101890. [Google Scholar] [CrossRef]
- Shi, H.; Bai, C.; Xie, Y.; Li, W.; Zhang, H.; Liu, Y.; Zheng, Y.; Zhang, S.; Zhang, Y.; Lu, H.; et al. Capacitive–inductive magnetic plug sensor with high adaptability for online debris monitoring. IEEE Trans. Instrum. Meas. 2022, 71, 1–8. [Google Scholar] [CrossRef]
- Hao, X.; Zhang, G.; Zhang, W.; Dong, M.; Liu, G. Online monitoring technology for the insulation condition of capacitive-type substation equipment. In Proceedings of the 2008 International Conference on Condition Monitoring and Diagnosis, Beijing, China, 21–24 April 2008; pp. 1220–1223. [Google Scholar]
- Li, J.; Jiao, S.; Wen, Y.; Wang, H. Online insulation monitoring system of high-voltage capacitive substation equipment based on WSN. In Proceedings of the CICED 2010 Proceedings, Nanjing, China, 13–16 September 2010; pp. 1–6. [Google Scholar]
- Xu, M.; Guo, Y.; Zhang, D.; Cao, R.; Shen, Y.; Han, S. Research on high voltage online monitoring system for dielectric loss of capacitive equipment in substation. IOP Conf. Ser. Earth Environ. Sci. 2021, 769, 042025. [Google Scholar] [CrossRef]
- Sun, C.; Bi, W.; Zhou, Q.; Liao, R.; Chen, W. New gray prediction parameter model and its application in electrical insulation fault prediction. Control Theory Appl. 2003, 20, 798–801. [Google Scholar]
- Marino, P.; Sigiienza, C.; Poza, F.; Vazquez, F.; Machado, F. Supporting information system for power transformer fault forecasting applications. In Proceedings of the IECON’03. 29th Annual Conference of the IEEE Industrial Electronics Society (IEEE Cat. No. 03CH37468), Roanoke, VA, USA, 2–6 November 2003; Volume 2, pp. 1899–1904. [Google Scholar]
- Xu, Z.; Peng, D.; Xu, P. Fault prediction of power electronics module based on RELM-AdaBoost. In Proceedings of the 2022 4th International Conference on Communications, Information System and Computer Engineering (CISCE), Shenzhen, China, 27–29 May 2022; pp. 11–14. [Google Scholar]
- Di, Y.; Jin, C.; Bagheri, B.; Shi, Z.; Ardakani, H.D.; Tang, Z.; Lee, J. Fault prediction of power electronics modules and systems under complex working conditions. Comput. Ind. 2018, 97, 1–9. [Google Scholar] [CrossRef]
- Peng, J.; Zhou, F.; Xiang, H.; Ma, Y.; Zheng, Z.; Jiang, S.; Li, J. Prediction of defects occurrence time for capacitive device. In Proceedings of the 2019 16th International Computer Conference on Wavelet Active Media Technology and Information Processing, Chengdu, China, 14–15 December 2019; pp. 226–229. [Google Scholar]
- Wu, Y.; Pan, Y. Application analysis of credit scoring of financial institutions based on machine learning model. Complexity 2021, 2021, 1–12. [Google Scholar] [CrossRef]
- Surhone, L.; Tennoe, M.; Henssonow, S. Random forest. Mach. Learn. 2010, 45, 5–32. [Google Scholar]
- Yu, X.; Hyyppa, J.; Vastaranta, M.; Holopainen, M.; Viitala, R. Predicting individual tree attributes from airborne laser point clouds based on the random forests technique. ISPRS J. Photogramm. Remote Sens. 2011, 66, 28–37. [Google Scholar] [CrossRef]
- Yuan, G.X.; Ho, C.H.; Lin, C.J. Recent advances of large-scale linear classification. Proc. IEEE 2012, 100, 2584–2603. [Google Scholar] [CrossRef]
- Pal, S.; Mitra, S. Multilayer perceptron, fuzzy sets, and classification. IEEE Trans. Neural Netw. 1992, 3, 683–697. [Google Scholar] [CrossRef] [PubMed]
- Zhang, S.; Wang, Y.; Liu, M.; Bao, Z. Data-based line trip fault prediction in power systems using LSTM networks and SVM. IEEE Access 2017, 6, 7675–7686. [Google Scholar] [CrossRef]
- Chen, M.; Liu, Q.; Chen, S.; Liu, Y.; Zhang, C.; Liu, R. XGBoost-based algorithm interpretation and application on post-fault transient stability status prediction of power system. IEEE Access 2019, 7, 13149–13158. [Google Scholar] [CrossRef]
- Bilogur, A. Missingno: A missing data visualization suite. J. Open Source Softw. 2018, 3, 547. [Google Scholar] [CrossRef]
- Rodríguez, P.; Bautista, M.A.; Gonzàlez, J.; Escalera, S. Beyond one-hot encoding: Lower dimensional target embedding. Image Vis. Comput. 2018, 75, 21–31. [Google Scholar] [CrossRef]
- Zhang, Y.; Schneider, J. Multi-Label output codes using canonical correlation analysis. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, USA, 11–13 April 2011; Volume 15, pp. 873–882. [Google Scholar]
- He, H.; Garcia, E. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 2009, 21, 1263–1284. [Google Scholar]
- Blagus, R.; Lusa, L. Evaluation of SMOTE for high-dimensional class-imbalanced microarray data. In Proceedings of the 2012 11th International Conference on Machine Learning and Applications, Boca Raton, FL, USA, 12–15 December 2012; Volume 2, pp. 89–94. [Google Scholar]
Level | Urgent | Critical | General | Others |
---|---|---|---|---|
Size | 3275 | 1496 | 5894 | 1050 |
Power Supply Bureau | Voltage Level | Defect Level | Defect Type | Equipment Type | Manufacturer |
---|---|---|---|---|---|
502 | 10,000 | General | Bird nest | Overhead conductor | Jinbei Electric Co., Ltd. |
502 | 400 | Critical | Insufficient safe distance | Low-voltage overhead conductor | Kunming Cable Group Co., Ltd. |
502 | 35,000 | Urgent | Low insulation | Isolating switch | Yunnan Yunkai Electric Co., Ltd. |
502 | 110,000 | General | Visible gas in the Buchholz relay | Oil-filled transformer | Jiangsu Huapeng Transformer Co., Ltd. |
Bureau | One-Hot Encoding | WOE Encoding |
---|---|---|
501 | (0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1) | −0.012523 |
502 | (0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0) | −0.313688 |
503 | (0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0) | 0.571820 |
504 | (0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0) | 0.091393 |
505 | (0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0) | −0.229169 |
506 | (0, 0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0) | −0.112641 |
507 | (0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0) | 0.034807 |
508 | (0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0) | 0.851002 |
509 | (0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0) | 0.136587 |
510 | (0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0) | −0.052263 |
511 | (0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0) | 0.450399 |
512 | (0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0) | −0.483160 |
513 | (0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0) | −0.954793 |
514 | (0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0) | 0.526281 |
515 | (0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0) | 1.029099 |
516 | (0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0) | −0.939009 |
522 | (0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0) | −0.850388 |
581 | (1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0) | −0.533197 |
Bureau | PSB_woe1 | PSB_woe2 | PSB_woe3 | PSB_woe4 |
---|---|---|---|---|
501 | −0.320753 | −0.434713 | 0.276863 | 0.359008 |
502 | −0.320477 | −0.174722 | 0.381339 | −0.212646 |
503 | 0.605673 | −1.446667 | −0.698807 | 1.056935 |
504 | −0.631471 | 0.821530 | −0.337750 | 0.645300 |
505 | 0.204939 | −0.212179 | 0.104558 | −0.808023 |
506 | −0.210769 | −0.422391 | 0.293096 | 0.106945 |
507 | 0.228539 | −0.118984 | 0.070539 | −0.974570 |
508 | 0.620354 | −0.900355 | −0.329595 | 0.182347 |
509 | 0.041635 | 0.283675 | 0.085593 | −1.448681 |
510 | 0.281236 | 0.786925 | −0.430517 | −1.793746 |
511 | −1.677792 | 0.140298 | 0.551238 | 0.59351 |
512 | 0.064487 | −0.088544 | 0.456459 | −0.631664 |
513 | 0.119139 | 0.105754 | 0.193433 | −3.619674 |
514 | 1.516963 | 0.225044 | −1.452342 | −2.123319 |
515 | −2.454321 | 0.236879 | 0.14353 | 0.911634 |
516 | 0.030586 | −0.136746 | 0.166161 | −0.50136 |
522 | −0.107284 | −1.470198 | 0.723976 | −1.016985 |
581 | −0.367959 | 0.903592 | −0.463598 | 0.428509 |
Models | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|
Linear | 0.72 | 0.95 | 0.64 | 0.82 |
XGBoost | 0.85 | 0.98 | 0.84 | 0.90 |
SVM | 0.74 | 0.98 | 0.70 | 0.83 |
MLP | 0.86 | 0.98 | 0.83 | 0.91 |
RF | 0.92 | 0.98 | 0.92 | 0.94 |
Models | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|
Linear | 0.79 | 0.98 | 0.69 | 0.86 |
XGBoost | 0.93 | 0.98 | 0.89 | 0.93 |
SVM | 0.83 | 0.98 | 0.73 | 0.89 |
MLP | 0.88 | 0.98 | 0.87 | 0.92 |
RF | 0.96 | 0.98 | 0.97 | 0.97 |
Models | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|
Linear | 0.44 | 0.43 | 0.44 | 0.42 |
XGBoost | 0.61 | 0.60 | 0.61 | 0.60 |
SVM | 0.55 | 0.54 | 0.55 | 0.54 |
MLP | 0.61 | 0.61 | 0.60 | 0.61 |
RF | 0.71 | 0.71 | 0.70 | 0.71 |
Models | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|
Linear | 0.46 | 0.46 | 0.45 | 0.45 |
XGBoost | 0.62 | 0.62 | 0.62 | 0.62 |
SVM | 0.73 | 0.72 | 0.73 | 0.72 |
MLP | 0.66 | 0.66 | 0.66 | 0.66 |
RF | 0.78 | 0.79 | 0.78 | 0.78 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Peng, Q.; Zheng, Z.; Hu, H. Defect Prediction for Capacitive Equipment in Power System. Appl. Sci. 2024, 14, 1968. https://doi.org/10.3390/app14051968
Peng Q, Zheng Z, Hu H. Defect Prediction for Capacitive Equipment in Power System. Applied Sciences. 2024; 14(5):1968. https://doi.org/10.3390/app14051968
Chicago/Turabian StylePeng, Qingjun, Zezhong Zheng, and Hao Hu. 2024. "Defect Prediction for Capacitive Equipment in Power System" Applied Sciences 14, no. 5: 1968. https://doi.org/10.3390/app14051968
APA StylePeng, Q., Zheng, Z., & Hu, H. (2024). Defect Prediction for Capacitive Equipment in Power System. Applied Sciences, 14(5), 1968. https://doi.org/10.3390/app14051968