Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting
Abstract
:1. Introduction
- (1)
- Image method. The basic idea of the image method is to convert scattered data into digital images and transform the data cleaning problem into an image segmentation problem. Huan Long et al. [11] proposed an abnormal data cleaning algorithm based on 3D images. Su Y et al. [12] and Liang G et al. [13] used an image thresholding algorithm to identify anomalies. Wang Z et al. [14] propose an efficient acceleration algorithm that can convert data into images for cleaning. The disadvantage of the image method is that the required computing resources are too large.
- (2)
- Power curve modeling method. The power curve modeling method is to establish a wind speed-power curve model through a series of methods, compare the real data with the power curve model, and then clean out abnormal data. The methods include quantile power curve [15], interval extreme probability density [16], maximum likelihood estimation [17], Artificial Neural Network (ANN) algorithm [18], etc. Based on the ideal curve, Joon-Young Park et al. [19] used a monitoring power curve to automatically calculate the limit of the power curve value method. Yongning Zhao et al. [20] proposed an algorithm combining the quartile algorithm and the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) to optimize the power curve, in which the quartile algorithm was used to eliminate sparse abnormal data, and DBSCAN was used to eliminate accumulated abnormal data. According to different wind characteristics, Yang Mao et al. [21] used the Copula function to obtain the probability power curve and combined the time series characteristics of the abnormal data to summarize three types of abnormal data, and established the abnormal data identification model, which improved the modeling accuracy. The disadvantage of the power curve modeling method is that when there are a large number of abnormal data, the abnormal data will greatly affect the accuracy of the established power curve.
- (3)
- Statistical methods. Statistical methods compare the statistical value of data or data in each interval with a preset threshold to achieve the purpose of data cleaning. Statistical methods include sample entropy [22], cloud segmentation optimal entropy [23], bin algorithm [24,25], quartile method [26], etc. Lou Jianlou et al. [27] used the optimal intra-group variance method for data cleaning, which is good at dealing with abnormal data with low power in the wind speed range. Wang S et al. [28] adopted the combination of 3σ-median criterion to effectively identify abnormal data points in the data. Tao L et al. [29] use the gray relational algorithm and the support vector regression algorithm to effectively solve the problem of dimensional explosion. There are some algorithms that divide abnormal data into multiple types and use different algorithms according to the characteristics of the types [30,31]. The statistical data generated by statistical methods will be affected by abnormal data, and there is a problem of difficulty in threshold selection.
2. Wind Speed-Power Characteristics
3. Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting
3.1. Quartile Outlier Data Detection Algorithm
3.2. Constrained Least Square Curve Fitting Algorithm
3.3. Exterior Penalty Function Method for Solving the Constrained Optimization Problem
Algorithm 1 Exterior Penalty Function Method | |
Input | : initial parameters; : initial exterior penalty factor; c: amplification factor; , : precision; R: penalty factor control factor |
Output | : optimal parameters |
1: | . |
2: | Solve the unconstrained optimization problem , and get . |
3: | If , go to step 7, otherwise go to step 4. |
4: | If , go to step 7, otherwise go to step 5. |
5: | , . |
6: | , go to step 2. |
7: | output . |
3.4. Improved 3-σ Data Cleaning Method
Algorithm 2 Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting | ||
Input: | original dataset: | |
Output: | normal dataset: | |
1: | ||
2: | Get the cut-in wind speed , cut-out wind speed , and rated power from dataset | |
3: | ||
//Select runtime data | ||
4: | ||
5: | //Quartile method to remove abnormal data | |
6: | //Exterior penalty function method solves constrained fitting | |
7: | , calculate its power upper limit and power lower limit . | |
8: | //Improved 3-σ method data cleaning | |
9: | ||
10: | Bring the maximum wind speed of the dataset D3 into the fitting function to get the actual power maximum value . | |
11: | ||
//Handling the wind speed cut-off part | ||
12: | ||
13: | output |
4. Experimental Validation and Analysis
4.1. Dataset Description
4.2. Algorithm Experiment Process
4.2.1. Quartile Preprocessing
4.2.2. Constrained Wind Speed-Power Curve Fitting
4.2.3. Improved 3-σ Division of Abnormal Data
4.3. Algorithm Comparison
5. Conclusions
- (1)
- Compared with the traditional data cleaning method, the wind turbine abnormal data cleaning method based on constrained curve fitting has the advantages of insensitivity to parameters and less computation. Experiments show that the method can still perform data cleaning well in the presence of a large number of abnormal data. Compared with the traditional data cleaning method, the cleaned data are more in line with the wind speed-power characteristics and the ideal wind speed-power curve.
- (2)
- We use entropy and hyper entropy to evaluate the stability of the cleaned data. The rationality of the index is verified by comparing the index with the data scatterplots of three wind turbines in a wind farm in eastern China after cleaning with different algorithms.
- (3)
- Experiments show that the abnormal data cleaning method for wind turbines based on constrained curve fitting can effectively clean the data and improve the data quality, which is of great significance to the follow-up research. The next step should focus on further improving the running speed of the algorithm, improving the fitting degree of the cleaned data and the ideal wind speed-power curve, and further improving the operating efficiency.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Kumar, J.; Agarwal, A.; Singh, N. Design, operation and control of a vast DC microgrid for integration of renewable energy sources. Renew. Energy Focus 2020, 34, 17–36. [Google Scholar] [CrossRef]
- Neshat, M.; Nezhad, M.M.; Abbasnejad, E.; Mirjalili, S.; Groppi, D.; Heydari, A.; Tjernberg, L.B.; Garcia, D.A.; Alexander, B.; Shi, Q.; et al. Wind turbine power output prediction using a new hybrid neuro-evolutionary method. Energy 2021, 229, 120617. [Google Scholar] [CrossRef]
- Lin, Z.; Liu, X. Wind power forecasting of an offshore wind turbine based on high-frequency SCADA data and deep learning neural network. Energy 2020, 201, 117693. [Google Scholar] [CrossRef]
- Tautz-Weinert, J.; Watson, S.J. Using SCADA data for wind turbine condition monitoring–a review. IET Renew. Power Gener. 2017, 11, 382–394. [Google Scholar] [CrossRef]
- Black, I.M.; Richmond, M.; Kolios, A. Condition monitoring systems: A systematic literature review on machine-learning methods improving offshore-wind turbine operational management. Int. J. Sustain. Energy 2021, 40, 923–946. [Google Scholar] [CrossRef]
- Zhang, J.; Jiang, N.; Li, H.; Li, N. Online health assessment of wind turbine based on operational condition recognition. Trans. Inst. Meas. Control 2019, 41, 2970–2981. [Google Scholar] [CrossRef]
- McKinnon, C.; Turnbull, A.; Koukoura, S.; Carroll, J.; McDonald, A. Effect of time history on normal behaviour modelling using SCADA data to predict wind turbine failures. Energies 2020, 13, 4745. [Google Scholar] [CrossRef]
- Udo, W.; Muhammad, Y. Data-driven predictive maintenance of wind turbine based on SCADA data. IEEE Access 2021, 9, 162370–162388. [Google Scholar] [CrossRef]
- Leahy, K.; Gallagher, C.; O’Donovan, P.; O’Sullivan, D.T. Issues with data quality for wind turbine condition monitoring and reliability analyses. Energies 2019, 12, 201. [Google Scholar] [CrossRef]
- Astolfi, D.; Castellani, F.; Natili, F. Wind turbine multivariate power modeling techniques for control and monitoring purposes. J. Dyn. Syst. Meas. Control 2021, 143, 034501. [Google Scholar] [CrossRef]
- Long, H.; Xu, S.; Gu, W. An abnormal wind turbine data cleaning algorithm based on color space conversion and image feature detection. Appl. Energy 2022, 311, 118594. [Google Scholar] [CrossRef]
- Su, Y.; Chen, F.; Liang, G.; Wu, X.; Gan, Y. Wind Power Curve Data Cleaning Algorithm via Image Thresholding. In Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO), Dali, China, 6–8 December 2019; pp. 1198–1203. [Google Scholar]
- Liang, G.; Su, Y.; Chen, F.; Long, H.; Song, Z.; Gan, Y. Wind power curve data cleaning by image thresholding based on class uncertainty and shape dissimilarity. IEEE Trans. Sustain. Energy 2020, 12, 1383–1393. [Google Scholar] [CrossRef]
- Wang, Z.; Wang, L.; Huang, C. A fast abnormal data cleaning algorithm for performance evaluation of wind turbine. IEEE Trans. Instrum. Meas. 2020, 70, 1–12. [Google Scholar] [CrossRef]
- Xu, K.; Yan, J.; Zhang, H.; Zhang, H.; Han, S.; Liu, Y. Quantile based probabilistic wind turbine power curve model. Appl. Energy 2021, 296, 116913. [Google Scholar] [CrossRef]
- Han, S.; Qiao, Y.; Yan, P.; Yan, J.; Liu, Y.; Li, L. Wind turbine power curve modeling based on interval extreme probability density for the integration of renewable energies and electric vehicles. Renew. Energy 2020, 157, 190–203. [Google Scholar] [CrossRef]
- Seo, S.; Oh, S.I.; Kwak, H.-Y. Wind turbine power curve modeling using maximum likelihood estimation method. Renew. Energy 2019, 136, 1164–1169. [Google Scholar] [CrossRef]
- Li, T.; Liu, X.; Lin, Z.; Morrison, R. Ensemble offshore Wind Turbine Power Curve modelling—An integration of Isolation Forest, fast Radial Basis Function Neural Network, and metaheuristic algorithm. Energy 2022, 239, 122340. [Google Scholar] [CrossRef]
- Park, J.; Lee, J.; Oh, K.; Lee, J. Development of a Novel Power Curve Monitoring Method for Wind Turbines and Its Field Tests. IEEE Trans. Energy Convers. 2014, 29, 119–128. [Google Scholar] [CrossRef]
- Zhao, Y.; Ye, L.; Wang, W.; Sun, H.; Ju, Y.; Tang, Y. Data-Driven Correction Approach to Refine Power Curve of Wind Farm Under Wind Curtailment. IEEE Trans. Sustain. Energy 2018, 9, 95–105. [Google Scholar] [CrossRef]
- Yang, M.; Zhai, G.; Su, X. An Algorithm for Abnormal Data Identification of Wind Turbine Based on Wind Characteristic Analysis. In Proceedings of the 2nd World Congress on Civil, Structural, and Environmental Engineering, Barcelona, Spain, 2–4 April 2017; Volume 37, pp. 144–151. [Google Scholar] [CrossRef]
- Xiang, L.; Deng, Z.; Zhao, Y. Anomaly Recognition Method for Wind Turbines Based on SCADA Data. Acta Energy Sol. Sin. 2020, 41, 278–284. [Google Scholar]
- Yang, M.; Yang, Q. The Identification Research of the Wind Turbine Abnormal Data Based on the Cloud Segment Optimal Entropy Algorithm. In Proceedings of the 3rd World Congress on Civil, Structural, and Environmental Engineering, Budapest, Hungary, 8–10 April 2018; Volume 38, pp. 2294–2301+2539. [Google Scholar] [CrossRef]
- Wang, X.; Wang, Z. Wind speed-power data cleaning of wind turbine based on improved bin algorithm. Chin. J. Intell. Sci. Technol. 2020, 2, 62–71. [Google Scholar]
- Han, B.; Xie, H.; Shan, Y.; Liu, R.; Cao, S. Characteristic Curve Fitting Method of Wind Speed and Wind Turbine Output Based on Abnormal Data Cleaning. In Proceedings of the 2021 International Conference on Advanced Technologies and Applications of Modern Industry (ATAMI 2021), Wuhan, China, 19–21 November 2021; Volume 2185, p. 012085. [Google Scholar]
- Zou, T.; Gao, Y.; Yi, H.; Xu, C.; Xia, R.; Wu, C. Processing of Wind Power Abnormal Data Based on Thompson tau-quartile and Multi-point Interpolation. Autom. Electr. Power Syst. 2020, 44, 156–162. [Google Scholar]
- Lou, J.; Xu, J.; Lu, H.; Qu, Z.; Li, S.; Liu, R. Wind Turbine Data-cleaning Algorithm Based on Power Curve. Autom. Electr. Power Syst. 2016, 40, 116–121. [Google Scholar]
- Wang, S.; Zhang, Z.; Wang, P.; Tian, Y. Failure warning of gearbox for wind turbine based on 3σ-median criterion and NSET. Energy Rep. 2021, 7, 1182–1197. [Google Scholar] [CrossRef]
- Tao, L.; Siqi, Q.; Zhang, Y.; Shi, H. Abnormal detection of wind turbine based on SCADA data mining. Math. Probl. Eng. 2019, 2019, 5976843. [Google Scholar] [CrossRef]
- Luo, Z.; Fang, C.; Liu, C.; Liu, S. Method for Cleaning Abnormal Data of Wind Turbine Power Curve Based on Density Clustering and Boundary Extraction. IEEE Trans. Sustain. Energy 2021, 13, 1147–1159. [Google Scholar] [CrossRef]
- Wang, Y.; Liu, H.; Song, P.; Hu, Z.; Deng, X.; Wu, L. An approach for the cleaning of abnormal wind turbine operation data based on multi-phase progressive recognition. Renew. Energy Resour. 2020, 38, 1470–1476. [Google Scholar] [CrossRef]
- Trivellato, F.; Battisti, L.; Miori, G. The ideal power curve of small wind turbines from field data. J. Wind. Eng. Ind. Aerodyn. 2012, 107–108, 263–273. [Google Scholar] [CrossRef]
- Si, C.; Lan, T.; Hu, J.; Wang, L.; Wu, Q. Penalty parameter of the penalty function method. Control Decis. 2014, 29, 1707–1710. [Google Scholar] [CrossRef]
Name | Unit |
---|---|
Wind Number | |
Time Stump | |
Wind Speed | |
Power | |
Rotor Speed | |
Wheel Diameter | |
Wind Cut-in | |
Wind Cut-out | |
Rated Power | |
Rotor Speed Range |
Wind Number | Optimal Parameters | Constraint Function | Constraint Function |
---|---|---|---|
1 | |||
2 | |||
3 |
Algorithm Name | Wind Number | Time (s) | The Average Entropy | The Average Hyper Entropy |
---|---|---|---|---|
CCF | #1 | 1.29 | 46.73 | 6.27 |
#2 | 1.73 | 50.99 | 12.24 | |
#3 | 1.53 | 63.71 | 20.18 | |
OIV | #1 | 3.15 | 111.93 | 38.03 |
#2 | 3.14 | 375.16 | 45.21 | |
#3 | 3.35 | 171.34 | 130.37 | |
CSOE | #1 | 35.52 | 322.64 | 92.32 |
#2 | 34.62 | 210.87 | 50.16 | |
#3 | 40.21 | 259.60 | 125.48 | |
DBSCAN | #1 | 0.27 | 99.34 | 31.49 |
#2 | 0.25 | 381.05 | 23.92 | |
#3 | 0.27 | 171.20 | 124.40 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yin, X.; Liu, Y.; Yang, L.; Gao, W. Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting. Energies 2022, 15, 6373. https://doi.org/10.3390/en15176373
Yin X, Liu Y, Yang L, Gao W. Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting. Energies. 2022; 15(17):6373. https://doi.org/10.3390/en15176373
Chicago/Turabian StyleYin, Xiangqing, Yi Liu, Li Yang, and Wenchao Gao. 2022. "Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting" Energies 15, no. 17: 6373. https://doi.org/10.3390/en15176373
APA StyleYin, X., Liu, Y., Yang, L., & Gao, W. (2022). Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting. Energies, 15(17), 6373. https://doi.org/10.3390/en15176373