Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection

Yuan, Tongke; Sun, Zhifeng; Ma, Shihao

doi:10.3390/en12224224

Open AccessArticle

Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection

by

Tongke Yuan

^1,*

,

Zhifeng Sun

¹ and

Shihao Ma

²

¹

College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China

²

Wuzhong Baita Wind Power Corporation Limited, Wuzhong 751100, China

^*

Author to whom correspondence should be addressed.

Energies 2019, 12(22), 4224; https://doi.org/10.3390/en12224224

Submission received: 12 October 2019 / Revised: 3 November 2019 / Accepted: 4 November 2019 / Published: 6 November 2019

(This article belongs to the Special Issue Maintenance Management of Wind Turbines)

Download

Browse Figures

Versions Notes

Abstract

:

The fault diagnosis and prediction technology of wind turbines are of great significance for increasing the power generation and reducing the downtime of wind turbines. However, most of the current fault detection approaches are realized by setting a single alarm threshold. Considering the complicated working conditions of wind farms, such methods are prone to ignore the fault, send out a false alarm, or leave insufficient troubleshooting time. In this work, we propose a gearbox fault prediction approach of wind turbines based on the supervisory control and data acquisition (SCADA) data. A stacking model composed of Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and Extreme Gradient Boosting (XGBOOST) was constructed as the normal behavior model to describe the normal conditions of the wind turbines. We used the Mahalanobis distance (MD) instead of the residual to measure the deviation of the current state from the normal conditions of the turbines. By inputting the MD series into the proposed change-point detection algorithm, we can obtain the change point at which the fault symptom begins to appear, and thus achieving the fault prediction of the gearbox. The proposed approach is validated on the historical data of 5 wind turbines in a wind farm, which proves its effectiveness to detect the fault in advance.

Keywords:

wind turbines; fault prediction; stacking model; normal behavior model; change-point detection; SCADA

Graphical Abstract

1. Introduction

In recent years, due to the increasingly serious energy crisis and environmental pollution, many countries of the world have vigorously developed new energy sources. As green renewable energy, wind energy has received great attention from countries all over the world. According to the 2018 annual report of the Global Wind Energy Council (GWEC) [1], the global offshore and onshore installed capacity has reached 23,140 MW and 568,409 MW, respectively. In 2018, the new installed capacity of onshore wind turbines reached 46,820 MW. GWEC expects the onshore market to install upwards of 50 GW per year until 2023. With the increase of the installed capacity of wind power, more urgent requirements are put forward for the daily fault detection and operation and maintenance (O&M) of wind turbines. Due to the remote location of the wind farm, the WTs are exposed to highly variable and harsh weather conditions [2], which makes the O&M cost of wind turbines higher than other traditional power generation methods. Especially when the wind farm is built at sea, desert, etc., its O&M costs will be much higher. Therefore, it is necessary to vigorously develop wind turbine fault diagnosis technology and to provide early warning of faults and achieve certain accuracy and practicability. Currently, wind turbine manufacturers and research institutes are trying to make breakthroughs in the fault diagnosis and O&M technology of wind turbines.

There are many sensors installed on the turbines to record the working conditions of each component. According to the utility of the data recorded by sensors, the parameter system of the wind turbine can be divided into the supervisory control and data acquisition (SCADA) system, which mainly consists of the performance parameters and the condition-monitoring system (CMS) based on the vibration parameters [3]. For different fault modes, the parameters such as temperature, power, speed, and blade angle of the turbine will not be the same. Therefore, the relationship between these data and the fault can be explored through machine learning algorithms. SCADA and CMS systems form a big data system for wind turbines. The advantage of big data is that as long as the amount of data is large enough, by selecting appropriate data mining methods and optimizing the data mining process, it is possible to predict unknown parameters and its regularity.

For the fault diagnosis of wind turbines, Professor Andrew Kusiak of the Intelligent Systems Laboratory of the University of Iowa has made great achievements. Kusiak et al. [4] developed virtual models of a wind turbine which were extracted by six machine learning algorithms. The power output and rotor speed were selected as performance parameters to test the proposed methodology. In [5], a three-level fault prediction methodology based on the SCADA data and status data of 4 wind turbines was proposed. In order to predict the blade angle asymmetry and implausibility faults, an association rule mining algorithm was applied and five classifiers were used to identify the best model [6]. Also, the bearing faults were analyzed by building a normal behavior model based on the neural network algorithms in [7]. The test results on five over-temperature events validated its effectiveness on bearing faults prediction.

Generally speaking, the approaches for wind turbine fault detection/prediction through data mining algorithms are divided into two major ways. One is through normal behavior modeling (NBM) [8]. By selecting the historical SCADA data under normal conditions as the training set, the normal behavior model of the key components of the turbines can be constructed. Support Vector Machine (SVM) [9,10], neural network [11], and deep belief networks [12] are commonly used algorithms in NBM. Then, according to the statistical characteristics of the normal conditions of the wind turbine, an empirical threshold is set as an alarm line so that any fault deviates from the normal conditions can be detected. Bangalore et al. [13] constructed an artificial neural network (ANN) normal behavior model to predict the gearbox fault. During the anomaly detection stage, a Mahalanobis Distance (MHD) method was presented to describe the deviation between the ANN model and the operating condition. Finally, the case studies of four wind turbines illustrated the effectiveness of the proposed approach. For other subcomponents of the turbines, such as the generator bearings and slip rings, etc., the evolution of their faults develops faster than that of gearbox and thus it is difficult to make timely intervention before the fault occurs. Therefore, it is particularly important to achieve the fault prediction of these subcomponents through advanced analysis techniques. In [14], a nonlinear state estimate technique (NSET) is proposed to construct the normal behavior model of the generator temperature. In [15], the selected model is principal component regression and can identify the incoming generator slip ring fault approximately one day in advance. In [16], the authors proposed a model-based approach to predict generator bearing temperature and thus achieving the early detection of bearing and gearbox faults.

The other way is a classification-based approach. The classification-based approach is to divide the operating state of the wind turbine into different categories (label), such as normal operation, shutdown, or a specific type of fault. Each sample in the training set consisting of SCADA historical data has a label. Then the classifier after the training process can be used to determine the type of fault of the turbines. Association rule is a commonly used algorithm. The authors in [6] used the Apriori algorithm to identify the correlation from the database composed of fault categories, various operating parameters, and history records and then identified the specific fault mode. Similarly, in [17], the Apriori algorithm was applied to find the key event codes of wind turbines.

For the NBM approach, no matter which algorithm is used to establish the prediction model, most of the approaches during the fault prediction stage are achieved by analyzing the residual and setting a single alarm threshold. The authors in [18] used the Mahalanobis distance instead of the residual and applied the Weibull distribution to obtain the threshold of the fault alarm. Wu et al. [19] developed a normal behavior model for wind turbines gearboxes using echo state networks. The potential faults were detected through residual evaluation and dynamic threshold setting. However, using a single alarm threshold or an adaptive threshold method often has a time uncertainty problem, that is, an accurate time point at which the fault symptom begins to appear cannot be given. Moreover, an inappropriate threshold setting may cause a false alarm or miss detection, which makes the fault prediction based on SCADA data has the problems of low prediction accuracy and low computational efficiency. In order to improve the reliability and accuracy of fault prediction, we propose a gearbox fault prediction approach based on a stacking model and CUSUM (Cumulative Sum) control chart. The framework of the proposed method can be divided into four parts, which are the data preprocessing, feature selection, the normal behavior modeling process, and the fault prediction process, where several algorithms or their combinations were applied. The main contributions of our work and the functionality of these algorithms are summarized as follows:

During the data preprocessing process, we select the SCADA historical data under the normal working conditions and apply the Density Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm and the quartile method to recognize and filter out the three types of outliers and abnormal data. The DBSCAN algorithm plays an important part in filtering out most of the outliers and then the quartile method is applied for further preprocessing to improve the data quality.
During the feature selection process, three data mining algorithms are used to select the features most relevant to the gearbox oil temperature as the input variables of the model.
Based on the Stacking idea, a normal behavior model including Random Forest (RF), GBDT, and XGBOOST is established to describe the temperature characteristics of the gearbox under normal operation state. By comparing the performance of seven single models on three metrics, we finally select three models, RF, GBDT, XGBOOST, as the basic models of our proposed stacking model. The proposed stacking model trains three basic models in parallel which can avoid the defect of a single machine learning model like overfitting and low prediction accuracy. The case study illustrates the superiority and effectiveness of the proposed model.
The change-point detection algorithm based on CUSUM control chart is applied to detect the change points in the MD series. Compared with the fault detection algorithm by setting a single threshold, the proposed approach can accurately locate the time at which the fault symptom begins to appear. A case study on five faulty turbines shows the proposed approach can predict the faults 13 to 22 days in advance.

The rest of this paper is organized as follows: Section 2 briefly introduces three types of abnormal data and illustrates the necessity of data preprocessing. Section 3 clarifies the structure of the proposed approach. Section 4 introduces the data set used in this work and the entire process of normal behavior modeling. In Section 5, we use the data of five turbines to test the performance of our proposed approach and list the test results. Section 6 summarizes the full paper and points out the future research direction.

2. Background Review

The complete wind turbine consists of the gearbox system, generator system, pitch system, yaw system and other key components [20]. The gearbox is an important component of the wind turbine transmission system. Its main function is to transmit the power generated by the blade under the action of the wind to the generator and achieve a higher speed. At present, the gearbox manufacturing technology is relatively mature and has high reliability, however, due to its frequent high-speed, heavy-duty, and high-impact working environment, it is prone to suffer various faults within an operational period of 5 years, and has to be replaced [21]. Moreover, the system shutdown caused by gearbox failure takes the longest time and a gearbox replacement can cost up to 14.5 percent of the wind turbine maintenance cost [20]. Therefore, gearbox fault prediction is necessary.

Generally, gearbox faults can be reflected by the change of gearbox oil temperature. In [22], the authors illustrated that the rise of oil temperature could indicate the development of the gearbox faults. And an artificial neural network model was constructed to predict the oil temperature. In this work, the gearbox oil temperature is modeled to describe the working status of the wind turbine.

The normal behavior modeling approach has high requirements for the quality of training data: fault-free SCADA data must be accurately selected [23]. However, due to sensors’ accuracy degradation, communication failure, and curtailment, the turbine can operate in different modes even under normal conditions [24]. Figure 1 shows three common types of anomaly data in the power curve:

The lower stacked abnormal data is represented as a horizontally dense data cluster in the scatter plot, as shown in Type 1 in Figure 1. Type 1 indicates the points where the wind power output from the wind turbine is 0 regardless of the wind speed. Such abnormal data appears due to damage to the power measuring instrument or abnormal communication equipment, fault of the wind turbine, etc. [25].
Wind curtailment data. The “Wind curtailment” data appears as a horizontally dense cluster of data in the scatter plot, typically in the middle of the curve, as shown in Type 2 in Figure 1. Such anomaly data shows that the output power of the wind turbine does not change with the change of wind speed. Wind curtailment is the reduction in electricity generation below what a system of well-functioning wind turbines are capable of producing [26], which is usually enforced by wind farms. The following are the main causes of “Wind curtailment”. First, China’s energy structure is still dominated by thermal power generation, supplemented by new energy, so wind power generation is only used as regulating power. Second, because of the excess capacity of wind power, it is difficult to store large capacity wind power resources, so it is necessary to curtail wind power. Finally, due to the instability of wind speed, the electric energy generated by the wind turbine is also unstable. When the wind speed has a huge fluctuation, the grid-connected cannot be performed forcibly, so the wind curtailment operation has to be carried out at this time.
The dispersive anomaly data is randomly distributed around the curve, as shown in Type 3 in Figure 1. The distribution of such anomaly data is irregular and is usually caused by sensor accuracy degradation, instrument fault, or noise during the signal propagation.

If these raw SCADA data are directly used for analysis and modeling, it will inevitably lead to a decrease in prediction accuracy and difficulty in achieving research goals. Therefore, it is necessary to preprocess the raw data collected by the SCADA system before establishing the normal behavior model.

3. Framework for Proposed Fault Prediction Approach

The main idea of our proposed fault prediction approach is to fit the normal operating behavior of wind turbines by establishing a normal behavior model of the gearbox oil temperature, and then use the Mahalanobis distance, instead of the residual, to describe the deviation between the actual measured value and the model output value. Finally, the Change-point detection based on CUSUM control charts is applied to implement the gearbox fault prediction. The overall flowchart of the fault prediction model proposed in this study can be divided into the training process and application process, as shown in Figure 2. The training process deals with the necessary data preprocessing and feature selection, which are performed based on the historical data collected by the SCADA system to establish the normal behavior model of the gearbox. During the application process, the real-time SCADA data is input into the normal behavior model after the same data preprocessing and feature selection. There is a certain deviation between the predicted values of the model and the actual values recorded by the SCADA system. The larger the deviation is, the higher the possibility of a fault happens. In this work, we apply the Mahalanobis distance to describe this deviation. By inputting the Mahalanobis distance series into the proposed Change-point detection algorithm, the time at which the fault occurs and the time at which the fault symptom begins to appear can be obtained, and thus the fault prediction can be realized.

3.1. Data Preprocessing

From Figure 1, we can see intuitively that the three types of anomaly data points show obvious outlier characteristics in the power curve. In this work, we apply the combination of DBSCAN and the quartile method to identify the anomaly points and outliers. The case study of wind turbines using SCADA data has validated this data cleaning method, which demonstrates its effectiveness and generality.

DBSCAN is a typical density-based clustering algorithm, which is designed to discover the clusters and the noise in a spatial database [27]. According to two input parameters, Eps and MinPts, the algorithm divides the data to be clustered into three categories: core points, border points, and noise points. A core point refers to the point in a cluster that there are at least a minimum number of points in an Eps-neighborhood of that point. The Eps-neighborhood of border points contains fewer points than core points. And the points not belong to core points or border points are defined as outliers (noise), which are the anomaly data to be filtered in this work. Compared with the K-Means clustering algorithm, the DBSCAN does not need to determine the number of cluster centers in advance and can identify clusters of arbitrary shapes and has strong anti-noise ability.

The quartiles are a type of quantile which can be applied to check for outliers [28]. In this work, we used the first quartile (Q₁) and the third quartile (Q₃) to achieve the further preprocessing of the data after the DBSCAN method. The first quartile refers to the middle number between the smallest number and the median of the data set. The third quartile is defined as the middle value between the median and the highest value of the data set. By calculating the Q₁ and Q₃, we can get the interquartile range (

I Q R

) which defined as follows:

I Q R = Q_{3} - Q_{1}

(1)

According to the IQR, we can get the normal data bounds expressed as follows:

[F_{l}, F_{u}] = [Q_{1} - 1.5 I Q R, Q_{3} + 1.5 I Q R]

(2)

where F_l and F_u are the lower limit and upper limit of normal data, respectively. Any data lying outside the defined bound can be considered an outlier. In this work, the quartile method is adopted by dividing the wind speed data into different intervals and then to filter the active power data in each interval correspondingly. The interquartile range is the same as the variance and standard deviation, which can represent the dispersion of statistical data of a variable. But the interquartile range is relatively robust statistics since the value of IQR does not change significantly with individual abnormal data. However, this method has its limitations, that is, the method can correctly and effectively identify these abnormal data only when the proportion of abnormal data to the total amount of data is small.

Hence, in this study, the combination of the DBSCAN and quartile method is applied for data preprocessing. It can be seen from Figure 1 that the proportion of three kinds of abnormal data is relatively large. If we user the quartile method directly, the abnormal data is bound to affect the value of the IQR. Moreover, the density of these abnormal data is significantly less than normal data. Therefore, in the data preprocessing stage, the DBSCAN is firstly used to identify and remove most of the abnormal data, the SCADA data is further processed by the quartile method to improve the quality of data cleaning.

3.2. Feature Selection

Feature selection is a significant part of the data mining process. As the input of the model, features determine the quality of the model. Generally, feature selection is executed by selecting a subset from many features or applying a suitable method to reduce the dimension. The purpose of feature selection is to remove redundancy, reduce the number of features, improve model accuracy and reduce run time. In this work, to establish the normal behavior model of the wind turbine gearbox, the gearbox oil temperature is selected as the output variable. Three feature selection algorithms, namely, filter model [29], wrapper model with recursive feature elimination (RFE) algorithm [30,31], and RF model [32,33], are applied to select the features most relevant to the gearbox oil temperature as the input variables of the model. These three algorithms are typical feature selection methods based on how to generate feature subsets [34]. They will calculate the most relevant features according to different rules, which can avoid the defect of a single method. By inputting the SCADA data with whole 39 parameters as input variables, we obtained the top 10 most relevant parameters ranked by each algorithm and finally choose 15 parameters as the input variables.

3.3. The Stacking Model

The oil temperature of the gearbox is affected by many factors including the working condition and the inherent properties of wind turbines, thus making the status information complicated. To avoid the defects of the single model, we propose a model based on the Stacking strategy by combining three regressors for temperature predictions. At present, the machine learning model based on the stacking strategy has been widely applied in Kaggle competition, sentiment classification [35], speech recognition [36], and many other application scenarios. As far as we know, there are still no researchers who have applied stacking strategy to wind turbine fault prediction.

In this work, a two-layer stacking model is proposed to construct the normal behavior model of gearbox oil temperature as shown in Figure 3.

The first layer consists of three basic models, including RF [32], GBDT [37], and XGBOOST [38]. The idea of stacking is to train these basic models in parallel and combine them by training a meta-model to output predictions based on the multiple predictions returned by these basic models [39]. For each basic model, we use five-fold cross-training, which is similar to k-fold cross-validation, to generate training data and test data for the meta-model in the second layer. Figure 4 describes the details of this process.

The training data is divided into five folds. In each training process, four folds are used for training and the remaining one fold is used for making predictions for observations. In other words, the five-fold cross-training consists in training on four folds in order to make predictions on the remaining fold and that iteratively so that to obtain predictions for observations in any folds. By combining these predictions of three basic models as new features, we can obtain the training data of the meta-model in the second layer. Other than this, each basic model can produce five predictions of the original test data. We take the average of these predictions as the test data for the meta-model. In the second layer, with the training data and the test data produced by the cross-training process, the XGBOOST model is trained as the meta-model and output the final predictions based on the test data.

The proposed stacking model acts as a normal behavior model to generate a prediction of the gearbox oil temperature based on the given input features. In this work, by inputting the 15 features we selected, the stacking model will output the prediction of gearbox oil temperature at this time. Its superiority lies in the part that it can train three single machine learning models in parallel, which greatly accelerates the training process. The performance of our stacking model on the test data has validated its effectiveness.

3.4. Fault Prediction Approach

3.4.1. Mahalanobis Distance Series

The Mahalanobis distance is an effective method for calculating the similarity of two unknown sample sets. Compared to the Euclidean distance, the MD is scale-invariant and takes into account the correlations of the data set. At present, the MD has been introduced into the field of wind turbine fault detection to determine the fault alarm threshold [12,13]. In this work, we use the MD to quantify the deviation between the test data (real-time SCADA data) and the normal behavior model. The MD for the normal behavior model can be expressed as follows:

{MD}_{n o r_{i}} = \sqrt{(X_{n o r_{i}} - μ_{n o r}) C_{n o r}^{- 1} {(X_{n o r_{i}} - μ_{n o r})}^{T}}

(3)

where

X_{n o r_{i}} = [e r r_{i}, M V_{i}]

. MV_i represents the i-th measured value recorded by the SCADA system during the five-fold cross-validation process and err_i is the validation error; C_nor and μ_nor are the covariance matrix and the mean value vector of X_nor;

{MD}_{n o r_{i}}

is the Mahalanobis distance of the vector

X_{n o r_{i}}

.

During the fault prediction stage, we input the real-time SCADA data into the normal behavior model and calculate the corresponding MD series as the input of the subsequent change- point detection algorithm. The MD in this stage is defined as follows:

{MD}_{t e s t_{i}} = \sqrt{(X_{t e s t_{i}} - μ_{n o r}) C_{n o r}^{- 1} {(X_{t e s t_{i}} - μ_{n o r})}^{T}}

(4)

where

X_{t e s t_{i}} = [e r r_{i}, M V_{i}]

is consists of the test error err_i and the measured value MV_i during the fault prediction stage; C_nor and μ_nor are the covariance matrix and the mean value vector obtained from the normal behavior model. l;

{MD}_{t e s t} = [{MD}_{t e s t_{1}}, {MD}_{t e s t_{2}}, \dots, {MD}_{t e s t_{N}}]

is the final MD series as the input of the change-point detection algorithm.

3.4.2. Change-Point Detection

At present, most of the fault detection approaches of wind turbines analyze the residual series by setting the single threshold or adaptive alarm threshold, which has the problem of time uncertainty. In this work, we applied the change-point detection algorithm based on the CUSUM control chart to analyze the MD series.

The CUSUM control chart is drawn by the cumulative sum of the difference between the observed value and the target value. It is widely used in the field of economics because of its simple and intuitive characteristics. If the statistical value is higher than the average value over a period of time, this amount of data will continue to accumulate, and the CUSUM control chart will show a steady growth trend and vice versa. Therefore, the CUSUM control chart can detect whether there is a change in the series, but the detailed information of the change point, such as confidence level and confidence interval, cannot be obtained only through the control chart [40]. Therefore, in this work, we propose a fault prediction algorithm combining the CUSUM control chart and change-point detection, which can accurately capture the small abnormal changes in the MD series. Algorithm 1 below describes the calculation process of the algorithm.

Algorithm 1: Change-Point Detection.
Data: Mahalanobis Distance Series MD and its length M; Confidence Level C; Number of bootstrap samples performed N
Result: index of the change point
1	Calculate the average $\bar{MD} = \frac{{MD}_{1} + {MD}_{2} + \dots + {MD}_{M}}{M}$
2	Calculate the cumulative sums by adding the difference between the current value and the average to the previous sum. $S_{i} = S_{i - 1} + (X_{i} - \bar{X})$ for i = 1, 2, …, M and S₀ = 0
3	Calculate the original S_diff which defined as: S_diff = S_max − S_min—where $S_{\max} = \max_{i = 1, 2, \dots, M} S_{i}$ , $S_{\min} = \min_{i = 1, 2, \dots, M} S_{i}$
4	for j in range (1, N) do
5	Generate a bootstrap sample of the original MD series, denoted ${MD}_{1}^{j}, {MD}_{2}^{j}, \dots, {MD}_{M}^{j}$
6	Based on the bootstrap sample, calculate the bootstrap CUSUM, denoted $S_{0}^{j}, S_{1}^{j}, \dots, S_{M}^{j}$
7	Calculate the difference of the bootstrap CUSUM, denoted $S_{d i f f}^{j}$
8	Compare each $S_{d i f f}^{j}$ with the original S_diff, let X be the number of bootstraps for which $S_{d i f f}^{j} < S_{d i f f}$ for j = 1, 2, …, N
9	Calculate the confidence level P that a change occurred as a percentage: $P = 100 \frac{X}{N} %$ and compare with the input confidence level C
10	If P > C, a change has occurred with a confidence level of C
11	Output the index of the change point m, let m be such that: $\| S_{m} \| = \max_{i - 0, 1, \dots, M} \| S_{i} \|$

The input parameters of the algorithms are the MD series, its length and set a confidence level C and the number of bootstrap samples performed N. The functionality of the proposed approach is to find the possible change point in the MD series obtained from our normal behavior model and then output the index of the change point with the given confidence level. In other words, the change point output by our algorithm is calculated by a large number of repeated sampling and obtained based on the probability. Our fault detection approach will perform bootstrap sampling N times on the original MD series and will generate a new random sorted MD series each time. If the majority of the new series are recognized as having change points, we are confident about the conclusion that there exist change points in the original MD series. For example, if the algorithm finds that there are 900 new MD series containing change points in the total 1000 times of bootstrap sampling, we can draw a conclusion that the original MD series have change points with a confidence level of 90%. The final number of change points depends on the confidence level we set. If we set a relatively low confidence level, there is a high possibility that we will get many change points, which including the points at which the real fault happened as well as the points turn out to be unreliable. To get more reliable results, we can set a higher confidence level so that we will much more confident about the change points obtained from our algorithms and they will be closer to the real conditions.

The principle of using the CUSUM control chart to judge whether a change has taken place in the series is to generate new MD series by multiple sampling without replacement. If the maximum difference of the cumulative sum of the new series is mostly smaller than that of the original series, we have reason to believe that a change must have occurred in the original MD series. The more this happens in N times sampling, the higher the confidence level of a change happens. Once a change has been detected, it is also necessary to determine when the change occurred. In this work, we use the binary search algorithm to specify the index of the change point. The original MD series is divided into two parts based on the change point found by the change-point detection algorithm, similar to the binary search algorithm, we iteratively execute the Algorithm 1 on each subseries until all the change points that satisfy the confidence level are accurately found. Figure 5 is the flowchart of the entire change-point detection process.

4. Modeling Normal Gearbox Behavior

4.1. Data Description

The SCADA data used in this work comes from 173 wind turbines in a wind farm located in Ningxia, China. In this work, the SCADA data contains 37 parameters for each wind turbine with a one-minute interval, which can be divided into four categories, including the condition parameters, the health parameters, the performance parameters, and the controlling parameters [41]. We use two types of wind turbines with rated powers of 1500 kW and 2000 kW as the research object, and correspondingly establish two normal behavior models to fit the gearbox oil temperature under normal working conditions of turbines. Table 1 shows the specifications of two types of wind turbines. The SCADA data are re-sampled using the 5-min interval.

Table 2 lists the detailed descriptions of the modeling datasets. A totally of 16 turbines were used for modeling analysis. To depict the normal behavior of the turbines as completely as possible, we used the SCADA data of five turbines to train the normal behavior model, and there were no gearbox faults during the corresponding time interval. In addition, the historical SCADA data of turbines #36 and #149 in a month period is selected to test the effectiveness of the normal behavior model we established. Finally, we use five turbines (#38, #58 for type A and #159, #173, #162 for type B) to test our proposed fault prediction algorithm. According to the wind farm fault alarm record, these five turbines experienced gearbox oil temperature exceeding limit faults during the selected time interval.

4.2. Data Preprocessing

After removing the duplicate and missing values in the original SCADA data, in order to accurately describe the normal conditions of the wind turbines, a power curve is built from SCADA data where the outliers are filtered out through the method we proposed above. In this section, we use the historical data of turbine #159 during the half-year period to validate the performance of the proposed data cleaning method. The result is provided in Figure 6.

The DBSCAN method is firstly applied to identify and filter out most of the outliers as shown in Figure 6a, and we additionally use the quartile method to further filter the SCADA data. Through the combination of the DBSCAN and quartile method, we obtain the power curve seen in Figure 7. It is not difficult to see from the figure that the data preprocessing method can filter out the outliers in the power curve, that is, corresponding to the abnormal working conditions of the wind turbine.

4.3. Feature Selection

After deletion of the abnormal data of the turbines, three data-mining algorithms (filter model, wrapper model with recursive feature elimination algorithm, and Random Forest) are used to select the most relevant parameters for predicting the gearbox oil temperature. Considering that the rise of oil temperature is a gradual process that cannot change in a sudden, the input parameters measured at past intervals may influence the future state of the turbines [4]. In this work, we introduce the gearbox oil temperature parameter at two past intervals, (t-1) and (t-2), into input parameters to predict the oil temperature at time t. The time interval is 5 min. Table 3 presents the 10 most relevant parameters of all 39 parameters. Based on the result of three feature selection algorithms, we finally choose the following 15 parameters as the input variables for gearbox oil temperature prediction, namely gearbox oil temperature(t-1), gearbox oil temperature(t-2), active power, gearbox bearing A temperature, gearbox bearing B temperature, wind speed, generator speed, rotor speed, generator stator L3 temperature, voltage phase C, generator torque, active consumption, generator bearing A temperature, pitch angle, and rotor temperature.

4.4. Model Construction

In this section, the normal behavior model of gearbox oil temperature is constructed based on the stacking strategy. We use the SCADA data of ten turbines to establish the stacking models of two types of wind turbines, five for type A and the remaining five for type B, as shown in Table 2. To evaluate the performance of the proposed model, three metrics, mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R²), are applied, which are defined as follows:

MAE = \frac{1}{N} \sum_{i = 1}^{N} | {\hat{y}}_{i} - y_{i} |

(5)

RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {({\hat{y}}_{i} - y_{i})}^{2}}

(6)

R^{2} = 1 - \frac{\sum_{i = 1}^{N} ({\hat{y}}_{i} - y_{i})^{2}}{\sum_{t = 1}^{N} (y_{i} - \bar{y})^{2}}

(7)

where y_i is the i-th measured value of oil temperature,

{\hat{y}}_{i}

represents the i-th prediction of the model and

\bar{y}

refers to the mean value of the measured values.

Furthermore, seven single machine learning models, including k-Nearest Neighbor (KNN), Support Vector Machine (SVM), Decision Tree (DT), Adaboost, RF, GBDT, and XGBOOST, are applied for comparison with the proposed stacking model. In particular, the latter three models are the basic models of our proposed stacking model and the XGBOOST model is also selected as the meta-model in the second layer. We set aside 15% of the training data as the test set. Table 4 lists the performance of the models. We can observe that the three models, RF, GBDT, and XGBOOST, achieve better performance than other single models, and the stacking model based on these three models achieves the best performance with the highest R² scores and the lowest MAE and RMSE.

4.5. Testing Normal Behavior Model

In this section, in order to evaluate whether the proposed model can accurately describe the normal behavior of wind turbines, we select the historical SCADA data of turbine #36 (type A) and #149 (type B) for a period of a month to test the two models constructed above. Table 5 shows the metrics of the two normal behavior models on the test set. Figure 8 and Figure 9 show the performance of the two models on type A and B respectively, including the comparison of the measured value and the model predictions, the Mahalanobis distance and the CUSUM control chart.

As can be seen from Figure 8 and Figure 9, the predicted value of the model for the gearbox oil temperature can accurately fit the measured value, and the gearbox oil temperature values fall within the normal working range of (20, 75). The Mahalanobis distance values of the two turbines are relatively stable, and correspondingly, there is no sharp change occurred in the CUSUM control chart. This proves that the proposed model can accurately describe the temperature change of the gearbox during normal operating conditions. Hence, it can be concluded that the normal behavior model we built can be used for the modeling analysis of any turbines of the same type in the wind farm, which means there is no need to establish a normal behavior model for each turbine.

5. Testing Fault Prediction Approach

In this section, the proposed approach has been applied in five wind turbines (two turbines of type A and three turbines of type B) to predict the gearbox fault. According to the fault alarm record of the wind farm, the five turbines have suffered gearbox oil temperature over-limit fault during the selected time period. First, the test data is input into the normal behavior model we established to obtain the MD series on the test data set. Then, the change-point detection algorithm is applied to analysis the MD series, and the time point at which the fault occurs and the time point at which the fault symptom begins to appear can be obtained to achieve the fault prediction. In this work, the time interval of the MD series is 5 min, the number of sampling is set to 1000, and the confidence level is 0.99 and 0.90 respectively. The test results of the fault prediction algorithm are as follows.

5.1. Test Results for Turbine #38

Figure 10 displays the fault prediction results of turbine #38, which consists of the gearbox oil temperature curve, MD series, and CUSUM control chart in turn. There are two change points detected by the change-point detection algorithm which were marked in the figure. The change point 1 indicates that the fault symptom begins to appear at 11:42 on 15 March 2018, which is the 2148th sample point. The change point 2 indicates the actual fault that happened at 11:24 on 29 March 2018, which is the 4015th sample point calculated by our algorithm. It can be seen from the temperature curve the measured value has exceeded the upper limit at the corresponding time of change point 2, at which the MD value also has a drastic increase.

From Table 6, it is worth noting that the confidence level of change point 3 is 0.90, which means we only get two change points under the confidence level of 0.99. The confidence level indicates how confident the result is that the fault actually occurs or the fault symptom begins to appear. The change point 1 occurred with 99% confidence. While the change point 3 occurred with 90% confidence. That means in the 1000 times of bootstrap sampling, the change point 3 was detected at least 900 times while the change point 1 and 2 were detected at least 990 times. Therefore, we are much more confident about the change point 1 at which the fault symptoms begin to appear.

According to the fault alarm record of the wind farm, the gearbox oil temperature over-limit fault did happen at 15:10 on 29 March 2018, which is almost consistent with the time point 2 calculated by the change point detection algorithm. Although the gearbox oil temperature at change point 3 is close to the upper alarm limit, no gearbox fault has occurred. Thus, change point 3 is only a potential change point with a low confidence level. Therefore, we can achieve the fault prediction by sending out an alarm signal at the time of change point 1 to arrange the operation and maintenance work. The case study preliminarily verifies the effectiveness of our fault prediction algorithm to recognize the fault symptoms, and thus send out the alarm signal 14 days in advance. Other than this, the proposed change point algorithm can generate different results based on the confidence level we set. Generally speaking, the lower the confidence level we set, the more change points been recognized. But quite low confidence levels will lead to an unreliable result like a false alarm.

5.2. Test results for Turbine #173

Similarly, Figure 11 shows the fault prediction results of turbine #173. The fault symptom begins to appear at 03:24 on 6 April 2018, with corresponding to the change point 1. The change point 2 indicates the fault finally happened at 08:32 on 19 April 2018. Table 7 lists the information about the four change points. When the confidence level is reduced from 0.99 to 0.90, in addition to the original change point 1 and 2, the algorithm also detects the change point 3 and 4. However, there is no fault happened at the timestamp corresponding to change point 3 and 4.

According to the fault alarm record, turbine #173 had a temperature over-limit fault at 06:52 on 19 April 2018. We are much more confident about the change point 1 at which the fault symptoms beginning to appear. Therefore, the fault prediction algorithm can detect the fault 13 days in advance. Through the analysis of the above two turbines, we can find that the proposed algorithm can not only identify the time of the fault happens but also accurately detect the time point at which the fault symptom begins to appear with a certain confidence level so as to realize the fault prediction.

5.3. Test Results for Other Turbines

The proposed algorithm was applied to the other three wind turbines to prediction the gearbox fault. All these three turbines have suffered gearbox oil temperature faults during the selected time period. Table 8 lists the test results. It can be seen from the figure that the faults can be detected 16 to 22 days in advance. The difference in prediction results may be related to the type of wind turbines and the severity of the fault.

6. Conclusions

In this work, a gearbox fault prediction approach has been proposed by building a normal behavior model and analyzing the MD series through the change-point detection algorithm. During the modeling process, a data cleaning method composed of the DBSCAN and quartile was applied to identify and drop out the anomaly data. Then, two stacking models were constructed for each type of wind turbines by inputting 15 features to describe the gearbox oil temperature under normal conditions. Compared with other single models, the stacking model can achieve better performance on three metrics and can also overcome the defects of the single model, such as the over-fitting problem.

The effectiveness of the proposed approach was verified using the SCADA data of a wind farm in Ningxia, China. Considering that there are a large number of wind turbines in the wind farm, we use the SCADA data of five turbines for half a year period to establish the normal behavior model of the turbines, so that the model can be used for fault prediction of all other turbines of the same type without separately building models for each turbine. Finally, we verified the fault prediction algorithm on two normal turbines and five fault turbines. The case study shows that the algorithm can detect the fault symptoms 13 to 22 days in advance.

The proposed approach in this paper mainly has two aspects. The Mahalanobis distance, on the one hand, is applied to quantify the discrepancy between the model output value and the actual value of oil temperature. It can be seen from the MD figures that the series of turbines under normal conditions is relatively stable and there is no large fluctuation. Contrary to this, the MD value of the faulty turbines will have a drastic increase when the fault happens, which proves its rationality of using MD to measure the deviation between the actual and normal conditions of turbines. On the other hand, the change-point detection algorithm can not only identify the time point at which the fault occurs, but more importantly, it can detect the time point at which the fault symptom begins to appear, thereby scheduling the operation and maintenance work in advance, and realizing the fault prediction.

In conclusion, we have proposed a fault prediction approach based on a stacking model and change-point detection. The stacking model, as the normal behavior model to describe the normal conditions of wind turbines, its prediction accuracy is obviously higher than that of other single models and is especially suitable for dealing with the occasions with a large amount of data and complex parameter relationships in wind farm. The stacking model can also be applied to power forecasting and the power optimization of wind farms. In addition, compared to the traditional fault detection approach by setting an alarm threshold, our approach is capable of providing more information on the incoming fault. It can not only determine the time at which the fault symptoms beginning to appear, for each change point, it also provides further information including a confidence level representing the possibility that a fault symptom is about to appear. The higher the confidence level is, the more we are confident about the fault prediction result. During the actual application stage, by setting a proper confidence level in advance, the personnel of the wind farm can arrange the operation and maintenance work at the corresponding time of the change point detected by the algorithm, repairing the gearbox in advance, and achieving the goal of fault prediction.

It is worth noting that, unlike setting a single threshold for fault detection, we use the method of change-point detection for the MD series to predict the fault. However, due to the inherent limitations of the change-point detection algorithm itself and the complicated state of the turbines during actual operation, it is also possible to detect the existence of change points in the MD series of a normal turbine, which may lead to a false alarm. In this case, it is of vital importance to select the proper test interval during the actual application stage. For example, in this paper, we use the SCADA data for a period of a month for fault prediction. That is to say, the length of the MD series can significantly affect the accuracy of the final detection result. Take Figure 8 and Figure 10 as examples. They display the test results on normal turbines and fault turbines respectively. There is an obvious difference between their CUSUM control chart. The CUSUM chart of normal turbines (Figure 8) may fluctuate slightly in the short term, but the trend in the one-month period is relatively stable, with its absolute value fluctuating from 0 to 500. While for the fault turbines in Figure 10, the CUSUM chart has a steeper trend and especially when the fault happens, there is a sudden change in direction of CUSUM with their absolute values changing from 0 to 1600. In future work, the appropriate prediction interval should be selected in the application process to reduce the false alarm rate. In addition, the proposed approach will be applied for fault prediction of other components of the wind turbine.

Author Contributions

T.Y. proposed the main idea for the paper and prepared the manuscript. Z.S. supervised the paper and reviewed the manuscript. S.M. contributed the data set and supervised the research.

Funding

This work was supported by Commonweal Technology Research Project of Zhejiang Province, China (grant No. LGG18F030005).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

CUSUM	Cumulative Sum
NBM	Normal Behavior Modeling
DNSCAN	Density-Based Spatial Clustering of Applications with Noise
IQR	Interquartile Range
RF	Random Forest
GBDT	Gradient Boosting Decision Tree
XGBOOST	Extreme Gradient Boosting
MD	Mahalanobis Distance
RFE	Recursive Feature Elimination
KNN	k-Nearest Neighbor
SVM	Support Vector Machine
DT	Decision Tree
Adaboost	Adaptive Boosting

References

Brussels, B. GWEC Global Wind Report 2018. 2019. Available online: https://gwec.net/global-wind-report-2018/ (accessed on 21 June 2019).
Tchakoua, P.; Wamkeue, R.; Ouhrouche, M.; Slaoui-Hasnaoui, F.; Tameghe, T.; Ekemb, G. Wind Turbine Condition Monitoring: State-of-the-Art Review, New Trends, and Future Challenges. Energies 2014, 7, 2595–2630. [Google Scholar] [CrossRef] [Green Version]
Leahy, K.; Gallagher, C.; O’Donovan, P.; Bruton, K.; O’Sullivan, D. A Robust Prescriptive Framework and Performance Metric for Diagnosing and Predicting Wind Turbine Faults Based on SCADA and Alarms Data with Case Study. Energies 2018, 11, 1738. [Google Scholar] [CrossRef]
Kusiak, A.; Li, W. Virtual Models for Prediction of Wind Turbine Parameters. IEEE Trans. Energy Convers. 2010, 25, 245–252. [Google Scholar] [CrossRef]
Kusiak, A.; Li, W. The prediction and diagnosis of wind turbine faults. Renew. Energy 2011, 36, 16–23. [Google Scholar] [CrossRef]
Kusiak, A.; Verma, A. A Data-Driven Approach for Monitoring Blade Pitch Faults in Wind Turbines. IEEE Trans. Energy Convers. 2010, 2, 87–96. [Google Scholar] [CrossRef]
Kusiak, A.; Verma, A. Analyzing bearing faults in wind turbines: A data-mining approach. Renew. Energy 2012, 48, 110–116. [Google Scholar] [CrossRef]
Tautz-Weinert, J.; Watson, S.J. Using SCADA data for wind turbine condition monitoring—A review. IET Renew. Power Gener. 2017, 11, 382–394. [Google Scholar] [CrossRef]
Santos, P.; Villa, L.F.; Renones, A.; Bustillo, A.; Maudes, J. An SVM-based solution for fault detection in wind turbines. Sensors 2015, 15, 5627–5648. [Google Scholar] [CrossRef]
Wenyi, L.; Zhenfeng, W.; Jiguang, H.; Guangfeng, W. Wind turbine fault diagnosis method based on diagonal spectrum and clustering binary tree SVM. Renew. Energy 2013, 50, 1–6. [Google Scholar] [CrossRef]
Bangalore, P.; Tjernberg, L.B. An Approach for Self Evolving Neural Network Based Algorithm for Fault Prognosis in Wind Turbine. In Proceedings of the 2013 IEEE Grenoble Conference, Grenoble, France, 16–20 June 2013; pp. 1–6. [Google Scholar]
Wang, H.; Wang, H.; Jiang, G.; Li, J.; Wang, Y. Early Fault Detection of Wind Turbines Based on Operational Condition Clustering and Optimized Deep Belief Network Modeling. Energies 2019, 12, 984. [Google Scholar] [CrossRef]
Bangalore, P.; Letzgus, S.; Karlsson, D.; Patriksson, M. An artificial neural network-based condition monitoring method for wind turbines, with application to the monitoring of the gearbox. Wind Energy 2017, 20, 1421–1438. [Google Scholar] [CrossRef]
Guo, P.; Infield, D.; Yang, X. Wind Turbine Generator Condition-Monitoring Using Temperature Trend Analysis. IEEE Trans. Sustain. Energy 2012, 3, 124–133. [Google Scholar] [CrossRef]
Astolfi, D.; Castellani, F.; Natili, F. Wind turbine generator slip ring damage detection through temperature data analysis. Diagnostyka 2019, 20, 3–9. [Google Scholar] [CrossRef]
Garlick, W.G.; Dixon, R.; Watson, S.J. A model-based approach to wind turbine condition monitoring using SCADA data. ICSE 2009, 38, 536–548. [Google Scholar]
Yeh, C.H.; Lin, M.H.; Lin, C.H.; Yu, C.E.; Chen, M.J. Machine Learning for Long Cycle Maintenance Prediction of Wind Turbine. Sensors 2019, 19, 1671. [Google Scholar] [CrossRef]
Bangalore, P.; Tjernberg, L.B. An Artificial Neural Network Approach for Early Fault Detection of Gearbox Bearings. IEEE Trans. Smart Grid 2015, 6, 980–987. [Google Scholar] [CrossRef]
Wu, X.; Wang, H.; Jiang, G.; Xie, P.; Li, X. Monitoring Wind Turbine Gearbox with Echo State Network Modeling and Dynamic Threshold Using SCADA Vibration Data. Energies 2019, 12, 982. [Google Scholar] [CrossRef]
Pinar Pérez, J.M.; García Márquez, F.P.; Tobias, A.; Papaelias, M. Wind turbine reliability analysis. Renew. Sustain. Energy Rev. 2013, 23, 463–472. [Google Scholar] [CrossRef]
Ragheb, A.; Ragheb, M. Wind turbine gearbox technologies. In Proceedings of the 1st International Nuclear & Renewable Energy Conference (INREC), Amman, Jordan, 21–24 March 2010; pp. 1–8. [Google Scholar]
Zaher, A.; Mcarthur, S.D.J.; Infield, D.G.; Patel, Y. Online Wind Turbine Fault Detection through Automated SCADA Data Analysis. Wind Energy 2010, 12, 574–593. [Google Scholar] [CrossRef]
Leahy, K.; Gallagher, C.; O’Donovan, P.; O’Sullivan, D.T.J. Issues with Data Quality for Wind Turbine Condition Monitoring and Reliability Analyses. Energies 2019, 12, 201. [Google Scholar] [CrossRef]
Butler, S.; Ringwood, J.; O’Connor, F. Exploiting SCADA system data for wind turbine performance monitoring. In Proceedings of the Conference on Control & Fault-tolerant Systems, Nice, France, 9–11 October 2013. [Google Scholar]
Xi, Y.; Lu, Z.; Ying, Q.; Yong, M.; O’Malley, M. Identification and Correction of Outliers in Wind Farm Time Series Power Data. IEEE Trans. Power Syst. 2016, 31, 4197–4205. [Google Scholar]
Bird, L.; Lew, D.; Milligan, M.; Carlini, E.M.; Estanqueiro, A.; Flynn, D.; Gomez-Lazaro, E.; Holttinen, H.; Menemenlis, N.; Orths, A.; et al. Wind and solar energy curtailment: A review of international experience. Renew. Sustain. Energy Rev. 2016, 65, 577–586. [Google Scholar] [CrossRef] [Green Version]
Ester, M.; Kriegel, H.P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, OR, USA, 2–4 August 1996; AAAI Press: Portland, OR, USA, 1996; pp. 226–231. [Google Scholar]
Hyndman, R.J.; Fan, Y. Sample Quantiles in Statistical Packages. Am. Stat. 1996, 50, 361–365. [Google Scholar]
Brown, G. A New Perspective for Information Theoretic Feature Selection. In Proceedings of the 12th International Conference on Artificial Intelligence and Statistics (AISTATS) 2009, Clearwater Beach, FL, USA, 16–19 April 2009; Volume 5, pp. 49–56. [Google Scholar]
Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V. Gene Selection for Cancer Classification using Support Vector Machines. Mach. Learn. 2002, 46, 389–422. [Google Scholar] [CrossRef]
Dy, J.G.; Brodley, C.E. Feature Selection for Unsupervised Learning. J. Mach. Learn. Res. 2004, 5, 845–889. [Google Scholar]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Zhu, R.; Zeng, D.; Kosorok, M.R. Reinforcement Learning Trees. J. Am. Stat. Assoc. 2015, 110, 1770–1784. [Google Scholar] [CrossRef] [Green Version]
Deng, X.; Li, Y.; Weng, J.; Zhang, J. Feature selection for text classification: A review. Multimed. Tools Appl. 2018, 78, 3797–3816. [Google Scholar] [CrossRef]
Vishwanath, D.; Gupta, S. Adding CNNs to the Mix: Stacking models for sentiment classification. In Proceedings of the IEEE Annual India Conference (INDICON), Bangalore, India, 16–18 December 2016; pp. 1–4. [Google Scholar]
Wang, Z.; Wang, D. Recurrent deep stacking networks for supervised speech separation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; pp. 71–75. [Google Scholar]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: San Francisco, CA, USA, 2016; pp. 785–794. [Google Scholar]
Li, Y.; Yang, Z.; Chen, X.; Yuan, H.; Liu, W. A stacking model using URL and HTML features for phishing webpage detection. Future Gener. Comput. Syst. 2019, 94, 27–39. [Google Scholar] [CrossRef]
Taylor, W.A. Change-Point Analysis: A Powerful New Tool for Detecting Changes; Baxter Healthcare Corporation: Deerfield, IL, USA, 2000; Available online: https://variation.com/wp-content/uploads/change-point-analyzer/change-point-analysis-a-powerful-new-tool-for-detecting-changes.pdf (accessed on 21 June 2019).
Zhao, Y.; Li, D.; Dong, A.; Kang, D.; Lv, Q.; Shang, L. Fault Prediction and Diagnosis of Wind Turbine Generators Using SCADA Data. Energies 2017, 10, 1210. [Google Scholar] [CrossRef]

Figure 1. Three common types of anomaly data in the power curve.

Figure 2. Overview of the overall framework.

Figure 3. Structure of the two-layer stacking model.

Figure 4. Structure of the five-fold cross-validation process.

Figure 5. The flowchart of the change-point detection process.

Figure 6. Performance of the proposed data preprocessing method: (a) result of the DBSCAN and (b) result of the quartile method after filtered out by the DBSCAN.

Figure 7. The power curve after data preprocessing.

Figure 8. Testing normal behavior model results for Turbine #36.

Figure 9. Testing normal behavior model results for Turbine #149.

Figure 10. Testing the fault prediction approach results for Turbine #38.

Figure 11. Testing the fault prediction approach results for Turbine #173.

Table 1. Specification of two types of wind turbines.

Wind Turbine Parameters	Type
Wind Turbine Parameters	A	B
Rated power	1500 kW	2000 kW
Cut-in, rated wind speed	3 m/s, 11 m/s	2.5 m/s, 10 m/s
Rotor diameter	88 m	115 m
Blade length	43 m	56.5 m
Hub height	80 m	90 m

Table 2. Description of modeling datasets.

Type	Dataset	Turbines Considered	Timestamps
A	Training	#34, #35, #42, #44, #52	1 July 2018–1 January 2019
A	Testing normal behavior	#36	1 July 2018–31 July 2018
	Testing fault prediction	#38	1 March 2018–4 April 2018
	Testing fault prediction	#58	1 April 2018–1 May 2018
	Training	#150, #152, #153, #156, #169	1 July 2018–1 January 2019
B	Testing normal behavior	#149	1 December 2018–30 December 2018
	Testing fault prediction	#159	30 March 2018–30 April 2018
		#173	1 April 2018–30 April 2018
		#162	10 February 2018–18 March 2018

Table 3. Result of the three feature selection algorithms.

No.	Filter	Wrapper	RF
1	Gearbox oil Temperature(t-1)	Active power	Gearbox oil Temperature (t-1)
2	Gearbox oil Temperature(t-2)	Active consumption	Gearbox bearing A temperature
3	Gearbox bearing A temperature	Gearbox bearing A temperature	Gearbox bearing B temperature
4	Gearbox bearing B temperature	Gearbox bearing B temperature	Gearbox oil Temperature (t-2)
5	Wind speed	Generator Stator L1 temperature	Wind speed
6	Active power	Generator Stator L3 temperature	Active power
7	Generator speed	Wind speed	Rotor temperature
8	Rotor speed	Gearbox oil Temperature (t-1)	Voltage phase b, c
9	Generator torque	Gearbox oil Temperature (t-2)	Rotor speed
10	Generator Stator L3 temperature	Generator bearing A temperature	Pitch angle

Table 4. Performance of the models.

Model	A			B
Model	MAE	RMSE	R²	MAE	RMSE	R²
KNN	1.586	2.064	0.935	1.383	1.765	0.965
SVM	0.512	0.714	0.959	0.657	1.033	0.988
DT	0.644	0.948	0.928	0.416	0.662	0.985
Adaboost	0.689	0.845	0.943	0.675	0.893	0.991
RF	0.289	0.389	0.996	0.262	0.407	0.995
GBDT	0.266	0.381	0.997	0.279	0.408	0.995
XGBOOST	0.265	0.380	0.997	0.282	0.409	0.995
Stacking	0.219	0.352	0.998	0.254	0.376	0.997

Table 5. Performance of the two normal behavior models.

Type	Turbine	MAE	RMSE	R²
A	#36	0.440	0.589	0.972
B	#149	0.343	0.435	0.998

Table 6. The information of the change points for Turbine #38.

Change Point	Confidence Level	Timestamp
1	0.99	15 March 2018 11:42
2	0.99	29 March 2018 11:24
3	0.90	25 March 2018 11:35

Table 7. The information of the change points for Turbine #173.

Change Point	Confidence Level	Timestamp
1	0.99	6 April 2018 03:24
2	0.99	19 April 2018 08:32
3	0.90	9 April 2018 16:23
4	0.90	13 April 2018 21:05

Table 8. Prediction results for the other three wind turbines.

Wind Turbine	Fault Prediction Date	Fault Record Date	Length of Prediction
#58	3 April 2018	19 April 2018	16 days
#159	30 March 2018	18 April 2018	19 days
#162	21 February 2018	14 March 2018	22 days

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yuan, T.; Sun, Z.; Ma, S. Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection. Energies 2019, 12, 4224. https://doi.org/10.3390/en12224224

AMA Style

Yuan T, Sun Z, Ma S. Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection. Energies. 2019; 12(22):4224. https://doi.org/10.3390/en12224224

Chicago/Turabian Style

Yuan, Tongke, Zhifeng Sun, and Shihao Ma. 2019. "Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection" Energies 12, no. 22: 4224. https://doi.org/10.3390/en12224224

APA Style

Yuan, T., Sun, Z., & Ma, S. (2019). Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection. Energies, 12(22), 4224. https://doi.org/10.3390/en12224224

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Gearbox Fault Prediction of Wind Turbines Based on a Stacking Model and Change-Point Detection

Abstract

1. Introduction

2. Background Review

3. Framework for Proposed Fault Prediction Approach

3.1. Data Preprocessing

3.2. Feature Selection

3.3. The Stacking Model

3.4. Fault Prediction Approach

3.4.1. Mahalanobis Distance Series

3.4.2. Change-Point Detection

4. Modeling Normal Gearbox Behavior

4.1. Data Description

4.2. Data Preprocessing

4.3. Feature Selection

4.4. Model Construction

4.5. Testing Normal Behavior Model

5. Testing Fault Prediction Approach

5.1. Test Results for Turbine #38

5.2. Test results for Turbine #173

5.3. Test Results for Other Turbines

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI