Next Article in Journal
Calorific Value Forecasting of Coal Gangue with Hybrid Kernel Function–Support Vector Regression and Genetic Algorithm
Previous Article in Journal
Ensemble Learning Techniques-Based Monitoring Charts for Fault Detection in Photovoltaic Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Condition Monitoring Method of Wind Turbines Based on GMDH Neural Network

1
School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
2
Ocean College, Zhejiang University, Hangzhou 310058, China
3
Southwest Technology and Engineering Research Institute, Chongqing 400039, China
4
Zhejiang Windey Co., Ltd., Hangzhou 310012, China
5
Huaneng Clean Energy Research Institute, Beijing 102209, China
*
Authors to whom correspondence should be addressed.
Energies 2022, 15(18), 6717; https://doi.org/10.3390/en15186717
Submission received: 15 August 2022 / Revised: 4 September 2022 / Accepted: 5 September 2022 / Published: 14 September 2022

Abstract

:
The safety of power transmission systems in wind turbines is crucial to the wind turbine’s stable operation and has attracted a great deal of attention in condition monitoring of wind farms. Many different intelligent condition monitoring schemes have been developed to detect the occurrence of defects via supervisory control and data acquisition (SCADA) data, which is the most commonly applied condition monitoring system in wind turbines. Normally, artificial neural networks are applied to establish prediction models of the wind turbine condition monitoring. In this paper, an alternative and cost-effective methodology has been proposed, based on the group method of data handling (GMDH) neural network. GMDH is a kind of computer-based mathematical modelling and structural identification algorithm. GMDH neural networks can automatically organize neural network architecture by heuristic self-organization methods and determine structural parameters, such as the number of layers, the number of neurons in hidden layers, and useful input variables. Furthermore, GMDH neural network can avoid over-fitting problems, which is a ubiquitous problem in artificial neural networks. The effectiveness and performance of the proposed method are validated in the case studies.

1. Introduction

As a source of clean, efficient, and widely distributed renewable energy, wind energy has maintained a good development trend in recent years. According to the Global wind power capacity announcement 2022 issued by the Global Wind Energy Council, the global cumulative wind power capacity has reached 837 GW [1]. Wind turbines usually work in harsh environments and, therefore, are prone to failures and unplanned shutdowns after certain years of operation, which are the main obstacles hindering the development of wind energy. The operation and maintenance costs of wind turbines account for quite high percentage of the overall energy generation cost, maximum up to 30% according to reference [2]. Therefore, the early detection of wind turbine failures, which can be achieved via sophisticated and powerful real-time condition monitoring, is critical to the operation and maintenance of wind turbines. Thus, the incipient defects can be corrected before they turn into severe defects by preventive maintenance, and therefore, the wind turbine can be maintained in a satisfactory operational state.
SCADA system and CMS (condition monitoring system) are two main types of condition monitoring systems used for wind turbines. Most of the wind farms install SCADA systems to monitor wind turbines and log the information as time-series data. The SCADA system can provide a large number of measurements, such as temperatures, environment parameters (wind speed, wind direction, and environment temperature), and control system parameters (output power, pitch angle, rotor speed), which are widely used by wind farm operators to monitor the health condition of wind turbines. It demands a powerful information extraction process scheme for data analysis and trend prediction. In CMS systems, a large number of sensors and a high sampling frequency are required, resulting in a huge amount of data to be acquired. A CMS system needs to install many additional sensors [3,4,5], such as vibration accelerometers acoustic emission sensors, which result in the extra expenditure of the condition monitoring. Traditional methods mainly use signal processing (such as discrete wavelet transform, empirical mode decomposition, etc.) and vibration analysis (such as vibration signal, amplitude, etc.) to identify faults of wind turbines [6,7,8].
X. Jin [9] proposed an ensemble approach based on Mahalanobis Distance and Johnson Transform detection and fault diagnosis of wind turbine generator via analyzing the time series SCADA data. P. B. Dao [10] presented a method based on cointegration analysis using process parameters of the SCADA data. This method can effectively analyse nonlinear data trends, continuously monitor the wind turbine, and reliably detect abnormal problems. It has advantages of simplicity and fast computation, which enables it to be implemented online for real-time condition monitoring applications. Traditional machine learning classification models greatly rely on expert knowledge and handcrafted features and do not consider the long-term dependencies hidden in time-domain signals. To solve the problem, J. Lei [11] developed a novel end-to-end fault diagnosis framework based on the Long Short-term Memory model, which can extract features from multivariate time-series data directly and model the long-term dependencies hidden in the raw data. Y. Zhao [12] developed an integrated prediction and diagnosis solution for wind turbine generator RUL prediction and fault diagnosis based on machine learning and statistical techniques. An unsupervised learning method was applied to operational state clustering for generator RUL prediction and a supervised classification method was utilized for mapping features space to state space. A. Santolamazza et al. [13] proposed a comprehensive methodology using an artificial neural network and statistical process control for the fault detection of wind turbines based on SCADA data, integrating tools from machine learning techniques for the development of the models, and statistical process control field for the identification and analysis of operating anomalies. Y. Cui [14] developed an anomaly detection approach using nonlinear autoregressive neural networks for modelling with SCADA data and Mahalanobis distance as evaluated indicators for the condition monitoring of wind turbines. The main contribution of the approach is to consider the SCADA alarms with the condition monitoring during modelling, thereafter it can compensate the weakness of SCADA alarm logs to remind of the possible anomalies in wind turbines. Qian et al. [15] proposed an extreme learning machine model for condition monitoring of wind turbines, which helped to identify faults based on deviations from ideal SCADA signals. The extreme learning machine model takes considerably less time to train and make predictions, given that it can randomly updated the weights and biases, unlike artificial neural works. Recently, T. Xia et al. [16,17,18] proposed a few new frameworks based on multi-feedback neural network, opportunistic maintenance, and adversarial regressive domain adaptation approach for condition monitoring.
Most of the reviewed works are based on artificial neural networks or various deep learning networks. There are two main disadvantages: a large amount of data for the model training and the models are non-interpretable. To address this problem, GMDH neural network is applied in this research for modelling. The GMDH neural network is a self-organizing deep learning method for the time series forecasting problem and has been applied in many research areas. H. Azimi et al. [19] estimated the ice gouge deformations in clay seabeds using GMDH neural network. T. N. Nguyen et al. [20] proposed a novel analysis-prediction approach for geometrically nonlinear problems of solid mechanics based on GMDH method. M. Witczak et al. [21] developed a robust fault detection scheme with GMDH neural networks for generating an adaptive threshold to estimate the modelling uncertainty. M. R. Youcefi [22] applied GMDH to develop a trustworthy model that can predict the standpipe pressure in real-time during the drilling operation. GMDH neural networks have a good performance in time-series predicting. Therefore, in this paper GMDH neural networks are applied to predict the monitoring parameter, such as bearing temperature or gearbox temperature, and then a residual signal calculates the difference between the predicted temperature and the real measurement. Then, the fault detection can be carried out with the residual signal.

2. Methodology

2.1. Multilayer Perceptron

Multilayer perceptron (MLP) is a type of classical artificial neural network architecture. As shown in Figure 1, MLP neural networks often contain one or more hidden layers between the input and output layers. The feedforward neural network was the first devised and the simplest type of neural network architecture, which contains multiple neurons defined in hidden layers. There are connections or edges between neurons from adjacent layers. All these connections have weights associated with them. Feedforward means that the data flows from the input layer to the output layer in just one direction (forward). The MLP trained with the back-propagation learning algorithm is named as back-propagation (BP) neural network, where the feedback data from the output layer is applied for weights optimization.
In a BP neural network, a neuron is the basic computation unit. It receives inputs from external sources and/or other neurons in previous layers and generates one output. Each input is assigned one weight w , which is calculated based on its relative importance to other inputs. An activation function is applied to the weighted sum of its inputs in each neuron. The output of the hidden neuron j in a hidden layer can be expressed as:
y j = Φ j ( w j T x + b j ) = Φ j ( i = 1 N w j i x i + b j )
where, x = [ x 1 , x 2 , , x N ] T is the neural network input vector, w j = [ w j 1 , w j 2 , , w j N ] T denotes the weight vector between the hidden neuron j and the inputs x , b j is the bias, and Φ j is the activation function, which is usually defined as a nonlinear function (e.g., a sigmoid function). Then, the output of the neural network can be expressed as [23,24]:
y ^ = Φ o ( j = 1 M w o j y j + b o )
where, Φ o is the activation function of the output neuron, which is normally taken as a linear function, y ^ is the final output of the MLP network, b o is the bias value of the output neuron o , and w o j is the synaptic weight value from the hidden neuron j to the output neuron o .
The back-propagation algorithm is the most widely applied algorithm for training MLP neural networks. It works with the training data, so it is a kind of supervised training approach. During initialization, all the weights are initialized randomly. For each input from the training dataset, the ANN is triggered and provide one output. This calculated neural network output is compared with the objective output provided within the training data, yielding the residual error, and is called the cost function. The partial derivatives of the cost function for corresponding parameters are propagated back through the back-propagation process. Then, any gradient-based optimization algorithm can be applied to update the network weights. Repeat this process until the neural network satisfies the given stopping criterion [24].
The Levenberg-Marquardt algorithm applied in this study is a back-propagation algorithm, also known as the damped-least-square method. It is the fastest computationally efficient method for medium-sized feedforward neural networks. This algorithm is specifically developed to handle cost functions in the form of the sum of squared errors. It is not necessary to compute an accurate Hessian matrix.
The neural network weights updated by the Levenberg-Marquardt algorithm can be expressed as [25]:
w k + 1 = w k ( J k T J k + μ I ) 1 J k T e k
where, k is the index of iteration, w k is the weight vector of the k th training iteration as defined in Equation (4), Ι is the identity matrix, μ is the step size, and e k is the error value of the k th training iteration between the outputs and targets.
w k = [ w k , 1 , w k , 2 , , w k , Z ] T
where, Z is the number of weights
e k = [ e k , 1 , e k , 2 , , e k , P ] T
and P is the number of input samples.
J k is a Jacobian matrix of the k th training iteration as defined in Equation (6):
J k = [ e k , 1 w k , 1 e k , 1 w k , 2 e k , 1 w k , Z e k , 2 w k , 1 e k , 2 w k , 2 e k , 2 w k , Z e k , P w k , 1 e k , P w k , 2 e k , P w k , Z ]

2.2. Group Method of Data Handling Network

GMDH [26] is an approach that involves growing a neural network, which has a self-organized structure. GMDH networks gradually increase the number of neurons (or partial models) and obtain a model structure with optimal complexity. Thus, this process avoids the large amount of work in network architecture selection. Figure 2 shows the structure of an example GMDH network with three inputs and one output. GMDH is a smart type of regression, which utilizes three steps to reach the final output (representation, selection, and stopping). The procedure can produce a high degree of polynomials in effective predictor. Figure 3 illustrates a typical computational node of the GMDH network with three inputs as an example [27].
General connection between node inputs and output variables can be expressed by a complicated discrete form of the Volterra functional series (also known as the Kolmogorov–Gabor polynomial) in the form of [28]:
y = a 0 + i = 1 m a i x i + i = 1 m j = 1 m a i j x i x j + i = 1 m j = 1 m k = 1 m a i j k x i x j x k
where, m is the number of input variables, x i , x j , and x k are inputs, y is the corresponding node output value, and a i , a i j , and a i j k are polynomials coefficients.
By means of the GMDH algorithm, a model can be represented as a set of neurons in which different pairs of them in each layer are connected through a quadratic polynomial and thus produce new neurons in the next layer. Such representation can be used in modeling to map inputs to outputs. The formal definition of the identification problem is to find a function, f ^ , so that it can be approximately used instead of actual one, f , in order to predict output, y ^ , for a given input vector, X = [ x 1 , x 2 , , x n ] , as close as possible to its actual output, y . Therefore, the output of the GMDH network for signal output can be written as [28]:
y ^ = f ^ ( x 1 , x 2 , , x n )
In this way, such a partial quadratic description is recursively used in a network of connected neurons to build the general mathematical relation of inputs and output variables given in Equation (8). The coefficient is calculated using regression techniques in a least-squares sense so that the difference between actual output, y , and the calculated, y ^ , for each pair of x i , x j , and x k as input variables are minimized.
The critical step is to determine a GMDH neural network by the root mean square error (RMSE), so that the square of squared differences between the actual output and the predicted one is minimized, that is:
RMSE = 1 N i = 1 N ( y ^ i y i ) 2 min
where, N indicates the number of data samples.
The main procedure for GMDH implementation can be summarized in the following steps:
Step 1.
Divide data sample into training and validation sets. The training data was used for estimating the weights of neurons and the validation data was used for organizing the network architectures.
Step 2.
Determine all neurons (polynomial descriptions) that consisted of all the possible combinations with three inputs among all the input variables. Then, estimate the coefficients of polynomials with the training data set.
Step 3.
After training the first layer, select the best-fitting neurons via the validation data using the RMSE selection criterion.
Step 4.
Check whether the termination condition was fulfilled (the current layer remained and only one neuron or the introduction of new neurons did not improve the overall performance of the network); if YES then STOP, otherwise the output of Step 2 was applied as the inputs for the next layer, and proceed to Step 1.
Step 5.
Finally, the terminating condition was satisfied, the final structure of the network was obtained. To obtain the final model, the path of the neurons that corresponded to the lowest criterion was tracked back in each layer.
The GMDH network can be viewed as a polynomial neural network, where the node function was a polynomial rather than a sigmoid function. The parameter optimization was based on the least-square-fitting algorithm rather than iterative algorithm of BP neural network. Therefore, training in the GMDH network did not need a large amount of time. Moreover, the GMDH network did not have the problem of over-fitting, which was a difficulty in artificial neural networks.
This paper provided a novel framework based on the GMDH network for condition monitoring within the wind turbine paradigm, compared the GMDH condition monitoring model accuracy considering different input parameters combination for input parameter selection, and made a comparison with BP NN based condition monitoring to evaluate its performance.

3. Case Study

In this paper, the supervisory control and data acquisition SCADA data was collected from a wind farm located in China, which consisted of 121 parameters [15]. Condition monitoring based on temperature signal was one of the most common methods in wind turbines using the SCADA data. According to previous research [15], ambient temperature, wind speed, and power output were the most related parameters to gearbox oil temperature that could indicate the gearbox health condition. In terms of the data sampling rate, it was usually recorded at 5 min intervals, which aimed to reduce operational data collected from wind turbines. A 2.5 MW wind turbine equipped with the doubly fed induction generator was selected in this paper.

3.1. Parameter Selection for GMDH

Figure 4 shows the power curve of a 2.5 MW wind turbine. The power output of the wind turbine varied with the cube when the wind speed was less than the rated wind speed of 10 m/s. When the wind speed was below 3 m/s, the torque from the wind turbine rotor was insufficient to allow generator operation. Meanwhile, the wind turbine was shut down to prevent the wind turbine from being damaged when the wind speed was larger than 20 m/s. When the wind speed was between the rated speed and the cut-out speed, the power output of the wind turbine was limited to the rated power (10 m/s to 20 m/s in this paper). The condition monitoring model accuracy was not only determined by the model parameters, but also by the input variables selection. For example, a key factor affecting heat dissipation in a transmission was air temperature. The relationship between the air temperature and the gearbox oil temperature is illustrated in Figure 5. The heat dissipation of the gearbox was worse when the temperature was high than when the temperature was low. Hence, the gearbox temperature was able to change over a wide range in winter and over a small range during summer. It could be seen that both the power curve and the relationship between the air temperature and gearbox oil temperature were non-linear. Hence, the GMDH algorithm was suitable for wind turbine condition monitoring application.
In order to illustrate the impact of the parameter selection, a comparative analysis was performed, which contained wind speed with ambient temperature, wind speed with power output, power output with ambient temperature, and a combination of the above three input variables.
During the model training, the critical factors were selected manually by many trials. The main critical factors for the GMDH network applied in this research were as follows: 3 degree polynomials were applied in neurons in this research, maximal number of neurons in a layer was 6, and the criterion value was 0.005.
The gearbox oil temperature was selected as the wind turbine condition monitoring signal. Analysis results of the condition monitoring model considering different types of input variables are illustrated in Table 1. The RMSE of the condition monitoring models considering three-input variables was less than two-input condition monitoring model, which reached 0.077 for the three-input condition monitoring model. The above results showed that the condition monitoring model with three-input variables was more accurate than condition monitoring model with two-input variables. Consequently, the condition monitoring models developed were based on the three-input variables in this paper.

3.2. CASE 1: Condition Monitoring Based on the Gearbox Oil Temperature

In this case study, the actual wind turbine data obtained from a wind turbine SCADA system had been compared with the corresponding condition monitoring prediction model output to calculate the residual signal. The GMDH neural network was applied to predict gearbox oil temperature to achieve the fault detection of wind turbine gearbox. The actual gearbox oil temperature data gained from the wind turbine SCADA data was compared to the GMDH condition monitoring model output, which aimed to detect the fault of gearbox fault. The SCADA data of air temperature, power output of the wind turbine, and wind speed from a healthy wind turbine was selected as the training dataset to calculate the parameters of GMDH condition monitoring prediction model. In order to validate the performance of GMDH based condition monitoring model, back propagation (BP) neural network was adopted to make comparisons with the proposed method.
Figure 6 presents the wind speed of the wind farm obtained from the wind turbine SCADA system. The corresponding power generation of the wind turbine and actual ambient temperature are shown in Figure 7 and Figure 8, respectively. Figure 9 shows the actual temperature of wind turbine gearbox oil obtained from the SCADA system of a wind turbine with a gearbox fault.
Figure 10 illustrates the condition monitoring model output of wind turbine gearbox oil temperature from a BP model and a GMDH model. Figure 11 shows the residual gearbox oil temperature signals from the BP model and GMDH model. The residual signals were obtained from the difference between a condition monitoring model output and the actual wind turbine gearbox oil temperature from the SCADA data. It showed that the residual signal between the actual gearbox oil temperature of the wind turbine and condition monitoring model output stayed at a low level with fluctuations since the beginning and exhibited an increasing trend at the sampling point 18,600 for BP model and at sampling point 18,590 for GMDH model; thereafter, successive residual errors held at a high level, which indicated a potential fault occurred in the wind turbine gearbox. In order to verify the correctness of the proposed condition monitoring model, the alarm logs of the wind turbine SCADA system had been investigated to check this abnormal behavior which occurred in the wind turbine gearbox. The results showed that both the BP model and GMDH condition monitoring model outputs were consistent with the alarm logs of the wind turbine SCADA system. The fault detection results based on the GMDH condition monitoring model provided the abnormal indication slightly earlier than that of BP based model.

3.3. CASE 2: Condition Monitoring Based on the Generator Bearing Temperature

Case 1 validated the effectiveness of the GMDH network using the SCADA data with a gearbox fault. To further verify the proposed GMDH condition monitoring method, a wind turbine gearbox bearing fault was selected as the research object. The data applied for case study 2 were different than that for case study 1. They were two separate faults that occurred at different time periods. Figure 12 presented the wind speed of the wind farm obtained from the wind turbine SCADA system. The corresponding power generation of the wind turbine and actual ambient temperature were shown in Figure 13 and Figure 14, respectively. Figure 15 showed the actual temperature of the wind turbine gearbox bearing obtained from the SCADA system of a wind turbine with a bearing fault. Figure 16 illustrated the condition monitoring model outputs of wind turbine gearbox bearing temperature from a BP model and a GMDH model. Figure 17 showed the gearbox bearing temperature residual signals, which were obtained from the difference between condition monitoring model output and the actual wind turbine gearbox bearing temperature for both models. It showed that the residual signal stayed at a low level with fluctuations since the beginning and exhibited an increasing trend at sampling point 6037 for the BP model and at a sampling point 5085 for the GMDH model; thereafter, successive residual errors held at a high level, which indicated a potential fault occurring in the wind turbine gearbox bearing. To verify the correctness of the proposed condition monitoring model, the alarm logs of the wind turbine SCADA system had been investigated to check this abnormal behavior occurred at the wind turbine gearbox bearing. The results showed that both BP and GMDH condition monitoring model outputs were consistent with the alarm logs of the wind turbine SCADA system. The fault detection result based on GMDH condition monitoring model provided the abnormal indication at 52 sampling points earlier than that of the BP based model. Furthermore, the increasing trend of residual temperature from the GMDH model was more significant than that from the BP model, which meant that the GMDH based model could provide an effective fault detection indication at the early stage of the gearbox bearing fault case.
Finally, a comparison of the calculation time between these methods is listed in Table 2. It was obvious that the calculation time of the GMDH model was less than that of the BP model for both case studies, especially for case 2, which was 1/3 of the BP model. It showed that the GMDH model had greatly shortened the calculation time.

4. Conclusions

A novel wind turbine condition monitoring the approach based on GMDH neural network using SCADA data was, for the first time, proposed in this paper. In order to validate the effectiveness of the proposed method, the SCADA data collected from a commercial wind farm had been adopted. Two case studies had been carried out to illustrate the wind turbine condition monitoring by using gearbox oil temperature and gearbox bearing temperature. The results showed that the GMDH provided earlier and more significant fault indications than the traditional back-propagation neural network algorithm, while greatly shortening the calculation time. Moreover, GMDH was a kind of self-organized network, and the trained model was simple, easy to understand, and could avoid the over-fitting problems. Hence, the proposed method was suitable for real time implementation in condition monitoring systems, which could reduce condition monitoring cost, and in the meantime, increase the monitoring efficiency.

Author Contributions

Conceptualization, X.T., P.Q. and D.Z.; methodology, X.T. and Y.J.; software, X.T. and C.L. (Chen Liang); validation, H.W., C.L. (Chen Liang) and C.L. (Cong Liu); formal analysis, Y.Y.; investigation, H.W.; writing—original draft preparation, X.T.; writing—review and editing, P.Q. and D.Z.; funding acquisition, X.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Jiangsu University Senior Talents Research Start-up Fund, grant number [4111140012] and project HDHDW5901010102.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, Y.; Wu, Z. A condition monitoring approach of multi-turbine based on VAR model at farm level. Renew. Energy 2020, 166, 66–80. [Google Scholar] [CrossRef]
  2. Milborrow, D. Operation and maintenance costs compared and revealed. Wind Stats 2006, 19, 3. [Google Scholar]
  3. Yang, L.; Zhang, Z. A Conditional Convolutional Autoencoder-Based Method for Monitoring Wind Turbine Blade Breakages. IEEE Trans. Ind. Inform. 2021, 17, 6390–6398. [Google Scholar] [CrossRef]
  4. He, Y.; Kusiak, A. Performance Assessment of Wind Turbines: Data-Derived Quantitative Metrics. IEEE Trans. Sustain. Energy 2018, 9, 65–73. [Google Scholar] [CrossRef]
  5. Maldonado-Correa, J.; Martín-Martínez, S.; Artigao, E.; Gómez-Lázaro, E. Using SCADA Data for Wind Turbine Condition Monitoring: A Systematic Literature Review. Energies 2020, 13, 3132. [Google Scholar] [CrossRef]
  6. Yang, W.; Court, R.; Tavner, P.J.; Crabtree, C.J. Bivariate empirical mode decomposition and its contribution to wind turbine condition monitoring. J. Sound Vib. 2011, 330, 3766–3782. [Google Scholar] [CrossRef]
  7. Feng, Z.; Liang, M.; Yi, Z.; Hou, S. Fault diagnosis for wind turbine planetary gearboxes via demodulation analysis based on ensemble empirical mode decomposition and energy separation. Renew. Energy 2012, 47, 112–126. [Google Scholar] [CrossRef]
  8. Yang, Q.; An, D. EMD and Wavelet Transform Based Fault Diagnosis for Wind Turbine Gear Box. Adv. Mech. Eng. 2013, 5, 323–335. [Google Scholar] [CrossRef]
  9. Qian, P.; Ma, X.; Zhang, D.; Wang, J. Data-Driven Condition Monitoring Approaches to Improving Power Output of Wind Turbines. IEEE Trans. Ind. Electron. 2019, 66, 6012–6020. [Google Scholar] [CrossRef]
  10. Dao, P.B.; Staszewski, W.J.; Barszcz, T.; Uhl, T. Condition monitoring and fault detection in wind turbines based on cointegration analysis of SCADA data. Renew. Energy 2018, 116, 107–122. [Google Scholar] [CrossRef]
  11. Lei, J.; Liu, C.; Jiang, D. Fault diagnosis of wind turbine based on Long Short-term memory networks. Renew. Energy 2019, 133, 422–432. [Google Scholar] [CrossRef]
  12. Zhao, Y.; Li, D.; Dong, A.; Kang, D.; Lv, Q.; Shang, L. Fault Prediction and Diagnosis of Wind Turbine Generators Using SCADA Data. Energies 2017, 10, 1210. [Google Scholar] [CrossRef]
  13. Santolamazza, A.; Dadi, D.; Introna, V. A Data-Mining Approach for Wind Turbine Fault Detection Based on SCADA Data Analysis Using Artificial Neural Networks. Energies 2021, 14, 1845. [Google Scholar] [CrossRef]
  14. Cui, Y.; Bangalore, P.; Tjernberg, L.B. An Anomaly Detection Approach Based on Machine Learning and SCADA Data for Condition Monitoring of Wind Turbines. In Proceedings of the 2018 IEEE International Conference on Probabilistic Methods Applied to Power Systems (PMAPS), Boise, ID, USA, 23–28 June 2018; pp. 1–6. [Google Scholar]
  15. Qian, P.; Zhang, D.; Tian, X.; Si, Y.; Li, L. A novel wind turbine condition monitoring method based on cloud computing. Renew. Energy 2019, 135, 390–398. [Google Scholar] [CrossRef]
  16. Xia, T.; Dong, Y.; Pan, E.; Zheng, M.; Wang, H.; Xi, L. Fleet-level opportunistic maintenance for large-scale wind farms integrating real-time prognostic updating. Renew. Energy 2021, 163, 1444–1454. [Google Scholar]
  17. Xia, T.; Jiang, Y.; Zhuo, P.; Xi, L.; Wang, D. Dual-Ensemble Multi-Feedback Neural Network for Gearbox Fault Diagnosis. IEEE Trans. Instrum. Meas. 2021, 70, 1–10. [Google Scholar] [CrossRef]
  18. Jiang, Y.; Xia, T.; Wang, D.; Fang, X.; Xi, L. Adversarial Regressive Domain Adaptation Approach for Infrared Thermography-Based Unsupervised Remaining Useful Life Prediction. IEEE Trans. Ind. Inform. 2022, 18, 7219–7229. [Google Scholar] [CrossRef]
  19. Azimi, H.; Shiri, H.; Zendehboudi, S. Ice-seabed interaction modeling in clay by using evolutionary design of generalized group method of data handling. Cold Reg. Sci. Technol. 2022, 193, 103426. [Google Scholar] [CrossRef]
  20. Nguyen, T.N.; Lee, S.; Nguyen-Xuan, H.; Lee, J. A novel analysis-prediction approach for geometrically nonlinear problems using group method of data handling. Comput. Methods Appl. Mech. Eng. 2019, 354, 506–526. [Google Scholar] [CrossRef]
  21. Witczak, M.; Korbicz, J.; Mrugalski, M.; Patton, R.J. A GMDH neural network-based approach to robust fault diagnosis: Application to the DAMADICS benchmark problem. Control. Eng. Pract. 2006, 14, 671–683. [Google Scholar] [CrossRef]
  22. Youcefi, M.R.; Hadjadj, A.; Boukredera, F.S. New model for standpipe pressure prediction while drilling using Group Method of Data Handling. Petroleum 2022, 8, 210–218. [Google Scholar] [CrossRef]
  23. Fausett, L.V. Fundamentals of Neural Networks: Architectures, Algorithms and Applications; Prentice-Hall: Upper Saddle River, NJ, USA, 1994; ISBN 0-13-334186-0. [Google Scholar]
  24. Hagan, M.; Menhaj, M.B. Training feedforward networks with the Marquardt algorithm. IEEE Trans. Neural Netw. 1994, 5, 989–993. [Google Scholar] [CrossRef] [PubMed]
  25. Lu, B.; Upadhyaya, B.R. Monitoring and fault diagnosis of the steam generator system of a nuclear power plant using data-driven modeling and residual space analysis. Ann. Nucl. Energy 2005, 32, 897–912. [Google Scholar] [CrossRef]
  26. Nikolaev, N.Y.; Iba, H. Polynomial harmonic GMDH learning networks for time series modeling. Neural Netw. 2003, 16, 1527–1540. [Google Scholar] [CrossRef]
  27. Madandoust, R.; Ghavidel, R.; Nariman-Zadeh, N. Evolutionary design of generalized GMDH-type neural network for prediction of concrete compressive strength using UPV. Comput. Mater. Sci. 2010, 49, 556–567. [Google Scholar] [CrossRef]
  28. Onwubolu, G. GMDH-Methodology and Implementation in Matlab; Imperial College Press: London, UK, 2015. [Google Scholar]
Figure 1. One hidden layer MLP with one output.
Figure 1. One hidden layer MLP with one output.
Energies 15 06717 g001
Figure 2. Structure of a GMDH network.
Figure 2. Structure of a GMDH network.
Energies 15 06717 g002
Figure 3. A node of the GMDH network.
Figure 3. A node of the GMDH network.
Energies 15 06717 g003
Figure 4. Power curve of the wind turbine.
Figure 4. Power curve of the wind turbine.
Energies 15 06717 g004
Figure 5. Relationship between the gearbox oil temperature and air temperature.
Figure 5. Relationship between the gearbox oil temperature and air temperature.
Energies 15 06717 g005
Figure 6. Actual wind speed of the wind turbine (Case 1).
Figure 6. Actual wind speed of the wind turbine (Case 1).
Energies 15 06717 g006
Figure 7. Actual power generation of the wind turbine (Case 1).
Figure 7. Actual power generation of the wind turbine (Case 1).
Energies 15 06717 g007
Figure 8. Actual ambient temperature in the wind farm (Case 1).
Figure 8. Actual ambient temperature in the wind farm (Case 1).
Energies 15 06717 g008
Figure 9. Actual temperature of the gearbox oil obtained from the SCADA data (Case 1).
Figure 9. Actual temperature of the gearbox oil obtained from the SCADA data (Case 1).
Energies 15 06717 g009
Figure 10. Predicted temperature of the gearbox oil gained from both models. (a) BP model output, (b) GMDH model output.
Figure 10. Predicted temperature of the gearbox oil gained from both models. (a) BP model output, (b) GMDH model output.
Energies 15 06717 g010
Figure 11. Residual temperature of the gearbox oil. (a) Residual temperature from a BP model, (b) Residual temperature from a GMDH model.
Figure 11. Residual temperature of the gearbox oil. (a) Residual temperature from a BP model, (b) Residual temperature from a GMDH model.
Energies 15 06717 g011
Figure 12. Actual wind speed of the wind turbine (Case 2).
Figure 12. Actual wind speed of the wind turbine (Case 2).
Energies 15 06717 g012
Figure 13. Actual power generation of the wind turbine (Case 2).
Figure 13. Actual power generation of the wind turbine (Case 2).
Energies 15 06717 g013
Figure 14. Actual ambient temperature in the wind farm (Case 2).
Figure 14. Actual ambient temperature in the wind farm (Case 2).
Energies 15 06717 g014
Figure 15. Actual temperature of the gearbox bearing gained from the SCADA data (Case 2).
Figure 15. Actual temperature of the gearbox bearing gained from the SCADA data (Case 2).
Energies 15 06717 g015
Figure 16. Predicted temperature of the gearbox bearing gained from the GMDH model. (a) BP model output, (b) GMDH model output.
Figure 16. Predicted temperature of the gearbox bearing gained from the GMDH model. (a) BP model output, (b) GMDH model output.
Energies 15 06717 g016
Figure 17. Residual temperature of the gearbox bearing. (a) Residual temperature from a BP model, (b) Residual temperature from a GMDH model.
Figure 17. Residual temperature of the gearbox bearing. (a) Residual temperature from a BP model, (b) Residual temperature from a GMDH model.
Energies 15 06717 g017
Table 1. The accuracy of the prediction models considering different input parameters.
Table 1. The accuracy of the prediction models considering different input parameters.
Input VariablesRMSE
Wind speed, power output (2 inputs)0.087
Wind speed, ambient temperature (2 inputs)0.084
Power output, ambient temperature (2 inputs)0.086
Wind speed, power output, environment temperature (3 inputs)0.077
Table 2. The comparison of calculation time.
Table 2. The comparison of calculation time.
MethodCalculation Time (s)
Case 1Case 2
BP model1.751.02
GMDH model1.190.35
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tian, X.; Jiang, Y.; Liang, C.; Liu, C.; Ying, Y.; Wang, H.; Zhang, D.; Qian, P. A Novel Condition Monitoring Method of Wind Turbines Based on GMDH Neural Network. Energies 2022, 15, 6717. https://doi.org/10.3390/en15186717

AMA Style

Tian X, Jiang Y, Liang C, Liu C, Ying Y, Wang H, Zhang D, Qian P. A Novel Condition Monitoring Method of Wind Turbines Based on GMDH Neural Network. Energies. 2022; 15(18):6717. https://doi.org/10.3390/en15186717

Chicago/Turabian Style

Tian, Xiange, Yongjian Jiang, Chen Liang, Cong Liu, You Ying, Hua Wang, Dahai Zhang, and Peng Qian. 2022. "A Novel Condition Monitoring Method of Wind Turbines Based on GMDH Neural Network" Energies 15, no. 18: 6717. https://doi.org/10.3390/en15186717

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop