1. Introduction
The urgent need to address climate change has driven nations worldwide to transition decisively from fossil fuel-based energy systems to renewable energy sources, aiming to reduce CO
2 emissions and achieve global sustainability goals. In this context, wind energy has emerged as one of the most significant renewable energy sources, propelled by rapid advancements in wind turbine technology and a growing demand for clean energy solutions [
1]. According to the Global Wind Energy Council’s 2024 Report (GWEC), the wind industry achieved a record-breaking installation of 117 GW of new capacity in 2023, marking the most successful year in its history [
2]. The International Energy Agency (IEA) forecasts that renewables, with wind playing a key role, will account for nearly 30% of global electricity production by 2030, driven by the expansion of offshore wind farms [
3]. Offshore wind farms harness stable, high-speed winds far from shore, minimizing land-use conflicts and environmental impacts such as noise pollution while preserving natural landscapes [
4]. This growth not only underscores wind energy’s pivotal role in the global energy transition but also poses an urgent need to enhance the reliability and efficiency of wind turbine systems to meet ambitious climate targets, such as the 45% greenhouse gas emission reduction by 2030 outlined in the Intergovernmental Panel on Climate Change (IPCC) report [
5,
6].
Despite significant progress in wind energy, the reliability of wind turbines remains a major challenge, particularly concerning gearboxes, a critical yet vulnerable component. Gearboxes are responsible for converting the slow rotation of turbine blades into the high-speed input required by generators, but they operate under harsh conditions such as variable loads, high humidity, and corrosive marine environments, especially in offshore wind farms [
7,
8]. Operation and Maintenance (O&M) costs account for 20–25% of the total cost per kWh for new turbines, rising to 20–35% toward the end of their lifecycle, with gearbox-related failures contributing up to 40% of these costs due to high repair expenses and prolonged downtime [
7,
9].
Traditional condition monitoring methods, such as Condition Monitoring Systems (CMSs) based on vibration analysis, struggle with noise, nonlinearity, and instability in vibration data, resulting in low diagnostic accuracy (often below 85%), even with optimized Support Vector Machine (SVM) models enhanced by the Artificial Bee Colony algorithm [
10,
11]. Similarly, Supervisory Control and Data Acquisition (SCADA) systems, while offering real-time monitoring with diverse data, generate high-frequency datasets that demand substantial computational resources and are prone to false positives without effective preprocessing [
12,
13]. Recent studies, such as Verma et al. (2022), indicate that current SCADA-based fault prognosis models often detect issues only in advanced failure stages, missing early warning signs essential for preventive maintenance [
14]. Condition monitoring and early fault diagnosis techniques for wind turbines were reviewed by Md Liton Hossain et al. via a focus on component-specific faults, signal analysis techniques, and signal processing tools [
15]. It highlights the need for trustworthy, accessible online monitoring systems while indicating disadvantages such as high costs, sensor reliability in difficult circumstances, difficulties identifying early faults, and data dependency for AI. Guo, Wei, and colleagues suggest a hybrid fault diagnosis approach for wind turbine gearboxes that combines convolutional neural networks (CNNs) and symmetric dot pattern (SDP) visualization [
16]. To detect faults with high accuracy, the method transforms vibration signals into snowflake-shaped SDP images, which are subsequently classified by a CNN. Illumination is an important need in the early and exact diagnosis of faults in wind energy systems due to the drawbacks of conventional condition monitoring techniques, including CMS and SCADA systems. These challenges indicate the creation of an advanced diagnostic model that can process complex and noisy data, correctly predict gearbox failures early on, and optimize scheduled maintenance. In addition to enhancing wind turbine dependability, such a model would solve the urgent cost and efficiency challenges in the wind energy industry.
This study aims to address the identified gap in gearbox diagnostics by developing a current condition diagnostic (CCD) model for wind turbine gearboxes to significantly improve fault detection accuracy and enable proactive maintenance. Specifically, the objectives are as follows:
- (1)
To design a model that leverages SCADA data to detect gearbox anomalies with superior precision, targeting an accuracy exceeding 95%.
- (2)
To overcome the limitations of noise and false positives in traditional CMS and SCADA methods through advanced data processing techniques.
- (3)
To integrate the model with the health monitoring system for real-time monitoring and predictive maintenance, thereby extending gearbox lifespan and optimizing the economic efficiency of wind farms.
With state-of-the-art accuracy, this study advances the scientific utilization of deep learning for gearbox diagnosis using multidimensional SCADA data. This approach shows how deep learning can be optimized for high-dimensional, noisy datasets by using techniques such as Principal Component Analysis (PCA) for dimensionality reduction and DBSCAN for noise filtering. In practice, this study provides wind farms with a scalable, effective predictive maintenance solution that could lower expenses and downtime. The suggested model promotes proactive maintenance processes by enabling early fault detection and diagnosis, thereby improving wind energy systems’ efficiency and sustainability. These objectives are pursued to enhance wind turbine reliability, supporting the scalable adoption of wind energy as a sustainable resource. The proposed approach integrates the latest advancements in artificial intelligence and data science to overcome the shortcomings of traditional diagnostic methods. Utilizing SCADA data from wind turbine in the Republic of Korea as the foundation, the model follows a multi-step process:
- (1)
Principal Component Analysis (PCA) is applied to reduce data dimensionality and extract key features, improving computational efficiency.
- (2)
Density-Based Spatial Clustering of Applications with Noise (DBSCAN) removes noise, enhancing data clarity for both normal and anomalous states.
- (3)
A Deep Neural Network (DNN) with an optimized architecture—employing ReLU and Sigmoid activation functions—classifies gearbox conditions, achieving a confirmed accuracy of 98.8%.
The model is validated against and designed for compatibility with digital twin ecosystems, offering a cutting-edge solution for predictive maintenance that surpasses the reactive nature of traditional CMS and SCADA methods. The diagnostic model developed in this study represents a significant advancement in wind turbine technology, addressing a critical barrier to the widespread adoption of wind energy. Moreover, the model’s high accuracy and scalability position is a benchmark for future research, with the potential to influence global wind farm operations and contribute to the United Nations’ Sustainable Development Goal 7 (Affordable and Clean Energy) by 2030 [
17,
18]. This research not only enhances technical reliability but also boosts the economic viability of renewable energy, delivering an innovative and practical model for industry adoption.
The structure of this article is carefully crafted to guide readers systematically through the research process and its key findings.
Section 2, “Materials and Data Processing Methods”, provides a comprehensive overview of this study’s foundation, introducing the SCADA dataset collected from wind turbine, detailing the principles of data collection, and describing the preprocessing techniques, including data processing and analysis, as well as the design of the CCD model.
Section 3, “Design of the CCD Models”, elaborates on the deep neural network (DNN) workflow, outlining the model’s architecture, the application of ReLU and Sigmoid activation functions, and the development of two specialized diagnostic approaches—temperature prediction and condition labeling optimized to enhance fault detection accuracy.
Section 4, “Results of the CCD Model Accuracy Verification”, presents a detailed evaluation of the model’s performance, highlighting key accuracy metrics, such as the achieved 98.8%, and validating its effectiveness against historical fault data to underscore its practical applicability in real-world wind turbine operations. Finally,
Section 5, “Discussions and Conclusions”, synthesizes the research outcomes, explores their implications for improving wind turbine reliability, and proposes future research directions, including the potential for scaling the model across multiple turbines and integrating it with cutting-edge AI technologies.
3. Design of the CCD Models
3.1. Preprocessing of the Activation Functions
The integration of SCADA systems with deep learning and AI revolutionizes wind turbine gearbox diagnostics by enabling early fault detection, predictive accuracy, and proactive maintenance. Deep learning uncovers intricate patterns in SCADA datasets that traditional methods fail to detect [
30]. The DNN algorithm description in Algorithm 1 analyzes time-series data to identify early-stage degradation, preventing catastrophic failures. Building on preprocessing from
Section 2.2, the model architecture, training, validation, and inference steps learn hierarchical data representations for tasks like classification and regression. Two training approaches are applied: temperature prediction learning (forecasting thermal anomalies) and condition label learning (classifying operational states as normal/abnormal), forming a dual framework to optimize fault detection accuracy.
Algorithm 1 DNN algorithm for training CCD model |
- 1:
Initialize W and b - 2:
load SCADA dataset T - 3:
define DNN architecture with layers - 4:
set activation functions: ReLU for hidden layers, Sigmoid for the output layer - 5:
while not convergence criterion: - 6:
for each training example i in T: - 7:
z = W.x + b - 8:
a = σ(z) - 9:
calculate loss L using cross-entropy - 10:
compute gradients ∇L - 11:
update weights and biases using optimization algorithm (e.g., Adam): - 12:
θ_(t + 1) = θ_t − η∇L - 13:
end while - 14:
validate model performance on the validation set - 15:
test model on unseen data for inference
|
The DNN architecture is designed as a series of layers, each performing linear transformations followed by activation functions. A typical layer operation is as follows:
where
W is the weight matrix,
b is the bias vector,
x is the input vector, and
σ is an activation function (e.g., ReLU, sigmoid, or softmax).
A loss function quantifies the difference between the predicted outputs of the DNN and the actual target values, serving as a guide for optimizing model parameters. In DNN architecture, minimizing the loss functions such as cross-entropy for classification tasks, drives the learning process by adjusting weights to improve prediction accuracy. During training, the model optimizes its parameters by minimizing a loss function, such as cross-entropy for classification:
where
is the true label,
is the predicted probability,
N is the number of samples, and
C is the number of classes.
Optimization algorithms like stochastic gradient descent or Adam update parameters iteratively:
where
represents the model parameters,
is the learning rate, and
∇L is the gradient of the loss function.
The gearbox fault diagnosis model was trained using the architecture previously described, ensuring the model’s robustness and efficiency in predicting gearbox conditions. The model’s training process relied heavily on the effective use of activation functions, specifically ReLU for hidden layers and Sigmoid for the output layer, to enhance learning and ensure accurate binary classification. These activation functions played a critical role in determining how the model processed and transformed data throughout its layers.
The ReLU activation function
introduces non-linearity by zeroing negative inputs, enabling the network to capture complex data patterns and mitigating the vanishing gradient problem in deep networks [
31]. Its efficiency in accelerating convergence makes it ideal for hidden layers in fault diagnosis tasks. For output layers, temperature prediction uses baseline thermal data to detect anomalies, while condition label learning employs labeled operational states (normal/abnormal). The Sigmoid function
, maps output to a 0 to 1 probability range, enabling the precise binary classification of gearbox conditions (0 = normal, 1 = abnormal) [
32].
During the training process, the effectiveness of these activation functions directly influenced the model’s performance. ReLU’s ability to accelerate learning in the hidden layers allowed the model to quickly identify key features and patterns related to gearbox health. Meanwhile, Sigmoid’s probabilistic interpretation ensured that the model provided clear and reliable predictions for classification tasks. These two functions complemented each other, creating a balanced and efficient training pipeline. Iterative training on SCADA data enhances the model’s ability to differentiate normal and abnormal gearbox conditions. Combining ReLU (for feature learning) and Sigmoid (for precise classification) ensures reliable fault diagnosis. The model’s efficiency and interpretability make it ideal for predictive maintenance in wind turbine gearboxes.
The need for hyperparameter optimization was addressed by carefully adjusting the DNN architecture parameter values through grid search to maximize model performance. The following hyperparameters were evaluated:
Learning rate: {0.001, 0.01}.
Epoches: Early-stop training via TensorFlow/Keras’ EarlyStopping callback if no improvement occurs after a predefined number of epochs.
Batch size: {32, 64, 128}.
Optimizer: {Adam, SGD, RMSprop}.
Dropout rate: {0.2, 0.3, 0.4}.
A total of 54 hyperparameter combinations were tested to find a configuration that improved model performance while reducing overfitting. This study used a two-step optimization approach to deal with the overfitting problem. First, the model was trained for a fixed number of 1000 epochs to ensure that the loss was reduced to an optimal level. After that, training was dynamically stopped when no improvement was seen by applying early stopping with a patience value of 10 epochs and monitoring the validation loss. Finding the optimal number of epochs for the DNN model architecture is made possible by TensorFlow/Keras’ EarlyStopping callback. By halting the process if the validation metric does not improve after 10 epochs, this technique not only avoids overfitting but also guarantees effective training.
3.2. The Temperature Prediction Learning-Based CCD Model
The design of the CCD model based on temperature prediction learning follows a structured implementation process, as depicted in
Figure 7, which outlines the workflow from training to real-time diagnosis. The process begins with the preprocessing of raw historical SCADA data, as detailed in
Section 2.2, where data normalization, feature selection, and filtering are applied to ensure high-quality input. These preprocessed data, comprising temperature-related parameters from the wind turbine, are then used in the training phase to develop the DNN architecture. The model training involves feeding the input features into the DNN, with the target output representing predicted temperature values.
The architectural configuration of the DNN, as specified in
Table 4, includes eight input nodes, one output node, three hidden layers with 128 neurons each, ReLU activation functions in hidden layers, and a Sigmoid function in the output layer. The training employs a Mean Squared Error (MSE) loss function, a learning rate of 0.001, 276 epochs, and a batch size of 64, ensuring robust learning and convergence. During the real-time operation phase, the trained model processes measured inputs to predict output, which is compared with measured output to assess temperature deviations. In the real-time diagnosis phase, these predictions are analyzed against a predefined warning level to generate diagnosis results, facilitating early fault detection.
This temperature prediction-based approach offers several advantages, including its ability to detect thermal anomalies early, which is critical for preventive maintenance, and its compatibility with real-time monitoring systems like digital twins. However, it also presents notable disadvantages: the method is more complex, requiring two distinct steps (prediction and threshold-based classification), which increases computational complexity; it heavily depends on prediction accuracy, where inaccurate temperature forecasts by the DNN can lead to unreliable diagnosis results; and it necessitates defining a temperature deviation threshold to classify abnormalities, a process that can be challenging and inflexible due to varying operational conditions and the subjectivity in setting appropriate thresholds. Furthermore, the reliance on static thresholds may not adequately account for dynamic environmental factors, potentially leading to missed detections or false alarms in fluctuating conditions.
3.3. The Condition Label Learning-Based CCD Model
The design of the CCD model based on condition label learning, as depicted in
Figure 8, adopts a distinct implementation process tailored to classify gearbox states directly, diverging from the temperature prediction approach outlined in
Section 3.2, “The temperature prediction learning-based CCD model”. While both methods share similarities in leveraging preprocessed SCADA data from the wind turbine and utilizing a DNN framework with training, real-time operation, and diagnosis phases, the condition label learning focuses on binary classification (normal or abnormal) rather than predicting temperature values. The process begins with the trained DNN processing measured inputs derived from multiple operational parameters, to produce predicted labels for real-time diagnosis.
The DNN architecture, detailed in
Table 5, features nine input nodes, one output node, three hidden layers with 512, 256, and 128 neurons, ReLU and Sigmoid activation functions, and is trained using an MSE loss function, a learning rate of 0.001, 248 epochs, and a batch size of 64. Unlike the temperature prediction method, which requires an additional step to define temperature thresholds, this approach delivers a direct classification output, streamlining the diagnostic workflow.
This condition label learning method offers unique advantages: it is simple and efficient by avoiding intermediate steps like temperature prediction, thus reducing complexity; it achieves fast speed by eliminating the need for prior temperature forecasting; and it provides good generalization by utilizing diverse feature data (beyond temperature) to detect abnormalities from multiple perspectives. However, it faces challenges such as potential bias from imbalanced datasets and reliance on high-quality labeled data, which can be hard to ensure consistently in operational settings, setting it apart from the temperature-based method’s dependence on accurate thermal predictions and threshold setting.
5. Discussions and Conclusions
This study introduces a novel approach for wind turbine gearbox condition diagnosis, leveraging a DNN architecture integrated with SCADA data and condition labeling. The incorporation of advanced data processing techniques, including PCA for feature reduction and DBSCAN for data filtering, significantly enhances diagnostic accuracy while minimizing false positives and negatives. Wind turbine gearbox predictive maintenance has been enhanced by deep learning, providing a useful tool for the renewable energy industry. This approach uses deep learning to automatically extract complex patterns from raw SCADA data, in contrast with standard techniques like SVMs, which rely on handcrafted features and frequently struggle with high-dimensional data. Our architecture involves techniques for preprocessing that elevate accuracy and efficiency compared to other deep learning models. These improvements are critical for optimizing operation and maintenance processes in wind energy systems. Validation through four test cases with historical fault data demonstrates the model’s capability to accurately diagnose gearbox failures and predict fault conditions 1–2 days in advance, underscoring the importance of context-specific data collection and processing strategies. The proposed methodology bridges theoretical advancements and practical applications, illustrating how AI-driven condition-based maintenance models can transform SCADA data into actionable insights. By improving diagnostic precision and reducing downtime, this approach enhances the reliability and sustainability of wind turbines, reinforcing their role in global decarbonization efforts. While deep learning models offer significant advantages in terms of predictive accuracy, their inherent lack of interpretability can limit trust and adoption in industrial applications. To mitigate this issue, future work could incorporate explainable AI techniques, such as SHAP or LIME, to provide insights into the model’s decision-making process. These methods can help identify which input features—such as wind speed, rotor speed, or oil temperature—most significantly influence the predicted outcomes, thereby improving transparency and facilitating stakeholder confidence. Additionally, visualizing feature importance and decision pathways through XAI tools can assist operators in interpreting results and making informed maintenance decisions. The limited data coverage for the wind turbine and variations across geographical contexts may impact the generalizability of the model despite present limitations. However, by retraining or calibrating the AI model with new turbine SCADA data, the method described in this paper can be widely applied to other turbines. Future research will address class imbalances in fault types to better capture rare failure modes and focus on collecting samples from underrepresented regions, such as extreme climate zones and distinct turbine operating practices, to improve data diversity, as well as to create adaptive preprocessing pipelines for real-time SCADA integration to manage dynamic model updates, noise elimination, and streaming data synchronization. This is necessary to guarantee seamless deployment in operational facilities for preventative maintenance options. Integration with real-time SCADA data streams is also planned to enable continuous and timely diagnostics. Furthermore, fostering cross-industry collaboration and establishing standardized protocols will be essential to accelerate the adoption of AI-enhanced diagnostic solutions in the renewable energy sector. These advancements hold the potential to position wind energy as a cornerstone of sustainable energy systems, contributing to global efforts toward a low-carbon future.