1. Introduction
Bearing failures in rotating machinery lead to unplanned downtime, safety risks, and millions in lost revenue annually. Traditional diagnostic methods often struggle to detect early-stage faults in harsh operating environments, limiting the effectiveness of predictive maintenance. The significance of bearings is underscored by the severe implications of their failure, highlighting the need for effective fault diagnosis. Traditional methods, which typically rely on diagnostic approaches, primarily based on manual inspections and elementary statistical methods, are increasingly being replaced by more advanced techniques, especially in the realm of machine learning. A significant stride in this field is the emergence of deep learning methodologies, as highlighted by Zhao et al. [
1]. The role of bearing fault detection and diagnosis as key aspects of predictive maintenance in rotating machinery is well established, with vibration signals serving as the main diagnostic medium due to their direct correlation with bearing health, as reported by Ma et al. [
2].
In addition to the development of fault diagnosis methods, the establishment and collection of benchmark datasets is essential. Notable fault diagnosis datasets include those from Case Western Reserve University [
3], the IEEE PHM 2012 Data Challenge [
4], the University of Cincinnati [
5], the University of Ottawa [
6], and Xi’an Jiao Tong University [
7]. These datasets, which are publicly available and considered state-of-the-art, contain a wide range of rolling bearing operation data and are described in detail in their corresponding references. For fault detection problems, a representative signal segment is often subjectively selected from the data collected for analysis, which can affect the detection performance of the methods applied. Moreover, these datasets might require preprocessing to test the effectiveness of proposed fault-type recognition methods, such as dividing the data into training and testing sets for machine learning-based methods. While these datasets are instrumental, the necessity remains for sharing and developing diverse datasets for rolling bearing fault diagnosis tasks, echoing the significance of datasets like ImageNet [
8] in the computer vision community. Deep learning-based methods for rolling bearing fault type recognition can automate feature extraction, reduction, and classification, but they often require significant manual effort in designing network architectures and adjusting parameters, which can be time-consuming and resource-intensive. Additionally, these methods typically lack interpretability and require a large number of samples for training, a challenging requirement in practical engineering applications where fault samples are scarce. Therefore, there is a compelling need for new fault-type recognition methods that are less reliant on extensive manual effort and can effectively handle limited training data while offering interpretable models for fault identification [
9]. The existing methods, despite their advancements, still necessitate considerable prior knowledge and expert experience, highlighting the need for more intelligent and flexible approaches that can adapt to a broad range of applications without heavy reliance on domain-specific expertise.
The studies by Singh and Moore (2021) [
10] and Singh and Moore (2020) [
11] on nonlinear system identification characteristic nonlinear system identification (CNSI) for local attachments provided valuable information for bearing fault diagnosis. The 2021 research demonstrated the applicability of the CNSI method for multiple attachments and interactions with higher modes, which can be crucial for accurately diagnosing bearing faults that manifest through complex dynamic behaviors. Similarly, the 2020 study’s focus on clearance nonlinearities extended the utility of CNSI for identifying the dynamics of bearing systems with nonlinear characteristics, such as those found in faulty bearings. These methodologies could improve bearing fault diagnosis by offering a more nuanced understanding of the dynamic responses of bearings under various fault conditions [
12]. In bearing fault diagnostics, developing more accurate and predictive maintenance strategies is crucial.
The performance of fault-type recognition methods often presupposes that training and test samples are independently and identically distributed. However, variations in working conditions, such as changes in characteristic frequencies and amplitudes through different rotational speeds, can introduce significant differences between training and testing data, leading to domain adaptation challenges. Transfer learning (TL) has emerged as a promising approach to address these challenges by leveraging knowledge from related domains to improve learning performance in target scenarios [
4,
12,
13]. Despite the success of TL-based methods, further research is needed to enhance their domain adaptability and recognition accuracy, especially under variable operating conditions. Furthermore, a reliance on large volumes of labeled data poses significant challenges, particularly in obtaining fault samples, which are rare and labour-intensive to label. Few-shot learning (FSL) offers a viable solution for accurate failure attribution under conditions of limited data availability, with data augmentation, data/model transfer, and metalearning being key strategies in FSL methods [
9].
Recent advancements have identified the use of multidirectional vibration data as more effective than traditional single-directional vibration signals for bearing fault diagnostics. This method enhances the accuracy and reliability of bearing failure detections, as noted by Zhang & Gu [
14]. However, a primary challenge in this area is the inherent heterogeneity of datasets, which can differ in aspects such as experimental conditions, fault severity, fault types, and setup configurations, and setup configurations, a point highlighted by Chen et al. [
15]. Conventional diagnostic models often fail due to their assumption of uniform feature distribution in training and testing data, an assumption rarely met in practical scenarios. This discrepancy often diminishes the effectiveness of traditional machine learning models in various applications, as Bhuiyan and Uddin [
16] have observed.
The shift towards multidirectional vibration data in bearing failure diagnostics marks a substantial improvement over the analysis of single-direction signals. This approach, as reiterated by Zhang and Gu [
14], significantly refines precision and reliability in the identification of bearing malfunctions. The study by Han, Xie, and Pei innovatively addresses the challenges of fault diagnosis in wind turbines through a semi-supervised adversarial learning approach, leveraging both annotated and unannotated data to enhance diagnostic accuracy amidst the scarcity of labeled data (Han, Xie, & Pei, 2023 [
17]. By integrating adversarial learning with metric learning techniques, the research not only significantly contributes to the wind energy sector by improving the reliability of fault diagnosis methods, but also presents a methodology adaptable across various domains, highlighting its broader implications for the renewable energy sector and beyond. This advancement supports the operational stability of wind turbines, furthering sustainable energy transition efforts.
Recent research by Kaya, Kuncan, and Ertunç [
18] demonstrated the effectiveness of using time–frequency images obtained from Continuous Wavelet Transform (CWT) combined with deep transfer learning DTL methods for the automatic diagnosis of bearing fault sizes. Their study utilized various pre-trained networks, such as AlexNet, GoogleNet, ResNet, VGG16, and VGG19, achieving high classification accuracy rates between 96.67% and 100%, and showcasing the potential of DTL in bearing diagnostics [
18].
The importance of deep transfer learning (DTL) in fault diagnosis is further supported by a systematic review conducted by Singh and Singh [
19], which underscored the efficiency of DTL in processing and analyzing vibration signal data for identifying anomalies in roller bearings. This review highlighted the transition towards more sophisticated AI-based diagnostic techniques, emphasizing the advantages of leveraging pre-trained models through DTL [
19].
In addition, the work of Zhang and Zhou [
20] on the integration of machine learning techniques into condition monitoring systems echoed the importance of adopting advanced algorithms for fault diagnosis. Their review encapsulated the growing trend of using deep learning models, including those enabled by deep transfer learning (DTL), to effectively predict equipment failures [
20].
The effectiveness of DTL in bearing fault diagnosis, especially in scenarios characterized by limited data availability, is also evident in research by Li and Zhang [
21],and Wang and Zhao [
22]. These studies highlighted how DTL models can be adapted from large datasets to specific diagnostic tasks, thus improving the performance of the model in detecting bearing failures [
23,
24].
Lastly, research by [
25] into the application of deep learning for fault diagnosis indicated the method’s potential to significantly improve diagnostic processes. This study emphasized the utility of pre-trained models in DTL approaches to circumvent the limitations inherent in traditional diagnostic methods [
25].
Incorporating vibration data from multiple directions in fault diagnosis provides several advantages:
By analyzing vibrations from different angles, it is possible to detect subtle fault characteristics that might be missed in a single-direction signa.
Multidirectional vibration data provide additional information on the location of the fault, allowing for more accurate identification.
Noise can significantly affect the accuracy of fault diagnosis. Using vibration data from multiple directions can help suppress noise, thus isolating fault-related signals more effectively.
Challenges persist due to the intrinsic heterogeneity of datasets, varying in experimental settings, fault severity, and types.
Traditional models often struggle in these scenarios, primarily because they are built on the assumption of consistent data distribution across training and testing phases—a rarity in real-world settings [
16,
26]. In order to address these issues, the current focus on deep transfer learning in industrial machinery fault diagnosis, particularly for bearings, represents a significant advancement. This approach uses knowledge from one domain to enhance learning in another and has shown notable effectiveness in scenarios characterized by limited labeled data, diverse operational conditions, and feature distribution discrepancies [
27,
28]. Various deep transfer learning techniques, including fine-tune-based, statistic-based, adversarial-based, and few-shot-based approaches, have demonstrated promising results in developing robust diagnostic models [
28,
29,
30]. Recent advances in the field of bearing fault diagnosis have heavily leaned on the capabilities of deep learning (DL) technologies, particularly convolutional neural networks (CNNs). Studies have utilized architectures like VGG16, VGG19, and ResNet to significantly improve fault classification accuracy by exploiting their deep feature extraction capabilities from vibration signal data. For instance, the use of CNNs for feature extraction has been pivotal in capturing subtle patterns in vibration signals that are often missed by traditional methods [
31,
32]. However, despite these advancements, the application of transfer learning to adapt these deep networks, pre-trained on generic datasets to specific diagnostic tasks in bearing fault, remains underexplored. This points towards a significant research gap where the potential of transfer learning could be harnessed to reduce dependency on large, labeled datasets specific to bearing faults. Another critical gap lies in the real-time application of these advanced models. The computational demands of architectures like ResNet and VGG19 often limit their deployment in real-time scenarios where quick fault detection is crucial to prevent machinery downtime [
29,
30]. Furthermore, most current research focuses on offline analysis, with less attention given to the challenges of integrating these models within live operational monitoring systems. There is a pressing need for developing lightweight models or optimizing existing architectures to operate efficiently in real-time conditions without compromising diagnostic accuracy.
The necessity for new work in this area stems from the evolving complexity of machinery and operational conditions under which these bearing systems operate. As industrial machinery becomes more sophisticated, the vibration patterns associated with faults become more complex, requiring more nuanced detection capabilities [
29]. Additionally, the integration of physical domain knowledge with DL models presents an underexplored area of research that could potentially lead to more robust diagnostic systems that are adaptable to varying operational contexts and can handle noisy data more effectively [
33].
Our research builds on these findings, employing the TL_ResNet-101 model in transfer learning to enhance the detection of early-stage bearing faults, extending the work of Zhao et al. [
34]. Furthermore, we address the challenge of data heterogeneity using multidirectional vibration data, following the recommendation of Bhuiyan and Uddin [
16] for adaptable models under varying operational conditions.
The availability of open-source bearing failure datasets, which include both experimentally induced faults and run-to-failure data, has been instrumental in advancing and validating these models in controlled environments. However, the scarcity of accurately labeled target fault data continues to be a significant barrier, making traditional training methods impractical for many industrial [
35,
36,
37]. Transfer learning offers a solution to this by applying knowledge from one domain to improve learning in another, thus addressing the issue of insufficient labeled data and enhancing the flexibility and applicability of diagnostic models [
38].
Despite these advances, the application of advanced, complex deep learning structures like ResNet-101 in the diagnosis of spherical bearings remains underexplored. The ResNet-101 model, known for its exceptional image recognition capabilities, can significantly improve mechanical fault diagnosis when adapted through transfer [
39,
40,
41]. This study aims to merge a multidirectional vibration data analysis, which offers a comprehensive insight into bearing states, with advanced machine learning algorithms [
42].
The primary contributions of our study are as follows:
Introducing the innovative application of ResNet-101 architecture, renowned for its efficacy in image recognition, spherical bearing fault diagnosis, and employing signal-to-image conversion techniques.
Utilizing multidirectional vibration data to enhance the detection of nuanced fault characteristics and provide additional information on fault location, thereby improving the isolation of the fault and the suppression of noise, thus isolating fault-related signals more effectively.
Enhancing accuracy in the diagnosis of early-stage faults, even under limited data availability, and enhancing adaptability to diverse operating conditions and bearing configurations.
Merging transfer learning with the ResNet-101 framework to tackle the complexities of signal processing and data uniqueness in different operational scenarios, promising a substantial improvement in the precision and efficiency of defect identification in spherical bearings, especially double row spherical roller bearings.