A Deep Transfer Learning Model for the Fault Diagnosis of Double Roller Bearing Using Scattergram Filter Bank 1

: In this study, a deep transfer learning model was developed using ResNet-101 architecture to diagnose double roller bearing defects. Vibration data were collected for three different load scenarios, including conditions without load, and for five different rotational speeds, ranging from 500 to 2500 RPM. Significantly, the speed condition of 2500 RPM has not previously been investigated, therefore offering a potential avenue for future investigations. This study offers a thorough examination of bearing conditions using multidirectional vibration data collected from accelerometers positioned in both vertical and horizontal orientations. In addition to transfer learning using ResNet-101, four additional models (VGG-16, VGG19, ResNet-18, and ResNet-50) were trained. Transfer learning using ResNet-101 consistently achieved the highest accuracy in all scenarios, with accuracy rates ranging from 90.78% to 99%. Scattergram Filter Bank 1 was used as the image input for training as a preprocessing method to enhance feature extraction. Research has effectively applied transfer learning to improve fault diagnosis accuracy, especially in limited data scenarios. This shows the capability of the method to differentiate between normal and faulty bearing conditions using signal-to-image transformation, emphasizing the potential of transfer learning to augment diagnostic performance in scenarios with limited training data.


Introduction
Bearing failures in rotating machinery lead to unplanned downtime, safety risks, and millions in lost revenue annually.Traditional diagnostic methods often struggle to detect early-stage faults in harsh operating environments, limiting the effectiveness of predictive maintenance.The significance of bearings is underscored by the severe implications of their failure, highlighting the need for effective fault diagnosis.Traditional methods, which typically rely on diagnostic approaches, primarily based on manual inspections and elementary statistical methods, are increasingly being replaced by more advanced techniques, especially in the realm of machine learning.A significant stride in this field is the emergence of deep learning methodologies, as highlighted by Zhao et al. [1].The role of bearing fault detection and diagnosis as key aspects of predictive maintenance in rotating machinery is well established, with vibration signals serving as the main diagnostic medium due to their direct correlation with bearing health, as reported by Ma et al. [2].
In addition to the development of fault diagnosis methods, the establishment and collection of benchmark datasets is essential.Notable fault diagnosis datasets include those from Case Western Reserve University [3], the IEEE PHM 2012 Data Challenge [4], the University of Cincinnati [5], the University of Ottawa [6], and Xi'an Jiao Tong University [7].These datasets, which are publicly available and considered state-of-the-art, contain a wide range of rolling bearing operation data and are described in detail in their corresponding references.For fault detection problems, a representative signal segment is often subjectively selected from the data collected for analysis, which can affect the detection performance of the methods applied.Moreover, these datasets might require preprocessing to test the effectiveness of proposed fault-type recognition methods, such as dividing the data into training and testing sets for machine learning-based methods.While these datasets are instrumental, the necessity remains for sharing and developing diverse datasets for rolling bearing fault diagnosis tasks, echoing the significance of datasets like ImageNet [8] in the computer vision community.Deep learning-based methods for rolling bearing fault type recognition can automate feature extraction, reduction, and classification, but they often require significant manual effort in designing network architectures and adjusting parameters, which can be time-consuming and resource-intensive.Additionally, these methods typically lack interpretability and require a large number of samples for training, a challenging requirement in practical engineering applications where fault samples are scarce.Therefore, there is a compelling need for new fault-type recognition methods that are less reliant on extensive manual effort and can effectively handle limited training data while offering interpretable models for fault identification [9].The existing methods, despite their advancements, still necessitate considerable prior knowledge and expert experience, highlighting the need for more intelligent and flexible approaches that can adapt to a broad range of applications without heavy reliance on domain-specific expertise.
The studies by Singh and Moore (2021) [10] and Singh and Moore (2020) [11] on nonlinear system identification characteristic nonlinear system identification (CNSI) for local attachments provided valuable information for bearing fault diagnosis.The 2021 research demonstrated the applicability of the CNSI method for multiple attachments and interactions with higher modes, which can be crucial for accurately diagnosing bearing faults that manifest through complex dynamic behaviors.Similarly, the 2020 study's focus on clearance nonlinearities extended the utility of CNSI for identifying the dynamics of bearing systems with nonlinear characteristics, such as those found in faulty bearings.These methodologies could improve bearing fault diagnosis by offering a more nuanced understanding of the dynamic responses of bearings under various fault conditions [12].In bearing fault diagnostics, developing more accurate and predictive maintenance strategies is crucial.
The performance of fault-type recognition methods often presupposes that training and test samples are independently and identically distributed.However, variations in working conditions, such as changes in characteristic frequencies and amplitudes through different rotational speeds, can introduce significant differences between training and testing data, leading to domain adaptation challenges.Transfer learning (TL) has emerged as a promising approach to address these challenges by leveraging knowledge from related domains to improve learning performance in target scenarios [4,12,13].Despite the success of TL-based methods, further research is needed to enhance their domain adaptability and recognition accuracy, especially under variable operating conditions.Furthermore, a reliance on large volumes of labeled data poses significant challenges, particularly in obtaining fault samples, which are rare and labour-intensive to label.Few-shot learning (FSL) offers a viable solution for accurate failure attribution under conditions of limited data availability, with data augmentation, data/model transfer, and metalearning being key strategies in FSL methods [9].
Recent advancements have identified the use of multidirectional vibration data as more effective than traditional single-directional vibration signals for bearing fault diagnostics.This method enhances the accuracy and reliability of bearing failure detections, as noted by Zhang & Gu [14].However, a primary challenge in this area is the inherent heterogeneity of datasets, which can differ in aspects such as experimental conditions, fault severity, fault types, and setup configurations, and setup configurations, a point highlighted by Chen et al. [15].Conventional diagnostic models often fail due to their assumption of uniform feature distribution in training and testing data, an assumption rarely met in practical scenarios.This discrepancy often diminishes the effectiveness of traditional machine learning models in various applications, as Bhuiyan and Uddin [16] have observed.
The shift towards multidirectional vibration data in bearing failure diagnostics marks a substantial improvement over the analysis of single-direction signals.This approach, as reiterated by Zhang and Gu [14], significantly refines precision and reliability in the identification of bearing malfunctions.The study by Han, Xie, and Pei innovatively addresses the challenges of fault diagnosis in wind turbines through a semi-supervised adversarial learning approach, leveraging both annotated and unannotated data to enhance diagnostic accuracy amidst the scarcity of labeled data (Han, Xie, & Pei, 2023 [17].By integrating adversarial learning with metric learning techniques, the research not only significantly contributes to the wind energy sector by improving the reliability of fault diagnosis methods, but also presents a methodology adaptable across various domains, highlighting its broader implications for the renewable energy sector and beyond.This advancement supports the operational stability of wind turbines, furthering sustainable energy transition efforts. Recent research by Kaya, Kuncan, and Ertunç [18] demonstrated the effectiveness of using time-frequency images obtained from Continuous Wavelet Transform (CWT) combined with deep transfer learning DTL methods for the automatic diagnosis of bearing fault sizes.Their study utilized various pre-trained networks, such as AlexNet, GoogleNet, ResNet, VGG16, and VGG19, achieving high classification accuracy rates between 96.67% and 100%, and showcasing the potential of DTL in bearing diagnostics [18].
The importance of deep transfer learning (DTL) in fault diagnosis is further supported by a systematic review conducted by Singh and Singh [19], which underscored the efficiency of DTL in processing and analyzing vibration signal data for identifying anomalies in roller bearings.This review highlighted the transition towards more sophisticated AI-based diagnostic techniques, emphasizing the advantages of leveraging pre-trained models through DTL [19].
In addition, the work of Zhang and Zhou [20] on the integration of machine learning techniques into condition monitoring systems echoed the importance of adopting advanced algorithms for fault diagnosis.Their review encapsulated the growing trend of using deep learning models, including those enabled by deep transfer learning (DTL), to effectively predict equipment failures [20].
The effectiveness of DTL in bearing fault diagnosis, especially in scenarios characterized by limited data availability, is also evident in research by Li and Zhang [21],and Wang and Zhao [22].These studies highlighted how DTL models can be adapted from large datasets to specific diagnostic tasks, thus improving the performance of the model in detecting bearing failures [23,24].
Lastly, research by [25] into the application of deep learning for fault diagnosis indicated the method's potential to significantly improve diagnostic processes.This study emphasized the utility of pre-trained models in DTL approaches to circumvent the limitations inherent in traditional diagnostic methods [25].
Incorporating vibration data from multiple directions in fault diagnosis provides several advantages:

1.
By analyzing vibrations from different angles, it is possible to detect subtle fault characteristics that might be missed in a single-direction signa.

2.
Multidirectional vibration data provide additional information on the location of the fault, allowing for more accurate identification.

3.
Noise can significantly affect the accuracy of fault diagnosis.Using vibration data from multiple directions can help suppress noise, thus isolating fault-related signals more effectively.
Challenges persist due to the intrinsic heterogeneity of datasets, varying in experimental settings, fault severity, and types.
Traditional models often struggle in these scenarios, primarily because they are built on the assumption of consistent data distribution across training and testing phases-a rarity in real-world settings [16,26].In order to address these issues, the current focus on deep transfer learning in industrial machinery fault diagnosis, particularly for bearings, represents a significant advancement.This approach uses knowledge from one domain to enhance learning in another and has shown notable effectiveness in scenarios characterized by limited labeled data, diverse operational conditions, and feature distribution discrepancies [27,28].Various deep transfer learning techniques, including fine-tune-based, statistic-based, adversarial-based, and few-shot-based approaches, have demonstrated promising results in developing robust diagnostic models [28][29][30].Recent advances in the field of bearing fault diagnosis have heavily leaned on the capabilities of deep learning (DL) technologies, particularly convolutional neural networks (CNNs).Studies have utilized architectures like VGG16, VGG19, and ResNet to significantly improve fault classification accuracy by exploiting their deep feature extraction capabilities from vibration signal data.For instance, the use of CNNs for feature extraction has been pivotal in capturing subtle patterns in vibration signals that are often missed by traditional methods [31,32].However, despite these advancements, the application of transfer learning to adapt these deep networks, pre-trained on generic datasets to specific diagnostic tasks in bearing fault, remains underexplored.This points towards a significant research gap where the potential of transfer learning could be harnessed to reduce dependency on large, labeled datasets specific to bearing faults.Another critical gap lies in the real-time application of these advanced models.The computational demands of architectures like ResNet and VGG19 often limit their deployment in real-time scenarios where quick fault detection is crucial to prevent machinery downtime [29,30].Furthermore, most current research focuses on offline analysis, with less attention given to the challenges of integrating these models within live operational monitoring systems.There is a pressing need for developing lightweight models or optimizing existing architectures to operate efficiently in real-time conditions without compromising diagnostic accuracy.
The necessity for new work in this area stems from the evolving complexity of machinery and operational conditions under which these bearing systems operate.As industrial machinery becomes more sophisticated, the vibration patterns associated with faults become more complex, requiring more nuanced detection capabilities [29].Additionally, the integration of physical domain knowledge with DL models presents an underexplored area of research that could potentially lead to more robust diagnostic systems that are adaptable to varying operational contexts and can handle noisy data more effectively [33].
Our research builds on these findings, employing the TL_ResNet-101 model in transfer learning to enhance the detection of early-stage bearing faults, extending the work of Zhao et al. [34].Furthermore, we address the challenge of data heterogeneity using multidirectional vibration data, following the recommendation of Bhuiyan and Uddin [16] for adaptable models under varying operational conditions.
The availability of open-source bearing failure datasets, which include both experimentally induced faults and run-to-failure data, has been instrumental in advancing and validating these models in controlled environments.However, the scarcity of accurately labeled target fault data continues to be a significant barrier, making traditional training methods impractical for many industrial [35][36][37].Transfer learning offers a solution to this by applying knowledge from one domain to improve learning in another, thus addressing the issue of insufficient labeled data and enhancing the flexibility and applicability of diagnostic models [38].
Despite these advances, the application of advanced, complex deep learning structures like ResNet-101 in the diagnosis of spherical bearings remains underexplored.The ResNet-101 model, known for its exceptional image recognition capabilities, can significantly improve mechanical fault diagnosis when adapted through transfer [39][40][41].This study aims to merge a multidirectional vibration data analysis, which offers a comprehensive insight into bearing states, with advanced machine learning algorithms [42].
The primary contributions of our study are as follows: 1.
Introducing the innovative application of ResNet-101 architecture, renowned for its efficacy in image recognition, spherical bearing fault diagnosis, and employing signal-to-image conversion techniques.

2.
Utilizing multidirectional vibration data to enhance the detection of nuanced fault characteristics and provide additional information on fault location, thereby improving the isolation of the fault and the suppression of noise, thus isolating fault-related signals more effectively.

3.
Enhancing accuracy in the diagnosis of early-stage faults, even under limited data availability, and enhancing adaptability to diverse operating conditions and bearing configurations.

4.
Merging transfer learning with the ResNet-101 framework to tackle the complexities of signal processing and data uniqueness in different operational scenarios, promising a substantial improvement in the precision and efficiency of defect identification in spherical bearings, especially double row spherical roller bearings.

Experimental Work
The experimental setup is illustrated in Figure 1.The shaft was driven by a variablespeed motor controlled by an AC driver.Two bearing housings were used to support the shaft.Bearing housing 2 on the right side was healthy.On the contrary, bearing housing 1 on the left side was replaced with different defective bearings.The loading system consisted of a hydraulic pump attached to the frame, which applied pressure to the load sensor on the bearing frame.These bearings included a healthy bearing (HB), a bearing with a defect on a spherical roller (RF), a bearing with an outer race defect (ORF), and a bearing with an inner race defect (IRF).Two accelerometers were installed on the bearing housing in two vertical and horizontal directions to collect vibration signals in the vertical and horizontal directions.A tachometer was used to measure the shaft rotation speed to validate the proposed method.The signals were collected with an SKF Microlog CMX80 instrument (Made by SKF, SKF is a Swedish company headquartered in Gothenburg, Sweden.) and sampled using SKF Report and analysis software.
ResNet-101 model, known for its exceptional image recognition capabilities, can significantly improve mechanical fault diagnosis when adapted through transfer [39][40][41].This study aims to merge a multidirectional vibration data analysis, which offers a comprehensive insight into bearing states, with advanced machine learning algorithms [42].
The primary contributions of our study are as follows: 1. Introducing the innovative application of ResNet-101 architecture, renowned for its efficacy in image recognition, spherical bearing fault diagnosis, and employing signal-to-image conversion techniques.2. Utilizing multidirectional vibration data to enhance the detection of nuanced fault characteristics and provide additional information on fault location, thereby improving the isolation of the fault and the suppression of noise, thus isolating faultrelated signals more effectively.3. Enhancing accuracy in the diagnosis of early-stage faults, even under limited data availability, and enhancing adaptability to diverse operating conditions and bearing configurations.4. Merging transfer learning with the ResNet-101 framework to tackle the complexities of signal processing and data uniqueness in different operational scenarios, promising a substantial improvement in the precision and efficiency of defect identification in spherical bearings, especially double row spherical roller bearings.

Experimental Work
The experimental setup is illustrated in Figure 1.The shaft was driven by a variable-speed motor controlled by an AC driver.Two bearing housings were used to support the shaft.Bearing housing 2 on the right side was healthy.On the contrary, bearing housing 1 on the left side was replaced with different defective bearings.The loading system consisted of a hydraulic pump attached to the frame, which applied pressure to the load sensor on the bearing frame.These bearings included a healthy bearing (HB), a bearing with a defect on a spherical roller (RF), a bearing with an outer race defect (ORF), and a bearing with an inner race defect (IRF).Two accelerometers were installed on the bearing housing in two vertical and horizontal directions to collect vibration signals in the vertical and horizontal directions.A tachometer was used to measure the shaft rotation speed to validate the proposed method.The signals were collected with an SKF Microlog CMX80 instrument (Made by SKF, SKF is a Swedish company headquartered in Gothenburg, Sweden.) and sampled using SKF Report and analysis software.

Spherical Rolling Bearing SKF 22209 EK
SKF 22209EK is a high-performance spherical roller bearing from the Explorer series, designed for high-speed applications with high radial loads, axial loads, and tilting moments.It is suitable for the mining, construction, agriculture, and transportation in-

Spherical Rolling Bearing SKF 22209 EK
SKF 22209EK is a high-performance spherical roller bearing from the Explorer series, designed for high-speed applications with high radial loads, axial loads, and tilting moments.It is suitable for the mining, construction, agriculture, and transportation industries.The bearing had a tapered bore with a 1:12 taper for easy installation and removal.It had two rows of symmetrical rollers with a common spherical raceway in the outer ring, compensating for misalignments and shaft deflections.It can operate at a maximum speed of 8500 RPM and a maximum operating temperature of 200 • C. Its high radial internal clearance reduced the risk of thermal expansion damage.The inner raceway was mounted on the shaft, whereas the outer raceway was in the housing.The bearing had 34 rollers arranged in two rows, with a common spherical raceway in the outer ring.This model has not previously been used for bearing fault diagnosis, providing an opportunity to explore its characteristics and suitability for fault detection, and enriching current research in this area.The dimensions of the SKF 22209EK spherical roller bearing are illustrated in Figure 2b and in Table 1   The SKF22209 EK bearing was chosen specifically for its design flexibility, allowing for the disassembly and reassembly of parts without compromising its operational integrity.This design facilitates the controlled introduction of defects such as wear, pits, or cracks into the roller, outer race, and inner race for experimental purposes.
Concerns are addressed as follows: 1. Disassembly and reassembly process: The bearing design allows for the removal and replacement of rollers, especially by extracting two rollers along the same line.This method also applies to other defective parts.The process was conducted with precision tools and under strict guidelines to ensure accurate positioning and orientation of components upon reassembly.2. Quality Assurance of Reassembly: After introducing the defects and reassembling the bearing, several quality checks were performed:  The SKF22209 EK bearing was chosen specifically for its design flexibility, allowing for the disassembly and reassembly of parts without compromising its operational integrity.This design facilitates the controlled introduction of defects such as wear, pits, or cracks into the roller, outer race, and inner race for experimental purposes.
Concerns are addressed as follows: 1.
Disassembly and reassembly process: The bearing design allows for the removal and replacement of rollers, especially by extracting two rollers along the same line.This method also applies to other defective parts.The process was conducted with precision tools and under strict guidelines to ensure accurate positioning and orientation of components upon reassembly.

2.
Quality Assurance of Reassembly: After introducing the defects and reassembling the bearing, several quality checks were performed: • Dimensional Inspection: We Ensured all parts were fitted correctly and there were no unintended gaps or misalignments.

•
Rotational Testing: The bearing was subjected to low-speed rotation tests to detect any anomalies in smoothness or noise that could indicate improper assembly.
Vibration analysis: Before and after introducing defects, a vibration analysis was conducted.The comparison helped in confirming that any new vibration signatures were solely due to the introduced defects and not from assembly issues.

Artificial Defects Using Electrical Discharge Machining (EDM)
Artificial defects were engineered into the bearings using electrical discharge machining (EDM) to simulate typical faults in bearing component.This process ensured the controlled introduction of damage in various locations, such as the inner and outer races and the rollers, which are common sites for the origin of the defect.These artificially induced defects represented the initial signs of wear and tear, which can lead to more severe forms of degradation, such as spalling, cage collapse, and, ultimately, complete bearing failure.The precise nature of these defects is illustrated in Figure 3.This methodical induction of faults allowed for the creation of a realistic test environment to effectively evaluate diagnostic techniques and predictive maintenance strategies.


Dimensional Inspection: We Ensured all parts were fitted correctly and there were no unintended gaps or misalignments. Rotational Testing: The bearing was subjected to low-speed rotation tests to detect any anomalies in smoothness or noise that could indicate improper assembly.
Vibration analysis: Before and after introducing defects, a vibration analysis was conducted.The comparison helped in confirming that any new vibration signatures were solely due to the introduced defects and not from assembly issues.

Artificial Defects Using Electrical Discharge Machining (EDM)
Artificial defects were engineered into the bearings using electrical discharge machining (EDM) to simulate typical faults in bearing component.This process ensured the controlled introduction of damage in various locations, such as the inner and outer races and the rollers, which are common sites for the origin of the defect.These artificially induced defects represented the initial signs of wear and tear, which can lead to more severe forms of degradation, such as spalling, cage collapse, and, ultimately, complete bearing failure.The precise nature of these defects is illustrated in Figure 3.This methodical induction of faults allowed for the creation of a realistic test environment to effectively evaluate diagnostic techniques and predictive maintenance strategies.Electrical discharge machines (EDMs) can create artificial bearing defects, allowing for controlled testing and analysis of their performance under different conditions.EDMs can simulate common types of damage in real-world bearing applications, such as cracks or wear patterns, enabling a more accurate assessment of bearing durability and reliability.An artificial defect was created in the inner, outer, and rolling elements of the spherical bearing using an EDM, as mentioned in Table 2, for use in separate experiments and as shown in Figure 4.  Electrical discharge machines (EDMs) can create artificial bearing defects, allowing for controlled testing and analysis of their performance under different conditions.EDMs can simulate common types of damage in real-world bearing applications, such as cracks or wear patterns, enabling a more accurate assessment of bearing durability and reliability.An artificial defect was created in the inner, outer, and rolling elements of the spherical bearing using an EDM, as mentioned in Table 2, for use in separate experiments and as shown in Figure 4.

The Alignment Procedure Using the FixturLaser XA System
The FixturLaser XA system is commonly used to assess and correct misalignment.This method allows for the precise alignment of motor shafts within designated tolerances by adjusting the vertical and horizontal positions of the front and back pairs of the motor foot.The alignment process was performed using a tolerance table in the device manual.The Fixturlaser XA system uses two measurement units placed on each shaft, using the fittings provided by the system as illustrated in Figure 8.The system determined the relative distance between the two shafts in the two planes by rotating the shafts to various measuring points using bearings.The system received the input of the measurements taken from the bearing to the connection and motor feet.The Fixturlaser XA display provided precise data on the current alignment status and the motor position.

The Alignment Procedure Using the FixturLaser XA System
The FixturLaser XA system is commonly used to assess and correct misalignment.This method allows for the precise alignment of motor shafts within designated tolerances by adjusting the vertical and horizontal positions of the front and back pairs of the motor foot.The alignment process was performed using a tolerance table in the device manual.The Fixturlaser XA system uses two measurement units placed on each shaft, using the fittings provided by the system as illustrated in Figure 8.The system determined the relative distance between the two shafts in the two planes by rotating the shafts to various measuring points using bearings.The system received the input of the measurements taken from the bearing to the connection and motor feet.The Fixturlaser XA display provided precise data on the current alignment status and the motor position.

Hydraulic Load System
The loading system consisted of a hydraulic pump attached to a load frame structure and then connected to a hydraulic actuator.The hydraulic cylinder, which was the final component of this system, was used to apply pressure to the HBM C2 load sensor.This load sensor was designed to measure the force exerted by the hydraulic cylinder

Hydraulic Load System
The loading system consisted of a hydraulic pump attached to a load frame structure and then connected to a hydraulic actuator.The hydraulic cylinder, which was the final component of this system, was used to apply pressure to the HBM C2 load sensor.This load sensor was designed to measure the force exerted by the hydraulic cylinder and provide feedback to the display, allowing the user to keep track of the amount of force applied, as illustrated in Figures 9 and 10.
Figure 8.The alignment procedure using the Fixturlaser XA system.

Hydraulic Load System
The loading system consisted of a hydraulic pump attached to a load frame structure and then connected to a hydraulic actuator.The hydraulic cylinder, which was the final component of this system, was used to apply pressure to the HBM C2 load sensor.This load sensor was designed to measure the force exerted by the hydraulic cylinder and provide feedback to the display, allowing the user to keep track of the amount of force applied, as illustrated in Figures 9 and 10.

Vibration Measurement Device
The test rig was driven over a DC motor through flexible coupling.This setup allowed for efficient and reliable monitoring and control of motor performance.The power and speed controller, shown in Figure 10, effectively regulated the motor's operation.Additionally, using the non-contact SKF optical tachometer TMOT6, shown in Figure 11, provided accurate measurements of the motor's rotational speed.The CMSS 2200 accelerometer, which has a sensitivity of 100 mV/g, was securely attached to the bearing housing using a magnetic component, making it suitable for various applications.

Vibration Measurement Device
The test rig was driven over a DC motor through flexible coupling.This setup allowed for efficient and reliable monitoring and control of motor performance.The power and speed controller, shown in Figure 10, effectively regulated the motor's operation.Additionally, using the non-contact SKF optical tachometer TMOT6, shown in Figure 11, provided accurate measurements of the motor's rotational speed.The CMSS 2200 accelerometer, which has a sensitivity of 100 mV/g, was securely attached to the bearing housing using a magnetic component, making it suitable for various applications.
SKF Microlog CMX80, shown in Figure 12, is a valuable tool for machine health monitoring and predictive maintenance, particularly for research purposes.The comparative efficiency, accuracy, and reliability of SKF Microlog CMX80 in modern industrial environments underscore its advantages over other methods.Furthermore, the 100 mV/g sensitivity of the CMSS 2200 accelerometer enables it to measure vibrations in multiple applications, making it a versatile tool for detecting and monitoring vibrations in various systems.Combining the vibration measurement device and the power and speed controller provided an efficient and reliable means of monitoring and controlling motor performance.
er and speed controller, shown in Figure 10, effectively regulated the motor's opera Additionally, using the non-contact SKF optical tachometer TMOT6, shown in Figur provided accurate measurements of the motor's rotational speed.The CMSS 2200 a erometer, which has a sensitivity of 100 mV/g, was securely attached to the bea housing using a magnetic component, making it suitable for various applications.SKF Microlog CMX80, shown in Figure 12, is a valuable tool for machine h monitoring and predictive maintenance, particularly for research purposes.The parative efficiency, accuracy, and reliability of SKF Microlog CMX80 in modern in trial environments underscore its advantages over other methods.Furthermore, th mV/g sensitivity of the CMSS 2200 accelerometer enables it to measure vibratio multiple applications, making it a versatile tool for detecting and monitoring vibra in various systems.Combining the vibration measurement device and the power speed controller provided an efficient and reliable means of monitoring and contro motor performance.

Vibration Measurement Procedure
Extensive standardizations of the experimental apparatus were initiated befor ginning any experimental pursuit.The machine operation was started for 15 min, du which the healthy test bearing was secured on the shaft, from which the vibration nals were collected.The electric motor was activated, and a speed controller was ployed to regulate the speed of the shaft.A CMSS 2200 piezoelectric sensor or a erometer was used to capture the vibration signals from the bearing housing conne to a magnetic base, as shown in Figure 13.Accelerometers are designed to measure chanical motion and convert these data into analog signals, which are subsequ transmitted to a frequency analyzer.The FFT analyzer digitized the recorded signal the computer received and stored the discrete signal for subsequent analyses.

Vibration Measurement Procedure
Extensive standardizations of the experimental apparatus were initiated before beginning any experimental pursuit.The machine operation was started for 15 min, during which the healthy test bearing was secured on the shaft, from which the vibration signals were collected.The electric motor was activated, and a speed controller was employed to regulate the speed of the shaft.A CMSS 2200 piezoelectric sensor or accelerometer was used to capture the vibration signals from the bearing housing connected to a magnetic base, as shown in Figure 13.Accelerometers are designed to measure mechanical motion and convert these data into analog signals, which are subsequently transmitted to a frequency analyzer.The FFT analyzer digitized the recorded signal, and the computer received and stored the discrete signal for subsequent analyses.
ployed to regulate the speed of the shaft.A CMSS 2200 piezoelectric sensor or accelerometer was used to capture the vibration signals from the bearing housing connected to a magnetic base, as shown in Figure 13.Accelerometers are designed to measure mechanical motion and convert these data into analog signals, which are subsequently transmitted to a frequency analyzer.The FFT analyzer digitized the recorded signal, and the computer received and stored the discrete signal for subsequent analyses.A new study conducted by [45].usingcylindrical roller bearings highlighted the importance of sensor placement in accurately identifying bearing faults.The study emphasized that the same fault can cause variations in recorded signals at different places in the bearing housing, owing to the nature of vibration transmission.This fluctuation is explained by the uneven thickness and weakening of the vibration signal from the source, which results in variable vibration signals due to different levels of structural rigidity [15].Research by [45] highlighted the significant influence of load and speed on the creation of defects.This study seeks to deepen our understanding of these effects to theoretically guide sensor placement and enhance the precision of bearing defect diagnosis precision under varying operational conditions.It is vital to understand the generation of structural vibration within the bearing housing and the transmission of fault ex- A new study conducted by [45].usingcylindrical roller bearings highlighted the importance of sensor placement in accurately identifying bearing faults.The study emphasized that the same fault can cause variations in recorded signals at different places in the bearing housing, owing to the nature of vibration transmission.This fluctuation is explained by the uneven thickness and weakening of the vibration signal from the source, which results in variable vibration signals due to different levels of structural rigidity [15].Research by [45] highlighted the significant influence of load and speed on the creation of defects.This study seeks to deepen our understanding of these effects to theoretically guide sensor placement and enhance the precision of bearing defect diagnosis precision under varying operational conditions.It is vital to understand the generation of structural vibration within the bearing housing and the transmission of fault excitation to determine the optimal sensor installation sites.The schematic in Figure 14 provides a structured approach to conducting experiments, with each path representing a different experimental condition.This allowed for extensive data collection and subsequent analyses.The results obtained from this procedure would be beneficial for predictive maintenance and bearing design as they would provide information on how different defects and loads affect bearing lifespan and performance.
Vibration 2024, 7, FOR PEER REVIEW 13 citation to determine the optimal sensor installation sites.The schematic in Figure 14 provides a structured approach to conducting experiments, with each path representing a different experimental condition.This allowed for extensive data collection and subsequent analyses.The results obtained from this procedure would be beneficial for predictive maintenance and bearing design as they would provide information on how different defects and loads affect bearing lifespan and performance.
For loaded and unloaded conditions, the bearings were tested at various speeds, ranging from 500 to 2500 RPM, broken down into three ranges: 500, 1000, 1500, 2000, and 2500 RPM.The load was measured instantaneously using a load sensor connected to the HBM Spider 8 and a computer with Catman v5 software to apply the exact load to the bearing housing.The following procedures were used to collect healthy bearings: 1. To begin collecting data, a healthy SFK 22209EK spherical roller bearing was installed into the bearing housing of the test rig.2. The level of the rig, the balance of the machine components, and electrical connections were tested.3. The accelerometer was connected to the SKF Microlog Analyser CMX80 using a cable after being magnetically attached to the bearing housing, as shown in Figure 15.For loaded and unloaded conditions, the bearings were tested at various speeds, ranging from 500 to 2500 RPM, broken down into three ranges: 500, 1000, 1500, 2000, and 2500 RPM.The load was measured instantaneously using a load sensor connected to the HBM Spider 8 and a computer with Catman v5 software to apply the exact load to the bearing housing.
The following procedures were used to collect healthy bearings: 1.
To begin collecting data, a healthy SFK 22209EK spherical roller bearing was installed into the bearing housing of the test rig.

2.
The level of the rig, the balance of the machine components, and electrical connections were tested.

3.
The accelerometer was connected to the SKF Microlog Analyser CMX80 using a cable after being magnetically attached to the bearing housing, as shown in Figure 15. 4.
The SKF Microlog Analyser and accelerometer were used in both directions (vertical and horizontal) after 20 min of operation of the healthy bearing, with five rotation speeds on the test rig.6.
The acceleration vibration signature was captured in the time domain in a frequency range of 10,000 HZ, and the frequency sampling was 25,600 HZ. 7.
Defective bearing parts, with an inner race, an outer race, and a spherical roller, were replaced for both loaded and unloaded conditions, and the bearings were tested at various speeds ranging from 500 to 2500 RPM. 8.
The hydraulic pump applied pressure to the load sensor during load scenarios.The sensor, connected to a data logger, precisely sent the load values to the Catman V5 software until the load reached the defined values of 500N and 1000N.9.
The operation was carried out at different rotational speeds, ranging from 500 to 2500 RPM, for each load and unload situation.This procedure was implemented for every faulty component and undamaged bearing, as illustrated in Figure 14.10.Finally, vibration signatures were collected, as shown in Figure 15, and the data were transferred to a computer for data analysis, where the Analysis and Report Manager (ARM) version 2.4 software was installed.
Vibration 2024, 7, FOR PEER REVIEW 14 10.Finally, vibration signatures were collected, as shown in Figure 15, and the data were transferred to a computer for data analysis, where the Analysis and Report Manager (ARM) version 2.4 software was installed.

Data Preprocessing
Once the raw data were collected, they underwent a preprocessing stage, where they were first converted into a time-frequency representation using a scalogram filter bank.This conversion is crucial for capturing the nonlinear and non-stationary nature of bearing signals.Data are categorized into a coding scheme in Table 3, where each combination of load and rotation speed is represented as follows.

Transfer Learning Model Data Preprocessing
Once the raw data were collected, they underwent a preprocessing stage, where they were first converted into a time-frequency representation using a scalogram filter bank.This conversion is crucial for capturing the nonlinear and non-stationary nature of bearing signals.Data are categorized into a coding scheme in Table 3, where each combination of load and rotation speed is represented as follows.
The method was developed to convert vibration signals into images for processing using scalogram filter bank 1.The processed signals were then segmented according to the rotational speed of the bearings.This segmentation ensured that each image generated from the scalogram corresponded to a full bearing rotation, which was essential for capturing the complete behavior of the bearing within each operational cycle (Table 4), as illustrated in Figure 16.The Scattergram Filter Bank 1 image was generated using MATLAB's scattergram function, specifically designed for wavelet-scattering representations.The function was invoked as follows: scattergram (sf, U, filterBank, 1), where sf represents the wavelet scattering object, and U denotes the cell array of wavelet scattering coefficients [46].
In the context of diagnosing and detecting faults in rolling element bearings, labels play a crucial role in identifying the type and size of defects present within bearing components.Each label name corresponds to a specific defect type and size, facilitating precise categorization and analysis. The label "HB" stands for healthy bearing, indicating a bearing that is free from defects and operates under normal conditions. Outer race defect (ORF): The label "ORF", followed by a numerical value, denotes a defect on the outer race of the bearing, with the number indicating the diameter of the defect in millimeters.For instance, "ORF0.5"signifies a defect on the outer race with a diameter of 0.5 mm, "ORF1" for a 1 mm defect, and "ORF2" for a 2 mm defect.


Inner race defect (IRF): Similarly, "IRF" labels are used to denote defects on the inner race of the bearing."IRF0.5"refers to a defect with a 0.5 mm diameter, "IRF1" to a 1 mm defect, and "IRF2" to a 2 mm defect. Spherical element defect (RF): The label "RF" is used for defects on the spherical elements (rollers) of the bearing."RF0.5"indicates a defect of 0.5 mm diameter, "RF1" for a 1 mm diameter defect, and "RF2" for a 2 mm diameter defect.
(a) The segmented scalograms were adjusted in size or padded to match the input dimensions specified by ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 architectures.The standard input size for the VGG16 and VGG19 models was 224 × 224 pixels, which was applied to DenseNet-201.On the contrary, TL_ResNet-101 often required an input dimension of 256 × 256 pixels.Resizing or padding is essential to guarantee that the images meet each network's specific input layer criteria.Furthermore, the pixel values of the images were normalized across all models to standardize the scale of the input data.This is advantageous for the learning process because it guarantees uniformity.Subsequently, the prepared photos were partitioned into training, validation, and testing sets, a customary machine learning approach to evaluate the model's performance and ability to apply knowledge to unfamiliar data.The research utilized pre-trained models such as ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50.The upper layers of these models were eliminated to customize them for transfer learning, making them suitable for the precise classification of bearing defects.In the transfer learning configuration, additional top layers were incorporated into each model to classify bearing defects.In the early layers of TL_ResNet-101, four additional models (VGG-16, VGG19, ResNet-18, and ResNet-50) were kept fixed to retain the acquired features from the pretraining stage, 2.8.2 Model Training and Evaluation.

The total number of samples was 32,768
To calculate the number of sampling points for one cycle N, the formula is as follows: where fs denotes the sampling frequency (Hz), n is the bearing speed (in RPM), The time is 1.28 s, The sampling frequency (in Hz), totaling 25,600.
The Scattergram Filter Bank 1 image was generated using MATLAB's scattergram function, specifically designed for wavelet-scattering representations.The function was invoked as follows: scattergram (sf, U, filterBank, 1), where sf represents the wavelet scattering object, and U denotes the cell array of wavelet scattering coefficients [46].
In the context of diagnosing and detecting faults in rolling element bearings, labels play a crucial role in identifying the type and size of defects present within bearing compo-nents.Each label name corresponds to a specific defect type and size, facilitating precise categorization and analysis.
• The label "HB" stands for healthy bearing, indicating a bearing that is free from defects and operates under normal conditions.

•
Outer race defect (ORF): The label "ORF", followed by a numerical value, denotes a defect on the outer race of the bearing, with the number indicating the diameter of the defect in millimeters.For instance, "ORF0.5"signifies a defect on the outer race with a diameter of 0.5 mm, "ORF1" for a 1 mm defect, and "ORF2" for a 2 mm defect.

•
Inner race defect (IRF): Similarly, "IRF" labels are used to denote defects on the inner race of the bearing."IRF0.5"refers to a defect with a 0.5 mm diameter, "IRF1" to a 1 mm defect, and "IRF2" to a 2 mm defect.

•
Spherical element defect (RF): The label "RF" is used for defects on the spherical elements (rollers) of the bearing."RF0.5"indicates a defect of 0.5 mm diameter, "RF1" for a 1 mm diameter defect, and "RF2" for a 2 mm diameter defect.
The segmented scalograms were adjusted in size or padded to match the input dimensions specified by ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 architectures.The standard input size for the VGG16 and VGG19 models was 224 × 224 pixels, which was applied to DenseNet-201.On the contrary, TL_ResNet-101 often required an input dimension of 256 × 256 pixels.Resizing or padding is essential to guarantee that the images meet each network's specific input layer criteria.Furthermore, the pixel values of the images were normalized across all models to standardize the scale of the input data.This is advantageous for the learning process because it guarantees uniformity.Subsequently, the prepared photos were partitioned into training, validation, and testing sets, a customary machine learning approach to evaluate the model's performance and ability to apply knowledge to unfamiliar data.The research utilized pre-trained models such as ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50.The upper layers of these models were eliminated to customize them for transfer learning, making them suitable for the precise classification of bearing defects.In the transfer learning configuration, additional top layers were incorporated into each model to classify bearing defects.In the early layers of TL_ResNet-101, four additional models (VGG-16, VGG19, ResNet-18, and ResNet-50) were kept fixed to retain the acquired features from the pre-training stage, 2.8.2 Model Training and Evaluation.
The model was compiled using an appropriate optimizer loss function and metrics tailored to the classification task.The system was then trained using the training and validation sets to monitor performance and adjust as needed.The model was trained using MATLAB R2023, with the computer specifications listed in Table 5.Finally, the performance of the model was evaluated using a test set.Relevant metrics were calculated to quantify the model's accuracy, precision, recall, and F1 score, among others, to fully understand its diagnostic capabilities, as shown in Formulas ( 2 where TP is the true positive, TN is the true negative, FP is the false positive, and FN is the false negative. The model evaluated the effectiveness using the IMS public data set (Intelligent Maintenance Systems), and the data for the Intelligent Maintenance Systems (IMS) study were generated using a test rig with four bearings installed on a shaft.Lee et al. (2007) conducted this study previously.This study used a test apparatus equipped with four Rexnord ZA-2115 double row bearings attached to a shaft [35].The bearings were greased under pressure and the rotational speed was maintained at 2000 RPM.High-sensitivity quartz ICP accelerometers (model PCB 353B33) were placed on the bearing housing to collect data.The IMS data set was made up of three separate data sets that reflect unique test-to-failure studies.Each data set consisted of 20,480 data points.Test 1 revealed the presence of inner race faults in bearing 3 and roller element defects in bearing 4. Bearings 1 and 3 had defects in the outer race, as shown in Figure 17c-e.Vibration data were acquired at 20 min intervals using a National Instruments DAQCaRFTM-6062E instrument.Data were collected formally using the National Instruments LabVIEW version 8.5 software [35].Figure 17b illustrates the Rexnord ZA-2115 double row bearings on a single shaft.The bearings consisted of 16 rollers in each row.Table 6 presents the test setups and results of the three experiments with Rexnord ZA-2115 bearings.Table 6 includes the number of tests, the arrangement of accelerometers in each bearing, and the condition of the bearings after the tests were completed.

Results and Discussion
This section presents the results of the experiments conducted with SKF 2209EK spherical roller bearings and the model's ability to classify and predict various bearing faults under different operational conditions, transfer learning, and ResNet-101 in bearing fault diagnosis, with improved accuracy and reliability compared to traditional methods.

Results of the Experiment
The load and speed variations significantly affect the vibration signatures of the bearings.Faults in different parts of the bearings (such as the inner race, outer race, and spherical roller) alter the vibration patterns, and these alterations are more pronounced at higher speeds due to increased dynamic effects on the measured data in the vertical (V) and horizontal (H) directions.Raw vibration data for three cases, L0-RS1, L1-RS1, and L2-RS1, in the vertical and horizontal directions, are presented in Figure 18.The example of a time domain data scenario for one case is illustrated in Figure 18, which shows distinct vibration characteristics on the vertical and horizontal axes for healthy and defective bearings, where each image represents the vertical amplitude in the upper layer and the horizontal amplitude under it.Our experiments reveal that faults in different bearing components (inner race, outer race, spherical roller) significantly alter vibration patterns.This effect becomes even more pronounced at higher speeds due to increased dynamic forces.It was observed that there is some similarity between the vertical and horizontal amplitudes.Key observations include variations in amplitude and any notable patterns or anomalies in vibration data.These vibration data exhibit distinct vibration characteristics along both the vertical and horizontal axes.

Transfer Learning Model Results
In deep learning, particularly in the context of transfer learning, selecting hyperparameters is critical for determining a model's effectiveness and efficiency.Table 7 presents a detailed overview of the hyperparameters selected to implement transfer learning in ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 architecture.This table is pivotal because it outlines the specific settings used to fine-tune the pre-trained ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 models for the task at hand, which, in this case, is likely related to bearing fault detection.

Max epochs 15
Vibration 2024, 7, FOR PEER REVIEW 20 ent bearing components (inner race, outer race, spherical roller) significantly alter vibration patterns.This effect becomes even more pronounced at higher speeds due to increased dynamic forces.It was observed that there is some similarity between the vertical and horizontal amplitudes.Key observations include variations in amplitude and any notable patterns or anomalies in vibration data.These vibration data exhibit distinct vibration characteristics along both the vertical and horizontal axes.

Transfer Learning Model Results
In deep learning, particularly in the context of transfer learning, selecting hyperparameters is critical for determining a model's effectiveness and efficiency.Table 7 presents a detailed overview of the hyperparameters selected to implement transfer learning in ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 architecture.This table is pivotal because it outlines the specific settings used to fine-tune the pre-trained ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 models for the task at hand, which, in this case, is likely related to bearing fault detection.

Case Study 1 (L0)
The confusion matrices allowed us to analyze the performance of ResNet-101, VGG16, VGG19, ResNet-18, and ResNet-50 on the experimental data in more detail.By examining off-diagonal elements, we can identify which classes are often misclassified and gain insight into potential areas for improvement.Comparing this confusion matrix with previous ones helps us to understand whether the model's performance has improved or deteriorated over time.Ten labels were used to train each model, corresponding to different classes or types of bearing faults.HB was used for healthy bearing, ORF was used for outer fault, IRF was used for inner race fault, and RF was used for roller fault, the numbers (0.5, 1, 2) indicating the size of the fault.The confusion matrices provided show the performance of the classification model in five different data sets or under different conditions.True classes appear to represent the health status of bearings with varying degrees of defects in the inner race, outer race, and rollers.Figure 19 illustrates Case Study 1 for five rotation speeds and a clear increase in data samples, as indicated by the increasing numbers along the diagonal.This increase typically represents the model's performance better and can lead to more accurate predictions.cation between IRF1 and other fault classifications.The second matrix exhibits superior precision, demonstrating enhanced model prediction capabilities.Within the third matrix, there is discernible ambiguity about the distinction between ORF0.5 and IRF2, indicating a potential scope for enhancing the model.The fourth matrix demonstrates robust performance, albeit with some misclassifications, particularly between IRF1 and HB.The final matrix demonstrates the highest level of classification accuracy with very few errors.This indicates a robust model for detecting faults; however, there are some instances when IRF 0.5 and RF 0.5 are incorrectly classified.These matrices are vital instruments for evaluating the accuracy of problem detection, which is crucial for the upkeep and dependability of mechanical systems.The confusion matrices illustrate the classification results of a machine learning model designed to detect various bearing problems.The initial matrix demonstrates a predom-inantly elevated level of precision, although it occasionally shows a misclassification between IRF1 and other fault classifications.The second matrix exhibits superior precision, demonstrating enhanced model prediction capabilities.Within the third matrix, there is discernible ambiguity about the distinction between ORF0.5 and IRF2, indicating a potential scope for enhancing the model.The fourth matrix demonstrates robust performance, albeit with some misclassifications, particularly between IRF1 and HB.The final matrix demonstrates the highest level of classification accuracy with very few errors.This indicates a robust model for detecting faults; however, there are some instances when IRF 0.5 and RF 0.5 are incorrectly classified.These matrices are vital instruments for evaluating the accuracy of problem detection, which is crucial for the upkeep and dependability of mechanical systems.
Table 8 presents the training accuracy of the four diagnostic models TL_VGG-16, TL_VGG-19, TL_ ResNet-18, TL_ResNet-50, and TL_ResNet-101 across different rotational speed scenarios (RS1 to RS5) under no load (L0) conditions.The TL_VGG-16 model shows varied performance at different speeds, with the highest accuracy observed at L0-RS1 (93.68%) and the lowest at L0-RS2 (80%).The fluctuation in accuracy suggests that, while TL_VGG-16 is generally effective, its performance may be affected by changes in the rotational speed, and the TL_VGG-19 model consistently demonstrates high accuracy.TL_VGG-19 excels, particularly at L0-RS2 (95.20%) and L0-RS5 (95.47%).Its performance indicates robustness across the range of speeds, suggesting an adequate adaptation of the model to varying conditions without load.The TL_ResNet-101 model shows an impressive ability to maintain high accuracy, peaking at L0-RS2 (98.40%).The consistency in performance across speeds indicates that TL_ ResNet-50 is highly reliable under no-load conditions; TL_ResNet-101 exhibits the highest accuracy among the models, demonstrating exceptional proficiency in fault diagnosis under no-load conditions.The graph in Figure 20 shows the training accuracy of each diagnostic model (VGG-16, VGG-19, ResNet-18, ResNet-50, and ResNet-101) under no-load conditions.
The accuracy percentages for each model were plotted for five different study repetitions (L0-RS1-L0-RS5).This graph helps us to understand how each model performs without load, indicating its robustness and reliability under ideal conditions.Variations across repetitions indicate the consistency of each model.The results highlight the importance of selecting an appropriate model based on operational conditions.The high accuracy of TL_ResNet-101 with no load at different speeds makes it preferable for fault diagnosis in these scenarios.Figure 20 presents the training accuracy of various transfer learning models across different rotational speed scenarios under no-load conditions.Models such as TL_VGG-16, TL_VGG-19, TL_ResNet-18, TL_ResNet-50, and TL_ResNet-101 are evaluated.The accuracy varies between different speeds, with TL_ResNet-101 achieving the highest accuracy at L0-RS4.The graph in Figure 20 illustrates the training accuracy of each model, which helps us to understand their performance and robustness under ideal conditions.The results underscore the importance of choosing the right model for specific operational conditions, with TL_ResNet-101 being preferred for fault diagnosis in no-load scenarios due to its high accuracy.
peaking at L0-RS2 (98.40%).The consistency in performance across speeds indicates that TL_ ResNet-50 is highly reliable under no-load conditions; TL_ResNet-101 exhibits the highest accuracy among the models, demonstrating exceptional proficiency in fault diagnosis under no-load conditions.The graph in Figure 20   The accuracy percentages for each model were plotted for five different study repetitions (L0-RS1-L0-RS5).This graph helps us to understand how each model performs without load, indicating its robustness and reliability under ideal conditions.Variations across repetitions indicate the consistency of each model.The results highlight the importance of selecting an appropriate model based on operational conditions.The high accuracy of TL_ResNet-101 with no load at different speeds makes it preferable for fault diagnosis in these scenarios.Figure 20 presents the training accuracy of various transfer learning models across different rotational speed scenarios under no-load conditions.Models such as TL_VGG-16, TL_VGG-19, TL_ResNet-18, TL_ResNet-50, and TL_ResNet-101 are evaluated.The accuracy varies between different speeds, with TL_ResNet-101 achieving the highest accuracy at L0-RS4.The graph in Figure 20 illustrates the training accuracy of each model, which helps us to understand their performance and robustness under ideal conditions.The results underscore the importance of choosing the right model for specific operational conditions, with TL_ResNet-101 being preferred for fault diagnosis in no-load scenarios due to its high accuracy.

Case Study 2 (L1)
The matrices show that the model generally has high accuracy, as shown in Appendix A, from 1 to 5, with most predictions falling on the diagonal for classes representing healthy bearings and those with varying fault degrees.However, certain types of faults, particularly those with minor defects, exhibit more frequent misclassifications,

Case Study 2 (L1)
The matrices show that the model generally has high accuracy, as shown in Appendix A, from 1 to 5, with most predictions falling on the diagonal for classes representing healthy bearings and those with varying fault degrees.However, certain types of faults, particularly those with minor defects, exhibit more frequent misclassifications, indicating a challenge for the model to differentiate between subtle fault characteristics.Despite the high precision, some consistent confusion between closely related fault types (e.g., RF1 and RF2) suggests that the model may struggle to distinguish between similar fault severities.This is particularly evident in classes like RF0.5 and ORF0.5, which may share similar vibration signatures or extracted features, leading to misclassification.
Table 9 illustrates the training accuracy percentages for different transfer learning models applied to Case Study 2, where the load condition is 500N.Models include variations in VGG and ResNet architecture, specifically VGG-16, VGG-19, ResNet-18, ResNet-50, and ResNet-101.The performance is evaluated under five run scenarios (L1-RS1 to L1-RS5).The ResNet-101 model shows the highest accuracy in most scenarios, particularly excelling in L1-RS3 with a 96.80% accuracy rate.VGG-16, while showing the lowest accuracy in L1-RS2, performs consistently well in other scenarios.Data indicate that while all models are highly effective, specific models such as ResNet-101 may be more suitable for this diagnostic task under the given load condition.The graph in Figure 21   The graph in Figure 21

Case Study 3 (L2)
The accuracy of the model can be observed through the high number of true positives along the diagonals of the matrices, as illustrated in Appendix A, from 6 to 10.However, some consistent misclassifications are notable, particularly between certain fault classes with similar characteristics, such as RF0.5, RF1, IRF0.5, and HB, where the model seems to confuse one for the other.This pattern suggests that the model may benefit from further refinement in its ability to discern subtle differences between fault types.Despite the high accuracy, misclassifications highlight specific areas for improvement, such as feature engineering, data augmentation, and even re-visiting the model architecture.Fine-tuning the model to address these misclassifications is crucial to enhance its diagnostic performance and reliability in real-world applications for fault detection in rolling element bearings.
Table 10 presents the training accuracy of the five diagnostic models, TL_VGG-16, TL_VGG-19, TL_ResNet-18, TL_ResNet-50, and TL_ResNet-101, in different rotational speed scenarios (RS1-RS5) under a load condition of 1000 N (L2).The multiclass confusion matrix presents a precise and unambiguous depiction of errors and their corresponding positions for each category in a multiclassification task.The vertical axis represents the actual labels, whereas the horizontal axis represents the predicted outcomes.The elements on the main diagonal represent the count of accurate judgment samples, whereas the remaining points indicate the number of incorrect judgments.Figure 23 shows the multiclass confusion matrix of TL_ResNet-101 when evaluating the model on unseen data from the experiment, specifically from L0-RS1 to L0-RS5.
The series of normalized confusion matrices reflects the classification performance of a machine learning model for bearing fault detection.We observed a high classification accuracy for most classes in these matrices, indicating a robust model.However, a closer examination revealed some areas of misclassification, suggesting opportunities for model improvement.For example, there is recurring confusion between classes, such as HB with IRF0.5, RF0.5, and ORF0.5, implying challenges in differentiating between  The multiclass confusion matrix presents a precise and unambiguous depiction of errors and their corresponding positions for each category in a multiclassification task.The vertical axis represents the actual labels, whereas the horizontal axis represents the predicted outcomes.The elements on the main diagonal represent the count of accurate judgment samples, whereas the remaining points indicate the number of incorrect judgments.Figure 23 shows the multiclass confusion matrix of TL_ResNet-101 when evaluating the model on unseen data from the experiment, specifically from L0-RS1 to L0-RS5.Table 11 presents the performance metrics of a diagnostic model for rolling element bearings in five test cases (L0-RS1 to L0-RS5).It includes precision, recall, specificity, F1 score, and accuracy for different bearing conditions.Precision measures the proportion of true positives against all positive predictions, indicating the accuracy when it predicts a fault.Recall, or sensitivity, assesses the model's ability to identify actual faults correct- The series of normalized confusion matrices reflects the classification performance of a machine learning model for bearing fault detection.We observed a high classification accuracy for most classes in these matrices, indicating a robust model.However, a closer examination revealed some areas of misclassification, suggesting opportunities for model improvement.For example, there is recurring confusion between classes, such as HB with IRF0.5, RF0.5, and ORF0.5, implying challenges in differentiating between healthy bearings and those with minor defects.Furthermore, as the severity of the defects increases, the model appears to struggle more to distinguish between different severity levels, which is evident in the confusion between IRF2 and IRF1 and between RF2 and RF1.L0-RS1 demonstrates exceptional precision, effectively eliminating misclassifications.L0-RS2 and L0-RS3 still show high accuracy with minor confusions between certain fault types, such as healthy bearing and inner race fault size 0.5.L0-RS4 continues the trend of high performance with slight confusion, primarily between outer race fault size 0.5 and inner race fault size 1.Finally, L0-RS5 indicates some misclassifications, notably with respect to the 0.5 fault size of the roller.These matrices highlight the high reliability and minor areas for improvement in fault diagnosis.
Table 11 presents the performance metrics of a diagnostic model for rolling element bearings in five test cases (L0-RS1 to L0-RS5).It includes precision, recall, specificity, F1 score, and accuracy for different bearing conditions.Precision measures the proportion of true positives against all positive predictions, indicating the accuracy when it predicts a fault.Recall, or sensitivity, assesses the model's ability to identify actual faults correctly.The F1 score is a balanced measure of precision and recall, useful for uneven class distributions.Specificity measures the model's ability to correctly identify true negatives, which is crucial for avoiding false alarms.Together, these metrics provide a comprehensive understanding of diagnostic accuracy.Starting with the L0-RS1 case, the model achieves an exceptional 99% overall accuracy, demonstrating perfect precision, recall, F1 score, and specificity across all classes.This indicates an extraordinary ability to accurately diagnose faults, identify healthy bearings without errors, and minimize false positives and negatives.However, a change is observed in the L0-RS2 case, where the accuracy slightly drops to 97.38%.Here, the precision and recall for classes like IRF0.5 and IRF1 show minor declines, hinting at challenges in accurately identifying smaller inner race faults.Despite this, the high specificity across all classes maintains the model's credibility in avoiding false positives.particularly in the balanced F1 score for IRF1, indicating room for improvement in the model's classification of these fault types.L2-RS3 sees a slight increase in overall accuracy at 98.83%.Precision maintains a high standard across all classes, with a minor decrease for RF1 and RF2.Recall and F1 scores remain consistently high, but there is a noticeable drop in ORF2 recall.Specificity remains high, reinforcing the model's reliability in ruling out non-fault conditions.For L2-RS4, the accuracy drops slightly to 96.43%.There is a decrease in precision for IRF0.5 and a significant reduction in recall for RF0.5, with corresponding effects on the F1 score, reflecting the need for better identification of these fault types.Despite this, the specificity is still strong.Finally, L2-RS5 shows an overall precision of 94.64%, with the lowest precision for RF0.5 and the lowest recall for IRF0.5, RF1, and RF2 among the runs.The F1 score and specificity also take a hit, suggesting that this run's model performance is the weakest, particularly in distinguishing between these fault types.
The confusion matrices shown in Appendix C represent the classification test results for Case Study 2 over five runs (L2-RS1 through L2-RS5).In L2-RS1, the matrix suggests excellent model performance with perfect classification for most classes, except for a slight confusion between RF1 and RF2.The L2-RS2 matrix reveals more variability with noticeable misclassifications, such as between IRF1 and other fault types, suggesting difficulty distinguishing between specific inner race fault sizes.In L2-RS3, the model appears to have high accuracy again with minor confusion, such as slightly misclassifying ORF0.5 as ORF2.
The L2-RS4 matrix shows a high degree of precision, but with some misclassification between RF0.5 and other fault types, indicating specific areas where the model could improve.
Lastly, L2-RS5 shows excellent overall performance, with minor confusion between IRF1 and IRF2, and between RF1 and other faults, pointing to potential difficulties in distinguishing similar fault conditions.

Testing the Model on Unseen Data for IMS
To evaluate the efficacy of the proposed technique using the IMS dataset, the data were initially transformed into images using Scattergram Filter Bank 1.Subsequently, the MATLAB script provided in Appendix B was used to test the TL_ResNet-101 model.The model, saved as 'TL_Model.mat'following the training phase, was then ready for use in the testing phase.
The TL_ResNet-101 model can classify new images.This process involves loading a new image, modifying its size to match the input criteria of the model, and then applying the model to determine its category.The MATLAB code for prediction is illustrated in Appendix D. This code iterates through each image in the specified folder, resizes it, classifies it using the TL_ResNet-101 model, and stores the filename, predicted class, and classification confidence in a table.After the script, the results were reviewed using MATLAB.The highest predictions of the segment scores are presented in Table 12.The model demonstrates high predictive accuracy in most cases, particularly in the L2-RS4 scenario in all fault positions and directions.This indicates its robustness and effectiveness in diagnosing bearing faults in various load and speed conditions.Variable accuracy levels, particularly in the L0-RS4 and L1-RS4 scenarios, suggest that the fault's specific operational conditions or characteristics may influence the model's performance.The moderate accuracy in some cases (L0-RS4 for B3-CHX) indicates potential areas for model refinement.Table 13 presents the results using transfer learning with TL_ResNet-101 to predict the type of fault in Set No.2 of the IMS data set.The model achieved exceptionally high accuracy (99.979% and 99.99%) in predicting the type of fault labeled ORF2 for the early fault (L0-RS4 and L1-RS4, respectively).This indicates that ResNet-101, when combined with transfer learning, is highly effective for early fault detection, which is a critical aspect of preventive maintenance.For the advanced fault stage (L2-RS4), the accuracy dropped to 96.028%, with the predicted label changing to RF0.5.This decrease in accuracy can be attributed to the increased complexity of the fault characteristics at this stage.This suggests that although the model is robust in early detection, its performance varies with the progression of fault severity.
Table 14 presents the test results on Set No.3 of the IMS dataset, focusing on bearing 3 (B3) across three levels of fault severity (L0-RS4, L1-RS4, and L2-RS4).The experiment utilized transfer learning on TL_ResNet-101 to analyze data from the CHX sensor channel in the vertical direction.The model achieved high precision in the early (L0-RS4) and intermediate (L1-RS4) fault stages with scores of 99.871% and 99.949%, respectively.Both stages were predicted as ORF2.This high accuracy indicates the robustness of the model in detecting early-to mid-stage faults, which is crucial for timely maintenance interventions.

Comparison of this Work with Previous Research
Table 15 shows a range of models, from knowledge addition-based transfer learning, seen in Kumar et al. (2023) [36], to more complex frameworks such as deep transfer learning with ResNet-101, VGG-16, VGG19, ResNet-18, and ResNet-50, used in the current work.This diversity indicates that a broad spectrum of methodologies is being explored in the field, each with unique strengths.The accuracies achieved in these studies were impressive, with current work reaching up to 99%.This high level of precision, especially compared to earlier works such as [36] which demonstrated a 93% precision rate, indicates continuous improvements in fault diagnosis techniques, and the use of advanced deep transfer learning models in the current work could contribute to this high accuracy.Some studies, such as those of Li et al. [40], integrated CNN with the pseudo-Wigner-Ville empirical mode decomposition distribution, highlighting innovative combinations of techniques to improve diagnostic capabilities.Similarly, integrating multiple advanced architectures, such as ResNet-101, VGG-16, VGG19, ResNet-18, and ResNet-50, in the current work, demonstrates a trend toward complex, multifaceted approaches.The variable operating speed ranges and high accuracy indicate the potential of these models for real-world applications.As in the current study, models that operate effectively across a wide range of speeds are important for diverse industrial settings.The evolution from simpler models to more complex and accurate systems indicates a trend towards integrating multiple deep learning techniques.

Conclusions
This study aims to fill the gap in spherical bearing fault diagnosis methods by presenting an innovative approach that combines advanced machine learning techniques with practical application considerations.This study successfully demonstrated the efficacy of using transfer learning, specifically the ResNet-101 model, in the fault diagnosis of spherical roller bearings under varying operating conditions.The key findings of this study indicate that transfer learning can significantly improve the accuracy and efficiency of fault diagnosis in mechanical bearings, which is a critical component in numerous industrial applications.Experiments conducted with SKF 2209EK spherical roller bearings yield promising results, showing the model's ability to identify and diagnose faults accurately.This performance is particularly notable given the challenges of varying operating conditions and the need for robust diagnostic tools in dynamic industrial environments.
The findings of this study underscore the potential of advanced machine learning techniques, specifically transfer learning coupled with ResNet-101, in the field of mechanical fault diagnosis.The high precision of the model can be attributed to the effective signalto-image transformation using Scattergram Filter Bank 1, which allows for a detailed representation of vibration signals.This approach addresses the challenge of limited labeled data and improves diagnostic accuracy under varying operational conditions.Furthermore, the adoption of a multidirectional vibration data analysis provides a more nuanced understanding of bearing conditions, surpassing the limitations of traditional single-direction analysis.The methodology and results could serve as a benchmark for future research in this domain, especially in spherical bearings, to address the challenge of limited labeled data and improve diagnostic accuracy under varying operational conditions.Additionally, the adoption of a multidirectional vibration data analysis provides a more nuanced understanding of bearing conditions, surpassing the limitations of traditional single-direction analysis.
In conclusion, this study presents a significant step forward in applying advanced machine learning techniques for mechanical fault diagnosis.It offers a foundation upon which future research can be built, with the aim to further expand the capabilities of such technologies in industrial maintenance.

Figure 1 .
Figure 1.Schematic diagram of the experimental test rig.

Figure 1 .
Figure 1.Schematic diagram of the experimental test rig.

Figure 4 .
Figure 4. Creating defective components of experimental work using an EDM: (a) inner race, (b) outer race, and (c) roller.

Figure 8 .
Figure 8.The alignment procedure using the Fixturlaser XA system.

Figure 10 .
Figure 10.Data acquisition and power control.

Figure 12 .
Figure 12.The measurement of vibrations using the SKF Microlog loading case.

Figure 14 .
Figure 14.Schematic diagram of the experimental procedure.

Figure 14 .
Figure 14.Schematic diagram of the experimental procedure.

Figure 15 .
Figure 15.The vibration measurements within the unloading case.

Figure 15 .
Figure 15.The vibration measurements within the unloading case.

Figure 16 .
Figure 16.(a) Segmentation of the signal, and (b) Scalogram filter bank 1 for each bearing condition.

Figure 17 .Figure 17 .
Figure 17.(a) IMS bearing test rig, (b) Rexnord ZA-2115 double row bearing, (c) inner race defect in bearing 3, test 1, (d) roller element defect in bearing 4, test 1, and (e) outer race defect in bearing 1, test 2 [35].Table 6. IMS test setup and results for three different sets of experiments.Test No. Type of Bearing Arrangement of the Accelerometer on the Bearing Type of Fault at the End of the Test Sensor

Figure 18 .
Figure 18.Raw vibration data for L0-RS1 in vertical and horizontal directions.

Figure 18 .
Figure 18.Raw vibration data for L0-RS1 in vertical and horizontal directions.

Figure 20 .
Figure 20.Transfer learning model accuracy for Case Study 1.

Figure 20 .
Figure 20.Transfer learning model accuracy for Case Study 1.
illustrates the training accuracy percentages of five different transfer learning models in five separate runs for Case Study 2, under a load of 500N.Models tested include VGG-16, VGG-19, ResNet-18, ResNet-50, and ResNet-101.Each bar represents a model's accuracy in a particular run, with colors denoting different runs (L1-RS1 through L1-RS5).The graph in Figure 21 shows that ResNet-101 generally achieves the highest accuracy on most runs, while VGG-16 and VGG-19 exhibit some variability.This visual representation helps to compare the consistency and reliability of each model's performance in the given scenario.
illustrates the training accuracy percentages of five different transfer learning models in five separate runs for Case Study 2, under a load of 500N.Models tested include VGG-16, VGG-19, ResNet-18, ResNet-50, and ResNet-101.Each bar represents a model's accuracy in a particular run, with colors denoting different runs (L1-RS1 through L1-RS5).The graph in Figure 21 shows that ResNet-101 generally achieves the highest accuracy on most runs, while VGG-16 and VGG-19 exhibit some variability.This visual representation helps to compare the consistency and reliability of each model's performance in the given scenario.

Figure 21 .
Figure 21.Training accuracy for Case Study 2.Figure 21.Training accuracy for Case Study 2.

Figure 21 .
Figure 21.Training accuracy for Case Study 2.Figure 21.Training accuracy for Case Study 2.

3. 3 .
Testing Model on Unseen Data 3.3.1.Testing Model on Unseen Data from the Experiment

3. 3 .
Testing Model on Unseen Data 3.3.1.Testing Model on Unseen Data from the Experiment

Table 2 .
Dimensions of defective bearings in EDM experiments.

Table 2 .
Dimensions of defective bearings in EDM experiments.

Table 3 .
Coding scheme for each case study with different loads and rotation speeds.

Table 3 .
Coding scheme for each case study with different loads and rotation speeds.

Table 4 .
N values for each rotation speed.

Table 4 .
N values for each rotation speed. N

Table 6 .
IMS test setup and results for three different sets of experiments.

Table 7 .
Hyperparameters for transfer learning.

Table 7 .
Hyperparameters for transfer learning.

Table 10
displays the training accuracy for various transfer learning models in Case Study

Table 12 .
Test results of Set No.1 from the IMS dataset.

Table 13 .
Testing in Set No.2 of the IMS dataset.

Table 14 .
Testing in Set No.3 of the IMS dataset.

Table 15 .
Comparison of previous research with this work.