XGBoost Method-Based Gearbox Fault Diagnosis Using Time-Domain Signal Under Road Vehicle Characteristics
Abstract
1. Introduction
- Real-world time-domain wheel-speed signals from four wheels under diverse driving conditions (multiple speeds, hill grades, and steering pad) were collected to support gearbox fault diagnosis. These signals inherently contain noise, dynamic disturbances, and nonlinear interactions representative of actual vehicle operation. The proposed XGBoost model was validated using this comprehensive, real-world dataset, demonstrating robustness in a highly challenging diagnostic scenario.
- Twelve statistical time-domain features combined with vehicle operational parameters were extracted from the wheel-speed signals. These features were selected to capture the dynamic behaviors associated with gearbox health states. By leveraging wheel-speed-based features rather than traditional sensors, this study offers a practical, diagnostic approach suitable for on-board vehicle implementations.
- The proposed XGBoost-based diagnostic model achieved superior accuracy compared with benchmark tree-based methods such as Decision Tree and Random Forest. The enhanced performance results from XGBoost’s ability to capture nonlinear feature interactions, handle noisy real-world data. These advantages make XGBoost particularly suitable for gearbox diagnostics in real driving environments where conventional methods struggle.
2. Methodology
2.1. Experimental Setup
2.1.1. System Configuration
2.1.2. Driving Scenarios
- High-Speed Circuit: Conducted on a closed track combining straight and curved sections. Driving speeds of 20 kph (low), 60 kph (medium), and 80 kph (high) were tested, with three repetitions at each speed.
- Uphill Road: Designed to examine the reducer’s response to varying load conditions. Uphill and downhill driving were performed on slopes of 6%, 12%, 18%, and 30%. Vehicles drove at a constant speed of 20 kph. Measurements were consistent with the high-speed circuit.
- Steering Pad: Implemented to analyze the torque imbalance between left and right wheels during steering. The vehicle was driven in a circular path for one minute at a constant speed of 40 kph to evaluate load transfer and vibration behavior.
2.2. Data Preprocessing
2.3. Experimental Data Setup
2.4. Model Development
2.5. Optimization Framework
3. Performance Evaluations
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Kumar, P.; Hirani, H. Misalignment effect on gearbox failure: An experimental study. Measurement 2021, 169, 108492. [Google Scholar] [CrossRef]
- Goswami, P.; Rai, R.N. A systematic review on failure modes and proposed methodology to artificially seed faults for promoting PHM studies in laboratory environment for an industrial gearbox. Eng. Fail. Anal. 2023, 146, 107226. [Google Scholar] [CrossRef]
- Latiffianti, E.; Sheng, S.; Ding, Y. Wind turbine gearbox failure detection through cumulative sum of multivariate time series data. Front. Energy Res. 2022, 10, 904622. [Google Scholar] [CrossRef]
- Li, F.; Ding, N.; Song, M.; Zhong, Z.; Liu, L.; Shen, X.; Xu, H.; Tian, L.; Xu, N.; Guo, W.; et al. A review on research of gear failures in mechanical equipment. J. Fail. Anal. Prev. 2025, 25, 612–648. [Google Scholar] [CrossRef]
- Wu, J.; Zhou, Y.; Jiang, W.; Chen, X. Time-domain dynamic modeling and analysis of complex heavy-duty gearbox considering floating effect. Appl. Sci. 2021, 11, 6876. [Google Scholar] [CrossRef]
- Zhang, J.; Zhang, Q.; Feng, W.; Qin, X.; Sun, Y. Gearbox fault diagnosis based on frequency-domain Gramian angular difference field and deep convolutional neural network. Proc. Inst. Mech. Eng. C J. Mech. Eng. Sci. 2023, 237, 5187–5202. [Google Scholar] [CrossRef]
- Qiu, T.; Wang, L.; Huang, W. Time-frequency domain feature enhanced sparse matrix and singular value vector optimization for gearbox fault diagnosis. In Proceedings of the 2024 International Conference on Sensing, Measurement & Data Analytics in the Era of Artificial Intelligence (ICSMD), Huangshan, China, 15–31 August 2024; pp. 1–4. [Google Scholar]
- Başaran, M.; Fidan, M. Gearbox fault classification by using frequency-based feature extraction. Eskisehir Tech. Univ. J. Sci. Technol. A Appl. Sci. Eng. 2020, 21, 101–107. [Google Scholar] [CrossRef]
- Chen, Y.; Zuo, M. A sparse multivariate time series model-based fault detection method for gearboxes under variable speed condition. Mech. Syst. Signal Process. 2022, 167, 108539. [Google Scholar] [CrossRef]
- Han, G.; Wan, S.; Lv, Z.; Liu, R.; Wang, J.; Tang, G. The analysis of gearbox fault diagnosis research based on the EMD and Hilbert envelope demodulation. Adv. Mater. Res. 2014, 926–930, 1800–1805. [Google Scholar] [CrossRef]
- Gelman, L.; Soliński, K.; Ball, A. Novel higher-order spectral cross-correlation technologies for vibration sensor-based diagnosis of gearboxes. Sensors 2020, 20, 5131. [Google Scholar] [CrossRef]
- Alhams, A.; Abdelhadi, A.; Badri, Y.; Sassi, S.; Renno, J. Enhanced bearing fault diagnosis through trees ensemble method and feature importance analysis. J. Vib. Eng. Technol. 2024, 12, 109–125. [Google Scholar] [CrossRef]
- Tang, G.; Pang, B.; Tian, T.; Zhou, C. Fault diagnosis of rolling bearings based on improved fast spectral correlation and optimized random forest. Appl. Sci. 2018, 8, 1859. [Google Scholar] [CrossRef]
- Han, T.; Jiang, D. Rolling bearing fault diagnostic method based on VMD–AR model and random forest classifier. Shock Vib. 2016, 2016, 5132046. [Google Scholar] [CrossRef]
- Zhang, R.; Li, B.; Jiao, B. Application of XGBoost algorithm in bearing fault diagnosis. IOP Conf. Ser. Mater. Sci. Eng. 2019, 490, 072062. [Google Scholar] [CrossRef]
- Pani, S.R.; Bera, P.K.; Kumar, V. Detection and classification of internal faults in power transformers using tree-based classifiers. In Proceedings of the 2020 IEEE International Conference on Power Electronics, Drives and Energy Systems (PEDES), Jaipur, India, 16–19 December 2020; pp. 1–6. [Google Scholar]
- Zhang, M.; Fang, C.; Ji, S. Research on fault diagnosis for power transformer based on random forests and wavelet transform. Int. J. Wirel. Mob. Comput. 2024, 26, 354–360. [Google Scholar] [CrossRef]
- Wang, L. Fault diagnosis of transformer based on XGBoost. Appl. Comput. Eng. 2024, 90, 107–111. [Google Scholar] [CrossRef]
- Kundu, P.; Darpe, A.K.; Kulkarni, M.S. An ensemble decision tree methodology for remaining useful life prediction of spur gears under natural pitting progression. Struct. Health Monit. 2020, 19, 854–872. [Google Scholar] [CrossRef]
- Ikhlef, B.; Rahmoune, C.; Toufik, B.; Benazzouz, D. Gearboxes fault detection under operation varying condition based on MODWPT, ant colony optimization algorithm and random forest classifier. Adv. Mech. Eng. 2021, 13, 168781402110430. [Google Scholar] [CrossRef]
- Spirto, M.; Melluso, F.; Nicolella, A.; Malfi, P.; Cosenza, C.; Savino, S.; Niola, V. A comparative study between SDP-CNN and time–frequency-CNN based approaches for fault detection. J. Dyn. Monit. Diagn. 2025, in press. [Google Scholar] [CrossRef]
- Spirto, M.; Nicolella, A.; Melluso, F.; Malfi, P.; Cosenza, C.; Savino, S.; Niola, V. Enhancing SDP-CNN for gear fault detection under variable working conditions via multi-order tracking filtering. J. Dyn. Monit. Diagn. 2025, in press. [Google Scholar] [CrossRef]
- Hoang, N.P.; Nguyen, T.D.; Nguyen, T.H.; Pham, D.H.; Nguyen, P.D.; Nguyen, T.V.H. A data-driven framework for fault diagnostics in gearbox health monitoring under non-stationary conditions. Processes 2025, 13, 1663. [Google Scholar] [CrossRef]
- Bastianin, A. Robust measures of skewness and kurtosis for macroeconomic and financial time series. Appl. Econ. 2020, 52, 637–670. [Google Scholar] [CrossRef]
- Smith, J. Root mean square (RMS) vibration measurement and analysis. J. Sound Vib. 2017, 403, 265–274. [Google Scholar]
- Brys, G.; Hubert, M.; Struyf, A. A robust measure of skewness. J. Comput. Graph. Stat. 2004, 13, 996–1017. [Google Scholar] [CrossRef]
- Dutta, A.; Chakraborty, N. On the use of crest factor in machine condition monitoring: An overview. Measurement 2016, 87, 147–160. [Google Scholar]
- Randall, R.B. Vibration-Based Condition Monitoring: Industrial, Aerospace and Automotive Applications; Wiley: Chichester, UK, 2010. [Google Scholar]
- Antoni, J. The spectral kurtosis: A useful tool for characterising non-stationary signals. Mech. Syst. Signal Process. 2006, 20, 282–307. [Google Scholar] [CrossRef]
- Wu, H.; Zhang, P.; Zhu, R.; Kong, F. Kurtosis-based fault detection in rolling element bearings under varying operating conditions. IEEE Trans. Instrum. Meas. 2019, 68, 4319–4329. [Google Scholar]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Aisy, R. Residual XGBoost regression—Based individual moving window GDP forecasting. PLoS ONE 2025, 20, e0321660. [Google Scholar] [CrossRef] [PubMed]
- Cortes, C.; Vapnik, V.N. Support vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Huang, H.-Y.; Lin, C.-J. Linear and kernel classification: When to use which? In Proceedings of the SIAM International Conference on Data Mining, Miami, FL, USA, 5–7 May 2016; pp. 216–224. [Google Scholar]
- Scabini, L.F.S.; Bruno, O.M. Structure and performance of fully connected neural networks: Emerging complex network properties. Phys. A Stat. Mech. Its Appl. 2023, 615, 128585. [Google Scholar] [CrossRef]









| Equation Name | Equation Formula | |
|---|---|---|
| Root square mean (RMS) | (1) | |
| Standard deviation (STD) | (2) | |
| Mean | (3) | |
| Variance | (4) | |
| Peak | (5) | |
| Peak to peak (P2P) | (6) | |
| Shape factor | (7) | |
| Clearance factor | (8) | |
| Np4 factor | (9) | |
| Crest factor (CRSF) | (10) | |
| Kurtosis (KURT) | (11) | |
| Skewness (SKEW) | (12) |
| Hyperparameter | Grid Search |
|---|---|
| n estimators | |
| Max depth | |
| Learning rate | |
| Subsampling |
| Health State | Normal | Abnormal | Total |
|---|---|---|---|
| Number of experiments | 224 | 228 | 452 |
| Model Type | Training Time (Seconds) | Testing Time (Seconds) |
|---|---|---|
| Proposed XGBoost | ∼0.44 | ∼0.0034 |
| Decision Tree | ∼0.02 | ∼0.0011 |
| Random Forest | ∼0.37 | ∼0.015 |
| FNN | ∼19.38 | ∼0.22 |
| Nonlinear SVM | ∼0.78 | ∼0.0029 |
| Linear SVM | ∼0.03 | ∼0.0012 |
| Heath State | Normal | Abnormal | Overall |
|---|---|---|---|
| Random Forest (%) | 71.11 | 73.91 | 72.53 |
| Decision Tree (%) | 80.00 | 71.74 | 75.82 |
| Proposed XGBoost (%) | 86.67 | 78.26 | 82.42 |
| Health State | Precision | Recall | F1-Score |
|---|---|---|---|
| Normal | 0.84 ± 0.04 | 0.80 ± 0.08 | 0.82 ± 0.05 |
| Abnormal | 0.82 ± 0.06 | 0.85 ± 0.04 | 0.83 ± 0.03 |
| Average | 0.83 ± 0.05 | 0.82 ± 0.07 | 0.82 ± 0.04 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tuyet-Doan, V.-N.; Choi, M.; Park, G. XGBoost Method-Based Gearbox Fault Diagnosis Using Time-Domain Signal Under Road Vehicle Characteristics. Electronics 2025, 14, 4736. https://doi.org/10.3390/electronics14234736
Tuyet-Doan V-N, Choi M, Park G. XGBoost Method-Based Gearbox Fault Diagnosis Using Time-Domain Signal Under Road Vehicle Characteristics. Electronics. 2025; 14(23):4736. https://doi.org/10.3390/electronics14234736
Chicago/Turabian StyleTuyet-Doan, Vo-Nguyen, Mooryong Choi, and Giseo Park. 2025. "XGBoost Method-Based Gearbox Fault Diagnosis Using Time-Domain Signal Under Road Vehicle Characteristics" Electronics 14, no. 23: 4736. https://doi.org/10.3390/electronics14234736
APA StyleTuyet-Doan, V.-N., Choi, M., & Park, G. (2025). XGBoost Method-Based Gearbox Fault Diagnosis Using Time-Domain Signal Under Road Vehicle Characteristics. Electronics, 14(23), 4736. https://doi.org/10.3390/electronics14234736

