A Novel Kolmogorov–Arnold Attention Allocation Network for Cutting Tool Remaining Useful Life Prediction
Abstract
1. Introduction
- (1)
- ST is applied to multi-sensor signals to extract time-frequency features, providing KA-AAN with as much processing information as possible, thereby avoiding overfitting caused by insufficient features and samples.
- (2)
- The Attentional Feature Extraction Network (AFEN) allocates attention to extracting attentional features concerning both tool degradation and working conditions, which effectively overcomes the problem of poor feature generalization in milling.
- (3)
- The KAN regression block maps from attentional features to RUL, learning the most appropriate way to activate and combine features based on the specific circumstances of the tool and machining process. The nonlinear fusion of the attentional features overcomes the problem of insufficient prediction accuracy of tool RUL under different working conditions.
2. Problem Definition
3. Methodologies
3.1. Time-Frequency Feature Based on S-Transform
3.2. Kolmogorov–Arnold Network
3.3. KA-AAN
4. Experimental Setup and Prediction Results
4.1. Experimental Setup
4.2. Results and Discussions
5. Conclusions
- (1)
- The innovative use of AFEN in the method allocates importance to the features in the time-frequency domain. This captures the complex nonlinear mapping relationship between time-frequency features and RUL, effectively overcoming the problem of the limited generalization ability of milling features.
- (2)
- KAN is used to fuse the attentional features after attention redistribution. By replacing the traditional activation function with a learnable spline curve and optimizing the weight parameters of the spline curve, the nonlinear relationship between the signal features and tool RUL is accurately captured, enabling precise prediction of tool RUL under different working conditions.
- (3)
- RUL prediction experiments conducted using the milling tool life dataset validate the effectiveness of the method. Comparative analysis shows that the errors associated with the proposed method are relatively small, with the MSE and MAPE between predicted RUL and actual RUL reaching 2.91 and 18.63%, respectively. The average MAPE of KA-AAN is about 20% lower than that of Time–Space Attention and about 18% lower than CSBLSTM-TSAM (Notes: The performance metrics reported in the abstract and conclusion for ‘same working conditions’ and ‘different working conditions’ represent the average results over Cases 1–3 and Cases 4–6, respectively. Case 0, in contrast, evaluates model performance under a random train-test split across the entire dataset, regardless of operating conditions.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zhu, K.; Zhang, Y. A generic tool wear model and its application to force modeling and wear monitoring in high speed milling. Mech. Syst. Signal Process. 2019, 115, 147–161. [Google Scholar] [CrossRef]
- Li, Z.; Liu, R.; Wu, D. Data-driven smart manufacturing: Tool wear monitoring with audio signals and machine learning. J. Manuf. Process. 2019, 48, 66–76. [Google Scholar] [CrossRef]
- Wang, C.; Bao, Z.; Zhang, P.; Ming, W.; Chen, M. Tool wear evaluation under minimum quantity lubrication by clustering energy of acoustic emission burst signals. Measurement 2019, 138, 256–265. [Google Scholar] [CrossRef]
- Deutsch, J.; He, D. Using Deep Learning-Based Approach to Predict Remaining Useful Life of Rotating Components. IEEE Trans. Syst. Man Cybern. Syst. 2018, 48, 11–20. [Google Scholar] [CrossRef]
- Mo, Y.; Wu, Q.; Li, X.; Huang, B. Remaining useful life estimation via transformer encoder enhanced by a gated convolutional unit. J. Intell. Manuf. 2021, 32, 1997–2006. [Google Scholar] [CrossRef]
- Yousuf, S.; Khan, S.A.; Khursheed, S. Remaining useful life (RUL) regression using Long–Short Term Memory (LSTM) networks. Microelectron. Reliab. 2022, 139, 114772. [Google Scholar] [CrossRef]
- Cao, L.; Zhang, H.; Meng, Z.; Wang, X. A parallel GRU with dual-stage attention mechanism model integrating uncertainty quantification for probabilistic RUL prediction of wind turbine bearings. Reliab. Eng. Syst. Saf. 2023, 235, 109197. [Google Scholar] [CrossRef]
- Zhang, X.; Shi, B.; Feng, B.; Liu, L.; Gao, Z. A hybrid method for cutting tool RUL prediction based on CNN and multistage wiener process using small sample data. Measurement 2023, 213, 112739. [Google Scholar] [CrossRef]
- Zhao, R.; Yan, R.; Chen, Z.; Mao, K.; Wang, P.; Gao, R.X. Deep learning and its applications to machine health monitoring. Mech. Syst. Signal Process. 2019, 115, 213–237. [Google Scholar] [CrossRef]
- Li, G.; Xu, S.; Jiang, R.; Liu, Y.; Zhang, L.; Zheng, H.; Sun, L.; Sun, Y. Physics-informed inhomogeneous wear identification of end mills by online monitoring data. J. Manuf. Process. 2024, 132, 759–771. [Google Scholar] [CrossRef]
- Nasir, V.; Dibaji, S.; Alaswad, K.; Cool, J. Tool wear monitoring by ensemble learning and sensor fusion using power, sound, vibration, and AE signals. Manuf. Lett. 2021, 30, 32–38. [Google Scholar] [CrossRef]
- Li, G.; Shang, X.; Yang, L.; Xie, D.; Sun, L.; Si, M.; Zhou, H. Milling tool wear condition monitoring based on physics-informed autoregression transformation of audio signals. IEEE Sensors J. 2025. [Google Scholar] [CrossRef]
- Cao, X.-C.; Chen, B.-Q.; Yao, B.; He, W.-P. Combining translation-invariant wavelet frames and convolutional neural network for intelligent tool wear state identification. Comput. Ind. 2019, 106, 71–84. [Google Scholar] [CrossRef]
- Chen, Y.; Jin, Y.; Jiri, G. Predicting tool wear with multi-sensor data using deep belief networks. Int. J. Adv. Manuf. Technol. 2018, 99, 1917–1926. [Google Scholar] [CrossRef]
- Azmi, A.I. Monitoring of tool wear using measured machining forces and neuro-fuzzy modelling approaches during machining of GFRP composites. Adv. Eng. Softw. 2015, 82, 53–64. [Google Scholar] [CrossRef]
- Wei, X.; Liu, X.; Yue, C.; Wang, L.; Liang, S.Y.; Qin, Y. Tool wear state recognition based on feature selection method with whitening variational mode decomposition. Robot. Comput.-Integr. Manuf. 2022, 77, 102344. [Google Scholar] [CrossRef]
- Cho, S.; Binsaeid, S.; Asfour, S. Design of multisensor fusion-based tool condition monitoring system in end milling. Int. J. Adv. Manuf. Technol. 2010, 46, 681–694. [Google Scholar] [CrossRef]
- Zhang, X.; Lu, X.; Li, W.; Wang, S. Prediction of the remaining useful life of cutting tool using the hurst exponent and CNN-LSTM. Int. J. Adv. Manuf. Technol. 2021, 112, 2277–2299. [Google Scholar] [CrossRef]
- Ahsan, M.; Salah, M.M. Similarity index of the STFT-based health diagnosis of variable speed rotating machines. Intell. Syst. Appl. 2023, 20, 200270. [Google Scholar] [CrossRef]
- Laddada, S.; Si-Chaib, M.O.; Benkedjouh, T.; Drai, R. Tool wear condition monitoring based on wavelet transform and improved extreme learning machine. Proc. Inst. Mech. Eng. Part C J. Mech. Eng. Sci. 2020, 234, 1057–1068. [Google Scholar] [CrossRef]
- Chen, X.; Li, B. Acoustic emission method for tool condition monitoring based on wavelet analysis. Int. J. Adv. Manuf. Technol. 2007, 33, 968–976. [Google Scholar] [CrossRef]
- Mustafa, D.; Yicheng, Z.; Minjie, G.; Jonas, H.; Jürgen, F. Motor current based misalignment diagnosis on linear axes with short-time fourier transform (STFT). Procedia CIRP 2022, 106, 239–243. [Google Scholar] [CrossRef]
- Yang, Q.; Tang, B.; Deng, L.; Zhu, P.; Ming, Z. WTFormer: RUL prediction method guided by trainable wavelet transform embedding and lagged penalty loss. Adv. Eng. Inform. 2024, 62, 102710. [Google Scholar] [CrossRef]
- Feng, T.; Guo, L.; Gao, H.; Chen, T.; Yu, Y.; Li, C. A new time–space attention mechanism driven multi-feature fusion method for tool wear monitoring. Int. J. Adv. Manuf. Technol. 2022, 120, 5633–5648. [Google Scholar] [CrossRef]
- Guo, X.; Wang, K.; Yao, S.; Fu, G.; Ning, Y. RUL prediction of lithium ion battery based on CEEMDAN-CNN BiLSTM model. Energy Rep. 2023, 9, 1299–1306. [Google Scholar] [CrossRef]
- Stockwell, R.G.; Mansinha, L.; Lowe, R.P. Localization of the complex spectrum: The S transform. IEEE Trans. Signal Process. 1996, 44, 998–1001. [Google Scholar] [CrossRef]
- Xu, X.; Wang, J.; Zhong, B.; Ming, W.; Chen, M. Deep learning-based tool wear prediction and its application for machining process using multi-scale feature fusion and channel attention mechanism. Measurement 2021, 177, 109254. [Google Scholar] [CrossRef]
- Zhou, J.-T.; Zhao, X.; Gao, J. Tool remaining useful life prediction method based on LSTM under variable working conditions. Int. J. Adv. Manuf. Technol. 2019, 104, 4715–4726. [Google Scholar] [CrossRef]
- Li, Y.; Wang, X.; He, Y.; Wang, Y.; Wang, S.; Wang, S. Deep spatial-temporal feature extraction and lightweight feature fusion for tool condition monitoring. IEEE Trans. Ind. Electron. 2022, 69, 7349–7359. [Google Scholar] [CrossRef]
- Li, X.; Liu, X.; Yue, C.; Wang, L.; Liang, S.Y. Data-model linkage prediction of tool remaining useful life based on deep feature fusion and wiener process. J. Manuf. Syst. 2024, 73, 19–38. [Google Scholar] [CrossRef]
- Schmidt-Hieber, J. The kolmogorov–arnold representation theorem revisited. Neural Netw. 2021, 137, 119–126. [Google Scholar] [CrossRef] [PubMed]
- Guo, H.; Zhang, Y.; Zhu, K. Interpretable deep learning approach for tool wear monitoring in high-speed milling. Comput. Ind. 2022, 138, 103638. [Google Scholar] [CrossRef]
- Liu, Z.; Ma, P.; Wang, Y.; Matusik, W.; Tegmark, M. KAN 2.0: Kolmogorov-arnold networks meet science. arXiv 2024, arXiv:2408.10205. [Google Scholar] [CrossRef]
- Liu, Z.; Wang, Y.; Vaidya, S.; Ruehle, F.; Halverson, J.; Soljačić, M.; Hou, T.Y.; Tegmark, M. KAN: Kolmogorov-arnold networks. arXiv 2024, arXiv:2404.19756. Available online: http://arxiv.org/abs/2404.19756 (accessed on 1 October 2025).
- ISO 8688-2; Tool Life Testing in Milling; Part 2: End Milling. ISO: Vernier, Geneva, 1989.














| Sensor | Manufacturers | Model Number |
|---|---|---|
| Milling tool | Yu Ling, China | D10*30*75*4T |
| Three-directional force measuring instrument | Kistler, Switzerland | 9129AA |
| Three-axis accelerometer | Kistler, Switzerland | 8763B100BB |
| Uniaxial accelerometer | Kistler, Switzerland | 8702B100 |
| Acoustic emission sensor | Kistler, Switzerland | 8152C |
| Microphone | GRAS, Denmark | 46AE |
| Microscope | INSIZE, China | 5301-D400 |
| Tool No. | Depth of Cut (mm) | Speed (r/min) | Feed (mm/min) | Width of Cut (mm) | Material | Workpiece 1 Dimension (mm) | Workpiece 2 Dimension (mm) |
|---|---|---|---|---|---|---|---|
| T01-T10 | 1.6 | 6000 | 720 | 1.6 | Carbon-steel (0.45%C) | 300 × 160 × 40 | 90 × 60 × 30 |
| T11-T20 | 1000 | 1.6 | |||||
| T21-T30 | 720 | 1.0 |
| Case | Train Dataset | Test Dataset |
|---|---|---|
| 0 | 80% randomization in all datasets | 20% randomization in all datasets |
| 1 | 80% T01~T10 | 20% T01~T10 |
| 2 | 80% T11~T20 | 20% T11~T20 |
| 3 | 80% T21~T30 | 20% T21~T30 |
| 4 | T01~T10 and T11~T20 | T21~T30 |
| 5 | T01~T10 and T21~T30 | T11~T20 |
| 6 | T11~T20 and T21~T30 | T01~T10 |
| Parameters | Settings |
|---|---|
| Epoch | 100 |
| Loss function | MSE |
| Shape of features (C*F*R) | 11 × 5000 × 2500 |
| dK | 100 |
| KAN in the attention allocation block | [100, 512, 100] |
| KAN regression block | [70,000, 64, 32, 1] |
| The grid of the B-spline | g = 5 |
| The order of the B-spline | k = 3 |
| Optimizer | Adam |
| Learning rate | 0.0001 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guo, D.; Liu, Y.; Sun, L.; Li, G. A Novel Kolmogorov–Arnold Attention Allocation Network for Cutting Tool Remaining Useful Life Prediction. Appl. Sci. 2025, 15, 11549. https://doi.org/10.3390/app152111549
Guo D, Liu Y, Sun L, Li G. A Novel Kolmogorov–Arnold Attention Allocation Network for Cutting Tool Remaining Useful Life Prediction. Applied Sciences. 2025; 15(21):11549. https://doi.org/10.3390/app152111549
Chicago/Turabian StyleGuo, Dingli, Yinfei Liu, Li Sun, and Guochao Li. 2025. "A Novel Kolmogorov–Arnold Attention Allocation Network for Cutting Tool Remaining Useful Life Prediction" Applied Sciences 15, no. 21: 11549. https://doi.org/10.3390/app152111549
APA StyleGuo, D., Liu, Y., Sun, L., & Li, G. (2025). A Novel Kolmogorov–Arnold Attention Allocation Network for Cutting Tool Remaining Useful Life Prediction. Applied Sciences, 15(21), 11549. https://doi.org/10.3390/app152111549

