Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression

Xu, Hui; Xie, Hui; Li, Guangxian

doi:10.3390/jmmp9050163

Open AccessArticle

Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression

by

Hui Xu

¹,

Hui Xie

² and

Guangxian Li

^1,3,*

¹

School of Mechanical Engineering, Guangxi University, Nanning 530004, China

²

School of Computing & Engineering, The University of Huddersfield, Huddersfield HD1 3DH, UK

³

School of Engineering, RMIT University, Mill Park 3082, Australia

^*

Author to whom correspondence should be addressed.

J. Manuf. Mater. Process. 2025, 9(5), 163; https://doi.org/10.3390/jmmp9050163

Submission received: 3 April 2025 / Revised: 3 May 2025 / Accepted: 13 May 2025 / Published: 17 May 2025

Download

Browse Figures

Versions Notes

Abstract

Accurate prediction of tool wear plays a vital role in improving machining quality in intelligent manufacturing. However, traditional Gaussian Process Regression (GPR) models are constrained by linear assumptions, while conventional filtering algorithms struggle in noisy environments with low signal-to-noise ratios. To address these challenges, this paper presents an innovative tool wear prediction method that integrates a nonlinear mean function and a multi-kernel function-optimized GPR model combined with an LSTM-enhanced particle filter algorithm. The approach incorporates the LSTM network into the state transition model, utilizing its strong time-series feature extraction capabilities to dynamically adjust particle weight distributions, significantly enhancing the accuracy of state estimation. Experimental results demonstrate that the proposed method reduces the mean absolute error (MAE) by 47.6% and improves the signal-to-noise ratio by 15.4% compared to traditional filtering approaches. By incorporating a nonlinear mean function based on machining parameters, the method effectively models the coupling relationships between cutting depth, spindle speed, feed rate, and wear, leading to a 31.09% reduction in MAE and a 42.61% reduction in RMSE compared to traditional linear models. The kernel function design employs a composite strategy using a Gaussian kernel and a 5/2 Matern kernel, achieving a balanced approach that captures both data smoothness and abrupt changes. This results in a 58.7% reduction in MAE and a 64.5% reduction in RMSE. This study successfully tackles key challenges in tool wear monitoring, such as noise suppression, nonlinear modeling, and non-stationary data handling, providing an efficient and stable solution for tool condition monitoring in complex manufacturing environments.

Keywords:

tool wear; LSTM; GPR; kernel function; KPCA-SPCA

1. Introduction

In high-speed cutting, precision machining, and automated production, the development of tool wear significantly influences the surface quality and machining accuracy of the workpiece [1,2,3,4]. High-performance evaluation and prediction of tool wear prediction can help reduce production downtime, tool replacement frequency, and costs and enhance overall machining efficiency [5]. With advancements in intelligent manufacturing and Industry 4.0, data-driven techniques for tool wear prediction have emerged as research and technique focuses [6]. Conventional methods, including empirical formulas, physical modeling, and statistical analysis, are capable of providing the off-line prediction of tool wear conditions, whereas such methods struggle to adapt to the complexities of practical machining environments with high-dimensional feature data [7,8,9]. In high-precision and high-efficiency automated manufacturing, tool wear is influenced by a combination of cutting parameters, material properties, and coolant/lubrication types [10,11,12]. The nonlinear and dynamic nature of these factors makes it difficult for conventional methods to meet the requirement of accuracy and efficiency for practical applications [13].

In response to these challenges, machine learning (ML)-based models for tool wear prediction have gained considerable traction. ML models offer distinct advantages in improving predictive accuracy and efficiency. Regression techniques [14,15] and support vector machines (SVM) [16,17] provide basic predictive capabilities but require predefined model assumptions, which are less effective in handling intricate nonlinear relationships and high-dimensional datasets. For example, linear regression develops a direct linear correlation between variables, which fails to capture the complex and dynamic nature of tool wear. Although SVM addresses nonlinear patterns, it remains highly sensitive to parameter tuning and suffers from inefficiencies in high-dimensional data processing. To overcome these limitations, deep learning (DL) methods have emerged as a powerful alternative for tool wear prediction. Convolutional neural networks (CNN) [18,19] and deep neural networks (DNN) [20,21] have proven highly effective in automatic feature extraction and pattern recognition. CNN is good at identifying spatial characteristics in images or visualized data, while DNN leverages multi-layer nonlinear mappings to model complex input–output relationships, making the algorithms particularly well-suited for tool wear prediction. However, deep learning models require large amounts of labeled training data [22] and come with high computational costs, posing challenges related to data availability and processing resources in industrial applications. Recently, Gaussian Process Regression (GPR) [23] has gained significant attention due to its non-parametric nature and Bayesian inference framework. Different from traditional regression methods, GPR does not rely on rigid model assumptions, which leverage kernel functions to automatically capture complex nonlinear relationships. Compared to DL algorithms, GPR is more adaptable to limited datasets, as it does not require large amounts of labeled data. Furthermore, by using appropriate kernel functions, GPR can effectively model diverse nonlinear relationships [24]. However, standard GPR methods are limited due to predicting instability and noise sensitivity when applied to dynamically evolving wear data. Enhancing GPR with dynamic processing mechanisms and improving its predictive performance remain critical research challenges.

According to the reviewed studies, online tool wear monitoring currently faces two pivotal challenges: (1) in practical machining, sensor-captured signals—such as cutting forces and vibrations—are frequently impacted by a variety of noise sources, e.g., mechanical vibrations, electromagnetic interference, and background noise from the sensors. Effective filtering strategies that extract tool wear-relevant information remain a core technical hurdle in enhancing the accuracy and reliability of monitoring systems. (2) Existing monitoring models often struggle to adapt to complex machining scenarios, largely because mainstream approaches rely on single-kernel function mappings between input signals and wear states. This restricts their ability to accurately capture the nonlinear evolution of tool wear and, eventually, the generalizability and robustness.

To tackle the critical challenges of severe signal noise interference and limited model adaptability in online tool wear monitoring, this study introduces a Gaussian Process Regression (GPR) model enhanced with a machining-parameter-based nonlinear mean function and a composite kernel function alongside an LSTM-enhanced particle filter algorithm for preprocessing input signals. The main contributions of this work are summarized as follows:

(1): A nonlinear mean function that incorporates key machining parameters such as spindle speed, feed rate, and cutting depth is designed, which overcomes the limitations imposed by the linear assumptions inherent in traditional GPR models.
(2): To further enhance the model’s capability in handling complex wear patterns, a composite kernel function is constructed by integrating a Gaussian kernel with a 5/2 Matern kernel, which is effective in capturing abrupt changes and non-stationary behaviors.
(3): An LSTM-enhanced particle filter algorithm is developed to address the degradation of state estimation under high-noise industrial conditions, which adaptively corrects particle distributions through gated recurrent units.

Through the synergistic integration of these techniques, the proposed approach significantly advances the accuracy and reliability of tool wear prediction, providing a precise and robust solution for tool condition monitoring in complex manufacturing scenarios.

2. Experiment and Data Collection

The experiment was performed on a VDL600A CNC machining center (Figure 1) for dry face milling of Inconel 718 alloy (Table 1). The workpiece had dimensions of 80 × 80 × 20 mm³. In compliance with ISO 3685 standards [25], a four-flute TiAlN-coated carbide end mill (Φ6 mm) was used, with machining parameters listed in Table 2. To leverage the high sensitivity of cutting force signals in tool condition monitoring, this study employed a YDCB-III05 tri-axial piezoelectric dynamometer (manufacturer: Ningbo Lingyuan Measurement and Control Engineering Co., Ltd., Ningbo, China) to acquire real-time dynamic force signals at a 20 kHz sampling rate. This sensor demonstrated significantly superior resistance to environmental interference compared to conventional temperature and acoustic emission sensors. During machining, the flank wear (VB) of the cutting edge was measured every 200 s using a VHX-7000 microscope system (manufacturer: Keyence Corporation, Osaka, Japan), with the average wear across the four flutes serving as the characteristic value. Experimental observations indicated that when VB exceeded 0.10 mm, edge chipping occurred. Consequently, the tool life criterion was adjusted to

{V B}_{m a x}

= 0.15 mm, reducing the standard threshold by 50%. Over a total machining duration of 2000 s, a mapping database was established to correlate the time-frequency domain features of force signals with tool wear using machine learning algorithms.

In the cutting experiments, force signals were obtained via a piezoelectric dynamometer, and 30,000 data points were selected as the sampling dataset in each cutting trail. Raw signals were subsequently processed using a particle filtering algorithm optimized by LSTM, aiming to extract effective metrics reflecting the development of tool wear. Time-domain features were extracted from the filtered force signals, including mean value, standard deviation, root mean square, skewness, kurtosis, and peak value. For the frequency-domain metrics, spectral features such as spectral variance, spectral skewness, spectral kurtosis, and center frequency were calculated through Fast Fourier Transform (FFT) analysis. Additionally, three-level wavelet packet decomposition was conducted using the db1 wavelet basis, and eight wavelet packet energy features were extracted. In total, 54 features were derived from the cutting force signals in three axes to construct the initial feature space. Subsequently, the Pearson correlation coefficient was calculated between each feature and tool wear, and features with a correlation coefficient greater than 0.6 were selected as input variables for tool wear monitoring.

3. Methodology

3.1. LSTM Improved Particle Filtering Algorithm

At the initial stage of the particle filter algorithm, the particle set must first be initialized. The particle filter approximates the system’s true state by incorporating many particles, where each particle’s initial position and velocity significantly influence the algorithm’s convergence rate and accuracy. The initialized particle set is generally uniformly distributed across the state space to ensure comprehensive coverage of all possible system states. Additionally, each particle is assigned an initial weight, representing its contribution to the system state at the current time step. Initially, all particles are typically assigned equal weights.

x_{0}^{(i)} = x_{i n i t}, ω_{0}^{(i)} = \frac{1}{N}, i = 1,2, \dots, N

(1)

At each time step, an LSTM network is utilized to predict the system state at the current time t. The LSTM takes observed data

X_{t}

from both the current and past time steps as input and, through forward propagation, computes the predicted future state

{\hat{x}}_{t + 1 | t}

. This prediction serves as the prior estimate for the particle filter, forming the basis for updating particle states in the next step.

{\hat{x}}_{t + 1 | t} = L S T M (X_{t})

(2)

For each particle

x_{t}^{(i)}

, the system’s state transition function is applied to generate the predicted state at time t.

x_{t}^{(i)} = f (x_{t - 1}^{(i)}, u_{t}) + ϵ_{t}^{(i)}

(3)

where

f (x_{t - 1}^{(i)}, u_{t})

represents the state transition function, and

ϵ_{t}^{(i)}

denotes the process noise associated with particle

i

.

The discrepancy between each particle’s predicted state and the actual observation is quantified using the observation model, which is then used to update particle weights. Throughout this process, the particle weights reflect the degree of consistency between the predicted and observed states. The weight update formula is given as follows:

ω_{t}^{(i)} = \frac{ω_{t - 1}^{(i)} \cdot p (z_{t}| x_{t}^{(i)})}{\sum_{i = 1}^{N} ω_{t - 1}^{(i)} \cdot p (z_{t}| x_{t}^{(i)})}

(4)

Resampling is performed based on the particle weights

ω_{t}^{(i)}

, redistributing weight toward particles that better align with actual observations, thereby generating an updated particle set.

x_{t}^{(i)} ~ ω_{t}^{(i)}, i = 1,2, \dots, N

(5)

Finally, the state estimation at the current time step is derived using a weighted average. By integrating all particle states along with their respective weights, the particle filter produces a refined and comprehensive state estimate, leading to improved prediction accuracy.

x_{t}^{(i)} = \frac{\sum_{i = 1}^{N} ω_{t}^{(i)} \cdot x_{t}^{(i)}}{\sum_{i = 1}^{N} ω_{t}^{(i)}}

(6)

By implementing these steps, the LSTM-enhanced particle filter algorithm enables high-precision state estimation in complex, nonlinear, and non-Gaussian noise environments. It is particularly well-suited for state tracking and prediction tasks in dynamic systems. The algorithm workflow is illustrated in Figure 2.

3.2. Nonlinear Mean Function for Machining Parameter-Based Tool Wear Prediction

Conventional GPR models often employ simple mean functions, such as constant or linear functions, while relying on basic kernel functions like the Radial basis function kernel (RBF) or Matern kernel. However, in real-world machining processes, the relationship between machining parameters and tool wear is rarely linear. Tool wear is influenced by multiple interacting machining parameters, exhibiting nonlinear behavior that cannot be effectively captured by conventional models. When using a single kernel function, the RBF kernel excels in modeling smooth variations, while the Matern kernel is particularly effective at handling local fluctuations. However, when used independently, neither kernel can fully encapsulate the diverse characteristics of tool wear, especially in machining processes with varying scales of influence. The conventional GPR model is expressed as follows:

f (x) ~ g p (μ (x), k (x, x^{'}))

(7)

where

μ (x)

is the mean function, and

k (x, x^{'})

denotes the kernel function.

In this study, a nonlinear mean function that integrates linear terms, quadratic terms, and interaction terms of machining parameters: spindle speed

n

, feed rate

f

, and cutting depth

a_{p}

is developed, which enhances the model’s ability to capture intricate wear dynamics:

μ (x) = β_{1} + β_{2} n + β_{3} f + β_{3} a_{p} + β_{4} n^{2} + β_{5} f^{2} + β_{6} a_{p}^{2} + β_{7} n f + β_{8} n a_{p} + β_{9} f a_{p}

(8)

The parameter

β

is estimated by maximizing the marginal likelihood function:

\hat{β} = a r g \min_{β} [(y - μ {(X))}^{T} K^{- 1} (y - μ (X))]

(9)

where K represents the kernel matrix, and X denotes the feature matrix of training data.

The inclusion of interaction terms is important because machining conditions often exhibit synergistic effects, such as joint effects of n and f on tool wear development. By explicitly modeling these interactions, the nonlinear mean function not only improves data fitting accuracy but also better reflects the underlying physical mechanisms of tool wear. In addition, kernel selection plays a crucial role in determining the predictive capability of the GPR model. Table 3 lists the commonly used kernel functions.

In the listed equations,

l

represents the length-scale parameter, which controls the similarity between data points diminishes and

σ_{f}^{2}

is the amplitude parameter governing the overall magnitude of the kernel function.

To enhance prediction accuracy, this study develops a multi-kernel function that dynamically integrates the RBF and Matern kernels, allowing the model to fit different data patterns:

k (x, x^{'}) = k_{R B F} (x, x^{'}) + γ \cdot k_{m a r e r n 5 / 2} (x, x^{'})

(10)

where

γ

is a learned adaptive parameter that dynamically adjusts the contribution of RBF and Matern kernels at each predicted point.

The form of the Radial basis function (RBF) kernel is given by

k_{R B F} (x, x^{'}) = σ_{f}^{2} e x p (- \frac{{‖x - x^{'}‖}^{2}}{2 l_{g}^{2}})

(11)

The Matern 5/2 kernel is formulated as

k_{M a t e r n 5 / 2} (x, x^{'}) = σ_{m}^{2} (1 + \frac{\sqrt{5} ‖x - x^{'}‖}{l_{m}} + \frac{5 {‖x - x^{'}‖}^{2}}{3 l_{m}^{2}}) e x p (- \frac{\sqrt{5} ‖x - x^{'}‖}{l_{m}})

(12)

The hyperparameters

θ = \{σ_{f}, l_{g}, σ_{m}, l_{m}, γ\}

are optimized by maximizing the log marginal likelihood function.

l o g p (y| X, θ) = - \frac{1}{2} y^{T} K_{θ}^{- 1} y - \frac{1}{2} l o g |K_{θ}| - \frac{N}{2} l o g 2 π

(13)

With the advantages of RBF and Matern kernels, this hybrid kernel function effectively balances smooth global trends and localized variations in tool wear data. The composite kernel provides a robust, flexible, and adaptive framework, making it particularly well-suited for modeling complex, multi-scale machining processes with diverse wear patterns.

For a new input

x_{*}

, the predictive posterior distribution is

p (f_{*}| X, y, x_{*}) = N (μ_{*}, σ_{*}^{2})

(14)

where the predictive mean

μ_{*}

and variance

σ_{*}^{2}

are given by

μ_{*} = μ (x_{*}) + k_{*}^{T} K^{- 1} (y - μ (X))

(15)

σ_{*}^{2} = k (x_{*}, x_{*}) - k_{*}^{T} K^{- 1} k_{*}

(16)

k_{*} = {[k (x_{*}, x_{1}), \dots, k (x_{*}, x_{n})]}^{T}

(17)

The improved GPR framework integrates physical mechanisms with data-driven insights to achieve synergistic optimization of modeling accuracy and interpretability for tool wear prediction. In the design of the mean function, a nonlinear structure incorporating quadratic and interaction terms based on the coupling effects of machining parameters is explicitly constructed. As for the kernel function, a synthesized kernel architecture enables multi-scale feature extraction, where the Gaussian kernel captures global smooth trends, and the Matern kernel adapts to local abrupt changes. A dynamic weight adjustment mechanism autonomously balances their contributions based on the data distribution characteristics, significantly improving the model’s adaptability across different stages of tool wear.

4. Experimental Results and Analysis

Figure 3 illustrates the performance of different filtering methods, highlighting the limitations of conventional approaches. The wavelet filter exhibits noticeable fluctuations due to its thresholding strategy, which struggles to differentiate between sharp transients in the signal and noise. When significant variations occur, improper threshold selection compromises denoising effectiveness, preventing the complete restoration of high-frequency components.

The Gaussian filter suffers from excessive smoothing, resulting in signal distortion—especially at pulse peaks—where amplitude attenuation leads to the loss of critical high-frequency features. Meanwhile, the Kalman filter reveals its inherent limitations when applied to complex dynamic systems. As a linear predictive model, it fails to handle nonlinear or non-Gaussian noise effectively. Over time, accumulated errors exacerbate phase shifts, causing a marked decline in performance.

By comparison, the LSTM particle filter demonstrates a substantial advantage over traditional methods in both mean absolute error (MAE) and signal-to-noise ratio (SNR). As shown in Table 4, the LSTM particle filter achieves a 47.6% reduction in MAE compared to the wavelet filter, 54.2% compared to the Gaussian filter, and 62.2% compared to the Kalman filter. Additionally, it increases the SNR by approximately 15.4%, significantly outperforming other filtering techniques. These results highlight the superior denoising precision and signal reconstruction capability of the LSTM particle filter. The LSTM network, with its advanced gating mechanisms, effectively captures long-term temporal dependencies, allowing it to model intricate variations within the signal. Meanwhile, the particle filter dynamically updates particle weights, ensuring more accurate tracking of the true system state. This integration of deep learning and probabilistic filtering techniques enhances both estimation accuracy and robustness, making the LSTM particle filter a highly effective solution for complex signal-processing tasks.

As illustrated in Figure 4, the Mean Squared Error (MSE) of the LSTM particle filter increased by 75% compared to Figure 3, while other filtering methods exhibited significantly larger increases: 111% for the wavelet filter, 105% for the Gaussian filter, and 126% for the Kalman filter. These results indicate that the LSTM particle filter possesses superior generalization ability, maintaining stable performance across varying noise conditions. Unlike traditional methods, LSTM’s gradient optimization mechanism dynamically adjusts state transition parameters, reducing bias from manual tuning. Additionally, its nonlinear activation functions enhance the model’s ability to distinguish intricate signal variations, ensuring stable performance even in high-noise environments.

By integrating deep learning’s advanced feature extraction with the probabilistic framework of particle filtering, the LSTM particle filter simultaneously optimizes both error reduction and signal-to-noise ratio (SNR) in complex noise scenarios, as shown in Table 5. The key advantages of this approach include robust dynamic modeling, allowing it to accurately capture long-term temporal dependencies. Superior nonlinear adaptability, enabling effective handling of complex signal fluctuations and varying noise characteristics.

Compared to conventional filtering techniques, the LSTM particle filter surpasses the limitations of fixed parameter assumptions and linear model constraints, delivering a highly flexible and precise state estimation approach for nonlinear, non-Gaussian systems. Experimental results confirm that the LSTM particle filter provides a superior solution for state estimation in complex systems, demonstrating exceptional performance in dynamic environments.

As depicted in Figure 5, the nonlinear mean function exhibits superior performance in both mean absolute error (MAE) and Root Mean Squared Error (RMSE). Compared with the constant mean function, linear mean function, and neighboring mean function, the MAE decreases by 41.78%, 47.72%, and 31.09%, respectively, while the RMSE is reduced by 50.47%, 58.45%, and 42.61%, respectively. These results underscore the effectiveness of the nonlinear mean function in improving data fitting and significantly reducing prediction errors. Figure 6 further highlights the limitations of traditional mean functions. The constant mean function can only approximate a fixed value, failing to reflect trends and complex variations in data, leading to poor predictive accuracy. The linear mean function, although capable of basic linear fitting, lacks the flexibility to model nonlinear relationships, limiting its applicability. While the neighboring mean function provides some improvement in fitting, it remains restricted to localized similarity patterns and simple approximations, preventing it from capturing global nonlinear structures in the data. The nonlinear mean function overcomes these limitations by incorporating machining parameters, which have a direct physical correlation with tool wear. By embedding these parameters into the model, the nonlinear mean function not only enhances the representation of real machining processes but also minimizes prediction errors, providing a more accurate and reliable tool wear prediction model.

As shown in Figure 7, the multi-kernel function significantly outperforms other kernel functions in Gaussian Process Regression (GPR). Compared to the Gaussian kernel, 5/2 Matern kernel, and covariance kernel, the composite kernel reduces mean absolute error (MAE) by 67.2%, 58.7%, and 60.5%, respectively, and Root Mean Squared Error (RMSE) by 66.7%, 64.5%, and 66.3%, respectively. Figure 8 demonstrates that the Gaussian kernel produces larger fluctuations in the predictions, particularly where the data exhibits strong nonlinear characteristics, leading to more noticeable prediction errors. On the other hand, the Matern kernel delivers smoother predictions, especially across broader data ranges. Its smoothness helps mitigate the overfitting issue commonly associated with the Gaussian kernel. However, the Matern kernel still experiences larger errors at the prediction endpoints, indicating challenges in handling boundary conditions or regions with substantial data fluctuations. The Rational Quadratic (RQ) kernel shows larger discrepancies in some localized regions, especially where sharp changes occur in the data. This results in difficulty fitting accurate predictions, leading to potential error accumulation.

In contrast, the multi-kernel function, by combining the advantages of both kernels, achieves a better balance between global and local features. It minimizes fluctuations, improves the smoothness of the predictions, and maintains lower errors across a larger data range. Notably, the composite kernel performs better than individual kernels in boundary regions, offering a more robust and accurate solution for tool wear prediction.

The comparison of the performance among the algorithms of the proposed multi-kernel GPR and representative ML algorithms, SVM, Random Forests (RF), and Backpropagation Neural Networks (BPNN), is implemented. For SVM, a nonlinear regression framework is established using a Gaussian kernel function, with model complexity and generalization ability balanced through a regularization parameter. To enhance robustness, a tight loss function is employed, and the kernel bandwidth is optimized in a data-driven manner. RF constructs a strong learner by integrating multiple regression trees. To mitigate overfitting, a minimum sample constraint at leaf nodes is applied. Full-dimensional feature sampling is retained to effectively capture multi-source feature interactions related to tool wear. For BPNN, a dual-hidden-layer topology is designed to strengthen nonlinear mapping capabilities. Bayesian regularization is introduced to autonomously optimize model complexity, and an early stopping criterion is used to dynamically control the training process. Optimal hyperparameters for each algorithm are determined through parameter sensitivity analysis to ensure fair comparison.

Figure 9 presents the predicted results of different algorithms. Although SVM can achieve nonlinear modeling through the Gaussian kernel and balance complexity via regularization, it exhibits limited dynamic response to severe cutting force fluctuations. The RF benefits from ensemble learning and feature interaction capturing; however, it still suffers from monitoring lag during the rapid tool wear phase. BPNN enhances nonlinear expressiveness through its dual-hidden-layer structure but remains constrained by local optimum convergence. As summarized in Table 6, the modified GPR achieves substantial improvements over the traditional models, with reductions in mean absolute error (MAE) by 47.67%, 61.30%, and 44.51%, and decreases in Root Mean Square Error (RMSE) by 51.08%, 66.25%, and 42.62%, respectively. Particularly during the rapid wear stage near the end of tool life, the multi-kernel GPR demonstrates superior adaptability, with its prediction curve exhibiting a 0.29–0.99% higher Pearson correlation coefficient (PCC) compared to other models. To further validate the generalizability of the modified GPR, verification was conducted using the standardized and publicly available PHM2010 dataset. Specifically, datasets C1 and C4 were selected for evaluation. As shown in Figure 10, the model achieved mean absolute errors of 1.264 and 2.176, Mean Squared Errors of 2.895 and 6.906, Root Mean Square Errors of 1.7015 and 2.628, and Pearson correlation coefficients of 0.9984 and 0.9976, respectively. These results demonstrate that the proposed model maintains excellent adaptability across varying operational conditions.

5. Conclusions

This study proposes a tool wear monitoring system based on the fusion of multi-kernel Gaussian Process Regression (GPR) and intelligent filtering, achieving a notable breakthrough in the engineering applicability of process condition monitoring in manufacturing. By addressing the noise suppression limitations and nonlinear modeling deficiencies of traditional approaches, the proposed system demonstrates significant engineering value:

(1) To accommodate the complex industrial conditions commonly encountered in workshops—such as vibration and electromagnetic interference—this study employs an LSTM-enhanced particle filter algorithm. By learning temporal features, the algorithm adaptively adjusts particle weights in real time. Experimental results show that under typical industrial environments, this method reduces the mean absolute error (MAE) of state estimation by 47.6% and improves the signal-to-noise ratio (SNR) by 15.4%, compared with traditional wavelet and Gaussian filtering methods. This significant improvement greatly enhances the reliability of online monitoring for CNC equipment and provides robust technical support for real-time condition tracking in industrial production.

(2) A nonlinear mean function was constructed to model the mathematical relationships between key process parameters—such as cutting depth, spindle speed, and feed rate—and tool wear. The results indicate that the proposed model outperforms conventional linear approaches, achieving a 31.09% reduction in MAE. This advancement provides a solid theoretical foundation for process optimization and supports accurate control and maintenance decision-making in industrial machining operations.

(3) By combining Gaussian and Matern kernels, the system not only maintains stability in modeling smooth signal trends but also significantly improves the capability to capture abrupt feature changes. In the context of challenging tasks such as high-hardness material machining, the prediction model achieves reductions of 58.7% in MAE and 64.5% in RMSE, greatly enhancing its stability and reliability for real-world industrial applications.

This research offers a scalable and engineering-practical approach to predictive maintenance in intelligent manufacturing. The proposed algorithmic framework is versatile and adaptable to various CNC machining scenarios. However, several real-world challenges must still be addressed in practical applications, such as physical constraints in sensor deployment and the impact of fluctuating data quality on model performance. To further enhance the system’s stability and generalization capability in complex industrial environments, future work will focus on the fast reconstruction of adaptive kernel functions and model updating strategies under dynamic operating conditions. In addition, we will explore the deep integration of heterogeneous multi-source sensor data—including vibration, acoustic emission, and current signals—to construct a more robust multi-modal perception framework. This will strengthen the model’s adaptability to unknown and extreme conditions, ultimately enabling stable deployment and long-term operation in real production environments.

Author Contributions

Conceptualization, H.X. (Hui Xu) and G.L.; methodology, H.X. (Hui Xie); software, H.X. (Hui Xu) and H.X. (Hui Xie); validation, H.X. (Hui Xu), H.X. (Hui Xie) and G.L.; formal analysis, H.X. (Hui Xu) and H.X. (Hui Xie); investigation, H.X. (Hui Xu); resources, H.X. (Hui Xie) and G.L.; data curation, H.X. (Hui Xu) and H.X. (Hui Xie); writing—original draft preparation, H.X. (Hui Xu); writing—review and editing, H.X. (Hui Xu), H.X. (Hui Xie) and G.L.; visualization, H.X. (Hui Xu); supervision, H.X. (Hui Xie) and G.L.; project administration, H.X. (Hui Xu); funding acquisition, G.L. All authors have read and agreed to the published version of the manuscript.

Funding

This study is funded by the Guangxi Innovative Talent Research Project (Grand No. AD22035184) and the program of the China Scholarship Council [No. 202306660011].

Data Availability Statement

The data will be provided upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fernández-Robles, L.; Azzopardi, G.; Alegre, E.; Petkov, N.; Castejón-Limas, M. Identification of milling inserts in situ based on a versatile machine vision system. J. Manuf. Syst. 2017, 45, 48–57. [Google Scholar] [CrossRef]
Li, G.; Shuang, Y.; Wen, C.; Ding, S. Wear mechanism and tribological behaviour of polycrystalline diamond tools in sticking-transition-sliding zones in machining titanium alloy Ti6Al4V. J. Manuf. Sci. Eng. 2018, 140, 121011. [Google Scholar] [CrossRef]
Liu, M.; Xie, H.; Pan, W.; Ding, S.; Li, G. Prediction of cutting force via machine learning: State of the art, challenges and potentials. J. Intell. Manuf. 2023, 36, 703–764. [Google Scholar] [CrossRef]
Xie, H.; Li, G.; Longstaff, A.P.; Fletcher, S.; Ding, S.; Pan, W. Study of cutting force predictability, signal complexity of different end milling CWE stages with different modelling methods. Int. J. Adv. Manuf. Technol. 2024, 136, 1163–1190. [Google Scholar] [CrossRef]
Peng, D.; Li, H.; Sun, B.; Wang, Z. Tool wear monitoring by estimating milling torque and calculating the wear energy coefficient under variable depth of cut conditions. ISA Trans. 2025, 159, 375–386. [Google Scholar] [CrossRef]
Qin, Y.; Liu, X.; Yue, C.; Wang, L.; Gu, H. A tool wear monitoring method based on data-driven and physical output. Robot. Comput.-Integr. Manuf. 2025, 91, 102820. [Google Scholar] [CrossRef]
Yu, W.; Zhan, H.; Yu, J.; Wang, R. Semi-supervised prediction of milling cutter wear based on an empirical formula for cutting force and wear. Int. J. Adv. Manuf. Technol. 2025, 137, 4761–4776. [Google Scholar] [CrossRef]
Fukuda, K.; Morita, T. Physical model of adhesive wear in early stage of sliding. Wear 2017, 376–377, 1528–1533. [Google Scholar] [CrossRef]
Jia, Y.; Li, G.; Dong, X. Real-time wear monitoring of hob cutter based on statistical analysis. ISA Trans. 2022, 129, 691–702. [Google Scholar] [CrossRef]
de Oliveira, D.; Ziberov, M.; de Paiva, R.L.; da Silva, M.B. An experimental evaluation of cutting parameters influence on the surface integrity and tool wear mechanisms on the dry micromilling of austenitic alloy Inconel 718. Wear 2025, 205789. [Google Scholar] [CrossRef]
Denkena, B.; Michaelis, A.; Herrmann, M.; Pötschke, J.; Krödel, A.; Vornberger, A.; Picker, T. Influence of tool material properties on the wear behavior of cemented carbide tools with rounded cutting edges. Wear 2020, 456–457, 203395. [Google Scholar] [CrossRef]
Antonicelli, M.; Piccininni, A.; Cusanno, A.; Lacedra, V.; Palumbo, G. Evaluation of the effectiveness of natural origin metalworking fluids in reducing the environmental impact and the tool wear. J. Clean. Prod. 2023, 385, 135679. [Google Scholar] [CrossRef]
Potthoff, N.; Agarwal, A.; Wöste, F.; Wiederkehr, P.; Mears, L. Evaluation of Contrived Wear Methodology in End Milling of Inconel 718. J. Manuf. Sci. Eng. 2023, 145, 101002. [Google Scholar] [CrossRef]
Çelik, Y.H.; Fidan, Ş. Analysis of cutting parameters on tool wear in turning of Ti-6Al-4V alloy by multiple linear regression and genetic expression programming methods. Measurement 2022, 200, 111638. [Google Scholar] [CrossRef]
Cardoso, R.A.; Oliveira, G.A.B.; Almeida, G.M.J.; Araújo, J.A. A simple linear regression strategy for fretting fatigue life estimates. Tribol. Int. 2024, 198, 109852. [Google Scholar] [CrossRef]
Gomes, M.C.; Brito, L.C.; da Silva, M.B.; Duarte, M.A.V. Tool wear monitoring in micromilling using Support Vector Machine with vibration and sound sensors. Precis. Eng. 2021, 67, 137–151. [Google Scholar] [CrossRef]
Liang, Y.; Hu, S.; Guo, W.; Tang, H. Abrasive tool wear prediction based on an improved hybrid difference grey wolf algorithm for optimizing SVM. Measurement 2022, 187, 110247. [Google Scholar] [CrossRef]
García-Pérez, A.; Ziegenbein, A.; Schmidt, E.; Shamsafar, F.; Fernández-Valdivielso, A.; Llorente-Rodríguez, R.; Weigold, M. CNN-based in situ tool wear detection: A study on model training and data augmentation in turning inserts. J. Manuf. Syst. 2023, 68, 85–98. [Google Scholar] [CrossRef]
Kim, G.; Yang, S.M.; Kim, D.M.; Kim, S.; Choi, J.G.; Ku, M.; Lim, S.; Park, H.W. Bayesian-based uncertainty-aware tool-wear prediction model in end-milling process of titanium alloy. Appl. Soft Comput. 2023, 148, 110922. [Google Scholar] [CrossRef]
Touati, S.; Boumediri, H.; Karmi, Y.; Chitour, M.; Boumediri, K.; Zemmouri, A.; Moussa, A.; Fernandes, F. Performance Analysis of Steel W18CR4V Grinding Using RSM, DNN-GA, KNN, LM, DT, SVM Models, and Optimization via Desirability Function and MOGWO. Heliyon 2025, 11, e42640. [Google Scholar] [CrossRef]
Li, B.; Liu, T.; Liao, J.; Feng, C.; Yao, L.; Zhang, J. Non-invasive milling force monitoring through spindle vibration with LSTM and DNN in CNC machine tools. Measurement 2023, 210, 112554. [Google Scholar] [CrossRef]
Chen, N.; Liu, Z.; Xue, Z.; He, L.; Zou, Y.; Chen, M.; Li, L. Intelligent wireless tool wear monitoring system based on chucked tool condition monitoring ring and deep learning. Adv. Eng. Inform. 2025, 65, 103176. [Google Scholar] [CrossRef]
Sun, M.; Wang, X.; Guo, K.; Huang, X.; Sun, J.; Li, D.; Huang, T. Tool wear monitoring based on physics-informed Gaussian process regression. J. Manuf. Syst. 2024, 77, 40–61. [Google Scholar] [CrossRef]
Kong, D.; Chen, Y.; Li, N. Gaussian process regression for tool wear prediction. Mech. Syst. Signal Process. 2018, 104, 556–574. [Google Scholar] [CrossRef]
Barcelos, M.B.; de Almeida, D.T.; Tusset, F.; Scheuer, C.J. Performance analysis of conventional and high-feed turning tools in machining the thermally affected zone after plasma arc cutting of low carbon manganese-alloyed steel. J. Manuf. Process. 2024, 115, 18–39. [Google Scholar] [CrossRef]
Lu, X.; Zeng, F.; Xv, K.; Zhang, Y.; Liang, S.Y. Prediction of tool wear during micro-milling Inconel 718 based on long short-term memory network. Precis. Eng. 2024, 86, 195–202. [Google Scholar] [CrossRef]
Xu, M.; Wei, R.; Cao, L.; Li, J.; Xiong, X.; Yu, L.; Li, C.; Ko, T.J. Cutting force modeling and machinability investigation of Inconel 718 using ultrasonic vibration-electrical discharge assisted milling. J. Manuf. Process. 2025, 136, 1–17. [Google Scholar] [CrossRef]
Yin, X.; Li, X.; Liu, Y.; Geng, D.; Zhang, D. Surface integrity and fatigue life of Inconel 718 by ultrasonic peening milling. J. Mater. Res. Technol. 2023, 22, 1392–1409. [Google Scholar] [CrossRef]

Figure 1. VDL600A three-axis vertical machining center.

Figure 2. LSTM optimized particle filter process.

Figure 3. Comparison with traditional filtering methods under low noise conditions.

Figure 4. Comparison with traditional filtering methods under high noise conditions.

Figure 5. Performance comparison of different mean functions under four evaluation indicators: (a) MAE; (b) MSE; (c) RMSE; (d) PCC.

Figure 6. Prediction results of tool wear monitoring model based on GPR with different mean functions: (a) Nolinear Mean; (b) Constant Mean; (c) Linear Mean; (d) Nearest Mean.

Figure 7. Performance comparison of different kernel functions under four evaluation indicators: (a) MAE; (b) MSE; (c) RMSE; (d) PCC.

Figure 8. Prediction results of tool wear monitoring model based on GPR with different kernel functions: (a) Multi Kernel; (b) Gaussian Kernel; (c) Matern5/2 Kernel; (d) RQ Kernel.

Figure 9. Tool wear prediction based on different algorithms: (a) multi-kernel GPR; (b) SVM; (c) RF; (d) BPNN.

Figure 10. The prediction results of multi-kernel Gaussian Process Regression on the PHM2010 dataset: (a) C1 dataset; (b) C4 dataset.

Table 1. Physical and mechanical properties of Inconel 718 [26,27,28].

Physical Properties	Values
Density (g/cm³)	8.19
Melting Point (°C)	1260–1320
Thermal Conductivity (W/m·K)	11.4
Specific Heat Capacity (J/kg·K)	435
Elastic Modulus (GPa)	205
Shear Modulus (GPa)	79.6
Electrical Resistivity (μΩ·m)	1.29
Poisson’s Ratio	0.33
Coefficient of Thermal Expansion (µm/m·K)	11.8 (20–1000 °C)

Table 2. Machining conditions of the experiments.

No.	$Spindle Speed n$ (rpm)	Feed Rate f (mm/r)	$Cutting Depth a_{p}$ (mm)	Replications
1	2000	0.04	0.2	3
2	3000	0.06	0.2	3
3	2000	0.04	0.3	3
4	3000	0.06	0.3	3

Table 3. GPR kernel function.

Kernel Function	Math Equation
Gaussian Kernel	$k (x_{i}, x_{j}) = σ_{f}^{2} e x p (- \frac{{‖x_{i} - x_{j}‖}^{2}}{{2 l}^{2}})$
Rational Quadratic Kernel	$k (x_{i}, x_{j}) = {σ_{f}^{2} (1 + \frac{{‖x_{i} - x_{j}‖}^{2}}{2 α l^{2}})}^{- α}$
$M a t è r n$ 5/2 Kernel	$k (x_{i}, x_{j}) = σ_{f}^{2} (1 + \sqrt{5} ‖x_{i} - x_{j}‖ + {5 ‖x_{i} - x_{j}‖}^{2} / 3) e x p (- \sqrt{5} ‖x_{i} - x_{j}‖)$
Periodic Kernel	$k (x_{i}, x_{j}) = σ_{f}^{2} e x p (- \frac{(2 {s i n}^{2} (π \|x_{i} - x_{j}\|) / p)}{l^{2}})$
Linear Kernel	$k (x_{i}, x_{j}) = {({x_{i}}^{T} x_{j} + c)}^{d}$

Table 4. Evaluation indicators of different filtering methods under low noise.

	MAE	SNR
Wavelet Filter	1.217	18.566
Gaussian Filter	1.375	18.035
Kalman Filter	1.674	17.181
LSTM Particle Filter	0.633	21.408

Table 5. Evaluation indicators of different filtering methods under high noise.

	MAE	SNR
Wavelet Filter	1.351	21.953
Gaussian Filter	2.826	18.747
Kalman Filter	3.792	17.470
LSTM Particle Filter	1.106	22.818

Table 6. Experimental results of different prediction methods for tool wear prediction.

	MAE	MSE	RMSE	PCC
Multi Kernel GPR	1.01	1.85	1.36	0.9985
SVM	1.93	7.73	2.78	0.9956
RF	2.61	16.26	4.03	0.9887
BPNN	1.82	5.61	2.37	0.9956

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, H.; Xie, H.; Li, G. Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression. J. Manuf. Mater. Process. 2025, 9, 163. https://doi.org/10.3390/jmmp9050163

AMA Style

Xu H, Xie H, Li G. Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression. Journal of Manufacturing and Materials Processing. 2025; 9(5):163. https://doi.org/10.3390/jmmp9050163

Chicago/Turabian Style

Xu, Hui, Hui Xie, and Guangxian Li. 2025. "Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression" Journal of Manufacturing and Materials Processing 9, no. 5: 163. https://doi.org/10.3390/jmmp9050163

APA Style

Xu, H., Xie, H., & Li, G. (2025). Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression. Journal of Manufacturing and Materials Processing, 9(5), 163. https://doi.org/10.3390/jmmp9050163

Article Menu

Online Tool Wear Monitoring via Long Short-Term Memory (LSTM) Improved Particle Filtering and Gaussian Process Regression

Abstract

1. Introduction

2. Experiment and Data Collection

3. Methodology

3.1. LSTM Improved Particle Filtering Algorithm

3.2. Nonlinear Mean Function for Machining Parameter-Based Tool Wear Prediction

4. Experimental Results and Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI