Next Article in Journal
Dual-Mode Manhole Cover Alarm Based on Triboelectric Nanogenerators for Smart City Infrastructure Monitoring
Previous Article in Journal
A Verifiable Steady-State Frequency–Velocity Mapping for Desktop FDM Printers Based on an Electromechanical Coupling Framework
Previous Article in Special Issue
A Digital Twin-Enabled Framework for Agrivoltaic System Design, Simulation, Monitoring and Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Groove Depth in Femtosecond Laser Ablation via Attention Mechanism and Monotonic Constraint

1
School of Mechanical Engineering, Guangxi University, Nanning 530004, China
2
School of Engineering, RMIT University, Melbourne, VIC 3000, Australia
3
School of Aerospace Engineering, Xiamen University, Xiamen 361005, China
4
Department of Finance, Nankai University Binhai College, Tianjin 300270, China
*
Author to whom correspondence should be addressed.
Machines 2026, 14(5), 509; https://doi.org/10.3390/machines14050509
Submission received: 14 February 2026 / Revised: 15 April 2026 / Accepted: 23 April 2026 / Published: 3 May 2026

Abstract

Femtosecond laser ablation (FLA) is efficient for the machining of micro-groove arrays on the surface of ultrahard cutting tools. The depth of the groove determines the precision and efficiency of ablation. In this study, an “Attention-based Monotonic Physics-Guided Neural Network” (AM-PGNN) algorithm is proposed to accurately predict groove depth in the FLA of tungsten carbide (WC). The new algorithm incorporates machining parameters directly governing the energy deposition and thermal accumulation, thereby determining the prediction of the micro-groove depth generation. By embedding the physics-guided monotonic relationships of parameter depth into the learning process, a dedicated physical loss coupled with an attention mechanism to enable adaptive feature weighting is constructed, which strengthens the representation of causal dependencies. Experimental data for training and testing are obtained from the FLA of WC with different machining parameters. Comparison between AM-PGNN and typical algorithms, including a Support Vector Machine (SVM), Deep Neural Network (DNN), Convolutional Neural Network (CNN), Gradient Boosting Decision Tree (GBDT), and a conventional PGNN, demonstrates that the proposed AM-PGNN achieves superior prediction accuracy. Moreover, AM-PGNN attains a physical consistency degree (PCD) of 100%, indicating strict adherence to monotonicity consistent with the actual situation. AM-PGNN also exhibits enhanced robustness to input perturbations, as reflected by reduced standard deviation (Std) and normalized absolute deviation (NAD). Finally, AM-PGNN is shown to be applicable in the FLA of different materials through additional experiments on Cu and SiC, achieving R2 values above 0.93 while maintaining a PCD of 100%.

1. Introduction

Tungsten carbide (WC) tools are widely used in industry, due to their high hardness, wear resistance, and strength retention at high temperatures [1,2]. To further enhance the performance, microstructures such as micro-dents or micro-grooves are fabricated on the tool surface, which can significantly improve the friction characteristics and wear resistance and enhance anti-adhesion capabilities [3]. However, the high hardness and brittleness of WC limit the machining efficiency and surface finish when fabricating microstructures via conventional machining methods. With the development of laser technology, femtosecond laser ablation (FLA) is emerging as a promising solution to the high-performance micromachining of ultrahard materials, because of its high precision, minimal surface damage, and non-contact processing characteristics [4,5,6]. The ultrashort pulse duration of FLA enables the near“cold processing” material removal mechanism, which reduces or even eliminates the heat-affected zone (HAZ) [7].
The dimensions of the microstructures determine the precision of FLA, which is governed by the material removal mechanism at different machining parameters. The FLA of WC involves complex processes such as multi-parameter coupling, multi-scale physical evolution, and nonlinear responses; furthermore, the ablation behavior of the WC phase and Co phase is different, which further increases the uncertainty of machining responses [8,9]. Currently, physics-based modeling and machine learning (ML)-based modeling are typical methods for analyzing FLA processes. Physical models elucidate microscopic mechanisms of laser/solid interactions from first principles. For example, Li et al. [10] revealed the evolution of electron temperature and lattice temperature based on the two-temperature model, establishing a theoretical model for FLA of fused SiO2. Chen et al. [11] developed a multi-physics model with transient material properties to investigate the FLA of Ti6Al4V. Song et al. [12] developed a phenomenological laser ablation model to predict the ablation morphology of fused SiO2 under single-pulse and multi-pulse FLA processes. Wang et al. [13] developed a numerical model to reveal the influence of focus position on the surface morphology during multi-pulse FLA. These physical models provide profound theoretical insights; however, substantial computational resources are required for the simulation of multi-field coupling, which is not efficient for real-time optimization or online monitoring. Furthermore, most physical models simplify complex factors such as material heterogeneity and plasma shielding effects, which impacts prediction accuracy.
In comparison, ML-based methods are increasingly applied in the monitoring or prediction of laser machining responses [14,15,16]. For example, Campanelli et al. [17] simulated the laser milling process at maximum machining speed via artificial neural networks (ANNs), with actual errors below 5%. Zhang et al. [18] developed an ANN model with laser power, scanning interval, and scanning speed as input, achieving 98% accuracy in the prediction of material removal efficiency during femtosecond laser bone drilling. Li et al. [19] proposed a method that integrates machine learning with multi-objective optimization for predicting and optimizing the surface parameters of CFRP following femtosecond laser processing. However, large amounts of datasets are required for the training of NN models, and it is time-consuming and less cost-effective to obtain high-quality datasets through FLA experiments requiring precision equipment and strict environmental control [20]. Under limited sample conditions, complex models are prone to overfitting, while simple models struggle to capture nonlinear relationships. Moreover, ML-based models can achieve good fitting results on specific datasets, meanwhile completely ignoring considerations of physical processes such as laser energy transfer and phase transformation, leading to the lack of interpretability regarding physical mechanisms [21,22]. The absence of physical information not only increases the model’s dependency on data volume, but also limits the exploration of physical laws during parameter optimization. Therefore, developing ML-based algorithms that incorporate physical prior knowledge and establishing multi-scale coupling models between process parameters and machining responses has become a key breakthrough for enhancing model generalization and mechanistic interpretability. Physics-Informed Neural Networks (PINNs), which integrate mechanistic principles with data-driven modeling, address the limitations of traditional physical models in accurately capturing complex scenarios and those of pure data-driven models that often violate physical constraints [23]. PINNs convert validated physical laws into constraint conditions and embed these constraints into an NN [24], which can effectively capture causal relationships and correlations, thereby achieving interpretable prediction results with high accuracy. Furthermore, the integration of physical laws enhances the model’s generalizability [25,26]. Recently, scholars have embedded various physical models into NNs [27,28]. Masi et al. [29] proposed a Thermodynamics-based Artificial Neural Network (TANN), which can generate physically consistent predictions. Xie et al. [30] embedded heat transfer laws into the loss function of an NN to simulate the temperature field in single-layer and multi-layer DED, achieving a mean relative error of 4.83%.
If physical mechanisms can be expressed analytically, PINN can significantly enhance the accuracy and interpretability of ML models. However, complex mechanisms involved in FLA of WC currently defy a comprehensive mathematical description, limiting the application of typical PINN. To overcome the challenge of the mathematical expression of complex physical processes, introducing physical monotonicity constraints provides an innovative approach. Zhu et al. [31] applied monotonicity constraints to an Extreme Learning Machine (ELM) model, enhancing its generalization ability. Wang et al. [32] introduced the physical relationship between superheated steam temperature (SST) and spray water flow as inequality constraints, and embedded the constraint into the loss function of a Long Short-Term Memory (LSTM) network, improving the prediction accuracy of SST. Ren et al. [33] improved the prediction accuracy of syngas composition by incorporating physical monotonicity constraints during training. Xie et al. [34] established an NN algorithm guided by extreme events and monotonic relationships to simulate the rainfall–runoff process, achieving significantly improved performance compared to a conventional NN. Zhu et al. [35] embedded three typical monotonic relationships in boiler NOx emissions into an NN model; the proposed NN outperformed traditional models in accuracy, interpretability, and generalization. Therefore, the model proposed in this study adopts the Physics-Guided Neural Network (PGNN) method, which was first proposed by Karpatne et al. [36] and fully defined by Karniadakis et al. [37] in Nature Reviews Physics. Integrating physical monotonicity as a loss term into the NN model, rather than relying on massive data to reduce costs, is a strategy to enhance model robustness in sample-limited scenarios and is particularly suitable for laser processing. In FLA, the relationships between key process parameters including laser fluence, repetition frequency, scanning speed and micro-groove depth are governed by fundamental physical mechanisms. Specifically, higher laser fluence increases energy deposition per unit area, leading to higher MRR and deeper grooves; higher repetition frequency promotes thermal accumulation, thereby increasing ablation efficiency; higher scanning speed reduces pulse overlap and energy input, resulting in the reduction in groove depth. The proposed PGNN model is designed to incorporate physical relationships into the learning process. This feature is useful in engineering problems where experimental data are often limited. By embedding such monotonicity constraints, the model can avoid unphysical predictions and improve generalizability.
In this study, a Physics-Guided Neural Network based on monotonicity constraints and an attention mechanism (AM-PGNN) is developed to achieve high-precision prediction of micro-groove depth in FLA. The key innovation of this algorithm lies in incorporating the monotonicity constraints between the principal process parameters and the micro-groove depth into the learning objective, with these constraints being directly embedded in the loss function. This integration ensures the model’s physical consistency, even when training on limited datasets. Furthermore, an attention mechanism is introduced to dynamically identify key input parameters, thereby enhancing the model’s ability to capture parameter interactions. A physical inconsistency loss function is added to the optimization objective, and Particle Swarm Optimization (PSO) is employed for training. The performance of AM-PGNN and other ML algorithms is compared to validate their advantages in prediction accuracy and model interpretability, and validation experiments on Cu and SiC further demonstrate their applicability to the FLA of different materials. The findings of this study underline significant advantages of AM-PGNN in both predictive accuracy and the potential for practical applications.

2. Methodology

2.1. Femtosecond Laser Ablation (FLA) Experiment and Data Processing

The FLA of micro-grooves was performed on a femtosecond laser platform (Figure 1). This system consists of a fiber femtosecond laser source (YSL Photonics Company Ltd., Wuhan, China) and a high-precision three-axis motion table (Precision Industry, Dongguan, China). The table has a positional accuracy of 1 μm in three directions. Table 1 details the laser specifications. Prior to experimentation, all samples (50 mm × 50 mm × 3 mm) were polished. Subsequently, the samples underwent 20 min ultrasonic cleaning. Laser fluence, repetition frequency, and scanning speed are the major machining parameters in FLA, and the range of parameters was optimized in our preliminary experiments. As listed in Table 2, laser fluence was set at eight levels (0.98–5.53 J); repetition frequency was set at five levels (0.025–0.3 MHz); and scanning speed was set at five levels (10–100 mm/s). The ranges of the parameters are determined following the criteria that the ablation be implemented with limited effects of plasma shielding, and the proposed monotonic constraints are applicable in such a process window. This is because FLA exhibits more complex behaviors out of the parameter range in Table 2, such as threshold effects, shielding effects, incubation effects, and saturation effects, and these effects could limit the machining efficiency of FLA. The depths of the micro-grooves were measured using a digital microscope system (VHX-7000, Osaka, Japan). The depth of each micro-groove was calculated as the average of ten measurements. The measurement uncertainty is estimated to be ±0.1 μm based on calibration standards. This uncertainty is negligible compared to the depth range of 0.06–31 μm. This ensures the reliability of the data for model training and validation. High measurement accuracy minimizes noise in the dataset. This allows the model to learn physical relationships effectively without significant bias from experimental errors. This study focuses on predicting the depth of micro-grooves from a single laser scan. Target depth in practical machining is often achieved through multiple passes. However, the removal behavior of a single scan is the physical basis for the cumulative effect.
The dataset contains 200 micro-grooves fabricated with different parameter sets. This includes 130 samples for training and 70 samples for validation. Preliminary calculations were conducted using different dataset splits, specifically 140/60, 130/70, and 120/80. For each split, the accuracy and generalization performance after repeated training were compared. The 130/70 split was ultimately selected as it showed the most stable results. Five-fold cross-validation was adopted to ensure robustness and training sufficiency. Normalization was applied to the raw data to enhance training stability. To assess reproducibility, an independent repeated-machining verification experiment was further conducted for six representative parameter groups, as described in Section 4.1. This maintains consistent data distribution and mitigates issues such as gradient explosion. Furthermore, normalization accelerates convergence and improves generalization by standardizing feature scales [38]. Min–Max Normalization is a widely adopted method that linearly maps data to a fixed interval. This ensures all features are on the same standardized scale. The formula is expressed as follows:
x = x x m i n x m a x x m i n
where x is the original value, x m i n is the minimum value of the variable in the dataset, x m a x is the maximum value of the variable in the dataset, and x is the normalized data. In this study, linear mapping was adopted to normalize the data to a fixed interval ([0, 1]). This preprocessed data improves quality and facilitates the modeling of relationships between input and output variables. Consequently, prediction reliability and accuracy are enhanced.

2.2. Micro-Groove Depth Prediction Scheme

Most NN-based models require large volumes of high-quality experimental data. However, acquiring such data is often challenging and costly. With limited data, NN-based models cannot be adequately trained. They become susceptible to overfitting. This leads to substantial prediction deviations during testing. To address these issues, monotonicity constraints between inputs and outputs are incorporated into the NN. Additionally, an attention mechanism is introduced. This mechanism accounts for the varying influence of each input on the output during training. Consequently, a Physics-Guided Neural Network model integrated with both an attention mechanism and monotonicity constraints was developed. It is expressed by the following equation:
y ^ = f p h y s i c s θ , η , λ
where f p h y s i c s is the proposed NN model; y ^ represents the predicted value; θ denotes the model parameters; η is the physical consistency constraint parameter; λ is the weighting factor for each term in the loss function.
The AM-PGNN balances the influence of training data volume and physical consistency constraints on the model’s learning ability. This algorithm addresses the fundamental limitations of two conventional models. First, physical models rely strictly on theoretical assumptions. They often neglect experimental data, resulting in limited expressiveness. These models fail to capture complex nonlinear dynamics in femtosecond laser processing (e.g., plasma shielding effects). This leads to insufficient prediction accuracy. Second, NN-based models possess strong fitting capabilities. However, they depend heavily on massive datasets and lack physical constraints. With limited data, generalizability reduces and prediction errors increase. Outputs often violate basic physical laws. This results in poor physical consistency and a lack of interpretability. AM-PGNN resolves these issues by adaptively correcting theoretical deviations through data integration. This enhances the accuracy of physical models. Furthermore, physical loss functions with enforced monotonicity constraints are embedded. This eliminates physical distortions in data-driven models. Consequently, AM-PGNN achieves high-precision predictions while ensuring physical consistency, interpretability, and strong generalization. The scheme of AM-PGNN is shown in Figure 2.

3. Attention-Based Monotonic Physics-Guided Neural Network (AM-PGNN) Network Architecture

3.1. Monotonicity-Constrained Physics-Guided Neural Network

The key innovation of this study lies in embedding fundamental monotonic relationships—dictated by critical physical mechanisms such as laser energy deposition and material removal—as strong prior knowledge into the model. This approach significantly enhances the accuracy, interpretability, and generalizability of predictions, overcoming the tendency of traditional data-driven models to diverge from actual physical processes.

3.1.1. Basic Architecture

The core of AM-PGNN remains an ANN model, structured with one input layer, one hidden layer, and an output layer. This can be denoted as an m n 1 architecture, where m and n represent the number of neurons in the input and hidden layers. Define N training samples as { ( x ( i ) , y ( i ) ) } i = 1 N , i = 1 , , N , x ( i ) R n . Here, x denotes the input vector of a sample, and y is the corresponding output. The model’s prediction for a given input can be expressed as
y ^ i = f x i = β g w x i + b
where y ( i ) represents the model’s predicted value, f ( · ) denotes the mapping function embodied by the neural network model, w is the weight matrix connecting the input layer to the hidden layer, β is the weight vector connecting the hidden layer to the output layer, g ( · ) is the sigmoid activation function, and b is the hidden layer’s bias vector.
The training process of an NN involves optimizing the internal parameters ( w , β , and b ) through backpropagation. The goal is to minimize the error L r between the predicted output y ^ i and the actual target value y i , defined as
L r = i = 1 N y i y ^ i 2

3.1.2. Formulation and Analysis of Physical Monotonic Relationships

During FLA, micro-groove depth exhibits inherent monotonic relationships with the three machining parameters, as determined by underlying physical mechanisms. From a mathematical perspective, the monotonic relationship between the model output y ^ and an input variable x j can be formally evaluated through the sign of the corresponding partial derivative y ^ x j . Given an observed dataset, a strictly positive value of this derivative indicates that y ^ increases monotonically with respect to x j , whereas a negative value implies a decreasing monotonic trend. Under the proposed model structure, the analytical form of the partial derivative of y ^ with respect to the j -th input variable is derived as follows:
y ^ x j = β g w x + b x j = β 1 g w 11 x 1 + + w 1 n x n + b 1 + + β N g w N 1 x 1 + + w N n x n + b N x j = k = 1 N β k w k j g w k x + b k 1 g w k x + b k , j = 1,2 , , n
The physical mechanisms of FLA were analyzed to determine the monotonic relationships between the micro-groove depth and laser parameters. These parameters are laser fluence, repetition frequency, and scanning speed. This analysis establishes specific monotonicity formulations.
Figure 3 illustrates the physical principles of FLA. Laser fluence (LF) determines the energy density (J/cm2) deposited per unit area. Increasing fluence raises the photon density. This enhances multiphoton absorption and avalanche ionization. Consequently, the density of excited free electrons in the material surface layer increases. These electrons transfer energy to the lattice via electron–phonon coupling. The local temperature rises rapidly when the lattice energy density exceeds the ablation threshold. This triggers non-equilibrium phase transitions. Higher fluence heats and removes more material per unit area. Removal occurs through mechanisms such as thermal expansion, phase explosion, or Coulomb explosion. This process expands the molten and vaporized regions and deepens the micro-groove (Figure 3b). This monotonic relationship is mathematically expressed as
y ^ L F x > 0
Repetition frequency (RF) determines the number of pulses irradiating a single point per unit time. High RF results in pulse intervals shorter than the thermal diffusion time. Consequently, heat from a previous pulse does not fully diffuse before the next pulse arrives. This causes continuous heat accumulation in the processing zone. Accumulated heat raises the local temperature significantly. Subsequent pulses reach the ablation threshold with lower energy density. This effectively reduces the apparent ablation threshold. Furthermore, sustained high temperatures maintain the molten region for a longer duration. This expands the molten scope and promotes secondary ablation. Thermal accumulation increases the effective energy deposition per pulse. Therefore, increasing RF enhances material removal efficiency and increases micro-groove depth (Figure 3c). This monotonic relationship is mathematically expressed as
y ^ R F x > 0
Scanning speed (SS) determines the velocity of the laser focus across the surface. It directly affects the spatial overlap of adjacent pulses and the total laser interaction time. Lower scanning speed results in longer laser dwell time per unit length. This significantly increases the number of pulses at the same spatial location:
N = D · f v
where D is the beam spot diameter, f is the repetition frequency, and v is the scanning speed.
More pulses result in a higher total energy deposition. At low scanning speeds, pulses overlap significantly. Energy is deposited multiple times in the same area. This intensifies thermal accumulation and material removal. Extended interaction time allows molten material to be expelled from the ablation crater. Surface tension and vapor pressure drive this expulsion. Consequently, recasting and redeposition are reduced. Subsequent pulses act effectively on fresh material layers. This deepens the micro-groove (Figure 3d). Conversely, high scanning speeds result in low pulse overlapping. The energy received per unit area decreases drastically. The interaction time is insufficient for effective material expulsion before resolidification. This increases the recast layer thickness and limits ablation depth. Thermal accumulation is also weakened. Therefore, a monotonically decreasing relationship exists between scanning speed and micro-groove depth:
y ^ S S x < 0
To verify the monotonic relationships among laser fluence, repetition frequency, scanning speed, and groove depth within the selected process window, a controlled variable analysis was conducted for LF, RF, and SS, respectively. Specifically, when examining the effect of a single variable on groove depth, the remaining variables were held constant. The data were then partitioned into several subsequences, and the corresponding output sequence ({ y i }) was arranged in ascending order. The first-order differences were subsequently computed as
y i = y i + 1 y i , i = 1 , 2 , , n 1
The consistency of the signs of the differences is used as the criterion for monotonicity. When all y i > 0 (or y i < 0 ), the relationship between the variable and the output is considered to be monotonically increasing (or decreasing). In this context, indicates a positive correlation between the variable and the groove depth, whereas indicates a negative correlation. As shown in Table 3, all groups exhibit monotonicity, indicating that each variable exhibits a stable monotonic relationship with the groove depth ( y ).
In addition, several groups were selected for visualization. As shown in Figure 4, the results are consistent with the conclusions drawn from the overall analysis.
The AM-PGNN model incorporates these physical relationships into the monotonicity formulation (Equation (5)).

3.1.3. Physical Information Loss Function

Building upon conventional neural networks, this study fully incorporates prior monotonic relationships between the input parameters and the output. Two additional constraints are introduced into the loss function during training: structural loss L s and physical inconsistency loss L p , which enhance the model’s performance and interpretability when predicting results for unknown samples, as described by Equations (11) and (12):
L s = j = 1 n k = 1 N ( w j k ) 2 + k = 1 N ( β k ) 2
L p = j = 1 n i = 1 T j 1 m j i × sign y ^ x j x i 2
where sign θ = 1 if θ < 0 , and sign θ = 1 if θ 0 . x ( i ) denotes the i -th training sample; m j ( i ) 1 , 1 is the monotonicity factor indicating the expected direction between the j -th input and output: 1 for positive monotonic relationships and −1 for the negative; T j is the number of samples considered for the monotonic constraint associated with input x j . Intuitively, if s i g n y ^ x j agrees with the prescribed direction m j ( i ) , the corresponding penalty is suppressed; otherwise, the penalty is activated, thereby discouraging violations of the physical monotonic trend. The total loss function for the AM-PGNN can be expressed as Equation (13):
L = L d a t a + λ s L s + λ p L p = i = 1 N ( y ^ i y i ) 2 + λ s j = 1 n k = 1 N ( w j k ) 2 + k = 1 N ( β k ) 2 + λ p j = 1 n i = 1 T j 1 m j i × sign y ^ x j x i 2
where λ s and λ p denote the weightings associated with structural loss and physical inconsistency loss, respectively. Data-driven regression loss L d a t a quantifies the discrepancy between observed and predicted outcomes. Structural loss L s functions as a regularization component, aiming to mitigate overfitting while improving the model’s generalizability. In addition, physical inconsistency loss L p quantifies the extent to which the learned model violates the prescribed physical monotonicity constraints.

3.2. Attention Mechanism

In many operating regimes, the input variables contribute unequally to the target response. To account for such heterogeneous feature contributions, we incorporate a feature-wise attention gating module inspired by Cai et al. [39] into the physics-guided monotonic neural network. Specifically, given the input vector X = [ x 1 , x 2 , , x M ] , the gate produces non-negative attention coefficients α 0 , 1 M and yields a reweighted representation X = α X , which is subsequently fed into the predictive network. In contrast to monotonicity-based constraints, which enforce a consistent input–output trend without assigning weights to individual input features, the proposed gating mechanism enables adaptive assignment of feature weights, whereas monotonicity constraints maintain directionality consistent with the underlying physical meaning. The combination improves predictive accuracy and robustness, especially under limited-data settings.
It is important to note that, while the weighted feature vector X serves as the input for predicting y ^ , physical inconsistency constraints are still based on the partial derivatives of original inputs. Although X undergoes numerical changes, the model maintains three independent channels for LF, RF, and SS. For example, when calculating y ^ L F , the network gradient can still backpropagate to the original LF through α 1 L F .

3.3. Training the AM-PGNN Model

Preliminary calculations demonstrated that the non-continuous and non-differentiable nature of the loss function compromised the stability of gradient-based optimizers such as Adam. As shown in Figure 5, the gradient-free optimization algorithm (PSO) provided more stable convergence. Similar strategies have been applied to optimization problems with non-differentiable constraints [40]. The final configuration determined through five-fold cross-validation consists of a network structure of 3-22-1, with a structural loss weight λ s of 0.001, a physical inconsistency loss weight λ p of 0.06, a population size of 100 particles, and a maximum of 1000 iterations. An early-stopping strategy was applied when the objective function converged, which may terminate the optimization before reaching the maximum number of iterations. This approach allows the model to converge to a solution that balances data fitting accuracy with physical constraint requirements.
In summary, physically monotonic relationships between laser fluence, repetition frequency, scanning speed, and micro-groove depth are embedded as physical consistency constraints in the proposed prediction model, together with an adaptive feature-weighting attention mechanism. The overall AM-PGNN architecture is shown in Figure 6.

3.4. Metrics for the Evaluation of the Model’s Performance

To evaluate the performance of the AM-PGNN, this study benchmarked typical algorithms: SVM, DNN, CNN, GBDT, and PGNN. The hyperparameters for SVM, GBDT, PGNN, and AM-PGNN were optimized using PSO, while DNN and CNN were optimized using the Adam optimizer. Furthermore, DNN, PGNN, and AM-PGNN share the same underlying network architecture. Each model was optimized using the corresponding strategy to achieve its best performance. To ensure a fair and reliable comparison, each model was trained and evaluated ten times with different random initializations. The average values of the evaluation metrics were reported. In addition, the Std of the results from the ten runs was calculated to reflect the stability of the models. Numerical computations were carried out in MATLAB R2023b, and prediction accuracy is assessed using three standard statistical indicators:
Coefficient of determination ( R 2 ) reflects the model’s fitting accuracy to experimental data. A higher R 2 indicates better alignment between the model and experimental data. R 2 is defined as
R 2 = 1 i = 1 N y i y ^ i 2 i = 1 N y i y ¯ 2
where N represents the total number of samples; y i and y ^ i correspond to the measured and predicted responses of the i -th sample, respectively; and y ¯ denotes the mean value of the measured observations.
Root Mean Square Error ( R M S E ) represents the absolute fit between the model and the data and possesses the property of using the same units as the response variable. R M S E is defined as follows
R M S E = 1 N i = 1 N y i y ^ i 2
The standard deviation ( S t d ) is used to evaluate the stability of each model. It is defined as follows
S t d = 1 n 1 i = 1 n y i y ¯ 2
The physical consistency degree (PCD) evaluates the agreement between the model’s predictions and the actual physical monotonicity rules. PCD can be calculated using the following equation:
P C D = 1 T i = 1 T 1 β j × s i g n o x j x i 2 × 100 %
where x i refers to the i -th data sample and T denotes the number of samples that satisfy the monotonic relationship between the j -th input variable and output. β j is defined to encode the direction of physical monotonicity: −1 for positive monotonic relationships and 1 for the negative.
For computational convenience, the partial differential equation is solved using finite difference approximation:
o x j l i m Δ x j 0 f x 1 , x 2 , , x j 1 , x j + Δ x j , x j + 1 , , x n f x 1 , x 2 , , x j 1 , x j , x j + 1 , , x n Δ x j
where Δ x j denotes a small perturbation approaching zero.

4. Results and Discussion

This section first evaluates the reproducibility of the experimental data and the predictability of the model. Subsequently, the performance of the developed model is comprehensively evaluated from four perspectives: (1) three metrics are employed to analyze models’ predictive ability; (2) the interpretability and generalization ability of the AM-PGNN model are investigated, focusing on the monotonic behavior; (3) a robustness analysis is performed to assess model’s sensitivity to input perturbations and parameter uncertainty, where the stability of the predictions is quantitatively characterized; (4) additional FLA experiments are conducted to further validate the generalization ability of AM-PGNN under different machining scenarios.

4.1. Statistical Analysis of the Measurement of Micro-Groove Depth

To rigorously validate the reproducibility of the experimental data and the predictability of the model, repeated-machining verification was performed using representative process parameters (Table 4) [41]. Under identical laser parameter combinations, three micro-grooves were machined for each set, designated G1–G3. For each groove, 30 measurement points ( i 1 ,   2 ,   ,   30 ) were selected at equal intervals from the starting point to the endpoint. A measurement strategy consistent with the dataset construction was employed to minimize stochastic reading fluctuations.
For each groove, the longitudinal coordinate is normalized as s 0 , 1 . The depth of the i th measurement is denoted as D k ( s i ) (where k = 1 , 2 , 3 corresponds to G1–G3). The representative overall depth for each groove is defined as the arithmetic mean along the groove D ¯ k :
D ¯ k = 1 30 i = 1 30 D k s i
Depth variations along the groove are governed by both the average MRR and factors including thermal accumulation, material heterogeneity (e.g., WC/Co phase differences), molten recast, and spatter redeposition, all of which induce local fluctuations. To determine whether these fluctuations constitute “random noise” or “systematic patterns,” a first-order difference analysis is performed on the depth sequence D k ( s i ) for each groove:
Δ D k i = D k s i + 1 D k s i , i = 1,2 , , 29
Figure 7 illustrates the results for P1, P3, and P5, which are selected based on their representative nature in covering the range of process parameters across shallow and deep grooves. Results show that, for all three grooves under the same parameters, Δ D k exhibits zero-centered random fluctuations. The mean values are close to zero, and the difference distributions are approximately symmetric. Furthermore, the Autocorrelation Function (ACF) shows no significant peaks at non-zero lags, indicating that the depth fluctuations along the groove are primarily zero-mean random perturbations, with no evidence of systematic drift or periodic regularity. These findings suggest that local depth variations are mainly caused by microscopic disturbances and do not constantly increase or decrease along the scanning direction. Consequently, D ¯ k serves as a stable, representative scalar response, and the fluctuations in depth can be considered zero-mean random errors superimposed on the actual removal depth, providing a statistical basis for subsequent reproducibility assessment and model predictability verification.
To unify the comparison standards between the repeated experimental results and outputs, this study defines the experimental benchmark depth as the grand mean of the three grooves’ depths:
D r e f = 1 3 k = 1 3 D ¯ k
Machining reproducibility is quantified by the inter-groove standard deviation:
σ i n t e r = 1 2 k = 1 3 D ¯ k D r e f 2
Fluctuation along the groove is characterized by the intra-groove average standard deviation:
σ i n t r a = 1 3 k = 1 3 1 29 i = 1 30 D k s i D ¯ k 2
To facilitate comparisons across different depth magnitudes, the coefficient of variation ( C V ) is introduced:
C V = σ D r e f
Statistical analysis is performed on the three repeated micro-grooves G1–G3. As shown in Table 5, the D r e f values for the six parameters sets are 1.067, 4.12, 4.921, 21.211, 22.898, and 30.575 µm. σ i n t e r ranges from 0.046 to 0.663 µm, and σ i n t r a , ranges from 0.106 to 0.916 µm. Normalizing these values using the coefficient of variation, the C V i n t e r values are between 0.012 and 0.043, and the C V i n t r a values are between 0.027 and 0.099. With the exception of the shallow groove condition (P1), C V i n t e r generally remains within the 0.012–0.022 range, indicating excellent consistency in average depth among repeated grooves under identical parameters. The relatively large C V i n t r a for the P1 group (0.099) is primarily attributed to the increased relative impact of measurement uncertainty at smaller depths. In contrast, for grooves with larger depth, although the absolute fluctuation amplitude ( σ i n t r a ) increases with depth, the normalized C V i n t r a remains stable at approximately 0.028–0.033, suggesting that random fluctuations along the path remain proportional to the depth magnitude.
In the predictability verification, the benchmark depth D r e f derived from three repeated grooves serves as the ground truth, and the model output is denoted as D p r e d . The prediction error is defined as
ε = D p r e d D r e f
Both the absolute error | ε | and the relative error ε % are reported:
ε % = | ε | D r e f × 100 %
An engineering depth control tolerance of ±2 µm is adopted as the acceptance criterion [42]; predictions are considered satisfactory if ε < 2 µm.
The results in Table 6 demonstrate that model predictions for all six parameter groups satisfy the ±2 µm tolerance. For P1–P3, the absolute errors ε are 0.031, 0.034, and 0.082 µm, indicating that the model maintains high absolute accuracy at small depth magnitudes. It is worth noting that the groove with small depth (e.g., P1) exhibits higher sensitivity in terms of relative error (2.89%), which is mainly due to the amplification of absolute deviations of the same order of magnitude when divided by a small benchmark depth. Therefore, ε % serves to quantify the degree of relative deviation across different depth magnitudes, aiding in assessing model sensitivity under shallow conditions. For P4–P6, errors range from 0.588 to 1.466 µm, which are significantly below the ±2 µm tolerance, with relative errors between 2.57% and 6.91%. This means that the model possesses good predictability even under conditions of intensive energy input. In summary, the comparison between the experimentally defined D r e f and the model-predicted D p r e d validates the model’s capability to achieve depth prediction within engineering tolerances across the full parameter range. Furthermore, the analysis of relative error ε % provides a sensitivity assessment for different depth magnitudes, facilitating refined judgment for engineering design and experimental control.
The rigorous experimental verification of reproducibility and model predictability confirms the high reliability and precision of the proposed machining process. The results of repeated machining under identical parameter sets demonstrate excellent consistency in groove depths. This indicates that the machining process produces highly reproducible results under varying process parameters. Additionally, the model’s predictability is verified through comparison with the experimental benchmark depth. The model successfully predicts groove depths within the engineering tolerance of ±2 µm across all parameter groups, demonstrating its robust accuracy and ability to maintain precision even under extreme parameter conditions. The relative error analysis further highlights the model’s capability to handle both shallow and deep groove conditions, ensuring reliability for diverse applications.

4.2. Performance Analysis and Comparison of Models

Figure 8 shows a comparison of the prediction performance of SVM, DNN, CNN, GBDT, PGNN, and AM-PGNN during both training and testing. Specifically, the prediction accuracy of SVM and CNN was significantly lower than that of other algorithms during training, with most predictions falling outside the deviation lines, while DNN, GBDT, PGNN, and AM-PGNN all demonstrated strong fitting capabilities. In testing, AM-PGNN showed significant advantages, with the vast majority of its prediction points located within the deviation lines. In contrast, the prediction accuracy of other models was relatively lower, with their prediction points showing varying degrees of dispersion and some exceeding the deviation lines (Figure 8b–d). Table 7 provides detailed information on R2 and RMSE for all six models, demonstrating that the AM-PGNN model achieves excellent prediction accuracy for both the training and test sets.
By incorporating an attention mechanism, AM-PGNN achieves adaptive weighting of critical information within the input features, thereby enhancing its capability to identify and model causal relationships between inputs and outputs. Figure 9 presents the attention weights of the three input parameters as computed by the attention mechanism. The results indicate that SS has the highest weight at 40.1%, followed by RF at 34.5%, and LF at 25.4%. Furthermore, as listed in Table 7, AM-PGNN achieves an R2 of 93.7% and an RMSE of 1.573   μ m on the test set, significantly outperforming the PGNN model without the attention mechanism (R2 of 91.5% and RMSE of 1.822   μ m ). This means that the introduction of an attention mechanism substantially enhances model accuracy and predictive performance.
To further demonstrate the prediction performance of PGNN, Figure 10 comprehensively evaluates the prediction performance of each model on both training and test using the Taylor diagram. This analytical method simultaneously displays three metrics through trigonometric geometry: correlation coefficient, standard deviation, and root mean square deviation (RMSD). These metrics share a key geometric relationship derived from the cosine theorem.
The correlation coefficient R corresponds to the cosine of angle θ :
R = cos θ
where θ is the angle of the model point relative to the observation point (in polar coordinates). Through the cosine theorem, the relationship between RMSD, R2, and Std is given by
R M S D 2 = σ f 2 + σ o 2 2 σ f σ o R
where σ f is the standard deviation of the model predictions, and σ o is the standard deviation of the observed data. In the Taylor diagram, the observation point is located at ( σ o ,0), while the model point is located at ( σ f cos θ , σ f sin θ ). RMSD is the straight-line distance between these two points, and the cosine theorem describes the relationship between this distance and σ & θ . An ideal model point should be close to the reference point (small RMSD), with a high R and an appropriate σ (close to σ o ).
The results for the training are shown in Figure 10a. The standard deviation of the observed data is 6.90   μ m . The standard deviation of DNN is closest to the observed value of 6.51 μm, while GBDT shows the smallest standard deviation of 4.68 μm. In terms of correlation, CNN, GBDT, and AM-PGNN show the highest correlation of 0.97, while SVM shows the lowest correlation of 0.94. Also, the RMSDs of DNN and AM-PGNN are the lowest, 1.69   μ m and 1.68   μ m , respectively, while SVM and GBDT have the highest RMSDs 2.40   μ m . For testing (Figure 10b), AM-PGNN’s three metrics are closest to the observation point R = 0.97 , S t d = 5.71   μ m , R M S D = 1.52   μ m . SVM performs the worst due to its significantly lower R2 and the largest RMSD. Overall, AM-PGNN model achieves the highest prediction accuracy. By incorporating the attention mechanism, its accuracy shows a slight improvement compared to PGNN, confirming the initial expectation that AM-PGNN can effectively predict the depth of micro-grooves processed by femtosecond laser.

4.3. Analysis of Interpretability and Generalizability

Section 3.4 details the quantification of input–output monotonicity using PCD. Figure 11 presents the results. Both PGNN and AM-PGNN achieve a PCD value of 100% during training and testing. These models fully satisfy the prescribed monotonic constraints over the evaluated samples. For SVM, the PCD values for the three inputs in the test results are 67.14%, 81.43%, and 81.43%. The overall PCD is 76.67%. Approximately one quarter of the predictions violate the actual physical monotonicity rules. For the DNN model, the PCD for SS is 100%. However, the overall PCD is 94.76%. A small number of samples fail to satisfy the monotonicity rules. Both CNN and GBDT models exhibit an approximate 17% of data points that violate the actual physical monotonicity rules. The prediction accuracy of DNN and CNN is comparable to PGNN and AM-PGNN. However, their interpretability and generalization are inferior. This reveals a drawback of conventional models. They often sacrifice physical consistency for high-precision fitting. The results in Figure 11 demonstrate the limitations of relying solely on statistical metrics. The PCD method can effectively determine whether the model is consistent with the true physical priors, thereby enabling an assessment of its interpretability and generalization capability.
To further validate the physical consistency of the proposed model in capturing input–output relationships, this study employs partial dependence plots (PDPs) to analyze the significance of target features on prediction outcomes during testing. PDPs isolate the independent effect of feature x s on micro-groove depth predictions by fixing x s and marginalizing the output over the joint distribution of other features x c . PDPs consider the target response as a function of x s [43]:
f ^ s x s = E x c f ^ x s , x c = f ^ x s , x c d P x c
Analysis of the monotonic relationships between micro-groove depth and laser parameters using univariate PDPs (Figure 12) reveals significant differences in the ability of different models to represent physical mechanisms. The prediction results of SVM (Figure 12a) show that micro-groove depth first increases and then decreases with increasing laser fluence, which contradicts the physical mechanism of FLA. Higher laser fluence enhances energy per unit area, thereby intensifying melting and vaporization effects and ultimately increasing ablation depth. Although the overall trend of DNN, CNN, and GBDT (Figure 12b–d) reflects the monotonic relationship of micro-groove depth and laser fluence, some data points still deviate from physical monotonicity, indicating that these data points fail to strictly adhere to physical rules. In contrast, due to the incorporation of monotonicity constraints, the prediction results of PGNN and AM-PGNN (Figure 12e,f) demonstrate consistency with the actual physical monotonicity rules: micro-groove depth increases with laser fluence, but the incremental depth gradually decreases as fluence rises. This aligns with the effect of plasma shielding. The increased laser fluence leads to higher plasma density, causing partial energy to be shielded by the plasma and not fully reach the bottom of the groove, thereby reducing the incremental depth [8]. Additionally, micro-groove depth increases regularly with a higher repetition frequency and lower scanning speed. These observations prove that the proposed model can extrapolate learned monotonic relationships, thereby enhancing its generalizability.
As shown in Figure 13, a comparison of the training processes between the two models reveals that although the total loss of DNN gradually decreases during training, the loss induced by physical inconsistency persists. This means that DNN struggles to achieve physical consistency. In contrast, while the total loss of AM-PGNN converges, its physical loss remains consistently zero. This strictly ensures physical consistency and makes its predictions fully compliant with the constraints imposed by physical laws.
To further investigate the synergistic effects of the laser parameters on micro-groove depth, this study employs bivariate partial dependence plots for visualizing parameter interactions. Figure 14 illustrates the performance differences among the six models through contour plots. The predictions of SVM, DNN, and CNN (Figure 14a–c) show that micro-groove depth increases monotonically with higher repetition frequency. However, in high-laser-fluence regions, the depth exhibits non-monotonic fluctuations, and the predictions of both models deviate from the physical mechanisms of FLA. The GBDT model exhibits non-monotonic fluctuations in both low- and high-laser-fluence regions. In contrast, the predictions of the PGNN and AM-PGNN models (Figure 14e,f) demonstrate that micro-groove depth increases with both higher laser fluence and repetition frequency.
Further analysis of the parameter interactions (Figure 15) reveals that both PGNN and AM-PGNN accurately capture the synergistic enhancement effect of increasing laser fluence and decreasing scanning speed on micro-groove depth. This aligns with the physical mechanism that higher laser fluence and lower scanning speed collectively contribute to larger micro-groove depth (Figure 15e,f). In contrast, SVM fails to demonstrate a significant correlation between laser fluence and micro-groove depth (Figure 15a), while the contour distributions predicted by DNN, CNN, and GBDT exhibit non-monotonic characteristics in specific regions (Figure 15b–d). Experimental results confirm that, by incorporating physical monotonicity, both PGNN and AM-PGNN enhance interpretability and generalizability, making the predictions consistent with the actual physical monotonicity rules.

4.4. Analysis of Sensitivity and Robustness

In practical laser processing, the nominal input parameters inevitably deviate from their setpoints due to equipment uncertainty and process fluctuations. To evaluate the sensitivity and robustness of the proposed AM-PGNN with respect to such perturbations, we perform a Monte Carlo perturbation test on the held-out dataset. Specifically, for each test sample X , we generate perturbed inputs X by injecting small zero-mean Gaussian noise to each parameter, as expressed by
x j = x j 1 + ϵ j
where ϵ j N 0 , σ 2 and σ is chosen such that the perturbation magnitude approximately corresponds to a ± 1 % fluctuation. For each original sample, K perturbed realizations are evaluated to obtain an output distribution y ^ ( k ) k = 1 K . The robustness of the model is quantified using two metrics: (i) the prediction standard deviation S t d y ^ across perturbations and (ii) the normalized absolute deviation ( N A D ), given by
N A D = 1 K k = 1 K y ^ k y ^ y ^ + ε
where y ^ is the baseline prediction and ε is a small stabilizer. A lower S t d y ^ and N A D indicate reduced noise amplification and improved robustness.
As shown in Figure 16, when subjected to the same ± 1 % input perturbation, AM-PGNN demonstrates noticeably smaller fluctuations in its predictions than both DNN, CNN, GBDT, and PGNN without an attention mechanism. In particular, AM-PGNN maintains low values of both the S t d and the N A D under perturbation, and the corresponding variation ranges remain narrow, indicating a stable response to input uncertainties. The SVM model is not included in this comparison because its robustness was insufficient for meaningful evaluation under the same perturbation setting. Collectively, these results confirm the value of introducing attention, as it mitigates sensitivity to perturbations and improves overall robustness. More specifically, the attention-based reweighting strategy, together with the imposed monotonicity constraints, appears to enhance the model’s tolerance to parameter uncertainty encountered in practical applications.

4.5. Multi-Scenario Ablation Experiments

To further validate that AM-PGNN is applicable to different materials, this study fabricated micro-grooves on Cu and SiC using FLA. The datasets for the ablation of Cu and SiC were obtained from the experiment, and the sample size and parameter ranges are consistent with those used in the WC experiments. Because the physical properties of Cu and SiC are different from those of the WC, the model was retrained separately for each workpiece material. However, within the studied parameter range, the monotonic relationships between the processing parameters and the ablation depth remain consistent across the studied materials. Figure 17 shows the prediction of the micro-groove depth in the FLA of Cu. Specifically, the majority of prediction points of AM-PGNN fall within the deviation lines and are close to the 1:1 line, while the other five models exhibit varying degrees of prediction points exceeding the deviation lines. For a more intuitive comparison of model performance, quantitative analysis was conducted using R2, RMSE, and PCD metrics (Figure 18). AM-PGNN achieved the highest R2 of 93.2% and 91.8% on training and testing, and the lowest RMSE values of 2.83 and 2.76, with PCD consistently at 100%, indicating that the predictions align with physical mechanisms. R2 values of SVM, DNN, CNN, and GBDT algorithms on both the training and test sets are significantly lower than those of the AM-PGNN, and some predictions violated physical monotonicity. As for the PGNN model, although its PCD is 100% and the predictions satisfy physical monotonicity, the R2 values on training and testing are 90.3% and 90.1%, respectively, which are lower than those of AM-PGNN.
In the ablation of SiC (Figure 19), SVM, DNN, CNN, GBDT, and PGNN exhibited varying degrees of prediction points exceeding the deviation lines, while most of AM-PGNN’s predictions remained within the deviation lines and closely aligned with the 1:1 line. In terms of evaluation metrics (Figure 20), SVM performed the worst with the lowest R2 and highest RMSE. In contrast, AM-PGNN achieved the highest R2, the lowest RMSE, and maintained a PCD of 100% on both the training and test sets, demonstrating the best overall performance.

5. Conclusions

This study proposes an effective ensemble method for the field of femtosecond laser ablation, integrating physics-guided constraints with neural network learning. The proposed algorithm integrates monotonic constraints and an attention mechanism, achieving accurate predictions based on the available experimental dataset. The main conclusions are summarized as follows:
(1)
Integration of domain knowledge and attention mechanism: a physics-guided architecture was established by translating key domain knowledge into physical constraints embedded into the NN, which incorporates the positive monotonic relationships of LF and RF, and the negative monotonic relationship of SS, with micro-groove depth. Additionally, an attention mechanism allows for the adaptive weighting of input features, prioritizing critical physical parameters during training.
(2)
Improved accuracy and physical consistency: AM-PGNN significantly outperforms other ML methods in prediction accuracy. The embedded monotonicity constraints ensure that the predictions strictly follow the prescribed monotonicity relationships, thereby preserving the physical consistency of the overall trend, which is essential for the accurate modeling, control, and optimization of FLA.
(3)
Enhanced robustness: the proposed model exhibits superior stability compared to conventional models. Under identical ±1% input perturbations, the predictions of AM-PGNN demonstrate smaller fluctuations than those models.
(4)
Applicable to different materials: in the FLA experiments on Cu and SiC, the proposed AM-PGNN framework can be applied to different materials after training on a specific material. This result shows that the framework is applicable in the FLA of different materials and can achieve accurate predictions by incorporating physical monotonicity rules rather than relying solely on statistical data. The method offers a solution with low data requirements and high interpretability, making it particularly suitable for high-cost experiments.
The proposed AM-PGNN model achieves high-accuracy groove depth prediction under limited sample conditions and improves physical consistency through monotonicity constraints. Additional results on other materials indicate that, after retraining for a specific material, the framework can be extended to different material systems. This indicates the algorithm has potential in the application of the digital twin architecture for FLA. On this basis, future efforts can be directed toward integrating the model with in situ real-time sensing modalities, such as laser-induced plasma diagnostics and optical profilometry. This development is expected to facilitate real-time optimization and online monitoring of FLA. Furthermore, the current study is confined to the prediction of micro-groove depth under single-scan conditions, while the cumulative material removal behavior in multi-pass machining and the characterization of complex three-dimensional surface topographies remain to be investigated. Therefore, future work will extend the AM-PGNN framework by describing multi-pass ablation dynamics based on physical principles, developing a strongly differentiable optimization strategy, and expanding its predictive scope from one-dimensional depth estimation to full three-dimensional surface topography prediction.

Author Contributions

Conceptualization, G.L.; methodology, G.L.; software, L.D. and M.L.; validation, H.X.; formal analysis, G.L., L.D. and M.L.; investigation, G.L. and L.D.; resources, S.D.; data curation, M.L. and H.X.; writing—original draft preparation, G.L., L.D. and M.L.; writing—review and editing, H.X. and S.D.; visualization, L.D.; supervision, G.L.; project administration, G.L. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge the UK Engineering and Physical Sciences Research Council (EPSRC)’s funding of The Future Advanced Metrology Hub for Sustainable Manufacturing (Grant Ref: EP/Z53285X/1) and the UKRI-funded Advanced Machinery and Productivity Initiative (84646).

Data Availability Statement

The data supporting the findings of this study are not publicly available due to confidentiality agreements stemming from business collaborations. However, they may be made available by the authors upon reasonable request for restricted purposes. Access to the data will require approval from the Centre for Precision Technology at the University of Huddersfield and may be subject to additional conditions.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
FLAFemtosecond Laser Ablation
AM-PGNNAttention-Based Monotonic Physics-Guided Neural Network
WCTungsten Carbide
SVMSupport Vector Machine
DNNDeep Neural Network
CNNConvolutional Neural Network
GBDTGradient Boosting Decision Tree
NADNormalized Absolute Deviation
HAZHeat-Affected Zone
MLMachine Learning
ANNsArtificial Neural Networks
PINNPhysics-Informed Neural Network
ELMExtreme Learning Machine
TANNThermodynamics-Based Artificial Neural Network
SSTSuperheated Steam Temperature
LSTMLong Short-Term Memory
PSOParticle Swarm Optimization
PCDPhysical Consistency Degree
LFLaser Fluence
RFRepetition Frequency
SSScanning Speed
RMSERoot Mean Square Error
RMSDRoot Mean Square Deviation
StdStandard Deviation
ACFAutocorrelation Function
PDPsPartial Dependence Plots

References

  1. Kresse, T.; Meinhard, D.; Bernthaler, T.; Schneider, G. Hardness of WC-Co hard metals: Preparation, quantitative microstructure analysis, structure-property relationship and modelling. Int. J. Refract. Met. Hard Mater. 2018, 75, 287–293. [Google Scholar] [CrossRef]
  2. Xiang, D.; Su, Z.; Li, Y.; Zhang, Z. The effect of laser texture on adhesion and tribological properties of boron-doped diamond film on WC-Co cemented carbide. Diam. Relat. Mater. 2022, 130, 109469. [Google Scholar] [CrossRef]
  3. Arslan, A.; Masjuki, H.H.; Kalam, M.A.; Varman, M.; Mufti, R.A.; Mosarof, M.H.; Khuong, L.S.; Quazi, M.M. Surface Texture Manufacturing Techniques and Tribological Effect of Surface Texturing on Cutting Tool Performance: A Review. Crit. Rev. Solid State Mater. Sci. 2016, 41, 447–481. [Google Scholar] [CrossRef]
  4. Singh, R.; Alberts, M.J.; Melkote, S.N. Characterization and prediction of the heat-affected zone in a laser-assisted mechanical micromachining process. Int. J. Mach. Tools Manuf. 2008, 48, 994–1004. [Google Scholar] [CrossRef]
  5. Singh, R.; Melkote, S.N. Characterization of a hybrid laser-assisted mechanical micromachining (LAMM) process for a difficult-to-machine material. Int. J. Mach. Tools Manuf. 2007, 47, 1139–1150. [Google Scholar] [CrossRef]
  6. Yu, Y.; Bai, S.; Wang, S.; Hu, A. Ultra-Short Pulsed Laser Manufacturing and Surface Processing of Microdevices. Engineering 2018, 4, 779–786. [Google Scholar] [CrossRef]
  7. Zhou, Z.; Feng, S.; Yang, C.; Ji, C.; Long, Y.; Li, G. Femtosecond laser machining of the novel superhydrophobic microstructure for the oil-water separation. Surf. Interfaces 2024, 45, 103873. [Google Scholar] [CrossRef]
  8. Li, G.; Fan, R.; Long, Y.; Zhou, Z.; Ding, S. Material removal mechanism and surface quality in low-fluence femtosecond laser ablation of polycrystalline diamonds. J. Mater. Process. Technol. 2025, 337, 118731. [Google Scholar] [CrossRef]
  9. Li, G.; Li, X.; He, G.; Fan, R.; Li, F.; Ding, S. Surface Quality and Material Removal Rate in Fabricating Microtexture on Tungsten Carbide via Femtosecond Laser. Micromachines 2023, 14, 1143. [Google Scholar] [CrossRef] [PubMed]
  10. Li, T.; Yin, Z.; Chen, M.; Cheng, J.; Zhao, L.; Zhang, T. Ablation profile prediction of hard brittle fused silica optics irradiated by single-pulse femtosecond laser. Appl. Surf. Sci. 2025, 688, 162319. [Google Scholar] [CrossRef]
  11. Chen, Y.; Sun, H.; Lin, G.; Song, S.; Ji, P. Multiphysics modeling femtosecond laser ablation of Ti6Al4V with material transient properties. J. Alloys Compd. 2024, 1002, 175360. [Google Scholar] [CrossRef]
  12. Song, S.; Jiang, L.; Ji, P. Phenomenological modeling for femtosecond laser processing of fused silica. J. Manuf. Process. 2024, 120, 365–377. [Google Scholar] [CrossRef]
  13. Wang, W.; Mei, X.; Jiang, G.; Lei, S.; Yang, C. Effect of two typical focus positions on microstructure shape and morphology in femtosecond laser multi-pulse ablation of metals. Appl. Surf. Sci. 2008, 255, 2303–2311. [Google Scholar] [CrossRef]
  14. Summers, A.J.; Yin, H.; Fischer, R.D.; Prorok, B.C.; Lou, X.; He, Q.P. Constructing process maps for pulsed wave laser additive manufacturing with interpretable machine learning. J. Manuf. Process. 2023, 104, 138–149. [Google Scholar] [CrossRef]
  15. McDonnell, M.D.T.; Arnaldo, D.; Pelletier, E.; Grant-Jacob, J.A.; Praeger, M.; Karnakis, D.; Eason, R.W.; Mills, B. Machine learning for multi-dimensional optimisation and predictive visualisation of laser machining. J. Intell. Manuf. 2021, 32, 1471–1483. [Google Scholar] [CrossRef]
  16. Herzog, T.; Brandt, M.; Trinchi, A.; Sola, A.; Molotnikov, A. Process monitoring and machine learning for defect detection in laser-based metal additive manufacturing. J. Intell. Manuf. 2024, 35, 1407–1437. [Google Scholar] [CrossRef]
  17. Campanelli, S.L.; Casalino, G.; Ludovico, A.D.; Bonserio, C. An artificial neural network approach for the control of the laser milling process. Int. J. Adv. Manuf. Technol. 2013, 66, 1777–1784. [Google Scholar] [CrossRef]
  18. Zhang, Q.; Zhang, X.; Zhou, Y.; Hai, Y.; Wang, B.; Guan, Y. Intelligent femtosecond laser bone drilling via online monitoring and machine learning. J. Manuf. Process. 2024, 117, 224–231. [Google Scholar] [CrossRef]
  19. Li, K.; Xu, J.; Sun, S.; Tang, Z.; Qiu, R.; Hu, J. Prediction and optimization of femtosecond laser processing parameters for CFRP surface microgrooves based on machine learning. Opt. Lasers Eng. 2026, 198, 109547. [Google Scholar] [CrossRef]
  20. Zhang, Z.; Jia, M.; Wang, L.; Yu, Y.; Yang, Z.; Wang, J.; Wang, Y.; Wang, C.; Lv, Z.; Xu, W. Physical model-guided machine learning for accelerating laser induced plasma micro-machining process optimization. Opt. Laser Technol. 2025, 183, 112402. [Google Scholar] [CrossRef]
  21. Karpatne, A.; Atluri, G.; Faghmous, J.H.; Steinbach, M.; Banerjee, A.; Ganguly, A.; Shekhar, S.; Samatova, N.; Kumar, V. Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data. IEEE Trans. Knowl. Data Eng. 2017, 29, 2318–2331. [Google Scholar] [CrossRef]
  22. Azodi, C.B.; Tang, J.; Shiu, S.-H. Opening the Black Box: Interpretable Machine Learning for Geneticists. Trends Genet. 2020, 36, 442–455. [Google Scholar] [CrossRef] [PubMed]
  23. Raissi, M.; Yazdani, A.; Karniadakis, G.E. Hidden fluid mechanics: Learning velocity and pressure fields from flow visualizations. Science 2020, 367, 1026–1030. [Google Scholar] [CrossRef]
  24. Cuomo, S.; Di Cola, V.S.; Giampaolo, F.; Rozza, G.; Raissi, M.; Piccialli, F. Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next. J. Sci. Comput. 2022, 92, 88. [Google Scholar] [CrossRef]
  25. Alber, M.; Buganza Tepole, A.; Cannon, W.R.; De, S.; Dura-Bernal, S.; Garikipati, K.; Karniadakis, G.; Lytton, W.W.; Perdikaris, P.; Petzold, L.; et al. Integrating machine learning and multiscale modeling—Perspectives, challenges, and opportunities in the biological, biomedical, and behavioral sciences. npj Digit. Med. 2019, 2, 115. [Google Scholar] [CrossRef]
  26. Rueden, L.v.; Mayer, S.; Beckh, K.; Georgiev, B.; Giesselbach, S.; Heese, R.; Kirsch, B.; Pfrommer, J.; Pick, A.; Ramamurthy, R.; et al. Informed Machine Learning—A Taxonomy and Survey of Integrating Prior Knowledge into Learning Systems. IEEE Trans. Knowl. Data Eng. 2023, 35, 614–633. [Google Scholar] [CrossRef]
  27. Kim, K.M.; Hurley, P.; Duarte, J.P. Physics-informed machine learning-aided framework for prediction of minimum film boiling temperature. Int. J. Heat Mass Transf. 2022, 191, 122839. [Google Scholar] [CrossRef]
  28. Halder, R.; Damodaran, M.; Khoo, B.C. Deep learning-driven nonlinear reduced-order models for predicting wave-structure interaction. Ocean Eng. 2023, 280, 114511. [Google Scholar] [CrossRef]
  29. Masi, F.; Stefanou, I.; Vannucci, P.; Maffi-Berthier, V. Thermodynamics-based Artificial Neural Networks for constitutive modeling. J. Mech. Phys. Solids 2021, 147, 104277. [Google Scholar] [CrossRef]
  30. Xie, J.; Chai, Z.; Xu, L.; Ren, X.; Liu, S.; Chen, X. 3D temperature field prediction in direct energy deposition of metals using physics informed neural network. Int. J. Adv. Manuf. Technol. 2022, 119, 3449–3468. [Google Scholar] [CrossRef]
  31. Zhu, H.; Tsang, E.C.C.; Wang, X.-Z.; Aamir Raza Ashfaq, R. Monotonic classification extreme learning machine. Neurocomputing 2017, 225, 205–213. [Google Scholar] [CrossRef]
  32. Wang, P.; Si, F.; Cao, Y.; Shao, Z.; Ren, S. Prediction of superheated steam temperature for thermal power plants using a novel integrated method based on the hybrid model and attention mechanism. Appl. Therm. Eng. 2022, 203, 117899. [Google Scholar] [CrossRef]
  33. Ren, S.; Wu, S.; Weng, Q. Physics-informed machine learning methods for biomass gasification modeling by considering monotonic relationships. Bioresour. Technol. 2023, 369, 128472. [Google Scholar] [CrossRef]
  34. Xie, K.; Liu, P.; Zhang, J.; Han, D.; Wang, G.; Shen, C. Physics-guided deep learning for rainfall-runoff modeling by considering extreme events and monotonic relationships. J. Hydrol. 2021, 603, 127043. [Google Scholar] [CrossRef]
  35. Zhu, B.; Ren, S.; Weng, Q.; Si, F. A physics-informed neural network that considers monotonic relationships for predicting NOx emissions from coal-fired boilers. Fuel 2024, 364, 131026. [Google Scholar] [CrossRef]
  36. Karpatne, A.; Watkins, W.; Read, J.S.; Kumar, V. Physics-guided Neural Networks (PGNN): An Application in Lake Temperature Modeling. arXiv 2017, arXiv:1710.11431. [Google Scholar]
  37. Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
  38. Singh, D.; Singh, B. Investigating the impact of data normalization on classification performance. Appl. Soft Comput. 2020, 97, 105524. [Google Scholar] [CrossRef]
  39. Cai, J.; Wang, Y. Short-term wind power prediction based on adaptive Stacking integrated learning model. Phys. Scr. 2025, 100, 066004. [Google Scholar] [CrossRef]
  40. Weng, Q.; Ren, S.; Zhu, B.; Si, F. Biomass gasification modeling based on physics-informed neural network with constrained particle swarm optimization. Energy 2025, 320, 135392. [Google Scholar] [CrossRef]
  41. Kuo, J.-F.; Cheng, C.-W. Fabrication of grooves on 4H–SiC using femtosecond laser vector beam. Opt. Commun. 2025, 574, 131200. [Google Scholar] [CrossRef]
  42. Dumitru, G.; Romano, V.; Weber, H.P.; Sentis, M.; Hermann, J.; Bruneau, S.; Marine, W.; Haefke, H.; Gerbig, Y. Metallographical analysis of steel and hard metal substrates after deep-drilling with femtosecond laser pulses. Appl. Surf. Sci. 2003, 208–209, 181–188. [Google Scholar] [CrossRef]
  43. Zhao, S.; Li, J.; Chen, C.; Yan, B.; Tao, J.; Chen, G. Interpretable machine learning for predicting and evaluating hydrogen production via supercritical water gasification of biomass. J. Clean. Prod. 2021, 316, 128244. [Google Scholar] [CrossRef]
Figure 1. (a) Experimental setup and (b) schematic diagram of the femtosecond laser ablation (FLA) platform.
Figure 1. (a) Experimental setup and (b) schematic diagram of the femtosecond laser ablation (FLA) platform.
Machines 14 00509 g001
Figure 2. Proposed prediction scheme.
Figure 2. Proposed prediction scheme.
Machines 14 00509 g002
Figure 3. The femtosecond laser machining process: (a) principle of femtosecond laser machining; (b) effect of laser fluence on groove depth; (c) effect of repetition frequency on groove depth; (d) effect of scanning speed on groove depth.
Figure 3. The femtosecond laser machining process: (a) principle of femtosecond laser machining; (b) effect of laser fluence on groove depth; (c) effect of repetition frequency on groove depth; (d) effect of scanning speed on groove depth.
Machines 14 00509 g003
Figure 4. Groove depth variation with different variables: (a) LF (b) RF (c) SS.
Figure 4. Groove depth variation with different variables: (a) LF (b) RF (c) SS.
Machines 14 00509 g004
Figure 5. Convergence curves under different optimizers: (a) Adam; (b) PSO.
Figure 5. Convergence curves under different optimizers: (a) Adam; (b) PSO.
Machines 14 00509 g005
Figure 6. Framework of the algorithm for micro-groove depth prediction.
Figure 6. Framework of the algorithm for micro-groove depth prediction.
Machines 14 00509 g006
Figure 7. Verification of randomness in depth variations along the micro-groove: (a1a3) variation of D i with position index i ; (b1b3) histogram of the distribution of D i , where data1, data2, and data3 represent the arithmetic mean along the groove in three repeated experiments; (c1c3) ACF of D i with 95% confidence bounds.
Figure 7. Verification of randomness in depth variations along the micro-groove: (a1a3) variation of D i with position index i ; (b1b3) histogram of the distribution of D i , where data1, data2, and data3 represent the arithmetic mean along the groove in three repeated experiments; (c1c3) ACF of D i with 95% confidence bounds.
Machines 14 00509 g007
Figure 8. Prediction results of various algorithms: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Figure 8. Prediction results of various algorithms: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Machines 14 00509 g008
Figure 9. Attention weights of LF, RF, and SS in the prediction of groove depth.
Figure 9. Attention weights of LF, RF, and SS in the prediction of groove depth.
Machines 14 00509 g009
Figure 10. Taylor plots for various models: (a) training set; (b) test set.
Figure 10. Taylor plots for various models: (a) training set; (b) test set.
Machines 14 00509 g010
Figure 11. PCD statistics for various models used in the training and test sets: (a) LF; (b) RF; (c) SS; (d) total.
Figure 11. PCD statistics for various models used in the training and test sets: (a) LF; (b) RF; (c) SS; (d) total.
Machines 14 00509 g011
Figure 12. Partial dependence plots of micro-groove depth at different laser fluences: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Figure 12. Partial dependence plots of micro-groove depth at different laser fluences: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Machines 14 00509 g012
Figure 13. Physical inconsistencies in the training processes: (a) DNN; (b) AM-PGNN.
Figure 13. Physical inconsistencies in the training processes: (a) DNN; (b) AM-PGNN.
Machines 14 00509 g013
Figure 14. Partial dependence plots of micro-groove depth at different laser fluences and repetition frequencies: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Figure 14. Partial dependence plots of micro-groove depth at different laser fluences and repetition frequencies: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Machines 14 00509 g014
Figure 15. Partial dependence plots of micro-groove depth at different laser fluences and scanning speeds: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Figure 15. Partial dependence plots of micro-groove depth at different laser fluences and scanning speeds: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Machines 14 00509 g015
Figure 16. Comparison of robustness among different models: (a) standard deviation of prediction across perturbations; (b) normalized absolute deviation.
Figure 16. Comparison of robustness among different models: (a) standard deviation of prediction across perturbations; (b) normalized absolute deviation.
Machines 14 00509 g016
Figure 17. Performance of different models in FLA of Cu: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Figure 17. Performance of different models in FLA of Cu: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Machines 14 00509 g017
Figure 18. Metrics of different models in FLA of Cu: (a) R2; (b) RMSE; (c) PCD.
Figure 18. Metrics of different models in FLA of Cu: (a) R2; (b) RMSE; (c) PCD.
Machines 14 00509 g018
Figure 19. Performance of different models in FLA of SiC: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Figure 19. Performance of different models in FLA of SiC: (a) SVM; (b) DNN; (c) CNN; (d) GBDT; (e) PGNN; (f) AM-PGNN.
Machines 14 00509 g019
Figure 20. Metrics of different models in FLA of SiC: (a) R2; (b) RMSE; (c) PCD.
Figure 20. Metrics of different models in FLA of SiC: (a) R2; (b) RMSE; (c) PCD.
Machines 14 00509 g020
Table 1. Relevant parameters of the laser processing platform.
Table 1. Relevant parameters of the laser processing platform.
Wavelength (λ)Pulse Width ( t p )Focused Spot Radius ( r 0 )Lens Focal Length (f)Rayleigh Range (ZR)
nmfsμmmmmm
1030300131000.524
Table 2. Design of experimental parameters.
Table 2. Design of experimental parameters.
VariableDescription/UnitVariation RangeStandard
InputsLaser fluence (LF)Energy transferred by laser per unit area/J/cm2 [0.98, 5.53]8
Repetition frequency (RF)Number of laser pulses emitted per second/MHz [0.025, 0.3]5
Scanning speed (SS)Moving speed of the laser beam/mm/s [10, 100]5
OutputsDepth (DP)Micro-groove depth/μm [0.06, 31]
Table 3. Monotonic relationships between each variable and groove depth.
Table 3. Monotonic relationships between each variable and groove depth.
VariableTotal Number of GroupsMonotonic Group NumberMonotonic Direction
LF2525
RF4040
SS4040
Table 4. Experimental parameter design for repeated-machining verification.
Table 4. Experimental parameter design for repeated-machining verification.
Param GroupLF (J/cm2)RF (MHz)SS (mm/s)
Minor-parameter regimeP13.980.0250.02
P25.250.050.01
P33.80.10.02
High-parameter regimeP44.330.20.01
P55.450.20.01
P64.20.30.01
Table 5. Statistical analysis of depth, reproducibility, and fluctuation for different parameter sets.
Table 5. Statistical analysis of depth, reproducibility, and fluctuation for different parameter sets.
Parameter Set D r e f σ i n t e r σ i n t r a C V i n t e r C V i n t r a
P11.0670.0460.1060.0430.099
P24.120.0740.1290.0180.031
P34.9210.0980.1330.020.027
P421.2110.2750.5970.0130.028
P522.8980.2790.750.0120.03
P630.5750.6630.9160.0220.03
Table 6. Model prediction error analysis for different parameter sets.
Table 6. Model prediction error analysis for different parameter sets.
Param Group D r e f D p r e d ε ε ε % Within ± 2 μm
P11.0671.0980.0310.0312.89Yes
P24.124.085−0.0340.0340.834Yes
P34.9215.0030.0820.0821.671Yes
P421.21122.6771.4661.4666.913Yes
P522.89823.4860.5880.5882.57Yes
P630.57531.520.9450.9453.09Yes
Table 7. Comparison of performance among various models.
Table 7. Comparison of performance among various models.
AlgorithmR2RMSE (μm)
TrainTestTrainTest
SVM0.878 ± 0.0980.841 ± 0.1042.412 ± 0.5672.495 ± 1.051
DNN0.938 ± 0.0450.920 ± 0.0491.714 ± 0.3321.775 ± 0.699
CNN0.926 ± 0.0180.910 ± 0.0291.823 ± 0.1331.941 ± 0.580
GBDT0.896 ± 0.0590.829 ± 0.0781.801 ± 0.3872.068 ± 0.667
PGNN0.923 ± 0.0230.915 ± 0.0311.909 ± 0.1091.822 ± 0.358
AM-PGNN0.940 ± 0.0140.937 ± 0.0201.695 ± 0.0881.573 ± 0.165
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, G.; Ding, L.; Liu, M.; Xie, H.; Ding, S. Prediction of Groove Depth in Femtosecond Laser Ablation via Attention Mechanism and Monotonic Constraint. Machines 2026, 14, 509. https://doi.org/10.3390/machines14050509

AMA Style

Li G, Ding L, Liu M, Xie H, Ding S. Prediction of Groove Depth in Femtosecond Laser Ablation via Attention Mechanism and Monotonic Constraint. Machines. 2026; 14(5):509. https://doi.org/10.3390/machines14050509

Chicago/Turabian Style

Li, Guangxian, Luyang Ding, Meng Liu, Hui Xie, and Songlin Ding. 2026. "Prediction of Groove Depth in Femtosecond Laser Ablation via Attention Mechanism and Monotonic Constraint" Machines 14, no. 5: 509. https://doi.org/10.3390/machines14050509

APA Style

Li, G., Ding, L., Liu, M., Xie, H., & Ding, S. (2026). Prediction of Groove Depth in Femtosecond Laser Ablation via Attention Mechanism and Monotonic Constraint. Machines, 14(5), 509. https://doi.org/10.3390/machines14050509

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop