1. Introduction
Friction stir welding (FSW) is a solid-state joining technique extensively adopted in the aerospace, automotive, and shipbuilding industries for aluminum alloys, owing to its low heat input, the absence of fusion defects such as blowholes and solidification cracks, and superior joint quality—defined by higher tensile strength, better fatigue performance, lower residual stress, and a finer, more uniform microstructure—compared with conventional fusion welding methods such as gas metal arc welding (GMAW) and TIG welding [
1,
2,
3,
4]. These characteristics result in enhanced structural integrity and service life under cyclic loading. Despite these advantages, welded structures remain susceptible to fatigue fracture under service loading, making the accurate determination of fatigue crack initiation sites critical for structural integrity and life prediction [
5,
6,
7].
Previous studies have shown that fatigue fracture locations in metal joints depend on microstructural variations, welding parameters, loading conditions, and environmental factors. Experimental investigations by Aydin et al. [
8] demonstrated that welding speed and stress range affect crack initiation zones, while Maggiolini et al. [
9] and Weng et al. [
10] analyzed crack paths in aluminum pipes and NiCrMoV steel joints under different loading regimes. Dorbane et al. [
11] further highlighted the role of microstructure and temperature in AZ31 magnesium plates. Complementary simulation work by Ren et al. [
12] employed crystal-plasticity finite element models to link texture distribution with fracture localization. However, these approaches rely on extensive experiments or complex physical models, limiting their efficiency and generalizability [
13].
Machine learning (ML) has recently emerged as a powerful tool for fatigue damage analysis, capable of handling nonlinear and high-dimensional data [
14,
15,
16]. Applications include the life prediction of gray cast iron using neural networks and random forests [
17], the fatigue performance of corroded steel wires [
18], and multiaxial fatigue modeling via image-based feature extraction [
19]. Related to joint integrity, Katarzyna [
20] applied ANN to predict the tightness of aluminum alloy heat exchanger joints in automotive applications, demonstrating the adaptability of ML for welding-related prediction tasks. Nonetheless, ML methods—particularly artificial neural networks—often require large data sets and suffer from interpretability and class-imbalance issues [
21,
22,
23,
24].
Although several recent studies have used machine learning to estimate the fatigue life or strength of welded or corroded components, most focus on scalar life prediction rather than the spatial localization of fracture zones. For example, the shear strength of Ag-Al bonded joints with surface finish was predicted by DL models [
25], and Chen et al. [
26] developed a method to quantify CF/ER bolted joint friction via surface topography—but neither addressed the spatial prediction of failure regions. To date, no known work integrates both fracture location classification and physics-informed regional modeling in aluminum FSW joints using a compact feature set.
In this work, we introduce a novel Quadratic Classification Neural Network (QCNN) for predicting fatigue fracture locations in 7075-T651 FSW joints. Unlike conventional three-layer back-propagation ANNs or single-stage classifiers (e.g., decision trees, logistic regression), our Quadratic Classification Neural Network (QCNN) employs a two-stage grouping strategy coupled with weight inheritance. The first (“prelude”) network coarsely partitions fracture regions into two macro-classes, then two specialized “rear” networks—with inherited prelude weights—resolve fine-grained class boundaries. This architecture improves class separation (especially for interlaced BM vs. TMAZ data) and accelerates convergence by reusing learned feature representations. Building on QCNN outputs and relevant material-property parameters, we derive a Regional Fracture Prediction Formula (RFPF) based on a Fourier-series quadratic expansion. Finally, we demonstrate the practical utility of the QCNN + RFPF framework for the rapid, physics-informed assessment of FSW-structure service life.
This paper is organized as follows.
Section 2 describes experimental procedures;
Section 3 presents the prediction of fracture locations based on machine learning;
Section 4 introduces our QCNN model for fracture-location prediction;
Section 5 evaluates model performance;
Section 6 and
Section 7 explore input-parameter influences and derive the Regional Fracture Prediction Formula.
5. Model Evaluation
The reliability of a model can be evaluated by such parameters as accuracy, precision, recall, and specificity. These parameters are the classification indicators obtained from the confusion matrix. The confusion matrix of the QCNN is listed in
Table 8. Among them, the vertical axis represents the predicted value, and the horizontal axis represents the actual value. The equations for precision rate, recall rate, and specificity are shown in Equations (14)–(16).
where FPR represents the false positive rate.
In the above equation, TP indicates that the real category of the sample is positive, and the result of model recognition is also positive. FN means that the real category of the sample is positive, but the model identifies it as negative. FP indicates that the real category of the sample is negative, but the model identifies it as positive. TN indicates that the real category of the sample is negative, and the model identifies it as negative.
According to the equations, some parameters of the QCNN can be calculated: The precision of the WNZ is 100%, the recall rate is 100%, and the specificity is 1. The precision rate of the HAZ is 100%, the recall rate is 100%, and the specificity is 1. The precision rate of the BM is 50%, the recall rate is 100%, and the specificity is 0.85. The precision rate of the TMAZ is 100%, the recall rate is 66.7%, and the specificity is 1.
The precision rate indicates the percentage of positive samples identified as positive by the model. The higher the precision ratio, the better the model effects. The recall rate indicates the ratio of the number of positive samples correctly identified by the model to the total number of positive samples. The higher the recall rate, the more positive samples the model predicted correctly. Specificity represents the ratio of the number of negative samples identified by the model to the total number of negative samples. The greater the specificity, the better the model effects [
38]. It can be seen from the above results that the model has a good effect in predicting WNZ, HAZ, and TMAZ but is less sensitive to BM.
Under the same experimental conditions, after conducting fatigue tests on the FSW joints of other materials, the QCNN was used to predict them, and the results are shown in
Table 9. The QCNN has a prediction accuracy of 75% and a loss value of 0.7210459 for the two major categories of manually divided fracture positions in the front network. The final prediction accuracy is 68.75%. It can be seen that the QCNN has a certain degree of inclusiveness, and it also has reasonable accuracy in predicting the fracture location of other materials.
The neural analysis results of this network are shown in
Figure 10. The circle-solid line represents the true value. Because the classification model uses one-hot code, the true value is always 1. The squared-solid line represents its prediction probability, and the other dotted lines represent the output of each neuron, which can be considered a form of probability.
The ordinate of
Figure 10 represents the output value of neurons, and the abscissa represents different prediction groups. It can be seen that the neuron output of the WNZ and the HAZ is relatively stable, which means the output of each neuron is at the same level. However, the neuron output of the BM and the TMAZ is relatively disordered. Only some neurons play a key role in prediction, and nearly half of them play no active role. By comparing the two areas, it can be found that although the neurons in the TMAZ are messy, the overall output is more accurate and closer to the true value than that in the BM. Therefore, the prediction effect in the TMAZ is better than that in the BM.
6. Influence of Input Parameters on Fracture Location
For the QCNN, the weight of input parameters is determined by trying to shuffle the input parameters. After each feature is processed, its influence is judged by the corresponding loss value. The larger the loss value, the more influential the feature is.
This network includes a prelude network and two rear networks. It is necessary to conduct comprehensive judgment and analysis on these three groups of networks. To ensure stability, 200 reshuffles were carried out for each input feature of the three networks, The corresponding loss value obtained from each reshuffle was calculated and recorded, as shown in
Figure 11. After that, repeated 100 times, all network loss values were added and divided. The results are shown in
Figure 12. The importance of the maximum stress, stress amplitude, and stress ratio to the fracture location was determined.
Figure 11 only shows one result. The three colored bars represent the influence of stress ratio, stress amplitude, and maximum stress on the accuracy of output parameters from bottom to top. The horizontal axis represents the corresponding loss value, and the vertical axis represents different networks. As can be seen from
Figure 11, rear network 2 (the network that distinguishes BM and TMAZ in the rear network) has a higher average loss value than other networks when the network structure is similar to other networks, so its prediction effect is worse. It may be one of the reasons why this network cannot predict BM accurately. In addition, compared with other networks, rear network 2 is more sensitive to input parameters, so the above problems may be solved by improving reasonable learning data sets.
The overall importance analysis results of the network are shown in
Figure 12. The impact of the three input parameters on the results is relatively average, and there is no significant difference. However, the maximum stress accounts for a considerable proportion of the input parameters, which can determine the location of the fracture.
The influence of the maximum stress, stress amplitude, and stress ratio on the fracture location is shown in
Figure 13. The fracture location prediction data in the maximum stress range of 175~350 MPa and the stress ratio of −1~1 are simulated by using the quadratic classification network model.
As can be seen from
Figure 13a, for the maximum stress within a specific range, the fracture locations of the FSW joints of 7075-T651 aluminum alloy will shift from the HAZ to the WNZ through the BM and the TMAZ with the increase in the maximum stress without considering other factors and other input parameters. There is no apparent transfer relationship between stress amplitude and stress ratio. But it can be seen that the TMAZ is more sensitive, which means that the TMAZ is more prone to fracture for the same input parameters. To intuitively analyze the influence of parameter correlation on the fracture location based on this model, the variable influence matrix is shown in
Figure 14. The diagonal panels (a, e, i) quantify each parameter’s direct effect; off-diagonals illustrate interaction effects and the shifting fracture zone envelopes.
The right diagonal is a bar graph in grid format (
Figure 14a,e,i), and the rest of the graphs are symmetrical at the diagonal, indicating the influence of the interaction of two input parameters on the fracture location. Its graph is a contour map. The density of the lines indicates the amount of data. The closed graph sealed with the same color indicates the same data. As can be seen from
Figure 14g, no matter what parameter the ordinate is, with the increase in the maximum stress, the fracture location changes from a small area of the BM through the HAZ, the BM, and the TMAZ to the WNZ.
Figure 15 shows the enlarged line figure of
Figure 14d. The influence of these two parameters on fracture location can be seen more clearly. Within a certain range of stress ratio, the fracture location has a relatively obvious distribution law. In order to better divide the areas, the outliers can be considered as the negligible factors caused by fatigue experiment error or network classification error, and the inevitable fracture location area under a certain fixed stress ratio can also be temporarily excluded. Therefore, the region equation of the fatigue fracture location of the 7075-T651 aluminum alloy FSW joint can be proposed using the data set of fracture position boundary.
7. Regional Fracture Prediction Formula (RFPF)
From the network model, the fracture location shows an obvious regionality with the change in maximum stress and stress amplitude. Therefore, the regional partition points are extracted separately, as shown in
Figure 16. The color band represents the value of the stress ratio. The lighter the color, the greater the stress ratio.
In the range of maximum stress, 175 MPa to 350 MPa, and a stress ratio of −1 to 1, the boundary points of each region show a good linear relationship. It is possible to establish the regional equation with the data.
The region shown in
Figure 17a is a HAZ. The region line consists of three parts: the solid line segment representing the data boundary, the dotted line segment representing the boundary between the first area of the BM and the HAZ, and the dashed line segment representing the boundary between the HAZ and the second area of the BM. Although it is impossible to determine the accuracy outside the data boundary line, it can be roughly determined that the stress ratio range is −1 to 1. The range equation according to the Fourier series quadratic expansion and the boundary data is shown in Equation (17).
where
σa is the stress amplitude,
σmax is the maximum stress, and
R is the stress ratio.
The region shown in
Figure 17b is the second area of the BM. The area consists of a dotted line segment representing the boundary between the HAZ and the second area of the BM and a dashed line segment representing the boundary between the second area of the BM and the TMAZ. The stress ratio ranges from −1 to 0.8. The range equation is shown in Equation (18).
Figure 17c is the boundary of the TMAZ. The dotted line segment represents the boundary between the second area of the BM and the TMAZ, while the dashed line segment represents the boundary between the TMAZ and the WNZ. The stress ratio ranges from −1 to 1. The range equation is shown in Equation (19).
The area shown in
Figure 17d is the WNZ. The area line consists of a dotted line segment representing the boundary between the TMAZ and the WNZ and a solid line segment representing the data boundary line. The stress ratio ranges from −0.6 to 1.0. The range equation is shown below.
All the region equations are based on the Fourier series quadratic expansion shown in Equation (21).
where
ω2 is twice as much as
ω1 in general.
The microstructure in each area of FSW has different mechanical property parameters [
39]. The material properties of each area are listed in
Table 10. AS and RS in the table are the advancing side and retreating side of the FSW joint, respectively. In order to facilitate classification calculation, this paper does not consider the influence of AS and RS and sets them as the same area.
For the above regional equation, after several calculations, combined with the fracture strength and elastic modulus in
Table 10, the RFPF is proposed as Equation (22).
where
σmax represents the maximum stress in MPa.
η represents one of the characteristic values of the region, which is named as the lower limit of regional fracture in this paper.
A and
B are first-order matrices, which are used to represent the quadratic expansion of the Fourier series.
Equation (22) represents the lower limit of the fracture area, which is the leftmost segment of the area in
Figure 17. The regional fracture location rule is HAZ, BM, TMAZ, and WNZ. The upper limit of one area is the lower limit of the next area. Cases beyond the above the maximum stress and stress ratio setting range will not be considered.
A and
B are shown in Equation (23).
where
A represents the parameters of each region, and each region has its own regional parameters whose values are dimensionless.
B represents a triangular function matrix of stress amplitude, which consists of a sine function and a cosine function with coefficients corresponding to
p,
q,
m, and
n in
A, respectively.
σa represents the stress amplitude in MPa.
μ1 and
μ2 are dimensionless stress amplitude adjustment factors, and
μ1 = 2
μ2.
μ is related to the corresponding fracture strength and the modulus of elasticity for the region, as shown in Equation (24).
where
E represents the elastic modulus of the region in MPa.
σf indicates the fracture strength of the corresponding area in MPa.