Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty

Zhong, Xian-Ci; Luo, Zhi-Yong; Zhang, Ke-Shi

doi:10.3390/ma18184336

Open AccessArticle

Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty

by

Xian-Ci Zhong

^1,2,*

,

Zhi-Yong Luo

^1,2 and

Ke-Shi Zhang

^1,2

¹

School of Civil Engineering and Architecture, Guangxi University, Nanning 530004, China

²

Key Laboratory of Disaster Prevention and Structural Safety of Ministry of Education, Guangxi University, Nanning 530004, China

^*

Author to whom correspondence should be addressed.

Materials 2025, 18(18), 4336; https://doi.org/10.3390/ma18184336

Submission received: 24 April 2025 / Revised: 1 September 2025 / Accepted: 8 September 2025 / Published: 16 September 2025

(This article belongs to the Section Metals and Alloys)

Download

Browse Figures

Versions Notes

Abstract

This paper reports machine learning models for predicting metal fatigue life under uncertainty by extracting stress–strain data from hysteresis loops. First, the hysteresis loops of Q235B under strain-controlled constant amplitude loading are analyzed. The values of stress and strain in six key points are extracted from each hysteresis loop at the earliest stages of the fatigue process, and transformed into polar coordinates. Second, the uncertainty is quantified by extending the applied strain amplitude and the selected stress–strain values to intervals. A great deal of data are generated randomly in each interval for coping with the challenge of a small fatigue test dataset. Third, three machine learning models are constructed, where the parameters of the back-propagation neural network model are optimized by using the leave-one-out cross-validation technique, and the models of support vector regression and random forest are selected carefully. The point and interval predictions of the low-cycle-fatigue life of Q235B are reported to reveal the feasibility and advantage of the proposed models. The results help to identify how to understand the fatigue behavior of materials by combining machine learning models and stress–strain hysteresis loops.

Keywords:

fatigue life prediction; stress–strain hysteresis loop; polar coordinate; machine learning; uncertainty

1. Introduction

In practical engineering, fatigue is one of the key factors leading to component damage and failure [1]. To reasonably evaluate the service life of components, it is crucial to have a deep understanding of the fatigue mechanism and to construct a reliable fatigue life-prediction model [2]. Since metal is a commonly used engineering material, it is interesting to explain fatigue behaviors and predict fatigue life of metallic materials [3,4]. Recently, machine learning (ML) models have been used widely to predict metal fatigue life [5,6,7]. It is worth noting that the stress–strain hysteresis loop is an important foundation for characterizing and studying the fatigue behavior of metallic materials under cyclic loading. Thus, in the present study we develop ML models driven by reasonably selecting stress–strain data of hysteresis loops. This constitutes a novel attempt to understand the fatigue behavior of metallic materials according to a combination of ML techniques and hysteresis loops.

In machine learning models for predicting metal fatigue life, one of the important issues is to construct the training set. Initially, the training set is made by only considering the geometric and loading factors of fatigue specimens. For example, the geometric and loading data of 10 specimens with various welding defects were used to train a neural network for evaluating the fatigue life in [8]. The parameters of shot peening process and materials together with stress amplitude were considered to train a back-propagation neural network to predict the fatigue life of carbon steels in [9]. The location and size of critical defects inside additively manufactured metals were measured by using the high cycle fatigue postmortem examination and synchrotron X-ray tomography, and combined with the morphology of specimens to train a support vector machine (SVM) for fatigue life prediction in [10]. Additive manufacturing process parameters and fatigue loadings were used as the input data of machine learning models for predicting the fatigue life of printed SS 316L with defects in [11]. Fatigue, creep and creep-fatigue data of 316 stainless steel were integrated to train three machine learning models and a deep neural network for life prediction in [12]. For predicting multiaxial fatigue life, the training set was composed of the data derived from the multiaxial path and temperature in [13].

It is seen that the above-mentioned works only focus on the point predictions of fatigue life based on a fatigue test dataset. There are two other questions that should be addressed. One is that fatigue test datasets are usually small due to cost and time constraints. This means that ML models may not have been adequately trained. The other is the dispersion of fatigue life under uncertainty [14], which cannot be characterized according to point predictions. The former is the challenge of small fatigue test datasets, which can be coped with to a certain degree by augmenting and/or expanding the fatigue test dataset. For instance, the initial fatigue dataset was augmented by using the methods of inverse transform sampling and multivariate radial basis function (RBF) interpolation in [15]. A damage mechanics model was adopted to generate data such that the extreme gradient boosting (XGBoost) ML model was trained to predict the high-cycle-fatigue life of ZM6 in [16]. A great deal of data were randomly generated to expand the fatigue test dataset for training ML models in [17,18]. Moreover, it is revealed that physics-informed neural network models exhibit the advantages of addressing the challenge of small dataset [19,20,21]. The latter is the challenge of dispersion to fatigue life prediction, which can be addressed by considering uncertainty quantification to some extent [17,18,22]. Thus, the statistical distribution, mean, and standard variance of fatigue life have been predicted [23,24,25,26,27,28].

In addition, the fatigue experiments of metallic materials show that the stress–strain hysteresis loop is an important characterization of the fatigue process. The analysis of hysteresis loops contributes to the explanation of the fatigue mechanism and the prediction of fatigue life [29,30]. The data of force peaks at the earliest stages of the fatigue process have been extracted to train ML models and improve the accuracy of fatigue life predictions [17,18]. Lots of hysteresis-loop data were adopted in an ML model for predicting the low-cycle-fatigue life of wrought Mg alloys [31], where five features of 548 hysteresis loops were considered such as the loading direction, strain amplitude, number of loading cycles, strain in a hysteresis loop, and corresponding stress. To address the challenges of small a fatigue test dataset and uncertainty in fatigue life prediction, there are some research gaps in combining the data of hysteresis loops and ML techniques.

(1): Hysteresis loop reflects the energy dissipation and plastic-deformation behavior of materials under alternating stress. There is a close relationship between hysteresis loops and fatigue life. How can data be selected from each hysteresis loop such that the features affecting fatigue life predictions can be extracted? Although the stress–strain data of hysteresis loops have been considered in predicting fatigue life [31], the above-mentioned question should still be investigated in depth.
(2): There are too many hysteresis loops and stress–strain values in a hysteresis loop. How many hysteresis loops and stress–strain values should be selected to train ML models for reasonably predicting metal fatigue life? It is seen from the existing works in [17,18] that the earlier 200 cycles have been considered with a step size. When considering multiple stress–strain data, the number of hysteresis loops should still be addressed carefully.
(3): Uncertainty inevitably exists in geometry, environment, and microstructure of specimens. How can the uncertainty be quantified by selecting the stress and strain data from hysteresis loops such that the dispersion of fatigue life can be characterized? The uncertainty has been quantified as an interval with probability distributions and the statistical property of fatigue life prediction has been considered [23,24,25,26,27]. However, when the stress–strain data are extracted from hysteresis loops, the uncertainty should still be considered for characterizing the dispersion of fatigue life.

This paper focuses on the above important issues to propose ML models for point and interval predictions of fatigue life. Some novel contributions to the literature are worth mentioning below:

(a): The influence of maximum stress, minimum stress, mean stress, and residual stress on uniaxial fatigue life is considered. The corresponding stress–strain data are selected from each hysteresis loop as the input data of ML models.
(b): It is considered that the hysteresis loops at the earliest stages of the fatigue process play an important role in predicting fatigue life. The early 10 hysteresis loops are selected and analyzed to obtain the optimal dataset.
(c): By considering multiple specimens, the selected stress–strain values from hysteresis loops are extended to intervals. A great deal of randomly generated data are used to train ML models and predict fatigue life with dispersion. In addition, the leave-one-out cross-validation (LOOCV) method is used to optimize the parameters of ML models.

The structure of this paper is as follows. Section 2 focuses on the method of constructing the dataset according to stress–strain hysteresis loops under uncertainty. In Section 3, we elaborate on the models of back-propagation (BP) neural network, support vector regression (SVR), and random forest (RF). The parameter optimizations of ML models are addressed. Section 4 reports the point prediction and interval prediction of the low-fatigue life of Q235B. Some comparisons are offered to reveal the feasibility of the developed models. The conclusion and some research directions are provided in Section 5.

2. Dataset Establishment

In the following, we elaborate on the method of extracting the stress–strain data from hysteresis loops according to fatigue experiments of Q235B under strain-controlled constant amplitude.

2.1. Experimental Data

The metallic material of Q235B steel is used to conduct fatigue tests under constant strain amplitudes. The mechanical properties and main chemical composition are given in Table 1 and Table 2 [32], respectively.

The geometry of the Q235B steel smooth specimen is shown in Figure 1 [32]. Based on the standard of ASTM E466 [33], the fatigue test is carried out on the MTS809 electro-hydraulic servo tensile and torsion testing machine (MTS Systems Corporation, Eden Prairie, MN, USA) at room temperature. Under the strain-controlled constant amplitudes of 0.004, 0.005, 0.006, and 0.008, the logarithmic fatigue lives are shown in Table 3 [32], where 3 specimens are used for each strain amplitude by following the standard of ASTM E466 [33]. It should be pointed out that specimens are acted on by tensile and compressive loadings, where the loading waveform is a sinusoidal wave with a maximum frequency of 2 Hz. There is no rotation during the process of a fatigue test.

Here, we focus in particular on the stress–strain hysteresis loops such as those in Figure 2, Figure 3, Figure 4 and Figure 5 by using the earliest 10 cycles. It can be seen from Figure 2, Figure 3, Figure 4 and Figure 5 that the strain amplitude is unchanged during the fatigue process under each strain loading. The stress and corresponding strain are the main physical quantities to characterize the fatigue process. The maximum stress and the minimum stress together with the mean stress are the basic parameters to describe a hysteresis loop. In the present study, the stress–strain values for describing a hysteresis loop should be carefully selected to effectively train an ML model.

2.2. Compilation of the Dataset

The establishment of a dataset depends on stress–strain hysteresis loops and uncertainty quantification. A scheme of a dataset compilation process is shown in Figure 6.

First, we focus on the question of how to extract data from stress–strain hysteresis loops. It is seen that the controlled strain amplitude of fatigue tests corresponds to the points of the maximum and minimum strains. This means that the two points with maximum strains should be selected. In addition, we can see from stress–strain hysteresis loops that the sudden changes in the slopes of the curves are near the points with half strain amplitude. For the sake of simplicity, the points corresponding to half strain amplitude should be selected. That is, we select six key points by considering the characteristics of hysteresis loops as shown in Figure 7. Hereafter,

P_{1} - P_{6}

correspond to the maximum strain, half strain amplitude, negative half strain amplitude, and minimum strain, which are expressed as

(ε_{m a x}, σ_{m a x}),

(ε_{h}, σ_{h l}),

(ε_{n h}, σ_{n h l}),

(ε_{m i n}, σ_{m i n}),

(ε_{n h}, σ_{n h u})

, and

(ε_{h}, σ_{h u}),

respectively. Moreover, by considering the normalization of input data for an ML model, the stress values in six points are further rewritten as dimensionless

σ_{m a x} / E,

σ_{h l} / E,

σ_{n h l} / E,

σ_{m i n} / E,

σ_{n h u} / E

, and

σ_{h u} / E,

where E stands for the Young’s modulus of Q235B. It can be seen from the six points in Figure 7 that the strain value of

ε_{n h}

or

ε_{h}

corresponds to two points. If the stress and the corresponding strain in Cartesian coordinates are directly used as the input data of ML models, the points cannot be distinguished for characterizing hysteresis loops. In addition, the input data of ML models are always normalized to improve their performance, meaning that the negative strain values should be addressed. Therefore, we change the Cartesian coordinates of the six points to polar coordinates as shown in Figure 8. That is, we have the following transformations:

\begin{matrix} P_{1} : (ε_{m a x}, \frac{σ_{m a x}}{E}) \to (\sqrt{ε_{m a x}^{2} + \frac{σ_{m a x}^{2}}{E^{2}}}, arctan \frac{σ_{m a x}}{ε_{m a x} E}), \end{matrix}

(1)

\begin{matrix} P_{2} : (ε_{h}, \frac{σ_{h l}}{E}) \to (\sqrt{ε_{h}^{2} + \frac{σ_{h l}^{2}}{E^{2}}}, 2 π + arctan \frac{σ_{h}}{ε_{h} E}), \end{matrix}

(2)

\begin{matrix} P_{3} : (ε_{n h}, \frac{σ_{n h l}}{E}) \to (\sqrt{ε_{n h}^{2} + \frac{σ_{n h l}^{2}}{E^{2}}}, π + arctan \frac{σ_{n h l}}{ε_{n h} E}), \end{matrix}

(3)

\begin{matrix} P_{4} : (ε_{m i n}, \frac{σ_{m i n}}{E}) \to (\sqrt{ε_{m i n}^{2} + \frac{σ_{m i n}^{2}}{E^{2}}}, π + arctan \frac{σ_{m i n}}{ε_{m i n} E}), \end{matrix}

(4)

\begin{matrix} P_{5} : (ε_{n h}, \frac{σ_{n h u}}{E}) \to (\sqrt{ε_{n h}^{2} + \frac{σ_{n h u}^{2}}{E^{2}}}, π + arctan \frac{σ_{n h u}}{ε_{n h} E}), \end{matrix}

(5)

\begin{matrix} P_{6} : (ε_{h}, \frac{σ_{h u}}{E}) \to (\sqrt{ε_{h}^{2} + \frac{σ_{h u}^{2}}{E^{2}}}, arctan \frac{σ_{h}}{ε_{h} E}) . \end{matrix}

(6)

When the early k hysteresis loops under a strain amplitude are considered, we can obtain

6 k

points. In addition, since four strain amplitudes are used for fatigue tests of Q235B here, there are a total of

4 \times 6 k

points.

Second, the existing uncertainty in fatigue tests is quantified as an interval. On the one hand, we consider that the axial sensitivity of the extensometer of length 25 mm for the used MTS809 is

\pm 0.002

mm. Therefore, each strain amplitude is extended to an interval as given in Table 4 [17]. On the other hand, it is considered that three specimens are tested under each strain amplitude. The interval logarithmic fatigue lives are determined and given in Table 4 by using the maximum–minimum method. Similarly, the maximum–minimum method is used to expand the

6 k

points in hysteresis loops under each strain amplitude to

2 \times 6 k

intervals, which are written as

{\bar{D}}_{1} - {\bar{D}}_{4}

in Table 4 by considering the four strain amplitudes, respectively.

Third, we randomly generate a great deal of data in each interval to construct the dataset. For example, by considering the strain amplitude

0.004

and the normal distribution, 50 points are randomly generated in a strain amplitude interval

[0.00392, 0.00408],

each polar coordinate interval in

{\bar{D}}_{1}

, and fatigue life interval

[3.9129, 3.9946],

respectively. Then, 50 vectors are constructed and represented in the following form:

{\vec{C}}_{1} = (s_{a}^{1}; x_{1}^{1}, g_{1}^{1}, \dots, x_{6 k}^{1}, g_{6 k}^{1}; N_{f}^{1}) .

(7)

Here, the symbol

s_{a}^{1}

is the randomly generated strain amplitude;

x_{i}^{1}

and

g_{i}^{1}

(i \in {1, 2, \dots, 6 k})

stand for the values of the polar axis and the polar angle;

N_{f}^{1}

denotes the fatigue life. Similarly, when we consider the strain amplitude

0.005, 0.006

, and

0.008,

50 vectors can also be randomly generated, and represented by

{\vec{C}}_{2},

{\vec{C}}_{3}

and

{\vec{C}}_{4},

respectively. Thus, the dataset for ML models is constructed and written in the following form:

X = \{{\vec{C}}_{1}, {\vec{C}}_{2}, {\vec{C}}_{3}, {\vec{C}}_{4}\} .

(8)

When 50 points are randomly generated in each interval, there are 200 samples in

X

due to four strain amplitudes.

3. Machine Learning Models

Once the dataset is established by selecting feature points in stress–strain hysteresis loops under uncertainty, three ML models are developed for point predictions and interval predictions of fatigue life.

3.1. Back-Propagation Neural Network

The BP neural network is the most commonly used model and has good nonlinear mapping ability. There are always the input layer, the hidden layers, and the output layer in a BP neural network, as shown in Figure 9. Each layer contains a certain number of neurons, which are not connected to each other. The neurons in adjacent layers are fully connected with weights and activated by using an activation function

f (x)

such as the Sigmoid function:

f (x) = \frac{1}{1 + e^{- c x}},

(9)

where the term c is a positive constant. The calculation of the BP neural network mainly consists of two parts: the forward-propagation of the input information and the back-propagation of the calculation error. The output of a neuron is expressed as:

y_{j} = f (\sum_{i = 1}^{m} w_{i j} x_{i} + b_{j}),

(10)

where

w_{i j}

is the weight of the j-th neuron connected to the previous layer;

x_{i}

is the value of the i-th neuron in the previous layer, and

b_{j}

is the threshold.

To quantify the prediction accuracy, the coefficient of determination

R^{2}

and mean squared error (MSE) are usually used. The formulae of

R^{2}

and MSE are given below:

\begin{matrix} R^{2} (y, y^{p r e}) & = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - y_{i}^{p r e})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - y_{m e a n})}^{2}}, \end{matrix}

(11)

\begin{matrix} M S E (y, y^{p r e}) & = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - y_{i}^{p r e})}^{2} . \end{matrix}

(12)

Here, the term

y_{i}

stands for the i-th experimental value;

y_{i}^{p r e}

is the i-th predicted value, and

y_{m e a n}

is the mean value of experimental values. It can be seen that the closer the value of

R^{2}

is to 1, or the smaller the value of MSE, the more accurate the predictions are. In addition, the coverage ratio of prediction interval (PICR) is utilized to describe the ratio for the predicted values that belong to a specific interval:

P I C R = \frac{1}{K} \sum_{i = 1}^{K} φ_{i},

(13)

where K is the number of specimens and

φ_{i} = 1

means that the predicted value falls into the interval; otherwise,

φ_{i} = 0 .

Prior to the application of the BP neural network model, we should determine the numbers of hidden layers and neurons. In the present study, by considering the small dataset

X,

leave-one-out cross validation (LOOCV) is used to give a stable and effective BP neural network. That is, when we consider 200 samples in

X,

the LOOCV has the following steps:

Step 1:: One sample is selected to construct the test set and the others are used to construct the training set.
Step 2:: The BP neural network model is carried out to give the MSE.
Step 3:: When each sample has been used as the test set, the average value of 200 MSEs is considered as the performance evaluation of the model.
Step 4:: The above steps are used for different BP neural network models and the best one is selected according to their performance evaluations.

Based on the above procedure, it is found that when the number of hidden layers is three, the numbers of neurons are determined as 18, 13, and 21 from the first hidden layer to the third hidden layer. The obtained neural network model will be used to predict the low-cycle-fatigue life of Q235B in the following section.

3.2. Support Vector Regression

SVR has obvious advantages in nonlinear mapping and high-dimensional pattern recognition. The key idea of SVR is to find a hyperplane that minimizes the distances from all sample points to this plane. Since the output of SVR for predicting fatigue life is a continuous value, an

ε

-insensitive loss function is introduced to achieve regression analysis. As shown in Figure 10, the

ε

-SVR determines a function

g (\vec{x}) = {\vec{ω}}^{T} \cdot \vec{x} + b

such that each sample

({\vec{x}}_{i}, y_{i})

in the training set is as close as possible to

g (\vec{x}) .

Letting the maximum allowable error be

ε,

the

ε

-insensitive loss function

l_{ε}

is given as follows:

l_{ε} = \{\begin{matrix} 0, & | g ({\vec{x}}_{i}) - y_{i} | < ε, \\ | g ({\vec{x}}_{i}) - y_{i} | - ε, & otherwise . \end{matrix}

(14)

Therefore, an optimization model is formed below:

min_{\vec{ω}, b} \frac{1}{2} {∥ \vec{ω} ∥}^{2} + C \sum_{i = 1}^{N} l_{ε},

(15)

where N is the number of samples in the training set and C is the regularization parameter or penalty coefficient. Once the weight vector

\vec{ω}

and b are determined, the regression function

g (\vec{x})

is used for prediction. When the training set is not linearly separable, a nonlinear kernel function is always introduced to transform the dataset to a high-dimensional space to make it linearly separable. In the present study, the radial basis function is selected as the kernel. In order to determine the parameters

ε

and

C,

we consider the variations of

R^{2}

in Figure 11. It is found that when

ε = 0.01

and

C = 10,

the prediction accuracy is the highest.

3.3. Random Forest

RF is an ML algorithm based on statistical theory and ensemble learning strategy. The network structure of an RF model is shown in Figure 12. The Bootstrap sampling method is used to extract multiple sample sets from the training set to construct multiple decision trees. Each decision tree is applied to give an independent prediction. The final prediction is determined by voting or averaging. In the present study, assume that the training set of each decision tree is randomly selected as

T_{k} = {({\vec{x}}_{1}^{k}, y_{1}^{k}), \dots, ({\vec{x}}_{l}^{k}, y_{l}^{k})}

, where

{\vec{x}}_{i}^{k} = (x_{i}^{1}, x_{i}^{2}, \dots, x_{i}^{n})

is an input vector with n elements, and

y_{i}^{k}

is the output value. If RF contains

K \geq 1

decision trees and the predicted values are

y_{i}^{p r e}

(i = 1, 2, \dots, K)

, the final predicted value

y^{p r e}

is calculated as:

y^{p r e} = \frac{1}{K} \sum_{i = 1}^{K} y_{i}^{p r e} .

(16)

When RF is used, the number of trees

n_{t}

and the depth of trees

n_{d}

should be selected carefully. Figure 13 is drawn to show the effects of

n_{t}

and

n_{d}

on MSE. It is found from Figure 13 that when the depth of trees is fixed, the values of MSE decrease with the increasing number of trees under

n_{t} \leq 100 .

When

n_{t} \geq 100,

the value of MSE is changed slightly for a fixed depth of trees. Moreover, when the number of trees is fixed as

n_{t} \leq 100,

the values of MSE drop down. Observation reveals that the values of

n_{t} = 100

and

n_{d} = 15

are optimal by considering the computational cost. For convenience, the hyperparameters used in the three ML models are summarized in Table 5.

4. Results and Discussion

Based on the developed ML models, we predict the low-cycle-fatigue life of Q235B by using the constructed dataset

X .

As shown in Table 4, the training dataset is formed by considering the strain amplitudes

0.004,

0.005

and

0.006;

and the test dataset is based on the strain amplitude

0.008 .

In order to illustrate the effects of the input data on the predictions, four cases in Table 6 are considered. In case 1, the input data include the strain amplitude and the selected six points at each hysteresis loop of the earliest 10 cycles of fatigue process. This implies that the dimension of input data in case 1 is 61. Similarly, we can see that the dimensions of input data in cases 2, 3, and 4 are 31, 60, and 30, respectively. Moreover, in each interval of

X,

50 data points are randomly generated to strain the developed ML models and carry out the point/interval predictions of the low-cycle-fatigue life of Q235B.

4.1. Point and Interval Predictions Based on BP Neural Network Model

In the following, the BP neural network model is used, where there are three hidden layers with 18, 13, and 21 neurons. The performance of the strained BP model on the training set is shown in Figure 14 by comparing the predicted values and the experimental data of fatigue life. It is seen from Figure 14 that all the points for four cases are located in the two-fold error band and approximate to the diagonal. This means that there is good accuracy for the training set when using the BP neural network model. A comparison between the four cases reveals that a point in case 4 is further away from the diagonal than the other points. The phenomenon shows that the performance of the trained BP model for case 4 is slightly worse than the other cases.

Furthermore, we use the trained BP neural network models for the four cases to predict the low-cycle-fatigue lives of specimens A10–A12 under the strain amplitude of 0.008. Figure 15 is drawn to show the comparison between the predicted values and the experimental data by using the trained BP neural network models for cases 1–4, respectively. We can find from Figure 15 that the predicted values all belong to the two-fold error band. Observation reveals that the hysteresis loop-based BP neural network model is effective with regards to giving an acceptable prediction of fatigue life. We further compare cases 1–4 and find that the obtained results of case 1 are the most accurate. A comparison between case 1 and case 2 shows that the use of the early 10 hysteresis loops is better than the early 5 hysteresis loops. A comparison between case 1 and case 3 reveals that the strain amplitude plays an important role in the prediction of fatigue life. When the influence of the strain amplitude on the prediction of fatigue life is neglected, the results of cases 3 and 4 are still acceptable for the developed models.

At the end, we focus on the dispersion of fatigue life since the same specimen configuration yields different fatigue test lives under the same strain amplitude. Based on the uncertainty quantification as shown in Table 4, the randomly generated data are used to predict the interval-valued logarithm fatigue lives of specimens A10–A12 as given in Figure 16. It is found from Figure 16 that the mean values of the predicted logarithm fatigue lives are all located in the interval

[3.1452, 3.4] .

Comparisons between cases 1–4 indicate that the mean value of case 1 is less than those of cases 2–4. The variability for the four cases is significant, since the values of many points are less than 3.1452 or more than 3.4. The underlying reason is that the input data are completely randomly generated. In order to more accurately characterize the dispersion of predictions, some physical knowledge should be used to capture the relationships between the input data and the fatigue life.

4.2. Point and Interval Predictions Based on SVR Model

Now, the SVR model with

ε = 0.01

and

C = 10

is adopted to give point and interval predictions of the low-cycle-fatigue life of Q235B. First, the training set including specimens A1–A9 is used to construct the SVR models under cases 1–4, respectively, and the performance is given in Figure 17. A comparison between the predicted values and the experimental data for cases 1–4 shows that the fatigue lives of A1–A9 have been evaluated correctly by using the SVR models. This means that the developed SVR models have been constructed reasonably and can be used for predictions.

Second, the constructed SVR models under cases 1–4 are used to predict the fatigue lives of specimens A10–A12 in the test set. The obtained results are given in Figure 18, where the predicted values and the experimental data are compared. One can see that the predicted results under case 1 perform the best and those under case 4 are the worst. This phenomenon is in agreement with the finding for the BP neural network model. By comparing Figure 15 and Figure 18, it is found that the prediction accuracy of the SVR model is higher than the BP neural network model.

Third, we apply the constructed SVR models under cases 1–4 to give interval predictions of the low-cycle-fatigue lives of specimens A10–A12. Figure 19 is depicted to show the probability distributions of the predicted values under the four cases. It can be seen from Figure 19 that the dispersion of fatigue life is still high due to the randomness of generated data. This observation is similar to the findings in Figure 16 based on the BP neural network model. The difference is the mean values of the predicted fatigue life in Figure 19, which are less than those in Figure 16 under cases 1–4, respectively.

4.3. Point and Interval Predictions Based on RF Model

In what follows, the RF model with

n_{t} = 100

and

n_{d} = 15

is selected to predict the fatigue lives of specimens A10–A12. First, the performance of the trained RF model on the training set is investigated under cases 1–4 and is shown in Figure 20. The obtained results show that the points are distributed around the diagonal. This means a good accuracy of the computed fatigue lives of specimens A1–A9 for cases 1–4, respectively. As compared to Figure 14 and Figure 17, it is revealed that the ML models can all be trained effectively.

Second, we predict the fatigue lives of specimens A10–A12 based on the RF models under cases 1–4. As shown in Figure 21, the predicted fatigue lives are all located at the two-fold error band, meaning that the RF models are effective. A comparison between cases 1–4 reveals that the predicted values under case 1 are the most accurate. The observation is in accordance with the findings based on the BP neural network model and the SVR model. The novel finding in Figure 21 is that the predicted results under cases 1 and 3 are better than those under cases 2 and 4. This means that the RF model is more influenced by the dimensionality of input data. In addition, we compare Figure 21 with Figure 15 and Figure 18 to find that the accuracy of the RF model is higher than the BP neural network model and the SVR model for case 1.

Third, we provide the interval predictions of fatigue lives of specimens A10–A12 based on the RF models in Figure 22. One can see from Figure 22 that the dispersion of fatigue life is captured by considering the uncertainty quantification and using the randomly generated data. The dispersion degree of the predicted fatigue lives is approximate to those in Figure 16 under the BP neural network model and Figure 19 under the SVR model. To further show the difference between the developed ML models, the mean values and variances of probability distributions in interval predictions are given in Table 7. This finding reveals that the randomness of generated data is the main factor affecting the variability of predictions. The appropriate physical knowledge should be used to improve the characterization of dispersion of fatigue life predictions.

In summary, the developed ML models can be used to effectively give point predictions of fatigue life of Q235 by constructing the hysteresis loop-based dataset. By considering the uncertainty quantification, the interval predictions are provided to characterize the dispersion in fatigue life prediction. As compared to the models in [17], the novelty comes from the method of constructing the dataset. Different from the approach in [31], here we only use the 10 hysteresis loops at the earliest stages of the fatigue process to give a good prediction. As shown in [34], it is convenient to compare the performances of three models for point predictions of fatigue life in Table 8. The changes in performance of ML models are statistically significant by considering the influences of the data extracted from stress–strain hysteresis loops. The basic reason is that the dispersion of fatigue life is attributed to the difference in stress–strain hysteresis loops for the same specimen. The statistically significant can be used for future studies, when the implicit physical mechanism in stress–strain hysteresis loops is characterized by the extracted data.

5. Conclusions

Understanding fatigue behavior of metallic materials plays an important role in engineering. The fatigue strength is an important parameter to characterize the fatigue behavior. Under different fatigue loadings, the fatigue life of materials should be evaluated for the safety of engineering structures. In this paper, we have considered the dependence of metal fatigue life on the stress–strain hysteresis loops. Three machine learning models—the back-propagation neural network model, the support vector regression model, and the random forest model—were developed to predict the low life fatigue of Q235B. Some interesting conclusions are worth mentioning below:

(a): The constructed dataset based on the early 10 hysteresis loops is effective at characterizing the fatigue features under different strain amplitudes.
(b): The challenge of small fatigue test datasets has been addressed to a certain degree according to uncertainty quantification.
(c): Random forest is more accurate than the other two models for point predictions of fatigue life by considering 10 hysteresis loops and strain loadings.

Our work has provided a novel perspective for predictions of metal fatigue life by considering the dependence on the stress–strain hysteresis loops at the earliest stages of the fatigue process. However, there are still some opportunities for further research concerning the following: (1) How to physically interpret the randomly generated data to enhance the reliability of the trained machine learning models. (2) How to characterize the dependence of fatigue life on the feature points used in machine learning models. (3) How to develop physics-informed machine learning models to capture the dispersion in fatigue life prediction. (4) How to reveal the fatigue mechanism according to hysteresis loop-based machine learning models for predicting fatigue life.

Author Contributions

Conceptualization, X.-C.Z. and Z.-Y.L.; methodology, X.-C.Z. and Z.-Y.L.; software, Z.-Y.L.; validation, X.-C.Z., Z.-Y.L. and K.-S.Z.; formal analysis, X.-C.Z. and Z.-Y.L.; data curation, Z.-Y.L.; writing—original draft preparation, X.-C.Z. and Z.-Y.L.; writing—review and editing, K.-S.Z.; project administration, X.-C.Z.; funding acquisition, X.-C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by the National Natural Science Foundation of China (No. 11872155).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank the anonymous reviewers for providing valuable and constructive suggestions to improve the quality of the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ML	Machine learning
BP model	Back-propagation neural network model
MSE	Mean squared error
PICP	The coverage ratio of prediction interval
RF	Random forest
SVM	Support vector machine
SVR	Support vector regression

References

Suresh, S. Fatigue of Materials; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
Schütz, W. A history of fatigue. Eng. Fract. Mech. 1996, 54, 263–300. [Google Scholar] [CrossRef]
Stinville, J.C.; Charpagne, M.A.; Cervellon, A.; Hemery, S.; Wang, F.; Callahan, P.G.; Valle, V.; Pollock, T.M. On the origins of fatigue strength in crystalline metallic materials. Science 2022, 377, 1065–1071. [Google Scholar] [CrossRef] [PubMed]
Pineau, A.; McDowell, D.L.; Busso, E.P.; Antolovich, S.D. Failure of metals II: Fatigue. Acta Mater. 2016, 107, 484–507. [Google Scholar] [CrossRef]
Chen, J.; Liu, Y.M. Fatigue modeling using neural networks: A comprehensive review. Fatigue Fract. Eng. Mater. Struct. 2022, 45, 945–979. [Google Scholar] [CrossRef]
Wang, H.J.; Li, B.; Gong, J.G.; Xuan, F.Z. Machine learning-based fatigue life prediction of metal materials: Perspectives of physics-informed and data-driven hybrid methods. Eng. Fract. Mech. 2023, 284, 109242. [Google Scholar] [CrossRef]
Xu, Q.W.; Song, G.Q.; Liu, X.W.; Wang, Y.J.; Sha, A.X.; Wei, Y.Y.; Hao, W.F. Low-cycle fatigue life prediction of titanium-based intermetallic alloys using machine learning and finite element analysis. Materials 2025, 18, 1887. [Google Scholar] [CrossRef]
Han, X.L. Artificial neural network technology as a method to evaluate the fatigue life of weldments with welding defects. Int. J. Pres. Ves. Pip. 1995, 63, 205–209. [Google Scholar] [CrossRef]
Maleki, E.; Unal, O.; Kashyzadeh, K.R. Fatigue behavior prediction and analysis of shot peened mild carbon steels. Int. J. Fatigue 2018, 116, 48–67. [Google Scholar] [CrossRef]
Bao, H.; Wu, S.; Wu, Z.; Kang, G.; Peng, X.; Withers, P.J. A machine-learning fatigue life prediction approach of additively manufactured metals. Eng. Fract. Mech. 2021, 242, 107508. [Google Scholar] [CrossRef]
Zhan, Z.X.; Li, H. Machine learning based fatigue life prediction with effects of additive manufacturing process parameters for printed SS 316L. Int. J. Fatigue 2021, 142, 105941. [Google Scholar] [CrossRef]
Zhang, X.C.; Gong, J.G.; Xuan, F.Z. A deep learning based life prediction method for components under creep, fatigue and creep-fatigue conditions. Int. J. Fatigue 2021, 148, 106236. [Google Scholar] [CrossRef]
Yang, J.Y.; Kang, G.Z.; Liu, Y.J.; Kan, Q.H. A novel method of multiaxial fatigue life prediction based on deep learning. Int. J. Fatigue 2021, 151, 106356. [Google Scholar] [CrossRef]
Giannella, V. Uncertainty quantification in fatigue crack-growth predictions. Int. J. Fracture 2022, 235, 179–195. [Google Scholar] [CrossRef]
Horňas, J.; Bĕhal, J.; Homola, P.; Doubrava, R.; Holzleitner, M.; Senck, S. A machine learning based approach with an augmented dataset for fatigue life prediction of additively manufactured Ti-6Al-4V samples. Eng. Fract. Mech. 2023, 293, 109709. [Google Scholar] [CrossRef]
Gao, T.Z.; Zhan, Z.X.; Hu, W.P.; Meng, Q.C. A novel damage mechanics and XGBoost based approach for HCF life prediction of cast magnesium alloy considering internal defect characteristics. Int. J. Fatigue 2024, 182, 108220. [Google Scholar] [CrossRef]
Zhong, X.C.; Xie, R.K.; Qin, S.H.; Zhang, K.S. A process-data-driven BP neural network model for predicting interval-valued fatigue life of metals. Eng. Fract. Mech. 2022, 276, 108918. [Google Scholar] [CrossRef]
Xie, R.K.; Zhong, X.C.; Qin, S.H.; Zhang, K.S.; Wang, Y.R.; Wei, D.S. Predicting multiaxial fatigue life of FGH96 superalloy based on machine learning models by considering failure process and loading paths. Int. J. Fatigue 2023, 175, 107730. [Google Scholar] [CrossRef]
Zhang, X.C.; Gong, J.G.; Xuan, F.Z. A physics-informed neural network for creep-fatigue life prediction of components at elevated temperatures. Eng. Fract. Mech. 2021, 258, 108130. [Google Scholar] [CrossRef]
Chen, D.; Li, Y.Z.; Liu, K.; Li, Y. A physics-informed neural network approach to fatigue life prediction using small quantity of samples. Int. J. Fatigue 2023, 166, 107270. [Google Scholar] [CrossRef]
Jing, G.X.; Ma, T.; Wang, Z.Q.; Fu, Y.H.; Chen, G.; Ma, T.; Sun, X.X. Physical hierarchical neural network for low-cycle-fatigue life prediction of compacted graphite cast iron based on small data. Int. J. Fatigue 2024, 188, 108509. [Google Scholar] [CrossRef]
Avoledo, E.; Tognan, A.; Salvati, E. Quantification of uncertainty in a defect-based physics-informed neural network for fatigue evaluation and insights on influencing factors. Eng. Fract. Mech. 2023, 292, 109595. [Google Scholar] [CrossRef]
Tognan, A.; Patanè, A.; Laurenti, L.; Salvati, E. A Bayesian defect-based physics-guided neural network model for probabilistic fatigue endurance limit evaluation. Comput. Methods Appl. Mech. Eng. 2024, 418, 116521. [Google Scholar] [CrossRef]
Zhou, T.T.; Jiang, S.; Han, T.; Zhu, S.P.; Cai, Y.N. A physically consistent framework for fatigue life prediction using probabilistic physics-informed neural network. Int. J. Fatigue 2023, 166, 107234. [Google Scholar] [CrossRef]
Wang, X.X.; Chen, H.F.; Xuan, F.Z. Neural network-assisted probabilistic creep-fatigue assessment of hydrogenation reactor with physics-based surrogate model. Int. J. Pres. Ves. Pip. 2023, 206, 105051. [Google Scholar] [CrossRef]
Yi, F.; Lei, H.; Lv, Q.F.; Zhang, Y. Coupling physics in artificial neural network to predict the fatigue behavior of corroded steel wire. Int. J. Fatigue 2025, 190, 108669. [Google Scholar] [CrossRef]
Chen, J.; Liu, Y.M. Probabilistic physics-guided machine learning for fatigue data analysis. Expert Syst. Appl. 2021, 168, 114316. [Google Scholar] [CrossRef]
Wang, H.J.; Li, B.; Lei, L.M.; Xuan, F.Z. Uncertainty-aware fatigue-life prediction of additively manufactured Hastelloy X superalloy using a physics-informed probabilistic neural network. Reliab. Eng. Syst. Saf. 2024, 243, 109852. [Google Scholar] [CrossRef]
Dallmeier, J.; Denk, J.; Huber, O.; Saage, H.; Eigenfeld, K. A phenomenological stress–strain model for wrought magnesium alloys under elastoplastic strain-controlled variable amplitude loading. Int. J. Fatigue 2015, 80, 306–323. [Google Scholar] [CrossRef]
Cao, J.F.; Guo, W.L. The hysteresis loop on the near-threshold fatigue crack growth curves generated by stepped load reduction and constant-amplitude loading methods in a Ni-based superalloy. Int. J. Fatigue 2025, 191, 108698. [Google Scholar] [CrossRef]
Yu, J.; Lee, S.H.; Cheon, S.; Park, S.H.; Lee, T. Alternative predictive approach for low-cycle fatigue life based on machine learning and energy-based modeling. J. Magnesium Alloys 2024, 12, 4075–4084. [Google Scholar] [CrossRef]
Qin, S.H.; Xiong, Z.Y.; Ma, Y.S.; Zhang, K.S. Low-cycle-fatigue life evaluation of notched specimens considering strain gradient. Materials 2020, 13, 1001. [Google Scholar] [CrossRef]
E466-21; Standard Practice for Conducting Force Controlled Constant Amplitude Axial Fatigue Tests of Metallic Materials. ASTM International: West Conshohocken, PA, USA, 2021.
Asgarkhani, N.; Kazemi, F.; Jankowski, R. Machine learning-based prediction of residual drift and seismic risk assessment of steel moment-resisting frames considering soil-structure interaction. Comput. Struct. 2023, 289, 107181. [Google Scholar] [CrossRef]

Figure 1. Geometry of smooth Q235B specimen (unit: mm).

Figure 2. The stress–strain hysteresis loops of a specimen under strain amplitude 0.004.

Figure 3. The stress–strain hysteresis loops of a specimen under strain amplitude 0.005.

Figure 4. The stress–strain hysteresis loops of a specimen under strain amplitude 0.006.

Figure 5. The stress–strain hysteresis loops of a specimen under strain amplitude 0.008.

Figure 6. Scheme of dataset compilation process.

Figure 7. Six key points by considering the characteristics of a hysteresis loop.

Figure 8. Six key points of a hysteresis loop in polar coordinates.

Figure 9. BP neural network model driven by the dataset

X .

Figure 9. BP neural network model driven by the dataset

X .

Figure 10. Flowchart of SVR.

Figure 11. Variations of

R^{2}

versus

ε

by selecting

C = 1, 10, 20

, and

50,

respectively.

Figure 11. Variations of

R^{2}

versus

ε

by selecting

C = 1, 10, 20

, and

50,

respectively.

Figure 12. Network structure of RF.

Figure 13. The effects of the number of trees

n_{t}

and the depth of trees

n_{d}

on MSE.

Figure 13. The effects of the number of trees

n_{t}

and the depth of trees

n_{d}

on MSE.

Figure 14. Comparison between the predicted values and the experimental results of the training set based on the trained BP neural network models for cases 1–4, respectively.

Figure 15. Comparison between the predicted values and the experimental results of the test set based on the trained BP neural network models for cases 1–4, respectively.

Figure 16. Interval predictions of logarithm fatigue life for specimens A10–A12 based on the BP neural network models for cases 1–4, respectively. The blue points are the predicted values based on the randomly generated input data.

Figure 17. Comparison between the predicted values and the experimental results of the training set based on the constructed SVR models for cases 1–4, respectively.

Figure 18. Comparison between the predicted values and the experimental results of the test set based on the constructed SVR models for cases 1–4, respectively.

Figure 19. Interval predictions of logarithm fatigue life for specimens A10–A12 based on the SVR models for cases 1–4, respectively. The blue points are the predicted values based on the randomly generated input data.

Figure 20. Comparison between the predicted values and the experimental data of the training set based on the selected RF models for cases 1–4, respectively.

Figure 21. Comparison between the predicted values and the experimental data of the test set based on the selected RF models for cases 1–4, respectively.

Figure 22. Interval predictions of logarithm fatigue life for specimens A10–A12 based on the RF models for cases 1–4, respectively. The blue points are the predicted values based on the randomly generated input data.

Table 1. Mechanical properties of Q235B steel.

Yield Strength/Mpa	Ultimate Strength/Mpa	Young’s Modulus/Mpa	Poisson’s Ratio	Elongation
260	550	193900	0.277	20%

Table 2. Main chemical compositions of Q235B steel (%).

C	$Si$	$Mn$	P	S	$As$	$Cep$
0.14	0.032	0.4	0.03	0.019	0.031	0.42

Table 3. Experimental data of Q235B.

Axial Strain Amplitude
0.004		0.005		0.006		0.008
Specimen	$lg N_{f}$	Specimen	$lg N_{f}$	Specimen	$lg N_{f}$	Specimen	$lg N_{f}$
A1	3.9227	A4	3.7782	A7	3.4905	A10	3.2586
A2	3.9129	A5	3.7455	A8	3.464	A11	3.1452
A3	3.9946	A6	3.7269	A9	3.4984	A12	3.4

Table 4. Interval-valued data of Q235B smooth specimens.

Class	Specimen	Strain Amplitude Interval	Hysteresis Loop	Fatigue Life (lg)
Training set	A1, A2, A3	[0.00392, 0.00408]	${\bar{D}}_{1}$	[3.9129, 3.9946]
	A4, A5, A6	[0.00492, 0.00508]	${\bar{D}}_{2}$	[3.7269, 3.7455]
	A7, A8, A9	[0.00592, 0.00608]	${\bar{D}}_{3}$	[3.464, 3.4984]
Test set	A10, A11, A12	[0.00792, 0.00808]	${\bar{D}}_{4}$	[3.1452, 3.4]

Table 5. Optimized hyperparameters in ML models.

ML Model	Hyperparameter	Value
BP	Number of hidden layers	3
	Numbers of neurons in hidden layers	18, 13, 20
	Learning rate	0.005
	Activation function	Sigmoid
SVR	Kernel	Radial basis function
	Maximum allowable error	$ε = 0.01$
	Regularization parameter	$C = 10$
RF	Number of trees	$n_{t} = 100$
RF	Depth of trees	$n_{d} = 15$

Table 6. Four cases of input data.

Case	Strain Amplitude?	Number of Early Hysteresis Loops	Dimension of Input Data
Case 1	Yes	10	61
Case 2	Yes	5	31
Case 3	No	10	60
Case 4	No	5	30

Table 7. Mean values and variances of probability distributions in interval predictions.

Case	BP Model		SVR Model		RF Model
Case	Mean	Variance	Mean	Variance	Mean	Variance
Case 1	3.28	0.032	3.27	0.034	3.23	0.030
Case 2	3.27	0.036	3.29	0.038	3.22	0.038
Case 3	3.30	0.042	3.25	0.041	3.31	0.036
Case 4	3.23	0.064	3.22	0.068	3.30	0.062

Table 8. Performances of three models for fatigue life point predictions of Q235B.

Case	BP Model	SVR Model	RF Model
Case	MSE	MSE	MSE
Case 1	0.0011	0.0041	0.0002
Case 2	0.0058	0.0069	0.0149
Case 3	0.0108	0.0119	0.0025
Case 4	0.0260	0.0155	0.0202

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhong, X.-C.; Luo, Z.-Y.; Zhang, K.-S. Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty. Materials 2025, 18, 4336. https://doi.org/10.3390/ma18184336

AMA Style

Zhong X-C, Luo Z-Y, Zhang K-S. Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty. Materials. 2025; 18(18):4336. https://doi.org/10.3390/ma18184336

Chicago/Turabian Style

Zhong, Xian-Ci, Zhi-Yong Luo, and Ke-Shi Zhang. 2025. "Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty" Materials 18, no. 18: 4336. https://doi.org/10.3390/ma18184336

APA Style

Zhong, X.-C., Luo, Z.-Y., & Zhang, K.-S. (2025). Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty. Materials, 18(18), 4336. https://doi.org/10.3390/ma18184336

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stress–Strain Hysteresis Loop-Based Machine Learning Models for Predicting Metal Fatigue Life Under Uncertainty

Abstract

1. Introduction

2. Dataset Establishment

2.1. Experimental Data

2.2. Compilation of the Dataset

3. Machine Learning Models

3.1. Back-Propagation Neural Network

3.2. Support Vector Regression

3.3. Random Forest

4. Results and Discussion

4.1. Point and Interval Predictions Based on BP Neural Network Model

4.2. Point and Interval Predictions Based on SVR Model

4.3. Point and Interval Predictions Based on RF Model

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI