1. Introduction
Ti
2AlNb-based alloys, commonly termed O-phase alloys, were first identified by Banerjee et al. [
1] in 1988 during their studies on heat treatments of Ti
3Al alloys. They observed a newly stabilized orthorhombic phase (O phase) emerging due to the addition of Nb. Compared with TiAl-based alloys, Ti
2AlNb alloys display enhanced room-temperature ductility, fracture toughness, and improved resistance to crack propagation. Furthermore, these alloys significantly reduce component weight compared with traditional nickel-based superalloys. Their outstanding properties at high temperatures enable prolonged service at temperatures ranging from 550 °C to 750 °C [
2,
3]. Consequently, Ti
2AlNb alloys are increasingly adopted as lightweight, high-temperature structural materials in aeroengine components like casings and combustor liners. However, due to their sensitivity in terms of chemical composition and microstructure during processing, additive manufacturing (AM) methods offer superior control over traditional casting and forming processes in fabricating these alloys [
4].
Ti
2AlNb-based alloy is a lightweight, high-temperature-resistant intermetallic compound with three distinct phases: the β/B
2 phase, α
2 phase, and O phase. The constituent phases of these alloys can exist as a single phase, or two or three phases can coexist, depending on the heat treatment temperature and alloy composition. Zhou and Gussone et al. [
5,
6] investigated the phase transformation behavior of Ti
2AlNb-based alloys under different solution heat treatment (SHT) and aging treatment (AT) temperatures. At 920 °C, the microstructure primarily consisted of lath-like O + α
2 phases and discontinuous O + α
2 phases at the grain boundaries. When the temperature increased to 1000 °C, needle-like O phases became dominant, and the tensile strength at 650 °C improved from 690 MPa to 820 MPa. However, above 1050 °C, the content of needle-like O phases decreased, and the B
2 phase became the predominant constituent. During aging, the precipitation of O phases consumed a significant amount of Nb from the B
2 matrix. With a prolonged aging time, the O phase tended toward saturation, leading to the formation of O/O phase interfaces, which act as crack initiation sites. Li and Zhang et al. [
7,
8] also reported that a substantial amount of needle-like O phases precipitate during aging in Ti
2AlNb-based alloys, which contributes to enhanced tensile strength. Comparative studies under different heat treatment conditions demonstrated that the formation of a B
2 matrix phase is beneficial for improving ductility and toughness, whereas the hexagonally close-packed α
2 phase helps to suppress B
2 grain coarsening. A higher fraction of the orthorhombic O phase was shown to enhance the alloy’s creep resistance and high-temperature strength [
9]. Moreover, Ti
2AlNb-based alloys generally exhibit high hardness (typically in the range of 400–500 HV). Zhang et al. [
10] further found that the addition of Mo to Ti
2AlNb-based alloys leads to the formation of well-aligned O + B
2 microstructures. With appropriate solution and aging treatments, the hardness of Mo-modified Ti
2AlNb-based alloys can reach up to 620 HV.
Bäumel and Seeger [
11] first emphasized the necessity of employing different fatigue parameters for accurately predicting fatigue life in various metallic materials. They developed specific expressions for fatigue parameters for steel, titanium, and aluminum alloys based on Seeger’s generalized slope method. Yang et al. [
12] further confirmed the accuracy of Seeger’s method through comparative analyses with other fatigue prediction approaches. Maliheh et al. [
13] applied continuous damage mechanics (CDM) combined with Lemaitre’s ductile damage model to accurately forecast damage evolution in metals under different loading conditions. Liu et al. [
14] integrated the Eshelby-Kroner localization principle into the Lemaitre fatigue damage model, introducing randomized parameters optimized through inverse analysis. Zhou et al. [
15,
16,
17] proposed a refined Lemaitre damage evolution model with parameters specifically fitted for precise simulation of low-cycle fatigue damage in Ti-6Al-4V alloys.
Machine learning methods have increasingly been employed for fatigue behavior analysis and prediction in metallic materials [
18]. Wang et al. [
19] proposed a high-accuracy and high-stability, low-cycle fatigue life prediction method for Ti
2AlNb-based alloy components based on continuum damage mechanics (CDM) theory and incorporating BP-ANN and GABP-ANN models. Heng et al. [
20] introduced a novel hybrid neural network model combining convolutional neural networks (CNNs) with long short-term memory (LSTM) networks to predict fatigue life under both uniaxial and proportional multiaxial loads across different metallic materials. Duan et al. [
21] compared several neural network models and identified LSTM as exhibiting superior predictive accuracy and generalization ability for high-cycle fatigue life in 304 stainless steel. The extreme learning machine (ELM), notable for its efficiency, simplicity, and robust generalization, has been widely applied across various fields [
22]. Pan et al. [
23] proposed a two-stage method based on ELMs for the fast and accurate prediction of the remaining service life of rolling bearings, demonstrating the ELM’s high accuracy and fast response capabilities even under small sample conditions. Duan et al. [
24] compared support vector machines (nu-SVRs) and ELMs in predicting low-cycle fatigue life in 316 stainless steel, emphasizing the efficiency and effectiveness of ELMs.
Overall, research on the low-cycle fatigue life of open-hole structural components fabricated from Ti2AlNb-based alloys remains limited. This paper presents experimental investigations and fatigue life predictions for Ti2AlNb-based open-hole components under a stress ratio of 0.05, thereby addressing the current research gap. Additionally, preliminary calibration of fatigue parameters for Ti2AlNb-based alloys was performed, providing essential reference data for future studies. Finally, three conventional machine learning models were applied to predict fatigue life and analyze performance, offering a rapid and accurate method for predicting the fatigue life of titanium alloy structural components.
4. Three Machine Learning Models
In this section, the high-temperature, low-cycle fatigue life of Ti2AlNb-based alloy structural components is predicted using a long short-term memory (LSTM) network, extreme learning machine (ELM), and partial least squares (PLS) algorithms. The prediction performance of the three machine models is also compared.
4.1. Long Short-Term Memory
The long short-term memory (LSTM) neural network is a refined version of the conventional recurrent neural network (RNN), developed to mitigate the limitations of standard RNNs in capturing long-term dependencies. Traditional RNNs, composed of basic recurrent units, struggle to maintain relevant information when the sequence length increases significantly. To address this challenge, Hochreiter et al. [
27] introduced the LSTM architecture, which incorporates specialized components called “gates” within each memory cell. These include input, output, and forget gates, as well as a cell state acting as the hidden layer’s memory. The LSTM model framework adopted in this study is illustrated in
Figure 6. The functionality of the three primary gates is described below.
The forget gate regulates which information from the previous cell state should be discarded. It is computed as follows:
where
is the forget gate’s output,
denotes the sigmoid activation function,
is the weight matrix associated with the forget gate,
is the hidden state from the previous time step,
is the current input, and
is the bias term. The forget gate output is then element-wise multiplied with the previous cell state
, allowing the network to retain only relevant information.
The input gate controls which new information is added to the cell state. Its computation is defined as follows:
where
represents the gate’s output, while
denotes the vector of candidate values to be added to the cell state.
The output gate controls which portion of the updated cell state will be exposed as the hidden state for the current time step and passed on to the next one. It plays a critical role in determining the network’s output at each stage in the sequence:
where
denotes the gate’s output,
is the hidden state at the current time step, and
is the cell state of the current time step.
The cell state
is updated through a combination of the outputs from the forget and input gates, which together determine what information is discarded and what is newly stored:
For training the LSTM model, the rectified linear unit (ReLU) activation function was employed in the activation layer, while the Adam optimizer was utilized to perform gradient-based optimization. Hyperparameter tuning is essential to the model’s effectiveness. The initial learning rate was set to 0.01, with a decay factor of 0.1 applied to reduce the learning rate as training progressed. To enhance data randomness and prevent overfitting, the randperm function was used to shuffle the dataset after each epoch. The number of training iterations is a critical hyperparameter. Five different maximum iteration values were tested. For each configuration, the model was trained 10 times, and the mean regression coefficient was calculated to assess performance consistency. The training outcomes are illustrated in
Figure 7.
4.2. Extreme Learning Machine
The extreme learning machine (ELM) is a rapid and efficient training method for single hidden layer feedforward neural networks (SLFNs), originally introduced by Huang et al. [
22] and extensively studied in later works. Unlike conventional learning approaches, the ELM achieves model training by simultaneously minimizing the training error and optimizing the output weights. This method significantly improves the training speed and generalization capability, making it highly suitable for applications such as fatigue life prediction and fault detection. The core steps of the ELM algorithm are summarized as follows.
The input training set is expressed by
where
is the input feature and
is the label.
The SLFNs with activation functions and the number of hidden neurons are modeled as follows:
where
represents the total number of hidden neurons and
is the activation function. The parameters
and
refer to the output weight and input weight of the i-th hidden neuron, respectively. Similarly,
is the bias term associated with the output layer, and
denotes the output of the j-th neuron. If the single hidden layer feedforward neural network (SLFN) is capable of perfectly fitting the training samples with zero error, then there exists a set of parameters
,
, and
that satisfies the following equation:
The above relationship can be expressed as .
For ELM,
and
can be arbitrary values, and
can be calculated by the least squares solution:
The above equations can be completed using the Moor–Penrose (MP) generalized inverse:
where
is the generalized inverse.
In the ELM model, the sigmoid activation function is selected to map input data into the (0, 1) range, thereby embedding nonlinearity into the output response. An essential hyperparameter in ELM is the number of neurons in the hidden layer, which has a substantial impact on the model’s predictive performance. In this study, after evaluating multiple datasets, the optimal number of hidden layer nodes was determined to be 40. A schematic illustration of the ELM network architecture is provided in
Figure 8, while the results of hyperparameter tuning and training performance are displayed in
Figure 9.
4.3. Partial Least Squares Algorithm
The core principle of the partial least squares (PLS) algorithm is to construct a predictive model by identifying new orthogonal projection directions—referred to as principal components—that maximize the covariance between the independent and dependent variables in the projected space. Unlike many other machine learning methods, PLS can deliver reliable prediction results even when working with limited sample sizes. A simplified schematic of the PLS algorithm workflow is illustrated in
Figure 10.
The first step involves normalizing both the independent and dependent variable matrices, followed by decomposition of the independent variable matrix:
Let represent the matrix of independent variables, where is the score matrix, is the load matrix, and is the residual matrix.
The matrix product
can be expressed as the sum of the products of the score vector
and the load vector
, and thus we have
The dependent variable matrix
can be decomposed to
where
is the score matrix,
is the load matrix, and
is the residual matrix, while
can be expressed as the sum of the product of the score vector
and the load vector
, as shown in the following equation:
Let , where is the regression coefficient. Then, , which ultimately yields
PLS reduces the dimensionality of the features by extracting the principal components from the input data. The number of principal components retained determines the dimensionality of the feature space after reduction. Before performing PLS regression, the data are normalized. The optimization results, along with the experimental and predicted lifetimes, are shown in
Figure 11.
4.4. Data Analysis and Discussion
In
Section 3, high-temperature, low-cycle fatigue tests and finite element simulations of Ti
2AlNb-based alloy structural components were conducted. By comparing the simulation data with experimental results, the errors between the fatigue life predictions from Seeger’s theory and the actual test data for three types of structural components (with a = 2.4, a = 3, and a = 4) were 11.7%, 6.7%, and 8.7%, respectively. The errors for predictions based on the improved Lemaitre damage evolution theory were 11%, 17.7%, and 9%, respectively. These errors fell within a reasonable range, demonstrating the accuracy and reliability of both theoretical methods. Therefore, these methods can be used to evaluate the fatigue performance of related components in future studies.
This study systematically investigated the influence of temperature, applied stress, and geometric characteristics on the high-temperature, low-cycle fatigue life of Ti
2AlNb-based alloy structural components through controlled testing and simulation. According to
Table 1 and
Table 2, the fatigue life tended to decrease with rising temperatures and stress levels. Finite element simulations further showed that fatigue resistance diminished significantly during the final stages of the loading cycle, highlighting increased vulnerability to damage under prolonged loading. The stress concentration factor (kt) was used to quantify the intensity of localized stress within a structural component. For the three types of structural members with a = 2.4, a = 3, and a = 4, the stress concentration factors were 3.52, 3.06, and 2.55, respectively, as shown in
Table 3. When the structural member had an elliptical opening, and the length of the axial opening was the same, the longer the length of the
x-axis opening, the more relieved the stress concentration of the structural member was, and the more enhanced the ability to resist fatigue damage was.
In practical engineering applications, fatigue life assessment through physical testing and numerical simulation is often time-consuming and costly. Machine learning provides a promising alternative by enabling the learning of underlying physical relationships directly from data without requiring explicit prior knowledge. This makes it a valuable approach for predicting the fatigue life of titanium alloys. In this study, three machine learning techniques—long short-term memory (LSTM) networks, the extreme learning machine (ELM), and partial least squares (PLS) regression—were utilized to forecast the high-temperature, low-cycle fatigue life of Ti
2AlNb-based alloy structural components. Each model underwent hyperparameter optimization to enhance performance, and the prediction outcomes are summarized in
Table 4, Overall, the ELM model demonstrated the best performance among the three, with notable advantages in training speed, structural simplicity, and robustness. It achieved an average R
2 of 0.892 and an RMSE of 0.177 across 10 runs, indicating high predictive accuracy. The LSTM model followed closely with an average R
2 of 0.873, also showing strong predictive capability. In contrast, the PLS model yielded an average R
2 of only 0.7357, suggesting limited effectiveness in capturing the nonlinear relationships inherent in the data.
The ELM model eliminates the need for iterative training by randomly assigning weights and biases between the input and hidden layers, distinguishing it from conventional machine learning approaches. This significantly enhances computational efficiency and the prediction speed. However, this study also found that improper selection of hyperparameters can lead to severe overfitting. The LSTM model possesses long-term memory capabilities and is well suited for handling complex data structures. In this study, a single-feature time step reconstruction approach was employed, treating each input feature as an independent time step. However, it was observed that the model required a large volume of training data to perform effectively, which may have contributed to its suboptimal predictive performance. The PLS model is appropriate for small-sample datasets, but it struggles with the highly complex and nonlinear nature of high-temperature fatigue behavior in titanium alloys. Its limited capacity to capture nonlinear relationships restricts its predictive performance in this context.
5. Conclusions
In this study, three representative structural components made of Ti2AlNb-based alloys were designed. Fatigue testing alongside numerical simulations was carried out at elevated temperatures of 550 °C and 650 °C, yielding satisfactory predictive accuracy. Based on the experimental and finite element method (FEM) results, three machine learning models—long short-term memory (LSTM) networks, the extreme learning machine (ELM), and partial least squares (PLS) regression—were applied to predict the fatigue life of the alloy components. These models demonstrated reliable and stable prediction performance while significantly reducing computation times. The findings offer valuable insights for fatigue life estimation of other alloys such as Ti-6Al-4V and TC4, especially for components with complex geometries. The primary conclusions of this work are as follows:
(1) In the finite element simulations of the three Ti2AlNb-based alloy structural components, Seeger’s fatigue life model produced prediction errors of 11.7%, 6.7%, and 8.7% compared with the experimental values, while the improved Lemaitre damage evolution model yielded errors of 11%, 17.7%, and 9%, respectively. These deviations are within an acceptable error margin, confirming the applicability of both models.
(2) The errors for the two fatigue life theories were 16.1%, 14.2%, and 17.4%, respectively. The reasonableness of both methods was validated by comparing the results with the experimental values.
(3) The impact of the hole geometry on fatigue life was clearly demonstrated. For the three elliptical hole structures investigated in this study, it was found that appropriately increasing the hole length in the x direction can effectively alleviate local stress concentrations, thereby enhancing fatigue life. Compared with conventional flat plates, incorporating surface structural reinforcement significantly improves the component’s resistance to fatigue failure.
(4) After hyperparameter optimization, the ELM model achieved the best prediction performance (R2 = 0.8919, RMSE = 0.17736), outperforming the other two models. The LSTM model ranked second, demonstrating strong scalability and the potential for improved performance with larger datasets. In contrast, the PLS model showed relatively weaker predictive capability and limited applicability. Overall, the ELM is more suitable for medium- and small-scale modeling tasks in the current study, while LSTM is better suited for complex scenarios involving strong temporal dependencies.