A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation

Cheng, Lu; Shao, Chenxi; Cheng, Peng

doi:10.3390/met16050510

Open AccessArticle

A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation

by

Lu Cheng

^1,2

,

Chenxi Shao

^1,2,* and

Peng Cheng

¹

Yanqi Lake (Beijing) Institute of Basic Manufacturing Technology Research, China Academy of Machinery Science and Technology, Beijing 101400, China

²

China Productivity Center for Machinery, China Academy of Machinery Science and Technology, Beijing 100044, China

^*

Author to whom correspondence should be addressed.

Metals 2026, 16(5), 510; https://doi.org/10.3390/met16050510

Submission received: 14 March 2026 / Revised: 28 April 2026 / Accepted: 4 May 2026 / Published: 9 May 2026

(This article belongs to the Section Computation and Simulation on Metals)

Download

Browse Figures

Versions Notes

Abstract

Accurate constitutive modeling of TC4 titanium alloy at elevated temperatures is critical for process design and numerical simulation in aerospace manufacturing. However, purely data-driven deep neural networks (DNNs) often suffer from severe overfitting and may yield physically unreasonable predictions in data-sparse or strictly out-of-distribution (OOD) regions. To address this issue, this study proposes a physics-guided two-stage neural network framework, termed NN-PhysicsInit, for the constitutive modeling of TC4 alloy. In Stage I, a large synthetic dataset generated from a strain-compensated Arrhenius-type constitutive equation is used to pre-train the network, thereby introducing analytical prior knowledge into the initial topological space. In Stage II, the pre-trained model is fine-tuned using rigorously corrected experimental data obtained from isothermal compression tests conducted over 800–980 °C and 0.001–1 s⁻¹ to improve material-specific predictive accuracy. To evaluate generalization capability, a rigorous dual-perspective extrapolation validation scheme is designed separately in the temperature (1010 °C) and strain-rate (10 s⁻¹) dimensions. The results demonstrate that, compared with direct black-box training, the proposed framework successfully prevents non-physical divergence and better preserves macroscopic thermodynamic smoothness in unseen domains. Specifically, the extrapolation average absolute relative error (AARE) is significantly reduced from 34.21% to 14.34% in the temperature extrapolation task, and from 27.91% to 8.92% in the strain-rate extrapolation task. These findings confirm that physics-based initialization acts as a powerful implicit regularizer, effectively mitigating the extrapolation catastrophe while maintaining high fitting accuracy. The proposed framework provides a robust and practical strategy for the constitutive modeling of complex alloys under limited-data conditions.

Keywords:

TC4 titanium alloy; constitutive modeling; physics-guided learning; transfer learning; extrapolation; neural network

1. Introduction

TC4 (Ti-6Al-4V) titanium alloy is widely utilized in aerospace, marine engineering, and other high-end manufacturing fields due to its high specific strength, excellent corrosion resistance, and favorable properties at elevated temperatures [1]. During hot working processes such as forging and superplastic forming, its flow behavior is strongly dependent on temperature, strain rate, and strain, governed by the synergistic effects of work hardening, dynamic recovery (DRV), and dynamic recrystallization (DRX) [2,3]. Consequently, accurate constitutive modeling of TC4 alloy is of paramount importance for process optimization, finite element method (FEM) simulation, and performance control of formed components [4].

Among conventional constitutive approaches, the strain-compensated Arrhenius-type equation remains a benchmark for describing the hot deformation of titanium alloys [5,6]. These analytical models are compact and physically interpretable, effectively capturing the coupled effects of thermomechanical parameters. However, because the functional form is predefined, their predictive accuracy is often limited when the material response becomes highly nonlinear, particularly in regimes involving pronounced flow softening or phase-related complexities [7,8]. As noted by Buchmayr [9], while traditional models provide a reliable baseline, they struggle to meet the precision requirements of modern digital twins in complex manufacturing environments.

With the development of machine learning, deep neural networks (DNNs) have attracted growing attention in constitutive modeling due to their strong nonlinear mapping capability [10,11]. Recently, comprehensive reviews, such as the latest exhaustive work published in Progress in Materials Science [12], have systematically highlighted the transformative impact of data-driven approaches in predicting the complex thermomechanical behaviors of metallic materials. Furthermore, advanced applications of artificial neural networks have successfully demonstrated superior predictive capabilities in capturing the highly non-linear flow characteristics of various alloys under extreme processing conditions [13]. In many cases, DNN-based models achieve high fitting accuracy within the training domain. Nevertheless, under limited and imbalanced datasets, purely data-driven models are prone to overfitting and often exhibit poor extrapolation behavior [14]. This issue is particularly critical in hot compression experiments, where high-strain-rate conditions typically yield much fewer data points because the deformation is completed rapidly [15]. Furthermore, high-strain-rate experimental data inherently contain localized noise and measurement artifacts (e.g., interfacial friction and adiabatic heating). Purely data-driven models lack explicit guidance from underlying deformation physics [16,17] and inevitably overfit these imperfections. As emphasized by Bock et al. [14], balancing model flexibility with physical robustness remains a core obstacle. Consequently, when applied to sparsely sampled or unseen regions, these “black-box” models produce unstable or trend-inconsistent predictions, which significantly reduces model reliability for digital manufacturing [18].

Therefore, it is highly desirable to develop a constitutive modeling strategy that combines the flexibility of neural networks with the robustness of physically motivated analytical models [19,20]. Recent advances in physics-guided and transfer-learning-based modeling have shown that prior physical knowledge can improve data efficiency and generalization performance [21,22,23,24,25,26]. Advanced frameworks have recently moved toward integrating physical metallurgy laws, such as the work by Li et al. [25] utilizing metallurgy-guided learning to predict microscopic deformation mechanisms. While such approaches are immensely valuable, providing a highly reliable, non-divergent macroscopic constitutive equation for complex aerospace FEM simulations requires a different perspective—one strictly governed by continuum mechanics and thermodynamic consistency.

While the phenomenological rheological behaviors of the TC4 titanium alloy have been extensively documented in previous decades, providing a rich foundation of experimental data, the direct application of pure data-driven deep learning models for FEM digital twins remains highly perilous due to the aforementioned extrapolation catastrophe involving extreme local gradients. Therefore, in this study, the well-documented TC4 alloy is strategically utilized as a “benchmark material” to propose and validate a novel methodological framework: a macroscopic physics-guided two-stage learning framework termed NN-PhysicsInit. Instead of embedding explicit physical residuals into the loss function, the framework introduces analytical prior knowledge through pre-training on a smooth synthetic dataset generated by a calibrated Arrhenius model, followed by fine-tuning with actual experimental data. This strategy acts as an implicit physical regularizer, filtering out experimental noise and constraining the initial parameter hypothesis space to align with thermodynamic boundaries.

In addition to the model design, this work emphasizes a highly stringent out-of-distribution (OOD) validation protocol. Unlike conventional data-driven studies that predominantly rely on random interpolation testing, a rigorous dual-perspective extrapolation strategy is designed to challenge the physical boundaries of the model. Specifically, the framework is blindly extrapolated across two critical dimensions: the thermodynamic boundary (temperature extrapolation to 1010 °C, crossing the macroscopic β-transus) and the kinetic boundary (strain-rate extrapolation to 10 s⁻¹, featuring severe data imbalance and dynamic softening). The scientific novelty of this work does not lie in discovering new deformation mechanisms of TC4, but in establishing a robust algorithmic bridge between classical thermodynamic bounds and high-capacity non-linear machine learning. This dual-perspective validation strictly proves that the embedded physical priors can effectively prevent the catastrophic extrapolation failure typical of pure black-box models, guaranteeing physically bounded predictions under unseen extreme conditions and providing a trustworthy digital foundation for aerospace component manufacturing.

The main contributions of this work are as follows:

A novel macroscopic physics-guided learning framework (NN-PhysicsInit) is proposed. Unlike conventional mixed-data models that suffer from gradient clash, this decoupled two-stage strategy utilizes an analytical Arrhenius prior for topological pre-training (acting as an implicit regularizer), followed by empirical fine-tuning to accurately capture material-specific nonlinearities.
A rigorous dual-perspective out-of-distribution (OOD) extrapolation scheme is designed. The generalization capability of the framework is systematically challenged across both thermodynamic (extrapolating across the macroscopic β-transus boundary at 1010 °C) and kinetic dimensions (extrapolating to the highly imbalanced 10 s⁻¹ domain).
The “extrapolation catastrophe” inherent in purely data-driven models is successfully mitigated. The proposed framework effectively filters out inherent high-frequency experimental noise and measurement artifacts. It guarantees physically bounded and trend-consistent predictions under unseen extreme conditions, providing a trustworthy constitutive engine for advanced aerospace digital twins.

2. Materials and Methods

2.1. Experimental Materials and Procedures

The investigated material in this study was a commercial TC4 (Ti-6Al-4V) titanium alloy (provided by Baoti Group Ltd., Baoji, China), and its detailed chemical composition is listed in Table 1.

To obtain the fundamental flow stress data under various deformation conditions, isothermal hot compression tests were conducted on a Gleeble-3800 thermal simulator (Dynamic Systems Inc., Poestenkill, NY, USA). The complete experimental procedure and the designed thermomechanical pathways are schematically illustrated in Figure 1. Cylindrical specimens with a diameter of 8 mm and a height of 12 mm were machined along the axial direction of the as-received ingot. To minimize interfacial friction during compression and prevent heterogeneous deformation (barreling), both ends of the specimens were carefully polished, and tantalum foils combined with graphite lubricant were applied. All tests were performed in a vacuum environment to prevent oxidation at elevated temperatures.

Based on the typical hot working windows for the TC4 alloy, the deformation temperatures were set at 800, 850, 900, 950, 980, and 1010 °C. Concurrently, the strain rates were selected as 0.001, 0.01, 0.1, 1, and 10 s⁻¹, resulting in a comprehensive experimental matrix of 30 distinct deformation conditions.

The comprehensive rheological behaviors of the TC4 alloy under various temperatures and strain rates are illustrated in Figure 2. To rigorously eliminate measurement artifacts, the raw experimental data (faint lines) were recalculated using standard friction and adiabatic heating correction models, yielding the genuine isothermal flow stress curves (solid lines) [27]. Furthermore, the figure superimposes the analytical baseline calculated by the strain-compensated Arrhenius model. To explicitly demonstrate our data isolation strategy, the plot distinguishes between the interpolation points used for model calibration (marked with solid dots) and the out-of-distribution (OOD) extrapolation predictions (marked with crosses) for the unseen extreme conditions (i.e., at 10 s⁻¹ and 1010 °C).

2.2. Establishment of the Analytical Prior Model

To provide a physically motivated prior for network pre-training, a strain-compensated Arrhenius-type constitutive model was first established. The relationship among strain rate (

\dot{ε}

), flow stress (σ), and deformation temperature (T) can be expressed as:

\dot{ε} = A {[\sinh (α σ)]}^{n} \exp (- Q / R T)

(1)

where A, n, and α are material constants, Q is the activation energy for hot deformation, and R is the universal gas constant.

Under low-stress and high-stress conditions, Equation (1) can be expressed as follows:

\dot{ε} = A_{1} σ^{n_{1}} \exp (- Q / R T)

(2)

\dot{ε} = A_{2} \exp (β σ) \exp (- Q / R T)

(3)

where A₁, A₂, n₁, and β are material constants, and α = β/n₁. The Zener–Hollomon parameter (Z) is defined as:

Z = \dot{ε} \exp (Q / R T) = A {[\sinh (α σ)]}^{n}

(4)

Taking logarithms of Equations (2) and (3) yields:

\ln \dot{ε} = \ln A_{1} - Q / R T + n_{1} \ln σ

(5)

\ln \dot{ε} = \ln A_{2} - Q / R T + β σ

(6)

From the fitted linear relationships of

\ln \dot{ε}

versus ln σ and

\ln \dot{ε}

versus σ, the parameters n₁ and β can be obtained.

Taking the logarithm of Equation (1) gives:

\ln \dot{ε} = \ln (A) + n \ln [\sinh (α σ)] - (Q / R T)

(7)

For a given temperature, the material constant n can be expressed as:

n = {[\frac{\partial \ln \dot{ε}}{\partial \ln [\sinh (α σ)]}]}_{T}

(8)

For a given strain rate, the activation energy Q can be obtained from:

Q = R n {[\frac{\partial \ln [\sinh (α σ)]}{\partial (1 / T)}]}_{\dot{ε}}

(9)

Substituting Equation (8) into Equation (9) yields:

Q = R {[\frac{\partial \ln \dot{ε}}{\partial \ln [\sinh (α σ)]}]}_{T} {[\frac{\partial \ln [\sinh (α σ)]}{\partial (1 / T)}]}_{\dot{ε}}

(10)

Taking the logarithm of Equation (4) yields:

\ln Z = \ln A + n \ln [\sinh (α σ)]

(11)

Thus, the values of n and ln A can be obtained from the linear relation between ln [sinh (ασ)] and ln Z.

To enhance the predictive accuracy of the constitutive model, the influence of true strain on the flow stress was incorporated into the analysis. Derived from Equation (4), the constitutive equation correlating the flow stress with the Zener–Hollomon (Z) parameter can be expressed as follows:

σ = \frac{1}{α} In {(\frac{Z}{A})^{\frac{1}{n}} + {[{(\frac{Z}{A})}^{\frac{2}{n}} + 1]}^{\frac{1}{2}}}

(12)

To account for the strain dependence of the material behavior, the fundamental material constants (α, n, Q, and ln A) were treated as functions of the true strain. These parameters were sequentially calculated at strain intervals from 0.05 to 0.60 with a step of 0.05, and then smoothly fitted using sixth-order polynomial functions.

\begin{array}{l} \ln A = {57.39 + 37.79 ε - 674.52 ε}^{2} {+ 3081.96 ε}^{3} {- 7275.80 ε}^{4} {+ 8687.73 ε}^{5} {- 4105.77 ε}^{6} \\ {α = 0.02 - 0.02 ε + 0.12 ε}^{2} {- 0.23 ε}^{3} {- 2.32 ε}^{4} {- 0.01 ε}^{5} {- 0.50 ε}^{6} \\ {n = 2.76 + 0.64 ε - 17.50 ε}^{2} {+ 92.15 ε}^{3} {- 207.04 ε}^{4} {+ 211.07 ε}^{5} {- 77.22 ε}^{6} \\ {Q = 611.82 + 281.52 ε - 5370.73 ε}^{2} {+ 25909.79 ε}^{3} {- 60748.35 ε}^{4} {+ 72391.56 ε}^{5} {- 34214.52 ε}^{6} \end{array}

(13)

To further validate the reliability of the constructed strain-compensated Arrhenius model, the calculated material constants were compared with similar models reported in published literature for TC4 (Ti-6Al-4V) titanium alloys. As shown in Figure 3c, the calculated deformation activation energy (Q) in this study varies from approximately 480 to 620 kJ/mol depending on the true strain. This range is highly consistent with the typical activation energies required for the hot deformation of Ti-6Al-4V alloys, which are generally widely reported to be in a similar magnitude by recent comprehensive constitutive studies (e.g., Hu et al. [4] and Cai et al. [5]). Furthermore, while many conventional studies predominantly employ 4th- or 5th-order polynomials to describe the strain dependence of these parameters, this study utilizes a higher-order (6th-order) polynomial fitting, as detailed in Equation (13). Utilizing a 6th-order polynomial effectively minimizes the inherent regression errors of the analytical model, thereby providing a more precise and thermodynamically rigorous mathematical baseline for the subsequent Stage I pre-training of the neural network.

The resulting strain-compensated Arrhenius model was subsequently utilized to generate the large-scale synthetic dataset for the Stage I pre-training. It should be particularly emphasized that, to prevent data leakage during subsequent extrapolation validation, the calibration of this baseline Arrhenius model strictly utilized only the data designated for the training set (e.g., ≤980 °C or ≤1.0 s⁻¹), keeping the extrapolation targets entirely unseen. Following this rigorous data splitting, it should be noted that the polynomial coefficients listed in Equation (13) specifically correspond to the analytical baseline calibrated for the temperature extrapolation task. For the strain-rate extrapolation ablation study, the Arrhenius parameters (α, n, Q, and ln A) were independently re-calibrated using their respective training subsets. To avoid redundancy, those specific alternative coefficients are omitted here, as the mathematical structure and derivation procedure remain perfectly identical.

2.3. Physics-Guided Two-Stage Neural Network Framework (NN-PhysicsInit)

To improve constitutive modeling under limited and imbalanced data conditions, a two-stage neural network framework, termed NN-PhysicsInit, was developed. The key idea is to introduce analytical prior knowledge during initialization and then refine the model using experimental data.

Stage I: Pre-training with synthetic data: In the first stage, a large-scale synthetic dataset was generated using the calibrated strain-compensated Arrhenius model. Compared with the actual experimental dataset, which is sparse and unevenly distributed in some extreme deformation regions, the synthetic dataset was meticulously designed to cover the thermomechanical parameter space more uniformly. The input variables included deformation temperature (T), strain rate (

\dot{ε}

), and true strain (ε), and the corresponding true stress (σ) values were calculated using the analytical model. This stage provides the neural network with a physically guided initialization. From a machine learning perspective, pre-training on smooth synthetic data constrains the initial parameter space and reduces the tendency of the model to overfit local fluctuations or noise in the experimental data. As illustrated in Figure 4, the synthetic dataset uniformly samples the temperature and logarithmic strain rate, projecting them into a smooth baseline stress mapping.

Stage II: Fine-tuning with experimental data. While the analytical Arrhenius prior provides a robust macroscopic boundary, it inherently lacks the non-linear capacity to capture the specific, localized dynamic softening features of the TC4 alloy at high strain rates. Therefore, the pre-trained network was further fine-tuned using the actual experimental stress–strain dataset. However, the critical challenge during this phase is to prevent the risk of prior degradation—a phenomenon where extensive unconstrained fine-tuning completely overwrites the physical priors, degrading the network back into a purely data-driven black box. To resolve this, the Stage II optimization was strictly regulated as a local, rather than an unconstrained global, search. By employing a significantly reduced learning rate (e.g., 1.5 × 10⁻⁴) combined with a strict early-stopping criterion monitored via validation loss, the network is restricted to moderate parameter adjustments. This optimized strategy ensures that the model successfully compensates for TC4-specific empirical non-linearities without compromising the global thermodynamic topology established in Stage I. The overall workflow of this proposed NN-PhysicsInit approach, integrating the physically guided topological initialization (Stage I) and the subsequent material-specific accuracy compensation (Stage II), is comprehensively illustrated in Figure 5.

It is imperative to clearly distinguish the proposed NN-PhysicsInit framework from existing hybrid constitutive models reported in the literature. Firstly, unlike standard Physics-Informed Neural Networks (PINNs) or conventional mixed-data training architectures that typically embed physical constraints as soft penalty residuals within the loss function—which often suffer from gradient clash as the network struggles to simultaneously minimize the loss for smooth physical priors and noisy local experimental variations—the proposed decoupled method injects physical priors through initial weight space topological mapping. This serves as an implicit regularizer rather than a loss penalty, allowing the network to first establish a globally stable thermodynamic manifold before performing localized optimization. Secondly, while conventional transfer learning transfers knowledge between different empirical datasets to accelerate convergence, the Stage I pre-training transfers knowledge from a deterministic analytical manifold to the network, deliberately restricting the hypothesis space to prevent catastrophic extrapolation. Finally, compared to traditional Arrhenius-ANN coupled frameworks that calculate analytical baselines and ANN residuals in parallel, NN-PhysicsInit absorbs the thermodynamic laws purely during the training phase. Consequently, it maintains the computational efficiency of a singular, end-to-end deep neural network during inference, making it highly robust for extreme out-of-distribution (OOD) scenarios in digital manufacturing.

Furthermore, it is critical to discuss the phenomenological consequences of excessive fine-tuning in Stage II. If the fine-tuning duration is extended indefinitely without strict early-stopping regularization, the neural network becomes overly biased towards minimizing the empirical loss on the limited experimental dataset. Consequently, the model begins to overfit the inherent high-frequency noise and local measurement artifacts. From a topological perspective, this excessive optimization leads to the severe degradation of the physical prior, where the globally smooth physical manifold constructed during Stage I is gradually overridden and distorted. As a result, the implicit thermodynamic bounds are lost, and the out-of-distribution (OOD) extrapolation performance degrades sharply, eventually regressing to the unstable behavior of a purely data-driven model. Therefore, determining the optimal fine-tuning stopping point—where material-specific non-linearities are sufficiently captured while the Stage I physical topology remains intact—is fundamentally essential for the success of this hybrid framework.

2.4. Extrapolation Validation Design and Implementation Details

2.4.1. Dual-Perspective Extrapolation Ablation Design

Before executing the dataset splitting, it is necessary to analyze the distribution of the collected experimental data. As illustrated in the coverage count heatmap (Figure 6), the rigorously corrected isothermal hot compression dataset exhibits an unbalanced distribution. For instance, the low strain rate domains (e.g., 0.001 s⁻¹) contain abundant sample points, whereas the high strain rate conditions (e.g., 10 s⁻¹) are relatively sparse due to the rapid completion of the dynamic deformation process. This inherent data imbalance is primarily caused by the intrinsic variations in test durations and the hardware limits of dynamic sampling rates during Gleeble high-speed compression. Such disparity inevitably leads to severe bias in standard data-driven models, which further highlights the necessity of injecting uniformly sampled physical priors (Stage I) to regularize the network.

This data sparsity poses a significant challenge for purely data-driven neural networks, increasing the risk of overfitting and distribution shift when transitioning to unseen conditions [27,28]. Therefore, to evaluate the out-of-distribution (OOD) generalization of the proposed NN-PhysicsInit architecture trained on this corrected thermodynamic baseline, this study adopted a dual-perspective extrapolation validation scheme instead of traditional random data division. The corrected isothermal experimental data corresponding to the highest temperature (1010 °C) and the highest strain rate (10 s⁻¹) were deliberately excluded from the training set to serve as testing domains:

Temperature extrapolation ablation: Corrected experimental data obtained at 800, 850, 900, 950, and 980 °C were utilized for the training set, while the data at 1010 °C were reserved as a strictly unseen testing set. This strategy is designed to evaluate the model’s thermodynamic stability and extrapolation capability across the critical β-transus phase boundary.
Strain rate extrapolation ablation: Corrected experimental data at strain rates of 0.001, 0.01, 0.1, and 1.0 s⁻¹ were allocated to the training set, while the data at the extreme strain rate of 10 s⁻¹ were reserved as an unseen test set. This approach evaluates the model’s kinetic robustness in the presence of inherent data imbalance and severe non-linear dynamic softening behaviors.

To further verify this OOD design, the statistical distribution of the true stress labels was analyzed. As depicted in the distribution comparison histogram (Figure 7), there is a noticeable data shift between the training set and the extrapolation test set. The high-stress regions (particularly exceeding 350 MPa) are mostly located within the extrapolation set. This distribution discrepancy indicates that the validation scheme functions as an extrapolation task rather than a standard interpolation, requiring appropriate physical constraints to stabilize predictions.

To systematically evaluate the effectiveness of the physics-guided training strategy under these conditions, an ablation framework was established. As summarized in Table 2, four distinct modeling approaches with different training strategies were deployed for comparison. This matrix aims to decouple and verify the contributions of the analytical physical prior (Stage I) and the experimental data fine-tuning (Stage II) to the final generalization capability.

2.4.2. Neural Network Architecture and Hyperparameters

The neural network had a topology of 3→128→128→64→1, consisting of one input layer, three hidden layers, and one output layer. The input variables were temperature, logarithmic strain rate, and true strain. The Sigmoid Linear Unit (SiLU) was used as the activation function, and the mean squared error (MSE) was used as the loss function. Although this architecture contains approximately 25,000 trainable parameters—which typically introduces a high risk of overfitting in standard data-driven models—this sufficient network capacity is intentionally retained to capture the severe non-linear dynamic softening at extreme strain rates. The inherent overfitting risk is fundamentally suppressed by the implicit regularization provided by the Stage I physics-guided pre-training, as detailed in Section 3.4.

During data preprocessing, the input variables were normalized using the Z-score method, and the flow stress target was transformed logarithmically. Model optimization was performed using the Adam optimizer together with a cosine annealing learning rate scheduler. Because the two extrapolation tasks differ in data distribution and difficulty, task-specific training hyperparameters were used. The detailed settings are summarized in Table 3. The neural network architecture and data processing were implemented using the Python programming language (version 3.9.12) and the PyTorch framework (version 1.12.1, Meta Platforms, Inc., Menlo Park, CA, USA).

2.4.3. Statistical Evaluation Metrics

To rigorously evaluate the model performance, the correlation coefficient (R), the coefficient of determination (R²), the average absolute relative error (AARE), and the root mean square error (RMSE) were introduced. These metrics are mathematically defined as follows:

R = \frac{\sum_{i = 1}^{n} (y_{i} - \bar{y}) ({\hat{y}}_{i} - \bar{\hat{y}})}{\sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2} \cdot \sum_{i = 1}^{n} {({\hat{y}}_{i} - \bar{\hat{y}})}^{2}}}

(14)

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(15)

A A R E = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - {\hat{y}}_{i}}{y_{i}}| \times 100 %

(16)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(17)

3. Results

3.1. Comparison of Predictive Performance in Training and Extrapolation Domains

The predictive performance of NN-PhysicsInit was compared with that of the Arrhenius model, NN-Direct, and NN-Arrhenius-Only.

To ensure statistical robustness, all reported neural network metrics represent the average results from 5 random restarts of the training process. As shown in Table 4, under this rigorous evaluation, the NN-Direct model achieves the lowest training error, with an average AARE of 3.51% and an RMSE of 6.34 MPa, indicating a strong fitting ability within the training domain. However, its performance deteriorates substantially in the unseen 1010 °C domain, where the extrapolation AARE surges to 34.21%. It should be noted that the determination coefficient (R²) of NN-Direct remains deceptively high in this testing set despite its poor AARE. This is mainly because the overall stress magnitude in the high-temperature domain is inherently low; thus, moderate absolute deviations can produce massive relative errors. In this extreme scenario, AARE serves as a much more sensitive and rigorous metric for evaluating extrapolation inaccuracies. Meanwhile, the Arrhenius model and NN-Arrhenius-Only exhibit nearly identical performance, indicating that the pre-training stage perfectly enables the network to reproduce the analytical baseline. However, both inherit the intrinsic approximation errors of the empirical Arrhenius formulation, thereby demonstrating limited predictive accuracy. The detailed statistical results of independent repeated runs are summarized in Tables S1–S4 of the Supplementary Materials.

By comparison, the proposed NN-PhysicsInit framework provides an exceptional balance between fitting performance and extrapolation stability. Its AARE in the 1010 °C testing set is significantly reduced to 14.34%, and its RMSE is reduced to 11.12 MPa. To understand the significance of this improvement, it is crucial to consider the underlying thermodynamic phase transition involved in this specific extrapolation task. At 1010 °C, the TC4 alloy crosses its macroscopic β-transus boundary. Although the single Arrhenius prior (calibrated primarily below this temperature) inherently lacks the domain-specific parameters for the single β-phase region, the NN-PhysicsInit framework seamlessly overcomes this physical gap. It effectively utilizes its Stage II fine-tuning to adaptively compensate for the macroscopic phase-shift and the associated stress reduction. Ultimately, this robust performance at 1010 °C, coupled with the high-strain-rate extrapolation at 10 s⁻¹, constitutes a rigorous dual-perspective validation. The consistency observed across these two orthogonal extrapolation perspectives (thermodynamic and kinetic) confirms that the proposed framework reliably preserves macroscopic physical trends and accurately self-corrects, even when the baseline analytical model reaches its theoretical limit.

A similar trend is observed in the kinetic strain-rate extrapolation task (Table 5). The purely data-driven NN-Direct model exhibits excellent fitting performance within the training domain (with an AARE of 2.69% and an RMSE of 5.00 MPa); however, it suffers a severe loss of accuracy in the strictly unseen 10 S⁻¹ testing domain, where the extrapolation RMSE predictably surges to 59.85 MPa and the AARE reaches 27.91%. In striking contrast, the proposed NN-PhysicsInit framework significantly reduces the extrapolation AARE to 8.92% and the RMSE to 25.34 MPa, demonstrating superior predictive robustness against the highly non-linear dynamic softening behaviors characteristic of extreme strain rates.

Taken together, the quantitative evaluations across Table 4 and Table 5 confirm that the direct application of unconstrained black-box models is highly perilous for reliable constitutive extrapolation. The proposed dual-stage framework successfully mitigates this “extrapolation catastrophe” by seamlessly combining the thermodynamic reliability of analytical priors with the flexibility of data-driven empirical corrections.

To rigorously justify the necessity of the proposed framework, five conventional machine-learning models, including Random Forest, ExtraTrees, XGBoost, SVR-RBF and KNN, were evaluated as benchmarks. As detailed in Supplementary Tables S5 and S6, while these models show acceptable interpolation accuracy, they suffer from catastrophic divergence in out-of-distribution (OOD) tasks. For instance, in the 1010 °C temperature extrapolation task, the AARE of the SVR-RBF model surges to 80.20%, whereas NN-PhysicsInit maintains a robust 14.34%. Similarly, for the 10 s⁻¹ strain-rate extrapolation, the AARE of SVR-RBF reaches 39.93%, while our framework achieves a high-precision prediction of 8.92%. This stark contrast compellingly proves that without inherent thermodynamic constraints, unconstrained data-driven algorithms fail to capture the physical boundaries of metal deformation.

The correlation scatter plots in Figure 8 and Figure 9 are consistent with the quantitative statistical results presented above. The purely data-driven NN-Direct model exhibits high accuracy within the interpolation training domain; however, it shows significant scattering and deviation from the ideal diagonal line when evaluated in the out-of-distribution (OOD) extrapolation regions. By comparison, the proposed NN-PhysicsInit framework demonstrates a much more stable and tightly bounded error distribution under these unseen extreme conditions. Benefiting from the implicit regularization of the pre-trained physical baseline, the dual-stage framework effectively mitigates the overfitting variance associated with unconstrained black-box models while reducing the systematic deviation inherent to the empirical Arrhenius equation.

3.2. Comparison of Stress–Strain Curve Fitting in the Training Domain

To intuitively evaluate the constitutive responses predicted by different models, four highly indicative cases were extracted from the full dataset to demonstrate the fitting performance under various thermomechanical conditions within the training domain. These representative curve comparisons are shown in Figure 10.

As clearly observed in the selected representative cases (Figure 10), the classical analytical Arrhenius model (blue dashed line) outlines the macroscopic thermodynamic contour but suffers from significant systematic deviations—such as massive overestimation at low strain rates (e.g., Figure 10a) and noticeable underestimation at high strain rates (e.g., Figure 10b)—due to the rigid constraints of its predefined mathematical structure. The proposed NN-PhysicsInit framework successfully utilizes this analytical prior as a topological baseline and precisely compensates for these empirical errors through the second-stage data fine-tuning, pulling the predictions effectively back into the accurate experimental domain.

More importantly, Figure 10 explicitly reveals the fundamental behavioral differences between the models when encountering experimental artifacts. As prominently exhibited in Figure 10c (900 °C, 0.001 s⁻¹), the raw experimental data (grey solid line) inevitably contains an anomalous, upward-drifting tail at large strains. This is likely a measurement artifact caused by dynamic instrumental interferences (such as excessive interfacial friction or high-temperature oxidation) rather than true material strain-hardening. The purely data-driven NN-Direct model aggressively minimizes the empirical training loss, essentially “memorizing” this non-physical localized trend. Although this phenomenon explains its deceptively low training error, it exposes the catastrophic sensitivity of unconstrained networks to corrupted local data.

In striking contrast, the proposed NN-PhysicsInit framework acts as a regularized continuum, exhibiting exceptional robustness against such artifacts. It deliberately resists overfitting this localized pseudo-hardening, strictly maintaining the physically correct downward trajectory (dynamic softening) dictated by the Arrhenius prior. This compellingly proves that, even within the interpolation training domain, the proposed dual-stage framework willingly sacrifices marginal local fitting accuracy on corrupted points to achieve a perfect equilibrium between macroscopic physical reliability and data-driven nonlinear capacity.

To comprehensively illustrate the fitting accuracy within the training domain, a detailed visual comparison of the flow behavior across all training temperatures (800–980 °C) and strain rates (0.001–1.0 s⁻¹) among the four modeling approaches is provided in Figure S1 of the Supplementary Materials.

3.3. Extrapolation Behavior in Unseen Deformation Domains

The differences among the four models become more apparent in the extrapolation tasks. Figure 11 and Figure 12 compare the predicted stress–strain curves in the unseen temperature domain (1010 °C) and the unseen strain-rate domain (10 s⁻¹), respectively.

The limitations of the purely data-driven approach (NN-Direct) are strikingly apparent in these out-of-distribution (OOD) regions. As highlighted by the red cross marks, these regions indicate significant non-physical oscillations and inaccurate trend predictions. For instance, at 1010 °C and 1.0 s⁻¹, while the actual material exhibits continuous work hardening, the NN-Direct model erroneously predicts a dynamic softening trend. Similarly, under the 10 s⁻¹ extrapolation condition, it yields concave shapes and substantial stress drops that severely deviate from the typical thermodynamic principles of metal deformation [29].

The fundamental reason why the purely data-driven model and the proposed NN-PhysicsInit model exhibit such divergent extrapolation behaviors lies in their inherent topological constraints. Devoid of any prior physical knowledge, the NN-Direct model merely performs unconstrained mathematical curve fitting, aggressively overfitting the localized high-frequency noise or data imbalance present in the training set. When forced into OOD domains, these overfitted mathematical artifacts are non-physically amplified, resulting in the aforementioned catastrophic stress oscillations and unnatural softening phenomena.

In striking contrast, the NN-PhysicsInit behaves robustly because macroscopic physical laws—specifically, the monotonic influence of thermal activation energy and the Zener–Hollomon parameter—are implicitly encoded into its neural weights during the Stage I pre-training [30,31]. This physical skeleton strictly governs the boundaries of the neural network’s hypothesis space. Therefore, even when navigating blindly through unseen extreme conditions, the model resists chaotic local fluctuations and naturally preserves the thermodynamic smoothness inherited from Arrhenius continuum mechanics. Consequently, at 10 s⁻¹, NN-PhysicsInit effectively mitigates the non-physical fluctuations of the data-driven model and partially corrects the systematic underestimation of the pure Arrhenius model (which struggles to capture intense dislocation proliferation at extreme speeds). Furthermore, at 1010 °C, this implicit regularization restricts the prediction boundaries, capturing thermodynamically reasonable evolutionary trends. Ultimately, this dual-stage physical injection strategy transforms the neural network into a highly reliable extrapolator, guaranteeing physically bounded predictions under unprecedented processing conditions [32].

3.4. Analysis of Parameter Distributions and Their Relation to Model Behavior

To further examine the effect of the two-stage training strategy, the layer-wise weight distributions of the neural network models were compared in both extrapolation tasks, as shown in Figure 13 and Figure 14.

Analysis of the layer-wise probability density distributions of the network weights provides insights into the structural robustness of the proposed framework. From the perspective of machine learning theory, a model with a widely dispersed weight distribution—such as the purely data-driven NN-Direct model, where the standard deviation in the shallow mapping layer (Layer 0) is approximately σ ≈ 0.35–0.37—tends to possess a highly complex and unconstrained hypothesis space. This characteristic inherently increases the risk of fitting high-frequency experimental noise, leading to potential overfitting.

However, by conducting Stage I pre-training on the analytical synthetic dataset, the physical prior effectively acts as an implicit regularizer. As shown by the NN-Arrhenius-Only distributions, this physical injection confines the parameter hypothesis space from a widely dispersed state to a more concentrated, zero-centered distribution. During the subsequent Stage II fine-tuning on real experimental data, the NN-PhysicsInit framework maintains this thermodynamically guided weight initialization. Rather than reverting to a scattered state, its standard deviation in Layer 0 remains bounded at σ ≈ 0.14–0.16. This structural restriction helps prevent weight explosion and promotes the macroscopic topological smoothness of the predicted constitutive surfaces, thereby improving the model’s generalization capability under extreme unseen conditions.

4. Conclusions

In this study, a physics-guided two-stage deep learning framework (NN-PhysicsInit) was developed to describe the hot deformation behavior of TC4 titanium alloy. Unlike conventional neural networks that merely pursue interpolation accuracy, this work rigorously evaluated the model’s out-of-distribution (OOD) extrapolation robustness. The main explicit findings are drawn as follows:

(1) Topological constraint via physical initialization: By decoupling the optimization process into analytical pre-training and experimental fine-tuning, the Arrhenius prior acts as an effective implicit regularizer. It restricts the neural network weights to a concentrated, thermodynamically consistent distribution (with a bounded standard deviation of σ ≈ 0.14–0.16), which mitigates the risk of overfitting inherent in unconstrained black-box models.

(2) Robustness against data imbalance at extreme strain rates: For the blind extrapolation at the highest strain rate (10 s⁻¹), the purely data-driven model overfitted local noise, producing non-physical stress predictions with an average absolute relative error (AARE) of 27.91%. In contrast, the NN-PhysicsInit model filtered out spurious high-frequency fluctuations and successfully captured the macroscopic dynamic softening trend, significantly reducing the extrapolation AARE to 8.92%.

(3) Generalization across the phase transus: During the temperature extrapolation task at 1010 °C (above the β-transus), the physical injection constrained the network from generating diverging predictions. While the purely analytical baseline inevitably exhibits systematic deviations when traversing this phase shift, the dual-stage framework adaptively compensates for the phase-induced stress reduction. Consequently, the extrapolation AARE was reduced to 14.34% (compared to 34.21% for the direct-training model), confirming that the physical prior provides a highly reliable structural baseline even across complex thermodynamic boundaries.

Limitations and Future Directions: It should be noted that the current framework is constructed primarily from a macroscopic phenomenological perspective. As highlighted by Li et al. [25], integrating explicit microstructural features into the machine learning framework is essential for a more fundamental understanding of physical metallurgy mechanisms. Future work will incorporate EBSD and TEM characterizations to establish a comprehensive cross-scale constitutive model.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/met16050510/s1, Figure S1: Detailed comparison between experimental true stress–strain curves and predicted results from four constitutive modeling approaches across a 5 × 4 matrix of 20 thermomechanical conditions within the training domain; Tables S1–S4: Detailed statistical metrics and repeated-run variances for the 5 independent neural network training processes; Tables S5 and S6: Benchmarking results comparing NN-PhysicsInit against conventional machine learning models (Random Forest, ExtraTrees, XGBoost, SVR-RBF, KNN). The core architecture code used in this study is also provided as a separate attachment.

Author Contributions

Conceptualization, L.C. and C.S.; Methodology, L.C. and P.C.; Software, L.C.; Validation, L.C. and P.C.; Formal analysis, L.C.; Investigation, L.C.; Data curation, L.C.; Writing—original draft preparation, L.C.; Writing—review and editing, C.S. and P.C.; Supervision, C.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request. The core neural network architecture code is provided as Supplementary Materials.

Conflicts of Interest

All authors were employed by the company China Academy of Machinery Science and Technology. All authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DNN	deep neural network
NN	Neural Network
NN-Arrhenius-Only	Neural network trained exclusively on Arrhenius synthetic data
NN-Direct	Purely data-driven neural network
NN-PhysicsInit	Physics-informed dual-stage neural network
OOD	Out-of-Distribution
TC4	Ti-6Al-4V titanium alloy
AARE	Average Absolute Relative Error
RMSE	Root Mean Square Error

References

Wang, H.; Wang, C.; Li, M.; Ma, R.; Zhao, J. Constitutive equations for describing the hot compressed behavior of TC4–DT titanium alloy. Materials 2020, 13, 3424. [Google Scholar] [CrossRef] [PubMed]
Warchomicka, F.; Canelo-Yubero, D.; Zehetner, E.; Requena, G.; Stark, A.; Poletti, C. In-situ synchrotron X-ray diffraction of Ti-6Al-4V during thermomechanical treatment in the beta field. Metals 2019, 9, 862. [Google Scholar] [CrossRef]
Seshacharyulu, T.; Medeiros, S.C.; Frazier, W.G.; Prasad, Y.V.R.K. Microstructural mechanisms during hot working of commercial grade Ti-6Al-4V with equiaxed alpha-beta microstructure. Mater. Sci. Eng. A 2000, 325, 112–125. [Google Scholar] [CrossRef]
Hu, M.; Dong, L.; Zhang, Z.; Lei, X.; Yang, R.; Sha, Y. Correction of flow curves and constitutive modelling of a Ti-6Al-4V alloy. Metals 2018, 8, 256. [Google Scholar] [CrossRef]
Cai, J.; Li, F.; Liu, T.; Chen, B.; He, M. Constitutive equations for elevated temperature flow stress of Ti-6Al-4V alloy considering the effect of strain. Mater. Des. 2011, 32, 1144–1151. [Google Scholar] [CrossRef]
Sellars, C.M.; Mctegart, W.J. On the mechanism of hot deformation. Acta Metall. 1966, 14, 1136–1138. [Google Scholar] [CrossRef]
Peng, W.; Zeng, W.; Wang, Q.; Yu, W. Comparative study on constitutive relationship of as-cast Ti60 titanium alloy during hot deformation based on Arrhenius-type and artificial neural network models. Mater. Des. 2013, 51, 95–104. [Google Scholar] [CrossRef]
Liu, Y.; Ning, Y.; Yao, Z.; Fu, M.W. Hot deformation behavior and processing map of a powder metallurgy nickel-based superalloy. J. Alloys Compd. 2014, 587, 183–190. [Google Scholar] [CrossRef]
Buchmayr, B. A systems engineering analysis of tailored formed metallic hybrids. Prod. Eng. 2021, 15, 211–221. [Google Scholar] [CrossRef]
Sadeghpour, S.; Abbasi, S.M.; Morakabati, M. Deformation behavior and microstructural evolution of Ti-6Al-4V/TiC composites during hot compression. J. Alloys Compd. 2017, 723, 1017–1025. [Google Scholar]
Wang, Q.; Wu, T.; Sun, D.L.; Lai, J. Prediction of flow stress in Ti6Al4V alloy with hydrogen at high temperature using artificial neural network. Mater. Sci. Forum 2007, 539, 3696–3701. [Google Scholar] [CrossRef]
Cheng, C.; Zou, Y. Accelerated discovery of nanostructured high-entropy alloys and multicomponent alloys via high-throughput strategies. Prog. Mater. Sci. 2025, 151, 101429. [Google Scholar] [CrossRef]
Churyumov, A.Y.; Kazakova, A.A. Prediction of true stress at hot deformation of high manganese steel by artificial neural network modeling. Materials 2023, 16, 1083. [Google Scholar] [CrossRef]
Bock, F.E.; Aydin, R.C.; Cyron, C.J.; Huber, N.; Kalidindi, S.R.; Klusemann, B. A review of the application of machine learning and data mining approaches in continuum materials mechanics. Front. Mater. 2019, 6, 110. [Google Scholar] [CrossRef]
Huang, X.; Li, X.; Guan, B.; Zhu, J.; Li, J. Machine learning-based constitutive model for hot deformation behavior of metallic materials. Comput. Mater. Sci. 2022, 207, 111301. [Google Scholar]
Zhang, C.; Shi, Q.; Wang, Y.; Qiao, J.; Tang, T.; Zhou, J.; Liang, W.; Chen, G. Towards an optimized artificial neural network for predicting flow stress of In718 alloys at high temperatures. Materials 2023, 16, 2663. [Google Scholar] [CrossRef]
Zhu, Y.; Zeng, W.; Sun, Y.; Feng, F.; Zhou, Y. Artificial neural network approach to predict the flow stress in the isothermal compression of as-cast TC21 titanium alloy. Comput. Mater. Sci. 2011, 50, 1785–1790. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Haghighat, E.; Raissi, M.; Mura, A.; Hector, H.; Zevalkink, L.; Karniadakis, G.E. A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics. Comput. Methods Appl. Mech. Eng. 2021, 379, 113741. [Google Scholar] [CrossRef]
Cuomo, S.; Di Cola, V.S.; Giampaolo, F.; Rozza, G.; Raissi, M.; Piccialli, F. Scientific machine learning through physics–informed neural networks: Where we are and what’s next. J. Sci. Comput. 2022, 92, 88. [Google Scholar]
Linka, K.; Hillgärtner, M.; Abdolazizi, K.P.; Aydin, R.C.; Itskov, M.; Cyron, C.J. Constitutive artificial neural networks: A fast and general approach to predictive data-driven constitutive modeling by deep learning. J. Comput. Phys. 2021, 429, 110010. [Google Scholar] [CrossRef]
Fuhg, J.N.; Bouklas, N. The mixed deep energy method for resolving concentration features in finite strain hyperelasticity. J. Comput. Phys. 2022, 451, 110839. [Google Scholar] [CrossRef]
Lin, Y.C.; Li, L.T.; Fu, M.W.; Chen, M.S. Hot deformation behavior and processing map of a Ni-based superalloy. J. Alloys Compd. 2013, 556, 103–110. [Google Scholar]
Li, H.; Wang, X.; Song, Y.; Li, Y.; Li, X.; Ji, Y. Physical metallurgy guided machine learning to predict hot deformation mechanism of stainless steel. Mater. Today Commun. 2023, 36, 106779. [Google Scholar]
Aggarwal, C.C. Neural Networks and Deep Learning: A Textbook, 2nd ed.; Springer: Cham, Switzerland, 2018; pp. 245–280. [Google Scholar]
Frankel, A.; Jones, R.E.; Alleman, C.; Templeton, J.A. Predicting the macroscopic fracture energy of porous metals from highly porous microstructures using machine learning. Comput. Mater. Sci. 2019, 169, 109098. [Google Scholar]
Jones, R.E.; Templeton, J.A.; Sanders, C.M.; Ostien, J.T. Machine learning models of plastic flow based on representation theory. Comput. Model. Eng. Sci. 2018, 117, 309–342. [Google Scholar] [CrossRef]
Nguyen, T.; Chen, J.; Liu, W.K. Machine learning-driven paradigm for polymer aging lifetime prediction: Integrating multi-mechanism coupling and cross-scale modeling to avoid extrapolation divergence. Macromolecules 2024, 57, 3456–3470. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, UK, 2016. [Google Scholar]
Neyshabur, B.; Tomioka, R.; Srebro, N. In search of the real inductive bias: On the role of implicit regularization in deep learning. In Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015; pp. 1–14. [Google Scholar]
Bartlett, P.L.; Foster, D.J.; Telgarsky, M.J. Spectrally-normalized margin bounds for neural networks. In Proceedings of the Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; pp. 6240–6249. [Google Scholar]

Figure 1. Schematic diagram of the hot compression test procedure.

Figure 2. True stress–strain curves of the TC4 alloy under various temperatures and strain rates: (a) 800 °C, (b) 850 °C, (c) 900 °C, (d) 950 °C, (e) 980 °C and (f) 1010 °C. The faint lines represent the raw experimental data, while the thick solid lines denote the genuine isothermal flow stress curves after rigorous friction and adiabatic heating corrections. The solid dots indicate the calculated values derived from the Arrhenius model within the training domain. The cross marks represent the purely extrapolated predictions for the out-of-distribution (OOD) conditions (i.e., the blind test sets at 10 s⁻¹ and 1010 °C).

Figure 3. Polynomial relationships between the material constants and true strain for the TC4 alloy: (a) stress multiplier α; (b) stress exponent n; (c) deformation activation energy Q; and (d) material constant ln A. The red dots represent the calculated values at specific strains, and the blue solid lines indicate the 6th-order polynomial fitting curves.

Figure 4. Statistical distributions of the synthetic dataset used for Stage I pre-training: (a) temperature distribution; (b) logarithmic strain-rate distribution; (c) mapping relationship between temperature and generated stress.

Figure 5. Schematic architecture of the physics-informed dual-stage neural network.

Figure 6. Heatmap of experimental data coverage counts under different temperatures and strain rates.

Figure 7. Comparison of stress distributions between the training and extrapolation test sets.

Figure 8. Correlation scatter plots comparing the predicted and experimental flow stress across different modeling approaches. Performance on the interpolation training set (800–980 °C): (a) Arrhenius, (b) NN-Direct, (e) NN-Arrhenius-Only, (f) NN-PhysicsInit. Performance on the 1010 °C temperature extrapolation test set: (c) Arrhenius, (d) NN-Direct, (g) NN-Arrhenius-Only, (h) NN-PhysicsInit. The black dashed diagonal line represents the ideal perfect prediction (y = x). The different colored dots correspond to the specific models evaluated: blue for the Arrhenius model, orange for NN-Direct, grey for NN-Arrhenius-Only, and red for the proposed NN-PhysicsInit framework.

Figure 9. Correlation scatter plots comparing the predicted and experimental flow stress across different modeling approaches. Performance on the interpolation training set (0.001–1.0 s⁻¹): (a) Arrhenius, (b) NN-Direct, (e) NN-Arrhenius-Only, (f) NN-PhysicsInit. Performance on the 10 s⁻¹ strain rate extrapolation test set: (c) Arrhenius, (d) NN-Direct, (g) NN-Arrhenius-Only, (h) NN-PhysicsInit. The black dashed diagonal line represents the ideal perfect prediction (y = x). The different colored dots correspond to the specific models evaluated: blue for the Arrhenius model, orange for NN-Direct, grey for NN-Arrhenius-Only, and red for the proposed NN-PhysicsInit framework.

Figure 10. Detailed comparison between the experimental true stress–strain curves and the predicted results from different constitutive models under four highly indicative conditions within the training domain: (a) 800 °C, 0.001 s⁻¹; (b) 800 °C, 1.0 s⁻¹; (c) 900 °C, 0.001 s⁻¹; and (d) 900 °C, 1.0 s⁻¹. The purely data-driven black-box neural network (orange dashed-dotted line) erroneously overfits high-frequency experimental measurement noise, whereas the proposed physics-informed framework (red solid line) successfully preserves macroscopic thermodynamic smoothness.

Figure 11. Comparison between experimental true stress–strain curves and predicted results from different modeling approaches in the 1010 °C temperature extrapolation domain at varying strain rates: (a) 0.001 s⁻¹, (b) 0.01 s⁻¹, (c) 0.1 s⁻¹, (d) 1.0 s⁻¹, and (e) 10.0 s⁻¹. Note: The gray dotted line (NN-Arrhenius-Only) is completely visually overlapped by the blue dashed line (Arrhenius). This is mathematically expected, as the network trained exclusively on analytical synthetic data perfectly replicates the extrapolation behavior of the foundational Arrhenius model without physical divergence.

Figure 12. Comparison between experimental true stress–strain curves and predicted results from different modeling approaches in the 10 s⁻¹ strain rate extrapolation domain at varying temperatures: (a) 800 °C, (b) 850 °C, (c) 900 °C, (d) 950 °C, (e) 980 °C, and (f) 1010 °C. Note: The gray dotted line (NN-Arrhenius-Only) is completely visually overlapped by the blue dashed line (Arrhenius). This is mathematically expected, as the network trained exclusively on analytical synthetic data perfectly replicates the extrapolation behavior of the foundational Arrhenius model without physical divergence.

Figure 13. Comprehensive layer-wise weight distributions of different models in the temperature extrapolation task. The grid compares Probability Density (Y-axis) versus Weight Value (X-axis) across three model types (rows: NN-Direct, NN-Arrhenius-Only, NN-PhysicsInit) and the network layers (columns: Layer 0, Layer 3). Sub-figures illustrate: (a,b) NN-Direct weights across layers 0, 3; (c,d) NN-Arrhenius-Only weights across layers 0, 3; (e,f) NN-PhysicsInit weights across layers 0, 3.

Figure 14. Comprehensive layer-wise weight distributions of different models in the strain rate extrapolation task. The grid compares Probability Density (Y-axis) versus Weight Value (X-axis) across three model types (rows: NN-Direct, NN-Arrhenius-Only, NN-PhysicsInit) and the network layers (columns: Layer 0, Layer 3). Sub-figures illustrate: (a,b) NN-Direct weights across layers 0, 3; (c,d) NN-Arrhenius-Only weights across layers 0, 3; (e,f) NN-PhysicsInit weights across layers 0, 3.

Table 1. Chemical composition of the investigated TC4 titanium alloy (wt.%).

C	Fe	N	H	Al	O	V	Ti
0.04	0.24	0.013	0.015	6.36	0.20	4.37	Bal.

Table 2. Design of the comparative ablation study.

No.	Models Compared	Training Strategy
1	Arrhenius	Traditional analytical model (strain-compensated)
2	NN-Direct	Neural network trained from scratch using experimental data
3	NN-Arrhenius-Only	Neural network pre-trained using synthetic Arrhenius data only
4	NN-PhysicsInit (Ours)	Neural network pre-trained using synthetic data and fine-tuned using experimental data

Table 3. Neural network architecture and task-specific training hyperparameter configurations.

Category	Parameter	Temperature Extrapolation	Strain Rate Extrapolation
Shared Architecture	Topology	3→128→128→64→1	3→128→128→64→1
	Input variables	(T/1000, ln( $\dot{ε}$ ), ε)	(T/1000, ln( $\dot{ε}$ ), ε)
	Activation/Loss	SiLU/MSE	SiLU/MSE
	Optimizer	Adam	Adam
	Weight decay	1 × 10⁻⁵	1 × 10⁻⁵
Dataset Splitting	Interpolation Temp.	800, 850, 900, 950, 980 °C	800, 850, 900, 950, 980, 1010 °C
	Extrapolation Temp.	1010 °C	—
	Interpolation Strain Rates	0.001, 0.01, 0.1, 1.0, 10.0 s⁻¹	0.001, 0.01, 0.1, 1.0 s⁻¹
	Extrapolation Strain Rate	—	10.0 s⁻¹
	Synthetic dataset size	15,000	10,000
NN-Direct	Epochs/Batch size	2000/64	1500/64
NN-Direct	Initial learning rate	6 × 10⁻⁴	1 × 10⁻³
NN-Arrhenius-Only	Epochs/Batch size	1200/64	800/64
NN-Arrhenius-Only	Initial learning rate	8 × 10⁻⁴	1 × 10⁻³
NN-PhysicsInit: Stage I	Epochs/Batch size	1200/512	800/512
NN-PhysicsInit: Stage I	Initial learning rate	8 × 10⁻⁴	1 × 10⁻³
NN-PhysicsInit: Stage II	Epochs/Batch size	1000/64	600/64
NN-PhysicsInit: Stage II	Initial learning rate	1.5 × 10⁻⁴	2 × 10⁻⁴

Table 4. Statistical metrics of different models in the temperature extrapolation task.

No.	Models Compared	Data Set	R	R²	AARE (%)	RMSE (MPa)
1	Arrhenius	Training set (800–980 °C)	0.9803	0.9609	16.30	22.22
1	Arrhenius	Testing set (1010 °C)	0.9625	0.9263	25.72	13.49
2	NN-Direct	Training set (800–980 °C)	0.9978	0.9957	3.51	6.34
2	NN-Direct	Testing set (1010 °C)	0.9807	0.9617	34.21	14.49
3	NN-Arrhenius-Only	Training set (800–980 °C)	0.9803	0.9610	16.37	22.22
3	NN-Arrhenius-Only	Testing set (1010 °C)	0.9627	0.9267	25.74	13.48
4	NN-PhysicsInit	Training set (800–980 °C)	0.9949	0.9899	7.92	9.73
4	NN-PhysicsInit	Testing set (1010 °C)	0.9287	0.8625	14.34	11.12

Table 5. Statistical evaluation metrics of different models on the strain rate training and testing sets.

No.	Models Compared	Data Set	R	R²	AARE (%)	RMSE (MPa)
1	Arrhenius	Training set $\dot{ε}$ ≤ 1 S⁻¹	0.9607	0.9229	23.14	23.84
1	Arrhenius	Testing set $\dot{ε}$ = 10 S⁻¹	0.9778	0.9561	16.83	51.54
2	NN-Direct	Training set $\dot{ε}$ ≤ 1 S⁻¹	0.9978	0.9956	2.69	5.00
2	NN-Direct	Testing set $\dot{ε}$ = 10 S⁻¹	0.9534	0.9090	27.91	59.85
3	NN-Arrhenius-Only	Training set $\dot{ε}$ ≤ 1 S⁻¹	0.9608	0.9231	23.11	23.81
3	NN-Arrhenius-Only	Testing set $\dot{ε}$ = 10 S⁻¹	0.9777	0.9559	16.87	51.48
4	NN-PhysicsInit	Training set $\dot{ε}$ ≤ 1 S⁻¹	0.9900	0.9802	10.92	10.66
4	NN-PhysicsInit	Testing set $\dot{ε}$ = 10 S⁻¹	0.9787	0.9578	8.92	25.34

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheng, L.; Shao, C.; Cheng, P. A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation. Metals 2026, 16, 510. https://doi.org/10.3390/met16050510

AMA Style

Cheng L, Shao C, Cheng P. A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation. Metals. 2026; 16(5):510. https://doi.org/10.3390/met16050510

Chicago/Turabian Style

Cheng, Lu, Chenxi Shao, and Peng Cheng. 2026. "A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation" Metals 16, no. 5: 510. https://doi.org/10.3390/met16050510

APA Style

Cheng, L., Shao, C., & Cheng, P. (2026). A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation. Metals, 16(5), 510. https://doi.org/10.3390/met16050510

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Physics-Guided Two-Stage Learning Framework for Constitutive Modeling of TC4 Titanium Alloy: Validation Through Temperature and Strain-Rate Extrapolation

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Materials and Procedures

2.2. Establishment of the Analytical Prior Model

2.3. Physics-Guided Two-Stage Neural Network Framework (NN-PhysicsInit)

2.4. Extrapolation Validation Design and Implementation Details

2.4.1. Dual-Perspective Extrapolation Ablation Design

2.4.2. Neural Network Architecture and Hyperparameters

2.4.3. Statistical Evaluation Metrics

3. Results

3.1. Comparison of Predictive Performance in Training and Extrapolation Domains

3.2. Comparison of Stress–Strain Curve Fitting in the Training Domain

3.3. Extrapolation Behavior in Unseen Deformation Domains

3.4. Analysis of Parameter Distributions and Their Relation to Model Behavior

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI