A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics

Liu, Yuxin; Zhou, Chuyu; Xin, Guoguo; Nan, Pengyu; Yang, Hangzhou

doi:10.3390/app16041885

Open AccessArticle

A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics

by

Yuxin Liu

¹,

Chuyu Zhou

¹,

Guoguo Xin

^1,2,*,

Pengyu Nan

^1,2

and

Hangzhou Yang

^1,2,*

¹

School of Physics, Northwest University, Xi’an 710100, China

²

Fundamental Discipline Research Center for Quantum Science and Technology of Shaanxi Province, Xi’an 710049, China

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2026, 16(4), 1885; https://doi.org/10.3390/app16041885

Submission received: 22 January 2026 / Revised: 10 February 2026 / Accepted: 11 February 2026 / Published: 13 February 2026

Download

Browse Figures

Versions Notes

Abstract

Physics-informed neural networks (PINNs) often struggle to balance multiple loss terms in thermally coupled multiphysics problems. We propose Cooperative Soft-Hard PINNs (s-hPINN/s-HB-PINN), which apply soft constraints to fields with Neumann conditions while enforcing hard constraints on others to balance exact boundary enforcement with training stability. Validated on thermoelasticity and thermal convection, our method reduces training time by approximately 56%. In thermal convection experiments, incorporating partial data further reduces velocity errors by up to 78% compared to standard PINNs. We subsequently assessed the framework’s robustness against varying relative Gaussian white noise levels and different data sampling locations. The result demonstrate that s-HB-PINN maintains high-fidelity predictions even under noise interference, consistently outperforming baseline methods. This confirms that the proposed collaborative strategy offers a superior trade-off between accuracy, efficiency, and robustness in complex multiphysics environments.

Keywords:

physics-informed neural network; deep learning; multi-physics coupling; thermoelasticity; thermal convection

1. Introduction

Thermally coupled multiphysics problems are ubiquitous in both natural phenomena [1,2,3] and engineering practice [4,5,6], involving complex interactions among heat conduction, fluid flow, and solid deformation across diverse applications from aerospace propulsion systems and nuclear reactor cooling devices to thermal management of electronic equipment. A canonical example is the conjugate heat transfer problem, where heat conduction in the solid domain and convective heat transfer in the fluid domain are interconnected through interface coupling conditions, forming a complex bidirectional coupled system. Accurate simulation of such systems is critical for engineering design, safety assessment, and performance optimization.

Currently, numerical methods such as the Finite Difference Method (FDM) [7] and the Finite Element Method (FEM) [8] are predominant in the field of heat conduction. For instance, Ye et al. integrated FDM with Monte Carlo simulations to predict the effective thermal conductivity of fractal porous media characterized by rough surfaces [9]. regarding mesh adaptability, a polygonal FEM based on Wachspress shape functions has been successfully applied to two-dimensional steady-state heat conduction problems, demonstrating significantly enhanced robustness in handling irregular boundaries and singularities [10]. Furthermore, a novel polyhedral Scaled Boundary Finite Element Method (PSBFEM) incorporating octree meshes offers an efficient solution for transient heat conduction within complex three-dimensional geometries [11]. Additionally, by introducing fractal calculus, an effective thermal conductivity (TC) model rooted in fractal theory has been established to address solid–liquid two-phase heat transport in rough fracture networks [12].

However, traditional numerical methods still face significant challenges in handling high-dimensional parameter spaces, complex geometric boundaries, and inverse problems. These include the high computational cost of mesh generation for intricate geometries [13] and limited interpolation accuracy [14]. Furthermore, modern engineering systems often require reconstructing full-field physical quantities or identifying key model parameters from sparse experimental data, thereby transforming the problem from a conventional forward formulation into a more challenging inverse problem and further increasing computational complexity.

With the rapid advancement of artificial intelligence, deep learning has attracted growing interest for solving heat transfer problems. Although traditional deep neural networks have been applied to model heat transfer phenomena [15,16], they rely heavily on large amounts of high-quality labeled data. This dependence severely limits their performance in high-dimensional settings, multi-physics coupled systems, or scenarios where experimental data are scarce.

To address these limitations in physical modeling, Raissi et al. introduced Physics-Informed Neural Networks (PINNs) [17]. Unlike conventional machine learning methods, PINNs embed the governing physical equations directly into the network architecture. As a result, they can produce physically consistent solutions even without training data, guided solely by the underlying physics.

In recent years, PINNs have seen rapid adoption across a wide range of physical problems and have been successfully applied to heat transfer applications. For example, temperature measurements from a limited number of sensors can be used as inputs to train a real-time surrogate model that accurately predicts the full temperature field and the corresponding heat flux [18]. Similarly, Cai et al. combined sparse observational data with PINNs to reconstruct both temperature and velocity fields in forced and mixed convection cases across multiple regions [19]. Shang et al. used PINN to identify boundary heat fluxes and thermal conductivity [20]. Wei et al. [21] propose a Conditioned Adaptive Physics Network framework for efficient reconstruction of three-dimensional transient temperature fields in thermal protection systems of hypersonic vehicles. Kalpana et al. proposed a novel hybrid model by combining convolutional neural networks to predict heat transfer coefficients, showing higher accuracy compared with traditional and purely machine-learning approaches [22].

However, when applied to complex thermal systems, PINNs still face several challenges. These include difficulty in effectively balancing multiple loss terms, slow convergence in strongly nonlinear problems, and potential instability in long-term predictions [23]. To address these issues, researchers have developed a range of improvement strategies.

One approach adaptively adjusts loss weights during training to balance the contributions of different physical terms. For example, McClenny et al. used inverse gradient optimization to dynamically balance loss minimization and weight assignment [24]. Another strategy modifies the network architecture to enforce boundary conditions by construction, thereby reducing the number of loss terms that need to be balanced. Lu et al. proposed the hard-constrained PINN (hPINN) [25], which builds trial functions or analytical particular solutions that satisfy boundary conditions exactly. This ensures strict compliance with boundaries through the network structure itself, eliminating the loss of conflicts typical of soft-constrained formulations. Building on this idea, Zhou et al. introduced the Hybrid Boundary PINN (HB-PINN) [26], which adds smooth residual terms near boundaries to improve transition stability and adaptability.

In addition, domain decomposition techniques such as eXtended PINNs have been widely explored [27]. These methods split the computational domain into multiple subdomains, each modeled by an independent neural network, enhancing both scalability and local solution accuracy. Collectively, these advances have greatly improved the capability of PINNs for thermal multiphysics problems.

For instance, Ma et al. applied hPINN to electro-thermal coupling problems and demonstrated high prediction accuracy [28]. Wu et al. enhanced the Fourier neural operator with transfer learning to accelerate multi-objective stochastic optimization for heat exchanger systems, achieving better computational efficiency and predictive performance while maintaining a favorable trade-off between cost and accuracy [29]. Abueidda et al. developed a Physics-Informed Temporal Convolutional Network (PI-TCN) for thermoelasticity and integrated it into a finite element framework to exploit the fast inference capability of neural networks. Their results showed that PI-TCN outperforms standard PINNs [30]. More recently, Qiu et al. proposed an adaptive PINN for three-dimensional transient thermo-mechanical coupling. Their method uses automatic differentiation to embed the full set of governing equations and boundary conditions into a unified loss function and applies an adaptive balancing strategy to address scale mismatches among equations. While these advancements have shown promise in their respective domains, applying PINN-based frameworks to even more complex, multi-scale coupled systems remains a formidable challenge. In such scenarios, researchers frequently encounter persistent issues inherited from the standard PINN architecture, such as slow convergence [31], training instability [32], or limited accuracy [32] due to the inherent complexity of the loss landscape [23,32,33,34].

Despite recent advances in improving the accuracy and convergence of PINNs, significant challenges remain in effectively and accurately solving complex thermal multiphysics problems, especially those involving special boundary conditions.

In numerical simulations of thermally coupled multiphysics systems, heat transfer is typically more sensitive to interfaces than other physical fields, and boundary heat flux often directly participates in the coupling mechanism. As a result, thermal boundary conditions require more careful treatment and have a greater impact on overall solution accuracy. Neumann boundary conditions, which are specified in terms of derivatives, rely on gradient information obtained through automatic differentiation. This makes them particularly sensitive to noise and harder to fit accurately during training, which can significantly degrade both model stability and boundary precision.

In response to these limitations, this paper proposes a soft-hard constraint collaborative training method to solve multiphysics systems with complex coupling behavior more efficiently and accurately. The core idea is to treat different physical variables separately based on the complexity and analytical tractability of their boundary conditions, and to apply differentiated boundary constraint strategies. This approach combines the high boundary accuracy of hard constraints with the training stability of soft constraints.

Focusing on representative multiphysics problems involving thermal coupling, this study selects two benchmark thermal models as test cases: a thermoelasticity model and a thermal convection model. To systematically evaluate the effectiveness of the proposed soft-hard constraint collaborative training methods (s-hPINN and s-HB-PINN), we compare them against three established PINN approaches: the standard PINN (sPINN), hPINN, and HB-PINN. Numerical experiments on both coupled models enable a comparative assessment of each method in terms of solution accuracy, training stability, and boundary treatment performance, thereby revealing their respective strengths and limitations under varying coupling conditions. The main contributions of this work are as follows:

(1): We propose a novel collaborative training strategy that combines soft and hard constraints. This method achieves higher accuracy than existing approaches for multiphysics problems.
(2): We conduct comprehensive validation on multiple multiphysics systems, including thermoelasticity and thermal convection models, and evaluate how different PINN formulations perform with respect to coupling strength and spatial complexity of the physical fields.
(3): We demonstrate the advantages of the collaborative constraint strategy for steady-state multiphysics problems. Experimental results show that the proposed approach maintains physical consistency while achieving a better balance between accuracy and training efficiency, offering improved generalization and strong potential for practical engineering applications.

2. Methods

2.1. Physics-Informed Neural Networks

PINNs embed the physical constraints of partial differential equations directly into neural network training by constructing a loss function that incorporates residuals from the governing equations, boundary conditions, and initial conditions. Unlike traditional numerical methods, PINNs do not require spatial discretization. Instead, they approximate PDE solutions through continuous function representation. This framework provides excellent scalability and geometric flexibility, making it especially well-suited for modeling high-dimensional systems, multiphysics coupled problems, and physical scenarios with complex boundary conditions.

A system of coupled partial differential equations can generally be expressed as follows [35]:

\begin{matrix} N (u (x); ψ) = 0 x \in Ω \\ B (u (x)) = g (x), x \in \partial Ω \end{matrix}

(1)

Among them,

N

represents the differential operator,

u

represents the physical field variable that needs to be obtained, and

ψ

is the set of given parameters. The governing equation is defined on the

Ω

field, where

x

represents the input vector

[x_{1}, x_{2}, \dots, x_{n}]

.

B (\cdot)

is an operator that can represent various types of boundaries,

g (x)

is a boundary condition that can be a constant value or function, and

\partial Ω

represents the boundary domain.

To solve the multiphysics coupling problems described by PDE, this paper uses the PINN as a unified framework to approximate the solutions. PINN employ a fully connected neural network architecture to approximate

u (x)

, where the network takes coordinates

(x)

as input and yields

\hat{u} (x; θ)

as output. Here,

θ

represents the trainable parameters within the network. The neural network comprises multiple hidden layers, where the input of each hidden layer

X = [x_{1}, x_{2}, \dots, x_{j}]

and the outputs

Y = [y_{1}, y_{2}, . . ., y_{j}]

are propagated through the network as

y_{j} = σ (w_{i, j} x_{i} + b_{j})

(2)

where

σ (\cdot)

represents the activation function of a simple nonlinear trans- formation,

w_{i, j}

and

b_{j}

denote the trainable weights and biases, respectively. The parameters of the network can be trained by minimizing a composite loss function, which takes the following form:

L_{P D E} = \frac{1}{n_{c}} \sum_{i \in n_{c}} ∥ N (\hat{u} (x; θ)) ∥_{2}^{2}

(3)

L_{B C} = \frac{1}{n_{b c}} \sum_{i \in n_{b c}} ∥ B (\hat{u} (x; θ) - g (x)) ∥_{2}^{2}

(4)

here,

L_{P D E}

refers to the loss associated with the residuals of PDE, and

n_{c}

represents the set point of the equation.

∥ \cdot ∥_{2}^{2}

represents mean squared error (MSE).

L_{B C}

is the loss of boundary conditions, and

n_{b c}

represents the number of boundary points. Since the initial conditions are specific forms of boundary conditions, their losses are the same as Equation (4). The total loss is now expressed as

L_{t o t a l} = λ_{p d e} L_{P D E} + λ_{b c} L_{B C}

(5)

The parameters

λ_{p d e}

and

λ_{b c}

denote the weights assigned to the PDE loss and the boundary loss, respectively. These weights are introduced to balance gradient magnitudes across different loss components, accelerate convergence, and enhance the accuracy of the computed solution.

Figure 1 illustrates the standard PINN architecture. Input variables are passed through a neural network, which outputs dimensionless field variables via activation functions. To accurately evaluate the derivatives required by the governing equations, boundary conditions, and initial conditions, automatic differentiation is applied during forward propagation.

Residuals for the coupled system of partial differential equations, boundary conditions, and initial conditions are then computed using collocation points

n_{c}

and boundary points

n_{b c}

. These residuals measure the mismatch between the neural network’s predictions and the underlying physical constraints. The training process proceeds iteratively: it terminates when either the total residual loss falls below a predefined tolerance or the maximum number of iterations is reached; otherwise, optimization continues until one of these criteria is satisfied.

In PINN, boundary condition treatment critically affects solution accuracy. Traditional soft constraints incorporate boundary conditions as penalty terms in a composite loss, often causing conflicts that reduce accuracy. To address this, a hard constraint approach that enforces boundary conditions exactly by designing the network output structure, avoiding penalty optimization. This method introduces auxiliary functions satisfying boundary properties to weight and combine the network output, ensuring the prediction inherently meets boundary values. Typically, it involves three networks:

N_{P}, N_{D} and N_{H}

[36].

\hat{u} (x) = u_{p} (x) + D (x) ⊙ u_{h} (x)

(6)

Let

\hat{u} (x)

denote the target physical quantity,

u_{p} (x)

is known particular solution satisfying initial/boundary conditions, and

N_{D}

is the distance function. To handle complex geometries, two low-capacity auxiliary networks,

N_{P}

and

N_{D}

, approximate

u_{p}

and

D

, yielding

{\hat{u}}_{p} (x)

and

\hat{D} (x)

. After pretraining

u_{p} (x)

and

D (x)

, they are combined with

N_{H}

according to Equation (6) to form the final solution

\hat{u} (x)

. In the final training stage, we optimize only the parameters of

N_{H}

, ensuring that the remaining constraints are satisfied and the residuals of the governing partial differential equations are effectively controlled.

However, this PINN approach faces notable limitations. The hard constraint method relies on two key components: a boundary solution network and a distance function network. During training, the boundary solution network takes only boundary coordinates as input to guarantee exact satisfaction of boundary conditions. When the boundary geometry or conditions are complex, however, the deep neural network’s output in the interior domain may lose smoothness or exhibit spurious oscillations. This can severely slow convergence or even prevent the main network from converging, limiting the applicability of hard constraints to problems with intricate boundaries.

To address this challenge, Zhou et al. proposed the HB-PINN, which introduces structural and loss-function innovations that significantly improve both representational capacity and training stability under complex boundary conditions [26]. Key differences include: (1)

N_{P}

incorporates residual loss of PDE and adjusts the weight of the loss of the boundary to improve training near the boundaries; (2)

N_{D}

uses a power distance function as a training label, imposing the output to approach zero near the boundaries and rapidly increase away from them, ensuring

N_{H}

dominates training in the interior.

2.2. Methods for Collaborative Soft-Hard Constraints

In numerical simulations of thermal engineering problems, accurate enforcement of boundary conditions is essential for ensuring the physical fidelity of the solution. The temperature field, as a canonical scalar field, is typically subject to two types of boundary conditions: Dirichlet boundaries, which prescribe temperature values directly, and Neumann boundaries, which specify the normal heat flux or, equivalently, the normal gradient of temperature.

When solving such problems with PINNs, imposing hard constraints on Dirichlet boundaries by structurally ensuring that the network output exactly matches the prescribed temperature at the boundary generally yields higher accuracy and improved training stability compared to soft constraint approaches. This strategy is especially effective for fixed-temperature boundary scenarios.

However, the performance of hard constraint methods deteriorates significantly when applied to Neumann boundaries involving prescribed heat flux or dissipative thermal conditions. The fundamental reason lies in Neumann conditions: they constrain the normal derivative of the temperature field, which governs the heat flux. Unlike pointwise temperature values, this derivative-based condition exerts a nonlocal influence, shaping the temperature distribution not only at the boundary but also across adjacent interior regions. Enforcing such a condition through a rigid structural modification of the network restricts its ability to adaptively resolve the spatial variation in temperature near the boundary. Consequently, conflicts arise between the optimization objectives of satisfying the governing equations in the domain and matching the prescribed heat flux at the boundary.

These challenges are exacerbated in coupled multiphysics settings, often leading to error accumulation near boundaries, loss of solution smoothness, poor convergence behavior, or even numerical divergence. As a result, it becomes difficult to achieve the dual accuracy required in engineering practice for both temperature and heat flux.

Therefore, for the broad class of thermal problems featuring complex boundary conditions, developing computational frameworks that can effectively coordinate Dirichlet boundary and Neumann boundary constraints is critical to improving the robustness and practical utility of PINNs.

In this study, for multi-physical problems related to thermal fields, we adopt a collaborative soft-hard constraint strategy. The allocation of soft and hard constraints follows a key principle: hard constraints are preferentially applied to physical quantities whose boundary conditions can be accurately and stably approximated by a low-capacity network or simple analytical functions, thereby reducing conflicts between boundary loss terms and PDE loss terms during model training. Conversely, for thermal quantities with boundary conditions involving complex derivative relationships (such as Neumann boundaries of the temperature field), soft constraints are applied to ensure training stability and overall prediction accuracy.

Assume that the physical field to be obtained comprises variables

m

and

n

. We employ hard constraints to approximate the variable

m

and utilize soft constraints networks for the variable

n

. The variables are subsequently linked through a system of coupled equation losses. In the final training of the composite PINN, the weights and biases of the network

N_{H}

approximating

m_{h} (x)

and the soft constraints network

N_{S}

approximating

n_{s} (x)

will be the trainable variables exposed to the optimizer.

f (\hat{D} (x; θ_{d})) = 1 - (1 - \hat{D} (x; θ_{d}) / m a x (\hat{D} (x; θ_{d})))^{a}

(7)

In Equation (7), the hyperparameter a is typically set to 5, 10, or 15 [26]. To compare the impact of different distance networks, the distance networks of h-PINN and HB-PINN are, respectively, incorporated into Equation (7). When the hyperparameter a is set to 1, Equation (7) reduces to the traditional distance function. These networks are embedded into s-hPINN and s-HB-PINN, and their overall architectures are illustrated in Figure 2.

In this work, the choice between hard and soft boundary constraints is guided by three practical criteria rather than solely by the mathematical type of boundary condition.

(1): Boundary Embeddability: Hard constraints are preferred when boundary conditions can be precisely and stably embedded into the neural network architecture via simple analytical functions or low-capacity networks, such as Dirichlet conditions. Conversely, Neumann or Robin conditions involving directional derivatives are difficult to implement stably due to the requirement for additional differentiation operations and are thus unsuitable for hard-coding.
(2): Gradient Stability: Enforcing hard constraints on derivative-based boundaries introduces high-order automatic differentiation, which significantly amplifies gradient noise. In gradient-sensitive problems such as heat conduction, this often leads to training instability.
(3): Multi-physics Coupling Sensitivity: In strongly coupled systems like thermal convection, hard constraints may propagate local setup errors to other field variables, thereby compromising global consistency. In contrast, soft constraints allow the network to adaptively coordinate the relationships between different physical quantities during training, enhancing the overall physical plausibility of the solution.

Given that the thermal-related multi-physics problems investigated in this study often involve Neumann-type temperature boundaries (e.g., prescribed heat flux), while variables such as velocity and displacement typically have clear Dirichlet boundaries or can avoid derivative constraints through variable reconstruction (e.g., directly predicting stress), we adopt a hybrid strategy. Specifically, we apply soft constraints to the temperature field and hard constraints to other field variables. It should be noted that this strategy is problem-dependent and may not be universally optimal. To assist researchers in making a priori judgments for new problems, we provide a reference framework in Figure 3 and Algorithm 1.

Algorithm 1 Soft-Hard Collaborative Physics-Informed Neural Network
Input: spatiotemporal coordinates $x$
Output: multiphysics variables of the thermal coupled system, with Dirichlet boundaries variables denoted by m and the temperature field with Neumann boundaries denoted by n
Step 1: Pretrained networks $N_{P}$ and $N_{D}$
Initialize the parameters $θ_{p} and θ_{d}$ for the networks $N_{P}$ and $N_{D}$ , respectively.
$L_{p a r t} = \frac{1}{n_{b c}} \sum_{i \in n_{b c}} {∥ B ({\hat{m}}_{p} (x; θ_{p})) - g (x) ∥}_{2}^{2}$ (8)
$L_{d i s} = \frac{1}{n_{d}} \sum_{i \in n_{d}} {∥ D_{m} (x) - f ({\hat{D}}_{m} (x; θ_{d})) ∥}_{2}^{2}$ (9)
Freeze the parameters $θ_{p} and θ_{d}$ after training.
Step 2: Train networks $N_{H}$ and $N_{S}$
Initialize parameters $θ_{h} and θ_{s}$ for networks $N_{H}$ and $N_{S}$ , respectively. for epochs = 1, 2, … do
(a) Compute the soft boundary loss for Neumann boundaries:
$L_{b c_s o f t} = \frac{1}{n_{b c}} \sum_{i \in n_{b c}} ∥ B ({\hat{n}}_{s} (x; θ_{s}) - g (x)) ∥_{2}^{2}$ (10)
(b) Compute the PDE residual loss:
$L_{p d e} = \frac{1}{n_{c}} \sum_{i \in n_{c}} ∥ N (({\hat{m}}_{h} (x; θ_{c}) + ({\hat{n}}_{s} (x; θ_{s}) ∥_{2}^{2}$ (11)
(c) Compute the total loss:
${L_{t o t a l} = L}_{p d e} + L_{b c_s o f t}$ (12)
(d) Backpropagation and update parameters $N_{H}$ and $N_{S}$ via gradient descent. end

3. Results and Discussion

To validate and demonstrate the effectiveness of the proposed method, this study selects two representative scenarios for in-depth analysis and experimentation. A systematic comparison is conducted among the standard PINN, hPINN, HB-PINN, and the proposed soft-hard constraint collaborative approach. Through comprehensive comparative experiments, the feasibility and potential of the collaborative strategy are evaluated. Flowcharts of the different methods are illustrated in Figure 4 below.

This study uses CFD simulation results as the reference benchmark to compare the performance of sPINN, hPINN, HB-PINN, and the proposed cooperative soft-hard PINN (s-hPINN and s-HB-PINN). To address the sensitivity of PINN to network initialization, a fixed random seed (77777) was used to initialize both PyTorch 2.10.0 and NumPy 2.3.5 in all experiments, ensuring the reproducibility of the results.

In terms of network architecture design in Table 1, network

N_{P}

uses a DNN with four hidden layers, each containing 20 neurons; network

N_{D}

uses four hidden layers with 64 neurons each; networks

N_{H}

and

N_{S}

use six hidden layers with 128 neurons each. The tanh activation function was used throughout all experiments, and Xavier initialization [37,38,39,40] was applied to all trainable parameters. The Adam optimizer was selected for the optimization process [41], combined with a cosine annealing learning rate schedule for training, with specific hyperparameters adjusted according to the model characteristics. The hyperparameter

a

for the distance metric function was set to 10, following the suggestion of Zhou et al. [26]. All PINN models presented in this paper were trained on an NVIDIA RTX 4090 GPU.

3.1. Steady-State Thermoelastic Problem

In this study, we have investigated a linear quasistatic thermoelastic problem assuming that the materials are isotropic, homogeneous and neglecting inertia. The solution involves simultaneously solving the heat conduction equation, momentum conservation equation, and thermoelastic constitutive relations. The force equilibrium and heat conduction equations are given as follows [42,43,44]:

\nabla \cdot σ = 0

(13)

\nabla \cdot q = 0

(14)

where

σ

represents the Cauchy stress tensor and

q

is the thermal flux vector. The constitutive elastic equation is expressed as [37]:

σ = 2 μ ε + (λ t r (ε) - α (3 λ + 2 μ) (T - T_{o})) I

(15)

here, µ and λ denote the Lamé constants, which characterize the mechanical behavior of isotropic elastic materials; α represents the coefficient of thermal expansion; ε signifies the infinitesimal strain tensor, expressed as:

ϵ = \frac{1}{2} (\nabla u + (\nabla u)^{T})

(16)

The first case studies a 2D quarter plate with a defect. A prescribed temperature is applied on the defect boundary, while traction and heat flux conditions are set on the outer boundaries in Figure 5. Material parameters are listed in Table 2. The network takes 2D spatial coordinates as input and predicts displacement, stress, and temperature fields. The soft-hard constraint scheme applies hard constraints for displacement and stress, and soft constraints for temperature. Sampling includes

n_{c} = 10,000

interior points

1000

outer boundary points, and

3000

defect boundary points.

Figure 6 shows the evolution of different loss terms with respect to training iterations in the solution of the steady state thermoelasticity problem. The PDE loss for all methods decreases steadily and eventually converges. However, their boundary losses exhibit distinct behaviors.

As training progresses, HB-PINN and the proposed s-HB-PINN further reduce their PDE losses, indicating a stronger capacity to capture the coupled thermoelastic physics. In contrast, sPINN converges quickly but attains a relatively high final boundary loss, as shown in Figure 6. This arises from competition between the boundary and PDE loss terms, which hinders their simultaneous minimization and may degrade prediction accuracy.

For hPINN, Figure 6 reveals persistent difficulty in reducing boundary loss, particularly under Neumann boundary conditions, even as training proceeds. This behavior highlights a fundamental limitation of purely hard-constrained formulations when applied to multiphysics problems involving complex or mixed boundary conditions. In such cases, the strict enforcement of boundary constraints can restrict the network’s flexibility, leading to optimization imbalance and degraded boundary satisfaction.

Although HB-PINN achieves stable overall convergence, with both PDE residuals and boundary losses reduced to acceptable levels, its Neumann boundary loss remains noticeably higher than those of s-hPINN and s-HB-PINN. This residual discrepancy may adversely affect solution accuracy in regions adjacent to Neumann boundaries, where thermoelastic responses are particularly sensitive to boundary condition enforcement.

In contrast, s-hPINN and s-HB-PINN consistently drive boundary losses to lower values while maintaining stable and smooth convergence throughout the training process. This improved behavior indicates that the coordinated use of soft and hard constraints provides a more balanced optimization mechanism, allowing the network to satisfy both governing equations and complex boundary conditions more effectively. Consequently, the proposed collaborative constraint strategy offers a robust and reliable approach for addressing the challenges associated with thermoelastic coupling problems featuring complex boundary configurations.

In this work, we focus not only on the numerical discrepancy between predicted and reference solutions, such as the relative L2 error, but also on the intrinsic physical consistency of the predictions. To this end, we compute the residual field of the governing equations

\nabla \cdot σ = 0

for all methods and perform pointwise verification of the predicted stress fields using automatic differentiation.

Figure 7 shows the two-dimensional distribution of PDE residuals for different methods in the thermoelastic problem, visualized with a logarithmic color scale to highlight small-magnitude regions. As can be seen, the residual map of the proposed s-HB-PINN method exhibits the lightest colors, with values predominantly in the range of

10^{- 3}

to

10^{- 4}

, whereas other baseline methods display prominent dark red regions indicating much larger residuals, up to

10^{- 1}

to

10^{0}

.

Such a low-magnitude residual distribution provides robust evidence that the displacement and stress fields predicted by s-HB-PINN are highly self-consistent and strictly satisfy the conservation laws within the computational domain. Consequently, these results validate the reliability of the proposed synergistic strategy in addressing complex multi-physics problems from the perspective of fundamental physical mechanisms.

From Figure 8 and Figure 9, both sPINN and hPINN demonstrate notable limitations in their predictive results, struggling to accurately forecast multiple physical quantities simultaneously. Specifically, in the temperature contour plot of hPINN, the predictions are significantly influenced by the Neumann boundary condition. HB-PINN demonstrates good agreement with the finite element solution in temperature and displacement predictions, but stress predictions at the notch are significantly underestimated, as further confirmed in Figure 10. In contrast, s-hPINN and s-HB-PINN significantly improve the prediction accuracy of all physical quantities, with s-HB-PINN showing the highest agreement; its heat maps closely match the finite element results and accurately capture the variation in physical quantities from the boundary to the interior.

In addition, the comparison of stress distribution along the centerline between the finite element solution and different PINN methods is shown in Figure 10. The s-HB-PINN results show significant agreement with the finite element solution. In Figure 8, the normal stress distributions predicted by s-HB-PINN and the finite element method are very similar, accurately capturing the stress gradient from the edge to the center of the region. This visual similarity is quantitatively confirmed along the centerline in Figure 10. Among the methods, s-HB-PINN most closely reflects the curve variation of the finite element solution. s-hPINN and hPINN capture the general trend of the curve but show slightly larger numerical deviations. HB-PINN overestimates or underestimates stress changes at curve inflection points, leading to noticeable deviation from the finite element solution. sPINN performs poorly in the quantitative analysis, almost failing to capture the curve variation, which further indicates that sPINN is not suitable for complex multiphysics problems.

Table 3 and Figure 11 summarize the training time and L2 errors of various physical quantities for each method in the thermoelasticity problem. The results indicate that s-HB-PINN achieves the lowest errors across all quantities, followed by s-hPINN. Both methods consistently outperform the others, with the most substantial improvement observed in temperature prediction. hPINN yields the largest temperature error due to its inability to adequately resolve boundary temperature variations. In contrast, HB-PINN strikes a more effective balance between enforcing complex boundary conditions and learning the coupled field equations, resulting in more stable predictions overall.

The standard PINN generally exhibits higher errors, and its displacement field shows signs of overfitting as training proceeds. Under an identical and modest number of training iterations, the stress L2 error of s-HB-PINN is approximately 60% lower than that of the standard PINN, demonstrating its enhanced capability in modeling complex multiphysics coupling. The systematic sensitivity analysis of coupling strength is detailed in Appendix A.

3.2. Transient Thermoelastic Coupling

Building upon the steady state analysis, we further evaluated the proposed methods on a transient thermoelastic problem. The framework illustrated in Figure 5 was extended by incorporating a temporal dimension. The plate was assumed stationary at the initial time, with a total simulation duration of 1 s. Given that the sPINN already exhibited unsatisfactory performance in the steady state case, the transient study focuses exclusively on improved methods that incorporate hard constraints.

Figure 12 and Figure 13 show the displacement and temperature fields, along with their corresponding error maps, after 200,000 training iterations for the different PINN methods. All methods produce displacement fields in general agreement with the CFD reference solution. In contrast, larger discrepancies are observed in the temperature predictions. The proposed soft-hard constraint collaborative methods yield substantially lower temperature errors than the other approaches. Furthermore, the error maps reveal that the temperature errors of hPINN and HB-PINN are primarily localized near Neumann boundaries. This observation highlights a critical challenge in thermoelastic simulations: inaccurate or overly rigid enforcement of Neumann conditions can introduce significant local biases that propagate into the coupled solution.

Figure 14 shows the evolution of the relative L2 error in temperature prediction with respect to both physical time and training iterations for each method. As training progresses, the error for all methods generally decreases, indicating gradual convergence of the neural networks. However, a slight but consistent increase in error is observed over physical time across all approaches. This temporal error growth primarily stems from the accumulation of approximation errors in transient simulations. During long-term evolution, small inaccuracies at early time steps are continuously propagated and amplified through the dynamics governed by the PDEs.

Among all compared methods, hPINN consistently exhibits the highest temperature error throughout the entire simulation period, suggesting that enforcing hard constraints alone without adaptive data guidance or geometric prior regularization is insufficient to maintain high accuracy in time-dependent multiphysics problems. In contrast, the error curves of s-hPINN and s-HB-PINN nearly overlap and remain stably at the lowest level throughout training. This close agreement not only demonstrates their superior accuracy in temperature prediction but also reflects exceptional stability in capturing the spatiotemporal evolution of the thermal field.

3.3. Steady-State Thermal Convection

The problem of forced convection is extensively encountered in practical applications such as engineering heat transfer and fluid mechanics, where the solution typically involves the coupling between velocity, pressure, and temperature fields. The governing equations for this problem are the incompressible Navier–Stokes equations, and the corresponding equations are [45,46]:

\frac{\partial T}{\partial t} + (u \cdot \nabla) θ = \frac{1}{P e} \nabla^{2} T

(17)

\frac{\partial u}{\partial t} + (u \cdot \nabla) u = - \nabla p + \frac{1}{R e} \nabla^{2} u + f + R i T

(18)

\nabla \cdot u = 0

(19)

here,

T

,

u = (u, ν)^{T}

, and

p

represent the dimensionless temperature field, velocity field, and pressure field, respectively. The symbol

\nabla

denotes the gradient operator. Pe, Re, and Ri stand for the Péclet number, Reynolds number, and Richardson number, respectively. In the context of the problem discussed herein, Ri is set to zero, and the external force

f

is not considered.

We consider 2D steady forced convection in a closed domain (in Figure 15). The network takes spatial coordinates as input and outputs velocity, temperature, and pressure. A semi-circular heat source at the bottom is set to

T = 1

, while cold fluid enters from the left (

u = 1

,

v = 0

) and exits through the right. All other boundaries are subject to no-slip conditions with

T = 0

. and the right boundary is thermally insulated with

p = 0

. In this study, we adopt

R e = 50

and

P e = 36

, a widely used parameter set in classical thermally driven cavity benchmarks [19]. This combination represents a standard laminar regime with moderate nonlinearity (

R e = 50

) and convection-dominated heat transfer (

P e = 36

), corresponding to

P r \approx 0.72

, close to that of air). The choice aims not at quantitative replication of specific results, but at evaluating the proposed constraint strategy under a physically representative benchmark setting. In the cooperative soft-hard PINN Framework, velocity and pressure are enforced via hard constraints, and temperature via soft constraints.

Figure 16 shows the evolution of the different loss terms during training for each method. As the number of iterations increases, the PDE and boundary losses of s-hPINN and s-HB-PINN decrease steadily, indicating stable training and consistent convergence. Although sPINN converges early and attains a low boundary loss, its PDE loss remains large, leading to a higher total loss than the other methods and implying limited predictive accuracy. For hPINN, the Neumann boundary loss increases with training iterations, which drives the overall rise in boundary loss and reflects degraded accuracy near Neumann boundaries. In contrast, HB-PINN incorporates PDE residuals into the training of boundary conditions, effectively suppressing the fluctuations typically caused by hard constraints. This yields smoother loss decay and improved boundary prediction accuracy over successive iterations.

As shown in Figure 17 and further quantified in Figure 18, the high complexity of the thermal convection model limits the ability of sPINN to accurately predict the spatial distribution of physical quantities. Although hPINN captures the general trends, its accuracy near complex boundaries remains low. HB-PINN improves the hard constraint formulation by incorporating PDE residuals at boundaries, which reduces Neumann boundary errors and enhances overall prediction accuracy. Among all methods, s-hPINN and s-HB-PINN achieve the highest fidelity: their predicted fields show close agreement with the CFD reference solution and reproduce fine-scale features of the complex flow structures.

Figure 19 presents a quantitative comparison of the velocity, temperature, and pressure distributions along the horizontal centerline obtained using different PINN-based methods. The reference solution is provided by a high-fidelity CFD simulation, against which all predictions are evaluated.

Except for the standard PINN, all methods capture the general trends of these quantities. The standard PINN fails to reproduce the velocity profile and yields only coarse approximations for temperature and pressure, with large deviations from the reference. Both hPINN and HB-PINN represent the overall spatial variations along the centerline but exhibit limited resolution of fine-scale features. These limitations suggest that while hard enforcement improves boundary fidelity, the treatment of Neumann-type conditions via purely hard or unbalanced soft constraints may still introduce inaccuracies in strongly coupled regions. In contrast, s-hPINN and s-HB-PINN accurately resolve both the global trends and local details. The cooperative soft–hard constraint strategy effectively balances boundary enforcement and interior PDE satisfaction. Their predictions show close agreement with the CFD reference solution, as evidenced by the small discrepancies in the figure.

To further assess the accuracy of each method, L2 errors between the predicted solutions and the finite element reference were computed for the steady state thermal convection model. Table 4 and Figure 20 summarize the training time and L2 errors for all physical quantities across the different methods. The results show that s-hPINN and s-HB-PINN achieve, with a low number of training iterations, an accuracy level that the standard PINN does not reach, even at its maximum iteration count. At comparable accuracy, the proposed methods reduce total training time by approximately 56%. Moreover, they exhibit lower wall clock time per iteration, indicating improved computational efficiency. As training progresses, the L2 errors of all methods decrease; however, s-hPINN and s-HB-PINN consistently yield the lowest errors. This reflects their enhanced capability to resolve the coupled dynamics among velocity, temperature, and pressure. The sustained reduction in L2 error demonstrates the effectiveness of these methods in modeling complex multiphysics systems.

3.4. Steady-State Thermal Convection with Data

To improve the network’s ability to fit real physical behavior and enhance prediction accuracy in complex regions, this section adds several measurement points based on Figure 15. As shown in Figure 21, four points are added at x = 6 with y = 0.25, 0.5, 0.75, and 1, recording the data (u, T and p) at each location. By introducing sparse observational data during training, the differences in performance of each method under data-assisted guidance are further compared.

Figure 22 shows the predicted distributions of velocity, temperature, and pressure obtained by different PINN methods after incorporating sparse observational points. The inclusion of these data improves prediction accuracy across all methods. In particular, hPINN and HB-PINN show noticeable gains in boundary accuracy, consistent with the quantitative centerline results in Figure 23. These observations suggest that even a limited number of measurements can help stabilize PINN training in regions with complex or rapidly varying boundaries, leading to more accurate and consistent predictions overall.

Although the inclusion of observational data improves sPINN to some extent, its predictions still deviate considerably from the finite element reference solution. In contrast, s-hPINN and s-HB-PINN exhibit more stable training behavior when augmented with sparse observational data. Both methods achieve further reductions in L2 error, and their centerline predictions for velocity, temperature, and pressure show close agreement with the finite element results. Specifically, the L2 error in velocity decreases by approximately 78% for s-hPINN and 56% for s-HB-PINN shown in Table 5. These results suggest that the collaborative soft-hard constraint strategy, which combines soft penalty terms with hard enforcement, improves the imposition of physical boundary conditions and effectively integrates limited observational information. Consequently, the approach maintains a consistent trade-off between accuracy and numerical stability, demonstrating its suitability for complex multiphysics problems.

In this study, we further investigate the impact of the spatial layout of sparse observation points on model predictive performance. Comparative experiments were conducted by placing observation points in regions with distinct flow characteristics, including areas near the inlet (

x = 0

), in close proximity to the heat source (

x = 2

), and within the downstream sections of the computational domain (

x = 6

), while keeping the vertical coordinates fixed at y = 0.25, 0.5, 0.75, and 1.0 across all configurations.

The results in Table 6 demonstrate that the proposed s-HB-PINN method exhibits exceptional robustness across all tested regions. Whether the sensors are deployed in the near-obstacle zones, where the flow field undergoes drastic changes, or in the downstream regions where the flow is fully developed, the model maintains consistent and stable predictive accuracy. This low sensitivity to observation locations indicates that the method can effectively leverage local sparse data to achieve accurate reconstruction of the global field through PDEs.

Specifically, for heat convection problems where sensor quantity or installation conditions are restricted, we recommend prioritizing the placement of observation points at downstream cross-sections where the flow field tends to be steady and far from heat sources or obstacles. These locations are not only easier to deploy in practical engineering scenarios but also provide observational values that encapsulate the integrated information of upstream physical evolution. Consequently, such a strategy can more efficiently drive high-fidelity reconstruction across the entire computational domain, achieving optimal global inference with minimal measurements.

Following the validation of the model’s spatial robustness across different cross-sections where the far-field developed region at

x = 6

demonstrated the most stable predictive performance, we further examined the framework’s resilience against observational uncertainties. In practical engineering applications, sensor data are inevitably contaminated by measurement noise, which can significantly challenge the convergence and accuracy of standard PINN architectures. To systematically evaluate this, we introduced 1% and 5% relative Gaussian white noise specifically to the sparse data at the

x = 6

location. As summarized in the corresponding error Table 7, while the predictive discrepancies of baseline methods such as sPINN and hPINN escalate sharply, with temperature errors even exceeding 1.0 under 5% noise, the proposed s-HB-PINN maintains remarkable stability. Even at the higher 5% noise level, s-HB-PINN keeps the relative L2 errors for velocity, temperature, and pressure fields within the

10^{- 2}

magnitude, showing only marginal degradation compared to the noise-free baseline.

4. Conclusions

This study presents a novel PINN approach for solving thermal multiphysics problems, evaluated on representative thermoelastic and thermal convection systems. The proposed collaborative soft-hard constraint strategies, specifically s-hPINN and s-HB-PINN, demonstrate an enhanced capability in predicting the evolution of coupled physical fields, particularly in regions governed by Neumann boundary conditions where prediction accuracy is markedly improved. Compared with the standard PINN, these methods attain higher accuracy with reduced training time, and they more effectively suppress errors arising from challenging temperature boundary conditions relative to hPINN and HB-PINN. Two additional experiments further validate the method’s effectiveness through transient simulations and sparse observational data, showing that the framework maintains stable and accurate solutions even in time-dependent scenarios or with limited measurements.

While the collaborative strategy effectively captures interdependent physical processes, its current implementation is primarily tailored for thermal multiphysics problems where only the temperature field involves Neumann boundaries while other variables typically possess Dirichlet conditions. In more generalized multiphysics systems where multiple variables must simultaneously handle Neumann boundaries, introducing independent soft constraints and auxiliary networks for each variable may significantly increase the number of model parameters and computational overhead. Furthermore, it should be noted that the current framework has not been tested on Robin boundary conditions, and the existing collaborative logic may not be directly applicable to systems governed by inequality constraints, such as contact mechanics or phase-change problems. Nevertheless, given that Robin boundary conditions inherently combine Dirichlet and Neumann components, they could in principle be accommodated within the proposed cooperative constraint framework by treating the Neumann part with a soft penalty and embedding the Dirichlet part via hard enforcement, suggesting a viable path for future extension. Although the proposed strategies demonstrate superior convergence behavior in the numerical examples presented, a formal theoretical analysis of the method’s stability or convergence in a generalized setting is still lacking. Additionally, the current study has not yet provided a systematic quantification of computational resource consumption, such as memory footprint and per-step execution time, which are critical for large-scale industrial applications. In high-dimensional or transient scenarios, the added complexity of the hybrid architecture might weaken network performance or hinder training convergence.

Given that many real-world applications involve unsteady, multiscale multiphysics phenomena with strong temporal dynamics, future work will focus on addressing these constraints. This includes exploring the robustness of the framework under Robin boundaries, developing more compact parameter-sharing architectures, and incorporating penalty methods for inequality constraints. We also aim to conduct a rigorous analysis of computational efficiency and provide a more robust mathematical foundation for the stability of collaborative constraint strategies, thereby advancing the role of PINNs in intelligent multiphysics simulation.

Author Contributions

Writing—original draft, visualization, validation, software, methodology, Y.L.; writing—review & editing, methodology, C.Z.; supervision, writing—review & editing, G.X., P.N. and H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to acknowledge the financial support received from National Natural Science Foundation of China (12304346), Scientific Research Foundation of Education Department of Shaan’xi Province, China (21JK0945), Key Research and Development Projects of Shaanxi Province (2025CY-YBXM-073), the financial support received from Donghai Laboratory (2024SSYS0091), Open Fund of Beijing Key Laboratory of Advanced Optical Remote Sensing Technology (AORS202408).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to patent preparation.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Investigating the impact of coupling strength is crucial for evaluating the robustness of multiphysics PINNs. In response, we have conducted a systematic sensitivity analysis by varying the thermal expansion coefficient

α = 2.31 \times 10^{- 5} K^{- 1}

(

α = 0.5 α_{0}

,

α = α_{0}

, and

α = 2 α_{0}

) in the steady-state thermoelastic problem. Because thermally induced displacements in such problems are inherently small, the differences in displacement fields across these

α

values are subtle. To highlight the influence of

α

, we have provided a deviation plot (see Figure A1) showing

Δ u = u_{α} - u_{1.0}

.

Figure A1. Incremental displacement profiles relative to the baseline coupling (

α = α_{0}

).

Figure A1. Incremental displacement profiles relative to the baseline coupling (

α = α_{0}

).

Although these variations are relatively small due to the low magnitude of

α

itself, Figure A2 displays the predicted displacement distributions along the horizontal midline for different

α

values. It can be clearly observed that our proposed method remains the most consistent with the high-fidelity reference (CFD) across all cases. This demonstrates the model’s ability to sensitively capture even minor trends in deformation caused by changes in physical parameters.

Figure A2. Comparison of displacement u at the centerline (y = x) across varying

α

.

Figure A2. Comparison of displacement u at the centerline (y = x) across varying

α

.

As

α

increases, the interaction between different physical fields becomes stronger, which often leads to gradient imbalances during the training of sPINN, affecting their convergence. As shown in the updated Table A1, while the prediction error of sPINN rises significantly as the coupling strengthens (e.g., the temperature error increases from

1.88 \times 10^{- 2}

to

3.65 \times 10^{- 2}

), the error of the proposed s-HB-PINN remains nearly constant at a very low level. This proves that our strategy is highly robust and maintains stable predictions even when the physical environment becomes more complex.

Table A1. The L2 error of u, T, and σ_xx calculated using different methods at different thermal expansion coefficient.

$α$	Method	$Times (10^{3} s)$	$u$ $(10^{- 1})$	$T$ $(10^{- 1})$	$σ_{x x}$ $(10^{- 1})$
$α = 0.5 α_{0}$	sPINN	7.56	4.50	0.19	2.71
	hPINN	13.00	0.34	3.49	1.78
	HB-PINN	18.90	0.62	0.10	1.18
	s-hPINN	17.70	0.13	0.01	1.17
	s-HB-PINN	12.90	0.09	0.01	0.97
$α = α_{0}$	sPINN	8.98	2.70	0.27	2.59
	hPINN	14.00	0.13	3.17	1.18
	HB-PINN	20.90	0.62	0.11	1.18
	s-hPINN	19.80	0.13	0.01	1.19
	s-HB-PINN	13.00	0.09	0.01	0.97
$α = 2 α_{0}$	sPINN	8.99	4.40	0.37	2.72
	hPINN	15.10	0.37	5.48	1.68
	HB-PINN	20.40	0.62	0.11	1.17
	s-hPINN	19.90	0.14	0.05	1.20
	s-HB-PINN	13.20	0.08	0.05	0.96

References

Chen, Y.; Ma, W.; He, J.; Zhao, W.; Sun, M.; Ma, Y. Contrasting mechanisms of cross-regional heavy precipitation induced by an eastward-moving Tibetan Plateau vortex: Dynamical dominance versus thermodynamic maintenance. Atmos. Res. 2026, 331, 108634. [Google Scholar] [CrossRef]
Urban, P.; Králík, T.; Musilová, V.; Skrbek, L. Modulated turbulent convection: A benchmark model for large scale natural flows driven by diurnal heating. Sci. Rep. 2024, 14, 13892. [Google Scholar] [CrossRef]
Ouyang, Z.; Wang, Q.; Li, K.; Wen, B.; Ding, Z. Touching the classical scaling in penetrative convection. Proc. Natl. Acad. Sci. USA 2025, 122, e2418468122. [Google Scholar] [CrossRef]
Wei, X.; Jin, K.; Nan, P.; Zhou, F.; Xin, G.; Lim, K.-S.; Ahmad, H.; Zhang, Y.; Yang, H. Global rapid thermomechanical decoupling method based on adaptive localized method of fundamental solutions and sparse embedded FBG in thermal protection materials for aerospace vehicles. Int. J. Therm. Sci. 2025, 218, 110166. [Google Scholar] [CrossRef]
Lipiński, W.; Abbasi-Shavazi, E.; Chen, J.; Coventry, J.; Hangi, M.; Iyer, S.; Kumar, A.; Li, L.; Li, S.; Pye, J.; et al. Progress in heat transfer research for high-temperature solar thermal applications. Appl. Therm. Eng. 2021, 184, 116137. [Google Scholar] [CrossRef]
Zhao, X.; Jin, K.; Yan, M.; Nan, P.; Zhou, F.; Xin, G.; Lim, K.S.; Ahmad, H.; Zhang, Y.; Yang, H. Inverse heat transfer for real-time thermal evaluation of aircraft thermal protection structure with embedded FBG sensors. Appl. Therm. Eng. 2025, 260, 124869. [Google Scholar] [CrossRef]
Wole, A.; Lobo, M.; Ginting, K.B. The application of finite difference method on 2-D heat conductivity problem. J. Phys. Conf. Ser. 2021, 2017, 012009. [Google Scholar] [CrossRef]
Chen, H.; Liu, D. Formulation of a nonlocal discrete model for anisotropic heat conduction problems. Int. J. Therm. Sci. 2022, 182, 107816. [Google Scholar] [CrossRef]
Ye, S.; Liu, Y.; Xiao, Y.; Chen, W.; Gong, Y.; Lv, R.; Yang, Y.; Huang, W. Predicting effective thermal conductivity of unsaturated fractal porous media with rough surfaces using fractal-Monte Carlo simulations. Fractals 2025, 33, 25402315. [Google Scholar] [CrossRef]
Wu, C.-T.; Wu, S.-W.; Niu, R.-P.; Jiang, C.; Liu, G.R. The polygonal finite element method for solving heat conduction problems. Eng. Anal. Bound. Elem. 2023, 155, 935–947. [Google Scholar] [CrossRef]
Yan, M.; Yang, Y.; Su, C.; Zhang, Z.; Duan, Q.; Hao, D.; Zhou, J. A polyhedral scaled boundary finite element method solving three-dimensional heat conduction problems. Eng. Anal. Bound. Elem. 2025, 175, 106191. [Google Scholar] [CrossRef]
Yang, S.; Sheng, Q.; Zou, M.; Wang, M.; Cui, R.; Chen, S.; Zheng, Q. Fractal study on heat transfer characteristics of fractured dual porous media with rough surface. J. Porous Media 2024, 27, 55–71. [Google Scholar] [CrossRef]
Bai, J.; Zhou, Y.; Ma, Y.; Jeong, H.; Zhan, H.; Rathnayaka, C.; Sauret, E.; Gu, Y. A general neural particle method for hydrodynamics modeling. Comput. Methods Appl. Mech. Eng. 2022, 393, 114740. [Google Scholar] [CrossRef]
Schiassi, E.; Furfaro, R.; Leake, C.; De Florio, M.; Johnston, H.; Mortari, D. Extreme theory of functional connections: A fast physics-informed neural network method for solving ordinary and partial differential equations. Neurocomputing 2021, 457, 334–356. [Google Scholar] [CrossRef]
Ma, C.; Gui, H.; Liu, J. Self learning-empowered thermal error control method of precision machine tools based on digital twin. J. Intell. Manuf. 2021, 34, 695–717. [Google Scholar] [CrossRef]
Qureshi, H.; Alakhras, A.I. AI-driven multilayer modeling of Tetra-Hybrid Casson nanofluid flow with thermal radiation: Implications for solar energy and energy conversion. Results Phys. 2025, 79, 108505. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Go, M.-S.; Lim, J.H.; Lee, S. Physics-informed neural network-based surrogate model for a virtual thermal sensor with real-time simulation. Int. J. Heat Mass Transf. 2023, 214, 124392. [Google Scholar] [CrossRef]
Cai, S.; Wang, Z.; Wang, S.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks for heat transfer problems. J. Heat Transf. 2021, 143, 062101. [Google Scholar] [CrossRef]
Shang, Y.; Ban, H.; Liu, D. Simultaneous identification of boundary heat flux and thermal conductivity in inverse heat conduction problems using physics-informed neural networks. Therm. Sci. Eng. Prog. 2025, 65, 103905. [Google Scholar] [CrossRef]
Wei, X.; Nan, P.; Xin, G.; Zhang, Y.; Yang, H. Efficient real-time reconstruction of transient temperature fields in some hypersonic thermal protection systems with a conditioned adaptive physics network. Aerosp. Sci. Technol. 2026, 168, 111320. [Google Scholar] [CrossRef]
Kalpana, V.; Sujana, G.J.; Thyagarajan, K.; Lalitha, R.V.S.; Talasila, V.; Jadhav, M.M. Enhancing heat transfer coefficient predictions in complex geometries through hybrid machine learning approaches. Therm. Sci. Eng. Prog. 2024, 55, 103017. [Google Scholar] [CrossRef]
Wang, S.; Teng, Y.; Perdikaris, P. Understanding and mitigating gradient flow pathologies in physics-informed neural networks. SIAM J. Sci. Comput. 2021, 43, A3055–A3081. [Google Scholar] [CrossRef]
McClenny, L.D.; Braga-Neto, U.M. Self-adaptive physics-informed neural networks. J. Comput. Phys. 2023, 474, 111722. [Google Scholar] [CrossRef]
Lu, L.; Pestourie, R.; Yao, W.; Wang, Z.; Verdugo, F.; Johnson, S.G. Physics-informed neural networks with hard constraints for inverse design. SIAM J. Sci. Comput. 2021, 43, B1105–B1132. [Google Scholar] [CrossRef]
Zhou, C.; Li, T.; Lan, C.; Du, R.; Xin, G.; Li, W.; Wang, G.; Liu, X.; Yang, H. Hybrid Boundary Physics-Informed Neural Networks for Solving Navier-Stokes Equations with Complex Boundary. In Proceedings of the Thirty-Ninth Annual Conference on Neural Information Processing Systems, San Diego, CA, USA, 2–7 December 2025. [Google Scholar]
Jagtap, A.D.; Karniadakis, G.E. Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations. Commun. Comput. Phys. 2020, 28, 2002–2041. [Google Scholar] [CrossRef]
Ma, Y.; Xu, X.; Yan, S.; Ren, Z. A Preliminary Study on the Resolution of Electro-Thermal Multi-Physics Coupling Problem Using Physics-Informed Neural Network (PINN). Algorithms 2022, 15, 53. [Google Scholar] [CrossRef]
Wu, Z.; Zhang, B.; Yu, H.; Ren, J.; Pan, M.; He, C.; Chen, Q. Accelerating heat exchanger design by combining physics-informed deep learning and transfer learning. Chem. Eng. Sci. 2023, 282, 119285. [Google Scholar] [CrossRef]
Abueidda, D.W.; Mobasher, M.E. I-FENN for thermoelasticity based on physics-informed temporal convolutional network (PI-TCN). Comput. Mech. 2024, 74, 1229–1259. [Google Scholar] [CrossRef]
Zhu, Y.; Zabaras, N.; Koutsourelakis, P.-S.; Perdikaris, P. Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. J. Comput. Phys. 2019, 394, 56–81. [Google Scholar] [CrossRef]
Wang, S.; Yu, X.; Perdikaris, P. When and why PINNs fail to train: A neural tangent kernel perspective. J. Comput. Phys. 2022, 449, 110768. [Google Scholar] [CrossRef]
Fuks, O.; Tchelepi, H.A. Limitations of physics informed machine learning for nonlinear two-phase transport in porous media. J. Mach. Learn. Model. Comput. 2020, 1, 19–37. [Google Scholar] [CrossRef]
Raissi, M.; Karniadakis, G.E. Hidden physics models: Machine learning of nonlinear partial differential equations. J. Comput. Phys. 2018, 357, 125–141. [Google Scholar] [CrossRef]
Eshkofti, K.; Hosseini, S.M. A new modified deep learning technique based on physics-informed neural networks (PINNs) for the shock-induced coupled thermoelasticity analysis in a porous material. J. Therm. Stress. 2024, 47, 798–825. [Google Scholar] [CrossRef]
Rao, C.; Sun, H.; Liu, Y. Physics-informed deep learning for computational elastodynamics without labeled data. J. Eng. Mech. 2021, 147, 04021043. [Google Scholar] [CrossRef]
Farhani, G.; Dashtbayaz, N.H.; Kazachek, A.; Wang, B. A simple remedy for failure modes in physics-informed neural networks. Neural Netw. 2025, 183, 106963. [Google Scholar] [CrossRef]
Wang, Z.; Karniadakis, G.E.; Chalfant, J.; Chryssostomidis, C.; Babaee, H. High-fidelity modeling and optimization of conjugate heat transfer in arrays of heated cables. In Proceedings of the 2017 IEEE Electric Ship Technologies Symposium (ESTS), Arlington, VA, USA, 14–17 August 2017; pp. 557–563. [Google Scholar] [CrossRef]
Markidis, S. The old and the new: Can physics-informed deep-learning replace traditional linear solvers? Front. Big Data 2021, 4, 669097. [Google Scholar] [CrossRef]
Lu, L.; Meng, X.; Mao, Z.; Karniadakis, G.E. DeepXDE: A deep learning library for solving differential equations. SIAM Rev. 2021, 63, 208–228. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015), San Diego, CA, USA, 7–9 May 2015; Available online: https://arxiv.org/abs/1412.6980 (accessed on 10 February 2026).
Carlson, D.E. Linear thermoelasticity. In Linear Theories of Elasticity and Thermoelasticity; Springer: Berlin/Heidelberg, Germany, 1973; pp. 297–345. [Google Scholar] [CrossRef]
Verhás, J. Thermoelasticity and entropy flow. Proc. Est. Acad. Sci. 2008, 57, 142–144. [Google Scholar] [CrossRef]
Chang, C.H. Mechanics of Elastic Structures with Inclined Members: Analysis of Vibration, Buckling and Bending of X-Braced Frames and Conical Shells; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar] [CrossRef]
Greenkorn, R. (Ed.) Heat transfer models. In Momentum, Heat, and Mass Transfer Fundamentals; CRC Press: Boca Raton, FL, USA, 2018; pp. 549–874. [Google Scholar] [CrossRef]
Bejan, A.; Kraus, A.D. (Eds.) Heat Transfer Handbook; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]

Figure 1. Schematic diagram of a standard neural network.

Figure 2. Diagram of a coordinating soft and hard constraints neural network architecture.

Figure 3. Guideline for assigning hard and soft constraints in PINN.

Figure 4. Schematic diagram of the solution procedures for thermoelastic coupling and thermal convection problems using different methods.

Figure 5. Schematic representation of the geometry and boundary conditions of a 2D plate with a hole issue.

Figure 6. Evolution of PDE loss, Neumann BC loss, and total boundary condition losses during the solution of the thermo-mechanical coupling process.

Figure 7. Residual plots of equations

\nabla \cdot σ = 0

using different methods.

Figure 7. Residual plots of equations

\nabla \cdot σ = 0

using different methods.

Figure 8. Comparison of different methods at epochs = 200,000 (a) u (b) T (c) σ_xx.

Figure 9. Comparison of errors between different PINN methods and finite element methods at epoch = 200,000: (a) u (b) T (c) σ_xx.

Figure 10. Quantitative analysis of the stress field distribution along the centerline (y = x) using different computational methods.

Figure 11. Comparison of L2 error calculations for u, T and σ_xx using different methods.

Figure 12. Comparison of x-direction displacement u at t = 0.5 s using different methods.

Figure 13. Comparison of temperature T at t = 0.5 s using different methods.

Figure 14. L2 norm of temperature errors computed by different methods.

Figure 15. Forced convection in an enclosed space.

Figure 16. Evolution of PDE loss, Neumann BC loss, and all boundary condition losses during the solution of the thermal convection process.

Figure 17. Comparison of different methods at epochs = 200,000. (a) u (b) T(c) p.

Figure 18. Comparison of error calculations for (a) u, (b) T and (c) p using different methods.

Figure 19. Quantitative analysis of the distribution of each physical quantity along the centerline using different methods.

Figure 20. Comparison of L2 error calculations for u, T and p using different methods.

Figure 21. Enhancement of forced convection within a confined space by increasing selected measurement points.

Figure 22. Comparison of different methods at epochs = 200,000. (a) u results (b) T results (c) p results.

Figure 23. Quantitative analysis of the distribution of each physical quantity along the centerline using different methods after adding observational data.

Table 1. The architecture of the neural network (Depth × Width).

Method	$N_{P}$	$N_{D}$	$N_{H}$	$N_{S}$
sPINN	/			6 × 128
hPINN	4 × 20	4 × 64	6 × 128	/
HB-PINN	4 × 20	4 × 64	6 × 128	/
s-hPINN	4 × 20	4 × 64	6 × 128	6 × 128
s-HB-PINN	4 × 20	4 × 64	6 × 128	6 × 128

Table 2. Parameters of typical metallic materials.

Parameter	Unit	Value
Temperature $Θ = T - T_{0}$	K	$T_{0} = 273.15$
Young’s modulus $E$	MPa	20
Poisson’s ratio $μ$	-	0.25
Thermal expansion coefficient $α$	$m /^{\circ} C$	$2.31 \times 10^{- 5}$
Thermal conductivity $k$	$W /^{\circ} C$	$1$

Table 3. The L2 error of u, T and σ_xx is calculated using different methods.

Epochs	Method	$Times (10^{3} s)$	$u$ $(10^{- 1})$	$T$ $(10^{- 1})$	$σ_{x x}$ $(10^{- 1})$
50,000	sPINN	2.18	1.41	0.69	2.76
	hPINN	3.86	0.22	3.37	1.46
	HB-PINN	3.88	0.62	0.10	1.29
	s-hPINN	3.83	0.22	0.02	1.47
	s-HB-PINN	3.90	0.13	0.02	1.14
100,000	sPINN	4.86	1.73	0.49	2.58
	hPINN	7.63	0.16	3.18	1.30
	HB-PINN	9.94	0.62	0.11	1.20
	s-hPINN	7.76	0.17	0.01	1.30
	s-HB-PINN	10.40	0.10	0.01	1.02
200,000	sPINN	8.98	2.77	0.27	2.59
	hPINN	14.00	0.13	3.17	1.18
	HB-PINN	20.90	0.63	0.11	1.18
	s-hPINN	19.80	0.13	0.01	1.19
	s-HB-PINN	13.00	0.09	0.01	0.97

Table 4. The L2 error of u, T and p is calculated using different methods.

Epochs	Method	$Times (10^{3} s)$	$u$ $(10^{- 1})$	$T$ $(10^{- 1})$	$p$ $(10^{- 1})$
50,000	sPINN	1.96	9.35	21.00	12.10
	hPINN	4.23	2.39	14.00	2.92
	HB-PINN	4.19	1.40	6.01	2.15
	s-hPINN	3.89	0.70	1.21	1.08
	s-HB-PINN	3.90	0.25	0.85	0.43
100,000	sPINN	4.09	9.30	20.10	12.10
	hPINN	6.87	2.48	13.40	6.47
	HB-PINN	8.86	1.20	5.79	1.44
	s-hPINN	8.08	0.32	0.55	0.51
	s-HB-PINN	8.06	0.14	0.29	0.15
200,000	sPINN	9.00	9.29	19.50	12.40
	hPINN	17.90	3.25	10.90	2.98
	HB-PINN	17.80	1.12	5.80	1.29
	s-hPINN	16.50	0.15	0.29	0.17
	s-HB-PINN	16.00	0.14	0.28	0.15

Table 5. The L2 error of u, T, and p calculated using different methods assisted by sparse observational data.

Epochs	Method	$Times (10^{3} s)$	$u$ $(10^{- 1})$	$T$ $(10^{- 1})$	$p$ $(10^{- 1})$
50,000	sPINN	$2.33$	$7.73$	$1.38$	$10.90$
	hPINN	$4.32$	$1.61$	$9.44$	$4.88$
	HB-PINN	$4.39$	$0.37$	$5.58$	$1.23$
	s-hPINN	$3.83$	$0.15$	$0.56$	$0.37$
	s-HB-PINN	$3.94$	0.11	0.38	0.17
100,000	sPINN	$4.50$	$7.59$	$1.36$	$10.80$
	hPINN	$8.89$	$1.06$	$8.19$	$3.56$
	HB-PINN	$9.03$	$0.40$	$5.63$	$1.31$
	s-hPINN	$8.08$	$0.12$	$0.36$	$0.15$
	s-HB-PINN	$7.98$	0.11	0.30	0.14
200,000	sPINN	$9.93$	$7.52$	$13.40$	$11.00$
	hPINN	$16.60$	$0.99$	$7.48$	$3.37$
	HB-PINN	$17.80$	$0.31$	$5.59$	$0.79$
	s-hPINN	$16.40$	$0.12$	$0.28$	$0.14$
	s-HB-PINN	$16.20$	$0.12$	$0.25$	$0.14$

Table 6. The L2 error of u, T, and p calculated using different methods at different sensor locations.

Position	Method	$Times (10^{3} s)$	$u$ $(10^{- 1})$	$T$ $(10^{- 1})$	$p$ $(10^{- 1})$
$x = 0$	sPINN	$9.92$	$8.26$	$9.46$	$19.50$
	hPINN	$16.40$	$1.00$	$7.23$	$3.41$
	HB-PINN	$17.80$	$1.00$	$5.61$	$1.02$
	s-hPINN	$16.60$	$0.13$	$0.29$	$0.15$
	s-HB-PINN	$16.30$	0.13	0.29	0.16
$x = 2$	sPINN	$9.91$	$7.84$	$8.06$	$13.58$
	hPINN	$16.50$	$1.71$	$8.84$	$2.83$
	HB-PINN	$17.80$	$0.89$	$5.67$	$0.72$
	s-hPINN	$16.50$	$0.13$	$0.30$	$0.17$
	s-HB-PINN	$16.10$	0.13	0.29	0.17
$x = 6$	sPINN	$9.93$	$7.52$	$13.40$	$11.00$
	hPINN	$16.60$	$0.99$	$7.48$	$3.37$
	HB-PINN	$17.80$	$0.31$	$5.59$	$0.79$
	s-hPINN	$16.40$	$0.12$	$0.28$	$0.14$
	s-HB-PINN	$16.20$	$0.12$	$0.25$	$0.14$

Table 7. The L2 error of u, T, and p calculated using different methods under different levels of relative Gaussian white noise.

Noise(%)	Method	$Times (10^{3} s)$	$u$ $(10^{- 1})$	$T$ $(10^{- 1})$	$p$ $(10^{- 1})$
0	sPINN	$9.93$	$7.52$	$13.40$	$11.00$
	hPINN	$16.60$	$0.99$	$7.48$	$3.37$
	HB-PINN	$17.80$	$0.31$	$5.59$	$0.79$
	s-hPINN	$16.40$	$0.12$	$0.28$	$0.14$
	s-HB-PINN	$16.20$	0.12	0.25	0.14
1	sPINN	$9.94$	$7.73$	$11.70$	$10.90$
	hPINN	$18.60$	$1.02$	$7.50$	$3.53$
	HB-PINN	$18.30$	$0.31$	$5.59$	$0.83$
	s-hPINN	$16.50$	$0.12$	$0.25$	$0.15$
	s-HB-PINN	$16.70$	0.12	0.25	0.14
5	sPINN	$9.99$	$7.94$	$11.90$	$11.40$
	hPINN	$22.10$	$1.11$	$8.01$	$3.67$
	HB-PINN	$22.10$	$0.37$	$5.63$	$1.10$
	s-hPINN	$16.90$	$0.12$	$0.26$	$0.15$
	s-HB-PINN	$17.00$	$0.12$	$0.26$	$0.14$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, Y.; Zhou, C.; Xin, G.; Nan, P.; Yang, H. A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics. Appl. Sci. 2026, 16, 1885. https://doi.org/10.3390/app16041885

AMA Style

Liu Y, Zhou C, Xin G, Nan P, Yang H. A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics. Applied Sciences. 2026; 16(4):1885. https://doi.org/10.3390/app16041885

Chicago/Turabian Style

Liu, Yuxin, Chuyu Zhou, Guoguo Xin, Pengyu Nan, and Hangzhou Yang. 2026. "A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics" Applied Sciences 16, no. 4: 1885. https://doi.org/10.3390/app16041885

APA Style

Liu, Y., Zhou, C., Xin, G., Nan, P., & Yang, H. (2026). A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics. Applied Sciences, 16(4), 1885. https://doi.org/10.3390/app16041885

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Cooperative Soft-Hard PINN Framework for Decoupling the Thermoelasticity and Thermal Convection Multiphysics

Abstract

1. Introduction

2. Methods

2.1. Physics-Informed Neural Networks

2.2. Methods for Collaborative Soft-Hard Constraints

3. Results and Discussion

3.1. Steady-State Thermoelastic Problem

3.2. Transient Thermoelastic Coupling

3.3. Steady-State Thermal Convection

3.4. Steady-State Thermal Convection with Data

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI