Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem

Liu, Kunhao; Wu, Jibing

doi:10.3390/computation13100228

Open AccessArticle

Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem

by

Kunhao Liu

¹ and

Jibing Wu

^2,*

¹

College of Mathematics and Data Science, Shaanxi University of Science and Technology, Xi’an 710021, China

²

Laboratory for Big Data and Decision, College of Systems Engineering, National University of Defense Technology, Changsha 410073, China

^*

Author to whom correspondence should be addressed.

Computation 2025, 13(10), 228; https://doi.org/10.3390/computation13100228

Submission received: 3 September 2025 / Revised: 15 September 2025 / Accepted: 16 September 2025 / Published: 1 October 2025

(This article belongs to the Section Computational Engineering)

Download

Browse Figures

Versions Notes

Abstract

The core challenge of the Unique Continuity (UC) problem lies in inferring solutions across an entire domain using limited observational data, holding significant practical implications for multiphysics coupled models. Recently, physics-informed neural networks (PINNs) have shown considerable promise in addressing the UC problem. However, the reliance on a fixed activation function and a fixed weighted loss function prevents PINNs from adequately representing the multiphysics characteristics embedded in coupled models. To overcome these limitations, we propose a novel dual adaptive neural network (DANN) algorithm. This approach integrates trainable adaptive activation functions and an adaptively weighted loss scheme, enabling the network to dynamically balance the observational data and governing physics. Our method is applicable not only to the UC problem but also to general forward problems governed by partial differential equations. Furthermore, we provide a theoretical foundation for the algorithm by deriving a generalization error estimate, discussing the potential causes of neural networks solving this problem. Extensive numerical experiments including 3D demonstrate the superior accuracy and effectiveness of the proposed DANN framework in solving the UC problem compared to standard PINNs.

Keywords:

adaptive algorithm; neural network; coupled models; unique continuation problem; 2D/3D numerical experiments

1. Introduction

Free-flow coupled porous media models represent an important class of multiphysics models. They are widely employed to describe the shale oil, reservoir development and groundwater flow [1,2]. Free-flow coupled porous media models have fractional characteristics [3] and are described by the Navier–Stokes equation coupled with the porosity equation. For the modeling and numerical computation of such models, the Finite Element Method (FEM) has become a main traditional algorithm [4,5,6,7]. However, the porous media domain often exhibits complex physical behaviors, such as those found in dual-porosity [8] or triple-porosity [9] models, which present significant challenges for traditional numerical approaches.

With the rapid advancement of scientific computing and data-driven modeling, neural network methods have also emerged as powerful tools for tackling complex steady and unsteady physical problems. Specifically, this includes solving partial differential equations (PDEs) [10,11,12,13,14,15,16,17]. In practice, the limited availability of observational data significantly intensifies the challenge of solving PDE models by neural networks. This challenge is referred to as the unique continuation (UC) problem, which aims to infer the distribution of physical information across the computational domain from observational data [18]. Furthermore, addressing the UC problem can enhance solution accuracy and extend the applicability of the solution.

Solving the UC problem requires integrating observational data with physical laws to reconstruct models’ information from limited information. Consequently, physical information neural networks (PINNs) have emerged as an effective approach to addressing the UC problem by enabling the combination of observational data with physical principles [19]. Recently, Mishra et al. [20] derived a generalization error estimate for neural network solutions, establishing a theoretical basis for applying PINNs to the UC problem. Mai et al. [21] reconstructed the entire temperature field of an engine using limited observational data. However, the conventional PINN method faces two major limitations when applied to the UC problem, especially in free-flow coupled porous media models:

Multiphysical coupled models contain multiple quantities to be solved. The fixed activation function may prevent the neural network from accurately capturing changes in physical quantities. It may even lead to issues such as vanishing gradients.
In the UC problem, observational data in the boundary domain is usually unknown. This issue causes the balanced weighted loss function to develop bias during training, ultimately preventing it from fully training to optimality.

To overcome the limitations of conventional PINNs and improve the solution of the UC problem, it is crucial to develop a unified framework applicable to free-flow coupled porous media models. Therefore, this study proposes the dual adaptive neural network (DANN) algorithm to solve steady free-flow coupled porous media models, focusing on a class of practically significant UC problems. It can enhance the nonlinearity of neural network models and better approximate various PDE models. The main contributions of this work are summarized as follows:

An adaptive Swish activation function is introduced to dynamically modulate its nonlinearity, thereby enhancing the capability of neural networks to represent physical information.
An adaptively weighted loss function is proposed for the UC problem, which dynamically balances observational data and physical constraints to improve training convergence and solution accuracy.

This paper conducts 2D/3D and cavity flow numerical experiments to validate the performance of the DANN algorithm in the steady dual-porosity–Navier–Stokes model and steady triple-porosity–Navier–Stokes model. We also provide theoretical analysis for the DANN algorithm. This work verifies the effectiveness of the DANN algorithm on steady free-flow coupled porous media models.

This paper is structured as follows: Section 2 outlines the free-flow coupled porous media models under the UC problem. The design principles of the DANN algorithm are introduced in Section 3. The theoretical analysis in Section 4 provides the feasibility basis for this paper. Section 5 describes the numerical experiments of this paper. Section 6 concludes with a summary and future research directions.

2. Unique Continuation Problem

In this section, the main focus is on modeling the steady free-flow coupled porous medium model under the UC problem. The computational domain D consists of free-flow domain

D_{s} = D_{s} ∖ D_{s}^{'} \cup D_{s}^{'}

, porous medium domain

D_{d} = D_{d} ∖ D_{d}^{'} \cup D_{d}^{'}

, interface

Γ_{i} (i = 1, 2, 3)

and smooth boundary

\partial D_{s}

,

\partial D_{d}

. The known observational data g is provided by observational domains

D_{s}^{'}

,

D_{d}^{'}

that also have smooth boundaries

\partial D_{s}^{'}

,

\partial D_{d}^{'}

. The unit normal vectors are denoted by

n_{s}

and

n_{d}

.

Actually, the PDE is solved in

D_{s} ∖ D_{s}^{'}

and

D_{d} ∖ D_{d}^{'}

with boundary

\partial D_{s}^{'}, \partial D_{d}^{'}, \partial D_{s}, \partial D_{d}

and interface

Γ_{i} (i = 2, 3)

, as shown in Figure 1. This means that the UC problem is solving PDEs based on known observational data g. Therefore, the UC problem is also called the data assimilation inverse problem [20].

In the UC problem, the fluid in the free-flow domain is described by the Naiver–Stokes equation as follows:

\begin{matrix} - ν Δ u_{s} + (u_{s} \cdot \nabla) u_{s} + \nabla p_{s} & = f_{s} in D_{s} ∖ D_{s}^{'} \end{matrix}

(1)

\begin{matrix} \nabla \cdot u_{s} & = 0 in D_{s} ∖ D_{s}^{'}, \end{matrix}

(2)

\begin{matrix} u_{s} & \equiv g on \partial D_{s}^{'}, \end{matrix}

(3)

The viscosity of the fluid is represented by

ν

, while the external force function is denoted by

f_{s}

. Notably, the observational data g satisfies the PDEs in

D_{s}^{'}

. In the porous media domain, it is generally classified into different models by fracture according to Figure 1. In this paper, we mainly consider the following two types of classical free-flow coupled porous media models.

2.1. The Steady Dual-Porosity–Navier–Stokes Model

The dual-porosity–Navier–Stokes model is commonly used to describe shale oil [5]. It assumes that the porous media domain contains only major fractures and matrices. Equations (1)–(3) are used in the free-flow domain. The following PDEs are used to control the physical law in the porous media domain:

\begin{matrix} - \nabla \cdot (\frac{k_{m}}{μ} \nabla φ_{m}) & = - \frac{σ k_{m}}{μ} (φ_{m} - φ_{F}) in D_{d} ∖ D_{d}^{'}, \end{matrix}

(4)

\begin{matrix} φ_{m} & \equiv g on \partial D_{d}^{'}, \end{matrix}

(5)

\begin{matrix} - \nabla \cdot (\frac{k_{F}}{μ} \nabla φ_{F}) & = \frac{σ k_{m}}{μ} (φ_{m} - φ_{F}) + f_{d} in D_{d} ∖ D_{d}^{'}, \end{matrix}

(6)

\begin{matrix} φ_{F} & \equiv g on \partial D_{d}^{'}, \end{matrix}

(7)

where

μ

denotes the dynamic viscosity and

σ

is the shape factor associated with the rock matrix and major fracture system. The parameters

k_{m}

and

k_{F}

represent the permeability of the matrix and the major fractures, respectively. The source term is denoted by

f_{d}

. The observational data g satisfies the dual-porosity–Navier–Stokes model in

D_{d}^{'}

.

At the interface

Γ_{i} (i = 2, 3)

, these conditions are derived based on the fundamental properties of this model as outlined below:

\begin{matrix} - \frac{k_{m}}{μ} \nabla φ_{m} \cdot n_{d} & = 0 on Γ_{i} (i = 2, 3), \end{matrix}

(8)

\begin{matrix} u_{s} \cdot n_{s} & = \frac{k_{F}}{μ} \nabla φ_{F} \cdot n_{d} on Γ_{i} (i = 2, 3), \end{matrix}

(9)

\begin{matrix} p_{s} - ν n_{s} \frac{\partial u_{s}}{\partial n_{s}} & = \frac{φ_{F}}{ρ} on Γ_{i} (i = 2, 3), \end{matrix}

(10)

\begin{matrix} - ν τ_{i} \frac{\partial u_{s}}{\partial n_{s}} & = \frac{α ν \sqrt{d}}{\sqrt{tr (Π)}} u_{s} \cdot τ_{i} 1 \leq i \leq (d - 1) on Γ_{i} (i = 2, 3) . \end{matrix}

(11)

Here

ρ

represents the fluid density,

α

is a constant parameter determined by the properties of the porous medium and d denotes the spatial dimension. The unit tangential vectors are denoted by

τ_{i} (i = 1, . . ., d - 1)

. The intrinsic permeability of the fracture medium is expressed as

Π = k_{f} I

, where

I

signifies the unit tensor.

2.2. The Steady Triple-Porosity–Navier–Stokes Model

The hydraulic fracturing system is characterized by the triple-porosity–Navier–Stokes model [9]. It describes the behavior of porous media more realistically. For the UC problem, the model is reconstructed in

D_{d} ∖ D_{d}^{'}

as follows:

\begin{matrix} - \nabla \cdot (\frac{k_{F}}{μ} \nabla φ_{F}) + \frac{σ^{*} k_{F}}{μ} (φ_{F} - φ_{f}) & = q_{F} in D_{d} ∖ D_{d}^{'}, \end{matrix}

(12)

\begin{matrix} φ_{F} & \equiv g on \partial D_{d}^{'}, \end{matrix}

(13)

\begin{matrix} - \nabla \cdot (\frac{k_{f}}{μ} \nabla φ_{f}) + \frac{σ^{*} k_{f}}{μ} (φ_{f} - φ_{F}) + \frac{σ k_{m}}{μ} (φ_{f} - φ_{m}) & = q_{f} in D_{d} ∖ D_{d}^{'}, \end{matrix}

(14)

\begin{matrix} φ_{f} & \equiv g on \partial D_{d}^{'}, \end{matrix}

(15)

\begin{matrix} - \nabla \cdot (\frac{k_{m}}{μ} \nabla φ_{m}) + \frac{σ k_{m}}{μ} (φ_{m} - φ_{f}) & = q_{m} in D_{d} ∖ D_{d}^{'}, \end{matrix}

(16)

\begin{matrix} φ_{m} & \equiv g on \partial D_{d}^{'}, \end{matrix}

(17)

where

φ_{F}, φ_{f}

and

φ_{m}

represent the pressures in the major fractures, micro-fractures and matrix. The shape factors of major fractures and micro-fractures are indicated by

σ^{*}

and

σ

. The PDEs (1)–(3) remain applicable in the triple-porosity–Navier–Stokes model.

On the interface

Γ_{i} (i = 2, 3)

, it is necessary to add a micro-fracture with a free-flow no-fluid-exchange condition (18). The remaining conditions are the same as for the dual-porosity–Navier–Stokes model:

\begin{matrix} - \frac{k_{f}}{μ} \nabla φ_{f} \cdot n_{d} & = 0 on Γ_{i} (i = 2, 3), \end{matrix}

(18)

\begin{matrix} - \frac{k_{m}}{μ} \nabla φ_{m} \cdot n_{d} & = 0 on Γ_{i} (i = 2, 3), \end{matrix}

(19)

\begin{matrix} u_{s} \cdot n_{s} & = \frac{k_{F}}{μ} \nabla φ_{F} \cdot n_{d} on Γ_{i} (i = 2, 3), \end{matrix}

(20)

\begin{matrix} p_{s} - ν n_{s} \frac{\partial u_{s}}{\partial n_{s}} & = \frac{φ_{F}}{ρ} on Γ_{i} (i = 2, 3), \end{matrix}

(21)

\begin{matrix} - ν τ_{i} \frac{\partial u_{s}}{\partial n_{s}} & = \frac{α ν \sqrt{d}}{\sqrt{tr (Π)}} u_{s} \cdot τ_{i} 1 \leq i \leq (d - 1) on Γ_{i} (i = 2, 3) . \end{matrix}

(22)

It is not hard to find that the boundary conditions on

\partial D_{s}

and

\partial D_{d}

are completely unknown, which is an ill-posed inverse problem. Theoretically, the origins of the UC problem can be traced back to the ill-posedness of the elliptic Cauchy problem [22,23]. Over the years, numerous methods have been developed to address this inverse problem. Among them, the quasi-invertible method [24] and the penalty method [25] are two classical numerical approaches. Fundamentally, the purpose of the UC problem is to extend the solution range of the solution based on the data in the observational domain. In this paper, we propose to solve PDEs under this problem using the DANN algorithm.

3. Dual Adaptive Neural Network Algorithm

In the DANN algorithm, we employ a feed-forward neural network with input

x \in D^{'}

and depth K. The network consists of one input layer,

K - 1

hidden layers and one output layer. In the kth hidden layer, there are

N_{k}

neurons. Each hidden layer receives the output

x^{k - 1} \in R^{N_{k - 1}}

from the previous layer. And it is subjected to an affine linear transformation

L_{k}

of the form

\begin{matrix} L_{k} (x^{k - 1}) : = W^{k} x^{k - 1} + b^{k}, for W^{k} \in R^{N_{k} \times N_{k - 1}}, b_{k} \in R^{N_{k}}, \end{matrix}

where W and b denote the weights and biases of the neural network, respectively. The activation function

σ

introduces nonlinearity into the input data, enabling the neural network to better capture the complex behaviors inherent in PDEs. The resulting neural network can be represented as

\begin{matrix} u_{θ} (x) = L_{K} \circ σ (x) \circ L_{K - 1} \dots \circ σ (x) \circ L_{2} \circ σ (x) \circ L_{1} (x) . \end{matrix}

(23)

Here, ∘ denotes a combination of functions. The trainable parameter of the neural network is represented by

θ = {W^{k}, b^{k}}_{k = 1}^{K}

. The output

u_{θ}

of the neural network is determined by the

θ

.

In the basic framework of deep learning [26], a neural network solves PDEs by finding the optimal

θ

by minimizing a loss function. However, the loss function in the free-flow coupled porous media model contains too many terms. This can cause the loss function to become unbalanced during convergence. Especially in the UC problem, incomplete data information can exacerbate the problem.

3.1. Adaptively Weighted Loss Function

As noted in the literature [27,28], the above problem arises from the tendency of gradient descent to prioritize loss components associated with a larger weighted parameter, while neglecting those with a smaller weighted parameter. Such imbalance ultimately compromises both the solution accuracy and the numerical stability of the neural network. To address this problem and ensure the loss function’s adequate convergence, we propose a loss function with an adaptively weighted method as follows:

\begin{matrix} J (x; θ, λ_{s}, λ_{bs}, λ_{d}, λ_{bd}, λ_{i}) & = J_{Ω_{s}} (x; θ, λ_{s}) + J_{\partial Ω_{s}} (x; θ, λ_{bs}) + J_{Ω_{d}} (x; θ, λ_{d}) \\ + J_{\partial Ω_{d}} (x; θ, λ_{bd}) + J_{Γ} (x; θ, λ_{i}), \end{matrix}

(24)

where the weighted parameter is given by

λ_{p} = (λ_{p}^{1}, \dots, λ_{p}^{N_{p}}) (p = s, b s, d, b s, i)

. Here, the number of training points in each respective domain is denoted by

N_{p} (p = s, d, b s, b d, i)

. The weighted parameter

λ_{p}

is learnable and constrained to be non-negative. The reason is to ensure the stability of gradient computations during neural network training. Taking the dual-porosity–Navier–Stokes model as an example, the loss function is exactly written as

\begin{matrix} J_{Ω_{s}} (x; θ, λ_{s}) & = \frac{1}{N_{s}} \sum_{i = 1}^{i = N_{s}} λ_{s}^{i} [| - ν Δ u_{s} (x_{i}, y_{i}; θ) + (u_{s} (x_{i}, y_{i}; θ) \cdot \nabla) u_{s} (x_{i}, y_{i}; θ) \\ + \nabla p_{s} (x_{i}, y_{i}; θ) - f_{s} (x_{i}, y_{i}; θ) |^{2} + {| \nabla \cdot u_{s} (x_{i}, y_{i}; θ) |}^{2}], \\ J_{Ω_{d}} (x; θ, λ_{d}) & = \frac{1}{N_{d}} \sum_{i = 1}^{i = N_{d}} λ_{d}^{i} [| - \nabla \cdot (\frac{k_{m}}{μ} \nabla φ_{m} (x_{i}, y_{i}; θ)) \\ + \frac{σ k_{m}}{μ} (φ_{m} (x_{i}, y_{i}; θ) - φ_{f} (x_{i}, y_{i}; θ)) |^{2} \\ + | - \nabla \cdot (\frac{k_{f}}{μ} \nabla φ_{f} (x_{i}, y_{i}; θ)) \\ - \frac{σ k_{m}}{μ} (φ_{m} (x_{i}, y_{i}; θ) - φ_{f} (x_{i}, y_{i}; θ)) - f_{d} (x_{i}, y_{i}; θ) |^{2}], \\ J_{Γ} (x; θ, λ_{i}) & = \frac{1}{N_{i}} \sum_{i = 1}^{i = N_{i}} λ_{i}^{i} [| - \frac{k_{m}}{μ} \nabla φ_{m} (x_{i}, y_{i}; θ) \cdot n_{d} |^{2} \\ + | u_{s} (x_{i}, y_{i}; θ) \cdot n_{s} - \frac{k_{f}}{μ} \nabla φ_{f} (x_{i}, y_{i}; θ) \cdot n_{d} |^{2} \\ + | p (x_{i}, y_{i}; θ) - ν n_{s} \frac{\partial u_{s} (x_{i}, y_{i}; θ)}{\partial n_{s}} - \frac{φ_{f} (x_{i}, y_{i}; θ)}{ρ} |^{2} \\ + | - ν τ_{i} \frac{\partial u_{s} (x_{i}, y_{i}; θ)}{\partial n_{s}} - \frac{α ν \sqrt{d}}{\sqrt{t r (Π)}} u_{s} (x_{i}, y_{i}; θ) \cdot τ_{i} |^{2}], \\ J_{\partial Ω_{s}} (x; θ, λ_{bs}) & = \frac{1}{N_{b s}} \sum_{i = 1}^{i = N_{b s}} λ_{b s}^{i} [| u_{s} (x_{i}, y_{i}; θ) - g (x_{i}, y_{i}; θ) |^{2}], \\ J_{\partial Ω_{d}} (x; θ, λ_{bd}) & = \frac{1}{N_{b d}} \sum_{i = 1}^{i = N_{b d}} λ_{b d}^{i} [{| φ_{m} (x_{i}, y_{i}; θ) - g (x_{i}, y_{i}; θ) |}^{2} \\ + | φ_{f} (x_{i}, y_{i}; θ) - g (x_{i}, y_{i}; θ) |^{2}] . \end{matrix}

Here,

λ_{p}^{i} (p = s, d, b s, b d, i)

is the weighted parameter assigned to each point by the weighed parameter

λ_{p}

. In Section 5, we visualize the value of

λ_{p}

at the end of pre-training.

3.2. Adaptive Activation Function

Traditional activation functions (ReLU, Sigmoid, Tanh) suffer from issues such as gradient vanishing and gradient explosion, which limit the performance of neural networks in complex physical modeling. In recent studies, the design of effective activation functions has become an active area of research [29,30]. To address challenges such as gradient vanishing and explosion associated with traditional activation functions, D. Jagtap et al. [31] proposed the incorporation of learnable parameters into traditional activation functions.

Recently, researchers have proposed a new activation function called Swish [32]. It has shown superior performance compared to the ReLU function in image and language processing tasks [33]. The standard Swish activation function is defined as follows:

\begin{matrix} Swish (x) : = x \cdot Sigmoid (β x), \end{matrix}

where

β = 1

. However, when it comes to issues involving complex physical models, such as the UC problem discussed in this paper, the fixed parameter form of the Swish activation function still suffers from a lack of flexibility. Thus, this paper proposes the use of an adaptive Swish activation function such that

0.1 < β_{i} (i = s, p, f, F, m) < 10

, and it is learnable as follows:

\begin{matrix} σ = x \cdot Sigmoid (β_{i} x) = \frac{x}{1 + e^{- β_{i} x}}, 0.1 < β_{i} < 10 . \end{matrix}

where

β_{i}

correspond to the parameters of each quantity to be solved for the models of Section 2. In this way, the activation function has different “nonlinear strengths” in different domains, which is equivalent to introducing a self-control gating mechanism. The specific range selection for the adaptive parameter

β_{i}

is primarily based on the following reasons:

If $β_{i}$ is too small, $Sigmoid (β_{i} x) \approx 0.5$ , causing the activation function to behave nearly linearly and lose its nonlinear characteristics.
If $β_{i}$ is too small, $Sigmoid (β_{i} x)$ approaches a step function, leading to gradient explosion or training instability during the training process.

The values of

β_{i}

for different experiments are presented in Section 5. In the UC problem, this flexibility allows the neural network to switch freely between different domains, which is more consistent with the physical characteristics of PDE coupled models.

The dual adaptive neural network (DANN) algorithm is the fusion of an adaptively weighted loss function and an adaptive activation function. The solving process of the UC problem is shown in Figure 2.

For the DANN algorithm, we employ the commonly used Adam optimizer [34] to pre-training in order to accelerate convergence and obtain optimal adaptive parameters

β_{i}

,

λ_{p}

. The BFGS [35] is used immediately afterward until the loss function converges completely. In the Adam pre-training process, the following two optimization algorithms, gradient increase and gradient descent, are used:

\begin{matrix} θ^{k + 1} & = θ^{k} - η^{k} \nabla_{θ} J (θ^{k}, β_{i}^{k}, λ_{p}^{k}), \\ β_{i}^{k + 1} & = β_{i}^{k} + η^{k} \nabla_{θ} J (θ^{k}, β_{i}^{k}, λ_{p}^{k}), \\ λ_{p}^{k + 1} & = λ_{p}^{k} + η^{k} \nabla_{λ_{i}} J (θ^{k}, β_{i}^{k}, λ_{p}^{k}) . \end{matrix}

In BFGS iteration,

λ_{p}

and

β_{i}

are fixed and

θ

is used in the gradient descent algorithm until the end:

\begin{matrix} θ^{k + 1} & = θ^{k} - α^{k} \nabla_{θ} J (θ^{k}, β_{i}^{k}, λ_{p}^{k}) . \end{matrix}

The learning rates for the two training parts are represented by

η

and

α

, respectively.

Theoretically, the core of DANNs solving PDEs is the approximation of integrals using the abstract quadrature rule [10] as follows. Assume that the PDE F exists in

D^{'}

and can be written in the following form:

F = \int_{D^{'}} F (\vec{x}) d \vec{x},

(25)

where

d \vec{x}

denotes the d-dimensional Lebesgue measure. To solve using a neural network, it is necessary to determine the quadrature points (sample points)

{\vec{x}}_{i} \in D^{'}

for

1 \leq i \leq N

, where N represents the total number. Subsequently, the quadrature can be established:

F_{N} = \sum_{i + 1}^{N} w_{i} F ({\vec{x}}_{i}),

(26)

where

w_{i} \in R_{+}

is the weight of the neural network. We further assume that the quadrature error is bounded as

| F - F_{N} | \leq C N^{- α},

(27)

for the learning rate

α > 0

. The quadrature points and weight depend on the underlying order of the quadrature rule, and the

α

depends on the regularity of the function being integrated [36]. Thus, we can employ above standard quadrature rule for solving PDEs by the DANN.

4. Theoretical Analysis

In this section we obtain generalization error estimates by conditional stability analysis of the UC problem. In this way, we provide a theoretical basis for the DANN algorithm. For ease of understanding, we write

\hat{D} = D_{s} ∖ D_{s}^{'} \cup D_{d} ∖ D_{d}^{'}

, and it has the following conditional stability.

Theorem 1.

For the dual-porosity–Navier–Stokes model, let

f_{s} \in L^{2} (\hat{D})

,

f_{d} \in L^{2} (\hat{D})

and

g \in H^{1} (D^{'})

. Let

u_{s} \in H^{1} (\hat{D})

,

p_{s} \in L^{2} (\hat{D})

,

φ_{F} \in H^{1} (\hat{D})

and

φ_{m} \in H^{1} (\hat{D})

hold for all test functions

v_{s} \in H_{0}^{1} (\hat{D})

,

q_{s} \in L_{0}^{2} (\hat{D})

,

v_{f} \in H_{0}^{1} (\hat{D})

and

v_{m} \in H_{0}^{1} (\hat{D})

. We have the global stability estimate

\begin{matrix} ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ p_{s} ∥_{L^{2} (\hat{D})} + ∥ φ_{F} ∥_{H^{1} (\hat{D})} + {∥ φ_{m} ∥}_{H^{1} (\hat{D})} \\ ⩽ C (∥ f_{s} ∥_{L^{2} (\hat{D})} + ∥ f_{d} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})}), \end{matrix}

(28)

The domain

\hat{D}

consists of a boundary

\partial \hat{D} = \partial D

, the boundary

\partial D^{'}

and the interface

Γ_{i} (i = 2, 3)

.

Proof of Theorem 1.

Firstly, we multiply (4) by

v_{m} \in H_{0}^{1} (\hat{D})

and (6) by

v_{F} \in H_{0}^{1} (\hat{D})

and interface condition (8), (9). Then, integrating over

\hat{D}

gives the following weak form:

\begin{matrix} \frac{k_{m}}{μ} \int_{\hat{D}} \nabla φ_{m} \cdot \nabla v_{m} d \hat{D} - \frac{k_{m}}{μ} \int_{\partial D^{'}} \nabla φ_{m} \cdot n_{d} \cdot v_{m} d s + \frac{k_{m}}{μ} \int_{\partial \hat{D}} \nabla φ_{m} \cdot n_{d} \cdot v_{m} d s \\ + \frac{σ k_{m}}{μ} \int_{\hat{D}} (φ_{m} - φ_{f}) \cdot v_{m} d \hat{D} = 0, \end{matrix}

and

\begin{matrix} \frac{k_{F}}{μ} \int_{\hat{D}} \nabla φ_{F} \cdot \nabla v_{F} d \hat{D} - \frac{k_{F}}{μ} \int_{\partial D^{'}} \nabla φ_{F} \cdot n_{d} \cdot v_{F} d s + \frac{k_{F}}{μ} \int_{\partial \hat{D}} \nabla φ_{F} \cdot n_{d} \cdot v_{F} d s \\ - \frac{k_{F}}{μ} \int_{Γ_{i}} u_{s} \cdot n_{d} \cdot v_{F} d s + \frac{σ k_{m}}{μ} \int_{\hat{D}} (φ_{F} - φ_{m}) \cdot v_{f} d \hat{D} = \int_{\hat{D}} f_{d} \cdot v_{F} d \hat{D} . \end{matrix}

Let

v_{F} = φ_{F}

,

v_{m} = φ_{m}

. According to the Cauchy inequality and Poincaré inequality, we can obtain

\begin{matrix} ∥ \nabla φ_{m} ∥_{L^{2} (\hat{D})}^{2} + ∥ φ_{m} ∥_{L^{2} (\hat{D})}^{2} - ∥ φ_{F} ∥_{L^{2} (\hat{D})} {∥ φ_{m} ∥}_{L^{2} (\hat{D})} & \leq C_{m} {∥ \nabla g ∥}_{L^{2} (D^{'})} {∥ φ_{m} ∥}_{L^{2} (\hat{D})}, \\ ∥ φ_{m} ∥_{H^{1} (\hat{D})}^{2} + ∥ φ_{m} ∥_{L^{2} (\hat{D})}^{2} - ∥ φ_{F} ∥_{L^{2} (\hat{D})} {∥ φ_{m} ∥}_{L^{2} (\hat{D})} & \leq C_{m} {∥ g ∥}_{H^{1} (D^{'})} {∥ φ_{m} ∥}_{L^{2} (\hat{D})}, \\ ∥ φ_{m} ∥_{H^{1} (\hat{D})} + ∥ φ_{m} ∥_{L^{2} (\hat{D})} - {∥ φ_{F} ∥}_{L^{2} (\hat{D})} & \leq C_{m} {∥ g ∥}_{H^{1} (D^{'})} . \end{matrix}

(29)

\begin{matrix} ∥ \nabla φ_{F} ∥_{L^{2} (\hat{D})}^{2} - ∥ u_{s} ∥_{L^{2} (Γ_{i})} ∥ φ_{F} ∥_{L^{2} (\hat{D})} + ∥ φ_{F} ∥_{L^{2} (\hat{D})}^{2} - ∥ φ_{m} ∥_{L^{2} (\hat{D})} {∥ φ_{F} ∥}_{L^{2} (\hat{D})} \\ \leq C_{F} (∥ f_{d} ∥_{L^{2} (\hat{D})} ∥ φ_{f} ∥_{L^{2} (\hat{D})} + {∥ \nabla g ∥}_{L^{2} (D^{'})} {∥ φ_{F} ∥}_{L^{2} (\hat{D})}), \\ ∥ φ_{F} ∥_{H^{1} (\hat{D})}^{2} - ∥ u_{s} ∥_{L^{2} (Γ_{i})} ∥ φ_{F} ∥_{L^{2} (\hat{D})} + ∥ φ_{F} ∥_{L^{2} (\hat{D})}^{2} - ∥ φ_{m} ∥_{L^{2} (\hat{D})} {∥ φ_{f} ∥}_{L^{2} (\hat{D})} \\ \leq C_{F} (∥ f_{d} ∥_{L^{2} (\hat{D})} ∥ φ_{F} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})} {∥ φ_{F} ∥}_{L^{2} (\hat{D})}), \\ ∥ φ_{F} ∥_{H^{1} (\hat{D})} - ∥ u_{s} ∥_{L^{2} (Γ_{i})} + ∥ φ_{F} ∥_{L^{2} (\hat{D})} - {∥ φ_{m} ∥}_{L^{2} (\hat{D})} \leq C_{f} (∥ f_{d} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})}) . \end{matrix}

(30)

Combining the results of (29) and (30) gives

\begin{matrix} ∥ φ_{F} ∥_{H^{1} (\hat{D})} + ∥ φ_{m} ∥_{H^{1} (\hat{D})} \leq C (∥ f_{d} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} {(D)}^{'}}) + {∥ u_{s} ∥}_{L^{2} (Γ_{i})} \end{matrix}

(31)

Then, multiplying (1) by

v_{s} \in H_{0}^{1} (\hat{D})

and (2) by

q \in L_{0}^{2} (\hat{D})

, taking the integrals similarly, we obtain the weak form for the conduit domain:

\begin{matrix} ν \int_{\hat{D}} \nabla u_{s}, \nabla v_{s} d \hat{D} - ν \int_{\partial D^{'}} \nabla u_{s} \cdot n_{d} \cdot v_{s} d s + ν \int_{\partial \hat{D}} \nabla u_{s} \cdot n_{d} \cdot v_{s} d s + c {(u_{s}, u_{s}, v_{s})}_{\hat{D}} \\ + \frac{1}{ρ} \int_{Γ_{i}} φ_{F} \cdot v_{s} \cdot n_{s} d s + \sum_{i = 1}^{d - l} \int_{Γ_{i}} \frac{α \cdot ν \sqrt{d}}{\sqrt{t_{r} (Π)}} (u_{s} \cdot τ_{i}) (v_{s} \cdot τ_{i}) d s = \int_{\hat{D}} f_{s} \cdot v_{s} d \hat{D} \\ \int_{\hat{D}} \nabla \cdot u_{s} \cdot q d \hat{D} = 0 . \end{matrix}

where the nonlinear form

c {(u_{s}, u_{s}, v_{s})}_{\hat{D}} : = {((u_{s} \cdot \nabla) u_{s}, v_{s})}_{\hat{D}}

. Let

v_{s} = u_{s}

. According to the nonlinear property [37], the nonlinear form can be estimated as follows:

\begin{matrix} {((u_{s} \cdot \nabla) u_{s}, v_{s})}_{\hat{D}} & \leq C_{N} ∥ u_{s} ∥_{L^{2} (\hat{D})}^{\frac{1}{2}} ∥ \nabla u_{s} ∥_{L^{2} (\hat{D})}^{\frac{1}{2}} ∥ \nabla u_{s} ∥_{L^{2} (\hat{D})} {∥ \nabla u_{s} ∥}_{L^{2} (\hat{D})} \\ \leq C_{N} {∥ \nabla u_{s} ∥}_{H^{1} (\hat{D})}^{3} . \end{matrix}

(32)

From (10) and (11), the Cauchy inequality, (32) and the Poincaré inequality, the following conclusions can be drawn:

\begin{matrix} ∥ \nabla u_{s} ∥_{L^{2} (\hat{D})} ∥ \nabla u_{s} ∥_{H^{1} (\hat{D})} + ∥ φ_{F} ∥_{L^{2} (Γ_{i})} ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ u_{s} ∥_{L^{2} (Γ_{i})} {∥ u_{s} ∥}_{H^{1} (\hat{D})} \\ \leq C_{s} (- ∥ \nabla u_{s} ∥_{H^{1} (\hat{D})}^{3} + ∥ f_{s} ∥_{L^{2} (\hat{D})} ∥ u_{s} ∥_{H^{1} (\hat{D})} + {∥ \nabla g ∥}_{L^{2} (D^{'})} {∥ u_{s} ∥}_{H^{1} (\hat{D})}), \\ ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ φ_{F} ∥_{L^{2} (Γ_{i})} + {∥ u_{s} ∥}_{L^{2} (Γ_{i})} \leq C_{s} (- ∥ \nabla u_{s} ∥_{H^{1} (\hat{D})}^{2} + ∥ f_{s} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})}), \\ ∥ u_{s} ∥_{H^{1} (\hat{D})} + {∥ u_{s} ∥}_{L^{2} (Γ_{i})} \leq C_{s} (- ∥ \nabla u_{s} ∥_{H^{1} (\hat{D})}^{2} + ∥ f_{s} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})}) . \end{matrix}

(33)

There is, as follows, an inf-sup condition [38] for a positive constant

C_{1}

:

C_{1} sup_{0 \neq v \in {(H_{0}^{1} (Ω))}^{d}} \frac{(div v, q)}{{∥ v ∥}_{{(H_{0}^{1} (Ω))}^{d}}} \geq {∥ q ∥}_{L^{2} (Ω)}, \forall q \in L_{0}^{2} (Ω),

d is the dimension. Thus, according to Equation (1), there exists a

p_{s} \in L_{0}^{2} (\hat{D})

, for

\forall v_{s} \in {(H_{0}^{1} (\hat{D}))}^{d}

. For a nonlinear form, we have the nonlinear property:

\begin{matrix} ∥ p_{s} ∥_{L^{2}} & \leq C_{1} sup_{u_{s} \in {(H_{0}^{1} (\hat{D}))}^{d}} [\frac{\int_{\hat{D}} \nabla u_{s}, \nabla v_{s} d \hat{D} - ν \int_{\partial D^{'}} \nabla g \cdot n_{d} \cdot \nabla v_{s} d \partial D^{'} + c (u_{s}, u_{s}, v_{s})}{∥ v_{s} ∥_{H^{1} (\hat{D})}} \\ + \frac{\int_{Γ_{i}} (u_{s} \cdot τ_{i}) (v_{s} \cdot τ_{i}) d Γ_{i} + \int_{Γ_{i}} φ_{F} \cdot v_{s} \cdot n_{s} d Γ_{i} - \int_{\hat{D}} f_{s}, v_{s} d \hat{D}}{∥ v_{s} ∥_{H^{1} (\hat{D})}}] \\ \leq C_{1} [\frac{∥ u_{s} ∥_{H^{1} (\hat{D})} ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ φ_{F} ∥_{L^{2} (Γ_{i})} ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ u_{s} ∥_{L^{2} (Γ_{i})} {∥ u_{s} ∥}_{H^{1} (\hat{D})}}{∥ u_{s} ∥_{H^{1} (\hat{D})}} \\ + \frac{∥ \nabla u_{s} ∥_{H^{1} (\hat{D})}^{3} - ∥ f_{s} ∥_{L^{2} (\hat{D})} ∥ u_{s} ∥_{H^{1} (\hat{D})} - {∥ \nabla g ∥}_{L^{2} (D^{'})} {∥ u_{s} ∥}_{H^{1} (\hat{D})}}{∥ u_{s} ∥_{H^{1} (\hat{D})}}] \\ \leq C_{1} {∥ \nabla u_{s} ∥}_{H^{1} (\hat{D})}^{2} . \end{matrix}

(34)

Adding up the conclusions of (31), (33) and (34) leads to

\begin{matrix} ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ p_{s} ∥_{L^{2} (\hat{D})} + ∥ φ_{m} ∥_{H^{1} (\hat{D})} + {∥ φ_{F} ∥}_{H^{1} (\hat{D})} \\ \leq C (∥ f_{d} ∥_{L^{2} (\hat{D})} + ∥ f_{s} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})}) . \end{matrix}

(35)

□

Theorem 2.

For the triple-porosity–Navier–Stokes model, let

f_{s} \in L^{2} (\hat{D})

,

f_{d} \in L^{2} (\hat{D})

and

g \in H^{1} (D^{'})

. Let

u_{s} \in H^{1} (\hat{D})

,

p_{s} \in L^{2} (\hat{D})

,

φ_{F} \in H^{1} (\hat{D})

,

φ_{f} \in H^{1} (\hat{D})

and

φ_{m} \in H^{1} (\hat{D})

hold for all test functions

v_{s} \in H_{0}^{1} (\hat{D})

,

q_{s} \in L_{0}^{2} (\hat{D})

,

v_{f} \in H_{0}^{1} (\hat{D})

and

v_{m} \in H_{0}^{1} (\hat{D})

. We have the global stability estimate

\begin{matrix} ∥ u_{s} ∥_{H^{1} (\hat{D})} + ∥ p_{s} ∥_{L^{2} (\hat{D})} + ∥ φ_{F} ∥_{H^{1} (\hat{D})} + ∥ φ_{f} ∥_{H^{1} (\hat{D})} + {∥ φ_{m} ∥}_{H^{1} (\hat{D})} \\ ⩽ C (∥ f_{s} ∥_{L^{2} (\hat{D})} + ∥ f_{d} ∥_{L^{2} (\hat{D})} + {∥ g ∥}_{H^{1} (D^{'})}), \end{matrix}

(36)

The proof of Theorem 2 is very similar to Theorem 1 and is easily obtained. For the generalization error of each physical quantity, we set

u_{s}^{*}, p_{s}^{*}, φ_{m}^{*}

,

φ_{F}^{*}

and

φ_{f}^{*}

as the neural network solution using the DANN.

f_{d}^{*}

and

f_{s}^{*}

are the right terms satisfied by the neural network solution. The observational data approximated by the DANN is denoted by

g^{*}

. Define the error

ε_{s} = u_{s} - u_{s}^{*} \in H^{1} (\hat{D})

,

ε_{p} = p_{s} - p_{s}^{*} \in L^{2} (\hat{D})

,

ε_{m} = φ_{m} - φ_{m}^{*} \in H^{1} (\hat{D})

,

ε_{F} = φ_{F} - φ_{F}^{*} \in H^{1} (\hat{D})

and

ε_{f} = φ_{f} - φ_{f}^{*} \in H^{1} (\hat{D})

.

Remark 1.

According to the conclusion of Theorems 1 and 2, the global generalization error between the analytical solution and the neural network solution will also be satisfied:

\begin{matrix} {∥ ε ∥}_{H^{1} (\hat{D})} & = ∥ ε_{s} ∥_{H^{1} (\hat{D})} + ∥ ε_{p} ∥_{L^{2} (\hat{D})} + ∥ ε_{m} ∥_{H^{1} (\hat{D})} + ∥ ε_{F} ∥_{H^{1} (\hat{D})} + {∥ ε_{f} ∥}_{H^{1} (\hat{D})} \\ \leq C (∥ f_{d} - f_{d}^{*} ∥_{L^{2} (\hat{D})} + ∥ f_{s} - f_{s}^{*} ∥_{L^{2} (\hat{D})} + {∥ g - g^{*} ∥}_{H^{1} (D^{'})}) . \end{matrix}

(37)

The global generalization error is defined by

ε \in H^{1} (\hat{D})

.

According to recent papers [20], the generalization error estimate is also determined by the training error

E

defined from the loss function (24) as follows:

\begin{matrix} E & : = {(J_{Ω_{s}} (x; θ, λ_{s}))}^{\frac{1}{2}} + {(J_{\partial Ω_{s}} (x; θ, λ_{bs}))}^{\frac{1}{2}} + {(J_{Ω_{d}} (x; θ, λ_{d}))}^{\frac{1}{2}} \\ + {(J_{\partial Ω_{d}} (x; θ, λ_{bd}))}^{\frac{1}{2}} + {(J_{Γ} (x; θ, λ_{i}))}^{\frac{1}{2}} \end{matrix}

(38)

The training error

E

can be easily computed from the loss function at the end of the training. Thus, we can estimate the global generalization error in the following result.

Theorem 3.

According to the abstract quadrature formulas, the global generalization error

∥ ε ∥

can be further estimated by the following form:

\begin{matrix} ∥ ε ∥ & \leq C (E + C_{i n t} N_{i n t}^{- α}) . \end{matrix}

(39)

C and

C_{i n t}

are non-negative constants, and

N_{i n t}

denotes the total number of sample points in the domain

D^{'}

.

Proof of Theorem 3.

According to Remark 1, the

∥ ε ∥

can be constrained from training error

E

and the quadrature error (27) to be

\begin{matrix} ∥ ε ∥ & \leq C (∥ f_{d} - f_{d}^{*} ∥_{L^{2} (\hat{D})} + ∥ f_{s} - f_{s}^{*} ∥_{L^{2} (\hat{D})} + {∥ g - g^{*} ∥}_{H^{1} (D^{'})}) \\ \leq C (E + C_{f_{s}} N_{f_{s}}^{- α} + C_{f_{d}} N_{f_{d}}^{- α} + C_{Γ} N_{Γ}^{- α} + C_{b o u_{s}} N_{b o u_{s}}^{- α} + C_{b o u_{d}} N_{b o u_{d}}^{- α}) \\ \leq C (E + C_{i n t} N_{i n t}^{- α}), \end{matrix}

where

N_{i n t} = m a x ({N_{f_{s}}, N_{f_{d}}, N_{Γ}, N_{b o u_{s}}, N_{b o u_{d}}})

. □

Remark 2.

Theorem 3 explains the mechanism for solving the UC problem using the DANN because it decomposes the source of error into the following parts:

The DANN algorithm uses an adaptively weighted loss function. This makes training adequate, meaning that the training error $E$ is small enough. We show the value of loss and plot the adaptively weighted value in the loss in the experiments in Section 5.
The DANN algorithm uses an adaptive activation function. It excellently performs adaptive computation for different physical quantities. In Section 5, the results of the adaptive Swish activation function are preferred to Sigmoid and Tanh.
The error estimation in Theorem 3 relies on the conditional stability for the UC problem. Thus, the generalization error achieves an efficient approximation through DANN and the conditional stability of the PDEs.

5. Numerical Experiment

In this section, we conduct numerical experiments on the two types of free-flow coupled porous medium models mentioned in Section 2. We select the network parameters for the DANN algorithm as summarized in Table 1, where

N_{N}

denotes the number of neurons in each hidden layer. The learning rate for Adam pre-training is

η

, and the BFGS iteration is chosen to be

α

. And

N_{i} (i = b s, b d, i, s, d)

denotes the number of observational data at the boundary, the interface and the interior, respectively. Notably, for ease of understanding and training, all learnable parameters

λ_{i}

and

β_{i}

are initialized to 1. The numerical experiment in this section is based on the Tensorflow 1.14, the code is written in Python 3.7 and the CPU is an Intel(R) Core(TM) i7-12700H.

5.1. Dual-Porosity–Navier–Stokes Model with Analytical Solution

In this example, the computational domain D is composed of two subdomains. The free-flow domain

D_{s} = [0, 1] \times [1, 2]

, and the porous media domain

D_{d} = [0, 1] \times [0, 1]

. The interface

Γ = [0, 1] \times {1}

. For computational simplicity, all physical parameters, including

ν

,

μ

,

σ

,

k_{m}

,

k_{f}

,

ρ

and

α

, are set to 1. The spatial dimension is

d = 2

. The source term in the dual-porosity–Navier–Stokes model is defined by the following analytical solution:

\begin{matrix} u_{s} & = ((x^{2} {(y - 1)}^{2} + y), (- \frac{2}{3} x {(y - 1)}^{3} + 2 - π sin (π x))), \\ p_{s} & = (2 - π sin (π x)) sin (\frac{π}{2} y), \\ φ_{f} & = (2 - π sin (π x)) (1 - y - cos (π y)), \\ φ_{m} & = (2 - π sin (π x)) cos (π (1 - y)) . \end{matrix}

In the UC problem, observational data can only be obtained from

D_{s}^{'} = [0.1, 0.9] \times [1, 1.9]

and

D_{d}^{'} = [0.1, 0.9] \times [0.1, 1]

, as shown in Figure 3 Left. The data of the boundary and near the boundary, shown in Figure 3 Right, are unknown.

At the end of the training, the adaptively weighted values are shown in Figure 4, and the values in the adaptive activation function are

β_{s} = 1.43

,

β_{p} = 1.58

,

β_{F} = 1.21

and

β_{m} = 1.14

, respectively.

First, we perform ablation experiments to prove that the DANN algorithm can obtain accurate results. In Table 2, “Non-adaptive” refers to the algorithm that has no adaptive techniques applied. “Adaptive Swish” and “Adaptively weighted” denote the use of the adaptive algorithms in Section 3.1 and Section 3.2 alone, respectively. To ensure that changes are unique, both “non-adaptive” and “adaptive weighting” use the standard Swish activation function of

β_{i} = 1

. And each method employs three hidden layers.

From the results of solving the UC problem in Table 2, the DANN algorithm obtains the majority of the best results. The second-best results are close to the best results. Furthermore, the DANN algorithm can obtain the minimum loss function value, which means it can achieve the minimum training error. According to Theorem 3, the DANN algorithm can also achieve the optimal error.

Similar to [29], we discuss the advantages of the adaptive Swish activation function compared to the Sigmoid and Tanh activation functions commonly used to solve PDEs as follows. Figure 5 shows the loss function trained using three hidden layers. “ASwish” stands for using the adaptive Swish activation function. The use of adaptive Swish results in the minimum loss function. According to Theorem 3, this indicates that its error is optimal.

To ensure that only the activation function changes, the loss function uses the adaptively weighted loss described in Section 3.1. Considering the accuracy of solving the UC problem of the dual-porosity–Naiver–Stokes model, Table 3 shows the

L^{1}

and

L^{2}

error by using different activation functions. From the results, the DANN algorithm obtains the optimal

L^{1}

and

L^{2}

error. Based on the above results, the adaptive Swish activation function is the most suitable.

Figure 6 and Figure 7 illustrate the analytical solution from observational data and the neural network solution of the DANN algorithm using three hidden layers. The results clearly show that the DANN algorithm is still able to compute the solution over the entire domain with incomplete data.

To confirm the accuracy of the DANN algorithm in an unknown domain, we randomly selected unknown points, as shown in Figure 3 Right, to verify the error between the neural network solution and the analytical solution. The errors between them are almost all 0. The results in Figure 8 prove that the DANN algorithm successfully solves the UC problem of the dual-porosity–Navier–Stokes model and extends the prediction range effectively.

In free-flow coupled porous media models, investigating fluid flow within complex domains is also crucial [39]. Therefore, we examine the proposed DANN algorithm’s ability to solve the UC problem within complex regions. The specific complex domain design is illustrated in Figure 9 Left.

After training, the values of

l a m b d a_{i}

in the adaptively weighted loss are as shown in Figure 9 Right. The values of

β_{s} = 1.20, β_{p} = 1.16, β_{F} = 1.41

and

β_{m} = 1.43

are used in the adaptive activation function. The analytical solution, the solutions computed by the DANN and the error between them are plotted in Figure 10. The neural network solution is identical to that in Figure 7. It can be observed that the DANN approach can also solve the UC problem in complex domains, effectively recovering information about solutions in unknown domains.

5.2. Cavity Flow Test of Dual-Porosity–Navier–Stokes Model

Based on recent literature [40,41,42], we explore the flow of free-flowing fluids without relying on analytical solutions. Specifically, we examine the case where fluids in a coupled porous medium model flow through an interface. Bo et al. [42] use the non-equilibrium extrapolation scheme to set boundary conditions for the experiment and simulate the flow patterns of the fluid.

In this subsection, the cavity flow simulations of the dual-porosity–Navier–Stokes model are considered under the UC problem. Let computational domain

D_{s} = [0, 1] \times [1, 1.25]

,

D_{d} = [0, 1] \times [0, 1]

. The observational domain

D_{d}^{'} = [0.1, 0.9] \times [0.1, 1]

, as shown in Figure 11 Left. And the condition imposed on the free-flow domain is as follows:

\begin{matrix} u_{s} = [sin (π x), 0], \end{matrix}

The Delicacy condition is applied at the boundary [40,41]. The pressure in the major fracture is set to

φ_{F} = 0

, and the pressure in the matrix is

φ_{m} = 0

. The remaining parameters, including

d = 2

,

ν

,

μ

,

σ

,

k_{m}

,

k_{f}

,

ρ

and

α

, are all assumed to be 1. The external force terms

f_{s}

and

f_{d}

are both equal to 0.

In this experiment, the neural network uses three hidden layers. At the end of pre-training, the visualization of the results for

λ_{p}

in the adaptively weighted loss function is shown in Figure 11 Right. And

β_{s} = 1.14

,

β_{p} = 1.09

,

β_{F} = 1.02

and

β_{m} = 1.02

in the adaptive Swish activation function.

The simulation results are shown in Figure 12, which are similar to the experimental results in paper [40]. In Figure 13, the DANN algorithm using the adaptive Swish activation function can also obtain the minimum loss function value in experiments without analytical solutions. In other words, it can obtain the optimal results.

5.3. Triple-Porosity–Navier–Stokes Model with Analytical Solution

In this subsection, we discuss the UC problem of the triple-porosity–Navier–Stokes model. Let

D_{s} = [0, 1] \times [1, 2]

,

D_{s} = [0, 1] \times [0, 1]

and the interface

Γ = [0, 1] \times {1}

. We set following analytical solution:

\begin{matrix} u_{s} & = ((x^{2} {(y - 1)}^{2} + y), (- \frac{2}{3} x {(y - 1)}^{3} + 2 - π sin (π x))), \\ p_{s} & = (2 - π sin (π x)) sin (\frac{π}{2} y), \\ φ_{F} & = (2 - π sin (π x)) (1 - y - cos (π y)), \\ φ_{f} & = (2 - π sin (π x)) cos (π (1 - y)), \\ φ_{m} & = (2 - π sin (π x)) sin (3 y^{3} - 2 y^{2}) . \end{matrix}

All the parameters

k_{i} (i = F, f, m), σ, σ^{*}, μ, ρ, η, ν

and

α

are supposed to be 1. Further in the UC problem, the observational domain

D_{s}^{'} = [0.1, 0.9] \times [1, 1.9]

and

D_{d}^{'} = [0.1, 0.9] \times [0.1, 1]

. The training points obtained from the observational domain are shown in Figure 3 Left. The adaptively weighted values are shown in Figure 14. The values of

β_{i}

for the adaptive Swish are

β_{s} = 1.27

,

β_{p} = 1.41

,

β_{F} = 1.53

,

β_{f} = 1.36

and

β_{m} = 1.13

with the neural network using three hidden layers.

Similar to Section 5.1, we also discuss the reason for choosing the adaptive Swish activation function in the triple-porosity–Navier–Stokes model. Based on the comparative results in Figure 15 and the error results in Table 4, the DANN algorithm using the adaptive Swish activation function proposed in this paper obtains optimality in both the loss function,

L^{1}

and

L^{2}

error.

Figure 16 illustrates the information of the analytical solution on the observational domain. Figure 17 plots the neural network solution for the computational domain solved by the DANN algorithm. It shows that the DANN algorithm effectively solves the UC problem and successfully extends the prediction range.

Similar to Figure 3 Right, we randomly selected test points to verify the accuracy of the DANN in an unknown domain. From Figure 18, it can be judged that the DANN algorithm has successfully expanded the range of predictions.

5.4. Triple-Porosity–Navier–Stokes Model in 3D

In the 3D experiment, let computational domain

D = [0, 1] \times [0, 1] \times [- 0.25, 0.75]

. The observational domain

D^{'} = [0.1, 0.9] \times [0.1, 0.9] \times [- 0.15, 0.65]

with

D_{d}^{'} = {(x, y, z) \in D | z ⩾ 0}

and

D_{s}^{'} = {(x, y, z) \in D | z ⩽ 0}

and

Γ = {(x, y, z) \in D | z = 0}

. The distribution of observational data is shown in Figure 19 Left.

We utilize the analytical solution in [37] as below. And the physical parameters of this model are also simply set, i.e.,

k_{i} (i = F, f, m), σ, σ^{*}, μ, ρ, η, ν

and

α

are supposed to be 1.

\begin{matrix} p_{m} & = - z + exp (z) sin (x y) cos (z), \\ p_{f} & = - z + exp (z) sin (x y) cos (z), \\ p_{F} & = - z + (- x^{2} - y^{2} + 8) sin (x y) cos (z), \\ u_{c} & = [\begin{matrix} (2 x sin (x y) + y (x^{2} + y^{2} - 8) cos (x y)) \\ (2 y sin (x y) + x (x^{2} + y^{2} - 8) cos (x y)) \\ 1 + ((x^{2} + y^{2}) (x^{2} + y^{2} - 8) sin (x y) - 4 sin (x y) - 8 x y cos (x y)) \end{matrix}], \\ p_{s} & = (- 16 x y cos (x y) + (x^{2} + y^{2} + z^{2} - 8) (2 x^{2} + 2 y^{2} + 2 z^{2} - 1) sin (x y) - 8 sin (x y)) . \end{matrix}

In this experiment, the neural network uses three hidden layers. At the end of training, the values in the adaptive activation function are

β_{s} = 1.32

,

β_{p} = 1.21

,

β_{F} = 1.56

,

β_{f} = 1.17

and

β_{m} = 1.19

. The adaptively weighted values are shown in Figure 19 Right. The results of the 3D UC problem are shown in Figure 20. The DANN algorithm brilliantly solves the UC problem in 3D and effectively extends the prediction range. Errors are also within acceptable limits.

The loss function is adaptively weighted from Section 3.1. We demonstrate through Figure 21 and Table 5 that adaptive Swish is also optimal in 3D. The best results are obtained using the DANN algorithm with adaptive Swish. It also proves that the DANN is the most appropriate algorithm for solving the UC problem.

We select test points from

D ∖ D^{'}

, as shown in Figure 19 Left. From the error in the unknown domain in Figure 22, it can be determined that the DANN algorithm can effectively solve the UC problem in 3D.

6. Conclusions

In this paper, we propose a dual adaptive neural network (DANN) algorithm for solving free-flow coupled porous media models under the unique continuation (UC) problem. The proposed algorithm is applicable to both the UC problem and general forward problem for solving PDEs. The experimental results in this paper show that the DANN algorithm achieves performance in the UC problem of two classical free-flow coupled porous media models. The DANN algorithm combines adaptively weighted loss and adaptive activation function. Numerical experiments can verify that the more complexity the model is faced with, the more obvious the accuracy of the solution obtained by the DANN algorithm. In the theoretical analysis, we provide a generalization error estimate for the UC problem and explain the rationale behind the feasibility of solving this problem using neural network. Future research directions include the following:

Explore the integration of combined activation functions (e.g., Aswish, Tanh and Sigmoid) within an adaptive framework to further enhance the performance and flexibility of the network.
Investigate unique continuation under time-dependent problems and propose neural network or machine learning algorithms that outperform the traditional Kalman filter method.
Explore whether deep operator networks (DeepONets) possess the capability to solve the UC problem and compare their performance with that of physics-informed neural networks (PINNs).
Discuss the application of the UC problem in practical real-world scenarios. We will actively explore how to incorporate real observational data into our framework to further validate the practicality of the proposed method beyond analytical solution benchmarks.

Author Contributions

Investigation, Conceptualization, Methodology, Software, Writing—original draft preparation, K.L.; Writing—reviewing and editing, Validation, Supervision, J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gao, L.; Li, J. A decoupled stabilized finite element method for the dual–porosity–Navier–Stokes fluid flow model arising in shale oil. Numer. Methods Partial Differ. Equ. 2021, 37, 2357–2374. [Google Scholar] [CrossRef]
Kashani, E.; Mohebbi, A.; Monfared, A.E.F.; de Vries, E.T.; Raoof, A. Lattice Boltzmann simulation of dissolution patterns in porous media: Single porosity versus dual porosity media. Adv. Water Resour. 2024, 188, 104712. [Google Scholar] [CrossRef]
Yadav, M.P.; Agarwal, R.; Purohit, S.D.; Kumar, D.; Suthar, D.L. Groundwater flow in karstic aquifer: Analytic solution of dual–porosity fractional model to simulate groundwater flow. Appl. Math. Sci. Eng. 2022, 30, 598–608. [Google Scholar] [CrossRef]
Li, J.; Gao, Z.Y.; Cao, L.L.; Chen, Z.X. A local parallel fully mixed finite element method for superposed fluid and porous layers. J. Comput. Appl. Math. 2026, 472, 116798. [Google Scholar] [CrossRef]
Hou, J.Y.; Hu, D.; Li, X.J.; He, X.M. Modeling and a Domain Decomposition Method with Finite Element Discretization for Coupled Dual-Porosity Flow and Navier–Stokes Flow. J. Sci. Comput. 2023, 95, 67. [Google Scholar] [CrossRef]
Hou, J.Y.; Qiu, M.L.; He, X.M.; Guo, C.H.; Wei, M.Z.; Bai, B.J. A dual–porosity–Stokes model and finite element method for coupling dual–porosity flow and free flow. SIAM J. Sci. Comput. 2016, 38, B710–B739. [Google Scholar] [CrossRef]
Cao, L.L.; He, Y.N.; Li, J. A parallel robin–robin domain decomposition method based on modified characteristic fems for the time-dependent dual–porosity–Navier–Stokes model with the Beavers–Joseph interface condition. J. Sci. Comput. 2022, 90, 16. [Google Scholar] [CrossRef]
Li, R.; Zhang, C.S.; Chen, Z.X. A Stokes–dual–porosity–poroelasticity model and discontinuous galerkin method for the coupled free flow and dual porosity poroelastic medium problem. J. Sci. Comput. 2025, 102, 41. [Google Scholar] [CrossRef]
Nasu, N.J.; Al Mahbub, M.A.; Zheng, H.B. A new coupled multiphysics model and partitioned time-stepping method for the triple-porosity-Stokes fluid flow model. J. Comput. Phys. 2022, 466, 111397. [Google Scholar] [CrossRef]
Mishra, S.; Molinaro, R. Estimates on the generalization error of physics-informed neural networks for approximating PDEs. IMA J. Numer. Anal. 2023, 43, 1–43. [Google Scholar] [CrossRef]
Hou, Q.Z.; Li, Y.X.; Singh, V.P.; Sun, Z.W. Physics-informed neural network for diffusive wave model. J. Appl. Math. Comput. 2024, 637, 131261. [Google Scholar] [CrossRef]
Dieva, N.; Aminev, D.; Kravchenko, M.; Smirnov, N. Overview of the Application of Physically Informed Neural Networks to the Problems of Nonlinear Fluid Flow in Porous Media. Computation 2024, 12, 69. [Google Scholar] [CrossRef]
Frey, R.; Köck, V. Deep Neural Network Algorithms for Parabolic PIDEs and Applications in Insurance and Finance. Computation 2022, 10, 201. [Google Scholar] [CrossRef]
Seabe, P.L.; Moutsinga, C.R.B.; Pindza, E. Forecasting Cryptocurrency Prices Using LSTM, GRU, and Bi-Directional LSTM: A Deep Learning Approach. Fractal Fract. 2023, 7, 203. [Google Scholar] [CrossRef]
Yuan, S.H.; Liu, Y.Q.; Yan, L.M.; Zhang, R.F.; Wu, S.J. Neural Networks-Based Analytical Solver for Exact Solutions of Fractional Partial Differential Equations. Fractal Fract. 2025, 9, 541. [Google Scholar] [CrossRef]
Pang, G.F.; Lu, L.; Karniadakis, G.E. fPINNs: Fractional physics-informed neural networks. SIAM J. Sci. Comput. 2019, 41, A2603–A2626. [Google Scholar] [CrossRef]
Dockhorn, T. A discussion on solving partial differential equations using neural networks. arXiv 2019, arXiv:1904.07200. [Google Scholar] [CrossRef]
Chen, N. Stochastic Methods for Modeling and Predicting Complex Dynamical Systems: Uncertainty Quantification, State Estimation, and Reduced-Order Models; Springer Nature: Berlin/Heidelberg, Germany, 2023. [Google Scholar]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Mishra, S.; Molinaro, R. Estimates on the generalization error of physics-informed neural networks for approximating a class of inverse problems for PDEs. IMA J. Numer. Anal. 2022, 42, 981–1022. [Google Scholar] [CrossRef]
Mai, J.A.; Li, Y.; Long, L.; Huang, Y.; Zhang, H.L.; You, Y.C. Two-dimensional temperature field inversion of turbine blade based on physics-informed neural networks. Phys. Fluids. 2024, 36, 037114. [Google Scholar] [CrossRef]
Bellassoued, M.; Imanuvilov, O.; Yamamoto, M. Carleman estimate for the Navier–Stokes equations and an application to a lateral Cauchy problem. Inverse Probl. 2016, 32, 025001. [Google Scholar] [CrossRef]
Chaves-Silva, F.W.; Zhang, X.; Zuazua, E. Controllability of evolution equations with memory. SIAM J. Control Optim. 2017, 55, 2437–2459. [Google Scholar] [CrossRef]
Bourgeois, L. A mixed formulation of quasi–reversibility to solve the Cauchy problem for Laplace’s equation. Inverse Probl. 2005, 21, 1087. [Google Scholar] [CrossRef]
Badra, M.; Caubet, F.; Dardé, J. Stability estimates for Navier–Stokes equations and application to inverse problems. arXiv 2016, arXiv:1609.03819. [Google Scholar] [CrossRef]
Sirignano, J.; Spiliopoulos, K. DGM: A deep learning algorithm for solving partial differential equations. J. Sci. Comput. 2018, 375, 1339–1364. [Google Scholar] [CrossRef]
Wang, S.F.; Teng, Y.J.; Perdikaris, P. Understanding and mitigating gradient flow pathologies in physics-informed neural networks. SIAM J. Sci. Comput. 2021, 43, A3055–A3081. [Google Scholar] [CrossRef]
McClenny, L.D.; Braga-Neto, U.M. Self-adaptive physics-informed neural networks. J. Comput. Phys. 2023, 474, 111722. [Google Scholar] [CrossRef]
Aghaee, A.; Khan, M.O. Performance of Fourier-based activation function in physics-informed neural networks for patient-specific cardiovascular flows. Comput. Meth. Prog. Biomed. 2024, 247, 108081. [Google Scholar] [CrossRef]
Wang, H.H.; Lu, L.; Song, S.J.; Gao, H. Learning specialized activation functions for physics-informed neural networks. arXiv 2023, arXiv:2308.04073. [Google Scholar]
Jagtap, A.D.; Kawaguchi, K.; Karniadakis, G.E. Adaptive activation functions accelerate convergence in deep and physics-informed neural networks. J. Comput. Phys. 2020, 404, 109136. [Google Scholar] [CrossRef]
Ramachandran, P.; Zoph, B.; Le, Q.V. Swish: A self-gated activation function. arXiv 2017, arXiv:1710.05941. [Google Scholar]
Sunkari, S.; Sangam, A.; Raman, R.; Rajalakshmi, R. A refined ResNet18 architecture with Swish activation function for Diabetic Retinopathy classification. Biomed. Signal Proces. 2024, 88, 105630. [Google Scholar] [CrossRef]
Jais, I.K.M.; Ismail, A.R. Adam optimization algorithm for wide and deep neural network. Knowledge Engineering and Data Science. Knowl. Eng. Data Sci. 2019, 2, 10. [Google Scholar] [CrossRef]
Zhou, W.J. A modified BFGS type quasi-Newton method with line search for symmetric nonlinear equations problems. J. Comput. Appl. Math. 2020, 367, 112454. [Google Scholar] [CrossRef]
Stoer, J.; Bulirsch, R. introduction to numerical analysis springer-verlag. Texts Appl. Math. 2002, 12, 30. [Google Scholar]
Cao, L.L.; Li, J.; Chen, Z.X.; Du, G.Z. A local parallel finite element method for superhydrophobic proppants in a hydraulic fracturing system based on a 2D/3D transient triple–porosity Navier–Stokes model. arXiv 2023, arXiv:2311.05170. [Google Scholar]
Bathe, K.-J. The Inf–Sup condition and its evaluation for mixed finite element methods. Comput. Struct. 2001, 79, 243–252. [Google Scholar] [CrossRef]
Kashefi, A.; Mukerji, T. Prediction of fluid flow in porous media by sparse observations and physics-informed PointNet. Neural Netw. 2023, 167, 80–91. [Google Scholar] [CrossRef] [PubMed]
Li, J.; Li, S.X.; Yue, J. MC-CDNNs: The Monte Carlo-coupled deep neural networks approach for stochastic dual-porosity-Stokes flow coupled model. Comput. Math. Appl. 2025, 181, 1–20. [Google Scholar] [CrossRef]
Yue, J.; Li, J.; Zhang, W.; Chen, Z.X. The coupled deep neural networks for coupling of the Stokes and Darcy–Forchheimer problems. Chin. Phys. B 2023, 32, 010201. [Google Scholar] [CrossRef]
Bo, A.N.; Mellibovsky, F.; Bergadà, J.M.; Sang, W.M. Towards a better understanding of wall-driven square cavity flows using the lattice Boltzmann method. Appl. Math. Model. 2020, 82, 469–486. [Google Scholar] [CrossRef]

Figure 1. The observational domain

D_{s}^{'}

,

D_{d}^{'}

and unknown domain

D_{s} ∖ D_{s}^{'}, D_{d} ∖ D_{d}^{'}

in the unique continuation problem.

Figure 1. The observational domain

D_{s}^{'}

,

D_{d}^{'}

and unknown domain

D_{s} ∖ D_{s}^{'}, D_{d} ∖ D_{d}^{'}

in the unique continuation problem.

Figure 2. A schematic of the DANN. The left part indicates training the neural network from incomplete observation data. The middle and right parts plot the neural network subject to PDEs and adaptive parameters.

Figure 3. (Left): The observational data. (Right): The data used to test the ability of neural networks to solve the UC problem in an unknown domain.

Figure 4. The adaptively weighted parameter values at the end of training.

Figure 5. Loss function of neural network trained with different activation functions.

Figure 6. The analytical solutions in the UC problem.

Figure 7. The neural network solutions of the DANN.

Figure 8. Absolute error of analytical solution and neural network solution on unknown domain.

Figure 9. (Left): The distribution of training sample points in a 2D complex domain. (Right): The values in the adaptively weighted loss in a 2D complex domain.

Figure 10. (Left): the analytical solutions under the UC problem. (Middle): the neural network solutions. (Right): the

L^{1}

error between analytical solutions and neural network solutions.

Figure 10. (Left): the analytical solutions under the UC problem. (Middle): the neural network solutions. (Right): the

L^{1}

error between analytical solutions and neural network solutions.

Figure 11. (Left): the observational data in the square cavity test. (Right): the adaptively weighted parameter values at the end of training.

Figure 12. Velocity streamline in free-flow domain and porous media domain.

Figure 13. Loss function of neural network trained with different activation functions.

Figure 14. The adaptively weighted parameter values at the end of training.

Figure 15. Loss function of neural network trained with different activation functions.

Figure 16. The analytical solutions in the UC problem.

Figure 17. The neural network solutions of the DANN.

Figure 18. Absolute error of analytical solution and neural network solution on unknown domain.

Figure 19. (Left): the distribution of training sample points in 3D DA framework. (Right): the adaptively weighted values in 3D.

Figure 20. (Left): the analytical solutions under the UC problem. (Middle): the neural network solutions. (Right): the

L^{1}

error between analytical solutions and neural network solutions.

Figure 20. (Left): the analytical solutions under the UC problem. (Middle): the neural network solutions. (Right): the

L^{1}

error between analytical solutions and neural network solutions.

Figure 21. Loss function of neural network trained with different activation functions.

Figure 22. Absolute error of analytical solution and neural network solution on unknown domain in 3D case.

Table 1. The parameters of the neural network.

$N_{bs} = N_{bd}$	$N_{i}$	$N_{s} = N_{d}$	$N_{N}$	$η$	$α$	Max Iterations
800	800	4000	32	$0.001$	$1 \times 10^{- 8}$	40,000

Table 2. Loss functions and

L^{2}

error results for different methods. The best results are highlighted in bold.

Table 2. Loss functions and

L^{2}

error results for different methods. The best results are highlighted in bold.

	Non-Adaptive	Adaptive Swish	Adaptively Weighted	DANN
Loss	$5.16 \times 10^{- 6}$	$4.02 \times 10^{- 6}$	$4.72 \times 10^{- 6}$	$3.97 \times 10^{- 6}$
$∥ u_{s} - u_{s}^{*} ∥_{L^{2}}$	$1.76 \times 10^{- 6}$	$2.21 \times 10^{- 6}$	$1.81 \times 10^{- 6}$	$1.20 \times 10^{- 6}$
$∥ p_{s} - p_{s}^{*} ∥_{L^{2}}$	$6.46 \times 10^{- 5}$	$6.67 \times 10^{- 5}$	$5.03 \times 10^{- 5}$	$4.98 \times 10^{- 5}$
$∥ φ_{F} - φ_{F}^{*} ∥_{L^{2}}$	$5.20 \times 10^{- 6}$	$3.87 \times 10^{- 6}$	$2.36 \times 10^{- 6}$	$2.29 \times 10^{- 6}$
$∥ φ_{m} - φ_{m}^{*} ∥_{L^{2}}$	$4.28 \times 10^{- 4}$	$4.14 \times 10^{- 4}$	$1.03 \times 10^{- 6}$	$5.77 \times 10^{- 7}$

Table 3. The

L^{1}

and

L^{2}

error of each quantity to be solved in the UC problem using three hidden layers. The best results are highlighted in bold. Here,

ε_{s} = u_{s} - u_{s}^{*}

,

ε_{p} = p_{s} - p_{s}^{*}

,

ε_{m} = φ_{m} - φ_{m}^{*}

and

ε_{F} = φ_{F} - φ_{F}^{*}

.

Table 3. The

L^{1}

and

L^{2}

error of each quantity to be solved in the UC problem using three hidden layers. The best results are highlighted in bold. Here,

ε_{s} = u_{s} - u_{s}^{*}

,

ε_{p} = p_{s} - p_{s}^{*}

,

ε_{m} = φ_{m} - φ_{m}^{*}

and

ε_{F} = φ_{F} - φ_{F}^{*}

.

	DANN (Adaptive Swish)		Tanh		Sigmoid
	$L^{1}$ Error	$L^{2}$ Error	$L^{1}$ Error	$L^{2}$ Error	$L^{1}$ Error	$L^{2}$ Error
$∥ ε_{s} ∥$	$7.01 \times 10^{- 7}$	$1.76 \times 10^{- 6}$	$1.32 \times 10^{- 6}$	$3.01 \times 10^{- 6}$	$1.53 \times 10^{- 6}$	$4.17 \times 10^{- 6}$
$∥ ε_{p} ∥$	$2.72 \times 10^{- 5}$	$6.46 \times 10^{- 5}$	$4.72 \times 10^{- 5}$	$1.02 \times 10^{- 4}$	$7.27 \times 10^{- 5}$	$1.60 \times 10^{- 4}$
$∥ ε_{F} ∥$	$1.03 \times 10^{- 6}$	$3.20 \times 10^{- 6}$	$1.61 \times 10^{- 6}$	$4.17 \times 10^{- 6}$	$1.99 \times 10^{- 6}$	$4.24 \times 10^{- 6}$
$∥ ε_{m} ∥$	$7.13 \times 10^{- 7}$	$1.77 \times 10^{- 6}$	$1.34 \times 10^{- 6}$	$4.18 \times 10^{- 6}$	$8.73 \times 10^{- 7}$	$3.21 \times 10^{- 6}$

Table 4. The

L^{1}

and

L^{2}

error of each quantity to be solved in the UC problem using three hidden layers. The best results are highlighted in bold. Here,

ε_{s} = u_{s} - u_{s}^{*}

,

ε_{p} = p_{s} - p_{s}^{*}

,

ε_{m} = φ_{m} - φ_{m}^{*}

,

ε_{F} = φ_{f} - φ_{f}^{*}

and

ε_{F} = φ_{F} - φ_{F}^{*}

.

Table 4. The

L^{1}

and

L^{2}

error of each quantity to be solved in the UC problem using three hidden layers. The best results are highlighted in bold. Here,

ε_{s} = u_{s} - u_{s}^{*}

,

ε_{p} = p_{s} - p_{s}^{*}

,

ε_{m} = φ_{m} - φ_{m}^{*}

,

ε_{F} = φ_{f} - φ_{f}^{*}

and

ε_{F} = φ_{F} - φ_{F}^{*}

.

	DANN (Adaptive Swish)		Tanh		Sigmoid
	$L^{1}$ Error	$L^{2}$ Error	$L^{1}$ Error	$L^{2}$ Error	$L^{1}$ Error	$L^{2}$ Error
$∥ ε_{s} ∥$	$8.01 \times 10^{- 7}$	$1.99 \times 10^{- 6}$	$9.16 \times 10^{- 7}$	$2.28 \times 10^{- 6}$	$9.80 \times 10^{- 7}$	$2.88 \times 10^{- 6}$
$∥ ε_{p} ∥$	$2.43 \times 10^{- 5}$	$4.69 \times 10^{- 5}$	$3.13 \times 10^{- 5}$	$7.07 \times 10^{- 5}$	$3.59 \times 10^{- 5}$	$8.23 \times 10^{- 5}$
$∥ ε_{F} ∥$	$1.45 \times 10^{- 6}$	$4.18 \times 10^{- 6}$	$2.37 \times 10^{- 6}$	$1.08 \times 10^{- 5}$	$2.89 \times 10^{- 6}$	$1.10 \times 10^{- 5}$
$∥ ε_{f} ∥$	$4.21 \times 10^{- 7}$	$1.84 \times 10^{- 6}$	$9.72 \times 10^{- 7}$	$3.29 \times 10^{- 6}$	$1.02 \times 10^{- 6}$	$4.93 \times 10^{- 6}$
$∥ ε_{m} ∥$	$4.34 \times 10^{- 6}$	$7.36 \times 10^{- 6}$	$5.45 \times 10^{- 6}$	$8.94 \times 10^{- 6}$	$5.31 \times 10^{- 6}$	$8.81 \times 10^{- 6}$

Table 5. The

L^{1}

and

L^{2}

error of each quantity to be solved in the 3D UC problem using three hidden layers. The best results are highlighted in bold. Here,

ε_{s} = u_{s} - u_{s}^{*}

,

ε_{p} = p_{s} - p_{s}^{*}

,

ε_{m} = φ_{m} - φ_{m}^{*}

,

ε_{F} = φ_{f} - φ_{f}^{*}

and

ε_{F} = φ_{F} - φ_{F}^{*}

.

Table 5. The

L^{1}

and

L^{2}

error of each quantity to be solved in the 3D UC problem using three hidden layers. The best results are highlighted in bold. Here,

ε_{s} = u_{s} - u_{s}^{*}

,

ε_{p} = p_{s} - p_{s}^{*}

,

ε_{m} = φ_{m} - φ_{m}^{*}

,

ε_{F} = φ_{f} - φ_{f}^{*}

and

ε_{F} = φ_{F} - φ_{F}^{*}

.

	DANN (Adaptive Swish)		Tanh		Sigmoid
	$L^{1}$ Error	$L^{2}$ Error	$L^{1}$ Error	$L^{2}$ Error	$L^{1}$ Error	$L^{2}$ Error
$∥ ε_{s} ∥$	$1.30 \times 10^{- 4}$	$1.76 \times 10^{- 4}$	$2.76 \times 10^{- 4}$	$3.28 \times 10^{- 4}$	$1.31 \times 10^{- 3}$	$1.78 \times 10^{- 3}$
$∥ ε_{p} ∥$	$5.68 \times 10^{- 4}$	$8.47 \times 10^{- 4}$	$6.29 \times 10^{- 4}$	$1.07 \times 10^{- 3}$	$5.27 \times 10^{- 3}$	$7.74 \times 10^{- 3}$
$∥ ε_{F} ∥$	$1.04 \times 10^{- 6}$	$1.06 \times 10^{- 5}$	$2.81 \times 10^{- 6}$	$1.53 \times 10^{- 5}$	$1.16 \times 10^{- 5}$	$1.42 \times 10^{- 5}$
$∥ ε_{f} ∥$	$1.68 \times 10^{- 6}$	$2.65 \times 10^{- 6}$	$2.47 \times 10^{- 6}$	$4.07 \times 10^{- 6}$	$7.47 \times 10^{- 6}$	$1.07 \times 10^{- 5}$
$∥ ε_{m} ∥$	$1.29 \times 10^{- 6}$	$2.19 \times 10^{- 6}$	$1.48 \times 10^{- 6}$	$3.38 \times 10^{- 6}$	$4.07 \times 10^{- 6}$	$5.88 \times 10^{- 6}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, K.; Wu, J. Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem. Computation 2025, 13, 228. https://doi.org/10.3390/computation13100228

AMA Style

Liu K, Wu J. Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem. Computation. 2025; 13(10):228. https://doi.org/10.3390/computation13100228

Chicago/Turabian Style

Liu, Kunhao, and Jibing Wu. 2025. "Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem" Computation 13, no. 10: 228. https://doi.org/10.3390/computation13100228

APA Style

Liu, K., & Wu, J. (2025). Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem. Computation, 13(10), 228. https://doi.org/10.3390/computation13100228

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dual Adaptive Neural Network for Solving Free-Flow Coupled Porous Media Models Under Unique Continuation Problem

Abstract

1. Introduction

2. Unique Continuation Problem

2.1. The Steady Dual-Porosity–Navier–Stokes Model

2.2. The Steady Triple-Porosity–Navier–Stokes Model

3. Dual Adaptive Neural Network Algorithm

3.1. Adaptively Weighted Loss Function

3.2. Adaptive Activation Function

4. Theoretical Analysis

5. Numerical Experiment

5.1. Dual-Porosity–Navier–Stokes Model with Analytical Solution

5.2. Cavity Flow Test of Dual-Porosity–Navier–Stokes Model

5.3. Triple-Porosity–Navier–Stokes Model with Analytical Solution

5.4. Triple-Porosity–Navier–Stokes Model in 3D

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI