A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities

Jiang, Ying; Nian, Minghui; Zhang, Qinghui

doi:10.3390/axioms11080384

Open AccessArticle

A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities

by

Ying Jiang

¹,

Minghui Nian

¹ and

Qinghui Zhang

^2,*

¹

School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou 510006, China

²

School of Science, Harbin Institute of Technology, Shenzhen 518000, China

^*

Author to whom correspondence should be addressed.

Axioms 2022, 11(8), 384; https://doi.org/10.3390/axioms11080384

Submission received: 29 June 2022 / Revised: 3 August 2022 / Accepted: 3 August 2022 / Published: 5 August 2022

(This article belongs to the Special Issue Numerical Computation, Approximation of Functions and Applied Mathematics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The stable generalized finite element method (SGFEM) is an improved version of generalized or extended FEM (GFEM/XFEM), which (i) uses simple and unfitted meshes, (ii) reaches optimal convergence orders, and (iii) is stable and robust in the sense that conditioning is of the same order as that of FEM and does not get bad as interfaces approach boundaries of elements. This paper designs the SGFEM for the discontinuous interface problem (DIP) by coupling a deep neural network (DNN). The main idea is to construct a function using the DNN, which captures the discontinuous interface condition, and transform the DIP to an (approximately) equivalent continuous interface problem (CIP) based on the DNN function such that the SGFEM for CIPs can be applied. The SGFEM for the DIP is a conforming method that maintains the features (i)–(iii) of SGFEM and is free from penalty terms. The approximation error of the proposed SGFEM is analyzed mathematically, which is split into an error of SGFEM of the CIP and a learning error of the DNN. The learning dimension of DNN is one dimension less than that of the domain and can be implemented efficiently. It is known that the DNN enjoys advantages in nonlinear approximations and high-dimensional problems. Therefore, the proposed SGFEM coupled with the DNN has great potential in the high-dimensional interface problem with interfaces of complex geometries. Numerical experiments verify the efficiency and optimal convergence of the proposed method.

Keywords:

GFEM/XFEM; SGFEM; interface; discontinuous condition; deep neural network

MSC:

65N12; 65N30; 68T07

1. Introduction

Generalized or Extended Finite Element Methods (GFEM/XFEM) augment the standard FEM with special functions that mimic local features of exact solutions to solve complicated non-smooth engineering problems [1,2,3,4]. The applications of GFEM/XFEM to typical fields, such as crack problems, interface problems, and material failures are referred to [3,5,6,7,8,9,10,11,12,13]. Both GFEM and XFEM are based on a partition of unity [14,15,16] to “paste” the local special functions. We will use GFEM to represent the GFEM/XFEM below for simplicity. The GFEM has been extensively applied to the interface problem, including elliptic interface problems [11,17,18,19,20,21,22,23,24,25,26,27,28] and time-dependent interface problems [9,29,30,31,32,33,34]. Meshes used in the GFEM are simple, fixed, and independent of the location of interfaces so that remeshing or mesh refinement for the FEM are avoided in the GFEM.

It was realized quite early that the GFEM has a conditioning difficulty in that condition numbers of stiffness matrices may be much larger than those of the standard FEM, and the conditioning may get extremely bad as the interfaces approach the boundaries of elements. The bad conditioning is mainly caused by almost linear dependence between the FE functions and added special functions. Many interesting ideas have been proposed for the conditioning of GFEM, such as (a) locally adapting positions of either nodes or interface curves [32,35], preconditioning the stiffness matrices by employing domain decomposition [36], local Cholesky decomposition [5], condensing DOFs at certain nodes [37], and modifying the enriched functions based on orthogonalization [38]. Recently, a stable GFEM (SGFEM) was introduced in [1,6,39] to improve the conditioning of GFEM. The main idea of SGFEM is to modify enriched functions by subtracting its FE interpolant. The SGFEM has been applied to the crack problem [6,7,8] and interface problems [1,17,20,23,32,40]. The SGFEM for interface problems is proven to have the optimal convergence in [1,20], and the conditioning is of the same order as that of the FEM and does not deteriorate as the interfaces approach the boundaries of elements.

To the best of our knowledge, the SGFEM has been developed only for continuous interface problems. Interface problems can be categorized as continuous and discontinuous interface problems (CIPs/DIPs), characterized by

{[u]}_{Γ} = 0

and

{[u]}_{Γ} \neq 0

, respectively, where u is a solution to an interface problem with interface

Γ

, and

{[u]}_{Γ}

is a jump of u across the interface

Γ

. Both cases have many applications (see [11,22,23,32,34,41,42] for CIPs, [10,30,43,44,45,46] for DIPs). The shape functions in the SGFEM (also in some GFEM [3,9,11,17,20,23,25,31,32,34,41,42,47]) for the interface problem are continuous. Thus, the SGFEM can provide conforming approximations for CIPs. However, the continuous shape functions cannot be used directly for the DIP, and penalty techniques need to be employed typically [19,30,48]. We stress that many of the unfitted FEM or GFEM introduce penalty techniques and parameters to solve the CIP, such as [11,34,41,42]. It is possible to propose an SGFEM for DIPs by changing the enrichments and using the penalty technique above-mentioned. However, the penalty parameters are generally problem-dependent and difficult to choose in a uniform approach.

In this paper, we tend to design the SGFEM for DIPs free from penalty terms and parameters. The main idea is to transform a DIP into an equivalent CIP [45,46,49]. This can be achieved by deriving a function

ϑ

such that the modified solution

U = u - ϑ

satisfies the continuity condition

{[U]}_{Γ} = 0

. Unfortunately, such a

ϑ

is hard to construct, especially in the high dimensions where the interface has complex geometric shapes [45,49]. We construct such

ϑ

using the DNN. Then, the SGFEM for the CIP coupled with such

ϑ

is proposed for the DIP. This idea is motivated by recent pioneer research about solving partial differential equations (PDEs) using the DNN [50,51,52,53,54,55,56,57]. We mention that the DNN has also been applied to many engineering problems, for instance, energy approach [58], failure models and predictions [59,60,61]. We take a Poisson equation

- ▵ u = f

in a domain

Ω

, for example, which has an essential boundary condition

\bar{u}

on

\partial Ω

. The DNN for PDEs [53,54,55,57] is to minimize a loss function

Loss = \frac{1}{2} \int_{Ω} \nabla u \cdot \nabla u - \int_{Ω} f u + β \int_{\partial Ω} {(u - \bar{u})}^{2}

(1)

using a certain DNN architecture, where

β

is a parameter to balance the two terms in (1). The DNN method based on (1) is not convex, and does not show the convergence rate [53,55]. In addition, the essential boundary condition affects the accuracy DNN [53], and

β

has to be selected carefully. To resolve the essential boundary condition, in [52,54,55], a small DNN is used to minimize the boundary term in (1), and then the PDE is transformed to an equivalent PDE with the (approximately) zero boundary condition, which is solved by another DNN. The small DNN function is defined in the whole domain

Ω

, but only the values on

\partial Ω

are needed in the training process. Thus, such a DNN has a dimension number that is one dimension less than that of

Ω

and can approximate the boundary function efficiently, especially when the boundary

\partial Ω

has complex geometries [52,54,55]. This idea is adopted in this paper to address the discontinuous interface condition, i.e.,

{[u]}_{Γ} \neq 0

.

Specifically, we first use a small DNN to produce a function

ϑ

(defined in

Ω

) by learning the discontinuous condition

{[u]}_{Γ}

such that

{[ϑ]}_{Γ} \approx {[u]}_{Γ}

. As reported in [52,54,55], the jump

{[u]}_{Γ}

can be “learned” accurately and efficiently by the DNN. Then, the DIP is transformed to an (approximately) equivalent CIP based on

ϑ

, which the SGFEM can solve for the CIP. The DNN function

ϑ

can be efficiently incorporated into the SGFEM framework because its differentiations are computed automatically. Since

{[ϑ]}_{Γ}

does not equal

{[u]}_{Γ}

exactly, there is a conforming error in the SGFEM solution (combined with the DNN function

ϑ

). This error is analyzed and proven mathematically, which indicates that the proposed SGFEM yields the optimal convergence for the DIP if the interface condition can be learned exactly enough. A set of numerical experiments verifies the theoretical results. It is known that the DNN possesses remarkable merits in solving high-dimensional problems and nonlinear approximations. Therefore, the proposed SGFEM in this paper has a great potential for the three-dimensional DIP with interfaces of complex geometries.

The paper is organized as follows. The model problem is described in Section 2. Conventional GFEM and SGFEM are reviewed in Section 3. The proposed SGFEM coupled with the DNN is proposed for the DIP in Section 4. In Section 5, we analyze the approximation error of the proposed method mathematically. The numerical experiments and concluding remarks are presented in Section 6 and Section 7, respectively.

2. Model Problem

For a domain

Δ

in

R^{2}

, an integer m, and

1 \leq q \leq \infty

, we denote the usual Sobolev spaces as

W^{m, q} (Δ)

with norm

{∥ \cdot ∥}_{W^{m, q} (Δ)}

and semi-norm

{| \cdot |}_{W^{m, q} (Δ)}

. The space

W^{m, q} (Δ)

will be represented by

H^{m} (Δ)

for

q = 2

and

L^{q} (Δ)

when

m = 0

, respectively.

We consider a bounded and simply connected domain

Ω \subset R^{2}

with a piecewise smooth boundary

\partial Ω

. Let

Γ

be an interface that divides

Ω

into two domains

Ω_{0}

and

Ω_{1}

such that

\bar{Ω} = {\bar{Ω}}_{0} \cup {\bar{Ω}}_{1}

,

Ω_{0} \cap Ω_{1} = \emptyset

, and

Γ = {\bar{Ω}}_{0} \cap {\bar{Ω}}_{1}

. In this study, we consider smooth interfaces

Γ

, as shown in Figure 1.

A point in the Cartesian coordinate system of

R^{2}

is denoted as

P = (x, y)

. Let a be a positive, piecewise-constant function given by

a (P) = \{\begin{matrix} a_{0}, & P \in Ω_{0}, \\ a_{1}, & P \in Ω_{1}, \end{matrix}

(2)

where

a_{0}

and

a_{1}

are positive constants,

0 < ζ_{0} \leq a_{r} \leq ζ_{1} < \infty, r = 0, 1

, and

ζ_{0}, ζ_{1} \in R

. Clearly, a is discontinuous along the interface

Γ

.

We are interested in the solution u of the interface problem:

\begin{matrix} - \nabla \cdot (a \nabla u) & = f, & in Ω, \\ a \frac{\partial u}{\partial {\vec{n}}_{b}} & = g, & on \partial Ω, \end{matrix}

(3)

subject to non-homogeneous jump conditions on the interface

Γ

\begin{matrix} {[u]}_{Γ} & = & ϕ, on Γ, \end{matrix}

(4)

\begin{matrix} {[a \frac{\partial u}{\partial {\vec{n}}_{Γ}}]}_{Γ} & = & ψ, on Γ, \end{matrix}

(5)

where

{\vec{n}}_{b}

and

{\vec{n}}_{Γ}

denote the unit outward normal to the boundary

\partial Ω

and the interface

Γ

directed towards

Ω_{0}

, respectively. The notation

{[v]}_{Γ} : = v_{1} - v_{0}

defines the jump of a quantity v along the interface

Γ

, where

v_{r} {: = v |}_{{\bar{Ω}}_{r}}, r = 0, 1

. We mention that if the boundary

\partial Ω

contains a re-entrant corner, u may be singular and may not belong to

H^{2} (Ω_{r}), r = 0, 1

. However, we do not discuss such a situation in this paper and limit the setting to the case

u \in H^{2} (Ω_{r}), r = 0, 1

, as in [42,45,47]. Therefore, we assume that the boundary

\partial Ω

and the data

f, g, ϕ

,

ψ

are given such that the solution

u \in M_{2}

, where

M_{2}

is defined by

M_{2} : = {{u : u |}_{Ω_{r}} \in H^{2} (Ω_{r}), r = 0, 1 and ∥ \partial^{α} {u ∥}_{L^{\infty} (Γ)} < \infty, | α | \leq 1}

(6)

with a norm

{∥ u ∥}_{M_{2}} = ∥ u_{0} ∥_{H^{2} (Ω_{0})} + ∥ u_{1} ∥_{H^{2} (Ω_{1})} + \sum_{| α | \leq 1} {∥ \partial^{α} u ∥}_{L^{\infty} (Γ)}, \forall u \in M_{2} .

The interface condition (4) has an essential effect on the features of the solution and constructions of numerical algorithms. The problem (3)–(5) is referred to as a CIP and a DIP if

ϕ

in (4) is zero and nonzero, respectively. Both the CIP and DIP have many applications, see [11,22,23,32,34,41,42] for the CIP and [10,30,43,44,45,46] for the DIP. The SGFEM was proposed for the CIP to provide conforming approximations [17,20,23,27,40]. In this paper, the SGFEM is generalized to the DIP in a conforming approach that is free from any penalty terms.

3. Conventional Gfem and Sgfem for Interface Problems

We begin with a quasi-uniform finite element mesh

T_{h} = {e_{s}}

of the domain

Ω

with mesh size

0 < h < 1

, where the finite elements

e_{s}

can be triangles or quadrilaterals. We note that the mesh

T_{h}

does not need to match the interface

Γ

. Let

{P_{i}}_{i \in I_{h}}

be the set of finite element nodes associated with the mesh

T_{h}

, where

I_{h}

is the index set of the nodes. For every

i \in I_{h}

, we consider the standard linear (bilinear for quadrilateral element) finite element hat function

ϕ_{i}

. The closure of support of

ϕ_{i}

is denoted by

ω_{i}

, which is called the patch associated with the node

P_{i}

. Since the mesh is quasi-uniform, we assume that

∥ ϕ_{i} ∥_{L^{\infty} (Ω)} = ∥ ϕ_{i} ∥_{L^{\infty} (ω_{i})} \leq 1, ∥ \nabla ϕ_{i} ∥_{L^{\infty} (Ω)} = {∥ \nabla ϕ_{i} ∥}_{L^{\infty} (ω_{i})} \leq C h^{- 1},

(7)

where the positive constant C is independent of h and i. It is well known that

{ϕ_{i}}_{i \in I_{h}}

form a partition of unity (PU) [14,15,16] subordinate to the patches

{ω_{i}}_{i \in I_{h}}

.

The standard FEM subspace to approximate the solution of (3) is given by

S_{h} : = S_{F E M} = span {ϕ_{i} (x) : i \in I_{h}} .

(8)

The FEM yields highly accurate approximations only if the underlying variational problem has a smooth solution. However, it is also well known that an FEM with a quasi-uniform mesh cannot approximate the solution of the interface problem efficiently [23,47].

The generalized or extended FEM (GFEM) [2,3] is a typical technique to approximate the non-smooth solutions to variational problems. The approximation space

S_{h}

of GFEM is obtained by augmenting the finite element space

S_{F E M}

by non-polynomial enrichment space

S_{E N R}

using the enrichment functions

Π_{i}

as follows:

S_{h} = S_{F E M} \oplus S_{E N R} and S_{E N R} = span {ϕ_{i} Π_{i} : i \in I_{h, e n r} and I_{h, e n r} \subset I_{h}},

(9)

where the enrichment functions

Π_{i}

are problem-dependent and mimic the non-smooth exact solutions of the underlying variational problem. The nodes indexed by

I_{h, e n r}

, are called the enriched nodes. The choice of

I_{h, e n r} \subset I_{h}

can vary and may also be problem-dependent.

The enrichment functions

Π_{i}

used in the GFEM to approximate the solution of a smooth interface problem are generally based on a distance function (or absolute of level set function) [3,18,23]

D (P) = dist (P, Γ) .

In a conventional GFEM for the interface problems, the approximate subspace is

S_{h} = S_{F E M} \oplus S_{E N R} and S_{E N R} = span {ϕ_{i} D : i \in I_{h, e n r} = I_{h, e n r}^{Γ}},

(10)

and

I_{h, e n r}^{Γ} = {i \in I_{h} : P_{i} \in e_{s} where {\overset{˚}{e}}_{s} \cap Γ \neq \emptyset} .

It is easy to know that

Card (I_{h, e n r}^{Γ}) : = cardinality of I_{h, e n r}^{Γ} = O (h^{- 1}) .

(11)

The GFEM based on

I_{h, e n r}^{Γ}

is referred to as a topological GFEM [3]. It was known early (e.g., [23]) that the topological GFEM only produces the convergence order

O (\sqrt{h})

in energy norm, which is not optimal

O (h)

. The optimal convergence can be attained by the geometric GFEM [3,23] and corrected GFEM [3,22]. However, the geometric GFEM introduces many more enriched degrees of freedom (DOFs) than the topological ones, and its conditioning is of

O (h^{- 4})

[23] that is much higher than that of the FEM. Meanwhile, the conditioning of the corrected GFEM may not be stable because it may “blow up” as the interface

Γ

is close to the nodes of the mesh [27].

Recently, a simple local procedure of subtracting the interpolant was introduced in [1,6,17,20,23,39,40] to address the bad conditioning of GFEM, and the modified GFEM is referred to as a stable GFEM (SGFEM). Specifically, the approximate subspace of the SGFEM for the interface problems is given by

S_{h} = S_{F E M} \oplus S_{E N R} and S_{E N R} = span {ϕ_{i} (D - I_{h} D) : i \in I_{h, e n r} = I_{h, e n r}^{Γ}},

(12)

where

I_{h} f

is the FEM interpolant of a continuous function f based on

S_{F E M}

. It was shown [17] that the SGFEM (12) for the elliptic interface (a) reaches the optimal convergence order

O (h)

in energy norm, (b) has a scaled condition number (SCN) of stiffness matrices

O (h^{- 2})

that is of the same order as the FEM, and (c) is robust in that the convergence and SCN do not depend on the relative positions of the mesh and interfaces.

Remark 1.

We mention that there are other options for the enrichments of the GFEM of the interface problems, such as [11,41,48], where

Π_{i} = H

is used as the enrichments, and H is the Heaviside function 1 in

Ω_{0}

and

- 1

in

Ω_{1}

. This scheme leads to a non-conforming formulation because of the discontinuity of H. A penalty technique needs to be developed to deal with discontinuity. This paper presents conforming methods in which the variational formulations are standard without any penalty terms or parameters.

We next illustrate the scaled condition number (SCN) of the stiffness matrices of the GFEM or SGFEM. For simplicity, we re-arrange the order of the shape functions so that the stiffness or mass matrices

B

of GFEM have the form

B = [\begin{matrix} B_{11} & B_{12} \\ B_{12}^{T} & B_{22} \end{matrix}],

(13)

where

B_{11}

and

B_{22}

are associated with the FE part and the enrichment part of GFEM or SGFEM, respectively. We note that

B_{11}

is the standard finite element matrix with respect to the standard finite element triangulation used to define the GFEM. Consider the matrix

D = [\begin{matrix} D_{11} & 0 \\ 0 & D_{22} \end{matrix}],

where

D_{11}

and

D_{22}

are diagonal matrices with

{(D_{11})}_{i i} = {(B_{11})}_{i i}^{- 1 / 2} and {(D_{22})}_{i i} = {(B_{22})}_{i i}^{- 1 / 2} .

(14)

We define

\hat{B} : = D B D

and

{\hat{B}}_{11} : = D_{11} B_{11} D_{11}

. The SCNs of

B

and

B_{11}

are defined by

K : = κ (\hat{B}), K_{F E M} : = κ ({\hat{B}}_{11}),

respectively, where

κ (B)

is a condition number of a matrix

B

. It is known that

K_{F E M} = O (h^{- 2})

for the stiffness matrices of FEM.

The relevant difficulties of GFEM consist of (i) stability:

K

may be much bigger than

K_{F E M}

, and (ii) robustness:

K

may blow up as the interfaces are close to the boundaries of elements [1,2,3,23]. These are caused by the almost linear dependence of subspaces

S_{F E M}

and

S_{E N R}

. The SGFEM is the stable and robust GFEM, and has been applied to crack and interface problems successfully [1,6,17,20,23,39,40].

The convergence of SGFEM for the CIP

The SGFEM for the CIP has been studied in [17,23,27,40], and the associated variational formulation based on the SGFEM space (12) is as follows:

Find u_{S G, h} \in S_{h} such that B (u_{S G, h}, v_{h}) = L (v_{h}), \forall v_{h} \in S_{h},

(15)

where

B (u, v) : = \int_{Ω} a \nabla u \cdot \nabla v d P, L (v) = \int_{Ω} f v d x + \int_{\partial Ω} g v d s + \int_{Γ} ψ v d s .

(16)

We define

E (Ω)

to be the energy space with respect to the CIP given by

E (Ω) : = {u \in H^{1} (Ω) : ∥ u ∥_{E (Ω)}^{2} : = B (u, u) < \infty and {[u]}_{Γ} = 0 o n Γ} .

(17)

It is obvious that

S_{h}

(12) belongs to

E (Ω)

. The convergence of

u_{S G, h}

to u was proven in [17,27,40], and we present it here without its proof.

Theorem 1.

Suppose that

u \in M_{2}

is the solution of the CIP ((3)–(5) with

ϕ = 0

), and

u_{S G, h}

is the SGFEM solution of (15) based on the finite-dimensional subspace

S_{h}

(12), then there exists

C > 0

independent of h such that

∥ u - u_{S G, h} ∥_{E (Ω)} \leq C h {∥ u ∥}_{M_{2}} .

(18)

In the next section, the SGFEM of the CIP is generalized to the DIP in a conforming approach that is free from any penalty terms by coupling a DNN.

4. Sgfem Coupled with Dnn for Dip

We first employ the DNN to learn a function that approximates the discontinuous interface condition. Then, by coupling with the DNN function, we reformulate the DIP into an (approximately) equivalent continuous model, which can be solved by SGFEM.

Let

ϑ_{0}

be a function in

H^{2} (Ω_{0})

with

ϑ_{0} |_{Γ} = - ϕ .

(19)

Define

\tilde{u}

to be a function on

Ω

as

\tilde{u} = \{\begin{matrix} ϑ_{0}, & in {\bar{Ω}}_{0}, \\ 0, & in Ω_{1}, \end{matrix}

(20)

then

\tilde{u}

satisfies the interface problem (4) because

{[\tilde{u}]}_{Γ} = 0 - ϑ_{0} |_{Γ} = 0 - (- ϕ) = ϕ

. Define a function

\tilde{U} = u - \tilde{u}

, then

\tilde{U} = \{\begin{matrix} u_{0} - ϑ_{0}, & on Ω_{0}, \\ u_{1}, & on Ω_{1} . \end{matrix}

(21)

It is easy to check that the

\tilde{U}

satisfies the continuous interface problem, i.e.,

{[\tilde{U}]}_{Γ} = {[u]}_{Γ} - {[\tilde{u}]}_{Γ} = 0

. The model problem (3) with the discontinuous interface conditions (4) and (5) is transformed into an equivalent equation about

\tilde{U}

with the continuous interface condition:

\begin{matrix} - \nabla \cdot (a \nabla \tilde{U}) & = & f + \nabla \cdot (a \nabla \tilde{u}), i n Ω, \end{matrix}

(22)

\begin{matrix} a \frac{\partial \tilde{U}}{\partial {\vec{n}}_{b}} & = & g - a \frac{\partial \tilde{u}}{\partial {\vec{n}}_{b}}, o n \partial Ω, \end{matrix}

(23)

\begin{matrix} {[\tilde{U}]}_{Γ} & = & 0, o n Γ, \end{matrix}

(24)

\begin{matrix} {[a \frac{\partial \tilde{U}}{\partial {\vec{n}}_{Γ}}]}_{Γ} & = & ψ - {[a \frac{\partial \tilde{u}}{\partial {\vec{n}}_{Γ}}]}_{Γ} = ψ + a_{0} \frac{\partial \tilde{u}}{\partial {\vec{n}}_{Γ}}, o n Γ . \end{matrix}

(25)

We note that the RHSs of (23)–(25) can be simplified because

\tilde{u} = 0

in

Ω_{1}

, for instance, in the boundary condition (24)

g - a \frac{\partial \tilde{u}}{\partial {\vec{n}}_{b}} = g

if

Γ \subset Ω

. The variational formula of (23)–(25) is the following:

Find \tilde{U} \in E (Ω) such that B (\tilde{U}, V) = \tilde{L} (V), \forall V \in E (Ω),

(26)

where

\begin{matrix} \tilde{L} (V) & = \int_{Ω} f V d P + \int_{Ω_{0}} (a_{0} Δ \tilde{u}) V d P + \int_{\partial Ω} g V d s - \int_{\partial Ω \cap \partial Ω_{0}} a_{0} \frac{\partial \tilde{u}}{\partial {\vec{n}}_{b}} V d s + \int_{Γ} (ψ - {[a_{0} \frac{\partial \tilde{u}}{\partial {\vec{n}}_{Γ}}]}_{Γ}) V d s \\ = \int_{Ω} f V d P + \int_{\partial Ω} g V d s + \int_{Γ} ψ V d s + [\int_{Ω_{0}} (a_{0} Δ \tilde{u}) V d P - \int_{\partial Ω \cap \partial Ω_{0}} a_{0} \frac{\partial \tilde{u}}{\partial {\vec{n}}_{b}} V d s + \int_{Γ} a_{0} \frac{\partial \tilde{u}}{\partial {\vec{n}}_{Γ}} V d s] \\ = \int_{Ω} f V d P + \int_{\partial Ω} g V d s + \int_{Γ} ψ V d s - \int_{Ω_{0}} a_{0} \nabla \tilde{u} \cdot \nabla V d P, \end{matrix}

(27)

and

B (\cdot, \cdot)

and

E (Ω)

are defined in (16) and (17), respectively. Note that in the second equality of (27)

{\vec{n}}_{Γ}

is directed towards to

Ω_{0}

.

We note that

\tilde{L}

in variational formula (26) depends on the unknown

\tilde{u}

, and cannot be calculated. It is easy to see that the evaluations of

\tilde{u}

on

Γ

(19) rather than in

Ω

are essential for the derivation of the equivalent CIP (23)–(25). This leads us to construct a function

{\tilde{u}}^{θ_{*}}

(defined in

Ω

) using a DNN such that

{\tilde{u}}^{θ_{*}}

mimics the condition (19) with high precision, where

θ_{*}

are parameters in the DNN algorithm. Such a

{\tilde{u}}^{θ_{*}}

is available, and the associated variational formula is solvable.

To this end, we take

N_{Γ}

sampling points

Q_{i}

uniformly distributed on

Γ

. Let

X = {[Q_{i}]}_{i = 1}^{N_{Γ}}

and

Y = {[ϕ (Q_{i})]}_{i = 1}^{N_{Γ}}

be the input and output sets, respectively, for training the DNN. The loss function for training the DNN is defined

Loss (X; θ) = \frac{1}{N_{Γ}} \sum_{i = 1}^{N_{Γ}} {[ϑ^{θ} (Q_{i}) - (- ϕ (Q_{i}))]}^{2}, \forall ϑ^{θ} \in S_{D N N},

(28)

where

S_{D N N}

is a subspace (defined in

Ω

) generated by a DNN with parameters

θ = {W_{i}, b_{i}}

;

W_{i}, b_{i}

are the weights and bias in the DNN, respectively. A DNN function

ϑ^{θ_{*}}

with certain parameter

θ_{*}

is obtained by solving the following minimization problem-based loss function (28):

min_{ϑ^{θ} \in S_{D N N}} Loss (X; θ) .

(29)

ϑ^{θ_{*}}

belongs to

C^{2} (\bar{Ω})

if the aviation function is chosen as the Sigmoid function [62,63].

Similar with the definition of

\tilde{u}

(20) we define a function on

Ω

as

{\tilde{u}}^{θ_{*}} = \{\begin{matrix} ϑ^{θ_{*}}, & in Ω_{0}, \\ 0, & in Ω_{1}, \end{matrix}

(30)

and the variational formula based on

{\tilde{u}}^{θ_{*}}

is proposed as follows:

Find {\tilde{U}}^{θ_{*}} \in E (Ω) such that B ({\tilde{U}}^{θ_{*}}, V) = {\tilde{L}}^{θ_{*}} (V), \forall V \in E (Ω),

(31)

where

\begin{matrix} {\tilde{L}}^{θ_{*}} (V) & = \int_{Ω} f V d P + \int_{Ω_{0}} (a_{0} Δ {\tilde{u}}^{θ_{*}}) V d P + \int_{\partial Ω} g V d s - \int_{\partial Ω \cap \partial Ω_{0}} a_{0} \frac{\partial {\tilde{u}}^{θ_{*}}}{\partial {\vec{n}}_{b}} V d s + \int_{Γ} (ψ - {[a_{0} \frac{\partial {\tilde{u}}^{θ_{*}}}{\partial {\vec{n}}_{Γ}}]}_{Γ}) V d s \\ = \int_{Ω} f V d P + \int_{\partial Ω} g V d s + \int_{Γ} ψ V d s - \int_{Ω_{0}} a_{0} \nabla {\tilde{u}}^{θ_{*}} \cdot \nabla V d P, \end{matrix}

(32)

and the function

u^{θ_{*}} : = U^{θ_{*}} + {\tilde{u}}^{θ_{*}}

serves as the approximation solution to u. We stress that (31) is not equivalent to the initial problem (3) exactly because according to (28) and (29),

{[{\tilde{u}}^{θ_{*}}]}_{Γ} = - ϑ^{θ_{*}} \approx (\neq) ϕ, on Γ .

This conforming error will be analyzed in the next section.

Unlike the variational problem (26), the problem (31) is computable because

{\tilde{u}}^{θ_{*}}

is obtained using the DNN. The problem (31) is discretized using the SGFEM subspace for the CIP, and the associated variational problem is

Find {\tilde{U}}_{h}^{θ_{*}} \in S_{h} such that B ({\tilde{U}}_{h}^{θ_{*}}, V_{h}) = {\tilde{L}}^{θ_{*}} (V_{h}), \forall V \in S_{h} .

(33)

Finally,

u_{h}^{θ_{*}} : = {\tilde{U}}_{h}^{θ_{*}} + {\tilde{u}}^{θ_{*}}

serves as the approximation solution to the initial problem (3), and the approximation error of

u_{h}^{θ_{*}}

will be analyzed mathematically in the next section. The formula (33) is called the SGFEM coupled with the DNN for the DIP due to the introduction of the DNN function

{\tilde{u}}^{θ_{*}}

.

Remark 2.

The idea to transform the DIP into a CIP using the DNN is motivated by [52,54,55]. In [52,54,55], non-homogeneous boundary conditions are transformed into homogeneous ones using a shallow DNN, and the associated (approximately) homogeneous equations are solved by another DNN. This approach has advantages over conventional methods (e.g., FEM, GFEM) in that it is meshless and can solve problems with boundaries of complex geometries, and is powerful for high-dimensional problems. We couple the SGFEM for the CIP with the shallow DNN to address the DIP in this paper. We achieve that (a) all the advantages of SGFEM are maintained for the DIP, (b) the proposed SGFEM for the DIP is a conforming method free from any penalty terms, and (c) the method has great potential for geometrically complex interfaces, especially in three dimensions (reported in a forthcoming study).

We analyze the computational costs of the proposed method (33). First, the stiffness matrices of (33) are exactly the same as those of SGFEM for the CIP, and only the RHS (32) of (33) needs to be treated. Therefore, the assembling of stiffness matrices and the construction of RHS can be implemented separately or in parallel. Second, the computational dimension of learning

ϑ^{θ_{*}}

based on (29) is one dimension less than the space dimension because the sampling points are located in the interface curve, and the learning time is very little in comparison with the assembling of stiffness matrices. Moreover, automatic differentiations are available in the existing DNN frameworks to save computational time. Therefore the proposed SGFEM coupled with the DNN is computationally efficient.

At the end of this section, we describe the structure of DNN used in (28) and (29). The DNN in this paper is a deep residual network (ResNet) [53,64,65] based on the full connection layers. Such a network structure was adopted in [53] to solve the PDEs. The ResNet is an improvement of the conventional DNN. The ResNet can fit high-dimensional functions better, and the fitting ability is not affected by network width. The ResNet can significantly speed training, increase pre-precision, reduce network degradation, and improve network characterization ability.

The ResNet was formally proposed in [64], which is obtained by stacking the residual blocks continuously. Each residual block has consisted of two fully connected layers, and its output is obtained by adding the output of the last layer and the input of the residual block. This structure leads to significant improvement in the training speed and approximation error. Let l be a positive integer. We set that there are l neurons in each layer of ResNet. Let

z_{i} \in R^{l}

be the vector consisted by excitation values of the l neurons in i-th layer,

W_{i}

be a

l \times l

weighted matrix, and

b_{i} \in R^{l}

be a real vector. Furthermore, let

σ

be an activation function. In a ResNet, the vectors

z_{i + 1}

and

z_{i}

satisfy

z_{i + 1} = σ (W_{i} z_{i} + b_{i}),

where

b_{i}

is called the bias term. Let

φ_{i} (z) : = σ (W_{i} z + b_{i})

,

z \in R^{l}

. Let

B_{j}

be the output of j-th residual block, which satisfies that

B_{j + 1} : = B_{j} + φ_{2 j} \circ φ_{2 j - 1} (B_{j}) .

Let

ψ_{j + 1} (z) : = z + φ_{2 j} \circ φ_{2 j - 1} (z)

,

z \in R^{l}

. The relationship between the input X and output Y of the ResNet with N residual blocks is

Y = φ_{2 N + 1} \circ ψ_{N} \circ \dots ψ_{2} \circ ψ_{1} (X) .

In our computations, we use a ResNet with three such blocks, see Figure 2. The parameters

W_{i}

and

b_{i}

for

i \in {1, 2, \dots, 2 N + 1}

are derived by solving the minimization problem (29) based on the loss function (28). These parameters are updated using the back propagation algorithm based on gradients of the loss function with respect to the parameters. In this paper, we use the stochastic gradient descent (SGD) method [66,67] for solving (29). The sigmoid function [62,63] serves as the activation function.

5. Convergence Analysis

We prove the convergence of the SGFEM solution coupled with the DNN,

u_{h}^{θ_{*}}

, in this section. The approximation error of

u_{h}^{θ_{*}}

involves two parts: the SGFEM error and the learning error of DNN. For any

ϑ \in M_{2}

(see (6)), let

ϑ_{r} : = ϑ |_{Ω_{r}}, r = 0, 1,

then

ϑ_{r} \in H^{2} (Ω_{r}), r = 0, 1

. Since we consider

Γ

is smooth,

ϑ_{0}

and

ϑ_{1}

can be continuously extended to the whole domain

Ω

to obtain functions

{\tilde{ϑ}}_{0}

and

{\tilde{ϑ}}_{1}

in

H^{2} (Ω)

such that

{\tilde{ϑ}}_{r} = ϑ_{r} o n Ω_{r} and ∥ {\tilde{ϑ}}_{r} ∥_{H^{2} (Ω)} \leq C {∥ ϑ_{r} ∥}_{H^{2} (Ω_{k})}, r = 0, 1,

(34)

where C is a positive constant independent of h (see Theorem 1.4.5 in [68]).

Let

ε

represent the learning error level of

{\tilde{u}}^{θ_{*}}

, i.e.,

∥ {\tilde{u}}^{θ_{*}} - \tilde{u} ∥_{L^{\infty} (Γ)} = {∥ ϑ^{θ_{*}} - (- ϕ) ∥}_{L^{\infty} (Γ)} \leq ε .

(35)

We first establish a relevant approximation result.

Lemma 1.

Suppose

w \in H^{k + 1} ({\bar{Ω}}_{0}), k \geq 1

and

{∥ w ∥}_{L^{\infty} (Γ)} \leq ε

, then there is

\hat{w} \in H^{1} ({\bar{Ω}}_{0})

satisfying

\hat{w} |_{Γ} = 0

such that

∥ w - \hat{w} ∥_{H^{1} ({\bar{Ω}}_{0})} \leq C ε^{\frac{k}{k + 1}} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})}^{\frac{1}{k + 1}},

(36)

where C is a constant independent of ε and k.

Proof.

We divide

Ω_{0}

using a quasi-uniform mesh fitting the interface

Γ

. The mesh-size parameter is denoted by l. We note that such a mesh is used only for obtaining associated estimates, and not used for actual computation. Let

O_{j}

and

ψ_{j}

,

j \in χ_{l}

be the FE nodes and FE functions of degree k associated with the mesh, respectively, and

I_{l} w = \sum_{j \in χ_{l}} w (O_{j}) ψ_{j}

be the standard FE interpolant of w. The index set

χ_{l}

is divided into

χ_{l}^{'}

and

χ_{l}^{''}

, which consist of indices of nodes on the interface

Γ

and in the interior of

Ω_{0}

, respectively. It is easy to know that

I_{l}^{''} w : = \sum_{j \in χ_{l}^{''}} w (O_{j}) ψ_{j}

belongs to

H^{1} ({\bar{Ω}}_{0})

and satisfies

I_{l}^{''} {w |}_{Γ} = 0

. Based on the error estimate of FEM interpolation [68] we have

\begin{matrix} ∥ w - I_{l}^{''} {w ∥}_{H^{1} ({\bar{Ω}}_{0})} & \leq & ∥ w - I_{l} {w ∥}_{H^{1} ({\bar{Ω}}_{0})} + {∥\sum_{j \in χ_{l}^{'}} w (O_{j}) ψ_{j}∥}_{H^{1} ({\bar{Ω}}_{0})} \\ \leq & C l^{k} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})} + \sum_{j \in χ_{l}^{'}} | w (O_{j}) | ∥ ψ_{j} ∥_{H^{1} ({\bar{Ω}}_{0})} \\ \leq & C l^{k} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})} + ε \sum_{j \in χ_{l}^{'}} {∥ ψ_{j} ∥}_{H^{1} ({\bar{Ω}}_{0})} \\ \leq & C l^{k} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})} + ε \sum_{j \in χ_{l}^{'}} C^{'} \leq C l^{k} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})} + C^{'} ε \frac{1}{l}, \end{matrix}

(37)

where the last inequality is because the number of nodes on

Γ

,

| χ_{l}^{'} |

, is

O (\frac{1}{l})

. Then letting

l = {(\frac{ε}{{| w |}_{H^{k + 1} ({\bar{Ω}}_{0})}})}^{\frac{1}{k + 1}}

in (37) yields

\begin{matrix} ∥ w - I_{l}^{''} {w ∥}_{H^{1} ({\bar{Ω}}_{0})} \leq C ε^{\frac{k}{k + 1}} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})}^{\frac{1}{k + 1}} + C^{'} ε^{\frac{k}{k + 1}} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})}^{\frac{1}{k + 1}} \leq C ε^{\frac{k}{k + 1}} {| w |}_{H^{k + 1} ({\bar{Ω}}_{0})}^{\frac{1}{k + 1}} . \end{matrix}

(38)

Let

\hat{w} = I_{l}^{''} w

, then we obtain the desired estimate (36) from (38). □

Theorem 2.

Suppose that

u \in M_{2} \cap E (Ω)

is the solution to (3)–(5), and

u_{h}^{θ_{*}} : = {\tilde{U}}_{h}^{θ_{*}} + {\tilde{u}}^{θ_{*}}

is the SGFEM approximation of u, where

U_{h}^{θ_{*}}

is the solution of associated CIP (33), and

{\tilde{u}}^{θ_{*}}

is the DNN function (30) that approximates the discontinuous jump ϕ. The learning error level of

{\tilde{u}}^{θ_{*}}

is represented in (35). Let

\tilde{u}

defined in (20) belongs to

H^{k + 1} ({\bar{Ω}}_{0})

. Then there is a constant C that is independent of h, k, and ε such that

{∥u - u_{h}^{θ_{*}}∥}_{E (Ω)} \leq C h {∥{\tilde{U}}^{θ_{*}}∥}_{M_{2}} + C ε^{\frac{k}{k + 1}} {|\tilde{u} - {\tilde{u}}^{θ_{*}}|}_{H^{k + 1} ({\bar{Ω}}_{0})}^{\frac{1}{k + 1}},

(39)

where

{\tilde{U}}^{θ_{*}}

is defined in (31).

Proof.

Remembering that

\tilde{U} = u - \tilde{u}

we have

\begin{matrix} {∥u - u_{h}^{θ_{*}}∥}_{E (Ω)} & = & {∥(\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}) + ({\tilde{U}}^{θ_{*}} - {\tilde{U}}_{h}^{θ_{*}})∥}_{E (Ω)} \\ \leq & {∥(\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}})∥}_{E (Ω)} + {∥U^{θ_{*}} - {\tilde{U}}_{h}^{θ_{*}}∥}_{E (Ω)} . \end{matrix}

(40)

According to (26) and (31) we obtain

B (\tilde{U} + \tilde{u}, V) = B ({\tilde{U}}^{θ_{*}} + {\tilde{u}}^{θ_{*}}, V) = 0, \forall V \in E (Ω),

and thus

B (\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}, V) = 0, \forall V \in E (Ω) .

Therefore, for arbitrary

V \in E (Ω)

\begin{matrix} {∥\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}∥}_{E (Ω)}^{2} & = & B (\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}, \tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}) \\ = & B (\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}, \tilde{u} - {\tilde{u}}^{θ_{*}} - V) \\ \leq & {∥\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}∥}_{E (Ω)} {∥\tilde{u} - {\tilde{u}}^{θ_{*}} - V∥}_{E (Ω)} . \end{matrix}

(41)

Eliminating

{∥\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}∥}_{E (Ω)}

on both sides of (41) we have

{∥\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}∥}_{E (Ω)} \leq {∥\tilde{u} - {\tilde{u}}^{θ_{*}} - V∥}_{E (Ω)}, \forall V \in E (Ω) .

(42)

Let

\hat{V} = \{\begin{matrix} V, & in {\bar{Ω}}_{0}, \\ 0, & in Ω_{1}, \end{matrix}

which also belongs to

E (Ω)

. Hence, we obtain from (42) that

\begin{matrix} {∥\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}∥}_{E (Ω)} \leq {∥\tilde{u} - {\tilde{u}}^{θ_{*}} - \hat{V}∥}_{E (Ω)} = {∥\tilde{u} - {\tilde{u}}^{θ_{*}} - \hat{V}∥}_{E ({\bar{Ω}}_{0})} \leq C {∥\tilde{u} - {\tilde{u}}^{θ_{*}} - \hat{V}∥}_{H^{1} ({\bar{Ω}}_{0})} . \end{matrix}

(43)

Based on the learning error (35) and Lemma 1, there is a

\hat{V}

such that

{∥\tilde{U} + \tilde{u} - {\tilde{U}}^{θ_{*}} - {\tilde{u}}^{θ_{*}}∥}_{E (Ω)} \leq C ε^{\frac{k}{k + 1}} {|\tilde{u} - {\tilde{u}}^{θ_{*}}|}_{H^{k + 1} ({\bar{Ω}}_{0})}^{\frac{1}{k + 1}} .

(44)

On the other hand, it is noted that

{\tilde{U}}_{h}^{θ_{*}}

in (33) is the SGFEM solution to the CIP (31). Then according to the approximation result (18) for the CIP in Theorem 1 we have

{∥{\tilde{U}}^{θ_{*}} - {\tilde{U}}_{h}^{θ_{*}}∥}_{E (Ω)} \leq C h {∥{\tilde{U}}^{θ_{*}}∥}_{M_{2}} .

(45)

Finally, we obtain from the estimates (40), (44), and (45)

{∥u - u_{h}^{θ_{*}}∥}_{E (Ω)} \leq C h {∥{\tilde{U}}^{θ_{*}}∥}_{M_{2}} + C ε^{\frac{k}{k + 1}} {|\tilde{u} - {\tilde{u}}^{θ_{*}}|}_{H^{k + 1} (Ω_{0})}^{\frac{1}{k + 1}},

which is the desired result (39). □

In (39)

\tilde{u}

only needs the evaluations on

Γ

, i.e.,

\tilde{u} |_{Γ} = - ϕ

, while its evaluations on

Ω_{0}

are not required specifically. Therefore, the term

{|\tilde{u} - {\tilde{u}}^{θ_{*}}|}_{H^{k + 1} (Ω_{0})}

in (39) can be replaced by

min_{\tilde{u} \in H^{k + 1} ({\bar{Ω}}_{0}), \tilde{u} |_{Γ} = - ϕ} {|\tilde{u} - {\tilde{u}}^{θ_{*}}|}_{H^{k + 1} (Ω_{0})},

which could be small. In (39) k at least equals to 1, i.e.,

\tilde{u} \in H^{2} ({\bar{Ω}}_{0})

. In fact, let

\tilde{u} {|_{{\bar{Ω}}_{0}} = [{\tilde{u}}_{0} - {\tilde{u}}_{1}] |}_{{\bar{Ω}}_{0}}

, where

{\tilde{u}}_{0}

and

{\tilde{u}}_{1}

are the extensions of

u_{0}

and

u_{1}

(34), respectively, then

\tilde{u} \in H^{2} ({\bar{Ω}}_{0})

and

\tilde{u} |_{Γ} = - ϕ

. We mention that for the interface problem where

Γ

and

ϕ

are relatively smooth,

\tilde{u}

could have higher smoothness in

{\bar{Ω}}_{0}

, i.e,

\tilde{u} \in H^{k + 1} ({\bar{Ω}}_{0}), k > 1

. Theorem 2 means that the proposed SGFEM coupled with the DNN can obtain the optimal energy error

O (h)

if the DNN function

{\tilde{u}}^{θ_{*}}

learns

\tilde{u}

(

ϕ

) on the interface

Γ

(not in the domains

Ω_{0}

and

Ω_{1}

) accurately sufficiently, see (35). In the numerical experiments below, the optimal errors

O (h)

are observed for the same discretization parameters h as those in the SGFEM for the CIP.

6. Numerical Results

We consider the model problem (3) in a domain

Ω = {(0, 1)}^{2}

with straight and curved interfaces

Γ

for the numerical experiments. We test two cases of coefficients

a (P)

: (i)

a_{0} = 1

and

a_{1} = 10

, and (ii)

a_{0} = 1

and

a_{1} = 100

. Their contrasts are

c = 10

and

c = 100

, respectively. The manufactured exact solution u of (3) will be employed in the tests. The loading functions

f, g

of (3), the jump

ϕ

(4), and the flux

ψ

(5) can be calculated by using Equation (3) and the manufactured exact solution u.

The uniform

n \times n

square FE mesh is used to discretize the domain

Ω = {(0, 1)}^{2}

with the mesh parameter

h = 1 / n

. The nodes associated with the mesh are denoted by

{P_{i}}_{i \in I_{h}}

, where

I_{h}

is the index set. Let

{ϕ_{i}}_{i \in I_{h}}

be the bilinear FE functions associated with the nodes

{P_{i}}_{i \in I_{h}}

.

We will test the standard FEM (8) and SGFEM (12) on the square mesh for the DIP, based on the variational formulation (31) incorporating the DNN function (20). We do not present the results of other GFEMs, such as the geometric and corrected GFEM. In the geometric GFEM, the SCN is of order

O (h^{- 4})

[23], which is much bigger than that of SGFEM. The corrected GFEM is not robust in the sense that the SCN gets bad as interface curves approach boundaries of elements [27]. We compute and compare the relative error of these methods in the energy norm (EE). The SCN of SGFEM has been shown to be of order

O (h^{- 2})

in [27], and we do not repeat it in this paper because the stiffness matrices for the DIP are the same as those for the CIP.

Setting for ResNet. As described at the end of Section 4, we use the ResNet for the DNN coupling in the tests. The ResNet structure consists of three residual blocks, each of which contains two full connection layers and one residual item, where each layer contains 20 neurons, see Figure 2. The sigmoid function [62,63] serves as the activation function. We use the stochastic gradient descent (SGD) method [66,67] for solving (29) in the learning process. The learning rate, the batch number, the number

N_{Γ}

of sampling points on

Γ

for training the ResNet will be specified in the following Subsections.

Integration for discontinuous enrichments. We describe the numerical integration formula used in the computations. For elements that do not intersect the interface, we employ the standard

4 \times 4

Gaussian rule. For elements cut by the interface, we connect the intersection points of the interface and the boundaries of an element by a straight line, and decompose the element into 4 to 6 sub-triangles, on each of which the standard 12-point Gaussian rule for triangles is employed. See Figure 3 for an example. We mention that a systematic study of the effect of numerical integration is not the objective of this work. We refer to for more details about the numerical integrations for the interface problems [3,5,22,69].

We now present our numerical results in the following sub-sections.

6.1. A Straight Interface Situation

We first consider a straight interface

Γ

, which has an equation

y = tan (θ_{0}) (x - 1 - d) + 1

with

θ_{0} = \frac{π}{6}

and

d = \frac{1}{π}

. The manufactured solution of (3) is as follows:

u = \{\begin{matrix} M r^{α_{0}} cos (α_{0} (θ + π - θ_{0})) + N \frac{a_{0}}{a_{1}} r^{α_{0}} sin (α_{0} (θ + π - θ_{0})) + sin (x y), & y > tan (θ_{0}) (x - 1 - d) + 1 (Ω_{1}), \\ r^{α_{0}} cos (α_{0} (θ + π - θ_{0})) + r^{α_{0}} sin (α_{0} (θ + π - θ_{0})) + sin (x y), & y \leq tan (θ_{0}) (x - 1 - d) + 1 (Ω_{0}), \end{matrix}

where

α_{0} = \frac{4}{3}

, and

(r, θ)

is the polar coordinate at the center

(1 + d, 1)

. It can be checked that u is continuous across the interface

Γ

when

M = N = 1

. In this example we take

M = 100

and

N = 100

, and u is discontinuous across

Γ

, i.e.,

{[u]}_{Γ} = ϕ \neq 0

(see (4)) and also the jump of flux

ψ \neq 0

(5). The mesh on the domain

[0, 1] \times [0, 1]

is refined with

h^{- 1} = n = 2^{j + 1}, j = 1, 2, \dots, 7

. The interface

Γ

and a mesh with

n = 16

are shown in Figure 4 Left, and the exact solution u with

a_{0} = 1

and

a_{1} = 100

is drawn in Figure 4 Right.

The number of sampling points uniformly for training the ResNet,

N_{Γ}

, is taken as 200 in this situation. The learning rate

η = 0.05

, and the batch number is 4096. The maximum relative error of DNN function

{\tilde{u}}^{θ_{*}}

(30) to the jump

ϕ

on

Γ

, defined as

\frac{∥ {\tilde{u}}^{θ_{*}} {- ϕ ∥}_{L^{\infty} (Γ)}}{{∥ ϕ ∥}_{L^{\infty} (Γ)}},

(46)

with respect to the iteration steps of SGD, is shown in Figure 5 Left. It shows that the error of the DNN function

{\tilde{u}}^{θ_{*}}

is reduced by increasing the iteration steps of SGD. We also observed this by testing the different learning rates and batch numbers, and we do not exhibit them here. Therefore, it is easy to learn a DNN function

{\tilde{u}}^{θ_{*}}

to reach the desired error level

ε

(35) in the Theorem 2 (39). Such an

ε

is small enough for the SGFEM to obtain the optimal energy error convergence order

O (h)

, see below for the energy errors.

The energy errors with respect to h of the FEM and SGFEM coupled with the DNN, are presented in Figure 5 Right for different contrasts c (10 and 100), where

h^{- 1} = n = 2^{j + 1}, j = 1, 2, \dots, 7

. It is shown in Figure 5 Right that the convergence orders of the FEM and SGFEM are

O (h^{0.5})

and

O (h)

, respectively. Therefore, it is concluded from this set of numerical experiments that the proposed SGFEM reaches the optimal convergence order for such a DIP, as predicted in the Theorem 2 (when

ε

is small).

6.2. A Curved Interface Situation

We next consider a curved interface

Γ

with an equation

{(x - x_{0})}^{2} + {(y - y_{0})}^{2} = r_{0}^{2}

, where

x_{0} = \frac{1}{\sqrt{5}}, y_{0} = \frac{1}{\sqrt{3}}, r_{0} = \frac{1}{\sqrt{10}}

. In this case we consider the manufactured solution of (3) as follows:

u = \{\begin{matrix} \frac{2 a_{1} - a_{1} ρ r_{0}^{2}}{(a 1 - a 0) r_{0}^{4}} r^{2} cos (2 θ), & r < r_{0} (Ω_{0}), \\ \frac{a_{1} + a_{0} - a_{0} ρ r_{0}^{2}}{(a 1 - a 0) r_{0}^{4}} r^{2} cos (2 θ) + r^{- 2} cos (2 θ), & r \geq r_{0} (Ω_{1}), \end{matrix}

where

(r, θ)

is the polar coordinate at the center

(x_{0}, y_{0})

. It can be verified that u is continuous across

Γ

for

ρ = 0

. In this test, we take

ρ = (a_{0} + a_{1}) / 2

, and

{[u]}_{Γ} = ϕ

is non-zero. The interface

Γ

and a mesh with

n = 16

are shown in Figure 6 Left, and the exact solution u with

a_{0} = 1

and

a_{1} = 100

is drawn in Figure 6 Right. The mesh on the domain

[0, 1] \times [0, 1]

is refined with

h^{- 1} = n = 2^{j + 1}, j = 1, 2, \dots, 7

.

The number of sampling points uniformly for training the ResNet,

N_{Γ}

, is taken as 500 in this situation. The learning rate

η = 0.08

, and the batch number is 4096. The maximum relative error (46) of DNN function

{\tilde{u}}^{θ_{*}}

(30) to the jump

ϕ

on

Γ

with respect to the iteration steps of SGD, is shown in Figure 7 Left. It shows that the error of the DNN function is reduced by increasing the iteration steps of SGD. We also observe this by testing the different learning rates and batch numbers, and we do not exhibit them here. Therefore, it is easy to obtain a DNN function

{\tilde{u}}^{θ_{*}}

to reach the small error level

ε

in the Theorem 2.

We note that the DNN used in this paper is a stochastic method due to the SGD. In this example, to test the robustness of proposed method, we implement the learning algorithm (29) five times to generate the DNN functions

{\tilde{u}}_{i}^{θ_{*}}

(30),

i = 1, 2, \dots, 5

. For each

{\tilde{u}}_{i}^{θ_{*}}

, the energy errors

{E E}_{i, h}

with respect to h of the coupled FEM and SGFEM are computed from (33). The means and standard deviations (STD) of these errors are defined by

{\bar{EE}}_{h} : = \frac{1}{5} \sum_{i = 1}^{5} {EE}_{i, h}, and {S T D}_{h} : = \sqrt{\frac{1}{5} \sum_{i = 1}^{5} {({E E}_{i, h} - {\bar{E E}}_{h})}^{2}} .

{\bar{EE}}_{h}

with respect to h of the FEM and SGFEM are presented in Figure 7 Right for different contrasts c (10 and 100).

{\bar{EE}}_{h}

and

{S T D}_{h}

with respect to h of SGFEM are listed in Table 1.

Figure 7 Right clearly shows that the convergence orders of the error means of FEM and SGFEM are still

O (h^{0.5})

and

O (h)

, respectively, as predicted. It is observed in Table 1 that the STDs of

{E E}_{i, h}

are at relatively low levels. These mean that the optimal convergence order, in this case, can also be obtained by the proposed SGFEM, and moreover, the proposed SGFEM exhibits nice robustness with respect to the randomness of the DNN method. We also obtained similar results by testing the different curved interfaces with discontinuous solutions, and we do not present them here.

7. Conclusions and Comments

This paper proposed an SGFEM coupled with the DNN for solving elliptic interface problems with a discontinuous interface condition. In this scheme, the DNN is used to learn a function (approximately) satisfying the interface condition, which helps us formulate the original problems with a discontinuous interface condition into the one with a continuous interface condition. Based on this, the SGFEM for the CIP is applied to the DIP straightforwardly, and no penalty terms are needed. All the merits of SGFEM for the CIP are maintained, such as the optimal convergence, stability, and robustness. The approximation error of the proposed SGFEM coupled with the DNN was analyzed mathematically. For comparison, we performed numerical experiments with the FEM and SGFEM. The FEM converges with an order

O (h^{0.5})

only, while the SGFEM converges with the optimal order

O (h)

. The proposed method has great potential for the DIP with interfaces of complex geometries due to the meshless features of DNN. The extension of the results in this paper to the three-dimensional and parabolic interface problems will be investigated in future studies.

Author Contributions

Methodology, Q.Z.; validation, Q.Z., Y.J., and M.N.; data curation, M.N. and Y.J.; supervision, Q.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by Key-Area Research and Development Program of Guangdong Province under grant 2021B0101190003 and Guangdong Provincial Natural Science Foundation of China under grant 2022A1515010831. This research was partially supported by the Natural Science Foundation of China under grant 11471343 and Guangdong Provincial Natural Science Foundation of China under grant 2022A1515011187.

Conflicts of Interest

The authors declare no conflict of interest.

References

Babuška, I.; Banerjee, U. Stable generalized finite element method. Comput. Methods Appl. Mech. Eng. 2011, 201–204, 91–111. [Google Scholar] [CrossRef] [Green Version]
Babuška, I.; Banerjee, U.; Osborn, J. Survey of meshless and generalized finite element methods: A unified approach. Acta Numer. 2003, 12, 1–125. [Google Scholar] [CrossRef] [Green Version]
Fries, T.P.; Belytschko, T. The extended/generalized finite element method: An overview of the method and its applications. Int. J. Numer. Meth. Eng. 2010, 84, 253–304. [Google Scholar] [CrossRef]
Efendiev, Y.; Hou, T.Y. Multiscale Finite Element Methods: Theory and Applications; Springer: Cham, Switzerland, 2009. [Google Scholar]
Bëchet, E.; Minnebo, H.; Moxexs, N.; Burgardt, B. Improved implementation and robustness study of the X-FEM method for stress analysis around cracks. Int. J. Numer. Meth. Eng. 2005, 64, 1033–1056. [Google Scholar] [CrossRef] [Green Version]
Gupta, V.; Duarte, C.A.; Babuška, I.; Banerjee, U. A Stable and optimally convergent generalized FEM (SGFEM) for linear elastic fracture mechanics. Comput. Methods Appl. Mech. Eng. 2013, 266, 23–39. [Google Scholar] [CrossRef]
Cui, C.; Zhang, Q. Stable generalized finite element methods (SGFEM) for elasticity crack problems. Int. J. Numer. Methods Eng. 2020, 121, 3066–3082. [Google Scholar] [CrossRef]
Zhang, Q. DOF-gathering stable generalized finite element methods (SGFEM) for crack problems. Numer. Methods Partial. Differ. Equ. 2020, 36, 1209–1233. [Google Scholar] [CrossRef]
Chessa, J.; Belytschko, T. An extended finite element method for two-phase fluids. J. Appl. Mech. 2003, 70, 10–17. [Google Scholar] [CrossRef]
Hou, T.; Li, Z.; Osher, S.; Zhao, H. A hybrid method for moving interface problems with application to the hele-shaw flow. J. Comput. Phys. 1997, 134, 236–252. [Google Scholar] [CrossRef] [Green Version]
Hansbo, A.; Hansbo, P. An unfitted finite element method, Based on Nitsche’s method, for elliptic interface problems. Comput. Methods Appl. Mech. Eng. 2002, 191, 5537–5552. [Google Scholar] [CrossRef] [Green Version]
Shahzamaniana, M.M.; Lin, M.; Kainat, M.; Yoosef-Ghodsi, N.; Adeeb, S. Systematic literature review of the application of extended finite element method in failure prediction of pipelines. J. Pipeline Sci. Eng. 2021, 1, 241–251. [Google Scholar] [CrossRef]
Elyasi, N.; Shahzamanian, M.; Lin, M.; Westover, L.; Li, Y.; Kainat, M.; Yoosef-Ghodsi, N.; Adeeb, S. Prediction of Tensile Strain Capacity for X52 Steel Pipeline Materials Using the Extended Finite Element Method. Appl. Mech. 2021, 2, 209–225. [Google Scholar] [CrossRef]
Babuška, I.; Melenk, J.M. The partition of unity finite element method. Int. J. Numer. Meth. Eng. 1997, 40, 727–758. [Google Scholar] [CrossRef]
Duarte, C.A.; Oden, J.T. An h-p adaptive method using clouds. Comput. Methods Appl. Mech. Eng. 1996, 139, 237–262. [Google Scholar] [CrossRef]
Melenk, J.M.; Babuška, I. The partition of unity finite element method: Theory and application. Comput. Methods Appl. Mech. Eng. 1996, 139, 289–314. [Google Scholar] [CrossRef] [Green Version]
Babuška, I.; Banerjee, U.; Kergrene, K. Strongly Stable Generalized Finite Element Method: Application to interface problems. Comput. Methods Appl. Mech. Eng. 2017, 327, 58–92. [Google Scholar] [CrossRef]
Moës, N.; Cloirec, M.; Cartraud, P.; Remacle, J.F. A computational approach to handle complex microstructure geometries. Comput. Methods Appl. Mech. Eng. 2003, 192, 3163–3177. [Google Scholar] [CrossRef]
Harari, I.; Dolbow, J. Analysis of an efficient finite element method for embedded interface problems. Comput. Math. 2010, 46, 205–211. [Google Scholar] [CrossRef]
Zhang, Q.; Banerjee, U.; Babuška, I. Strongly Stable Generalized Finite Element Method (SSGFEM) for a non-smooth interface problem. Comput. Methods Appl. Mech. Eng. 2019, 344, 538–568. [Google Scholar] [CrossRef]
Aragon, A.M.; Duarte, C.A.; Geubelle, P.H. Generalized finite element enrichment functionsfor discontinuous gradient fields. Int. J. Numer. Meth. Eng. 2010, 82, 242–268. [Google Scholar] [CrossRef]
Cheng, K.W.; Fries, T.P. Higher-order XFEM for curved strong and weak discontinuities. Int. J. Numer. Meth. Eng. 2010, 82, 564–590. [Google Scholar] [CrossRef]
Kergrene, K.; Babuška, I.; Banerjee, U. Stable Generalized Finite Element Method and associated iterative schemes: Application to interface problems. Comput. Methods Appl. Mech. Eng. 2016, 305, 1–36. [Google Scholar] [CrossRef] [Green Version]
Legrain, G.; Moës, N.; Huerta, A. Stability of incompressible formulations enriched with X-FEM. Comput. Meth. Appl. Mech. Eng. 2008, 197, 1835–1849. [Google Scholar] [CrossRef] [Green Version]
Kirchhart, M.; Gross, S.; Reusken, A. Analysis of an XFEM discretization for Stokes interface problems. SIAM J. Sci. Comput. 2016, 38, A1019–A1043. [Google Scholar] [CrossRef] [Green Version]
Díez, P.; Cottereau, R.; Zlotnik, S. A stable extended FEM formulation for multi-phase problems enforcing the accuracy of the fluxes through Lagrange multipliers. Int. J. Numer. Meth. Eng. 2013, 96, 303–322. [Google Scholar] [CrossRef] [Green Version]
Zhu, P.; Zhang, Q.; Liu, T. Stable generalized finite element method (SGFEM) for parabolic interface problems. J. Comput. Appl. Math. 2020, 367, 112475. [Google Scholar] [CrossRef]
Zhang, Q.; Cui, C.; Banerjee, U.; Babuska, I. A condensed generalized finite element method (CGFEM) for interface problems. Comput. Methods Appl. Mech. Eng. 2022, 391, 114537. [Google Scholar] [CrossRef]
Zilian, A.; Legay, A. The enriched space-time finite element method (EST) for simultaneous solution of fluid-structure interaction. Int. J. Numer. Meth. Eng. 2008, 75, 305–334. [Google Scholar] [CrossRef] [Green Version]
Lehrenfeld, C.; Reusken, A. Analysis of a Nitsche XFEM-DG discretization for a class of two-phase mass transport problems. SIAM J. Numer. Anal. 2013, 51, 958–983. [Google Scholar] [CrossRef] [Green Version]
Sauerland, H.; Fries, T.P. The extended finite element method for two-phase and free-surface flows: A systematic study. J. Comput. Phys. 2011, 230, 3369–3390. [Google Scholar] [CrossRef]
Sauerland, H.; Fries, T.P. The stable XFEM for two-phase flows. Comput. Fluids 2013, 87, 41–49. [Google Scholar] [CrossRef]
Fries, T.P.; Zilian, A. On time integration in the XFEM. Int. J. Numer. Meth. Eng. 2009, 79, 69–93. [Google Scholar] [CrossRef]
Gross, S.; Ludescher, T.; Olshanskii, M.; Reusken, A. Robust preconditioning for XFEM applied to time-dependent Stokes problems. SIAM J. Sci. Comput. 2016, 38, A3492–A3514. [Google Scholar] [CrossRef] [Green Version]
Loehnert, S. A stabilization technique for the regularization of nearly singular extended finite elements. Comput. Mech. 2014, 54, 523–533. [Google Scholar] [CrossRef]
Menk, A.; Bordas, S.P.A. A robust preconditioning technique for the extended finite element method. Int. J. Numer. Meth. Eng. 2011, 85, 1609–1632. [Google Scholar] [CrossRef] [Green Version]
Lang, C.; Makhija, D.; Doostan, A.; Maute, K. A simple and efficient preconditioning scheme for heaviside enriched XFEM. Comput. Mech. 2014, 54, 1357–1374. [Google Scholar] [CrossRef]
Schweitzer, M.A. Stable enrichment and local preconditioning in the particle-partition of unity method. Numer. Math. 2011, 118, 137–170. [Google Scholar] [CrossRef]
Zhang, Q.; Banerjee, U.; Babuska, I. High order stable generalized finite element methods. Numer. Math. 2014, 128, 1–29. [Google Scholar] [CrossRef]
Zhang, Q.; Babuška, I. A stable generalized finite element method (SGFEM) of degree two for interface problems. Comput. Methods Appl. Mech. Eng. 2020, 363, 112889. [Google Scholar] [CrossRef]
Hansbo, P.; Larson, M.G.; Zahedi, S. A cut finite element method for a Stokes interface problem. Appl. Numer. Math. 2014, 85, 90–114. [Google Scholar] [CrossRef] [Green Version]
Lin, T.; Lin, Y.; Zhang, X. Partially penalized immersed finite element methods for elliptic interface problems. SIAM J. Numer. Anal. 2015, 53, 1121–1144. [Google Scholar] [CrossRef] [Green Version]
Afraites, L.; Dambrine, M.; Kateb, D. Shape methods for the transmission problem with a single measurement. Numer. Funct. Anal. Optim. 2007, 28, 519–551. [Google Scholar] [CrossRef]
LeVeque, R.; Li, Z. Immersed interface methods for Stokes flow with elastic boundaries or surface tension. SIAM J. Sci. Comput. 1997, 18, 709–735. [Google Scholar] [CrossRef] [Green Version]
Li, Z.; Wang, W.; Chern, I.L.; Lai, M. New formulations for interface problems in polar coordinates. SIAM J. Sci. Comput. 2003, 25, 224–245. [Google Scholar] [CrossRef] [Green Version]
Adjerid, S.; Babuska, I.; Guo, R.; Lin, T. An enriched immersed finite element method for interface problems with nonhomogeneous jump conditions. arXiv 2021, arXiv:2004.13244. [Google Scholar]
Barrett, J.W.; Elliott, C.M. Fitted and unfitted finite-element methods for elliptic equations with smooth interfaces. IMA J. Numer. Anal. 1987, 7, 283–300. [Google Scholar] [CrossRef]
Huang, P.; Wu, H.; Xiao, Y. An unfitted interface penalty finite element method for elliptic interface problems. Comput. Methods Appl. Mech. Eng. 2017, 323, 439–460. [Google Scholar] [CrossRef]
Zhu, L.; Zhang, Z.; Li, Z. An immersed finite volume element method for 2D PDEs with discontinuous coefficients and non-homogeneuos jump conditions. Comput. Math. Appl. 2015, 70, 89–103. [Google Scholar] [CrossRef]
Sirignano, J.; Spiliopoulos, K. DGM: A deep learning algorithm for solving partial differential equations. J. Comput. Phys. 2018, 375, 1339–1364. [Google Scholar] [CrossRef] [Green Version]
Dockhorn, T. A discussion on solving partial differential equations using neural networks. arXiv 2019, arXiv:1904.07200. [Google Scholar]
Berg, J.; Nyström, K. A unified deep artificial neural network approach to partial differential equations in complex geometries. Neurocomputing 2018, 317, 28–41. [Google Scholar] [CrossRef] [Green Version]
Yu, W.E.B. The deep ritz method: A deep learning-based numerical algorithm for solving variational problems. Commun. Math. Stat. 2018, 8, 1–12. [Google Scholar]
Sheng, H.; Yang, C. Pfnn: A penalty-free neural network method for solving a class of second order boundary-value problems on complex geometries. J. Comput. Phys. 2021, 428, 110085. [Google Scholar] [CrossRef]
Wang, Z.; Zhang, Z. A mesh-free method for interface problems using the deep learning approach. J. Comput. Phys. 2020, 400, 108963. [Google Scholar] [CrossRef]
Liao, Y.; Ming, P. Deep nitsche method: Deep ritz method with essential boundary conditions. Commun. Comput. Phys. 2021, 29, 1365–1384. [Google Scholar]
Anitescu, C.; Atroshchenko, E.; Alajlan, N.; Rabczuket, T. Artificial neural network methods for the solution of second order boundary value problems. Comput. Mater. Contin. 2019, 59, 345–359. [Google Scholar] [CrossRef] [Green Version]
Samaniego, E.; Anitescu, C.; Goswami, S.; Nguyen-Thanh, V.M.; Guo, H.; Hamdia, K.; Zhuang, X.; Rabczuk, T. An energy approach to the solution of partial differential equations in computational mechanics via machine learning: Concepts, implementation and applications. Comput. Methods Appl. Mech. Eng. 2020, 362, 112790. [Google Scholar] [CrossRef] [Green Version]
Krishnapriyan, A.; Gholami, A.; Zhe, S.; Kirby, R.; Mahoney, M.W. Characterizing possible failure modes in physics-informed neural networks. Adv. Neural Inf. Process. Syst. 2021, 34, 26548–26560. [Google Scholar]
Yang, Z.; Baraldi, P.; Zio, E. A multi-branch deep neural network model for failure prognostics based on multimodal data. J. Manuf. Syst. 2021, 59, 42–50. [Google Scholar] [CrossRef]
Meyes, R.; Donauer, J.; Schmeing, A.; Meisen, T. A Recurrent Neural Network Architecture for Failure Prediction in Deep Drawing Sensory Time Series Data. Procedia Manuf. 2019, 34, 789–797. [Google Scholar] [CrossRef]
Cybenko, G. Approximation by superposition of a sigmoidal function. Math. Control. Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
Chandra, P.; Singh, Y. Feedforward sigmoidal networks-equicontinuity and fault-tolerance properties. IEEE Trans. Neural Netw. 2004, 15, 1350–1366. [Google Scholar] [CrossRef] [PubMed]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and PATTERN recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Li, H.; Zhang, Q.; Chen, X. Deep Learning-Based Surrogate Model for Flight Load Analysis. Cmes Comput. Model. Eng. Sci. 2021, 128, 605–621. [Google Scholar] [CrossRef]
Bottou, L. Large-scale machine learning with stochastic gradient descent. Proceedings of COMPSTAT 2010; Springer: Cham, Switzerland, 2010; pp. 177–186. [Google Scholar]
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Brenner, S.C.; Scott, L.R. The Mathematical Theory of Finite Element Methods, 3rd ed.; Springer: New York, NY, USA, 2008. [Google Scholar]
Fries, T.P.; Omerović, S. Higher-order accurate integration of implicit geometries. Int. J. Numer. Meth. Eng. 2016, 106, 323–371. [Google Scholar] [CrossRef]

Figure 1. The domains with a curved interface (Left) and a straight interface (Right).

Figure 2. The ResNet used in the tests.

Figure 3. Numerical integrations on an element

[a, b] \times [c, d]

cut by the interface

Γ

, where the element is divided into several triangles, and the Gaussian rule is used on each triangle.

Figure 3. Numerical integrations on an element

[a, b] \times [c, d]

cut by the interface

Γ

, where the element is divided into several triangles, and the Gaussian rule is used on each triangle.

Figure 4. The domains with a straight interface (Left) and a discontinuous solution u (Right) of DIP (

a_{0} = 1, a_{1} = 100

).

Figure 4. The domains with a straight interface (Left) and a discontinuous solution u (Right) of DIP (

a_{0} = 1, a_{1} = 100

).

Figure 5. The DIP with a straight interface: the maximum relative errors (46) of DNN function

{\tilde{u}}^{θ_{*}}

(Left) and the energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.

Figure 5. The DIP with a straight interface: the maximum relative errors (46) of DNN function

{\tilde{u}}^{θ_{*}}

(Left) and the energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.

Figure 6. The domains with a curved interface (Left) and a discontinuous solution u (Right) of DIP (

a_{0} = 1, a_{1} = 100

).

Figure 6. The domains with a curved interface (Left) and a discontinuous solution u (Right) of DIP (

a_{0} = 1, a_{1} = 100

).

Figure 7. The DIP with a curved interface: the maximum relative errors (46) of DNN function

{\tilde{u}}^{θ_{*}}

(Left) and the means of energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.

Figure 7. The DIP with a curved interface: the maximum relative errors (46) of DNN function

{\tilde{u}}^{θ_{*}}

(Left) and the means of energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.

Table 1. The means and STDs of errors with respect to h of SGFEM: the curved interface situation.

h	$c = 10$		$c = 100$
h	Mean	STD	Mean	STD
$2.5000 \times 10^{- 01}$	$2.1241 \times 10^{- 01}$	$4.0512 \times 10^{- 04}$	$2.3789 \times 10^{- 01}$	$7.9756 \times 10^{- 03}$
$1.2500 \times 10^{- 01}$	$1.1036 \times 10^{- 01}$	$3.0925 \times 10^{- 04}$	$1.3384 \times 10^{- 01}$	$8.4100 \times 10^{- 03}$
$6.2500 \times 10^{- 02}$	$5.7113 \times 10^{- 02}$	$1.4862 \times 10^{- 04}$	$7.4277 \times 10^{- 02}$	$5.5238 \times 10^{- 03}$
$3.1250 \times 10^{- 02}$	$2.9479 \times 10^{- 02}$	$8.1748 \times 10^{- 05}$	$3.8006 \times 10^{- 02}$	$2.9354 \times 10^{- 03}$
$1.5625 \times 10^{- 02}$	$1.5016 \times 10^{- 02}$	$4.3178 \times 10^{- 05}$	$1.9107 \times 10^{- 02}$	$1.4815 \times 10^{- 03}$
$7.8125 \times 10^{- 03}$	$7.5967 \times 10^{- 03}$	$2.0323 \times 10^{- 05}$	$9.5761 \times 10^{- 03}$	$7.4419 \times 10^{- 04}$
$3.9063 \times 10^{- 03}$	$3.8242 \times 10^{- 03}$	$1.0329 \times 10^{- 05}$	$4.8098 \times 10^{- 03}$	$3.7257 \times 10^{- 04}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, Y.; Nian, M.; Zhang, Q. A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities. Axioms 2022, 11, 384. https://doi.org/10.3390/axioms11080384

AMA Style

Jiang Y, Nian M, Zhang Q. A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities. Axioms. 2022; 11(8):384. https://doi.org/10.3390/axioms11080384

Chicago/Turabian Style

Jiang, Ying, Minghui Nian, and Qinghui Zhang. 2022. "A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities" Axioms 11, no. 8: 384. https://doi.org/10.3390/axioms11080384

APA Style

Jiang, Y., Nian, M., & Zhang, Q. (2022). A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities. Axioms, 11(8), 384. https://doi.org/10.3390/axioms11080384

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities

Abstract

1. Introduction

2. Model Problem

3. Conventional Gfem and Sgfem for Interface Problems

4. Sgfem Coupled with Dnn for Dip

5. Convergence Analysis

6. Numerical Results

6.1. A Straight Interface Situation

6.2. A Curved Interface Situation

7. Conclusions and Comments

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI