Next Article in Journal
Inference of Reliability Analysis for Type II Half Logistic Weibull Distribution with Application of Bladder Cancer
Next Article in Special Issue
Optimality of the Approximation and Learning by the Rescaled Pure Super Greedy Algorithms
Previous Article in Journal
A Brief Overview and Survey of the Scientific Work by Feng Qi
Previous Article in Special Issue
Some Upper Bounds for RKHS Approximation by Bessel Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities

1
School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou 510006, China
2
School of Science, Harbin Institute of Technology, Shenzhen 518000, China
*
Author to whom correspondence should be addressed.
Axioms 2022, 11(8), 384; https://doi.org/10.3390/axioms11080384
Submission received: 29 June 2022 / Revised: 3 August 2022 / Accepted: 3 August 2022 / Published: 5 August 2022

Abstract

:
The stable generalized finite element method (SGFEM) is an improved version of generalized or extended FEM (GFEM/XFEM), which (i) uses simple and unfitted meshes, (ii) reaches optimal convergence orders, and (iii) is stable and robust in the sense that conditioning is of the same order as that of FEM and does not get bad as interfaces approach boundaries of elements. This paper designs the SGFEM for the discontinuous interface problem (DIP) by coupling a deep neural network (DNN). The main idea is to construct a function using the DNN, which captures the discontinuous interface condition, and transform the DIP to an (approximately) equivalent continuous interface problem (CIP) based on the DNN function such that the SGFEM for CIPs can be applied. The SGFEM for the DIP is a conforming method that maintains the features (i)–(iii) of SGFEM and is free from penalty terms. The approximation error of the proposed SGFEM is analyzed mathematically, which is split into an error of SGFEM of the CIP and a learning error of the DNN. The learning dimension of DNN is one dimension less than that of the domain and can be implemented efficiently. It is known that the DNN enjoys advantages in nonlinear approximations and high-dimensional problems. Therefore, the proposed SGFEM coupled with the DNN has great potential in the high-dimensional interface problem with interfaces of complex geometries. Numerical experiments verify the efficiency and optimal convergence of the proposed method.
MSC:
65N12; 65N30; 68T07

1. Introduction

Generalized or Extended Finite Element Methods (GFEM/XFEM) augment the standard FEM with special functions that mimic local features of exact solutions to solve complicated non-smooth engineering problems [1,2,3,4]. The applications of GFEM/XFEM to typical fields, such as crack problems, interface problems, and material failures are referred to [3,5,6,7,8,9,10,11,12,13]. Both GFEM and XFEM are based on a partition of unity [14,15,16] to “paste” the local special functions. We will use GFEM to represent the GFEM/XFEM below for simplicity. The GFEM has been extensively applied to the interface problem, including elliptic interface problems [11,17,18,19,20,21,22,23,24,25,26,27,28] and time-dependent interface problems [9,29,30,31,32,33,34]. Meshes used in the GFEM are simple, fixed, and independent of the location of interfaces so that remeshing or mesh refinement for the FEM are avoided in the GFEM.
It was realized quite early that the GFEM has a conditioning difficulty in that condition numbers of stiffness matrices may be much larger than those of the standard FEM, and the conditioning may get extremely bad as the interfaces approach the boundaries of elements. The bad conditioning is mainly caused by almost linear dependence between the FE functions and added special functions. Many interesting ideas have been proposed for the conditioning of GFEM, such as (a) locally adapting positions of either nodes or interface curves [32,35], preconditioning the stiffness matrices by employing domain decomposition [36], local Cholesky decomposition [5], condensing DOFs at certain nodes [37], and modifying the enriched functions based on orthogonalization [38]. Recently, a stable GFEM (SGFEM) was introduced in [1,6,39] to improve the conditioning of GFEM. The main idea of SGFEM is to modify enriched functions by subtracting its FE interpolant. The SGFEM has been applied to the crack problem [6,7,8] and interface problems [1,17,20,23,32,40]. The SGFEM for interface problems is proven to have the optimal convergence in [1,20], and the conditioning is of the same order as that of the FEM and does not deteriorate as the interfaces approach the boundaries of elements.
To the best of our knowledge, the SGFEM has been developed only for continuous interface problems. Interface problems can be categorized as continuous and discontinuous interface problems (CIPs/DIPs), characterized by [ u ] Γ = 0 and [ u ] Γ 0 , respectively, where u is a solution to an interface problem with interface Γ , and [ u ] Γ is a jump of u across the interface Γ . Both cases have many applications (see [11,22,23,32,34,41,42] for CIPs, [10,30,43,44,45,46] for DIPs). The shape functions in the SGFEM (also in some GFEM [3,9,11,17,20,23,25,31,32,34,41,42,47]) for the interface problem are continuous. Thus, the SGFEM can provide conforming approximations for CIPs. However, the continuous shape functions cannot be used directly for the DIP, and penalty techniques need to be employed typically [19,30,48]. We stress that many of the unfitted FEM or GFEM introduce penalty techniques and parameters to solve the CIP, such as [11,34,41,42]. It is possible to propose an SGFEM for DIPs by changing the enrichments and using the penalty technique above-mentioned. However, the penalty parameters are generally problem-dependent and difficult to choose in a uniform approach.
In this paper, we tend to design the SGFEM for DIPs free from penalty terms and parameters. The main idea is to transform a DIP into an equivalent CIP [45,46,49]. This can be achieved by deriving a function ϑ such that the modified solution U = u ϑ satisfies the continuity condition [ U ] Γ = 0 . Unfortunately, such a ϑ is hard to construct, especially in the high dimensions where the interface has complex geometric shapes [45,49]. We construct such ϑ using the DNN. Then, the SGFEM for the CIP coupled with such ϑ is proposed for the DIP. This idea is motivated by recent pioneer research about solving partial differential equations (PDEs) using the DNN [50,51,52,53,54,55,56,57]. We mention that the DNN has also been applied to many engineering problems, for instance, energy approach [58], failure models and predictions [59,60,61]. We take a Poisson equation u = f in a domain Ω , for example, which has an essential boundary condition u ¯ on Ω . The DNN for PDEs [53,54,55,57] is to minimize a loss function
Loss = 1 2 Ω u · u Ω f u + β Ω ( u u ¯ ) 2
using a certain DNN architecture, where β is a parameter to balance the two terms in (1). The DNN method based on (1) is not convex, and does not show the convergence rate [53,55]. In addition, the essential boundary condition affects the accuracy DNN [53], and β has to be selected carefully. To resolve the essential boundary condition, in [52,54,55], a small DNN is used to minimize the boundary term in (1), and then the PDE is transformed to an equivalent PDE with the (approximately) zero boundary condition, which is solved by another DNN. The small DNN function is defined in the whole domain Ω , but only the values on Ω are needed in the training process. Thus, such a DNN has a dimension number that is one dimension less than that of Ω and can approximate the boundary function efficiently, especially when the boundary Ω has complex geometries [52,54,55]. This idea is adopted in this paper to address the discontinuous interface condition, i.e., [ u ] Γ 0 .
Specifically, we first use a small DNN to produce a function ϑ (defined in Ω ) by learning the discontinuous condition [ u ] Γ such that [ ϑ ] Γ [ u ] Γ . As reported in [52,54,55], the jump [ u ] Γ can be “learned” accurately and efficiently by the DNN. Then, the DIP is transformed to an (approximately) equivalent CIP based on ϑ , which the SGFEM can solve for the CIP. The DNN function ϑ can be efficiently incorporated into the SGFEM framework because its differentiations are computed automatically. Since [ ϑ ] Γ does not equal [ u ] Γ exactly, there is a conforming error in the SGFEM solution (combined with the DNN function ϑ ). This error is analyzed and proven mathematically, which indicates that the proposed SGFEM yields the optimal convergence for the DIP if the interface condition can be learned exactly enough. A set of numerical experiments verifies the theoretical results. It is known that the DNN possesses remarkable merits in solving high-dimensional problems and nonlinear approximations. Therefore, the proposed SGFEM in this paper has a great potential for the three-dimensional DIP with interfaces of complex geometries.
The paper is organized as follows. The model problem is described in Section 2. Conventional GFEM and SGFEM are reviewed in Section 3. The proposed SGFEM coupled with the DNN is proposed for the DIP in Section 4. In Section 5, we analyze the approximation error of the proposed method mathematically. The numerical experiments and concluding remarks are presented in Section 6 and Section 7, respectively.

2. Model Problem

For a domain Δ in R 2 , an integer m, and 1 q , we denote the usual Sobolev spaces as W m , q ( Δ ) with norm · W m , q ( Δ ) and semi-norm | · | W m , q ( Δ ) . The space W m , q ( Δ ) will be represented by H m ( Δ ) for q = 2 and L q ( Δ ) when m = 0 , respectively.
We consider a bounded and simply connected domain Ω R 2 with a piecewise smooth boundary Ω . Let Γ be an interface that divides Ω into two domains Ω 0 and Ω 1 such that Ω ¯ = Ω ¯ 0 Ω ¯ 1 , Ω 0 Ω 1 = , and Γ = Ω ¯ 0 Ω ¯ 1 . In this study, we consider smooth interfaces Γ , as shown in Figure 1.
A point in the Cartesian coordinate system of R 2 is denoted as P = ( x , y ) . Let a be a positive, piecewise-constant function given by
a ( P ) = a 0 , P Ω 0 , a 1 , P Ω 1 ,
where a 0 and a 1 are positive constants, 0 < ζ 0 a r ζ 1 < , r = 0 , 1 , and ζ 0 , ζ 1 R . Clearly, a is discontinuous along the interface Γ .
We are interested in the solution u of the interface problem:
· ( a u ) = f , in Ω , a u n b = g , on Ω ,
subject to non-homogeneous jump conditions on the interface Γ
u Γ = ϕ , on Γ ,
a u n Γ Γ = ψ , on Γ ,
where n b and n Γ denote the unit outward normal to the boundary Ω and the interface Γ directed towards Ω 0 , respectively. The notation v Γ : = v 1 v 0 defines the jump of a quantity v along the interface Γ , where v r : = v | Ω ¯ r , r = 0 , 1 . We mention that if the boundary Ω contains a re-entrant corner, u may be singular and may not belong to H 2 ( Ω r ) , r = 0 , 1 . However, we do not discuss such a situation in this paper and limit the setting to the case u H 2 ( Ω r ) , r = 0 , 1 , as in [42,45,47]. Therefore, we assume that the boundary Ω and the data f , g , ϕ , ψ are given such that the solution u M 2 , where M 2 is defined by
M 2 : = { u : u | Ω r H 2 ( Ω r ) , r = 0 , 1 and α u L ( Γ ) < , | α | 1 }
with a norm
u M 2 = u 0 H 2 ( Ω 0 ) + u 1 H 2 ( Ω 1 ) + | α | 1 α u L ( Γ ) , u M 2 .
The interface condition (4) has an essential effect on the features of the solution and constructions of numerical algorithms. The problem (3)–(5) is referred to as a CIP and a DIP if ϕ in (4) is zero and nonzero, respectively. Both the CIP and DIP have many applications, see [11,22,23,32,34,41,42] for the CIP and [10,30,43,44,45,46] for the DIP. The SGFEM was proposed for the CIP to provide conforming approximations [17,20,23,27,40]. In this paper, the SGFEM is generalized to the DIP in a conforming approach that is free from any penalty terms.

3. Conventional Gfem and Sgfem for Interface Problems

We begin with a quasi-uniform finite element mesh T h = { e s } of the domain Ω with mesh size 0 < h < 1 , where the finite elements e s can be triangles or quadrilaterals. We note that the mesh T h does not need to match the interface Γ . Let { P i } i I h be the set of finite element nodes associated with the mesh T h , where I h is the index set of the nodes. For every i I h , we consider the standard linear (bilinear for quadrilateral element) finite element hat function ϕ i . The closure of support of ϕ i is denoted by ω i , which is called the patch associated with the node P i . Since the mesh is quasi-uniform, we assume that
ϕ i L ( Ω ) = ϕ i L ( ω i ) 1 , ϕ i L ( Ω ) = ϕ i L ( ω i ) C h 1 ,
where the positive constant C is independent of h and i. It is well known that { ϕ i } i I h form a partition of unity (PU) [14,15,16] subordinate to the patches { ω i } i I h .
The standard FEM subspace to approximate the solution of (3) is given by
S h : = S F E M = span { ϕ i ( x ) : i I h } .
The FEM yields highly accurate approximations only if the underlying variational problem has a smooth solution. However, it is also well known that an FEM with a quasi-uniform mesh cannot approximate the solution of the interface problem efficiently [23,47].
The generalized or extended FEM (GFEM) [2,3] is a typical technique to approximate the non-smooth solutions to variational problems. The approximation space S h of GFEM is obtained by augmenting the finite element space S F E M by non-polynomial enrichment space S E N R using the enrichment functions Π i as follows:
S h = S F E M S E N R and S E N R = span { ϕ i Π i : i I h , e n r and I h , e n r I h } ,
where the enrichment functions Π i are problem-dependent and mimic the non-smooth exact solutions of the underlying variational problem. The nodes indexed by I h , e n r , are called the enriched nodes. The choice of I h , e n r I h can vary and may also be problem-dependent.
The enrichment functions Π i used in the GFEM to approximate the solution of a smooth interface problem are generally based on a distance function (or absolute of level set function) [3,18,23]
D ( P ) = dist ( P , Γ ) .
In a conventional GFEM for the interface problems, the approximate subspace is
S h = S F E M S E N R and S E N R = span { ϕ i D : i I h , e n r = I h , e n r Γ } ,
and
I h , e n r Γ = { i I h : P i e s where e ˚ s Γ } .
It is easy to know that
Card ( I h , e n r Γ ) : = cardinality   of   I h , e n r Γ = O ( h 1 ) .
The GFEM based on I h , e n r Γ is referred to as a topological GFEM [3]. It was known early (e.g., [23]) that the topological GFEM only produces the convergence order O ( h ) in energy norm, which is not optimal O ( h ) . The optimal convergence can be attained by the geometric GFEM [3,23] and corrected GFEM [3,22]. However, the geometric GFEM introduces many more enriched degrees of freedom (DOFs) than the topological ones, and its conditioning is of O ( h 4 ) [23] that is much higher than that of the FEM. Meanwhile, the conditioning of the corrected GFEM may not be stable because it may “blow up” as the interface Γ is close to the nodes of the mesh [27].
Recently, a simple local procedure of subtracting the interpolant was introduced in [1,6,17,20,23,39,40] to address the bad conditioning of GFEM, and the modified GFEM is referred to as a stable GFEM (SGFEM). Specifically, the approximate subspace of the SGFEM for the interface problems is given by
S h = S F E M S E N R and S E N R = span { ϕ i ( D I h D ) : i I h , e n r = I h , e n r Γ } ,
where I h f is the FEM interpolant of a continuous function f based on S F E M . It was shown [17] that the SGFEM (12) for the elliptic interface (a) reaches the optimal convergence order O ( h ) in energy norm, (b) has a scaled condition number (SCN) of stiffness matrices O ( h 2 ) that is of the same order as the FEM, and (c) is robust in that the convergence and SCN do not depend on the relative positions of the mesh and interfaces.
Remark 1.
We mention that there are other options for the enrichments of the GFEM of the interface problems, such as [11,41,48], where
Π i = H
is used as the enrichments, and H is the Heaviside function 1 in Ω 0 and 1 in Ω 1 . This scheme leads to a non-conforming formulation because of the discontinuity of H. A penalty technique needs to be developed to deal with discontinuity. This paper presents conforming methods in which the variational formulations are standard without any penalty terms or parameters.
We next illustrate the scaled condition number (SCN) of the stiffness matrices of the GFEM or SGFEM. For simplicity, we re-arrange the order of the shape functions so that the stiffness or mass matrices B of GFEM have the form
B = B 11 B 12 B 12 T B 22 ,
where B 11 and B 22 are associated with the FE part and the enrichment part of GFEM or SGFEM, respectively. We note that B 11 is the standard finite element matrix with respect to the standard finite element triangulation used to define the GFEM. Consider the matrix
D = D 11 0 0 D 22 ,
where D 11 and D 22 are diagonal matrices with
( D 11 ) i i = ( B 11 ) i i 1 / 2 and ( D 22 ) i i = ( B 22 ) i i 1 / 2 .
We define B ^ : = D B D and B ^ 11 : = D 11 B 11 D 11 . The SCNs of B and B 11 are defined by
K : = κ ( B ^ ) , K F E M : = κ ( B ^ 11 ) ,
respectively, where κ ( B ) is a condition number of a matrix B . It is known that K F E M = O ( h 2 ) for the stiffness matrices of FEM.
The relevant difficulties of GFEM consist of (i) stability: K may be much bigger than K F E M , and (ii) robustness: K may blow up as the interfaces are close to the boundaries of elements [1,2,3,23]. These are caused by the almost linear dependence of subspaces S F E M and S E N R . The SGFEM is the stable and robust GFEM, and has been applied to crack and interface problems successfully [1,6,17,20,23,39,40].
The convergence of SGFEM for the CIP
The SGFEM for the CIP has been studied in [17,23,27,40], and the associated variational formulation based on the SGFEM space (12) is as follows:
Find u S G , h S h such   that B ( u S G , h , v h ) = L ( v h ) , v h S h ,
where
B ( u , v ) : = Ω a u · v d P , L ( v ) = Ω f v d x + Ω g v d s + Γ ψ v d s .
We define E ( Ω ) to be the energy space with respect to the CIP given by
E ( Ω ) : = { u H 1 ( Ω ) : u E ( Ω ) 2 : = B ( u , u ) < and [ u ] Γ = 0 o n Γ } .
It is obvious that S h (12) belongs to E ( Ω ) . The convergence of u S G , h to u was proven in [17,27,40], and we present it here without its proof.
Theorem 1.
Suppose that u M 2 is the solution of the CIP ((3)–(5) with ϕ = 0 ), and u S G , h is the SGFEM solution of (15) based on the finite-dimensional subspace S h (12), then there exists C > 0 independent of h such that
u u S G , h E ( Ω ) C h u M 2 .
In the next section, the SGFEM of the CIP is generalized to the DIP in a conforming approach that is free from any penalty terms by coupling a DNN.

4. Sgfem Coupled with Dnn for Dip

We first employ the DNN to learn a function that approximates the discontinuous interface condition. Then, by coupling with the DNN function, we reformulate the DIP into an (approximately) equivalent continuous model, which can be solved by SGFEM.
Let ϑ 0 be a function in H 2 ( Ω 0 ) with
ϑ 0 | Γ = ϕ .
Define u ˜ to be a function on Ω as
u ˜ = ϑ 0 , in Ω ¯ 0 , 0 , in Ω 1 ,
then u ˜ satisfies the interface problem (4) because [ u ˜ ] Γ = 0 ϑ 0 | Γ = 0 ( ϕ ) = ϕ . Define a function U ˜ = u u ˜ , then
U ˜ = u 0 ϑ 0 , on Ω 0 , u 1 , on Ω 1 .
It is easy to check that the U ˜ satisfies the continuous interface problem, i.e., [ U ˜ ] Γ = [ u ] Γ [ u ˜ ] Γ = 0 . The model problem (3) with the discontinuous interface conditions (4) and (5) is transformed into an equivalent equation about U ˜ with the continuous interface condition:
· ( a U ˜ ) = f + · ( a u ˜ ) , i n Ω ,
a U ˜ n b = g a u ˜ n b , o n Ω ,
U ˜ Γ = 0 , o n Γ ,
a U ˜ n Γ Γ = ψ a u ˜ n Γ Γ = ψ + a 0 u ˜ n Γ , o n Γ .
We note that the RHSs of (23)–(25) can be simplified because u ˜ = 0 in Ω 1 , for instance, in the boundary condition (24) g a u ˜ n b = g if Γ Ω . The variational formula of (23)–(25) is the following:
Find U ˜ E ( Ω ) such   that B ( U ˜ , V ) = L ˜ ( V ) , V E ( Ω ) ,
where
L ˜ ( V ) = Ω f V d P + Ω 0 ( a 0 Δ u ˜ ) V d P + Ω g V d s Ω Ω 0 a 0 u ˜ n b V d s + Γ ψ a 0 u ˜ n Γ Γ V d s = Ω f V d P + Ω g V d s + Γ ψ V d s + Ω 0 ( a 0 Δ u ˜ ) V d P Ω Ω 0 a 0 u ˜ n b V d s + Γ a 0 u ˜ n Γ V d s = Ω f V d P + Ω g V d s + Γ ψ V d s Ω 0 a 0 u ˜ · V d P ,
and B ( · , · ) and E ( Ω ) are defined in (16) and (17), respectively. Note that in the second equality of (27) n Γ is directed towards to Ω 0 .
We note that L ˜ in variational formula (26) depends on the unknown u ˜ , and cannot be calculated. It is easy to see that the evaluations of u ˜ on Γ (19) rather than in Ω are essential for the derivation of the equivalent CIP (23)–(25). This leads us to construct a function u ˜ θ (defined in Ω ) using a DNN such that u ˜ θ mimics the condition (19) with high precision, where θ are parameters in the DNN algorithm. Such a u ˜ θ is available, and the associated variational formula is solvable.
To this end, we take N Γ sampling points Q i uniformly distributed on Γ . Let X = [ Q i ] i = 1 N Γ and Y = [ ϕ ( Q i ) ] i = 1 N Γ be the input and output sets, respectively, for training the DNN. The loss function for training the DNN is defined
Loss ( X ; θ ) = 1 N Γ i = 1 N Γ ϑ θ ( Q i ) ϕ ( Q i ) 2 , ϑ θ S D N N ,
where S D N N is a subspace (defined in Ω ) generated by a DNN with parameters θ = { W i , b i } ; W i , b i are the weights and bias in the DNN, respectively. A DNN function ϑ θ with certain parameter θ is obtained by solving the following minimization problem-based loss function (28):
min ϑ θ S D N N Loss ( X ; θ ) .
ϑ θ belongs to C 2 ( Ω ¯ ) if the aviation function is chosen as the Sigmoid function [62,63].
Similar with the definition of u ˜ (20) we define a function on Ω as
u ˜ θ = ϑ θ , in Ω 0 , 0 , in Ω 1 ,
and the variational formula based on u ˜ θ is proposed as follows:
Find U ˜ θ E ( Ω ) such   that B ( U ˜ θ , V ) = L ˜ θ ( V ) , V E ( Ω ) ,
where
L ˜ θ ( V ) = Ω f V d P + Ω 0 ( a 0 Δ u ˜ θ ) V d P + Ω g V d s Ω Ω 0 a 0 u ˜ θ n b V d s + Γ ψ a 0 u ˜ θ n Γ Γ V d s = Ω f V d P + Ω g V d s + Γ ψ V d s Ω 0 a 0 u ˜ θ · V d P ,
and the function u θ : = U θ + u ˜ θ serves as the approximation solution to u. We stress that (31) is not equivalent to the initial problem (3) exactly because according to (28) and (29),
[ u ˜ θ ] Γ = ϑ θ ( ) ϕ , on Γ .
This conforming error will be analyzed in the next section.
Unlike the variational problem (26), the problem (31) is computable because u ˜ θ is obtained using the DNN. The problem (31) is discretized using the SGFEM subspace for the CIP, and the associated variational problem is
Find U ˜ h θ S h such   that B ( U ˜ h θ , V h ) = L ˜ θ ( V h ) , V S h .
Finally, u h θ : = U ˜ h θ + u ˜ θ serves as the approximation solution to the initial problem (3), and the approximation error of u h θ will be analyzed mathematically in the next section. The formula (33) is called the SGFEM coupled with the DNN for the DIP due to the introduction of the DNN function u ˜ θ .
Remark 2.
The idea to transform the DIP into a CIP using the DNN is motivated by [52,54,55]. In [52,54,55], non-homogeneous boundary conditions are transformed into homogeneous ones using a shallow DNN, and the associated (approximately) homogeneous equations are solved by another DNN. This approach has advantages over conventional methods (e.g., FEM, GFEM) in that it is meshless and can solve problems with boundaries of complex geometries, and is powerful for high-dimensional problems. We couple the SGFEM for the CIP with the shallow DNN to address the DIP in this paper. We achieve that (a) all the advantages of SGFEM are maintained for the DIP, (b) the proposed SGFEM for the DIP is a conforming method free from any penalty terms, and (c) the method has great potential for geometrically complex interfaces, especially in three dimensions (reported in a forthcoming study).
We analyze the computational costs of the proposed method (33). First, the stiffness matrices of (33) are exactly the same as those of SGFEM for the CIP, and only the RHS (32) of (33) needs to be treated. Therefore, the assembling of stiffness matrices and the construction of RHS can be implemented separately or in parallel. Second, the computational dimension of learning ϑ θ based on (29) is one dimension less than the space dimension because the sampling points are located in the interface curve, and the learning time is very little in comparison with the assembling of stiffness matrices. Moreover, automatic differentiations are available in the existing DNN frameworks to save computational time. Therefore the proposed SGFEM coupled with the DNN is computationally efficient.
At the end of this section, we describe the structure of DNN used in (28) and (29). The DNN in this paper is a deep residual network (ResNet) [53,64,65] based on the full connection layers. Such a network structure was adopted in [53] to solve the PDEs. The ResNet is an improvement of the conventional DNN. The ResNet can fit high-dimensional functions better, and the fitting ability is not affected by network width. The ResNet can significantly speed training, increase pre-precision, reduce network degradation, and improve network characterization ability.
The ResNet was formally proposed in [64], which is obtained by stacking the residual blocks continuously. Each residual block has consisted of two fully connected layers, and its output is obtained by adding the output of the last layer and the input of the residual block. This structure leads to significant improvement in the training speed and approximation error. Let l be a positive integer. We set that there are l neurons in each layer of ResNet. Let z i R l be the vector consisted by excitation values of the l neurons in i-th layer, W i be a l × l weighted matrix, and b i R l be a real vector. Furthermore, let σ be an activation function. In a ResNet, the vectors z i + 1 and z i satisfy
z i + 1 = σ ( W i z i + b i ) ,
where b i is called the bias term. Let φ i ( z ) : = σ ( W i z + b i ) , z R l . Let B j be the output of j-th residual block, which satisfies that
B j + 1 : = B j + φ 2 j φ 2 j 1 ( B j ) .
Let ψ j + 1 ( z ) : = z + φ 2 j φ 2 j 1 ( z ) , z R l . The relationship between the input X and output Y of the ResNet with N residual blocks is
Y = φ 2 N + 1 ψ N ψ 2 ψ 1 ( X ) .
In our computations, we use a ResNet with three such blocks, see Figure 2. The parameters W i and b i for i { 1 , 2 , , 2 N + 1 } are derived by solving the minimization problem (29) based on the loss function (28). These parameters are updated using the back propagation algorithm based on gradients of the loss function with respect to the parameters. In this paper, we use the stochastic gradient descent (SGD) method [66,67] for solving (29). The sigmoid function [62,63] serves as the activation function.

5. Convergence Analysis

We prove the convergence of the SGFEM solution coupled with the DNN, u h θ , in this section. The approximation error of u h θ involves two parts: the SGFEM error and the learning error of DNN. For any ϑ M 2 (see (6)), let
ϑ r : = ϑ | Ω r , r = 0 , 1 ,
then ϑ r H 2 ( Ω r ) , r = 0 , 1 . Since we consider Γ is smooth, ϑ 0 and ϑ 1 can be continuously extended to the whole domain Ω to obtain functions ϑ ˜ 0 and ϑ ˜ 1 in H 2 ( Ω ) such that
ϑ ˜ r = ϑ r o n Ω r and ϑ ˜ r H 2 ( Ω ) C ϑ r H 2 ( Ω k ) , r = 0 , 1 ,
where C is a positive constant independent of h (see Theorem 1.4.5 in [68]).
Let ε represent the learning error level of u ˜ θ , i.e.,
u ˜ θ u ˜ L ( Γ ) = ϑ θ ( ϕ ) L ( Γ ) ε .
We first establish a relevant approximation result.
Lemma 1.
Suppose w H k + 1 ( Ω ¯ 0 ) , k 1 and w L ( Γ ) ε , then there is w ^ H 1 ( Ω ¯ 0 ) satisfying w ^ | Γ = 0 such that
w w ^ H 1 ( Ω ¯ 0 ) C ε k k + 1 | w | H k + 1 ( Ω ¯ 0 ) 1 k + 1 ,
where C is a constant independent of ε and k.
Proof. 
We divide Ω 0 using a quasi-uniform mesh fitting the interface Γ . The mesh-size parameter is denoted by l. We note that such a mesh is used only for obtaining associated estimates, and not used for actual computation. Let O j and ψ j , j χ l be the FE nodes and FE functions of degree k associated with the mesh, respectively, and
I l w = j χ l w ( O j ) ψ j
be the standard FE interpolant of w. The index set χ l is divided into χ l and χ l , which consist of indices of nodes on the interface Γ and in the interior of Ω 0 , respectively. It is easy to know that
I l w : = j χ l w ( O j ) ψ j
belongs to H 1 ( Ω ¯ 0 ) and satisfies I l w | Γ = 0 . Based on the error estimate of FEM interpolation [68] we have
w I l w H 1 ( Ω ¯ 0 ) w I l w H 1 ( Ω ¯ 0 ) + j χ l w ( O j ) ψ j H 1 ( Ω ¯ 0 ) C l k | w | H k + 1 ( Ω ¯ 0 ) + j χ l | w ( O j ) | ψ j H 1 ( Ω ¯ 0 ) C l k | w | H k + 1 ( Ω ¯ 0 ) + ε j χ l ψ j H 1 ( Ω ¯ 0 ) C l k | w | H k + 1 ( Ω ¯ 0 ) + ε j χ l C C l k | w | H k + 1 ( Ω ¯ 0 ) + C ε 1 l ,
where the last inequality is because the number of nodes on Γ , | χ l | , is O ( 1 l ) . Then letting l = ε | w | H k + 1 ( Ω ¯ 0 ) 1 k + 1 in (37) yields
w I l w H 1 ( Ω ¯ 0 ) C ε k k + 1 | w | H k + 1 ( Ω ¯ 0 ) 1 k + 1 + C ε k k + 1 | w | H k + 1 ( Ω ¯ 0 ) 1 k + 1 C ε k k + 1 | w | H k + 1 ( Ω ¯ 0 ) 1 k + 1 .
Let w ^ = I l w , then we obtain the desired estimate (36) from (38). □
Theorem 2.
Suppose that u M 2 E ( Ω ) is the solution to (3)–(5), and u h θ : = U ˜ h θ + u ˜ θ is the SGFEM approximation of u, where U h θ is the solution of associated CIP (33), and u ˜ θ is the DNN function (30) that approximates the discontinuous jump ϕ. The learning error level of u ˜ θ is represented in (35). Let u ˜ defined in (20) belongs to H k + 1 ( Ω ¯ 0 ) . Then there is a constant C that is independent of h, k, and ε such that
u u h θ E ( Ω ) C h U ˜ θ M 2 + C ε k k + 1 u ˜ u ˜ θ H k + 1 ( Ω ¯ 0 ) 1 k + 1 ,
where U ˜ θ is defined in (31).
Proof. 
Remembering that U ˜ = u u ˜ we have
u u h θ E ( Ω ) = ( U ˜ + u ˜ U ˜ θ u ˜ θ ) + ( U ˜ θ U ˜ h θ ) E ( Ω ) ( U ˜ + u ˜ U ˜ θ u ˜ θ ) E ( Ω ) + U θ U ˜ h θ E ( Ω ) .
According to (26) and (31) we obtain
B ( U ˜ + u ˜ , V ) = B ( U ˜ θ + u ˜ θ , V ) = 0 , V E ( Ω ) ,
and thus
B ( U ˜ + u ˜ U ˜ θ u ˜ θ , V ) = 0 , V E ( Ω ) .
Therefore, for arbitrary V E ( Ω )
U ˜ + u ˜ U ˜ θ u ˜ θ E ( Ω ) 2 = B ( U ˜ + u ˜ U ˜ θ u ˜ θ , U ˜ + u ˜ U ˜ θ u ˜ θ ) = B ( U ˜ + u ˜ U ˜ θ u ˜ θ , u ˜ u ˜ θ V ) U ˜ + u ˜ U ˜ θ u ˜ θ E ( Ω ) u ˜ u ˜ θ V E ( Ω ) .
Eliminating U ˜ + u ˜ U ˜ θ u ˜ θ E ( Ω ) on both sides of (41) we have
U ˜ + u ˜ U ˜ θ u ˜ θ E ( Ω ) u ˜ u ˜ θ V E ( Ω ) , V E ( Ω ) .
Let
V ^ = V , in Ω ¯ 0 , 0 , in Ω 1 ,
which also belongs to E ( Ω ) . Hence, we obtain from (42) that
U ˜ + u ˜ U ˜ θ u ˜ θ E ( Ω ) u ˜ u ˜ θ V ^ E ( Ω ) = u ˜ u ˜ θ V ^ E ( Ω ¯ 0 ) C u ˜ u ˜ θ V ^ H 1 ( Ω ¯ 0 ) .
Based on the learning error (35) and Lemma 1, there is a V ^ such that
U ˜ + u ˜ U ˜ θ u ˜ θ E ( Ω ) C ε k k + 1 u ˜ u ˜ θ H k + 1 ( Ω ¯ 0 ) 1 k + 1 .
On the other hand, it is noted that U ˜ h θ in (33) is the SGFEM solution to the CIP (31). Then according to the approximation result (18) for the CIP in Theorem 1 we have
U ˜ θ U ˜ h θ E ( Ω ) C h U ˜ θ M 2 .
Finally, we obtain from the estimates (40), (44), and (45)
u u h θ E ( Ω ) C h U ˜ θ M 2 + C ε k k + 1 u ˜ u ˜ θ H k + 1 ( Ω 0 ) 1 k + 1 ,
which is the desired result (39). □
In (39) u ˜ only needs the evaluations on Γ , i.e., u ˜ | Γ = ϕ , while its evaluations on Ω 0 are not required specifically. Therefore, the term u ˜ u ˜ θ H k + 1 ( Ω 0 ) in (39) can be replaced by
min u ˜ H k + 1 ( Ω ¯ 0 ) , u ˜ | Γ = ϕ u ˜ u ˜ θ H k + 1 ( Ω 0 ) ,
which could be small. In (39) k at least equals to 1, i.e., u ˜ H 2 ( Ω ¯ 0 ) . In fact, let u ˜ | Ω ¯ 0 = [ u ˜ 0 u ˜ 1 ] | Ω ¯ 0 , where u ˜ 0 and u ˜ 1 are the extensions of u 0 and u 1 (34), respectively, then u ˜ H 2 ( Ω ¯ 0 ) and u ˜ | Γ = ϕ . We mention that for the interface problem where Γ and ϕ are relatively smooth, u ˜ could have higher smoothness in Ω ¯ 0 , i.e, u ˜ H k + 1 ( Ω ¯ 0 ) , k > 1 . Theorem 2 means that the proposed SGFEM coupled with the DNN can obtain the optimal energy error O ( h ) if the DNN function u ˜ θ learns u ˜ ( ϕ ) on the interface Γ (not in the domains Ω 0 and Ω 1 ) accurately sufficiently, see (35). In the numerical experiments below, the optimal errors O ( h ) are observed for the same discretization parameters h as those in the SGFEM for the CIP.

6. Numerical Results

We consider the model problem (3) in a domain Ω = ( 0 , 1 ) 2 with straight and curved interfaces Γ for the numerical experiments. We test two cases of coefficients a ( P ) : (i) a 0 = 1 and a 1 = 10 , and (ii) a 0 = 1 and a 1 = 100 . Their contrasts are c = 10 and c = 100 , respectively. The manufactured exact solution u of (3) will be employed in the tests. The loading functions f , g of (3), the jump ϕ (4), and the flux ψ (5) can be calculated by using Equation (3) and the manufactured exact solution u.
The uniform n × n square FE mesh is used to discretize the domain Ω = ( 0 , 1 ) 2 with the mesh parameter h = 1 / n . The nodes associated with the mesh are denoted by { P i } i I h , where I h is the index set. Let { ϕ i } i I h be the bilinear FE functions associated with the nodes { P i } i I h .
We will test the standard FEM (8) and SGFEM (12) on the square mesh for the DIP, based on the variational formulation (31) incorporating the DNN function (20). We do not present the results of other GFEMs, such as the geometric and corrected GFEM. In the geometric GFEM, the SCN is of order O ( h 4 ) [23], which is much bigger than that of SGFEM. The corrected GFEM is not robust in the sense that the SCN gets bad as interface curves approach boundaries of elements [27]. We compute and compare the relative error of these methods in the energy norm (EE). The SCN of SGFEM has been shown to be of order O ( h 2 ) in [27], and we do not repeat it in this paper because the stiffness matrices for the DIP are the same as those for the CIP.
Setting for ResNet. As described at the end of Section 4, we use the ResNet for the DNN coupling in the tests. The ResNet structure consists of three residual blocks, each of which contains two full connection layers and one residual item, where each layer contains 20 neurons, see Figure 2. The sigmoid function [62,63] serves as the activation function. We use the stochastic gradient descent (SGD) method [66,67] for solving (29) in the learning process. The learning rate, the batch number, the number N Γ of sampling points on Γ for training the ResNet will be specified in the following Subsections.
Integration for discontinuous enrichments. We describe the numerical integration formula used in the computations. For elements that do not intersect the interface, we employ the standard 4 × 4 Gaussian rule. For elements cut by the interface, we connect the intersection points of the interface and the boundaries of an element by a straight line, and decompose the element into 4 to 6 sub-triangles, on each of which the standard 12-point Gaussian rule for triangles is employed. See Figure 3 for an example. We mention that a systematic study of the effect of numerical integration is not the objective of this work. We refer to for more details about the numerical integrations for the interface problems [3,5,22,69].
We now present our numerical results in the following sub-sections.

6.1. A Straight Interface Situation

We first consider a straight interface Γ , which has an equation y = tan ( θ 0 ) ( x 1 d ) + 1 with θ 0 = π 6 and d = 1 π . The manufactured solution of (3) is as follows:
u = M r α 0 cos ( α 0 ( θ + π θ 0 ) ) + N a 0 a 1 r α 0 sin ( α 0 ( θ + π θ 0 ) ) + sin ( x y ) , y > tan ( θ 0 ) ( x 1 d ) + 1 ( Ω 1 ) , r α 0 cos ( α 0 ( θ + π θ 0 ) ) + r α 0 sin ( α 0 ( θ + π θ 0 ) ) + sin ( x y ) , y tan ( θ 0 ) ( x 1 d ) + 1 ( Ω 0 ) ,
where α 0 = 4 3 , and ( r , θ ) is the polar coordinate at the center ( 1 + d , 1 ) . It can be checked that u is continuous across the interface Γ when M = N = 1 . In this example we take M = 100 and N = 100 , and u is discontinuous across Γ , i.e., [ u ] Γ = ϕ 0 (see (4)) and also the jump of flux ψ 0 (5). The mesh on the domain [ 0 , 1 ] × [ 0 , 1 ] is refined with h 1 = n = 2 j + 1 , j = 1 , 2 , , 7 . The interface Γ and a mesh with n = 16 are shown in Figure 4 Left, and the exact solution u with a 0 = 1 and a 1 = 100 is drawn in Figure 4 Right.
The number of sampling points uniformly for training the ResNet, N Γ , is taken as 200 in this situation. The learning rate η = 0.05 , and the batch number is 4096. The maximum relative error of DNN function u ˜ θ (30) to the jump ϕ on Γ , defined as
u ˜ θ ϕ L ( Γ ) ϕ L ( Γ ) ,
with respect to the iteration steps of SGD, is shown in Figure 5 Left. It shows that the error of the DNN function u ˜ θ is reduced by increasing the iteration steps of SGD. We also observed this by testing the different learning rates and batch numbers, and we do not exhibit them here. Therefore, it is easy to learn a DNN function u ˜ θ to reach the desired error level ε (35) in the Theorem 2 (39). Such an ε is small enough for the SGFEM to obtain the optimal energy error convergence order O ( h ) , see below for the energy errors.
The energy errors with respect to h of the FEM and SGFEM coupled with the DNN, are presented in Figure 5 Right for different contrasts c (10 and 100), where h 1 = n = 2 j + 1 , j = 1 , 2 , , 7 . It is shown in Figure 5 Right that the convergence orders of the FEM and SGFEM are O ( h 0.5 ) and O ( h ) , respectively. Therefore, it is concluded from this set of numerical experiments that the proposed SGFEM reaches the optimal convergence order for such a DIP, as predicted in the Theorem 2 (when ε is small).

6.2. A Curved Interface Situation

We next consider a curved interface Γ with an equation ( x x 0 ) 2 + ( y y 0 ) 2 = r 0 2 , where x 0 = 1 5 , y 0 = 1 3 , r 0 = 1 10 . In this case we consider the manufactured solution of (3) as follows:
u = 2 a 1 a 1 ρ r 0 2 ( a 1 a 0 ) r 0 4 r 2 cos ( 2 θ ) , r < r 0 ( Ω 0 ) , a 1 + a 0 a 0 ρ r 0 2 ( a 1 a 0 ) r 0 4 r 2 cos ( 2 θ ) + r 2 cos ( 2 θ ) , r r 0 ( Ω 1 ) ,
where ( r , θ ) is the polar coordinate at the center ( x 0 , y 0 ) . It can be verified that u is continuous across Γ for ρ = 0 . In this test, we take ρ = ( a 0 + a 1 ) / 2 , and [ u ] Γ = ϕ is non-zero. The interface Γ and a mesh with n = 16 are shown in Figure 6 Left, and the exact solution u with a 0 = 1 and a 1 = 100 is drawn in Figure 6 Right. The mesh on the domain [ 0 , 1 ] × [ 0 , 1 ] is refined with h 1 = n = 2 j + 1 , j = 1 , 2 , , 7 .
The number of sampling points uniformly for training the ResNet, N Γ , is taken as 500 in this situation. The learning rate η = 0.08 , and the batch number is 4096. The maximum relative error (46) of DNN function u ˜ θ (30) to the jump ϕ on Γ with respect to the iteration steps of SGD, is shown in Figure 7 Left. It shows that the error of the DNN function is reduced by increasing the iteration steps of SGD. We also observe this by testing the different learning rates and batch numbers, and we do not exhibit them here. Therefore, it is easy to obtain a DNN function u ˜ θ to reach the small error level ε in the Theorem 2.
We note that the DNN used in this paper is a stochastic method due to the SGD. In this example, to test the robustness of proposed method, we implement the learning algorithm (29) five times to generate the DNN functions u ˜ i θ (30), i = 1 , 2 , , 5 . For each u ˜ i θ , the energy errors E E i , h with respect to h of the coupled FEM and SGFEM are computed from (33). The means and standard deviations (STD) of these errors are defined by
EE ¯ h : = 1 5 i = 1 5 EE i , h , and S T D h : = 1 5 i = 1 5 ( E E i , h E E ¯ h ) 2 .
EE ¯ h with respect to h of the FEM and SGFEM are presented in Figure 7 Right for different contrasts c (10 and 100). EE ¯ h and S T D h with respect to h of SGFEM are listed in Table 1.
Figure 7 Right clearly shows that the convergence orders of the error means of FEM and SGFEM are still O ( h 0.5 ) and O ( h ) , respectively, as predicted. It is observed in Table 1 that the STDs of E E i , h are at relatively low levels. These mean that the optimal convergence order, in this case, can also be obtained by the proposed SGFEM, and moreover, the proposed SGFEM exhibits nice robustness with respect to the randomness of the DNN method. We also obtained similar results by testing the different curved interfaces with discontinuous solutions, and we do not present them here.

7. Conclusions and Comments

This paper proposed an SGFEM coupled with the DNN for solving elliptic interface problems with a discontinuous interface condition. In this scheme, the DNN is used to learn a function (approximately) satisfying the interface condition, which helps us formulate the original problems with a discontinuous interface condition into the one with a continuous interface condition. Based on this, the SGFEM for the CIP is applied to the DIP straightforwardly, and no penalty terms are needed. All the merits of SGFEM for the CIP are maintained, such as the optimal convergence, stability, and robustness. The approximation error of the proposed SGFEM coupled with the DNN was analyzed mathematically. For comparison, we performed numerical experiments with the FEM and SGFEM. The FEM converges with an order O ( h 0.5 ) only, while the SGFEM converges with the optimal order O ( h ) . The proposed method has great potential for the DIP with interfaces of complex geometries due to the meshless features of DNN. The extension of the results in this paper to the three-dimensional and parabolic interface problems will be investigated in future studies.

Author Contributions

Methodology, Q.Z.; validation, Q.Z., Y.J., and M.N.; data curation, M.N. and Y.J.; supervision, Q.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by Key-Area Research and Development Program of Guangdong Province under grant 2021B0101190003 and Guangdong Provincial Natural Science Foundation of China under grant 2022A1515010831. This research was partially supported by the Natural Science Foundation of China under grant 11471343 and Guangdong Provincial Natural Science Foundation of China under grant 2022A1515011187.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Babuška, I.; Banerjee, U. Stable generalized finite element method. Comput. Methods Appl. Mech. Eng. 2011, 201–204, 91–111. [Google Scholar] [CrossRef] [Green Version]
  2. Babuška, I.; Banerjee, U.; Osborn, J. Survey of meshless and generalized finite element methods: A unified approach. Acta Numer. 2003, 12, 1–125. [Google Scholar] [CrossRef] [Green Version]
  3. Fries, T.P.; Belytschko, T. The extended/generalized finite element method: An overview of the method and its applications. Int. J. Numer. Meth. Eng. 2010, 84, 253–304. [Google Scholar] [CrossRef]
  4. Efendiev, Y.; Hou, T.Y. Multiscale Finite Element Methods: Theory and Applications; Springer: Cham, Switzerland, 2009. [Google Scholar]
  5. Bëchet, E.; Minnebo, H.; Moxexs, N.; Burgardt, B. Improved implementation and robustness study of the X-FEM method for stress analysis around cracks. Int. J. Numer. Meth. Eng. 2005, 64, 1033–1056. [Google Scholar] [CrossRef] [Green Version]
  6. Gupta, V.; Duarte, C.A.; Babuška, I.; Banerjee, U. A Stable and optimally convergent generalized FEM (SGFEM) for linear elastic fracture mechanics. Comput. Methods Appl. Mech. Eng. 2013, 266, 23–39. [Google Scholar] [CrossRef]
  7. Cui, C.; Zhang, Q. Stable generalized finite element methods (SGFEM) for elasticity crack problems. Int. J. Numer. Methods Eng. 2020, 121, 3066–3082. [Google Scholar] [CrossRef]
  8. Zhang, Q. DOF-gathering stable generalized finite element methods (SGFEM) for crack problems. Numer. Methods Partial. Differ. Equ. 2020, 36, 1209–1233. [Google Scholar] [CrossRef]
  9. Chessa, J.; Belytschko, T. An extended finite element method for two-phase fluids. J. Appl. Mech. 2003, 70, 10–17. [Google Scholar] [CrossRef]
  10. Hou, T.; Li, Z.; Osher, S.; Zhao, H. A hybrid method for moving interface problems with application to the hele-shaw flow. J. Comput. Phys. 1997, 134, 236–252. [Google Scholar] [CrossRef] [Green Version]
  11. Hansbo, A.; Hansbo, P. An unfitted finite element method, Based on Nitsche’s method, for elliptic interface problems. Comput. Methods Appl. Mech. Eng. 2002, 191, 5537–5552. [Google Scholar] [CrossRef] [Green Version]
  12. Shahzamaniana, M.M.; Lin, M.; Kainat, M.; Yoosef-Ghodsi, N.; Adeeb, S. Systematic literature review of the application of extended finite element method in failure prediction of pipelines. J. Pipeline Sci. Eng. 2021, 1, 241–251. [Google Scholar] [CrossRef]
  13. Elyasi, N.; Shahzamanian, M.; Lin, M.; Westover, L.; Li, Y.; Kainat, M.; Yoosef-Ghodsi, N.; Adeeb, S. Prediction of Tensile Strain Capacity for X52 Steel Pipeline Materials Using the Extended Finite Element Method. Appl. Mech. 2021, 2, 209–225. [Google Scholar] [CrossRef]
  14. Babuška, I.; Melenk, J.M. The partition of unity finite element method. Int. J. Numer. Meth. Eng. 1997, 40, 727–758. [Google Scholar] [CrossRef]
  15. Duarte, C.A.; Oden, J.T. An h-p adaptive method using clouds. Comput. Methods Appl. Mech. Eng. 1996, 139, 237–262. [Google Scholar] [CrossRef]
  16. Melenk, J.M.; Babuška, I. The partition of unity finite element method: Theory and application. Comput. Methods Appl. Mech. Eng. 1996, 139, 289–314. [Google Scholar] [CrossRef] [Green Version]
  17. Babuška, I.; Banerjee, U.; Kergrene, K. Strongly Stable Generalized Finite Element Method: Application to interface problems. Comput. Methods Appl. Mech. Eng. 2017, 327, 58–92. [Google Scholar] [CrossRef]
  18. Moës, N.; Cloirec, M.; Cartraud, P.; Remacle, J.F. A computational approach to handle complex microstructure geometries. Comput. Methods Appl. Mech. Eng. 2003, 192, 3163–3177. [Google Scholar] [CrossRef]
  19. Harari, I.; Dolbow, J. Analysis of an efficient finite element method for embedded interface problems. Comput. Math. 2010, 46, 205–211. [Google Scholar] [CrossRef]
  20. Zhang, Q.; Banerjee, U.; Babuška, I. Strongly Stable Generalized Finite Element Method (SSGFEM) for a non-smooth interface problem. Comput. Methods Appl. Mech. Eng. 2019, 344, 538–568. [Google Scholar] [CrossRef]
  21. Aragon, A.M.; Duarte, C.A.; Geubelle, P.H. Generalized finite element enrichment functionsfor discontinuous gradient fields. Int. J. Numer. Meth. Eng. 2010, 82, 242–268. [Google Scholar] [CrossRef]
  22. Cheng, K.W.; Fries, T.P. Higher-order XFEM for curved strong and weak discontinuities. Int. J. Numer. Meth. Eng. 2010, 82, 564–590. [Google Scholar] [CrossRef]
  23. Kergrene, K.; Babuška, I.; Banerjee, U. Stable Generalized Finite Element Method and associated iterative schemes: Application to interface problems. Comput. Methods Appl. Mech. Eng. 2016, 305, 1–36. [Google Scholar] [CrossRef] [Green Version]
  24. Legrain, G.; Moës, N.; Huerta, A. Stability of incompressible formulations enriched with X-FEM. Comput. Meth. Appl. Mech. Eng. 2008, 197, 1835–1849. [Google Scholar] [CrossRef] [Green Version]
  25. Kirchhart, M.; Gross, S.; Reusken, A. Analysis of an XFEM discretization for Stokes interface problems. SIAM J. Sci. Comput. 2016, 38, A1019–A1043. [Google Scholar] [CrossRef] [Green Version]
  26. Díez, P.; Cottereau, R.; Zlotnik, S. A stable extended FEM formulation for multi-phase problems enforcing the accuracy of the fluxes through Lagrange multipliers. Int. J. Numer. Meth. Eng. 2013, 96, 303–322. [Google Scholar] [CrossRef] [Green Version]
  27. Zhu, P.; Zhang, Q.; Liu, T. Stable generalized finite element method (SGFEM) for parabolic interface problems. J. Comput. Appl. Math. 2020, 367, 112475. [Google Scholar] [CrossRef]
  28. Zhang, Q.; Cui, C.; Banerjee, U.; Babuska, I. A condensed generalized finite element method (CGFEM) for interface problems. Comput. Methods Appl. Mech. Eng. 2022, 391, 114537. [Google Scholar] [CrossRef]
  29. Zilian, A.; Legay, A. The enriched space-time finite element method (EST) for simultaneous solution of fluid-structure interaction. Int. J. Numer. Meth. Eng. 2008, 75, 305–334. [Google Scholar] [CrossRef] [Green Version]
  30. Lehrenfeld, C.; Reusken, A. Analysis of a Nitsche XFEM-DG discretization for a class of two-phase mass transport problems. SIAM J. Numer. Anal. 2013, 51, 958–983. [Google Scholar] [CrossRef] [Green Version]
  31. Sauerland, H.; Fries, T.P. The extended finite element method for two-phase and free-surface flows: A systematic study. J. Comput. Phys. 2011, 230, 3369–3390. [Google Scholar] [CrossRef]
  32. Sauerland, H.; Fries, T.P. The stable XFEM for two-phase flows. Comput. Fluids 2013, 87, 41–49. [Google Scholar] [CrossRef]
  33. Fries, T.P.; Zilian, A. On time integration in the XFEM. Int. J. Numer. Meth. Eng. 2009, 79, 69–93. [Google Scholar] [CrossRef]
  34. Gross, S.; Ludescher, T.; Olshanskii, M.; Reusken, A. Robust preconditioning for XFEM applied to time-dependent Stokes problems. SIAM J. Sci. Comput. 2016, 38, A3492–A3514. [Google Scholar] [CrossRef] [Green Version]
  35. Loehnert, S. A stabilization technique for the regularization of nearly singular extended finite elements. Comput. Mech. 2014, 54, 523–533. [Google Scholar] [CrossRef]
  36. Menk, A.; Bordas, S.P.A. A robust preconditioning technique for the extended finite element method. Int. J. Numer. Meth. Eng. 2011, 85, 1609–1632. [Google Scholar] [CrossRef] [Green Version]
  37. Lang, C.; Makhija, D.; Doostan, A.; Maute, K. A simple and efficient preconditioning scheme for heaviside enriched XFEM. Comput. Mech. 2014, 54, 1357–1374. [Google Scholar] [CrossRef]
  38. Schweitzer, M.A. Stable enrichment and local preconditioning in the particle-partition of unity method. Numer. Math. 2011, 118, 137–170. [Google Scholar] [CrossRef]
  39. Zhang, Q.; Banerjee, U.; Babuska, I. High order stable generalized finite element methods. Numer. Math. 2014, 128, 1–29. [Google Scholar] [CrossRef]
  40. Zhang, Q.; Babuška, I. A stable generalized finite element method (SGFEM) of degree two for interface problems. Comput. Methods Appl. Mech. Eng. 2020, 363, 112889. [Google Scholar] [CrossRef]
  41. Hansbo, P.; Larson, M.G.; Zahedi, S. A cut finite element method for a Stokes interface problem. Appl. Numer. Math. 2014, 85, 90–114. [Google Scholar] [CrossRef] [Green Version]
  42. Lin, T.; Lin, Y.; Zhang, X. Partially penalized immersed finite element methods for elliptic interface problems. SIAM J. Numer. Anal. 2015, 53, 1121–1144. [Google Scholar] [CrossRef] [Green Version]
  43. Afraites, L.; Dambrine, M.; Kateb, D. Shape methods for the transmission problem with a single measurement. Numer. Funct. Anal. Optim. 2007, 28, 519–551. [Google Scholar] [CrossRef]
  44. LeVeque, R.; Li, Z. Immersed interface methods for Stokes flow with elastic boundaries or surface tension. SIAM J. Sci. Comput. 1997, 18, 709–735. [Google Scholar] [CrossRef] [Green Version]
  45. Li, Z.; Wang, W.; Chern, I.L.; Lai, M. New formulations for interface problems in polar coordinates. SIAM J. Sci. Comput. 2003, 25, 224–245. [Google Scholar] [CrossRef] [Green Version]
  46. Adjerid, S.; Babuska, I.; Guo, R.; Lin, T. An enriched immersed finite element method for interface problems with nonhomogeneous jump conditions. arXiv 2021, arXiv:2004.13244. [Google Scholar]
  47. Barrett, J.W.; Elliott, C.M. Fitted and unfitted finite-element methods for elliptic equations with smooth interfaces. IMA J. Numer. Anal. 1987, 7, 283–300. [Google Scholar] [CrossRef]
  48. Huang, P.; Wu, H.; Xiao, Y. An unfitted interface penalty finite element method for elliptic interface problems. Comput. Methods Appl. Mech. Eng. 2017, 323, 439–460. [Google Scholar] [CrossRef]
  49. Zhu, L.; Zhang, Z.; Li, Z. An immersed finite volume element method for 2D PDEs with discontinuous coefficients and non-homogeneuos jump conditions. Comput. Math. Appl. 2015, 70, 89–103. [Google Scholar] [CrossRef]
  50. Sirignano, J.; Spiliopoulos, K. DGM: A deep learning algorithm for solving partial differential equations. J. Comput. Phys. 2018, 375, 1339–1364. [Google Scholar] [CrossRef] [Green Version]
  51. Dockhorn, T. A discussion on solving partial differential equations using neural networks. arXiv 2019, arXiv:1904.07200. [Google Scholar]
  52. Berg, J.; Nyström, K. A unified deep artificial neural network approach to partial differential equations in complex geometries. Neurocomputing 2018, 317, 28–41. [Google Scholar] [CrossRef] [Green Version]
  53. Yu, W.E.B. The deep ritz method: A deep learning-based numerical algorithm for solving variational problems. Commun. Math. Stat. 2018, 8, 1–12. [Google Scholar]
  54. Sheng, H.; Yang, C. Pfnn: A penalty-free neural network method for solving a class of second order boundary-value problems on complex geometries. J. Comput. Phys. 2021, 428, 110085. [Google Scholar] [CrossRef]
  55. Wang, Z.; Zhang, Z. A mesh-free method for interface problems using the deep learning approach. J. Comput. Phys. 2020, 400, 108963. [Google Scholar] [CrossRef]
  56. Liao, Y.; Ming, P. Deep nitsche method: Deep ritz method with essential boundary conditions. Commun. Comput. Phys. 2021, 29, 1365–1384. [Google Scholar]
  57. Anitescu, C.; Atroshchenko, E.; Alajlan, N.; Rabczuket, T. Artificial neural network methods for the solution of second order boundary value problems. Comput. Mater. Contin. 2019, 59, 345–359. [Google Scholar] [CrossRef] [Green Version]
  58. Samaniego, E.; Anitescu, C.; Goswami, S.; Nguyen-Thanh, V.M.; Guo, H.; Hamdia, K.; Zhuang, X.; Rabczuk, T. An energy approach to the solution of partial differential equations in computational mechanics via machine learning: Concepts, implementation and applications. Comput. Methods Appl. Mech. Eng. 2020, 362, 112790. [Google Scholar] [CrossRef] [Green Version]
  59. Krishnapriyan, A.; Gholami, A.; Zhe, S.; Kirby, R.; Mahoney, M.W. Characterizing possible failure modes in physics-informed neural networks. Adv. Neural Inf. Process. Syst. 2021, 34, 26548–26560. [Google Scholar]
  60. Yang, Z.; Baraldi, P.; Zio, E. A multi-branch deep neural network model for failure prognostics based on multimodal data. J. Manuf. Syst. 2021, 59, 42–50. [Google Scholar] [CrossRef]
  61. Meyes, R.; Donauer, J.; Schmeing, A.; Meisen, T. A Recurrent Neural Network Architecture for Failure Prediction in Deep Drawing Sensory Time Series Data. Procedia Manuf. 2019, 34, 789–797. [Google Scholar] [CrossRef]
  62. Cybenko, G. Approximation by superposition of a sigmoidal function. Math. Control. Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
  63. Chandra, P.; Singh, Y. Feedforward sigmoidal networks-equicontinuity and fault-tolerance properties. IEEE Trans. Neural Netw. 2004, 15, 1350–1366. [Google Scholar] [CrossRef] [PubMed]
  64. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and PATTERN recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  65. Li, H.; Zhang, Q.; Chen, X. Deep Learning-Based Surrogate Model for Flight Load Analysis. Cmes Comput. Model. Eng. Sci. 2021, 128, 605–621. [Google Scholar] [CrossRef]
  66. Bottou, L. Large-scale machine learning with stochastic gradient descent. Proceedings of COMPSTAT 2010; Springer: Cham, Switzerland, 2010; pp. 177–186. [Google Scholar]
  67. Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  68. Brenner, S.C.; Scott, L.R. The Mathematical Theory of Finite Element Methods, 3rd ed.; Springer: New York, NY, USA, 2008. [Google Scholar]
  69. Fries, T.P.; Omerović, S. Higher-order accurate integration of implicit geometries. Int. J. Numer. Meth. Eng. 2016, 106, 323–371. [Google Scholar] [CrossRef]
Figure 1. The domains with a curved interface (Left) and a straight interface (Right).
Figure 1. The domains with a curved interface (Left) and a straight interface (Right).
Axioms 11 00384 g001
Figure 2. The ResNet used in the tests.
Figure 2. The ResNet used in the tests.
Axioms 11 00384 g002
Figure 3. Numerical integrations on an element [ a , b ] × [ c , d ] cut by the interface Γ , where the element is divided into several triangles, and the Gaussian rule is used on each triangle.
Figure 3. Numerical integrations on an element [ a , b ] × [ c , d ] cut by the interface Γ , where the element is divided into several triangles, and the Gaussian rule is used on each triangle.
Axioms 11 00384 g003
Figure 4. The domains with a straight interface (Left) and a discontinuous solution u (Right) of DIP ( a 0 = 1 , a 1 = 100 ).
Figure 4. The domains with a straight interface (Left) and a discontinuous solution u (Right) of DIP ( a 0 = 1 , a 1 = 100 ).
Axioms 11 00384 g004
Figure 5. The DIP with a straight interface: the maximum relative errors (46) of DNN function u ˜ θ (Left) and the energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.
Figure 5. The DIP with a straight interface: the maximum relative errors (46) of DNN function u ˜ θ (Left) and the energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.
Axioms 11 00384 g005
Figure 6. The domains with a curved interface (Left) and a discontinuous solution u (Right) of DIP ( a 0 = 1 , a 1 = 100 ).
Figure 6. The domains with a curved interface (Left) and a discontinuous solution u (Right) of DIP ( a 0 = 1 , a 1 = 100 ).
Axioms 11 00384 g006
Figure 7. The DIP with a curved interface: the maximum relative errors (46) of DNN function u ˜ θ (Left) and the means of energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.
Figure 7. The DIP with a curved interface: the maximum relative errors (46) of DNN function u ˜ θ (Left) and the means of energy errors with respect to h of FEM and SGFEM (Right); the contrasts c are 10 and 100.
Axioms 11 00384 g007
Table 1. The means and STDs of errors with respect to h of SGFEM: the curved interface situation.
Table 1. The means and STDs of errors with respect to h of SGFEM: the curved interface situation.
h c = 10 c = 100
MeanSTDMeanSTD
2.5000 × 10 01 2.1241 × 10 01 4.0512 × 10 04 2.3789 × 10 01 7.9756 × 10 03
1.2500 × 10 01 1.1036 × 10 01 3.0925 × 10 04 1.3384 × 10 01 8.4100 × 10 03
6.2500 × 10 02 5.7113 × 10 02 1.4862 × 10 04 7.4277 × 10 02 5.5238 × 10 03
3.1250 × 10 02 2.9479 × 10 02 8.1748 × 10 05 3.8006 × 10 02 2.9354 × 10 03
1.5625 × 10 02 1.5016 × 10 02 4.3178 × 10 05 1.9107 × 10 02 1.4815 × 10 03
7.8125 × 10 03 7.5967 × 10 03 2.0323 × 10 05 9.5761 × 10 03 7.4419 × 10 04
3.9063 × 10 03 3.8242 × 10 03 1.0329 × 10 05 4.8098 × 10 03 3.7257 × 10 04
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jiang, Y.; Nian, M.; Zhang, Q. A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities. Axioms 2022, 11, 384. https://doi.org/10.3390/axioms11080384

AMA Style

Jiang Y, Nian M, Zhang Q. A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities. Axioms. 2022; 11(8):384. https://doi.org/10.3390/axioms11080384

Chicago/Turabian Style

Jiang, Ying, Minghui Nian, and Qinghui Zhang. 2022. "A Stable Generalized Finite Element Method Coupled with Deep Neural Network for Interface Problems with Discontinuities" Axioms 11, no. 8: 384. https://doi.org/10.3390/axioms11080384

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop