1. Introduction
The basic assumption about the fractional diffusion phenomenon is that the Brownian motion hypothesis is violated. In the case of isotropic homogeneous media, such processes are described by the fractional Laplacian. A natural and easy-to-comprehend presentation of fractional Laplacian in the whole space
,
can be derived through the Fourier transform. Serious (not only computational) challenges appear when the equation is posed in a bounded domain
, equipped with the correct boundary conditions, and the related boundary value problem is considered. There exist at least two different and not equivalent definitions of fractional Laplacian [
1,
2]. They follow the spectral and the Riesz formulations, respectively. Some recent comparison results between those two approaches are presented in [
3], where the difference of the asymptotes of boundary layers is analyzed in detail.
In this paper, we consider the spectral fractional elliptic equation with power
, which is in the form
where
is a self-adjoint elliptic operator in
, satisfying homogeneous Dirichlet boundary conditions. The non-local operator
is defined via the spectral decomposition of
, namely
where
are the eigenvalues, respectively,
are the corresponding normalized eigenfunctions of
, and
is the
inner product.
Let
be a uniform rectangular mesh, and the finite difference method (FDM) using a standard
-stencil is applied for approximating the operator
. Then, the FDM discretization of (
1) leads to a system of linear algebraic equations with respect to the mesh functions
and
in the form
Here,
is a sparse, symmetric and positive definite (SPD) matrix, and
As in the continuous case, is the spectrum of , and the eigenvectors are normalized with respect to the Euclidean dot product . Similar construction is applicable when the finite element method (FEM), using a quasi uniform triangulation , is applied. In this case, , where and are the FEM stiffness and mass matrices, respectively, and is SPD with respect to the scalar product generated by the mass matrix. In what follows, we assume that linear finite elements are used.
The recent achievements in the numerical solution of spectral fractional elliptic equations are determined by the class of methods that are applicable to multidimensional domains
of general geometry; see [
4,
5], the references therein, as well as the most recent paper [
6]. Although there are several rather different approaches there, all the obtained algorithms reduce the original non-local problem to solving
k auxiliary linear systems with sparse SPD matrices that can be expressed as some diagonal perturbations of
in the form
,
,
. Thus, all these methods can be interpreted as certain rational approximations of degree
k [
4,
7], i.e.,
. In this context, the advantages of the BURA method [
4,
5,
8,
9] follow directly by its definition, as being the best uniform rational approximation
of degree
k for
in
.
Introducing the BURA method [
9], the modified Remez algorithm was applied to derive
. However, it faces serious difficulties regarding the computational stability for larger
k and smaller
. The results obtained in the last few years largely overcome this problem. For example, in [
7], the Chebfun implementation of the adaptive Antoulas-Anderson (AAA) algorithm is applied, where the representation of the rational approximant in barycentric form and greedy selection of the support points is used to approximate
for
. Further progress in this direction has been made in [
10] (see [
11] for the related software implementation). The algorithm (BRASIL) is based on the introduced barycentric rational interpolation. The new algorithms and software tools for stable computation of best uniform rational approximations for larger degrees
k provide promising opportunities with respect to further expansions of the application of the BURA technology and its better understanding and interpretation.
The next four publications [
12,
13,
14,
15] are in the spirit of further development of methods based on best uniform rational approximations.
Solving time-fractional differential equations is considered in [
13], where the spectrum of the fractional Laplace kernel is approximated with a rational function, applying the AAA algorithm. Stability and convergence properties of the proposed numerical scheme are studied. Moreover, the developed algorithm is efficiently applied to a time-fractional Cahn–Hilliard problem.
A class of Reduced Basis Methods (RBM) is studied in [
12]. Based on the developed theory, the equivalence between RBM and Rational Krylov Methods (RKM) is analyzed in [
14]. Then a unified RKM framework for the spectral fractional in space and fractional in time problems is proposed in [
15].
The methods discussed above have a strong impact on the development of efficient solution methods for fractional diffusion problems beyond the scalar elliptic case (see, e.g., [
4]). Such results are presented in the more recent papers [
16,
17], where nonlinear and time-dependent fractional diffusion-in-space problems are considered, respectively. In this spirit, the results are also presented in [
18], where a tensor numerical method for optimal control problems constrained by a fractional elliptic operator is developed.
In many studies, the needed degree
k for obtaining the targeted accuracy of
is considered as a measure for the computational efficiency of the method. This holds true under the assumption that the complexity of the method is
. In the case of a large scale unstructured matrix
, this means that some iterative method of optimal complexity
is used for solving the auxiliary systems. Obviously, these auxiliary systems possess very different properties. More recently [
19], it was found that for larger
k, part of the coefficients
can be extremely large. This observation motivates the proposed reduced sum modification of the additive BURA method, called BURA-AR.
In this paper, we present new theoretical characterizations of the BURA parameters, which gives theoretical justification of the BURA-AR method and the newly proposed reduced product multiplicative BURA-MR method. In this way, a significant improvement in computational complexity is achieved. The presented numerical results can be used as direct practical receipts for optimizing the computational complexity, depending on the fractional power . The new theoretical and experimental results raise the question of whether the current state-of-the-art almost-optimal estimate of the computational complexity of the BURA method in the form can be improved.
The rest of the paper is organized as follows. Some basic definitions, error estimates, and additive and multiplicative representations of the BURA algorithm are given in
Section 2. BURA’s stabilized calculations for larger
k are discussed in the next section. It includes a comparative analysis of the theoretical error estimates and the accuracy of BURA, computed using the BRASIL software. Important theoretical results are presented in
Section 4. The obtained characterization of the BURA parameters includes interlacing inequalities and asymptotic cluster analysis. The BURA-AR and BURA-MR methods are introduced in the next section. It is shown how to optimize the new parameter
l in order to obtain the best computational complexity for a given target accuracy and given a priori estimates of the extreme eigenvalues
and
. The numerical results presented in
Section 6 confirm the efficiency of the developed computational technology. The considered large-scale 3D numerical tests illustrate the influence of the fractional power
depending on the smoothness of the right-hand side
. Brief concluding remarks are given at the end.
2. The BURA Method
The abbreviation BURA stands for best uniform rational approximation. Let us consider the min-max problem: find
such that
where
,
and
are polynomials of degree
k. Then the error
of the
k-BURA element
is defined as
A sharp estimate of
is derived in [
20]:
Following [
5] we introduce the approximation of
in the form
and then
where
is the BURA numerical solution of the linear algebraic system (
3). In other words,
is the BURA approximation of the solution
of the fractional elliptic Equation (
1).
The following error estimate of the BURA method holds true.
Property 1 (see [
5] p. 19).
Let , and let the lumped mass linear finite elements for discretization in space be used. Thenand thereforewith , and C independent of h and k. Here is the FEM function corresponding to the BURA approximation . Remark 1. The equivalence of the lumped linear FEM and FDM discretizations on uniform rectangle meshes allows obtaining analogues of the error estimates (9)–(11) for the case of the finite difference method. As noted in [
5], for
,
, the estimates (
8) and (
9) remain valid provided that
is replaced by
. In this way, in the more interesting three-dimensional case, one can obtain the error bounds
that is
where
.
An additive or multiplicative representation of the rational function
can be used for implementation of the BURA method. Till now, the first one is preferred, but in this paper we will analyze both of them independently. The additive representation relies on the partial fraction decomposition of
, while the multiplicative one deals with the factorization of
:
where
and
. In these ways, the BURA method reads as
Thus, the BURA method for numerical solution of the fractional differential Equation (
1) requires the solving of
k auxiliary linear systems with sparse SPD matrices, which are positive diagonal shifts of the FEM/FDM matrix
.
The unified view of the methods for solving Equation (
1) as a rational approximation [
4,
7] leads to a general understanding of the degree
k as a measure for computational efficiency. Thus, for the computational complexity of the BURA method, we accept that the following asymptotic estimate holds true:
This is valid if a method with optimal computational complexity is used to solve the auxiliary linear systems that appear in (
13). In the multidimensional case, this means that such an iterative solver is applied. In our numerical tests, we used the BoomerAMG implementation from HYPRE [
21] of the algebraic multigrid preconditioner in the PCG framework.
The further analysis is based on the error estimates (
9)–(
11). From there, choosing properly the mesh parameters
h and the BURA order
k, the contributions of the discretization and the BURA approximation to the total error are balanced. In this way, the following almost optimal computational complexity of the BURA algorithms is obtained
Improving the computational efficiency of the BURA method is the central focus of this paper. The analysis of the BURA parameters, and in particular of the coefficients
, determining the diagonal shifts of
in (
13), is behind the developed approach.
3. BURA: Stabilized Computations for Large k
In [
4], a modified Remez algorithm (see [
22]) was used for the computation of the BURA element of a degree up to ten. This algorithm is very time consuming and because of the difficulties related to the computational stability for large
k’s in [
19], we used the Best Rational Approximation by Successive Interval Length (BRASIL) [
11] to get the BURA of degree 85. Here, we computed the error of the rational approximation of the function
in the interval
obtained using the BRASIL software version 1.2.1. We used the following options in baryrat.brasil:
tol | convergence criterion tolerance; |
npi | 30 iterations of golden section search per interval; |
maxiter =10,000 | the maximum number of iterations. |
The BRASIL algorithm is based on a barycentric representation of the rational approximation of the form
In order to obtain the coefficients of the simple additive representation (
12), we first find the roots
and the poles
of the above barycentric representation. This is already implemented in the
baryrat package by solving a corresponding eigenvalue problem using arithmetical precision of 100 decimal places (provided by the
mpmath Python package). Since we are interested in
our roots and poles will be
and
, respectively. Therefore, we can straightforwardly compute
. To find the values of the
coefficients in (
12), we then solve the following linear system
For the solution of the above linear system, we have also used the mpmath package with a precision of 100 decimal places.
Let us also note that here we are using a modified convergence criterion for the BRASIL algorithm to ensure that the algorithm converges for large values of
k without tinkering with the tolerance. The BRASIL algorithm first computes the errors
for each interval between the interpolation points. Convergence is then decided, based on comparing the deviation of the errors in each interval to the tolerance
The issue with the above is that
has to be changed for different values of
k. When the errors become very small, the arithmetical error in computing the deviation becomes significant and the algorithm stops converging for small values of
. Instead of only looking at the deviation, we are checking the product of the maximum error and the deviation
The idea is to automatically get as close to the BURA as double-precision floating point arithmetics would allow.
The errors for the values of
are presented in
Figure 1. We compare these errors with the theoretical estimate (
6) of the BURA error from [
20]. It is easy to see that the computed errors are very close to the theoretical estimate when the estimate is greater than the double-precision accuracy. The comparison shows that the theoretical estimate of the error is sharp.
5. The BURA-AR and the BURA-MR Methods
In
Figure 2 and
Figure 3, we illustrate the behavior of the poles
of
in (
25), where
and
, respectively.
One can see that for , the first part of the poles for consists of extremely large values (for first nine coefficients are bigger than ). For the situation is: first 21 coefficients are bigger than for and for eight coefficients are bigger than . Furthermore, the asymptotic of the poles clustering towards , based on Theorem 1(c), is also well represented, as the order of the first pole for is approximately times larger than the order of the first pole for and approximately times larger than the order of the first pole for .
For extremely large values of
, the condition number of the matrices
, involved in (
13), is practically equal to one. Of course, we do not need any preconditioning for such well-conditioned matrices. Moreover, even solving such systems may not be the right approach. This motivates us to investigate potential reduction on the number of linear systems to be solved without losing BURA accuracy.
Let
be chosen such that
is very large for
. Then we can define the following rational approximations of
, both of degree
:
Lemma 3. The following relations hold true for every , , and :for all intermediate values . Proof. The first two relations
for
are trivial. Now, let
be fixed. First we show that
. Indeed, following the notation from
Section 4.1 let us define
Then
, while
. Applying (
23), we obtain
since
and
for
due to (
17). Therefore
Next, via expressing the two functions in the form
and applying (
23) one more time, we conclude that for all
and every
due to (
27) and
. Finally, for the function
we know that
The proof is completed. □
Let
be given. We can then introduce the error indicators
Both functions and share the same poles and the error analysis performed later in this section indicates that both reduction procedures could be used in practical applications. They do not only stabilize the numerical computations but also improve the efficiency of the BURA solver.
Motivated by the above observation, we replace the
in the BURA method for solving the spectral fractional elliptic Equation (
1) by either
or
and obtain the following BURA-AR and BURA-MR formulations
Using the BURA-AR and BURA-MR methods, we reduce the number of linear systems that are to be solved from k to , and thus we decrease the computational complexity of the method.
5.1. BURA-AR Error Analysis
Throughout this section, let
,
, and
k be fixed. We are interested in analyzing the error indicator
. By triangle inequality, we have
so we need to estimate the second term. Applying (
27) and Lemma 2, we derive
Hence, we proved the following theorem:
Theorem 2. Let and . If , such that , then As a byproduct, we also derived that whenever
meaning that for fixed
and
k, the error indicator
is a monotonically increasing function with respect to
ℓ and a monotonically decreasing function with respect to
. The proof relies on the observation that
,
, proven in an identical way as
above, together with (
27) and (
18) that imply
Note that while
, the reduction error between the BURA-AR method and the original BURA method is dominated by the BURA approximation error
, thus the reduction process does not affect the overall accuracy of the solver in any way. In practice, we observe that due to the large difference in the orders of
for small
i, the multiplier
l can be omitted and
Corollary 2. Let be the solution of the linear algebraic system (
3)
and be the additive reduced BURA solution, corresponding to (
28)
with . Then Example 1. Consider the fractional Laplacian problem with homogeneous Dirichlet boundary conditions in the d dimensional unit cube, . Take the finite difference or lumped finite element discretization of the problem on a uniform mesh with mesh-size h. It is well-known that the condition number of the matrix generated in this way for the corresponding algebraic Equation (
3)
behaves likeindependent of the space dimensionality of the problem. According to (
8)
, in order to balance the discretization and the approximation BURA error, there is no practical point to take larger k than the one that guarantees . Now, in order to balance the reduction error with the approximation one, applying Corollary 2, we conclude that all systems, corresponding to diagonal shifts , such thatcan be reduced in the numerical computation. Here, following (27), we used that . In particular, for and we can reduce all systems corresponding to diagonal shifts that are higher or equal to , respectively. The results are summarized in Table 1. We observe that the BURA-AR method can reduce more than of the systems to be solved without influencing the overall BURA error order when . As α increases, the impact of BURA-AR on the computational efficiency decreases. Thus, for the considered example, the results in Table 1 show that for , the recommended reduction is . This is in agreement with the significantly higher accuracy of BURA for , see Figure 1. In other words, smaller k is needed to ensure a certain accuracy, and then a smaller reduction is applicable. Furthermore, for every , we can reduce all systems with diagonal shifts larger than , as long as , which usually holds true in practical applications.
5.2. BURA-MR Error Analysis
Again, let
,
, and
k be fixed. We are interested in analyzing the error indicator
. By triangle inequality, we have
where
. Since
(see (
27)), and
is strictly monotonically increasing for
we conclude that
where the constants
are uniformly bounded for fixed
k. Moreover, (
19) suggests that the zeros and the poles of
are grouped into pairs, meaning that
, thus
and
. Analogously to
Section 5.1, we proved the following theorem:
Theorem 3. Let and . If , such that , then where the constant depends solely on k.
Again, as a byproduct, we also derived that whenever
meaning that for fixed
and
k, the error indicator
is a monotonically increasing function with respect to
ℓ and a monotonically decreasing function with respect to
.
Corollary 3. Let be the solution of the linear algebraic system (3) and be the multiplicative reduced BURA solution, corresponding to (28) with . Then 5.3. BURA-AR and BURA-MR Comparison
To summarize, the following theoretical estimates and relations were established in
Section 5:
Theorem 4. Let and . Consider the BURA element and take arbitrary , where is the extreme point of before 1. Then
Therefore, from a theoretical approximating point of view, the BURA-AR method always outperforms the BURA-MR method, giving rise to smaller errors for a fixed number of systems to be solved. Furthermore, in the BURA-MR method, apart from the
algebraic systems, there are also
matrix-vector multiplications (see (
28)). On top of that, the BURA-AR method allows for parallel, independent solutions of the algebraic systems, while the BURA-MR method is a sequential one, as the solution of the previous system becomes the right-hand side of the next one.
However, due to the interlacing property (
17) and (
27), the BURA-MR method seems to be numerically stable, independently of the value of
ℓ. Moreover, based on the conducted numerical experiments, we observe that the difference
decays as
ℓ increases, so from an application point of view, the two methods are equally useful. This topic will be addressed in the next section, where various experimental data are generated and analyzed.
6. Numerical Experiments
In this section, two classes of numerical experiments are considered. The first one aims at numerical validation of the theoretical foundations for the error indicator
, derived in
Section 5. Therefore, the experiments are in 1D and involve numerical and visual comparison of
and
in the interval
. A large value for
k, namely
is considered so that the reduction benefits of BURA-AR are better illustrated. The decreasing effect of BURA-AR when
increases has already been commented on; therefore, the focus here is mostly on
. Here, and in what follows, the case
and
are included for completeness. The numerical data are summarized in
Table 2.
For , and the accuracy remains the same for all values of ℓ in the interval . The error doubles around and then starts to exponentially grow. Such a behavior perfectly agrees with Theorem 2 and experimentally confirms that as long as , the reduction process does not affect the order of . Indeed, we have that , thus for we should be safe. Checking the poles of we get that for , while for . For and the accuracy is not changed for all values of ℓ in the interval . Again, this corresponds to , which holds true for all .
For
and
, the observations are the same. For
, as long as
, the reduction does not affect the overall accuracy. This relation holds true for
. Then
for
and
slowly increases. Finally, from the moment
,
starts to exponentially grow. Analogously, for
, the “critical order” for the poles is 16, with
for
, which agrees with the corresponding numbers in
Table 2.
Concerning and , for , the accuracy is preserved for , while for , it does not affect the computational efficiency.
Figure 4 shows a 1D comparison between the behavior of BURA-AR and BURA approximations for
,
, and for
,
.
It can be seen that even in the case , the accuracy for and is and , respectively. The values and are the minimum values that guarantee that all remaining poles of and , respectively, are within the double-precision limit . The monotonically increasing behavior of when ℓ increases, documented in Theorem 4, is well illustrated especially on the right plots, where .
The second class of experiments is devoted to the 3D fractional Laplace problem with homogeneous Dirichlet boundary conditions in the unit cube
. Uniform mesh with mesh size
h is considered, and matrix
in (
3) is generated via lumped mass linear finite elements discretization.
In the implementation of the additive version of BURA, we use equivalent representation of (
13) in the spirit of (
24) allowing us to solve auxiliary systems with sparse SPD matrices only:
where
is the stiffness matrix and
is the lumped mass matrix. Then, the corresponding reduced additive BURA-AR algorithm reads as
For the right-hand side of problem (
1) we investigate two choices, namely
which is a smooth function that agrees with the boundary conditions, and
which gives rise to boundary layers and has been a subject of theoretical and experimental investigations in numerous papers already (e.g., see [
2] or [
24]).
For the first choice, the exact continuous solution can be explicitly derived, thus the error estimate (
11) can be numerically computed (see
Figure 5). For the second choice, the BURA reduction errors
and
have been studied (see
Figure 6 and
Figure 7). The simulations were performed on coarse (
) and finer (
) meshes using for the auxiliary sparse SPD systems the BoomerAMG implementation from HYPRE [
21] of the algebraic multigrid preconditioner in the PCG framework. The values
have been considered. Since the best possible FEM accuracy is
, the BURA method for
is set on degree
, which guarantees
, thus, in all numerical simulations, the BURA error will not dominate over the error of discretization. For the corresponding degrees when
and
, we choose
and
, respectively, motivated by the estimate (
6).
In
Figure 6, we compare the BURA reduction errors for
to both the theoretical estimate from Theorems 2 and 3 and the corresponding pure non-reduction error
, related to
. All results agree with Theorem 4, as
always holds true. Further, we observe that the difference
decreases when
ℓ increases. Finally, the error plots indicate that we can reduce half of the systems (
) for
, one-quarter of the systems (
) for
, and one-eighth of the systems (
) for
, without affecting the order of the overall BURA accuracy.
In
Figure 7, we compare the BURA reduction errors for
with
—the best order of the discretization error estimate. The idea of this experiment is that whenever the BURA reduction error is below
, then the total error estimate (
11) should be dominated by the error of discretization; thus, the reduction process should again not affect the overall accuracy. We observe that for
∼
, we can “safely” reduce 13 systems within both BURA-AR and BURA-MR methods when
. Analogously, we can “safely” reduce 4 systems within both BURA-AR and BURA-MR methods when
, and 3 and 2 systems, respectively, within the BURA-AR and BURA-MR methods, when
. When
∼
, we are within the “optimal” scenario depicted in
Table 1 and, as suggested in the last column of the rows corresponding to
, we witness the possibility of exactly 9 system reductions for
, exactly 2 system reductions for
, and even 1 system reduction for
.
Finally, in
Figure 5, we conduct experiments analogous to
Figure 6 but using the smooth right-hand side
and estimating the total error (
11). The level of potential systems reduction is similar to that in the previous experiment. Moreover, we observe the balance between the BURA error and the error of discretization around level
, especially for finer meshes. This is an indication that the theoretically derived order of discretization of
in certain cases (smooth right-hand sides that agree on the boundary with the boundary conditions and give rise to regular solutions) may be too pessimistic and that the upper order estimate of
, corresponding to classical, non-fractional diffusion, could be achieved, independently of the choice of
.
7. Concluding Remarks
The newly obtained results significantly improve the computational efficiency of the BURA method. The fact that practically all other methods for a numerical solution of multidimensional spectral fractional-in-space diffusion problems in arbitrary domains can be viewed as some rational approximation further enhances the advantages of the new BURA-MR and BURA-AR methods. On the other hand, the proposed reduced product/sum approach can be applied to many of the other existing methods, as well. The new theoretical results and the presented proof-of-concept numerical tests open the window for a wide range of real-life applications, including various problems beyond the scalar elliptic case.
The number of auxiliary sparse SPD systems to be solved is applied as a measure of computational efficiency in [
9], where the BURA method is introduced. Then, this approach is used in the comparative analysis of various other methods, which can be interpreted as a certain rational approximation of degree
k of
. The theory of the BURA method does not depend on the shape of the computational domain. This also applies to the methods for which the unified approach from [
7] is applicable. Based on this, in various papers, one-dimensional numerical tests are considered to be fully representative. In theory, this is correct, but in practice, it is not. The key point in the multidimensional case is that the SPD matrix
is generally unstructured and large-scale. Thus, the implementation of methods based on rational approximation of degree
k naturally involves a certain iterative solver of the related
k auxiliary sparse SPD systems. In this spirit is the presented study on the efficient implementation of the BURA method in the case of higher degrees
k.
The idea of reducing BURA was firstly discussed in the short paper [
19]. Here, we analyzed two alternative methods: the recently proposed BURA-AR together with the new BURA-MR. They are based on the additive and the multiplicative representations of BURA, respectively. Although the two expressions are completely equivalent, the theoretical estimates of the reduced sum BURA-AR and the reduced product BURA-MR, as well as the corresponding numerical results, are different.
Until now, only the additive representation was used in the software implementation of the BURA method for large-scale multidimensional problems. One of the reasons for this is perhaps its better parallelization. Here, we show rather promising results for the multiplicative variant of BURA and related BURA-MR methods. This resumes the discussion of the advantages and disadvantages of both options.
The results of this study contribute to the strengthening of theoretical knowledge and practical skills for a numerical solution of real-life fractional diffusion problems in complex multidimensional domains. At the same time, there are still certain issues related to computational stability that most likely need further understanding, interpretation and treatment.
The development of computationally efficient BURA-AR and BURA-MR methods for reaction-diffusion and time-dependent equations involving fractional-in-space diffusion operators is among the priority topics for further research.
Finally, we observe that under certain assumptions, the computational complexity of the reduced BURA can be improved to .