A Preconditioner for Galerkin–Legendre Spectral All-at-Once System from Time-Space Fractional Diffusion Equation

: As a model that possesses both the potentialities of Caputo time fractional diffusion equation (Caputo-TFDE) and symmetric two-sided space fractional diffusion equation (Riesz-SFDE), time-space fractional diffusion equation (TSFDE) is widely applied in scientiﬁc and engineering ﬁelds to model anomalous diffusion phenomena including subdiffusion and superdiffusion. Due to the fact that fractional operators act on both temporal and spatial derivative terms in TSFDE, efﬁcient solving for TSFDE is important, where the key is solving the corresponding discrete system efﬁciently. In this paper, we derive a Galerkin–Legendre spectral all-at-once system from the TSFDE, and then we develop a preconditioner to solve this system. Symmetry property of the coefﬁcient matrix in this all-at-once system is destroyed so that the deduced all-at-once system is more convenient for parallel computing than the traditional timing-step scheme, and the proposed preconditioner can efﬁciently solve the corresponding all-at-once system from TSFDE with nonsmooth solution. Moreover, some relevant theoretical analyses are provided, and several numerical results are presented to show competitiveness of the proposed method.


Introduction
Classical integer order diffusion equation (IDE) is usually applied to describe normal diffusion phenomena such as the diffusion of particles in liquids or gases [1].However, with further exploration for complicated diffusion phenomena in mechanics and engineering fields [2,3], researchers have encountered challenges in accurately utilizing IDE to simulate the diffusion motion of particles in fractal media (known as anomalous diffusion) [4].Therefore, studies about time-space fractional diffusion equation (TSFDE) of the following form [5] are particularly popular ∂|x| α + g(x, t), (x, t) ∈ (a, b) × (0, T], u(x, 0) = ψ(x), x ∈ (a, b), u(a, t) = u(b, t) = 0, t ∈ (0, T], (1) where κ > 0 is the generalized diffusion coefficient, u(x, t) exhibits the concentration of interest at location x of time t, g(x, t) represents the source item and is a continuous function, ψ where Γ(•) is the Gamma function, and β ∈ (0, 1) and α ∈ (1, 2) are fractional orders of the time and space derivatives, respectively.
For β = 1, α ∈ (1, 2), Equation ( 1) is equivalent to a symmetric case of the two-sided fractional diffusion equation, which is called Riesz-space fractional diffusion equation (Riesz-SFDE) [6].Riesz fractional operator is an effective tool to explore nonlocal effects for engineering and physics applications [7], hence Riesz-SFDE is usually applied to model complicated phenomenon with path dependency and nonlocality [8].Equation ( 1) is reduced to Caputo-time fractional diffusion equation (Caputo-TFDE) if β ∈ (0, 1), α = 2.Note that Caputo derivative is defined pointwise for continuously differentiable functions [9], so that Caputo fractional differential operator can well overcome the supersingularity [10], and then Caputo-TFDE is practical for modeling long term memory phenomenon [11].As a model that possesses both potentialities of Caputo-TFDE and symmetric two-sided space fractional diffusion equation (Riesz-SFDE) [12], Equation ( 1) is usually used to describe subdiffusion phenomenon if 0 < 2β/α < 1, superdiffusion phenomenon if 2β/α > 1, and normal diffusion if α = 2β.In fact, for simulating the anomalous diffusion of particles, coefficients of fractional derivative in Equation (1) are usually used to describe the competitive relationship between long waiting time and large jumps of particles [13].Moreover, Equation ( 1) is usually related to the scaling limits of the continuous time random walks and extensively applied in numerous engineering and scientific fields [14].For instance, in ocean dynamics, Equation (1) can be used to describe the anomalous transport problem that the sticky trajectories within the boundary layers of the islands, where the boundary layers of the islands is regarded as a fractal support [15], and in financial economics, by modeling prices as random variables and varying waiting times between two consecutive transactions into a stochastic fashion.The authors in Ref. [16] utilize Equation (1) to model the behavior of returns and derive a rather general phenomenological theory in financial markets.In addition, as a foundation for more complicated diffusion modeling, Equation (1) can also be developed into variable-order model [17], distributed-order model [18], random-order model [19], nonlinear model [20], and other forms [21] for better application.
Due to the nonlocality of the fractional derivatives, analytical solutions for Equation (1) are usually difficult to obtain explicitly.Accordingly, numerical schemes for their approximate solutions have received much attention [22].As a numerical scheme that typically converges with high order [23], in recent years, spectral methods are well developed for discretizing those FDEs.For example, after applying the generalized Jacobi functions and Fourier-like basis functions on time and space of TFDE respectively, Sheng and Shen [24] obtained a space-time Petrov-Galerkin spectral method for TFDE, this method can reach spectral accuracy in both space and time.To further improve the efficiency in discretization, Li and Xu [25] proposed a new spectral method for TFDE.This method can solve the problem with long-time solution since the "global time dependence" makes the storage requirement considerably relaxed.However, Li and Xu only applied this method to a 1D problem, for the discretization of high-order TFDE, by utilizing Legendre polynomials and their variants to approximate both temporal and spatial variables.Fakhar-Izadi and Shabgard [26] derived a new time-space spectral Galerkin method.This method can be stable for a kind of fourth-order TFDE.In fact, for the inverse problem of TFDE, spectral methods can also reach an exponential convergence.Based on the work of Lin et al. [27], who applied the spectral scheme to approximate TFDE, Ye et al. [28] proposed a spectral optimization method for its inverse problem, under the assumption that the optimal initial condition is smooth.The proposed method can indeed reach the exponential convergence rate.In addition, to further exploit the stability and high convergence of spectral meth-ods in solving different variants of Equation (1), Zaky et al. [29,30] developed a class of semi-implicit Galerkin-Legendre spectral schemes.These methods are stable for nonlinear time-space fractional Schr ödinger equation and can reach spectral accuracy even for those TSFDEs with nonsmooth solutions.Discussions about spectral methods for solving other variants of Equation ( 1) are exploited in Refs.[31][32][33][34][35].The methods in these works are stable and can usually reach spectral accuracy.Moreover, in order to further save the computational cost of those methods, numerous fast algorithms are also designed for solving TFDE [36], TSFDE [37], and their variants [38][39][40].In these methods, by utilizing the sum-ofexponential technique to approximate the kernel function in the tempered Caputo fractional derivative or using the Laplace transform for time integration of a semi-discretized system, these methods are stable and usually have a low computational cost.
One may observe that with the timing-step schemes mentioned above, one should solve the discrete system on each time level sequentially since the right-hand-side vector usually depends on the solution from a previous time level, which may lead to less convenience for parallel computing in the solving of Equation (1).For this reason, people turn to studying the all-at-once technique.At first, the all-at-once technique was proposed for solving the evolutionary equation with integer order.In these works, McDonald [41], Goddard [42], Lin [43], and Wu [44] stacked all the time steps in a vector to obtain an all-at-once system in the discretization process and then solved it.The right-hand-side vectors in these corresponding all-at-once systems only rely on the solutions at the initial time, and this phenomenon is truly convenient for parallel computing.After that, with the exploration of efficient solving for Equation ( 1), the all-at-once technique combined with finite differential method (FDM) or finite element method (FEM) spatial discretization and uniform temporal discretization (such as L1 [45], L1-2 [46], and L2-1 σ [47]) has been well utilized to solve those fractional diffusion equation, and the solving of corresponding all-at-once systems are also well studied.In general, coefficient matrices in these all-at-once systems from the above scheme no longer maintain the symmetry property but turn into Toeplitz block lower triangular matrices with Toeplitz subblocks.In order to reduce the computational complexity in the solving of those kinds of all-at-once systems, Huang,et al. [48] and Jia, et al. [49] proposed two fast methods.These two methods are mainly based on the divide and conquer technique, and they are well applied to Equation (1) and TFDE with space-time-dependent variable order, respectively.
Note that the coefficient matrix in the above all-at-once system is usually ill-conditioned, so utilization of a preconditioners is necessary [50].Block preconditioner is a straightforward method to improve the illness of the coefficient matrix [51].Based on this idea, Chen et al. [52] proposed four block preconditioners for solving the all-at-once system originating from Equation (1).Among these four block preconditioners, the banded block triangular preconditioner usually outperforms the other three if β is close to one.Besides, Zhao [53], Gu [54,55], and Zhu [56] conducted extensive research on block preconditioning strategies for solving the all-at-once system especially those with Toeplitz coefficient matrix.These block preconditioners proposed by them can effectively accelerate the matrix-vector product and can usually reach the linear computing complexity.Furthermore, for the Volterra subdiffusion problem discrete by FDM in space and with combination of graded and uniform time-steps temporal discretization, Zhao et al. [57] combined a block lower tridiagonal preconditioner and a semi-circulant preconditioner to solve the corresponding all-at-once system.This method can also be extended to solve the system arising from semilinear subdiffusion problems.
Based on the above observations, we note that solving Equation (1) using spectral methods typically yields high-order discretization accuracy.However, traditional timingstep schemes are not convenient for parallel computing, and there is limited research on combining spectral methods with the graded L1 scheme and the all-at-once technique to solve Equation (1).Besides, the coefficient matrix of the all-at-once system obtained by spectral-method spatial discretization and graded L1 temporal discretization in Equation (1) no longer maintains the Toeplitz structure, and the preconditioners mentioned above are not appropriate for solving the corresponding Galerkin-Legendre spectral all-at-once system.Therefore, in order to improve the efficiency of solving Equation (1), in this paper, we mainly focus on deriving and solving the corresponding Galerkin-Legendre spectral all-at-once system that arises from Equation (1).The main contribution of the present paper has two aspects: (i).This is the first study to combine the all-at-once technique with the Galerkin-Legendre spectral method and graded L1 method to solve time-space fractional diffusion Equation (1), which is more convenient for parallel computing than the traditional timing-step spectral scheme.(ii).A new preconditioner is proposed to improve the efficiency of solving the corresponding Galerkin-Legendre spectral all-at-once system arising from Equation (1), and relevant theoretical analyses and numerical experiments are also provided to show the effectiveness of the proposed method.
The remaining part of this paper is organized as follows.In the following section, the graded L1-Galerkin-Legendre spectral scheme (GL1-GLS) is derived to approximate Equation ( 1) and then a Galerkin-Legendre spectral all-at-once system is obtained.In Section 3, based on the sparse pattern of the coefficient matrix, a block L-diagonal preconditioner is proposed to accelerate solving the all-at-once system and some relevant theoretical analyses are also provided.Numerical results are given in Section 4 to show the efficiency of the general block L-diagonal preconditioner, and selection of parameter in the proposed preconditioner is also discussed experimentally.Finally, concluding remarks are presented in Section 5.

GL1-GLS Discretization and the All-at-Once System
In this section, the graded L1-Galerkin-Legendre spectral (GL1-GLS) scheme [29] is applied to approximate Equation (1).Then, with several auxiliary symbols, a Galerkin-Legendre spectral all-at-once system is obtained.

The Time-Stepping Discretization
Notice that the solutions of Equation ( 1) may be non-smooth and singular near t = 0 even though the initial conditions and source terms are smooth, and the graded L1 scheme can handle this problem well.Thus, in the following, the graded L1 scheme is utilized for the temporal discretization of Equation (1).

Lemma 1 ([29]). For a positive integer M and i
i=0 be the graded mesh with grading parameter γ ≥ 1, and denote the temporal step-size τ i of time t i by where Since the spectral methods are usually with high-order discretization accuracy, in the following, the Galerkin-Legendre spectral scheme in [29] is used for the spatial discretization of Equation (1).

Lemma 2 ([29]). Given a function space W
, the base function ϕ n of spectral methods can be given by Then, we take Equations ( 2) and (3) into Equation ( 1), and get the following GL1-GLS scheme for Equation (1).
where P N is an appropriate projection operator [29].Denote Then the GL1-GLS scheme can be rewritten into the matrix-vector form as where c α = − 1 2 sec( πα 2 ).According to Ref. [29], under certain assumptions about the regularity of the unique solution for Equation (1), the numerical solution obtained from above GL1-GLS scheme can achieve an optimal time convergence order for the optimal grading parameter γ opt = max{1, (2 − β)/σ}, where σ = [0, 1] ∪ [1,2] is used to reflect the regularity of the solution.Furthermore, more detailed convergence analyses of above GL1-GLS scheme can be found in Ref. [29].
From Equation (4), we find that based on the above GL1-GLS scheme, one needs to solve the discrete system on each time level sequentially since the right-hand-side vector in Equation ( 4) usually depends on the solution on a previous time level and this may lead to less convenience for parallel computing in the solving of Equation (1).For this reason, in the next subsection, the all-at-once technique is considered, and then Equation ( 4) can be transformed into the Galerkin-Legendre spectral all-at-once system, which is more convenient for parallel computing.

The Galerkin-Legendre Spectral All-at-Once System
To simplify the derivation of the Galerkin-Legendre spectral all-at-once system, several auxiliary symbols are introduced: O and I are zero and identity matrices of suitable orders.Furthermore, denote Then, based on Equation (4), we can get the following Galerkin-Legendre spectral all-atonce system Au = b, where Note that the right-hand-side vector in Equation ( 5) only depends on the solution in the initial time level, and this makes Equation ( 5) truly convenient for parallel computing.In addition, from Equation (6), we find that although the block lower triangular coefficient matrix A in Equation ( 5) no longer maintains the symmetry property directly like A n 0 in Equation ( 4), each subblock of A is symmetric.In addition, by "⊗", we denote the Kronecker product.Then the coefficient matrix in Equation ( 5) can be written as From Equation ( 7) and Lemma 1, we find that values of elements in matrix B are mainly affected by the grading parameter γ of temporal graded mesh and the fractional order β of time derivative.Since A = B ⊗ M − I M ⊗ (κc α (S + S T )), γ and β may also affect values of elements in matrix A. Besides, values of elements in matrix A may also be affected by fractional order α of space derivative.Details about the elements of matrix B and A with variant parameters are shown in Figures 1-4, respectively.To solve the linear system, Krylov subspace methods such as BICGSTAB and GMRES are usually efficient if the condition number of the coefficient matrix is small.Notice that the coefficient matrix A in Equation ( 5) is usually ill-conditioned.Therefore, an effective preconditioner, which can reduce the condition number or make the spectral distribution of the coefficient matrix cluster around 1 should be employed [50].In the next section, we will mainly focus on developing such an efficient preconditioner.

The Block L-Diagonal Preconditioner
In this section, we propose a block L-diagonal preconditioner M L to solve Equation ( 5) and some corresponding theoretical analyses will also be provided.As is shown in Figures 1 and 2, the diagonal entries of the coefficient matrix B are decreased.Since A = B ⊗ M − I M ⊗ (κc α (S + S T )), as can be seen from Figures 3 and 4, the main information of A is gathered in the first L-nonzero diagonal blocks, where L is usually larger than 3. Based on the above observation, we can naturally propose a block L-diagonal preconditioner M L = B L ⊗ M − I M ⊗ (κc α (S + S T )) to solve Equation (5), where

Theoretical Analyses of the Block L-Diagonal Preconditioner
For convenience of discussing the nonsingularity of the L-diagonal preconditioner M L , we present the following two lemmas.

Lemma 4 ([29]
).The components of the stiffness matrix S are ), where } is the set of Jacobi-Gauss points and weights with respect to the weight Lemma 3 clarifies that M is symmetric positive definite and Lemma 4 shows detailed components of matrix S. Notice that ãn 0 > 0, c α > 0 and κ is a positive constant.Then, from Lemmas 3 and 4, we can see that all main diagonal blocks A n 0 = ãn 0 M − κc α (S + S T ), 1 ≤ n ≤ M of the coefficient matrix A in Equation (5) is symmetric and invertible.Thus the coefficient matrix A and the preconditioner M L are nonsingular.
In order to analyze the approximation effect of block L-diagonal preconditioner M L on A, we present the estimation for the approximate error between M L and A in the following theorem.
Proof.Notice that Then we have From Lemma 3, we know that , where τ j = t j − t j−1 and ς n j = (t n − t j ) 1−β .Thus we can get This completes the proof.
Theorem 1 gives a clear indication that M L will approximate to A better and better as L gets larger and larger, and M L will be equal to A if L is equal to M. In the following Figures 5 and 6, we will take M = 2 4 and depict the approximation error of M L to A numerically.
Figures 5 and 6 verify that the approximation error ||E || ∞ in Theorem 1 indeed gets smaller and smaller as L gets larger and larger.Furthermore, Figures 5 and 6 also reveal the fact that even with variant α, β and N, the error between M L and A can always stay smaller than 1 as long as L is larger than 4.  Similar to the contents of the bi-diagonal preconditioner in Ref. [54], in order to analyze the preconditioned effect of block L-diagonal preconditioner, we consider the spectral property of the preconditioned coefficient matrix M −1 L A as follows.
Theorem 2. All eigenvalues of M −1 L A are approximate to 1.
Proof.Let * represent the nonzero block matrix.Then we obtain L A are all approximate to 1.
From Theorem 2 and the following Figures 7 and 8, we can observe that the block L-diagonal preconditioner M L can indeed make the eigenvalues of the coefficient matrix A cluster around 1. Besides, Figures 7 and 8 further exhibit numerically that the spectral distribution of M −1 L A clusters closer to 1 as L gets larger and larger.

Numerical Examples
In this section, we present two examples to verify the effectiveness of the block Ldiagonal preconditioner for solving the Galerkin-Legendre spectral all-at-once system originating from Equation (1) with nonsmooth solution in temporal space [29].To solve the preconditioned system by the PGMRES method (right-preconditioned GMRES), we use the built-in function "\" in MATLAB to calculate M −1 L and the iterative process of PGMRES may terminate if the relative residual error satisfies res = b − Au / b < 10 −8 or the iteration number is more than 1000.The initial guess of PGMRES is set as zero vector and for simplicity, "TS" represents that we solve the linear system in timing-step scheme by using MINRES method, "PGMRES (L = 0)" means that we use GMRES without preconditioner and "PGMRES (L = 1, 2, 5, 8, 10)" exhibit that we utilize PGMRES with preconditioner M L (L is equal to 1, 2, 5, 8, 10) to solve the all-at-once system, respectively.Moreover, "cond" stands for the condition number of the coefficient matrix, "Its" and "Time" express the iterative time and total cost in PGMRES, and " †" represents that PGMRES can not converge within 1000 iterations.The unit of time is seconds.
All experiments are conducted in double precision arithmetic on Windows 10 and MATLAB R2017a with Intel (R) Core (TM) i7-8750H CPU@2.2GHz, using a PC with 8GB of memory.
Example 1.Consider the Galerkin-Legendre spectral all-at-once system arising from the following fractional equation In order to investigate the effect of M L on PGMRES, we depict the number of iterations such that res < 10 −8 with variant L for different α, β, and N in the following Figure 9.   Figure 9 indicates that even for different cases with various α, β, and N, PGMRES can always converge faster than GMRES.In addition, PGMRES converges faster and faster as L gets larger and larger, while the improvement of convergence is no longer significant if L is larger than 8.More detailed test results are shown in Table 1.
As can be seen from Table 1, for the improvement of illness in the coefficient matrix, performance of M L gets better and better as L increases.Additionally, Table 1 shows that GMRES cannot always converge within the specified 1000 iterations, while PGMRES can always converge, and this indicates that the preconditioner M L is effective for solving the all-at-once system (5).From Table 1, we can observe that PGMRES (L = 1, 2) usually converges within 60 iterations while PGMRES (L = 5) can converge within 8 iterative steps, PGMRES (L = 8) can converge within 6 iterations, and PGMRES (L = 10) can converge within 5 iterations.This informs us that reduction of the number of iterations in PGMRES is not obvious as L turns larger than 8 or smaller than 5. Accordingly, cost time of PGMRES (L = 5) and PGMRES (L = 8) are usually less than that of PGMRES with other L and this phenomenon is basically unaffected by the variance of α, β, and N. On the other hand, Table 1 also manifests that the total cost of TS is always more than that of PGMRES (L = 5) and PGMRES (L = 8), which reveals the fact that to solve TSFDE in this example, using PGMRES with preconditioner M L to solve the all-at-once system is more efficient than using MINRES to solve the traditional timing-step system, and L = 5-8 is appropriate.
Next, in order to further investigate the effect of parameter L on the application of M L for different cases with various M, we take N = 2 5 and show the test results in the following Table 2. Table 2 states that for different cases with various α, β, and M, M L is always effective.Furthermore, M L is almost equally effective for the cases with different α, β while for the case with large M, the effectiveness of M L is usually more significant.As can be seen from Table 2, for the effectiveness of M L on PGMRES (L), the selection of parameter L is not significantly affected by the temporal discrete size M, and for different cases with various M, preconditioner M L turns out to be more efficient as the parameter L gets around 5 and 8. Therefore, for the sake of an overall performance, L does not need to be taken too large and L = 5-8 is usually appropriate.In addition, in order to illustrate that the numerical solution obtained from the Galerkin-Legendre spectral all-at-once system is reasonable for Equation (1), we depict the numerical solutions for Equation (8) in Figure 10.We note that analytical solution in Equation ( 8) is difficult to obtain, hence we here consider the numerical solution under ra efined grid as the exact solution, and we also compare the numerical solutions obtained by different methods for Equation (8).Besides, in Figure 10, TS represents solving the timing-step scheme by MINRES and AAO(L) stands for solving the all-at-once scheme by PGMRES with preconditioner M L .Colors from light to dark represent the graded temporal mesh divided from 0 to 1.
As can be seen from Figure 10, the numerical solution obtained by using AAO is similar to that obtained by using TS, and they are both approximated to the numerical solution under the refined grid.Therefore, the numerical solution obtained from the Galerkin-Legendre spectral all-at-once system is reasonable for Equation (8).Example 2. Consider the Galerkin-Legendre spectral all-at-once system arising from the following fractional equation Firstly, we take M = 2 4 , N = 2 5 , 2 6 , 2 7 with variant L to explore the effect of preconditioner M L on the number of iterations for PGMRES in the following Figure 11.   Figure 11 claims that PGMRES always converges faster than GMRES, and PGMRES converges faster and faster as L gets larger and larger.Nevertheless, the number of iterations in PGMRES barely decreases as L grows larger than 8, and this phenomenon is barely affected by the variance of α, β, and N.More detailed test results about the condition number of the coefficient matrix and total cost in the solving with different solvers are shown in Table 3. Table 3 recounts that the condition number of the coefficient matrix is reduced and the number of iterations in PGMRES (L) is always less than that in GMRES.This explicates that the preconditioner M L is always effective for solving the all-at-once system (5).Besides, the number of iterations in PGMRES (L) gets smaller and smaller as L gets larger and larger while the total cost in PGMRES (L) does not always decrease as L increases.This phenomenon propagates the fact that an appropriate value of L is important for the application of M L in PGMRES (L).Based on the test results in Table 3, we see that the total cost of PGMRES (L = 5) and PGMRES (L = 8) are far less than that of TS and other PGMRES, which suggests that L = 5-8 is appropriate.
In order to further explore the effect of L on the application of preconditioner M L for different cases with various M, we take N = 2 5 and present the detailed test results in Table 4.  9) with α = 1.8, β = 0.2, M = 2 4 , where N is taken to be 2 7 and 2 5 in (a-c), respectively.

Conclusions
In this paper, we mainly studied an efficient way to solve Equation (1).We combined the all-at-once technique with the Galerkin-Legendre spectral method and the graded L1 method to solve the time-space fractional diffusion in Equation ( 1), and found that this scheme is more convenient for parallel computing than the traditional timing-step spectral scheme.With the observation that the coefficient matrix of the corresponding Galerkin-Legendre spectral all-at-once system is usually a block lower triangular and that the main information of the coefficient matrix is gathered in the first L-nonzero diagonal blocks, we proposed a block L-diagonal preconditioner M L to accelerate solving the Galerkin-Legendre spectral all-at-once system.Based on the analyses of approximation error and spectral distribution of the coefficient matrix, the proposed preconditioner M L can improve the illness of the coefficient matrix and M L usually performs better and better as L increases.Additionally, the numerical experiments further verify the effectiveness of M L and present the appropriate value of parameter L heuristically.
We note that the combination of the all-at-once technique with the Galerkin-Legendre spectral method and graded L1 method is convenient for parallel computing in solving Equation (1), and the proposed block L-diagonal preconditioner M L can accelerate solving the corresponding Galerkin-Legendre spectral all-at-once system.However, similar to many existing works where the parameters can only be selected experimentally, in this paper, we also choose the parameter L heuristically.For future work, we will develop some adaptive techniques to set L adaptively with moderate computational cost.Furthermore, it is important to note that the time-space fractional diffusion Equation (1) considered in this paper can only be utilized to model the linear anomalous diffusion.To simulate more completed anomalous diffusion such as heterogeneous porous, non-Fickian diffusion in porous media and nonlinear diffusion in image denoising media, Equation (1) needs to be further developed into a variable-order model, distributed-order model, and nonlinear model for better application.We expect that the method proposed in this paper could be developed to handle more comprehensive models, and we plan to further explore these applications in future work.

3. 1 .
Derivation of the General Block L-Diagonal Preconditioner M L Firstly, we focus on observing the sparsity pattern of matrix B and the coefficient matrix A = B ⊗ M − I M ⊗ (κc α (S + S T )).
.Number of iterations for N=2

Figure 9 .
Figure 9.The number of iterations for PGMRES in Example 1 with M = 2 4 , and L = 0 represents GMRES without the preconditioner.
.Number of iterations for N=2

Figure 11 .
Figure 11.The number of iterations for PGMRES in Example 2 with M = 2 4 , and L = 0 represents GMRES without the preconditioner.
Approximation error of M L to A with variant α, β and N in Example 1.
Figure 6.Approximation error of M L to A with variant α, β and N in Example 2.