A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems

Muzhinji, Kizito; Shateyi, Stanford

doi:10.3390/computation8030068

Open AccessArticle

A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems

by

Kizito Muzhinji

^†

and

Stanford Shateyi

^*,†

Department of Mathematics and Applied Mathematics, School of Mathematical and Natural Sciences, University of Venda, P Bag X5050, Thohoyandou 0950, South Africa

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Computation 2020, 8(3), 68; https://doi.org/10.3390/computation8030068

Submission received: 4 June 2020 / Revised: 6 July 2020 / Accepted: 9 July 2020 / Published: 27 July 2020

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we consider the numerical solution of the optimal control problems of the elliptic partial differential equation. Numerically tackling these problems using the finite element method produces a large block coupled algebraic system of equations of saddle point form. These systems are of large dimension, block, sparse, indefinite and ill conditioned. The solution of such systems is a major computational task and poses a greater challenge for iterative techniques. Thus they require specialised methods which involve some preconditioning strategies. The preconditioned solvers must have nice convergence properties independent of the changes in discretisation and problem parameters. Most well known preconditioned solvers converge independently of mesh size but not for the decreasing regularisation parameter. This work proposes and extends the work for the formulation of preconditioners which results in the optimal performances of the iterative solvers independent of both the decreasing mesh size and the regulation parameter. In this paper we solve the indefinite system using the preconditioned minimum residual method. The main task in this work was to analyse the 3 × 3 block diagonal preconditioner that is based on the approximation of the Schur complement form obtained from the matrix system. The eigenvalue distribution of both the proposed Schur complement approximate and the preconditioned system will be investigated since the clustering of eigenvalues points to the effectiveness of the preconditioner in accelerating an iterative solver. This is done in order to create fast, efficient solvers for such problems. Numerical experiments demonstrate the effectiveness and performance of the proposed approximation compared to the other approximations and demonstrate that it can be used in practice. The numerical experiments confirm the effectiveness of the proposed preconditioner. The solver used is robust and optimal with respect to the changes in both mesh size and the regularisation parameter.

Keywords:

elliptic optimal control problems; partial differential equations (PDEs); saddle point problem; block preconditioners; Schur complement; preconditioned minimum residual method (PMINRES)

MSC:

49J20; 65M22; 65M55; 65M60

1. Introduction

The partial differential equation constrained optimisation problems have over the years become an active research area that has arisen in many application areas that include finance, medicine, modern science and engineering. We refer to [1,2] on their theoretical and also to [3,4,5] on their numerical developments. These problems are numerically and analytically challenging to solve. The most clear challenge is that the resulting linear algebraic system is very large such that the only viable way is to get the solution iteratively using specialised methods. The formulation of the linear optimisation problem involves the objective function to minimise subject to the constraints defined by the underlying modelling partial differential equation in a bounded domain

Ω \subset R^{2}

with boundary

\partial Ω

. Consider the following elliptic distributed PDE-optimal control problem

\min_{(u, y)} J (y, u) : = \frac{1}{2} ‖ y - y_{d} ‖_{L^{2} (Ω)}^{2} + \frac{δ}{2} {‖ u ‖}_{L^{2} (Ω)}^{2}

(1)

subject to the constraints

\begin{matrix} - Δ y & = & f + u in Ω \\ y & = & g on \partial Ω \end{matrix}

(2)

with y the state variable,

y_{d}

the desired state known over the domain

\bar{Ω}

and u the control variable on the right hand side. The parameter

δ

is called the regularisation parameter which measures the cost of the control and is supplied and positive. The size of the regularisation parameter is important in the performances of the iterative solvers. The optimal value of the regularisation parameter

δ

is

10^{- 2}

; see [6,7]. The performance of iterative solvers with the decreasing parameter is the central theme of this study. The solution of the state variable y must satisfy the PDE over

Ω

and must be as close as possible to the desired state

y_{d}

so that the objective function can be minimised.

The optimal control problem Equations (1) and (2) has a unique solution

(y, u)

characterised by the following optimality system called the Karush–Kuhn–Tucker (KKT) system [8,9]. The first order optimality system of the PDE-optimal control problems consists of a state equation, an adjoint equation and the control equation which is a saddle point problem as given below

\begin{matrix} - Δ p & = & y - y_{d}, in Ω p = 0 on \partial Ω adjoint equation \end{matrix}

(3)

\begin{matrix} - Δ y & = & f + u, in Ω y = g on \partial Ω state equation \end{matrix}

(4)

\begin{matrix} - δ u & = & p in Ω control equation \end{matrix}

(5)

The optimality system is achieved through the Lagrange multiplier method which partitions the model problem into three equations, namely, the state y, control u and the adjoint, p which form the saddle point problem. For the numerical solution of the elliptic optimal control problem we apply the finite element method to the system ((3)–(5)) to get the linear saddle point problem. The finite element method is the most popular technique for the numerical solution of the PDE-constrained optimisation problems; see [5,6,7] and many more. The finite element method results in the coupled linear algebraic system which has to be solved by the appropriate solvers. The resulting discrete KKT system is

(\begin{matrix} M & O & K \\ O & δ M & - M \\ K & - M & O \end{matrix}) (\begin{matrix} y \\ u \\ p \end{matrix}) = (\begin{matrix} b \\ 0 \\ d \end{matrix})

(6)

where

K \in R^{n \times n}

is a stiffness matrix, and that and the mass matrix

M \in R^{n \times n}

are both symmetric and positive definite. The vector

b = M y_{d} \in R^{n}

is the finite element projection of the desired state

y_{d}

, and

d \in R^{n}

contains the terms arising from the boundaries of the finite element of the state y.

The finite element discretisation produces a large scale linear algebraic system of Equation (6) which is indefinite and has poor spectral properties. The numerical solution of such problems is a computational uphill task and has a lot of challenges such that constructing robust and efficient solvers has preoccupied the computational scientific community for decades. The linear algebraic indefinite system is parameter dependent such that the condition number grows when the mesh size and the regularisation parameter approach zero. The well known Krylov and multigrid iterative solvers perform poorly for such systems. In recent years, much effort has been devoted in developing specialised iterative solvers using suitable preconditioners [3,6,7,10,11,12] and appropriate smoothers for the multigrid solvers [8,9,13,14,15,16,17]. For the development of the efficient multigrid solvers for the distributed optimal control problems, we refer to ([8,9,15,16,18,19]. Several preconditioned strategies have been developed and evolved that accelerate the iterative solvers. These are the block diagonal preconditioner [6,20,21,22] for the minimum residual (MINRES) solver [23,24], and for the generalised minimal residual (GMRES) method, block triangular preconditioners with nonstandard inner products for the conjugate gradient method [7,10,21,22,25] and the constraint preconditioner [7,11,21]. The preconditioners were also developed for the two by two reduced block linear system and three by three block system to solve the optimal control problems of the Stokes [10,26,27], parabolic [28,29] and convection diffusion [30] equations. In most cases the application of preconditioners shows robustness and efficiency with the decreasing mesh size but not with the decreasing regularisation parameter. In this study, we focus on the improvement of the block diagonal preconditioner to enhance the performance of the MINRES solver. The MINRES was developed by [31] and widely used for PDE-constrained optimisation problems. The aim of this paper is to present preconditioners based on the Schur complement approach that seek to achieve an efficient solution of the linear system arising from the discretisation of the PDE- optimal control problems. We give particular attention to preconditioning techniques that achieve robust performances in the MINRES solver with respect to both decreasing mesh size and the regularisation parameter. As the mesh size approaches zero, the dimensions of the problem increase. A mesh-independent performance is always achieved for such systems, but the small values of the regularisation parameter pose a great challenge to the iterative solvers. As the regularisation parameter decreases, the performance of the iterative solver deteriorates, but it improves for large values.

The optimal performance of the preconditioner depends on applying an appropriate approximation of the (1,1) and (3,3)-block entries of the preconditioner. For the preconditioners based on the Schur complement, the (3,3) Schur complement remains one of the challenges to finding a suitable approximation to achieve robustness in terms of parameter changes. The Schur complement approximation developed in [6] and widely used in literature displays optimal performance for large values of the regularisation parameter, but the performance of it deteriorates as the parameter approaches zero. In [21,23] another Schur complement approximation was developed for PDE-constrained optimisation problems which gives convergence of the appropriate iterative method in a number of steps which are independent of the value of the regularisation parameter. The preconditioned iterative solver displays optimal performance for small values of the regularisation parameter. The main contribution of this work is to present a different form of the Schur compliment approximation to achieve an optimal performance of the MINRES solver in terms of iterative counts that are independent of the decreasing mesh size and the regularisation parameter. The clustering of the eigenvalues points to the good convergence properties of the preconditioned solver. We remove the dependency on the regularisation parameters within the preconditioned matrix as in [21]. We derive and investigate the clustering of the eigenvalues for the Schur compliment approximate and the preconditioned system. The numerical experiments’ outcomes of the proposed approximation are compared with the existing approximations of the Schur complement preconditioner to show its efficiency.

This paper is organised as follows. In Section 2 we preview the block diagonal preconditioner and investigate the eigenvalue distribution. In Section 3, we discuss our proposed approximation for the Schur complement and investigate the eigenvalue distribution of the preconditioned system. In Section 4 we present the numerical experiments to demonstrate how well the proposed approximation works and the conclusion is given.

2. Analysis of the Block Diagonal Preconditioner

We consider the preconditioning strategies for solving the saddle point problem, Equation (6). The optimal performance of the preconditioned Krylov subspace methods such as MINRES, GMRES and conjugate gradient rely more on the distribution of the eigenvalues of the coefficient matrix. It is well known in numerical analysis that the spectral radius must be bounded by one to guarantee convergence of the iterative solvers. The coefficient matrix of the system (6) is known to be large, sparse and indefinite with poor spectral properties. This section aims to outline the block diagonal preconditioner that involves the mass matrix and a Schur complement form. We consider a block diagonal preconditioner for the

3 \times 3

block coefficient matrix of the Equation (6). For the

2 \times 2

block preconditioners we refer to [7,22] and the references therein. The block diagonal preconditioner is given by

Q : = (\begin{matrix} M & O & O \\ O & δ M & O \\ O & O & S \end{matrix})

(7)

where

S : = K M^{- 1} K + \frac{1}{δ} M

is the Schur complement form. A good preconditioner must fulfil the assumptions that it is easy to invert, and the linear system

Q x = b

associated with it is easy to solve for any vector

b

. The approximation of the (3,3) block entry poses more challenges and difficulties. One possible widely used approach as in [6,21,24] is to get the approximation of S by discarding the term

\frac{1}{δ} M

to obtain

Q_{S_{1}} = K M^{- 1} K

by dropping a term with

δ^{- 1} M

with an argument such that for all very small values of

δ

the term

K M^{- 1} K

will be dominating. The application of the preconditioner with the approximation

Q_{S_{1}} = K M^{- 1} K

has the shortfall that the iterative solver converges independently of the decreasing mesh size only but not in the decreasing regularisation parameter

δ

. To demonstrate this we need to give the eigenvalue bounds for

Q_{S_{1}}^{- 1} S

as derived in [6,21]. We now use the following results, which are Theorems (3.4) and (3.5) in [6] and used by [21,24].

Theorem 1

([6]). For the problem Equation (6) in

Ω \in R^{2}

with the degree of approximation

Q_{m}

or

P_{m}

with

m \geq 1

the following bounds hold:

α_{1} h^{2} \leq \frac{v^{T} M v}{v^{T} v} \leq α_{2} h^{2}

(8)

where

α_{1}

and

α_{2}

are real constants independent of h but dependent on

m

.

Theorem 2

([6]). For the problem Equation (6) in

Ω \in R^{2}

with the degree of approximation

Q_{m}

or

P_{m}

with

m \geq 1

the following bounds hold:

θ_{1} h^{2} \leq \frac{v^{T} K v}{v^{T} v} \leq θ_{2}

(9)

where

θ_{1}

and

θ_{2}

are real constants independent of h but dependent on

m

.

Theorem 3.

The eigenvalues of

Q_{S_{1}}^{- 1} S

are bounded as

λ (Q_{S_{1}}^{- 1} S) \in [\frac{1}{δ} \tilde{α} h^{4} + 1, \frac{1}{δ} \tilde{θ} h^{4} + 1]

where

\tilde{α}

and

\tilde{θ}

are independent of h and but dependent on δ.

Proof of Theorem 3

To find the eigenvalue distribution of

Q_{S_{1}}^{- 1} S

, let

ω

be the eigenvalue, and then

\begin{matrix} Q_{S_{1}}^{- 1} S y & = & ω y \\ {(K M^{- 1} K)}^{- 1} (K M^{- 1} K + \frac{1}{δ} M) y & = & ω y \\ (I δ + K^{- 1} M K^{- 1} M) y & = & ω δ y \\ {(K^{- 1} M)}^{2} y & = & δ (ω - 1) y \end{matrix}

Let

υ

be an eigenvalue of

K^{- 1} M

then the eigenvalue of

{(K^{- 1} M)}^{2}

is

υ^{2}

. This means that

υ^{2} = δ (ω - 1)

\begin{matrix} {(K^{- 1} M)}^{2} y & = & υ^{2} y \\ (K^{- 1} M) y & = & υ y \\ M y & = & υ K y \end{matrix}

\Rightarrow υ = \frac{y^{T} M y}{y^{T} K y} = \frac{y^{T} M y}{y^{T} y} \cdot \frac{y^{T} y}{y^{T} K y}

. By using Theorems 1 and 2, there exist positive constants

\tilde{α}

and

\tilde{θ}

independent of mesh size; we have

\tilde{α} h^{2} \leq \frac{y^{T} M y}{y^{T} K y} \leq \tilde{θ}

This implies that we have

\begin{matrix} \tilde{α} h^{4} \leq & δ (ω - 1) & \leq \tilde{θ} \\ \frac{1}{δ} \tilde{α} h^{4} \leq & ω - 1 & \leq \frac{1}{δ} \tilde{θ} \\ \frac{1}{δ} \tilde{α} h^{4} + 1 \leq & ω & \leq \frac{1}{δ} \tilde{θ} + 1 \end{matrix}

Thus, it follows that there exist constants

\tilde{α}

and

\tilde{θ}

independent of the mesh size such that

\frac{1}{δ} \tilde{α} h^{4} + 1 \leq \frac{y^{T} S y}{y^{T} Q_{S_{1}} y} \leq \frac{1}{δ} \tilde{θ} + 1

□

It is clear that

Q_{S_{1}}

has no dependence on

δ

but the eigenvalue lower and upper bounds are dependent on

δ

. The eigenvalue bounds demonstrated by the Theorem 3 above show that the parameter

δ

determines the clustering and distribution of eigenvalues. This clearly shows that if

δ

is too small the eigenvalues are not clustered and

Q_{S_{1}}

will not be a better approximation. This entails that large values of

δ

,

Q_{S_{1}}

are a perfect approximation. The current research seeks to construct an approximation whose performance does not rely on both the mesh size and the regularisation parameter. The following theorem in [6] gives the eigenvalue distribution of the preconditioned coefficient matrix with preconditioner

Q_{1}

with the approximation for the Schur complement form

Q_{S_{1}}

where

Q_{1} : = (\begin{matrix} M & O & O \\ O & δ M & O \\ O & O & K M^{- 1} K \end{matrix})

(10)

Theorem 4

([7,24]). For

Q_{1}

approximation, let the preconditioner

Q_{1}

be defined by Equation (10). Assume that λ is an eigenvalue of the preconditioned matrix

Q_{1}^{- 1} K

. Then

λ = 1

or λ satisfies the following bound:

1 + \frac{1}{2 δ} \frac{α_{1}^{2}}{θ_{2}^{2}} h^{4} \leq λ \leq 1 + \frac{1}{2 δ} \frac{α_{1}^{2}}{θ_{2}^{2}}

For the proof and details we refer to [7,24] and references therein.

3. Proposed Schur Complement Preconditioner Approximation

The above Theorem 4 has demonstrated that applying the preconditioner with the Schur complement approximation

Q_{S_{1}}

results in performance and convergence of the preconditioned iterative solver that are independent of the discretisation mesh size and dependent on the regularisation parameter. It is expected that the convergences of the Krylov methods such as MINRES and GMRES are independent of the mesh size. The crucial task is to develop an efficient preconditioner that gives close eigenvalue bounds of the preconditioned system. The scientific literature on preconditioning optimal control systems includes classical Schur complement based approximations. In [21] we see the extension of the preconditioner whose effects are independent of the regularisation parameter. This was also used in [24]. The main task of this section is to briefly discuss the approximation in [21] and the one we propose in this work, which has the same clustering and distribution of eigenvalues and produces a preconditioned iterative solver that is not sensitive to the changes and decreasing regularisation parameter. The following approximation for S was developed in [21]

S = (K + \frac{1}{\sqrt{δ}} M) M^{- 1} (K + \frac{1}{\sqrt{δ}} M) - \frac{2}{\sqrt{δ}} K

which reduces to

S = (K + \frac{1}{\sqrt{δ}} M) M^{- 1} (K + \frac{1}{\sqrt{δ}} M) - \frac{2}{\sqrt{δ}} K \approx (K + \frac{1}{\sqrt{δ}} M) M^{- 1} (K + \frac{1}{\sqrt{δ}} M) : = Q_{S_{2}}

by discarding the much smaller term

\frac{2}{\sqrt{δ}} K

which is a smaller

O (δ^{\frac{- 1}{2}})

term than

O (δ^{- 1})

shown Section 2 above. To get eigenvalue bounds for

Q_{S_{2}}^{- 1} S

, we follow the same derivation and use the Theorems 1–3 above. The preconditioner involving the Schur complement approximation

Q_{S_{2}}

is given as follows

Q_{2} : = (\begin{matrix} M & O & O \\ O & δ M & O \\ O & O & (K + \frac{1}{\sqrt{δ}} M) M^{- 1} (K + \frac{1}{\sqrt{δ}} M) \end{matrix})

(11)

There has been development of similarly structured Schur complement approximations to

Q_{S_{2}}

whose eigenvalue bounds are the same but differ in computational complexities. We have

Q_{S_{c}} = (K + \frac{1}{i \sqrt{δ}} M) M^{- 1} (K - \frac{1}{i \sqrt{δ}} M)

which is complex based developed by Choi and others in [32]. In this paper we propose and analyse a robust approximation for S which has the same eigenvalue bounds but a different eigenvalue distribution within the interval for

Q_{S_{2}}

as the approximation

Q_{S_{3}}

which is expected to accelerate the numerical solution. Our proposed approximation for the Schur complement is also derived from

Q_{S_{1}}

by writing

\begin{matrix} S & = & K M^{- 1} K + δ^{- 1} M \\ = & (\sqrt{δ} K) {(δ M)}^{- 1} (\sqrt{δ} K) + {(δ M)}^{- 1} + 2 {\sqrt{δ}}^{- 1} K - 2 {\sqrt{δ}}^{- 1} K \\ = & [(\sqrt{δ} K) {(δ M)}^{- 1} (\sqrt{δ} K) + δ^{- 1} K] + [{(δ M)}^{- 1} + δ^{- 1} K] - 2 {\sqrt{δ}}^{- 1} K \\ = & \sqrt{δ} K [{(δ M)}^{- 1} (\sqrt{δ} K) + δ^{- 1} I] + δ^{- 1} [\sqrt{δ} K + M] - 2 {\sqrt{δ}}^{- 1} K \\ = & \sqrt{δ} K [({(δ M)}^{- 1} (\sqrt{δ} K) + δ^{- 1} M^{- 1} M] + δ^{- 1} [\sqrt{δ} K + M] - 2 {\sqrt{δ}}^{- 1} K \\ = & \sqrt{δ} K {(δ M)}^{- 1} [\sqrt{δ} K + M] + δ^{- 1} [\sqrt{δ} K + M] - 2 {\sqrt{δ}}^{- 1} K \\ = & [\sqrt{δ} K {(δ M)}^{- 1} + δ^{- 1} I] [\sqrt{δ} K + M] - 2 {\sqrt{δ}}^{- 1} K \\ = & [\sqrt{δ} K {(δ M)}^{- 1} + δ^{- 1} M M^{- 1}] [\sqrt{δ} K + M] - 2 {\sqrt{δ}}^{- 1} K \\ = & [\sqrt{δ} K + M] {(δ M)}^{- 1} [\sqrt{δ} K + M] - 2 {\sqrt{δ}}^{- 1} K \end{matrix}

which similarly reduces to

S = (\sqrt{δ} K + M) {(δ M)}^{- 1} (\sqrt{δ} K + M) - \frac{2}{\sqrt{δ}} K \approx (\sqrt{δ} K + M) {(δ M)}^{- 1} (\sqrt{δ} K + M) = Q_{S_{3}}

by discarding a

O (δ^{- \frac{1}{2}})

term this means that the error committed grows much slower. To motivate our choice of the Schur complement approximation, we illustrate the properties of the preconditioned Schur complement and the preconditioned system. We now discuss the derivation of the eigenvalue distribution of

Q_{S_{3}}^{- 1} S

by following the same derivation in [21].

Theorem 5.

The eigenvalues of

Q_{S_{3}}^{- 1} S

satisfy the following bounds

σ (Q_{S_{3}}^{- 1} S) \in [\frac{1}{2}, 1]

independent of both mesh size h and regularisation parameter δ

Proof of Theorem 5

Q_{S_{3}}

is invertible since M is symmetric and positive definite. Let

σ

be an eigenvalue of

Q_{S_{3}}^{- 1} S

corresponding to eigenvector

y

; then

\begin{matrix} Q_{S_{3}}^{- 1} S y & = & σ y \\ S y & = & σ Q_{S_{3}} y \\ (K M^{- 1} K + \frac{1}{δ} M) y & = & σ (\sqrt{δ} K + M) {(δ M)}^{- 1} (\sqrt{δ} K + M) y \\ (K M^{- 1} K + \frac{1}{δ} M) y & = & σ (K M^{- 1} K + \frac{1}{δ} M + \frac{2}{\sqrt{δ}} K) y \\ (I + \frac{1}{δ} K^{- 1} M K^{- 1} M) y & = & σ (I + \frac{1}{δ} K^{- 1} M K^{- 1} M + \frac{2}{\sqrt{δ}} K^{- 1} M) y \\ (δ I + K^{- 1} M K^{- 1} M) y & = & σ (δ I + K^{- 1} M K^{- 1} M + 2 \sqrt{δ} K^{- 1} M) y \\ (δ I + {(K^{- 1} M)}^{2}) y & = & σ {(\sqrt{δ} I + K^{- 1} M)}^{2} y \\ {(\sqrt{δ} I + K^{- 1} M)}^{- 2} (δ I + {(K^{- 1} M)}^{2}) y & = & σ y \end{matrix}

Let

ν

be the eigenvalue of

K^{- 1} M

; then

{(\sqrt{δ} + ν)}^{- 2} (δ + ν^{2}) y = σ y

This means that

σ = \frac{(δ + ν^{2})}{{(ν + \sqrt{δ})}^{2}}

is an eigenvalue of

Q_{S_{3}}^{- 1} S

. Now since

K^{- 1} M

is similar to the real and symmetric matrix

M^{\frac{1}{2}} K^{- 1} M^{\frac{1}{2}}

, it is diagonalisable; this means that it describes all the eigenvalues of

Q_{S_{3}}^{- 1} S

. It is known that a fraction of the form

\frac{v^{2} + w^{2}}{{(v + w)}^{2}}

is bounded between

\frac{1}{2}

and 1. Hence,

υ = \frac{(δ + ν^{2})}{{(ν + \sqrt{δ})}^{- 2}} = \frac{({\sqrt{δ}}^{2} + ν^{2})}{{(ν + \sqrt{δ} I)}^{- 2}}

which takes the form

\frac{v^{2} + w^{2}}{{(v + w)}^{2}}

. Thus,

\frac{1}{2} \leq σ \leq 1

. By this we conclude that

σ ({\hat{Q}}_{S_{3}}^{- 1} S) \in [\frac{1}{2}, 1]

is clustered and not dependent on the mesh size h and the regularisation parameter

δ

. □

The actual application of the preconditioner

Q_{S_{3}}

in actual computations involving solving two subsystems to accelerate the convergence of Krylov subspace iteration process can be implemented as: Application of Preconditioner

Q_{S_{3}}

.

Algorithm 1.Application of Preconditioner

Q_{S_{3}}

Let

L = \sqrt{δ} K + M

and

w

be any given vector; we can compute the vector

v

and

Q_{S_{3}} v = w

, that is,

L M^{- 1} L v = w

, using the following procedures:

$L y = w$ solve for $y$ ;
$M^{- 1} L v = y$ ; we have $M y = u$ ;
$L v = u$ solve for $v$ .

Since M and K are sparse, L is sparse too, and an added bonus for

Q_{S_{3}}

and

Q_{S_{2}}

is that they maintain

δ

-dependency and their bounds are fairly good. The preconditioner involving the Schur complement approximation

Q_{S_{3}}

is given as follows.

Q_{3} : = (\begin{matrix} M & O & O \\ O & δ M & O \\ O & O & (\sqrt{δ} K + M) {(δ M)}^{- 1} (\sqrt{δ} K + M) \end{matrix})

(12)

We now derive the eigenvalues distribution of the preconditioned coefficient matrix

Q_{3}^{- 1} K

with the modified approximation for the Schur complement form. We will use Proposition (2) in [7] which extends Theorem 4.

Theorem 6

([7]). The eigenvalues of the preconditioned system

Q_{3}^{- 1} K

where

K : = (\begin{matrix} M & O & K \\ O & δ M & - M \\ K & - M & O \end{matrix})

and

Q_{3} = (\begin{matrix} M & O & O \\ O & δ M & O \\ O & O & Q_{S_{3}}^{- 1} \end{matrix})

are in the interval

μ (Q_{3}^{- 1} K) \in [\frac{1 - \sqrt{5}}{2}, \frac{1 - \sqrt{3}}{2}] \cup 1 \cup [\frac{1 + \sqrt{3}}{2}, \frac{1 + \sqrt{5}}{2}]

Proof of Theorem 5

Let

σ

be the eigenvalue of

Q_{S_{3}}^{- 1} S

and

μ

be an eigenvalue of

Q_{3}^{- 1} K

corresponding to the eigenvector

v = {v_{1}, v_{2}, v_{3}}

. We want to find the eigenvalues bounds for

Q_{3}^{- 1} K

. We use a different approach from the one in [21]. We have

Q_{3}^{- 1} = (\begin{matrix} M^{- 1} & O & O \\ O & {(δ M)}^{- 1} & O \\ O & O & Q_{S_{3}}^{- 1} \end{matrix})

The preconditioned coefficient matrix

\begin{matrix} Q_{3}^{- 1} K & = & (\begin{matrix} M^{- 1} & O & O \\ O & {(δ M)}^{- 1} & O \\ O & O & Q_{S_{3}}^{- 1} \end{matrix}) (\begin{matrix} M & O & K \\ O & δ M & - M \\ K & - M & O \end{matrix}) \\ = & (\begin{matrix} I & O & M^{- 1} K \\ O & I & - {(δ)}^{- 1} I \\ Q_{S_{3}}^{- 1} K & - Q_{S_{3}}^{- 1} M & O \end{matrix}) \end{matrix}

We consider the eigenvalue problem

\begin{matrix} Q_{3}^{- 1} K v & = & μ v \\ (\begin{matrix} I & O & M^{- 1} K \\ O & I & - {(δ)}^{- 1} I \\ Q_{S_{3}}^{- 1} K & - Q_{S_{3}}^{- 1} M & O \end{matrix}) (\begin{matrix} v_{1} \\ v_{2} \\ v_{3} \end{matrix}) & = & μ (\begin{matrix} v_{1} \\ v_{2} \\ v_{3} \end{matrix}) \end{matrix}

It follows that,

\begin{matrix} v_{1} + M^{- 1} K v_{3} & = & μ v_{1} \end{matrix}

(13)

\begin{matrix} v_{2} + \frac{1}{δ} I v_{3} & = & μ v_{2} \end{matrix}

(14)

\begin{matrix} Q_{S_{3}}^{- 1} K v_{1} - Q_{S_{3}}^{- 1} M v_{2} & = & μ v_{3} \end{matrix}

(15)

If $μ = 1$ from Equations (13) and () we have $M^{- 1} K v_{3} = 0$ and $\frac{1}{δ} I v_{3} = 0$ , then $v_{3} = 0$ . Therefore, the corresponding eigenvector is

$v = (\begin{matrix} v_{1} \\ v_{2} \\ 0 \end{matrix}), \forall (\begin{matrix} v_{1} \\ v_{2} \end{matrix}) \neq 0$
If $μ \neq 1$ from Equations (13) and () we have

$\begin{matrix} v_{1} & = & \frac{1}{μ - 1} M^{- 1} K v_{3} \end{matrix}$

(16)

$\begin{matrix} v_{3} & = & \frac{1}{δ (μ - 1)} v_{3} \end{matrix}$

(17)

Substituting Equations (16) and () into Equation (), we get

$\begin{matrix} Q_{S_{3}}^{- 1} K (\frac{1}{μ - 1} M^{- 1} K v_{3}) - Q_{S_{3}}^{- 1} M (\frac{1}{δ (μ - 1)} v_{3}) & = & μ v_{3} \\ Q_{S_{3}}^{- 1} K M^{- 1} K v_{3} - \frac{1}{δ} Q_{S_{3}}^{- 1} M v_{3} & = & μ (μ - 1) v_{3} \\ Q_{S_{3}}^{- 1} (K M^{- 1} K v_{3} - \frac{1}{δ} M) v_{3} & = & μ (μ - 1) v_{3} \\ Q_{S_{3}}^{- 1} S v_{3} & = & μ (μ - 1) v_{3} \end{matrix}$

Therefore $μ$ satisfies $Q_{S_{3}}^{- 1} S v_{3} = μ (μ - 1) v_{3}$ with the corresponding eigenvector

$v = (\begin{matrix} \frac{1}{μ - 1} M^{- 1} K v_{3} \\ \frac{1}{δ (μ - 1)} v_{3} \\ v_{3} \end{matrix}), \forall v_{3} \neq 0$

Since

σ

is the eigenvalue of

Q_{S_{3}}^{- 1} S

, then we have

μ (μ - 1) = σ

which is

μ^{2} - μ - σ = 0

(18)

Solving Equation (18) we get

μ = \frac{1 \pm \sqrt{1 + 4 σ}}{2}

The values of

μ

derived above agree with the Proposition (2) in [6] which gives the bounds

μ (Q_{3}^{- 1} K) \in [\frac{1 - \sqrt{1 + 4 σ_{m}}}{2}, \frac{1 - \sqrt{1 + 4 σ_{1}}}{2}] \cup 1 \cup [\frac{1 + \sqrt{1 + 4 σ_{1}}}{2}, \frac{1 + \sqrt{1 + 4 σ_{m}}}{2}]

. We have that if

0 \leq σ_{1} \leq σ_{2} \leq \cdot \cdot \cdot \leq σ_{m}

are eigenvalues of

Q_{S_{3}}^{- 1} K M^{- 1} K + \frac{1}{δ} Q_{S_{3}}^{- 1} M

then the other eigenvalues of

Q_{3}^{- 1} K

are

\begin{matrix} \frac{1 + \sqrt{1 + 4 σ_{1}}}{2} \leq & μ & \leq \frac{1 + \sqrt{1 + 4 σ_{m}}}{2} o r \end{matrix}

(19)

\begin{matrix} \frac{1 - \sqrt{1 + 4 σ_{m}}}{2} \leq & μ & \leq \frac{1 - \sqrt{1 + 4 σ_{1}}}{2} . \end{matrix}

(20)

Now we have

Q_{S_{3}}^{- 1} K M^{- 1} K - \frac{1}{δ} Q_{S_{3}}^{- 1} M = Q_{S_{3}}^{- 1} (K M^{- 1} K + \frac{1}{δ} M) = Q_{S_{3}}^{- 1} S

This means that the eigenvalues of

Q_{S_{3}}^{- 1} K M^{- 1} K - \frac{1}{δ} Q_{S_{3}}^{- 1} M

are the eigenvalues of

Q_{S_{3}}^{- 1} S

. From Theorem 5 we have

σ (Q_{S_{3}}^{- 1} S) \in [\frac{1}{2}, 1]

. Let

σ_{1} = \frac{1}{2}

and

σ_{m} = 1

; substituting in Equations (19) and () we get

\begin{matrix} \frac{1 + \sqrt{1 + 4 σ_{1}}}{2} & \leq μ \leq & \frac{1 + \sqrt{1 + 4 σ_{m}}}{2} \\ \frac{1 + \sqrt{1 + 4 (\frac{1}{2})}}{2} & \leq μ \leq & \frac{1 + \sqrt{1 + 4 (1)}}{2} \\ \frac{1 + \sqrt{3}}{2} & \leq μ \leq & \frac{1 + \sqrt{5}}{2} \end{matrix}

and

\begin{matrix} \frac{1 - \sqrt{1 + 4 σ_{m}}}{2} \leq μ \leq \frac{1 - \sqrt{1 + 4 σ_{1}}}{2} \\ \frac{1 - \sqrt{1 + 4 (1)}}{2} \leq μ \leq \frac{1 - \sqrt{1 + 4 (\frac{1}{2})}}{2} \\ \frac{1 - \sqrt{5}}{2} \leq μ \leq \frac{1 - \sqrt{3}}{2} \end{matrix}

Therefore the eigenvalues of the preconditioned coefficient matrix are

\begin{matrix} μ & = & 1 \\ \frac{1 + \sqrt{3}}{2} \leq & μ & \leq \frac{1 + \sqrt{5}}{2} \\ \frac{1 - \sqrt{5}}{2} \leq & μ & \leq \frac{1 - \sqrt{3}}{2} \end{matrix}

□

The Theorem 6 above clearly shows that the eigenvalues of the preconditioned coefficient matrix are clustered and are independent of both the mesh size and the regularisation parameter. The iterative solver is expected to converge with changes in the problem parameters and discretisation parameter. In the next section we carry out numerical tests to verify the theoretical findings and compare the performance of the MINRES solver with the block diagonal preconditioner associated with the three approximations for the Schur complement form.

4. Numerical Results

In this section we present the results of the numerical experiments for solving the problem (Equation (6)) using the block diagonal preconditioned MINRES method. The main task here is to demonstrate the effectiveness of preconditioner in accelerating the MINRES solver. In this study the numerical experiments began with the

Q_{1}

finite element discretisation of the state y, the adjoint p and the control u equations using uniform grids. All simulations and implementations were performed on a Windows 10 platform with Intel^® Core™ i5-3230M CPU@2.6 GHz 6.00 GB speed Intel^® using Matlab 7 programming language. We used the IFISS matlab package developed in [33] to generate a discrete linear algebraic system. The values of the mesh size h and the dimensions of M, K and

K

are shown in Table 1 below

The exact solutions for the distributed optimality system and

\bar{Ω} = {(- 1, 1)}^{2} \subset R^{2}

are

\begin{matrix} y_{exact} & = & \frac{1}{1 + 4 δ π^{4}} sin (π x_{1}) sin (π x_{2}) \end{matrix}

(21)

\begin{matrix} u_{exact} & = & 2 π^{2} y_{exact} \end{matrix}

(22)

\begin{matrix} y_{d} & = & sin (π x_{1}) sin (π x_{2}) \end{matrix}

(23)

We give a thumbnail outline of the main task of the paper in the application of the approximation of the Schur complement preconditioner

Q_{S_{3}}

. The main dominant operations in the application of our proposed block diagonal preconditioner

Q_{3}

are 10 fixed Chebyshev semi iterations [6,12,29] for the mass matrix M in the (1,1) and (2,2) blocks; the (3,3) block is approximated by two cycles of the algebraic multigrid methods for

\sqrt{δ} K + M

. This means in total we have two Chebyshev and two algebraic multigrid iterations with two pre and post-smoothing steps of the Jacobi method; similarly for

Q_{S_{1}}

and

Q_{S_{2}}

. We test for four regularisation parameters

δ = 10^{- 3}, δ = 10^{- 5}, δ = 10^{- 7}

and

δ = 10^{- 9}

.

We solve the linear algebraic system of saddle point form Equation (6) using the MINRES with the block diagonal preconditioners

Q_{1}

(Equation (10)),

Q_{2}

(Equation (11)) and

Q_{3}

(Equation (12)) with the three different preconditioner approximations for the Schur complement form S. Then we compare the performance of the block diagonal preconditioned MINRES method with the three preconditioner approximations for the Schur complement in terms of the number of iterations, CPU time,

L^{2}

error and the value of the objective function. The numerical solution produced by the MINRES with the preconditioner associated with the three Schur complement approximations is the same. Figure 1 gives the snapshots of the numerical solution,

We now present the eigenvalue distribution of the

Q_{S_{1}}^{- 1} S, Q_{S_{2}}^{- 1} S, Q_{S_{3}}^{- 1} S

.

The Table 2 shows the largest and the smallest eigenvalues of the Schur complement

S, Q_{S_{1}}^{- 1} S, Q_{S_{2}}^{- 1} S, Q_{S_{3}}^{- 1} S

. The values in Table 2 clearly show the effects of the preconditioner approximation on the spectral properties of the Schur complement. The eigenvalues of

Q_{S_{1}}^{- 1} S

are clearly dependent on the parameter

δ

; and the eigenvalues of

Q_{S_{2}}^{- 1} S, Q_{S_{3}}^{- 1} S

are bounded between

\frac{1}{2}

and 1 though their distribution and are different independent of the parameter

δ

. This confirms the theoretical results and is clearly shown on the figures below. Figure 2 shows the distribution of eigenvalues for different values of regularisation parameter.

Figure 2 gives the eigenvalue distribution of

Q_{S_{1}}^{- 1} S, Q_{S_{2}}^{- 1} S, Q_{S_{2}}^{- 1} S

for different values of the regularisation parameter at

h = 2^{- 4}

. Figure 2a,d,g gives the eigenvalue distribution of

Q_{S_{1}}^{- 1} S

; Figure 2b,e,h shows the eigenvalue distribution of

Q_{S_{2}}^{- 1} S

; and Figure 2c,f,i shows the eigenvalue distribution of

Q_{S_{3}}^{- 1} S

for different values of

δ

at

h = 2^{- 4}

. The eigenvalues are sorted form the smallest to the largest. The most important observation is that the eigenvalue distribution of

Q_{S_{1}}^{- 1} S

is widely spread out as the regularisation parameter decreases. This agrees with the theoretical findings that the eigenvalues depends on the regularisation parameter. Hence the preconditioner approximation performance is not independent of

δ

and deteriorates for small values of

δ

. We also observe that for the approximations

Q_{S_{2}}

and

Q_{S_{3}}

the eigenvalue distribution falls with the interval that is given theoretically for the different values of the regularisation parameter. This entails that the clustering of the eigenvalues shown in the Figure 2 above points to the fast convergence of the iterative solver for all the values of the regularisation parameter.

We now give the numerical experiment results from our MINRES iterative solver preconditioned with the block diagonal preconditioner associated with the three approximations. Since the numerical solutions are the same, we concentrate on the performance in terms of the number of iterations and the CPU time in seconds for comparison purposes.

It is clear from Table 3 that all these preconditioners associated with the approximations

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

are robust with respect to mesh size h, but the approximation

Q_{S_{1}}

is not robust with respect to the regularisation parameter. The preconditioners associated with approximations

Q_{S_{2}}, Q_{S_{3}}

performed efficiently for all the values of h and

δ

, with

Q_{S_{3}}

performing slightly better.

The results in Table 3 and Table 4 demonstrate the practical applicability of the preconditioners explained theoretically in Section 2 and Section 3 and the theoretical prowess of the three approximations of the Schur complement form using the eigenvalue distribution. Table 3 and Table 4 give the number of iterations and the CPU time taken by the MINRES preconditioned by block diagonal preconditioners Equations (10)–(12) with different values of mesh size and the regularisation parameter, and we compare the effectiveness of the three preconditioners with approximations for the Schur compliment preconditioner. The results clearly show that the preconditioner with

Q_{S_{1}}

displays mesh-independent convergence but fails on the decreasing values of the regularisation parameter

δ

. The number of iterations and CPU time increase as

δ \to 0

. This agrees with the poor eigenvalue distribution of

Q_{S_{1}}^{- 1} S

and that of

Q_{1} K

. The number of iterations and CPU times produced by the preconditioners associated with

Q_{S_{2}}

and

Q_{S_{3}}

are competitive and exhibited parameters h and

δ

independent of convergence, while both h and

δ

approached zero. The number of iterations of

Q_{S_{2}}

and

Q_{S_{3}}

decreased by the decrease of

δ

, as expected, and those of

Q_{S_{3}}

decreased further. The results show that these preconditioners are also robust to the mesh size h and the regularisation parameter

δ

. This agrees with the theoretical results that the eigenvalues are more clustered and distributed more independently of the parameters h and

δ

than those of

Q_{S_{1}}^{- 1} S

; see Figure 2. The results of our numerical experiments clearly show that the preconditioners with the approximations

Q_{S_{3}}

and

Q_{S_{2}}

are very effective in improving the MINRES in solving elliptic control problems. When compared to other preconditioner approximations in terms of iteration count and computational time,

Q_{S_{3}}

slightly outperformed them. The preconditioners are more effective with our new approximation, producing more favourable results. We now give the results including the cost functional at

2^{- 6}

for different values of the objective function for the approximation

Q_{S_{3}}

and there are the same outputs with the other approximations.

The results in Table 5 show the behaviour of the cost functional for different values of the regularisation parameter

δ

. It is well known that the

δ

determines how close the state approaches the desired state

y_{d}

. The results provide an interesting observations that

{‖ u ‖}_{2}

stops increasing at

10^{- 6}

with

J (y, u)

and

‖ y - y_{d} ‖_{2}

decreases with the decrease by a constant factor of the decrease of

δ

. This means that the optimal value of

δ

for the problem is

10^{- 6}

, becoming very close to the desired state, and also that the control variable

‖ u ‖

increases as

δ

decreases. This is clear indication that the cost functional will be insensitive to the control variable as

δ

decreases.

5. Conclusions

The finite element discretisation of the elliptic PDE-constrained optimisation problem produces a large block

3 \times 3

linear algebraic system of saddle point form. The main task of this paper was to present a different approach for the approximation of the Schur complement in the preconditioning strategies to get a robust and efficient numerical solver. This was done to achieve the MINRES solver whose performance is independent of the mesh size h and the regularisation parameter

δ

. We have demonstrated both theoretically and numerically that the eigenvalues of both the the preconditioned Schur complement and the preconditioned system involving the proposed approximation are independent of the changes in the mesh size and the regularisation parameter. This gives a robust numerical scheme that is independent of h and

δ

. We have compared numerically the results of the preconditioners involving the proposed approximation of the Schur complement with those that are found in the literature. Numerical results have confirmed the theoretical results and have demonstrated that the proposed preconditioner in this paper can be used practically and can be considered as a viable preconditioner for the problem under investigation.

Author Contributions

All authors contributed equally and significantly in writing this article. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

The authors wish to acknowledge the financial support from the University of Venda.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Abbreviations

The following abbreviations are used in this manuscript:

PDE	Partial Differential Equation
MINRES	Minimum Residual Method
GMRES	Generalized Minimal Residual Method
CPU	Central Processing Unit

References

Lions, J.L. Optimal Control Systems Governed by Partial Differential Equations; Springer: Berlin, Germany, 1971. [Google Scholar]
Tröltzsch, F. Optimal Control of Partial Differential Equations. Theory, Methods and Applications; American Mathematical Society: Berlin, Germany, 2010. [Google Scholar]
Benzi, M.; Golub, G.H.; Liesen, J. Numerical solution of saddle point problems. Acta Numer. 2005, 14, 1–137. [Google Scholar] [CrossRef] [Green Version]
Hackbusch, W. Fast solution of elliptic control problems. J. Optim. Theory Appl. 1980, 31, 565–581. [Google Scholar] [CrossRef]
Hinze, M.; Pinnau, R.; Ulbrich, M.; Ulbrich, S. Optimization with PDE constraints; Springer: New York, NY, USA, 2009. [Google Scholar]
Rees, T.; Dollar, H.S.; Wathen, A.J. Optimal solvers for PDE-constrained optimization. SIAM J. Sci. Comput. 2010, 32, 271–298. [Google Scholar] [CrossRef] [Green Version]
Rees, T.; Stoll, M. Block-triangular preconditioners for PDE-constrained optimization. Numer. Linear Algebra Appl. 2010, 17, 977–996. [Google Scholar] [CrossRef]
Borzi, A.; Schulz, V. Multigrid methods for PDE optimization. SIAM Rev. 2009, 51, 361–395. [Google Scholar] [CrossRef]
Schöberl, J.; Zulehner, W.; Simon, R. A robust multigrid method for elliptic control problems. SIAM J. Numer. Anal. 2011, 49, 1482–1503. [Google Scholar] [CrossRef] [Green Version]
Dollar, H.S.; Nicholas, I.M.; Gould, M.; Stoll, M.; Wathen, A.J. Preconditioning saddle-point systems with applications in optimization. SIAM J. Sci. Comput. 2010, 32, 249–270. [Google Scholar] [CrossRef] [Green Version]
Schöberl, J.; Zulehner, W. Symmetric indefinite preconditioners for saddle point problems with applications to PDE-constrained optimisation problems. J. Matrix Anal. Appl. 2007, 29, 752–773. [Google Scholar] [CrossRef] [Green Version]
Wathen, A.J.; Rees, T. Chebyshev semi-iteration in preconditioning for problems including the mass matrix. Electron. Trans. Numer. Ananlysis 2009, 34, 125–135. [Google Scholar]
Borzi, A.; Kunisch, K. A Multigrid scheme for elliptic constrained optimal control problems. Comput. Optim. Appl. 2005, 31, 309–333. [Google Scholar] [CrossRef]
Borzi, A. Smoother for control-and State- constrained optimal control problems. Comput. Vis. Sci. 2008, 11, 56–66. [Google Scholar] [CrossRef]
Hinze, M.; Kunisch, K. Multigrid methods for control-constrained elliptic optimal control problems. In Numerical Mathematics and Advanced Applications 2009; Springer: Berlin/Heidelberg, Germany, 2010; pp. 883–891. [Google Scholar]
Tachs, S.; Zulehner, W. Convergence analysis of multigrid methods with collective point smoothers for optimal control problems. Comput. Vis. Sci. 2011, 14, 131–141. [Google Scholar]
Vallejos, M.; Borzi, A. Multigrid methods for control-constrained elliptic optimal control problems. Numer. Math. Adv. Appl. 2010, 26, 883–891. [Google Scholar]
Zulehner, W. A class of smoothers for saddle point problems. Computing 2000, 65, 227–246. [Google Scholar] [CrossRef]
Zulehner, W. Analysis of iterative methods for saddle point problems: A unified approach. Math. Comput. 2002, 238, 479–505. [Google Scholar] [CrossRef] [Green Version]
Bai, Z.Z. Block preconditioners for elliptic PDE-constrained optimization problems. Computing 2011, 91, 379–395. [Google Scholar] [CrossRef]
Pearson, J.W.; Wathen, A.J. A new approximation of the Schur complement in preconditioners for PDE-constrained optimization. Numer. Linear Algebra Appl. 2012, 19, 816–829. [Google Scholar] [CrossRef]
Zhang, G.; Zheng, Z. Block-symmetric and block-lower-triangular preconditioners for PDE-constrained optimization problems. J. Comput. Math. 2013, 31, 370–381. [Google Scholar] [CrossRef]
Mirchi, H.; Salkkuyeh, D.K. A new preconditioner for elliptic PDE-constrained optimization problems. Numer. Algorithms 2020, 83, 663–668. [Google Scholar] [CrossRef] [Green Version]
Ke, Y.F.; Ma, C.F. Some preconditioners for elliptic PDE-constrained optimization problems. Comput. Math. Appl. 2018, 75, 2795–2813. [Google Scholar] [CrossRef]
Pestana, J.W.; Wathen, A.J. Combination preconditioning of saddle point systems for positive definiteness. Numer. Linear Algebra Appl. 2013, 20, 785–808. [Google Scholar] [CrossRef] [Green Version]
Rees, T.; Wathen, A.J. Preconditioning iteraive methods for the optimal control of the Stokes equations. SIAM J. Sci. Comput. 2011, 33, 2903–2926. [Google Scholar] [CrossRef]
Stoll, M.; Wathen, A.J. Combination preconditioning and the Bramble-Pasciak⁺ preconditioner. J. Matrix Anal. Appl. 2008, 30, 582–608. [Google Scholar] [CrossRef]
Hackbusch, W. On the fast solving of parabolic boundary control problems. SIAM J. Control Optim. 1979, 17, 231–244. [Google Scholar] [CrossRef]
Pearson, J.W.; Stoll, M.; Wathen, A.J. Robust iterative solution of a class of time-dependent optimal control problems. Proc. Appl. Math. Mech. 2012, 12, 3–6. [Google Scholar] [CrossRef]
Pearson, J.W.; Wathen, A.J. Fast iterative solvers for convection-diffussion optimal control problems. Electron. Trans. Numer. Anal. 2013, 40, 294–310. [Google Scholar]
Paige, C.G.; Saunders, M.A. Solution of sparse indefinite systems of linear equations. SIAM J. Numer. Anal. 1975, 12, 617–629. [Google Scholar] [CrossRef]
Choi, Y.; Farhat, C.; Murray, W.; Saunders, M. A Practical Factorization of a Schur Complement for PDE-Constrained Distributed Optimal Control. J. Sci. Comput. 2015, 65, 576–597. [Google Scholar] [CrossRef] [Green Version]
Elman, H.C.; Ramage, A.; Silvester, D.J. Algorithm 866: IFISS a matlab toolbox for modeling incompressible flow. ACM Trans. Math. Softw. 2007, 33. [Google Scholar] [CrossRef]

Figure 1. Numerical Solution of the state y and control u at

h = 2^{- 6}

and

δ = 10^{- 5}

.

Figure 1. Numerical Solution of the state y and control u at

h = 2^{- 6}

and

δ = 10^{- 5}

.

Figure 2. Ordered eigenvalue distributions of

Q_{S_{1}}^{- 1} S [a, d, g]

,

Q_{S_{2}}^{- 1} S [b, e, h]

and

Q_{S_{3}}^{- 1} S [c, f, i]

for

δ = 1 \times 10^{- 3}, δ = 1 \times 10^{- 5}

and

δ = 1 \times 10^{- 7}

with

h = 2^{- 4}

.

Figure 2. Ordered eigenvalue distributions of

Q_{S_{1}}^{- 1} S [a, d, g]

,

Q_{S_{2}}^{- 1} S [b, e, h]

and

Q_{S_{3}}^{- 1} S [c, f, i]

for

δ = 1 \times 10^{- 3}, δ = 1 \times 10^{- 5}

and

δ = 1 \times 10^{- 7}

with

h = 2^{- 4}

.

Table 1. Mesh sizes and corresponding sizes of the matrices.

Mesh Size (h)	$2^{- 3}$	$2^{- 4}$	$2^{- 5}$	$2^{- 6}$	$2^{- 7}$	$2^{- 8}$	$2^{- 9}$
size of $M (n)$	81	289	1089	4225	16,641	66,049	26,3169
size of $K (n)$	81	289	1089	4225	16,641	66,049	263,169
size of $K (3 n)$	243	867	3267	12,675	49,923	198,147	789,507

Table 2. Maximum and minimum eigenvalues for the Schur complement S and preconditioned Schur complement with preconditioners

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

for different values of

δ

at

h = 2^{- 4}

.

Table 2. Maximum and minimum eigenvalues for the Schur complement S and preconditioned Schur complement with preconditioners

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

for different values of

δ

at

h = 2^{- 4}

.

	S		$Q_{S_{1}}^{- 1} S$		$Q_{S_{2}}^{- 1} S$		$Q_{S_{3}}^{- 1} S$
$δ$	min	max	min	max	min	max	min	max
$10^{- 3}$	15.8	4.03 × 10 $^{3}$	1.004	1001	0.5002	0.9594	0.5002	0.9594
$10^{- 5}$	1.3 × 10 $^{3}$	1 × 10 $^{5}$	1.005	1 × 10 $^{5}$	0.5000	0.9937	0.500	0.9979
$10^{- 7}$	2.21 × 10 $^{4}$	1 × 10 $^{7}$	5.49	1 × 10 $^{7}$	0.5643	0.9940	0.7740	1
$10^{- 9}$	1.8 × 10 $^{6}$	1 × 10 $^{9}$	449.9	1 × 10 $^{9}$	0.9139	0.9979	1	1
$10^{- 15}$	1.8 × 10 $^{12}$	1 × 10 $^{15}$	4.5e+8	1 × 10 $^{15}$	0.9999	1	1	1

Table 3. Number of iterations made by MINRES solver with the block diagonal preconditioners

Q_{1}, Q_{2}, Q_{3}

with Schur compliment approximated by

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

for different values of h and

δ

, tolerance =

10^{- 6}

.

Table 3. Number of iterations made by MINRES solver with the block diagonal preconditioners

Q_{1}, Q_{2}, Q_{3}

with Schur compliment approximated by

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

for different values of h and

δ

, tolerance =

10^{- 6}

.

	Approximation $Q_{S_{1}}$				Approximation $Q_{S_{2}}$				Approximation $Q_{S_{3}}$
h	$δ$				$δ$				$δ$
	$10^{- 3}$	$10^{- 5}$	$10^{- 7}$	$10^{- 9}$	$10^{- 3}$	$10^{- 5}$	$10^{- 7}$	$10^{- 9}$	$10^{- 3}$	$10^{- 5}$	$10^{- 7}$	$10^{- 9}$
$2^{- 4}$	9	23	35	35	13	5	3	3	13	5	3	3
$2^{- 5}$	11	29	71	99	13	9	3	3	13	9	3	3
$2^{- 6}$	11	33	111	101	13	11	5	3	13	10	5	3
$2^{- 7}$	13	33	133	107	15	11	7	5	15	10	5	3
$2^{- 8}$	17	33	139	103	15	11	7	5	15	10	5	3
$2^{- 9}$	17	39	147	191	17	11	11	7	17	11	5	5

Table 4. CPU time taken by MINRES solver with the block diagonal preconditioners

Q_{1}, Q_{2}, Q_{3}

with Schur complement approximated by

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

for different values of h and

δ

, tolerance =

10^{- 6}

.

Table 4. CPU time taken by MINRES solver with the block diagonal preconditioners

Q_{1}, Q_{2}, Q_{3}

with Schur complement approximated by

Q_{S_{1}}, Q_{S_{2}}, Q_{S_{3}}

for different values of h and

δ

, tolerance =

10^{- 6}

.

	Approximation $Q_{S_{1}}$				Approximation $Q_{S_{2}}$				Approximation $Q_{S_{3}}$
h	$δ$				$δ$				$δ$
	$10^{- 3}$	$10^{- 5}$	$10^{- 7}$	$10^{- 9}$	$10^{- 3}$	$10^{- 5}$	$10^{- 7}$	$10^{- 9}$	$10^{- 3}$	$10^{- 5}$	$10^{- 7}$	$10^{- 9}$
$2^{- 4}$	0.14	0.266	0.297	0.166	0.188	0.109	0.078	0.078	0.081	0.063	0.078	0.047
$2^{- 5}$	0.283	0.581	1.317	1.457	0.312	0.25	0.141	0.156	0.297	0.325	0.11	0.109
$2^{- 6}$	0.594	1.297	4.168	3.579	0.625	0.530	0.500	0.556	0.510	0.381	0.272	0.532
$2^{- 7}$	2.169	5.171	17.27	15.59	2.54	2.00	1.48	1.64	2.42	2.05	1.45	1.531
$2^{- 8}$	10.58	21.79	78.92	60.50	10.59	8.43	5.97	5.54	8.64	7.47	4.95	4.02
$2^{- 9}$	47.4	100.2	220	347.8	46.7	35.5	33.3	25.1	46.8	35.2	30.1	21.2

Table 5. Numerical results using preconditioner with approximation

Q_{S_{3}}

with

h = 2^{- 6}

.

Table 5. Numerical results using preconditioner with approximation

Q_{S_{3}}

with

h = 2^{- 6}

.

$δ$	$‖ u_{h} ‖_{2}$	$‖ u_{h} {- u ‖}_{2}$	$‖ y_{h} {- y ‖}_{2}$	$‖ y - y_{d} ‖_{2}$	$J (u_{h}, y_{h})$
$10^{- 1}$	1.58 × $10^{1}$	7.63 × $10^{- 4}$	1.56 × $10^{- 3}$	3.13 × $10^{1}$	4.07 × $10^{2}$
$10^{- 2}$	1.29 × $10^{2}$	4.75 × $10^{- 4}$	1.28 × $10^{- 3}$	2.55 × $10^{1}$	4.07 × $10^{2}$
$10^{- 3}$	4.56 × $10^{2}$	3.53 × $10^{- 4}$	4.50 × $10^{- 4}$	8.98 × $10^{0}$	1.44 × $10^{2}$
$10^{- 4}$	6.08 × $10^{2}$	7.43 × $10^{- 4}$	6.03 × $10^{- 5}$	1.20 × $10^{0}$	1.92 × $10^{1}$
$10^{- 5}$	6.29 × $10^{2}$	7.98 × $10^{- 4}$	6.30 × $10^{- 6}$	1.24 × $10^{- 1}$	1.98 × $10^{- 1}$
$10^{- 6}$	6.32 × $10^{2}$	8.03 × $10^{- 4}$	6.29 × $10^{- 7}$	1.25 × $10^{- 2}$	2.0 × $10^{- 2}$
$10^{- 7}$	6.32 × $10^{2}$	8.03 × $10^{- 4}$	6.26× $10^{- 8}$	1.24 × $10^{- 3}$	2.0 × $10^{- 3}$
$10^{- 8}$	6.32 × $10^{2}$	8.03 × $10^{- 4}$	6.32× $10^{- 9}$	1.24 × $10^{- 4}$	2.0 × $10^{- 4}$
$10^{- 9}$	6.32 × $10^{2}$	8.03 × $10^{- 4}$	6.32× $10^{- 10}$	1.24 × $10^{- 5}$	2.0 × $10^{- 5}$

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Muzhinji, K.; Shateyi, S. A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems. Computation 2020, 8, 68. https://doi.org/10.3390/computation8030068

AMA Style

Muzhinji K, Shateyi S. A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems. Computation. 2020; 8(3):68. https://doi.org/10.3390/computation8030068

Chicago/Turabian Style

Muzhinji, Kizito, and Stanford Shateyi. 2020. "A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems" Computation 8, no. 3: 68. https://doi.org/10.3390/computation8030068

APA Style

Muzhinji, K., & Shateyi, S. (2020). A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems. Computation, 8(3), 68. https://doi.org/10.3390/computation8030068

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Robust Approximation of the Schur Complement Preconditioner for an Efficient Numerical Solution of the Elliptic Optimal Control Problems

Abstract

1. Introduction

2. Analysis of the Block Diagonal Preconditioner

3. Proposed Schur Complement Preconditioner Approximation

4. Numerical Results

5. Conclusions

Author Contributions

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI