A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides

Zhong, Hong-Xiu; Gu, Xian-Ming; Zhang, Shao-Liang

doi:10.3390/sym11101302

Open AccessArticle

A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides

by

Hong-Xiu Zhong

¹

,

Xian-Ming Gu

^2,*

and

Shao-Liang Zhang

³

¹

School of Science, Jiangnan University, Wuxi 214122, Jiangsu, China

²

School of Economic Mathematics/Institute of Mathematics, Southwestern University of Finance and Economics, Chengdu 611130, Sichuan, China

³

Department of Applied Physics, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan

^*

Author to whom correspondence should be addressed.

Symmetry 2019, 11(10), 1302; https://doi.org/10.3390/sym11101302

Submission received: 19 September 2019 / Revised: 12 October 2019 / Accepted: 14 October 2019 / Published: 16 October 2019

Download

Browse Figure

Versions Notes

Abstract

The block conjugate orthogonal conjugate gradient method (BCOCG) is recognized as a common method to solve complex symmetric linear systems with multiple right-hand sides. However, breakdown always occurs if the right-hand sides are rank deficient. In this paper, based on the orthogonality conditions, we present a breakdown-free BCOCG algorithm with new parameter matrices to handle rank deficiency. To improve the spectral properties of coefficient matrix A, a precondition version of the breakdown-free BCOCG is proposed in detail. We also give the relative algorithms for the block conjugate A-orthogonal conjugate residual method. Numerical results illustrate that when breakdown occurs, the breakdown-free algorithms yield faster convergence than the non-breakdown-free algorithms.

Keywords:

COCG; COCR; breakdown-free; complex symmetric matrix; rank deficiency; multiple right-hand sides

MSC:

65F10; 65F50

1. Introduction

Consider the following complex symmetric linear system with multiple right-hand sides:

A X = B,

(1)

with

A \in C^{n \times n}

non-Hermitian but symmetric (i.e.,

A \neq A^{H}, A = A^{T}

),

X, B \in C^{n \times p}

, and

1 \leq p ≪ n

. Such systems arise from many practical and important applications, for example, electromagnetic scattering, quantum chemistry, the complex Helmholtz equation, and so on [1,2].

Due to simple calculations and less information required, block Krylov subspace methods are always designed to solve system (1) efficiently [3,4]. Recently, Tadano et al. presented the block conjugate orthogonal conjugate gradient (BCOCG) method [5], which can exploit the symmetry of A naturally. The BCOCG is also deemed a natural generalization of the conjugate orthogonal conjugate gradient (COCG) method [6,7,8] for solving systems (1). Besides COCG-type methods, the COCR method described in [2,7,8] can also exploit the symmetry of A when

p = 1

in (1). In [1], Gu et al. introduced a block version of the COCR method (BCOCR) by employing the residual orthonormalization technique.

However, rank deficiency is a common problem that can lead block Krylov subspace methods to breakdown. The main reason is that the block search direction vectors may be linearly dependent on the existing basis by the increasing of the iteration number [9,10]. Consequently, some useless information will affect the accuracy of the solution and the numerical stability. BCOCG and BCOCR may also encounter such a problem, especially when p of (1) is larger, and we will show this phenomenon in the example section. Hence, it is valuable to solve the rank deficiency problem, and finally to enhance the numerical stability of BCOCG and BCOCR. Motivated by [10], in this paper, we propose a breakdown-free block COCG algorithm (BFBCOCG) and a breakdown-free block COCR algorithm (BFBCOCR) that can efficiently solve the rank deficiency problem of BCOCG and BCOCR, respectively.

The convergence rate of the Krylov subspace method depends on the spectral properties of the coefficient matrix, for example, the eigenvalue or singular value distribution, field of values, condition of the eigensystem, and so on. Fortunately, preconditioning can improve the spectral properties [11]. In this paper, we present the preconditioned version of BFBCOCG and BFBCOCR in detail.

The rest of this paper is organized as follows. In Section 2, based on the orthogonality conditions, we propose the BFBCOCG and BFBCOCR algorithms and their preconditioned variants with their new parameter matrices. Some numerical examples are listed in Section 3 to show the efficiency of our new algorithms. Finally, some conclusions are given in Section 4.

In this paper,

K_{k + 1} (A, R_{0})

denotes the

(k + 1)

-dimension Krylov subspace

span {R_{0}, A R_{0}, \dots, A^{k} R_{0}}

,

\bar{A}

denotes the conjugate of A.

2. The Breakdown-Free Variants of BCOCG and BCOCR

In this section, we present our main algorithms, i.e., the breakdown-free block COCG algorithm (BFBCOCG) and the breakdown-free block COCR algorithm (BFBCOCR) in detail. We first introduce the block COCG (BCOCG) and block COCR (BCOCR) methods briefly and then give the derivation of BFBCOCG and BFBCOCR with their orthogonality properties, respectively. In the end, the preconditioned variants of BFBCOCG and BFBCOCR are also proposed, denoted by BFPBCOCG and BFPBCOCR, respectively. We use an underscore “_” to distinguish the breakdown-free and the non-breakdown-free algorithms.

2.1. BCOCG and BCOCR

Let

X_{0}

be an initial approximation, and let

X_{k + 1} \in X_{0} + K_{k + 1} (A, R_{0})

be the

k + 1

th approximate solution of system (1). Hence, the recurrence relation of BCOCG and BCOCR is as follows: [1]

\begin{matrix} R_{0} = P_{0} = B - A X_{0} \in K_{1} (A, R_{0}), \\ X_{k + 1} = X_{k} + P_{k} α_{k} \in X_{0} + K_{k + 1} (A, R_{0}), \\ R_{k + 1} = R_{k} - A P_{k} α_{k} \in K_{k + 2} (A, R_{0}), \\ P_{k + 1} = R_{k + 1} + P_{k} β_{k} \in K_{k + 2} (A, R_{0}), for k = 0, 1, 2 \dots . \end{matrix}

(2)

The difference between BCOCG and BCOCR is the different calculation formulas of matrices

α_{k}

and

β_{k}

in (2), which are derived by applying the following orthogonality conditions:

R_{k + 1} ⊥ L and A P_{k + 1} ⊥ L .

(3)

Different choices of

L

lead to different algorithms:

$L = K_{k + 1} (\bar{A}, {\bar{R}}_{0})$ results in BCOCG.
$L = \bar{A} K_{k + 1} (\bar{A}, {\bar{R}}_{0})$ results in BCOCR.

2.2. BFBCOCG and BFBCOCR

If the block size p is large, then the vectors of the block search direction will inevitably be linearly dependent on the increasing of the iteration number for BCOCG, hence rank deficiency occurs. In the following, in order to overcome this problem, we consider applying the breakdown-free strategy to BCOCG and propose the breakdown-free block COCG algorithm (BFBCOCG). The rationale of BFBCOCG is extracting an orthogonal basis

{\underset{̲}{P}}_{k + 1}

from the current searching space by using the operation

o r t h (\cdot)

. Thus, compared with (2), the new recurrence relation becomes

\begin{matrix} P_{0} = R_{0} = B - A X_{0}, {\underset{̲}{P}}_{0} = o r t h (P_{0}) \in K_{1} (A, R_{0}), \\ X_{k + 1} = X_{k} + {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k} \in X_{0} + K_{k + 1} (A, R_{0}), \\ R_{k + 1} = R_{k} - A {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k} \in K_{k + 2} (A, R_{0}), \\ {\underset{̲}{P}}_{k + 1} = o r t h (P_{k + 1}) = o r t h (R_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k}) \in K_{k + 2} (A, R_{0}), for k = 0, 1, 2 \dots . \end{matrix}

(4)

Therefore, again using the orthogonality condition (3), we can get the Lemma 1. Here, we denote

{\underset{̲}{U}}_{k} = A {\underset{̲}{P}}_{k}

.

Lemma 1.

For all

0 \leq j < k

,

R_{j}^{T} R_{k} = 0

,

{\underset{̲}{P}}_{j}^{T} R_{k} = 0

, and

{\underset{̲}{P}}_{j}^{T} A P_{k} = 0

.

Proof.

Because

R_{j}, {\underset{̲}{P}}_{j} \in K_{k} (A, R_{0})

for all

0 \leq j < k

, and

R_{k} ⊥ K_{k} (\bar{A}, {\bar{R}}_{0})

by (3), thus

R_{j}^{T} R_{k} = 0

and

{\underset{̲}{P}}_{j}^{T} R_{k} = 0

. Then

{\underset{̲}{P}}_{j}^{T} A P_{k} = 0

can be obtained by the second orthogonality condition in (3). □

Similarly, the following Theorem 1 is obtained to update the parameters

{\underset{̲}{α}}_{k}

and

{\underset{̲}{β}}_{k}

.

Theorem 1.

Under the orthogonality condition (3), the value of parameters

{\underset{̲}{α}}_{k}

and

{\underset{̲}{β}}_{k}

in the recurrence relation (4) can be obtained by solving the following equations:

\begin{matrix} ({\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k}) {\underset{̲}{α}}_{k} & = {\underset{̲}{P}}_{k}^{T} R_{k}, \\ (R_{k}^{T} {\underset{̲}{P}}_{k}) {\underset{̲}{β}}_{k} & = R_{k + 1}^{T} R_{k + 1}, for k = 0, 1, 2, \dots . \end{matrix}

(5)

Proof.

From Lemma 1 and (4), we have the following two equations:

0 = {\underset{̲}{P}}_{k}^{T} R_{k + 1} = {\underset{̲}{P}}_{k}^{T} (R_{k} - A {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k}) = {\underset{̲}{P}}_{k}^{T} R_{k} - {\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k} {\underset{̲}{α}}_{k}

(6)

and

0 = {\underset{̲}{P}}_{k}^{T} A P_{k + 1} = {\underset{̲}{P}}_{k}^{T} A (R_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k}) = {\underset{̲}{U}}_{k}^{T} R_{k + 1} + {\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k} {\underset{̲}{β}}_{k} .

(7)

So, solving (6), we can easily get the

{\underset{̲}{α}}_{k}

.

Pre-multiplying

{\underset{̲}{α}}_{k}^{T}

to (7), then from the third equation of (4), we have

- {\underset{̲}{α}}_{k}^{T} {\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k} {\underset{̲}{β}}_{k} = {\underset{̲}{α}}_{k}^{T} {\underset{̲}{U}}_{k}^{T} R_{k + 1} = {(R_{k} - R_{k + 1})}^{T} R_{k + 1} .

From Lemma 1, we have

R_{k}^{T} R_{k + 1} = 0

, and by the first equation of (5), one has

{\underset{̲}{α}}_{k}^{T} {\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k} = R_{k}^{T} {\underset{̲}{P}}_{k}

. Thus the above equation can be rewritten as

R_{k}^{T} {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k} = R_{k + 1}^{T} R_{k + 1},

(8)

which can be used to update matrix

{\underset{̲}{β}}_{k}

. □

The following Algorithm 1 is the breakdown-free block COCG.

Algorithm 1 Breakdown-free block COCG (BFBCOCG)

1.

Given the initial guess

X_{0} \in C^{n \times p}

and a tolerance

t o l

;

Compute:

R_{0} = B - A X_{0}

,

{\underset{̲}{P}}_{0} = o r t h (R_{0})

,

{\underset{̲}{U}}_{0} = A {\underset{̲}{P}}_{0}

;

2.

For

k = 0, 1, 2, \dots

until

∥ R_{k} ∥_{F} / {∥ R_{0} ∥}_{F} \leq t o l

, do

Solve: $({\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k}) {\underset{̲}{α}}_{k} = {\underset{̲}{P}}_{k}^{T} R_{k}$ ;
Update: $X_{k + 1} = X_{k} + {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k}$ , $R_{k + 1} = R_{k} - {\underset{̲}{U}}_{k} {\underset{̲}{α}}_{k}$ ,
Solve: ( $R_{k}^{T} {\underset{̲}{P}}_{k}) {\underset{̲}{β}}_{k} = R_{k + 1}^{T} R_{k + 1}$ ;
Update: ${\underset{̲}{P}}_{k + 1} = o r t h (R_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k})$ , ${\underset{̲}{U}}_{k + 1} = A {\underset{̲}{P}}_{k + 1}$ ;

End For

Similar to BFBCOCG, we can also easily get BFBCOCR by using

L = \bar{A} K_{k + 1} (\bar{A}, {\bar{R}}_{0})

in the orthogonality condition (3). The following Algorithm 2 is the breakdown-free block COCR.

Algorithm 2 Breakdown-free block COCR (BFBCOCR)

1.

Given the initial guess

X_{0} \in C^{n \times p}

and a tolerance

t o l

;

Compute:

R_{0} = B - A X_{0}

,

{\underset{̲}{P}}_{0} = o r t h (R_{0})

,

{\underset{̲}{U}}_{0} = A {\underset{̲}{P}}_{0}

;

2.

For

k = 0, 1, 2, \dots

until

∥ R_{k} ∥_{F} / {∥ R_{0} ∥}_{F} \leq t o l

, do

Solve: $({\underset{̲}{U}}_{k}^{T} {\underset{̲}{U}}_{k}) {\underset{̲}{α}}_{k} = {\underset{̲}{U}}_{k}^{T} R_{k}$ ;
Update: $X_{k + 1} = X_{k} + {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k}$ , $R_{k + 1} = R_{k} - {\underset{̲}{U}}_{k} {\underset{̲}{α}}_{k}$ ,
Solve: $(R_{k}^{T} {\underset{̲}{U}}_{k}) {\underset{̲}{β}}_{k} = R_{k + 1}^{T} A R_{k + 1}$ ;
Update: ${\underset{̲}{P}}_{k + 1} = o r t h (R_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k})$ , ${\underset{̲}{U}}_{k + 1} = A {\underset{̲}{P}}_{k + 1}$ ;

End For

2.3. BFPBCOCG and BFPBCOCR

As we all know, if the coefficient matrix has poor spectral properties, then the Krylov subspace methods may not robust, while a preconditioning strategy can make it better [11]. The trick is preconditioning (1) with a symmetric positive matrix M, which approximates to the inverse of matrix A; we get the following equivalent system:

M A X = M B .

(9)

Let

M = L L^{T}

be the Cholesky decomposition of M. Then system (9) is equivalent to

\tilde{A} \tilde{X} = \tilde{B}, with \tilde{A} = L^{T} A L, \tilde{X} = L^{- 1} X, \tilde{B} = L^{T} B .

(10)

We add a tilde “˜” on the variables derived from the new system. Then applying the BFBCOCG method and its recurrence relations (4) to (10), we have the orthogonality conditions

{\tilde{R}}_{k} ⊥ K_{k} (\bar{\tilde{A}}, {\bar{\tilde{R}}}_{0}) and \tilde{A} {\tilde{P}}_{k} ⊥ K_{k} (\bar{\tilde{A}}, {\bar{\tilde{R}}}_{0}) .

(11)

It is easy to see

{\tilde{R}}_{k} = L^{T} R_{k}

,

{\tilde{\underset{̲}{P}}}_{k} = o r t h ({\tilde{R}}_{k} + {\tilde{\underset{̲}{P}}}_{k - 1} {\underset{̲}{β}}_{k - 1}) \in K_{k + 1} (\tilde{A}, {\tilde{R}}_{0})

. The approximate solution

X_{k} = L {\tilde{X}}_{k}

is from

L {\tilde{X}}_{0} + L K_{k} (\tilde{A}, {\tilde{R}}_{0}) = X_{0} + K_{k} (M A, M R_{0})

. Set

{\underset{̲}{P}}_{k} = L {\tilde{\underset{̲}{P}}}_{k}

, then

{\underset{̲}{P}}_{k} \in K_{k + 1} (M A, M R_{0})

. The new recurrence relation becomes

\begin{matrix} R_{0} = B - A X_{0}, {\underset{̲}{P}}_{0} = o r t h (P_{0}) = o r t h (M R_{0}) \in K_{1} (M A, M R_{0}), \\ X_{k + 1} = X_{k} + {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k} \in X_{0} + K_{k + 1} (M A, M R_{0}), \\ R_{k + 1} = R_{k} - A {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k} \in L^{- T} K_{k + 2} (\tilde{A}, {\tilde{R}}_{0}), \\ {\underset{̲}{P}}_{k + 1} = o r t h (P_{k + 1}) = o r t h (M R_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k}) \in K_{k + 2} (M A, M R_{0}), for k = 0, 1, 2 \dots . \end{matrix}

(12)

The orthogonality condition (11) become

R_{k} ⊥ K_{k} (\bar{M A}, \bar{M} {\bar{R}}_{0}) and A P_{k} ⊥ K_{k} (\bar{M A}, {\bar{M R}}_{0}) .

(13)

Under the recurrence relation (12) and the orthogonality condition (13), we can get the following Lemma 2 and Theorem 2 to update the matrices

{\underset{̲}{α}}_{k}

and

{\underset{̲}{β}}_{k}

. Here, we omit the proof because it is like the proof of Lemma 1 and Theorem 1. We denote

Z_{k} = M R_{k}, {\underset{̲}{U}}_{k} = A {\underset{̲}{P}}_{k}

.

Lemma 2.

For all

0 \leq j < k

,

R_{j}^{T} Z_{k} = 0

,

{\underset{̲}{P}}_{j}^{T} R_{k} = 0

, and

{\underset{̲}{P}}_{j}^{T} A P_{k} = 0

.

Remark 1.

Under the preconditioned strategy, the relations of the block residuals are changed from orthogonal for BFBCOCG to M-orthogonal for BFPBCOCG. Here, two vectors x and y are M-orthogonal, meaning

x ⊥_{M} y

, i.e.,

y^{H} M x = 0

.

Theorem 2.

Under the orthogonality condition (13), the value of parameters

{\underset{̲}{α}}_{k}

and

{\underset{̲}{β}}_{k}

in the recurrence relations (12) can be obtained by solving the following equations:

\begin{matrix} {\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k} {\underset{̲}{α}}_{k} & = {\underset{̲}{P}}_{k}^{T} R_{k}, \\ (R_{k}^{T} {\underset{̲}{P}}_{k}) {\underset{̲}{β}}_{k} & = R_{k + 1}^{T} Z_{k + 1}, for k = 0, 1, 2, \dots . \end{matrix}

The following Algorithm 3 is the breakdown-free preconditioned block COCG algorithm.

Algorithm 3 Breakdown-free preconditioned block COCG (BFPBCOCG)

1.

Given the initial guess

X_{0} \in C^{n \times p}

and a tolerance

t o l

;

Compute:

R_{0} = B - A X_{0}

,

Z_{0} = M R_{0}

,

{\underset{̲}{P}}_{0} = o r t h (Z_{0})

,

{\underset{̲}{U}}_{0} = A {\underset{̲}{P}}_{0}

;

2.

For

k = 0, 1, 2, \dots

until

∥ R_{k} ∥_{F} / {∥ R_{0} ∥}_{F} \leq t o l

, do

Solve: $({\underset{̲}{P}}_{k}^{T} {\underset{̲}{U}}_{k}) {\underset{̲}{α}}_{k} = {\underset{̲}{P}}_{k}^{T} R_{k}$ ;
Update: $X_{k + 1} = X_{k} + {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k}$ , $R_{k + 1} = R_{k} - {\underset{̲}{U}}_{k} {\underset{̲}{α}}_{k}$ , $Z_{k + 1} = M R_{k + 1}$ ;
Solve: $(R_{k}^{T} {\underset{̲}{P}}_{k}) {\underset{̲}{β}}_{k} = R_{k + 1}^{T} Z_{k + 1}$ ;
Update: ${\underset{̲}{P}}_{k + 1} = o r t h (Z_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k})$ , ${\underset{̲}{U}}_{k + 1} = A {\underset{̲}{P}}_{k + 1}$ ;

End For

Change the orthogonality conditions (11) to the following conditions:

{\tilde{R}}_{k} ⊥ \bar{\tilde{A}} K_{k} (\bar{\tilde{A}}, {\bar{\tilde{R}}}_{0}) and \tilde{A} {\tilde{P}}_{k} ⊥ \bar{\tilde{A}} K_{k} (\bar{\tilde{A}}, {\bar{\tilde{R}}}_{0}) .

(14)

The breakdown-free preconditioned block COCR (BFPBCOCR) can be deduced with the similar derivation of BFPBCOCG. Algorithm 4 shows the code of BFPBCOCR.

Algorithm 4 Breakdown-free preconditioned block COCR (BFPBCOCR)

1.

Given the initial guess

X_{0} \in C^{n \times p}

and a tolerance

t o l

;

Compute:

R_{0} = B - A X_{0}

,

Z_{0} = M R_{0}

,

{\underset{̲}{P}}_{0} = o r t h (Z_{0})

,

{\underset{̲}{U}}_{0} = A {\underset{̲}{P}}_{0}

;

2.

For

k = 0, 1, 2, \dots

until

∥ R_{k} ∥_{F} / {∥ R_{0} ∥}_{F} \leq t o l

, do

Solve: $({\underset{̲}{U}}_{k}^{T} M {\underset{̲}{U}}_{k}) {\underset{̲}{α}}_{k} = {\underset{̲}{U}}_{k}^{T} Z_{k}$ ;
Update: $X_{k + 1} = X_{k} + {\underset{̲}{P}}_{k} {\underset{̲}{α}}_{k}$ , $R_{k + 1} = R_{k} - {\underset{̲}{U}}_{k} {\underset{̲}{α}}_{k}$ , $Z_{k + 1} = M R_{k + 1}$ ;
Solve: $(Z_{k}^{T} {\underset{̲}{U}}_{k}) {\underset{̲}{β}}_{k} = Z_{k + 1}^{T} A Z_{k + 1}$ ;
Update: ${\underset{̲}{P}}_{k + 1} = o r t h (Z_{k + 1} + {\underset{̲}{P}}_{k} {\underset{̲}{β}}_{k})$ , ${\underset{̲}{U}}_{k + 1} = A {\underset{̲}{P}}_{k + 1}$ ;

End For

At the end of this section, we will give the complexity for six algorithms. They are the block COCG algorithm, block COCR algorithm, breakdown-free block COCG algorithm, breakdown-free block COCR algorithm, breakdown-free preconditioned block COCG algorithm, and breakdown-free preconditioned block COCR algorithm, which are denoted by BCOCG, BCOCR, BFBCOCG, BFBCOCR, BFPBCOCG, BFPBCOCR, respectively. The pseudocodes of BCOCG and BCOCR are from [1]. Table 1 shows the main costs per cycle of the six algorithms. Here, we denote as “block vector” the matrix with size

n \times \times p

, “bdp” the dot product number of two block vectors, “bmv” the product number of a matrix with

n \times p

and a block vector, “bsaxpy” the number of two block vectors summed with one of the block vectors being from multiplying a block vector to a

p \times p

matrix, “LE” the number of solving linear equations with a

p \times p

coefficient matrix, and “bSC” the storage capacity of block vectors.

From Table 1, we can see the last four algorithms need one more dot product of two block vectors than the original two algorithms, i.e., BCOCG and BCOCR. For the product number of a matrix with a block vector, BFBCOCR and BFPBCOCR are both twice BFBCOCG and BFPBCOCG, respectively. This may result in more time to spend in BFBCOCR and BFPBCOCR, especially for a problem with a dense matrix. We reflect on the phenomenon in Example 1.

3. Numerical Examples

In this section, examples are given to demonstrate the effectiveness of our new algorithms. All examples are operated through MATLAB 8.4 (R2014b) on a laptop with an Intel Core i5-6200U CPU 2.3 GHz memory 8GB under the Win10 operating system.

We evaluate the algorithms according to the iteration number (

I t e r

), CPU time (

C P U

), and

{log}_{10}

of the Frobenius norm for the true relative residual

{log}_{10} \frac{∥ B - A X_{k} ∥_{F}}{{∥ B ∥}_{F}}

(

T R R

). All algorithms are started with

X_{0} = z e r o s (n, p)

and stopped with

\frac{∥ B - A X_{k} ∥_{F}}{{∥ B ∥}_{F}} \leq 10^{- 10}

. Symbol † means no convergence within 1000 iterations. The bold values in the following tables represent the shortest CPU time.

Example 1.

In this example, six algorithms without preconditioning are compared. They are BCOCG, BCOCR, BFBCOCG, BFBCOCR, the block COCG method with residual orthonormalization (BCOCG_rq) [1], and the block COCR method with residual orthonormalization (BCOCR_rq) [1]. Two types of matrices are tested. The first type contains three matrices,

c u b e 1800

,

p a r a l l e l e p i p e d e

, and

s p h e r e 2430

, which are all dense and from monostatic radar cross-section calculations, and obtained from the GitHub repository (https://github.com/Hsien-Ming-Ku/Test_matrices/tree/master/Example2). The dimensions n of these matrices are 1800, 2016, 2430, respectively. The second type contains

h e l m d a t e_N 40

,

h e l m d a t e_N 80

, and

h e l m d a t e_N 160

, which are all from the discretization of the Helmholtz equation [12]. Their dimensions n are 1681, 6561, 25921, respectively. The right-hand sides are chosen as

B = (1 + i) [r a n d (n, 6), o n e s (n, 2)]

, so that the block size

p = 8

, and B is rank deficiency. Here,

i = \sqrt{- 1}

. Table 2 and Table 3 give the results.

From Table 2 and Table 3, we can see that for these rank-deficiency problems, the breakdown-free algorithms perform better than the non-breakdown-free algorithms in CPU time and iteration number. For the first type of problem, although the iterations of BFBCOCR are fewer than BFBCOCG, the CPU time is nearly double because the matrices are all dense and the product number of matrix and block vectors for BFBCOCR is one more than BFBCOCG per iteration. For the first two matrices of the second type of problem, the difference between BFBCOCG and BFBCOCR is not obvious, the main reason being that the matrices are sparse. However, for the third matrix

h e l m d a t e_N 160

, only BFBCOCR can solve the problem. This indicates that for the matrices from the discretization of the Helmholtz equation, BFBCOCR performs the most robust compared to the other five algorithms.

Example 2.

To make it fair, all algorithms compared in this example are preconditioned; they are the preconditioned version sof the same six algorithms as in Example 1 and are denoted by PBCOCG, PBCOCR, BFPBCOCG, BFPBCOCR, PBCOCG_rq, and PBCOCR_rq, respectively. The preconditioning strategy we used is IC(3) in [13]. IC(3) produces

L L^{T}

for a complex symmetric A, and if L is nonsingular, then we can use

L L^{T}

as a preconditioner. We test three matrices. The first matrix is

b w g 961 b

, which is from the NEP collection [14] for electrical engineering and has dimension

n = 961

. The second and third matrices are

h e l m d a t e_N 40

and

h e l m d a t e_N 80

, respectively. The right-hand sides are chosen as

B = (1 + i) [r a n d (n, 8), o n e s (n, 2)]

so that the block size

p = 10

, and B is rank deficiency. Numerical results are shown in Table 4 and Figure 1.

From the results, we can see that the breakdown-free algorithms perform better than the non-breakdown-free algorithms. Especially in Figure 1, the convergence curve of the four non-breakdown-free algorithms has not dropped, while the breakdown-free ones quickly drop to the accuracy. For matrix

b w g 961 b

, from Table 4 and Figure 1 we can see that the convergence curves of BFPBCOCG and BFPBCOCR are both downward-trending, while BFPBCOCG performs smoother. For matrix

h e l m d a t e_N 40

, the difference between BFPBCOCG and BFPBCOCR is not big. For matrix

h e l m d a t e_N 80

, BFPBCOCG does not converge, but BFPBCOCR converges quickly. This illustrates that for the matrices from the discretization of the Helmholtz equation, BFPBCOCR has an advantage over the other five preconditioned algorithms in terms of robustness.

4. Conclusions

In this paper, we presented a breakdown-free block conjugate orthogonal conjugate gradient algorithm for complex symmetric linear systems with multiple right-hand sides. Based on the orthogonality conditions, we gave its two new parameter matrices. The preconditioned version is also proposed in detail. At the same time, we also present the breakdown-free version for the block conjugate A-orthogonal conjugate residual method with its preconditioned version. From the numerical examples, we realized that when the right-hand sides are rank deficiency, our four new algorithms perform better than other algorithms. Moreover, for Helmholtz equation problems, BFBCOCR and BFPBCOCR show more stable behavior than BFBCOCG and BFPCOCG; while for dense matrices problems, BFBCOCG and BFPBCOCG converge faster than BFBCOCR and BFPBCOCR. However, there is still a lack of theoretical analysis for the advantages of BFBCOCG and BFBCOCR, and even of BFPBCOCG and BFPBCOCR. All of these require further investigations.

Author Contributions

X.-M.G. guided the process of the whole paper and reviewed the paper; S.-L.Z. provided some innovative advice and reviewed the paper; H.-X.Z. deduced the theory, implemented the algorithms with the numerical examples, and wrote the paper.

Funding

This work was financed by the National Nature Science Foundation of China (11701225 and 11801463), the Fundamental Research Funds for the Central Universities (JBK1902028), and the Natural Science Foundation of Jiangsu Province (BK20170173).

Conflicts of Interest

The authors declare no conflict of interest.

References

Gu, X.-M.; Carpentieri, B.; Huang, T.-Z.; Meng, J. Block variants of the COCG and COCR methods for solving complex symmetric linear systems with multiple right-hand sides. In Numerical Mathematics and Advanced Applications ENUMATH 2015; Karasözen, B., Manguoǧlu, M., Tezer-Sezgin, M., Göktepe, S., Uǧur, Ö., Eds.; Lecture Notes in Computational Science and Engineering 112; Springer International Publishing: Cham, Switzerland, 2016; pp. 305–313. [Google Scholar]
Sogabe, T.; Zhang, S.-L. A COCR method for solving complex symmetric linear systems. J. Comput. Appl. Math. 2007, 199, 297–303. [Google Scholar] [CrossRef][Green Version]
Gutknecht, M.H. Block Krylov space methods for linear systems with multiple right-hand sides: An introduction. In Modern Mathematical Models, Methods and Algorithms for Real World Systems; Siddiqi, A.H., Duff, I.S., Christensen, O., Eds.; Anamaya Publishers: New Delhi, India, 2006; pp. 420–447. [Google Scholar]
Zhang, J.; Zhao, J. A novel class of block methods based on the block AA^T-Lanczos biorthogonalization process for matrix equations. Int. J. Comput. Math. 2013, 90, 341–359. [Google Scholar] [CrossRef]
Tadano, H.; Sakurai, T. A block Krylov subspace method for the contour integral method and its application to molecular orbital computations. IPSJ Trans. Adv. Comput. Syst. 2009, 2, 10–18. (In Japanese) [Google Scholar]
Van der Vorst, H.A.; Melissen, J.B.M. A Petrov-Galerkin type method for solving Ax = b, where A is symmetric complex. IEEE Trans. Mag. 1990, 26, 706–708. [Google Scholar] [CrossRef]
Gu, X.-M.; Huang, T.-Z.; Li, L.; Li, H.-B.; Sogabe, T.; Clemens, M. Quasi-minimal residual variants of the COCG and COCR methods for complex symmetric linear systems in electromagnetic simulations. IEEE Trans. Microw. Theory Tech. 2014, 62, 2859–2867. [Google Scholar] [CrossRef]
Gu, X.-M.; Clemens, M.; Huang, T.-Z.; Li, L. The SCBiCG class of algorithms for complex symmetric linear systems with applications in several electromagnetic model problems. Comput. Phys. Commun. 2015, 191, 52–64. [Google Scholar] [CrossRef]
Broyden, C.G. A breakdown of the block CG method. Optim. Methods Softw. 1996, 7, 41–55. [Google Scholar] [CrossRef]
Ji, H.; Li, Y. A breakdown-free block conjugate gradient method. BIT Numer. Math. 2017, 57, 379–403. [Google Scholar] [CrossRef]
Van der Vorst, H.A. Iterative Krylov Methods for Large Linear Systems; Cambridge University Press: Cambridge, UK, 2003; pp. 173–178. [Google Scholar]
Bayliss, A.; Goldstein, C.I.; Turkel, E. An iterative method for the Helmholtz equation. J. Comput. Phys. 1983, 49, 443–457. [Google Scholar] [CrossRef]
Meijerink, J.A.; Van der Vorst, H.A. An iterative solution method for linear systems of which the coefficient matrix is a symmetric M-matrix. Math. Comp. 1977, 31, 148–162. [Google Scholar]
Bai, Z.; Day, D.; Demmel, J.; Dongarra, J. A Test Matrix Collection for Non-Hermitian Eigenvalue Problems; Technical Report CS-97-355; University of Tennessee: Knoxville, TN, USA, 1997. [Google Scholar]

Figure 1. Relative residual F-norm for matrices

d w g 961 b

and

h e l m d a t e_40 N

in Example 2.

Figure 1. Relative residual F-norm for matrices

d w g 961 b

and

h e l m d a t e_40 N

in Example 2.

Table 1. Main costs per cycle for six algorithms.

	BCOCG	BCOCR	BFBCOCG	BFBCOCR	BFPBCOCG	BFPBCOCR
bdp	2	2	3	3	3	3
bmv	1	1	1	2	2	4
LE	2	2	2	2	2	2
bsaxpy	3	4	3	3	3	3
bSC	4	5	4	4	5	5

Table 2. The numerical results for the first type from Example 1.

Algorithm	cube1800			parallelepipede			sphere2430
Algorithm	CPU	Iter	TRR	CPU	Iter	TRR	CPU	Iter	TRR
BCOCG	†	†	†	†	†	†	†	†	†
BCOCR	†	†	†	†	†	†	4.4542	190	−10.2579
BCOCG_rq	†	†	†	†	†	†	†	†	†
BCOCR_rq	†	†	†	†	†	†	†	†	†
BFBCOCG	2.5071	161	−10.1050	3.5124	179	−10.2753	4.3642	172	−10.0783
BFBCOCR	4.4694	162	−10.3856	5.8695	164	−10.2095	8.8278	171	−10.1099

Table 3. The numerical results for the second type from Example 1.

Algorithm	helmdate_N40			helmdate_N80			helmdate_N160
Algorithm	CPU	Iter	TRR	CPU	Iter	TRR	CPU	Iter	TRR
BCOCG	†	†	†	†	†	†	†	†	†
BCOCR	†	†	†	†	†	†	†	†	†
BCOCG_rq	†	†	†	†	†	†	†	†	†
BCOCR_rq	†	†	†	†	†	†	†	†	†
BFBCOCG	0.2599	155	−10.3166	2.0171	364	−10.0025	†	†	†
BFBCOCR	0.3132	173	−10.1085	1.9872	332	−10.3876	38.2284	876	−10.0528

Table 4. The numerical results for Example 2.

Algorithm	dwg961b			helmdate_N40			helmdate_N80
Algorithm	CPU	Iter	TRR	CPU	Iter	TRR	CPU	Iter	TRR
PBCOCG	†	†	†	†	†	†	†	†	†
PBCOCR	†	†	†	†	†	†	†	†	†
PBCOCG_rq	†	†	†	†	†	†	†	†	†
PBCOCR_rq	†	†	†	†	†	†	†	†	†
BFPBCOCG	1.7563	904	−10.0333	0.3238	91	−10.2684	†	†	†
BFPBCOCR	†	†	†	0.4824	91	−10.0138	3.5199	197	−10.1141

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhong, H.-X.; Gu, X.-M.; Zhang, S.-L. A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides. Symmetry 2019, 11, 1302. https://doi.org/10.3390/sym11101302

AMA Style

Zhong H-X, Gu X-M, Zhang S-L. A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides. Symmetry. 2019; 11(10):1302. https://doi.org/10.3390/sym11101302

Chicago/Turabian Style

Zhong, Hong-Xiu, Xian-Ming Gu, and Shao-Liang Zhang. 2019. "A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides" Symmetry 11, no. 10: 1302. https://doi.org/10.3390/sym11101302

APA Style

Zhong, H.-X., Gu, X.-M., & Zhang, S.-L. (2019). A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides. Symmetry, 11(10), 1302. https://doi.org/10.3390/sym11101302

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Breakdown-Free Block COCG Method for Complex Symmetric Linear Systems with Multiple Right-Hand Sides

Abstract

1. Introduction

2. The Breakdown-Free Variants of BCOCG and BCOCR

2.1. BCOCG and BCOCR

2.2. BFBCOCG and BFBCOCR

2.3. BFPBCOCG and BFPBCOCR

3. Numerical Examples

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI