Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix

Alibrahim, Mohammed; Darvishi, Mohammad Taghi; Reichel, Lothar; Spalević, Miodrag M.

doi:10.3390/axioms14030179

Open AccessArticle

Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix

¹

Department of Mathematical Sciences, Kent State University, Kent, OH 44242, USA

²

Department of Mathematics, Razi University, Kermanshah 67149, Iran

³

Department of Mathematics, Faculty of Mechanical Engineering, University of Belgrade, Kraljice Marije 16, 11120 Belgrade 35, Serbia

^*

Author to whom correspondence should be addressed.

Axioms 2025, 14(3), 179; https://doi.org/10.3390/axioms14030179

Submission received: 27 January 2025 / Revised: 11 February 2025 / Accepted: 16 February 2025 / Published: 28 February 2025

(This article belongs to the Special Issue Orthogonal Polynomials, Special Functions and Applications: 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

This paper describes a Krylov subspace iterative method designed for solving linear systems of equations with a large, symmetric, nonsingular, and indefinite matrix. This method is tailored to enable the evaluation of error estimates for the computed iterates. The availability of error estimates makes it possible to terminate the iterative process when the estimated error is smaller than a user-specified tolerance. The error estimates are calculated by leveraging the relationship between the iterates and Gauss-type quadrature rules. Computed examples illustrate the performance of the iterative method and the error estimates.

Keywords:

iterative method; Lanczos process; error norm estimate; Gauss-type quadrature rules

MSC:

65D30; 65D32

1. Introduction

Consider the linear system of equations

A x = b,

(1)

where

A \in R^{m \times m}

is a large, symmetric, nonsingular, and indefinite matrix and

x

and

b

are real vectors. Such systems arise in various areas of applied mathematics and engineering. When A is too large to make its factorization feasible or attractive, an iterative solution method has to be employed. Among the most well-known iterative methods for solving linear systems of the kind (1) are MINRES or SYMMLQ by Paige and Saunders (see [1,2]). However, none of these methods allow for easy estimation of the error in the computed iterates. This can make it difficult to decide when to terminate the iterations.

For a symmetric, positive definite matrix A, the conjugate gradient method is typically used to solve (1). Various techniques are available to estimate the A-norm of the error in the iterates determined by the conjugate gradient method. These techniques leverage the relationship between the conjugate gradient method and Gauss-type quadrature rules applied to integrate the function

f (t) = t^{- 1}

. The quadrature rules are determined with respect to an implicitly defined non-negative measure defined by the matrix A, the right-hand side

b

, and the initial iterate

x_{0}

(see, for example, Almutairi et al. [3], Golub and Meurant [4,5], Meurant and Tichý [6] and references therein).

Error estimation of iterates in cases where the matrix A is nonsingular, symmetric, and indefinite has not received much attention in the literature. We observe that the application of A-norm estimates of the error in the iterates is meaningful when A is symmetric and positive definite [7]. However, this is not the case when A is symmetric and indefinite. In this paper, we estimate the Euclidean norm of the error for each iterate produced by an iterative method, which is described below.

In their work, Calvetti et al. [8] introduced a Krylov subspace method for the iterative solution of (1). They proposed estimating the Euclidean norm of the error in the iterates generated by their method using pairs of associated Gauss and anti-Gauss quadrature rules. However, the quality of the error norm estimates determined in this manner is mixed. Examples of computed results demonstrate that some error norm estimates are significantly exaggerated compared to the actual error norm in the iterates.

This paper presents novel methods for calculating the Euclidean norm of the error in the iterates computed using the iterative method described in Section 2 and [8]. In particular, the anti-Gauss rule used in [8] is replaced by other quadrature rules.

For notational simplicity, we start with the initial approximate solution

x_{0} = 0

. Consequently, the kth approximate solution

x_{k}

determined by the iterative method discussed in this paper lives in the Krylov subspace:

K_{k} (A, b) = span {b, A b, \dots, A^{k - 1} b}, k = 1, 2, \dots,

(2)

i.e.,

x_{k} = p_{k - 1} (A) b, k = 1, 2, \dots,

(3)

for a suitable iteration polynomial

p_{k - 1}

in

Π_{k - 1}

, where

Π_{k - 1}

is the set of all polynomials of degree less than or equal to

k - 1

. We require that the iteration polynomials satisfy

p_{k - 1} (0) = 0, k = 1, 2, \dots .

(4)

Then

p_{k - 1} (t) = t q_{k - 2} (t)

(5)

fulfills condition (4) for a polynomial

q_{k - 2} \in Π_{k - 2}

.

We introduce the residual error related to

x_{k}

as

r_{k} = b - A x_{k}

(6)

and let

x_{*}

denote the solution of (1). Then the error

{\tilde{e}}_{k} = x_{*} - x_{k}

in

x_{k}

can be expressed as

{\tilde{e}}_{k} = A^{- 1} r_{k} .

(7)

Equations (6) and (7) yield

{\tilde{e}}_{k}^{t} {\tilde{e}}_{k} = r_{k}^{t} A^{- 2} r_{k} = {(b - A x_{k})}^{t} A^{- 2} (b - A x_{k}) = b^{t} A^{- 2} b - 2 b^{t} A^{- 1} x_{k} + x_{k}^{t} x_{k},

(8)

where the superscript ^t denotes transposition. We may calculate the Euclidean norm of the vector

{\tilde{e}}_{k}

by using the terms found on the right-hand side of Equation (8). It is straightforward to compute the term

x_{k}^{t} x_{k}

, and using (5), the expression

2 b^{t} A^{- 1} x_{k}

can be calculated as

2 b^{t} A^{- 1} x_{k} = 2 b^{t} q_{k - 2} (A) b .

Hence, the expression

2 b^{t} A^{- 1} x_{k}

can be evaluated without using

A^{- 1}

. The iterative method has to be chosen so that recursion formulas for the polynomials

q_{k - 2}

,

k = 2, 3, \dots

easily can be computed. Finally, we have to estimate

b^{t} A^{- 2} b

, which by setting

f (t) = 1 / t^{2}

, can be written as the following matrix functional

F (A) = b^{t} f (A) b .

(9)

We use Gauss-type quadrature rules determined by the recurrence coefficients of the iterative method to approximate (9).

The structure of this paper is as follows. Section 2 outlines the iterative Krylov subspace method employed for the solution of (1). This method was discussed in [8]. We review the method for the convenience of the reader. Our presentation differs from that in [8]. The iterative method is designed to facilitate the evaluation of the last two terms on the right-hand side of (8). Section 3 explores various Gauss-type quadrature rules that are employed to estimate the first term on the right-hand side of (8). The quadrature rules used in this study include three kinds of Gauss-type rules, namely averaged and optimally averaged rules by Laurie [9] and Spalević [10], respectively, as well as Gauss-Radau rules with a fixed quadrature node at the origin. Additionally, we describe how to update the quadrature rules cost-effectively as the number of iterations grows. This section improves on the quadrature rules considered in [8]. Section 4 describes the use of these quadrature rules to estimate the Euclidean norm of the errors

{\tilde{e}}_{k}

,

k = 1, 2, \dots

. Section 5 provides three computed examples and Section 6 contains concluding remarks.

The iterates determined by an iterative method often converge significantly faster when a suitable preconditioner is applied (see, e.g., Saad [11] for discussions and examples of preconditioners). We assume that the system (1) is preconditioned when this is appropriate.

2. The Iterative Scheme

This section revisits the iterative method considered in [8] for solving linear systems of Equation (1) with a nonsingular, symmetric, and indefinite matrix A. We begin by discussing some fundamental properties of the method. Subsequently, we describe updating formulas for the approximate solutions

x_{k}

. This method can be seen as a variation of the SYMMLQ scheme discussed in [1,12].

It is convenient to introduce the spectral factorization of the coefficient matrix (1),

A = U Λ U^{t},

where

Λ \in R^{m \times m}

is a diagonal matrix with diagonal elements

{λ_{k}}_{k = 1}^{m}

and the matrix

U \in R^{m \times m}

is orthogonal. The spectral factorization is used for the description of the iterative method but does not have to be computed for the solution of (1). Defining

b^{'} = {[b_{1}^{'}, b_{2}^{'}, \dots, b_{m}^{'}]}^{t} = U^{t} b

, the functional (9) can be written as

F (A) = b^{' t} f (Λ) b^{'} = \sum_{k = 1}^{m} f (λ_{k}) b_{k}^{' 2} = \int_{- \infty}^{\infty} f (x) d w (x),

(10)

where the measure

d w

has jump discontinuities at the eigenvalues

λ_{k}

of A.

Our iterative method is based on the Lanczos algorithm. Let

I_{k} \in R^{k \times k}

denote the identity matrix. By applying

1 \leq k ≪ m

steps of the Lanczos process to A with initial vector

b

, the following Lanczos decomposition is obtained:

A V_{k} = V_{k} T_{k} + f_{k} e_{k}^{t},

(11)

where

f_{k} \in R^{m}

and

V_{k} = [v_{1}, v_{2}, \dots, v_{k}] \in R^{m \times k}

are such that

V_{k}^{t} V_{k} = I_{k}

,

V_{k}^{t} f_{k} = 0

, and

v_{1} = b / ∥ b ∥ .

(12)

Additionally,

T_{k}

is a symmetric, tridiagonal

k \times k

matrix,

e_{k}

denotes the kth column of the identity matrix, and

∥ \cdot ∥

represents the Euclidean vector norm. The columns of the matrix

V_{k}

span the Krylov subspace (2), i.e.,

range (V_{k}) = K_{k} (A, b) .

(13)

It is assumed that all subdiagonal entries of

T_{k}

are nonvanishing; otherwise, the recursion formulas of the Lanczos process break down, and the solution of (1) can be formulated as a linear combination of the vectors

v_{j}

that are available at the time of breakdown. The recursion relation for the columns of

V_{k}

is established by Equation (11) and, in conjunction with (12), shows that

v_{j} = S_{j - 1} (A) b, j = 1, 2, \dots, k,

(14)

for certain polynomials

S_{j}

of degree j.

Theorem 1.

The polynomials

S_{j}

determined by (14) are orthonormal with respect to the inner product

(g, h) = I (g h)

(15)

induced by the operator

I (f) = \int_{- \infty}^{\infty} f (x) d w (x)

.

Proof.

We have

\begin{matrix} (S_{j - 1}, S_{ℓ - 1}) & = & \int_{- \infty}^{\infty} S_{j - 1} (x) S_{ℓ - 1} (x) d w (x) \\ = & b^{t} (U S_{j - 1} (Λ) S_{ℓ - 1} (Λ) U^{t}) b \\ = & b^{t} (U S_{j - 1} (Λ) U^{t}) (U S_{ℓ - 1} (Λ) U^{t}) b \\ = & b^{t} S_{j - 1} (A) S_{ℓ - 1} (A) b \\ = & {(S_{j - 1} (A) b)}^{t} (S_{ℓ - 1} (A) b) \\ = & v_{j}^{t} v_{ℓ} = \{\begin{matrix} 0, & j \neq ℓ, \\ 1, & j = ℓ, \end{matrix} \end{matrix}

because the columns

v_{j}

,

1 \leq j \leq k

, of the matrix

V_{k}

are orthogonal and of unit norm (see (11)). □

We also use the related decomposition to (11):

A V_{k - 1} = V_{k} T_{k, k - 1},

(16)

where

T_{k, k - 1}

is the leading submatrix of

T_{k}

of order

k \times (k - 1)

.

We use the QR factorization for

T_{k}

, that is,

T_{k} = Q_{k} R_{k}, for Q_{k}, R_{k} \in R^{k \times k},

(17)

where

Q_{k}^{t} Q_{k} = I_{k}

and the matrix

R_{k} = {[r_{j ℓ}^{(k)}]}_{j, ℓ = 1}^{k}

is upper triangular. Similarly, we introduce the factorization

T_{k, k - 1} = Q_{k} [\begin{matrix} {\bar{R}}_{k - 1} \\ 0 \end{matrix}] = Q_{k, k - 1} {\bar{R}}_{k - 1},

(18)

where the

(k - 1) \times (k - 1)

matrix

{\bar{R}}_{k - 1}

is the leading submatrix of

R_{k}

and

Q_{k, k - 1} \in R^{k \times (k - 1)}

is the leading submatrix of

Q_{k}

.

Theorem 2.

Combine the QR factorization (17) with the Lanczos decomposition (11). This defines a new iterative process with iteration polynomials that comply with (4).

Proof.

Let

k > 1

. By applying the QR factorization (17) within the Lanczos decomposition (11), we obtain

A V_{k} = V_{k} Q_{k} R_{k} + f_{k} e_{k}^{t} .

(19)

Multiplying (19) by

Q_{k}

from the right-hand side, letting

{\tilde{V}}_{k} : = V_{k} Q_{k}

, and defining

{\tilde{T}}_{k} = R_{k} Q_{k}

, we obtain

A {\tilde{V}}_{k} = {\tilde{V}}_{k} {\tilde{T}}_{k} + f_{k} e_{k}^{t} Q_{k} .

(20)

The column vectors of the matrix expressed as

{\tilde{V}}_{k} = [{\tilde{v}}_{1}^{(k)}, {\tilde{v}}_{2}^{(k)}, \dots, {\tilde{v}}_{k}^{(k)}]

are orthonormal, and the matrix

{\tilde{T}}_{k}

is symmetric and tridiagonal. To expose the relation between the first column

v_{1}

of

V_{k}

and the first column

{\tilde{v}}_{1}^{(k)}

of

{\tilde{V}}_{k}

, we multiply (19) by

e_{1}

from the right. This yields

A V_{k} e_{1} = {\tilde{V}}_{k} R_{k} e_{1} + f_{k} e_{k}^{t} e_{1}, k > 1,

which simplifies to

A v_{1} = r_{11}^{(k)} {\tilde{v}}_{1}^{(k)},

where

r_{11}^{(k)} e_{1} = R_{k} e_{1}

. For a suitable choice of the sign of

r_{11}^{(k)}

, we have

{\tilde{v}}_{1}^{(k)} = A b / ∥ A b ∥ .

(21)

Since

T_{k}

is tridiagonal, the orthogonal matrix

Q_{k}

in the QR factorization (17) has upper Hessenberg form. As a result, only the last two entries of the vector expressed as

e_{k}^{t} Q_{k}

are non-zero. Hence, the decomposition (20) differs from a Lanczos decomposition by potentially having non-zero entries in the last two columns of the matrix

f_{k} e_{k}^{t} Q_{k}

.

Suppose that the matrix

{\bar{V}}_{k - 1}

consists of the first

k - 1

columns of

{\tilde{V}}_{k}

. Then,

{\bar{V}}_{k - 1} = V_{k} Q_{k, k - 1},

(22)

where

Q_{k, k - 1}

is defined by Equation (18). Typically,

{\bar{V}}_{k - 1} \neq {\tilde{V}}_{k - 1}

; additional details can be found in Section 4. When the last column is removed from each term in (20), the following decomposition results:

A {\bar{V}}_{k - 1} = {\bar{V}}_{k - 1} {\tilde{T}}_{k - 1} + {\bar{f}}_{k - 1} e_{k - 1}^{t} .

(23)

In (23), the matrix

{\tilde{T}}_{k - 1}

is the

(k - 1) \times (k - 1)

leading submatrix of

{\tilde{T}}_{k - 1}

. Furthermore,

{\bar{V}}_{k - 1}^{t} {\bar{f}}_{k - 1} = 0

, and

{\bar{V}}_{k - 1}^{t} {\bar{V}}_{k - 1} = I_{k - 1}

. As a result, according to (21), we have that (23) is a Lanczos decomposition with the starting vector

{\tilde{v}}_{1}^{(k)}

of

{\bar{V}}_{k - 1}

proportional to the vector

A b

. Similarly to (13), we have

range ({\bar{V}}_{k - 1}) = K_{k - 1} (A, A b) .

(24)

To determine the iteration polynomials (3) and the corresponding approximate solutions

x_{k}

of (3), we impose the following requirement for certain vectors

w_{k - 1} \in R^{k - 1}

:

x_{k} = p_{k - 1} (A) b = {\bar{V}}_{k - 1} w_{k - 1} .

(25)

It follows from (24) that any polynomial

p_{k - 1}

determined by (25) fulfills (4). This completes the proof. □

Remark 1.

We chose

w_{k - 1}

in (25) and, thereby,

p_{k - 1}

in

Π_{k - 1}

in a manner that guarantees the residual error (6) for the approximate solution

x_{k}

of (1) satisfies the Petrov-Galerkin condition, i.e.,

0 = V_{k - 1}^{t} r_{k} = V_{k - 1}^{t} b - V_{k - 1}^{t} A {\bar{V}}_{k - 1} w_{k - 1},

(26)

which, according to (12) and the factorization (22), simplifies to

∥ b ∥ e_{1} = {(A V_{k - 1})}^{t} V_{k} Q_{k, k - 1} w_{k - 1} .

(27)

Remark 2.

Replacing the matrix

{\bar{V}}_{k - 1}

in (26) with

V_{k - 1}

recovers the SYMMLQ method [1]. However, the iteration polynomial

p_{k - 1}

associated with the SYMMLQ method typically does not satisfy condition (4). Our method implements a QR factorization of matrix

T_{k}

, akin to the SYMMLQ method implementation by Fischer ([12], Section 6.5). In contrast, Paige and Saunders’ [1] implementation of the SYMMLQ method relies on an LQ factorization of

T_{k}

.

Remark 3.

Equation (26) shows that the iterative method is a Petrov-Galerkin method. In each step of the method, the dimension of the solution subspace (24) is increased by one, and the residual error is required to be orthogonal to the subspace

(K_{k - 1} (A, b))

, cf. (13). This secures convergence of the iterates (25) to the solution

x_{*}

of (1) as k increases.

Using Theorems 1 and 2, along with Remark 1, we can simplify the right-hand side of (7). First, using (16) and (18), we obtain

{(A V_{k - 1})}^{t} V_{k} Q_{k, k - 1} = T_{k, k - 1}^{t} Q_{k, k - 1} = {\bar{R}}_{k - 1}^{t} .

(28)

Subsequently, by substituting (28) into (27), we obtain

{\bar{R}}_{k - 1}^{t} w_{k - 1} = ∥ b ∥ e_{1} .

(29)

We can evaluate

w_{k - 1}

by forward substitution using (29). The rest of this section focuses on evaluating the right-hand side of (8).

Section 4 presents iterative formulas to efficiently update the approximate solutions

x_{k}

. The remainder of this section discusses the evaluation of the right-hand side of (8). From (24) and (25), it can be deduced that

x_{k}

lives in

K_{k - 1} (A, A b)

. Consequently, there is a vector

y_{k - 1} \in R^{k - 1}

such that

A^{- 1} x_{k} = V_{k - 1} y_{k - 1} .

(30)

Using the decomposition (16), the kth iterate generated by our iterative method can be written as

x_{k} = A V_{k - 1} y_{k - 1} = V_{k} T_{k, k - 1} y_{k - 1} .

Furthermore, according to (22) and (25), we have

x_{k} = V_{k} Q_{k, k - 1} w_{k - 1} .

This implies that

Q_{k, k - 1} w_{k - 1} = T_{k, k - 1} y_{k - 1} .

(31)

Multiplying (31) by

Q_{k, k - 1}^{t}

from the left and using (18) yields

w_{k - 1} = Q_{k, k - 1}^{t} T_{k, k - 1} y_{k - 1} = {\bar{R}}_{k - 1} y_{k - 1} .

(32)

By successively applying (12), (29), (30), and (32), we obtain

b^{t} A^{- 1} x_{k} = b^{t} V_{k - 1} y_{k - 1} = ∥ b ∥ e_{1}^{t} y_{k - 1} = ∥ b ∥ e_{1}^{t} {\bar{R}}_{k - 1}^{- 1} w_{k - 1} = w_{k - 1}^{t} w_{k - 1} .

(33)

According to (25), it follows that

x_{k}^{t} x_{k} = w_{k - 1}^{t} w_{k - 1}

. Combining this with (33) shows that Equation (8) can be represented as

{\tilde{e}}_{k}^{t} {\tilde{e}}_{k} = r_{k}^{t} A^{- 2} r_{k} = b^{t} A^{- 2} b - w_{k - 1}^{t} w_{k - 1} .

The term

w_{k - 1}^{t} w_{k - 1}

can easily be computed using (29). Section 3 describes several Gauss-type quadrature rules that are applied to compute estimates of

b^{t} A^{- 2} b

in Section 4.

3. Quadrature Rules

This section considers the approximation of integrals like

I (f) = \int_{a}^{b} f (x) d w (x)

(34)

by Gauss-type quadrature rules, where

[a, b] \subseteq [- \infty, \infty]

and

d w

denotes a non-negative measure with an infinite number of support points such that all moments

μ_{j} = \int_{a}^{b} x^{j} d w (x)

exist, for

j = 0, 1, 2, \dots

. In this section, we assume that

μ_{0} = 1

. Let

Q_{n} (f) = \sum_{i = 1}^{n} w_{i}^{(n)} f (ξ_{i}^{(n)})

denote an n-node quadrature rule to approximate (34). Then

\int_{a}^{b} f (x) d w (x) = Q_{n} (f) + E_{n} (f),

where

E_{n} (f)

is the remainder term. This term vanishes for all polynomials in

Π_{d}

for some non-negative integer d. The value of d is referred to as the degree of precision of the quadrature rule. It is well known that the maximum value of d for an n-node quadrature rule is

2 n - 1

. This value is achieved by the n-node Gauss rule (see, e.g., [13] for a proof). The latter rule can be written as

G_{n} (f) = \sum_{i = 1}^{n} w_{i}^{(n)} f (ξ_{i}^{(n)}) .

(35)

The nodes

ξ_{i}^{(n)}

,

i = 1, 2, \dots, n

, are the eigenvalues of the matrix

T_{n} = [\begin{matrix} α_{1} & β_{1} & 0 \\ β_{1} & α_{2} & β_{2} \\ ⋱ & ⋱ & ⋱ \\ β_{n - 2} & α_{n - 1} & β_{n - 1} \\ 0 & β_{n - 1} & α_{n} \end{matrix}] \in R^{n \times n},

(36)

and the weights

w_{i}^{(n)}

are the square of the first elements of normalized eigenvectors.

The entries

α_{i}

and

β_{i}

of

T_{n}

are obtained from the recursion formula for the sequence of monic orthogonal polynomials

{P_{i}}_{i = 0}^{\infty}

associated with the inner product (15):

P_{i + 1} (x) = (x - α_{i}) P_{i} (x) - β_{i}^{2} P_{i - 1} (x), i = 0, 1, 2, \dots,

(37)

where

P_{- 1} (x) \equiv 0

and

P_{0} (x) \equiv 1

. The values of

α_{i}

and

β_{i}

in (37) can be determined from the following formulas (see, e.g., Gautschi [13] for details):

α_{i} = \frac{(x P_{i}, P_{i})}{(P_{i}, P_{i})}, β_{i}^{2} = \frac{(P_{i}, P_{i})}{(P_{i - 1}, P_{i - 1})};

They also can be computed by the Lanczos process, which is presented in Algorithm 1.

Algorithm 1: The Lanczos algorithm.

It is straightforward to demonstrate that

G_{n} (f) = e_{1}^{t} f (T_{n}) e_{1},

(38)

where

e_{1} = {[1, 0, \dots, 0]}^{t}

.

We are interested in measures

d w

with support in two real intervals

[a, b]

and

[c, d]

, where

a < b < 0 < c < d

. The following result sheds light on how the nodes of the Gauss rule (35) are allocated for such measures.

Theorem 3.

Let

d ω

be a non-negative measure with support on the union of bounded real intervals

[a, b]

and

[c, d]

, where

a < b < c < d

. Then, the Gauss rule (35) has at most one node in the open interval

(b, c)

.

Proof.

The result follows from [14] (Theorem 3.41.1). □

The following subsection reviews some Gauss-type quadrature rules that are used to estimate the error in approximate solutions

x_{k}

of (1) that are generated by the iterative method proposed in Section 2.

Selected Gauss-Type Quadrature Rules

In [9], Laurie presented anti-Gauss quadrature rules. A recent analysis of anti-Gauss rules was also carried out by Díaz de Alba et al. [15]. Related investigations can be found in [16,17]. The

(n + 1)

-point anti-Gauss rule

{\overset{˘}{G}}_{n + 1}

, which is associated with the Gauss rule (35), is defined by the property

(I - {\overset{˘}{G}}_{n + 1}) (f) = - (I - G_{n}) (f), for all f \in Π_{2 n + 1} .

(39)

The following tridiagonal matrix is used to determine the rule

{\overset{˘}{G}}_{n + 1}

:

{\overset{˘}{T}}_{n + 1} = [\begin{matrix} α_{1} & β_{1} & 0 \\ β_{1} & α_{2} & β_{2} \\ ⋱ & ⋱ & ⋱ \\ β_{n - 2} & α_{n - 1} & β_{n - 1} \\ β_{n - 1} & α_{n} & \sqrt{2} β_{n} \\ 0 & \sqrt{2} β_{n} & α_{n + 1} \end{matrix}] R^{(n + 1) \times (n + 1)} .

Similarly to (38), we have

{\overset{˘}{G}}_{n + 1} (f) = e_{1}^{t} f ({\overset{˘}{T}}_{n + 1}) e_{1} .

Moreover,

{\overset{˘}{G}}_{n} (f) = I (f), for all f \in Π_{2 n - 1} .

Further, Laurie [9] introduced the averaged Gauss quadrature rule associated with

G_{n}

. It has

2 n + 1

nodes and is given by

A_{2 n + 1} = \frac{1}{2} (G_{n} + {\overset{˘}{G}}_{n + 1}) .

(40)

The property (39) suggests that the quadrature error for

A_{2 n + 1}

is smaller than the error for

G_{n}

. Indeed, it follows from (39) that the degree of precision of

A_{2 n + 1}

is no less than

2 n + 1

. This implies that the difference

A_{2 n + 1} (f) - G_{n} (f)

(41)

can be used to estimate the quadrature error

I (f) - G_{n} (f) .

(42)

Computed results in [18] illustrate that for numerous integrands and various values of n, the difference (41) provides fairly accurate approximation of the quadrature error (42). The accuracy of these estimates depends both on the integrand and the value of n.

In [10], Spalević presented optimal averaged Gauss quadrature rules, which usually have a higher degree of precision than averaged Gauss rules with the same number of nodes. The symmetric tridiagonal matrix for the optimal averaged Gauss quadrature rule

{\hat{A}}_{2 n + 1}

with

2 n + 1

nodes is defined as follows. Introduce the reverse matrix of

T_{n}

, which is given by

T_{n}^{'} = [\begin{matrix} α_{n} & β_{n - 1} & 0 \\ β_{n - 1} & α_{n - 1} & β_{n - 2} \\ ⋱ & ⋱ & ⋱ \\ β_{2} & α_{2} & β_{1} \\ 0 & β_{1} & α_{1} \end{matrix}] \in R^{n \times n},

as well as the concatenated matrix

{\hat{T}}_{2 n + 1} = [\begin{matrix} T_{n} & β_{n} e_{n} & 0 \\ β_{n} e_{n}^{t} & α_{n + 1} & β_{n + 1} e_{1}^{t} \\ 0 & β_{n + 1} e_{1} & T_{n}^{'} \end{matrix}] \in R^{(2 n + 1) \times (2 n + 1)} .

The nodes of the rule

{\hat{A}}_{2 n + 1}

are the eigenvalues, and the weights are the squared first components of normalized eigenvectors of the matrix

{\hat{T}}_{2 n + 1}

. It is worth noting that n of the nodes of

{\hat{A}}_{2 n + 1}

agree with the nodes of

G_{n}

. Similarly to Equation (38), we have

{\hat{A}}_{2 n + 1} (f) = e_{1}^{t} f ({\hat{T}}_{2 n + 1}) e_{1} .

The degree of precision for this quadrature rule is at least

2 n + 2

. Analyses of the degree of precision of the rules

{\hat{A}}_{2 n + 1}

and the location of their largest and smallest nodes for several measures for which explicit expressions for coefficients

α_{i}

and

β_{i}

are known can be found in [19] and references therein. An estimate of the quadrature error in the Gauss rule (35) is given by

{\hat{A}}_{2 n + 1} (f) - G_{n} (f) .

(43)

Numerical examples provided in [18] show this estimate to be quite accurate for a wide range of integrands. As the rule

{\hat{A}}_{2 n + 1}

typically has strictly higher degree of precision than Laurie’s averaged Gauss rule (40), we expect the quadrature error estimate (43) to generally be more accurate than the estimate (41), particularly for integrands with high-order differentiability.

In the computations, we use the representation

{\hat{A}}_{2 n + 1} (f) = \frac{β_{n + 1}^{2}}{β_{n}^{2} + β_{n + 1}^{2}} G_{n} (f) + \frac{β_{n}^{2}}{β_{n}^{2} + β_{n + 1}^{2}} G_{n + 1}^{*} (f),

where

G_{n + 1}^{*} (f) = e_{1}^{t} f (T_{n + 1}^{*}) e_{1}

with

T_{n + 1}^{*} = [\begin{matrix} α_{1} & β_{1} & 0 \\ β_{1} & α_{2} & β_{2} \\ ⋱ & ⋱ & ⋱ \\ β_{n - 2} & α_{n - 1} & β_{n - 1} \\ β_{n - 1} & α_{n} & β_{n}^{*} \\ 0 & β_{n}^{*} & α_{n + 1} \end{matrix}] \in R^{(n + 1) \times (n + 1)}

and

β_{n}^{*} = \sqrt{β_{n}^{2} + β_{n + 1}^{2}}

.

We finally consider the Gauss-Radau quadrature rule

R_{n + 1, 0} (f)

, which has

n + 1

nodes, with one node anchored at 0. This rule can be written as

R_{n + 1, 0} (f) = w_{0}^{(n + 1)} f (0) + \sum_{i = 1}^{n} w_{i}^{(n + 1)} f (ξ_{i, 0}^{(n + 1)}) .

(44)

To maximize the degree of precision, which is

2 n

, the n nodes

ξ_{i, 0}^{(n + 1)}

,

i = 1, 2, \dots, n

, are suitably chosen. The rule

R_{n + 1, 0} (f)

can be expressed as

R_{n + 1, 0} (f) = e_{1}^{t} f (T_{n + 1, 0}) e_{1},

where

T_{n + 1, 0} = [\begin{matrix} α_{1} & β_{1} & 0 \\ β_{1} & α_{2} & β_{2} \\ ⋱ & ⋱ & ⋱ \\ β_{n - 2} & α_{n - 1} & β_{n - 1} \\ β_{n - 1} & α_{n} & β_{n} \\ 0 & β_{n} & {\tilde{α}}_{n + 1} \end{matrix}] \in R^{(n + 1) \times (n + 1)},

(45)

and the entry

{\tilde{α}}_{n + 1}

is chosen so that

T_{n + 1, 0}

has an eigenvalue at the origin. Details on how to determine

{\tilde{α}}_{n + 1}

are provided by Gautschi [13,20] and Golub [21].

Theorem 4.

Let the nodes

{ξ_{i, 0}^{(n + 1)}}_{i = 1}^{n + 1}

of the rule (44) and the nodes

{ξ_{i}^{(n)}}_{i = 1}^{n}

of the rule (35) be ordered according to increasing magnitude. Then

ξ_{0, 0}^{(n + 1)} = 0

and

| ξ_{i, 0}^{(n + 1)} | > | ξ_{i}^{(n)} |, i = 1, \dots, n .

Proof.

The last subdiagonal entry of the matrix (45) is non-vanishing since the measure

d w

in (34) has infinitely many support points. By the Cauchy interlacing theorem, the eigenvalues of the leading principal

n \times n

submatrix (36) of the symmetric tridiagonal

(n + 1) \times (n + 1)

matrix (45) strictly interlace the eigenvalues of the latter. Since one of the eigenvalues of (45) vanishes, the theorem follows. □

The measure

d w

has no support at the origin. We therefore apply the Gauss-Radau rules with

f (0) = 0

in (44).

Finally, we compute error-norm estimates by using the “minimum rule”:

A_{4 n + 2}^{min} (f) = min {A_{2 n + 1} (f), {\hat{A}}_{2 n + 1} (f)} .

(46)

It typically has

3 n + 2

distinct nodes. This rule is justified by the observation that the rules

A_{2 n + 1} (f)

and

{\hat{A}}_{2 n + 1} (f)

sometimes overestimate the error norm.

4. Error-Norm Estimation

This section outlines how the quadrature rules discussed in the previous section can be used to estimate the Euclidean norm of the error in the iterates

x_{k}

,

k = 0, 1, \dots

, determined by the iterative method described in Section 2. The initial iterate is assumed to be

x_{0} = 0

. Then the iterate

x_{n}

lives in the Krylov subspace (cf. (24) and (25)),

K_{n - 1} (A, A b) = span \{A b, A^{2} b, \dots, A^{n - 1} b\},

The residual corresponding to the iterate

x_{n}

is defined in (6). We use the relation (7) to obtain the Euclidean norm of the error, cf. (8). Our task is to estimate the first term in the right-hand side of (8). In this section, the measure is defined by (10). In particular, the measure has support in intervals on the negative and positive real axis. These intervals exclude an interval around the origin.

We turn to the computation of the iterates

x_{k}

described by (25). The computations can be structured in such a way that only a few m-vectors need to be stored. Let the

k \times k

real matrix

T_{k}

be given by (36) with

n = k

, i.e.,

T_{k} = [\begin{matrix} α_{1} & β_{1} & 0 \\ β_{1} & α_{2} & β_{2} \\ β_{2} & α_{3} \\ ⋱ & ⋱ \\ ⋱ & α_{k - 2} & β_{k - 2} \\ β_{k - 2} & α_{k - 1} & β_{k - 1} \\ 0 & β_{k - 1} & α_{k} \end{matrix}] .

(47)

Based on the discussion following Equation (12), we may assume that the

β_{j}

are nonzero. This ensures that the eigenvalues of

T_{k}

are distinct. We can compute the QR factorization (17) of

T_{k}

by applying a sequence of

k - 1

Givens rotations to

T_{k}

,

G_{k}^{(j)} : = [\begin{matrix} I_{j - 1} \\ c_{j} & s_{j} \\ - s_{j} & c_{j} \\ I_{k - j - 1} \end{matrix}] \in R^{k \times k}, c_{j}^{2} + s_{j}^{2} = 1, s_{j} \geq 0,

(48)

This yields an orthogonal matrix

Q_{k}

and an upper triangular matrix

R_{k}

given by

\begin{matrix} Q_{k} : = G_{k}^{(1) T} G_{k}^{(2) T} \dots G_{k}^{(k - 1) T}, \\ R_{k} : = G_{k}^{(k - 1)} G_{k}^{(k - 2)} \dots G_{k}^{(1)} T_{k} . \end{matrix}

(49)

For a discussion on Givens rotations, see, e.g., ([2], Chapter 5). In our iterative method, the matrix

Q_{k}

is not explicitly formed; instead, we use the representation in (49). Since

T_{k}

is tridiagonal, the upper triangular matrix

R_{k}

has nonzero entries solely on the diagonal and the two adjacent superdiagonals.

Application of k steps of the Lanczos process to the matrix A with initial vector

b

results in the matrix

T_{k}

, as shown in (47). By performing one more step, analogous to (11), we obtain the following Lanczos decomposition:

A V_{k + 1} = V_{k + 1} T_{k + 1} + f_{k + 1} e_{k + 1}^{t} .

(50)

We observe that the last subdiagonal element of the symmetric tridiagonal matrix

T_{k + 1}

can be calculated as

β_{k} = ∥ f_{k} ∥

right after the kth Lanczos step is completed. This is convenient when evaluating the tridiagonal matrices (45) associated with Gauss-Radau quadrature rules.

We can express the matrix

T_{k + 1}

in terms of its QR factorization as follows:

T_{k + 1} = Q_{k + 1} R_{k + 1},

(51)

whose factors can be computed from

Q_{k}

and

R_{k}

in a straightforward manner. Indeed, we have

\begin{matrix} Q_{k + 1} & = & [\begin{matrix} Q_{k} & 0 \\ 0^{t} & 1 \end{matrix}] G_{k + 1}^{(k) T}, \\ Q_{k + 1, k} & = & [\begin{matrix} Q_{k} & 0 \\ 0^{t} & 1 \end{matrix}] G_{k + 1, k}^{(k) T}, \end{matrix}

(52)

wherein

Q_{k + 1}

is a

(k + 1) \times (k + 1)

real orthogonal matrix;

Q_{k + 1, k}

is a

(k + 1) \times k

real matrix;

G_{k + 1}^{(k)}

is defined by (48); and

G_{k + 1, k}^{(k)}

is a real

(k + 1) \times k

matrix, which is made up of the first k columns of

G_{k + 1}^{(k)}

.

We express the matrices in terms of their columns to derive updating formulas for the computation of the triangular matrix

R_{k + 1}

in (51) from

R_{k}

in (49):

R_{k} = [r_{1}^{(k)}, r_{2}^{(k)}, \dots, r_{k}^{(k)}], R_{k + 1} = [r_{1}^{(k + 1)}, r_{2}^{(k + 1)}, \dots, r_{k}^{(k + 1)}, r_{k + 1}^{(k + 1)}] .

A comparison between (17) and (51) yields the following results:

r_{j}^{(k + 1)} = [\begin{matrix} r_{j}^{(k)} \\ 0 \end{matrix}], 1 \leq j < k,

and

\begin{matrix} r_{k}^{(k + 1)} & = & G_{k + 1}^{(k)} [\begin{matrix} r_{k}^{(k)} \\ β_{k} \end{matrix}], \\ r_{k + 1}^{(k + 1)} & = & G_{k + 1}^{(k)} G_{k + 1}^{(k - 1)} T_{k + 1} e_{k + 1} . \end{matrix}

(53)

Thus, the elements of all the matrices

R_{1}, R_{2}, \dots, R_{k + 1}

can be calculated in just

O (k^{2})

arithmetic floating-point operations (flops).

As defined by (18), the matrix expressed as

{\bar{R}}_{k} = {[{\bar{r}}_{j ℓ}^{(k)}]}_{j, ℓ = 1}^{k}

is the leading principal submatrix of

R_{k + 1}

of order k, and differs from

R_{k} = {[r_{j ℓ}^{(k)}]}_{j, ℓ = 1}^{k}

only in its last diagonal entry. From Equation (53) and the fact that

β_{k}

is nonzero, it follows that

{\bar{r}}_{k k}^{(k)} > r_{k k}^{(k)} \geq 0

. When

T_{k}

is nonsingular, we obtain that

r_{k k}^{(k)} > 0

. Here we assume that the diagonal entries of the upper triangular matrix in all QR factorizations are non-negative.

We turn to the computation of the columns of the matrix

{\tilde{V}}_{k + 1}

, that is,

{\tilde{V}}_{k + 1} = [{\tilde{v}}_{1}^{(k + 1)}, {\tilde{v}}_{2}^{(k + 1)}, \dots, {\tilde{v}}_{k + 1}^{(k + 1)}] : = V_{k + 1} Q_{k + 1},

(54)

from columns of

{\tilde{V}}_{k}

, where

V_{k + 1}

is obtained by the modified Lanczos scheme (50), see Algorithm 1, and

Q_{k + 1}

is determined by (52). Substituting (52) into the right-hand side of (54) yields

\begin{matrix} {\tilde{V}}_{k + 1} & = & [V_{k}, v_{k + 1}] Q_{k + 1} = [{\tilde{V}}_{k}, v_{k + 1}] G_{k + 1}^{(k) T} \\ = & [{\bar{V}}_{k - 1}, c_{k} {\tilde{v}}_{k}^{(k)} + S_{k} v_{k + 1}, - S_{k} {\tilde{v}}_{k}^{(k)} + c_{k} v_{k + 1}] . \end{matrix}

(55)

As a result, the initial

k - 1

columns of the matrix

{\tilde{V}}_{k + 1}

correspond to those of

{\bar{V}}_{k - 1}

. The columns

{\tilde{v}}_{k}^{(k + 1)}

and

{\tilde{v}}_{k + 1}^{(k + 1)}

in

{\tilde{V}}_{k + 1}

are linear combinations obtained from the last columns of both

{\tilde{V}}_{k}

and

V_{k + 1}

.

Given the solution

w_{k - 1}

of the linear system (29) and considering that

{\bar{R}}_{k}

is upper triangular, with

{\bar{R}}_{k - 1}

as the leading principal submatrix of order

k - 1

, the computation of the solution

w_{k} = {[η_{1}, η_{2}, \dots, η_{k}]}^{t}

of

{\bar{R}}_{k}^{t} w_{k} = ∥ b ∥ e_{1}

is inexpensive. We find that

w_{k} = [\begin{matrix} w_{k - 1} \\ η_{k} \end{matrix}], η_{k} = - ({\bar{r}}_{k - 2, k}^{(k)} η_{k - 2} + {\bar{r}}_{k - 1, k}^{(k)} η_{k - 1}) / {\bar{r}}_{k k}^{(k)} .

Note that the computation of

w_{k}

from

w_{k - 1}

only requires the last column of the matrix

{\bar{R}}_{k}

.

We are now in a position to compute

x_{k + 1}

from

x_{k}

. Using Equations (25) and (55), we obtain

x_{k + 1} = {\bar{V}}_{k} w_{k} = {\bar{V}}_{k - 1} w_{k - 1} + η_{k} {\tilde{v}}_{k}^{(k + 1)} = x_{k} + η_{k} {\tilde{v}}_{k}^{(k + 1)},

Note that only the last few columns of

V_{k}

and

{\tilde{V}}_{k}

are required to update the iterate

x_{k + 1}

.

Algorithm 1 shows pseudo-code for computing the nontrivial elements of the matrix

T_{n}

in (36) and the matrix

V

in (50). Each iteration requires the evaluation of one matrix-vector product with the matrix A and a few operations with m-vectors. The latter only require

O (m)

flops.

5. Computed Examples

This section illustrates the performance of the iterative method of Section 2 and the error estimates of Section 4 when applied to three linear systems of Equation (1). In all examples, the matrix A is symmetric, nonsingular, and indefinite. All computations were carried out on a laptop computer using MATLAB R2024b. The initial approximate solution was set to

x_{0} = 0

, and iterations were terminated once the Euclidean norm of the error fell below

10^{- 6}

.

The matrix

A \in R^{m \times m}

in the computed examples were determined from their spectral factorization (2), where the eigenvector matrix U is a random orthogonal matrix and the eigenvalues are distributed on the positive and negative real axis. The exact solution for all examples is

x_{*} = {[1, 1, \dots, 1]}^{t} \in R^{m}

. The approximation of the Euclidean norm of the errors in the iterates was done using the quadrature rules of Section 3.

Problem 1.

In this example, we set

m = 491

. The spectrum of the symmetric matrix A is in the union of the intervals:

[- 150, - 10] \cup [1, 350]

. The eigenvalues of A are given by

λ_{i} = \{\begin{matrix} i - 151 & for 1 \leq i \leq 141, \\ i - 141 & for 142 \leq i \leq 491 . \end{matrix}

We have

cond (A) = 350

. Figure 1 displays numerical results for this example. The plots show the convergence history of the Gauss, Gauss-Radau, optimal averaged Gauss, and minimum

A_{4 n + 2}^{min} (f)

quadrature rules. The plots show all rules considered to be convergent. Furthermore, the Gauss-Radau and

A_{4 n + 2}^{min} (f)

rules display the fastest convergence.

Problem 2.

Let

m = 200

and let the spectrum of the symmetric matrix A be in the union of two intervals:

[- 7.3891, - 2.7456] \cup [1.0513, 148.4132]

. The eigenvalues of A are given by

λ_{i} = \{\begin{matrix} exp (i / 20) & for 1 \leq i \leq 100, \\ - exp (i / 100) & for 101 \leq i \leq 200 . \end{matrix}

Thus, the matrix A has 100 negative eigenvalues, and its condition number is

cond (A) \approx 147

. Figure 2 shows the convergence history for the Gauss, Gauss-Radau, optimal averaged Gauss, and

A_{4 n + 2}^{min} (f)

rules.

Problem 3.

In this example, we use the symmetric matrix

A \in R^{100 \times 100}

with eigenvalues

λ_{i} = \{\begin{matrix} - i^{- 2} & for 1 \leq i \leq 50, \\ i^{- 3} & for 51 \leq i \leq 100 . \end{matrix}

Thus, the matrix A has 50 negative eigenvalues, with its spectrum in the intervals of

[- 1, - 4 \cdot 10^{- 4}] \cup [10^{- 6}, 7.5386 \cdot 10^{- 6}]

and

cond (A) = 1.25 \cdot 10^{8}

. Figure 3 displays the convergence history for the Gauss, Gauss-Radau, optimal averaged Gauss, and minimum

A_{4 n + 2}^{min} (f)

rules.

6. Concluding Remarks

This paper describes novel techniques for estimating the Euclidean norm of the error in iterates obtained with a Krylov subspace iterative method. The method is designed to solve linear systems of equations with a symmetric, nonsingular, indefinite matrix. The error norm estimates are obtained by using the relation between the Krylov subspace iterative method and Gauss-type quadrature rules. The computed results demonstrate the performance of the iterative method and the error norm estimates. Among the considered quadrature rules, the rules (44) and (46) provide the most accurate error-norm estimates.

Author Contributions

Investigation, M.A., M.T.D., L.R. and M.M.S. All authors have read and agreed to the published version of the manuscript.

Funding

The research by M.M.S. was supported, in part, by the Serbian Ministry of Science, Technological Development, and Innovations according to Contract 451-03-65/2024-03/200105 dated 5 February 2024.

Data Availability Statement

The data that support the findings of the study are available from the authors upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

Paige, C.C.; Saunders, M.A. Solution of sparse indefinite systems of linear equations. SIAM J. Numer. Anal. 1975, 12, 617–629. [Google Scholar] [CrossRef]
Golub, G.H.; Loan, C.F.V. Matrix Computations, 4th ed.; Johns Hopkins University Press: Baltimore, MD, USA, 2013. [Google Scholar]
Almutari, H.; Meurant, G.; Reichel, L.; Spalević, M.M. New error estimates for the conjugate gradient method. J. Comput. Appl. Math. 2025, 459, 116357. [Google Scholar] [CrossRef]
Golub, G.H.; Meurant, G. Matrices, moments and quadrature. In Numerical Analysis 1993; Griffiths, D.F., Watson, G.A., Eds.; Longman: Essex, UK, 1994; pp. 105–156. [Google Scholar]
Golub, G.H.; Meurant, G. Matrices, Moments and Quadrature with Applications; Princeton University Press: Princeton, NJ, USA, 2010. [Google Scholar]
Meurant, G.; Tichý, P. Error Norm Estimation in the Conjugate Gradient Algorithm; SIAM: Philadelphia, PA, USA, 2024. [Google Scholar]
Meurant, G.; Tichý, P. On computing quadrature-based bounds for the A-norm of the error in conjugate gradients. Numer. Algorithms 2013, 62, 163–191. [Google Scholar] [CrossRef]
Calvetti, D.; Morigi, S.; Reichel, L.; Sgallari, F. An iterative method with error estimators. J. Comput. Appl. Math. 2001, 127, 93–119. [Google Scholar] [CrossRef]
Laurie, D.P. Anti-Gaussian quadrature formulas. Math. Comp. 1996, 6, 739–747. [Google Scholar] [CrossRef]
Spalević, M.M. On generalized averaged Gaussian formulas. Math. Comput. 2007, 76, 1483–1492. [Google Scholar] [CrossRef]
Saad, Y. Iterative Methods for Sparse Linear Systems, 2nd ed.; SIAM: Philadephia, PA, USA, 2003. [Google Scholar]
Fischer, B. Polynomial Based Iteration Methods for Symmetric Linear Systems; Teubner-Wiley: New York, NY, USA, 1996. [Google Scholar]
Gautschi, W. Orthogonal Polynomials, Computation and Approximation; Oxford University Press: Oxford, UK, 2004. [Google Scholar]
Szego, G. Orthogonal Polynomials, 4th ed.; American Mathematical Society: Providence, RI, USA, 1975. [Google Scholar]
Díaz de Alba, P.; Fermo, L.; Rodriguez, G. Solution of second kind Fredholm integral equations by means of Gauss and anti-Gauss quadrature rules. Numer. Math. 2020, 146, 699–728. [Google Scholar] [CrossRef]
Hascelik, A.I. Modified anti-Gauss and degree optimal average formulas for Gegenbauer measure. Appl. Numer. Math. 2008, 58, 171–179. [Google Scholar] [CrossRef]
Notaris, S.E. Anti-Gaussian quadrature formulae based on the zeros of Stieltjes polynomials. BIT Numer. Math. 2018, 58, 179–198. [Google Scholar] [CrossRef]
Reichel, L.; Spalević, M.M. Averaged Gauss quadrature formulas: Properties and applications. J. Comput. Appl. Math. 2022, 410, 114232. [Google Scholar] [CrossRef]
Djukić, D.L.; Mutavdzić Djukić, R.M.; Reichel, L.; Spalević, M.M. Weighted averaged Gaussian quadrature rules for modified Chebyshev measures. Appl. Numer. Math. 2024, 200, 195–208. [Google Scholar] [CrossRef]
Gautschi, W. The interplay between classical analysis and (numerical) linear algebra—A tribute to Gene H. Golub. Electron. Trans. Numer. Anal. 2002, 13, 119–147. [Google Scholar]
Golub, G.H. Some modified matrix eigenvalue problems. SIAM Rev. 1973, 15, 318–334. [Google Scholar] [CrossRef]

Figure 1. Problem 1. Plots (a–d): Convergence history of quadrature rules. (a) Convergence history of averaged (the dashed red curve) and optimal averaged (the blue curve) rules in comparison with the exact solution (the black curve). (b) Convergence history of Gauss-Radau (the dashed red curve) rule in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (c) Convergence history of Gauss rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (d) Convergence history of the

A_{4 n + 2}^{min} (f)

rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (e) Absolute error of Gauss (the dotted blue curve), Gauss-Radau (the red curve), and

A_{4 n + 2}^{min} (f)

(the green curve) rules, in a semi-logarithmic plot.

Figure 1. Problem 1. Plots (a–d): Convergence history of quadrature rules. (a) Convergence history of averaged (the dashed red curve) and optimal averaged (the blue curve) rules in comparison with the exact solution (the black curve). (b) Convergence history of Gauss-Radau (the dashed red curve) rule in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (c) Convergence history of Gauss rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (d) Convergence history of the

A_{4 n + 2}^{min} (f)

rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (e) Absolute error of Gauss (the dotted blue curve), Gauss-Radau (the red curve), and

A_{4 n + 2}^{min} (f)

(the green curve) rules, in a semi-logarithmic plot.

Figure 2. Problem 2. Plots (a–d): convergence history of quadrature rules. (a) Convergence history of averaged (the dashed red curve) and optimal averaged (the blue curve) rules in comparison with the exact solution (the black curve). (b) Convergence history of the Gauss–Radau rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (c) Convergence history of the Gauss rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (d) Convergence history of the

A_{4 n + 2}^{min} (f)

rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (e) Absolute error of the Gauss (the dotted blue curve), Gauss-Radau (the red curve), and

A_{4 n + 2}^{min} (f)

(the green curve) rules in a semi-logarithmic plot.

Figure 2. Problem 2. Plots (a–d): convergence history of quadrature rules. (a) Convergence history of averaged (the dashed red curve) and optimal averaged (the blue curve) rules in comparison with the exact solution (the black curve). (b) Convergence history of the Gauss–Radau rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (c) Convergence history of the Gauss rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (d) Convergence history of the

A_{4 n + 2}^{min} (f)

rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (e) Absolute error of the Gauss (the dotted blue curve), Gauss-Radau (the red curve), and

A_{4 n + 2}^{min} (f)

(the green curve) rules in a semi-logarithmic plot.

Figure 3. Problem 3. Plots (a–d): convergence history of quadrature rules. (a) Convergence history of averaged (the dashed red curve) and optimal averaged (the blue curve) rules in comparison with the exact solution (the black curve). (b) Convergence history of the Gauss–Radau rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (c) Convergence history of the Gauss rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (d) Convergence history of the

A_{4 n + 2}^{min} (f)

rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (e) Absolute error of the Gauss (the dotted blue curve), Gauss-Radau (the red curve), and

A_{4 n + 2}^{min} (f)

(the green curve) rules in a semi-logarithmic plot.

Figure 3. Problem 3. Plots (a–d): convergence history of quadrature rules. (a) Convergence history of averaged (the dashed red curve) and optimal averaged (the blue curve) rules in comparison with the exact solution (the black curve). (b) Convergence history of the Gauss–Radau rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (c) Convergence history of the Gauss rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (d) Convergence history of the

A_{4 n + 2}^{min} (f)

rule (the dashed red curve) in comparison with the exact solution (the black curve) and the residual vector (the dotted blue curve). (e) Absolute error of the Gauss (the dotted blue curve), Gauss-Radau (the red curve), and

A_{4 n + 2}^{min} (f)

(the green curve) rules in a semi-logarithmic plot.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alibrahim, M.; Darvishi, M.T.; Reichel, L.; Spalević, M.M. Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix. Axioms 2025, 14, 179. https://doi.org/10.3390/axioms14030179

AMA Style

Alibrahim M, Darvishi MT, Reichel L, Spalević MM. Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix. Axioms. 2025; 14(3):179. https://doi.org/10.3390/axioms14030179

Chicago/Turabian Style

Alibrahim, Mohammed, Mohammad Taghi Darvishi, Lothar Reichel, and Miodrag M. Spalević. 2025. "Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix" Axioms 14, no. 3: 179. https://doi.org/10.3390/axioms14030179

APA Style

Alibrahim, M., Darvishi, M. T., Reichel, L., & Spalević, M. M. (2025). Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix. Axioms, 14(3), 179. https://doi.org/10.3390/axioms14030179

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Error Estimators for a Krylov Subspace Iterative Method for Solving Linear Systems of Equations with a Symmetric Indefinite Matrix

Abstract

1. Introduction

2. The Iterative Scheme

3. Quadrature Rules

Selected Gauss-Type Quadrature Rules

4. Error-Norm Estimation

5. Computed Examples

6. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI