Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems

Yang, Zhanshan; Liu, Xilan

doi:10.3390/math9202550

Open AccessArticle

Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems^†

by

Zhanshan Yang

¹ and

Xilan Liu

^2,*

¹

School of Mathematics and Statistics, Qinghai Nationalities University, Xining 810007, China

²

School of Mathematics and Information Science, Baoji University of Arts and Sciences, Baoji 721000, China

^*

Author to whom correspondence should be addressed.

^†

Supported by the NNSF of China (11571004, 11701456), Natural Science Foundation of Qinghai Province (2018-ZJ-717), Foundation Sciences Qinghai Nationalities University (2020XJG11, 2019XJZ10).

Mathematics 2021, 9(20), 2550; https://doi.org/10.3390/math9202550

Submission received: 16 August 2021 / Revised: 26 September 2021 / Accepted: 30 September 2021 / Published: 12 October 2021

Download Versions Notes

Abstract

For large-scale problems, how to establish an algorithm with high accuracy and stability is particularly important. In this paper, the Householder bidiagonalization total least squares (HBITLS) algorithm and nonlinear iterative partial least squares for total least squares (NIPALS-TLS) algorithm were established, by which the same approximate TLS solutions was obtained. In addition, the propagation of the roundoff error for the process of the HBITLS algorithm was analyzed, and the mixed forward-backward stability of these two algorithms was proved. Furthermore, an upper bound of roundoff error was derived, which presents a more detailed and clearer approximation of the computed solution.

Keywords:

Householder bidiagonalization; NIPALS; roundoff error; total least squares problems

1. Introduction

Consider estimating

x

from the overdetermined linear system

\begin{matrix} A x \approx b for A \in R^{m \times n}, b \in R^{m} and x \in R^{n}, \end{matrix}

(1)

where the error exists in both the right-hand side b and the data matrix A and

m \geq n + 1

. In this case, the total least squares (TLS) model should be appropriate to adopt (cf. [1,2]). The TLS approach is just to find a perturbation with the minimum Frobenius norm to make the system (1) a compatible system

\begin{matrix} {min ∥ (E, r) ∥}_{F}, subject to b + r \in Range (A + E) . \end{matrix}

(2)

The TLS method is widely used in various scientific fields, such as physics, automatic control, signal processing, statistics, economics, biology, medicine etc. In essence, a solution of a TLS problem can be expressed by a singular value decomposition of the augmented matrix

(A, b) .

When the dimensions of A are not too large, one can use the truncated-SVD (TSVD) method. When the dimensions of A become large, this approach becomes prohibitive because the SVD algorithm is of complexity

O (m n^{2})

. The above considerations lead us to consider Krylov iterative methods, that do not alter the matrix A. The methods have the attractive feature just like the Lanczos methods—that when n increases, the computed extreme singular elements rapidly become good approximations to the exact ones, and are satisfactorily accurate even if k is far less than n theoretically [1]. Nevertheless, the orthonormal properties of the Krylov basis strongly support the use of these Householder matrix-based algorithms. This is particularly true when we need to be sure that the perturbed problem we are solving has to conserve some spectral similarity properties. This will be especially relevant when we need to compute approximations of the TLS problems. In view of this, we consider applying the Householder bidiagonalization algorithm and the NIPALS PLS algorithm posed by Å. Bj

\ddot{o}

rck [3] to TLS problems, the formed Householder bidiagonalization total least squares (HBITLS) algorithm, and NIPALS-TLS algorithm, respectively. Furthermore, we find that the HBITLS and NIPALS-TLS algorithms also compute the same approximate solutions for the TLS problems.

When it comes to practical problems, the arithmetic will be inaccurate and there will be errors in each step of the calculation. Arithmetic operations running on the computer have finite precision, so there will be rounding errors as long as there are numerical computations. These rounding errors cause the calculation quantities to be different from their theoretical values. One of the design principles of the floating-point operation is that it should encourage experts to develop robust, efficient, and portable numerical programs, enable the handling of arithmetic exceptions, and provide for the development of transcendental functions and high-precision arithmetic [4]. The results in the roundoff error analysis in Lanczos-type methods obtained by Paige [5,6,7] played an important role in interpreting the behavior of the Lanczos method in finite-precision computations. Parlett and Scott [8] used the results of the roundoff error analysis as the basis for suggesting a modification of the Lanczos method, which they called selective orthogonalization [8,9,10]. In addition, in many practical problems, the stop criterion can be safely selected on the basis of the rounding error analysis of the original problem, thereby diminishing the need for an extremely precise approximation of the algebraic problem solution [4]. As far as we know, the roundoff error analysis of the approximation TLS solutions obtained by using the Householder bidiagonalization procedure was not systematically performed in the literature. Hence, in this paper, we analyzed the propagation of the roundoff error during the process of the HBITLS algorithm and found that the HBITLS algorithm and NIPALS-TLS algorithm are mixed forward–backward stable.

The paper is organized as follows. The HBITLS algorithm and NIPALS-TLS algorithm were established, by which the same approximate TLS solution was obtained in Section 2. Section 3 analyzes the propagation of the roundoff error during the process of the HBITLS algorithm. A brief conclusion is shown in the last section.

2. HBITLS Algorithm and NIPALS-TLS Algorithm

It is well known that algorithms based on a sequence of orthogonal transformations with Householder matrices have very good stability properties; see Higham [4]. Based on this, this paper gives the HBITLS and NIPALS-TLS algorithms and finds that they both compute the same approximate TLS solutions.

2.1. HBITLS Algorithm

Let us first describe the Householder bidiagonalization process just as shown in [3]. However, in this paper, the process is used in the augmented matrix

(b, A)

for the TLS problem. The idea is to compute orthogonal matrices

U \in R^{m \times (n + 1)}

and

V \in R^{(n + 1) \times (n + 1)}

, such that

\begin{matrix} U^{T} (b, A) V = (\begin{array}{c} β_{1} & α_{1} \\ β_{2} & α_{2} \\ β_{3} & ⋱ \\ ⋱ & α_{n} \\ β_{n + 1} \end{array}) = (β_{1} e_{1}, B_{n}) \equiv C_{n} . \end{matrix}

(3)

U = G_{1} \dots G_{n} = [u_{1}, u_{2}, \dots, u_{n + 1}]

and

V = H_{1} \dots H_{n} = [v_{1}, v_{2}, \dots, v_{n + 1}]

can be determined as a product of Householder matrices in each iteration. Generally,

G_{k}

introduces zero in the kth column, while

H_{k}

sets zero for the appropriate entries in the kth row. This can be done by an algorithm named Householder. Given the reason of space, and known to all, we omit it here, see algorithm 5.4.2 in [1] for details.

From the above process of Householder bidiagonalization, we know that V can be rewritten as

V = (\begin{matrix} 1 & 0 \\ 0 & V_{n - 1} \end{matrix}),

V_{n - 1} = [v_{1}, v_{2}, \dots, v_{n}] .

In detail, from (3), we have

\begin{matrix} U^{T} b = U^{T} (b, A) V e_{1} = U^{T} (b, A) (\begin{matrix} 1 & 0 \\ 0 & V_{n - 1} \end{matrix}) e_{1} = C_{n} e_{1} = β_{1} e_{1} \end{matrix}

and

\begin{matrix} A v_{i} & = & α_{i} u_{i} + β_{i + 1} u_{i + 1}, for i = 1, \dots, n . \end{matrix}

(4)

From (3), we have

{(b, A)}^{T} U = (\begin{matrix} 1 & 0 \\ 0 & V_{n - 1} \end{matrix}) C_{n}^{T}

and there comes

\begin{matrix} A^{T} u_{1} = α_{1} v_{1}, \end{matrix}

\begin{matrix} A^{T} u_{i} & = & α_{i} v_{i} + β_{i} v_{i - 1}, for i = 2, \dots, n - 1 . \end{matrix}

(5)

Let

V_{k} = H_{1} H_{2} \dots H_{k} (\begin{matrix} I_{k} \\ 0 \end{matrix}),

U_{k} = G_{k} \dots G_{2} G_{1} (\begin{matrix} I_{k} \\ 0 \end{matrix}),

and

C_{k} = (β_{1} e_{1}, B_{k})

is a leading principal submatrix of order

k + 1

of the final bidiagonal matrix

C_{n}

. As we all know, if the exact arithmetic is used, we have

U_{k + 1}^{T} U_{k + 1} = I

,

V_{k}^{T} V_{k} = I

. However, in any case, the previous equations remain within machine precision. Then (4) and (5) can be rewritten as

\begin{matrix} (b, A) V_{k} = U_{k + 1} C_{k}, \end{matrix}

(6)

\begin{matrix} {(b, A)}^{T} U_{k + 1} = V_{k} C_{k}^{T} + α_{k + 1} v_{k + 1} e_{k + 1}^{T} = V_{k + 1} {\bar{C}}_{k}^{T}, \end{matrix}

(7)

where

{\bar{C}}_{k} = (C_{k}, α_{k + 1} e_{k}) \in R^{k \times (k + 1)} .

After performing the k-step Householder bidiagonalization iterations, the TLS problem can be reduced onto the subspace generated by

U_{k + 1}

and

V_{k}

. Then the reduced TLS problem (also see [11]) is as follows

\begin{matrix} min ∥ U_{k + 1}^{T} ((b, A) - ({\hat{b}}_{k}, {\hat{A}}_{k})) (\begin{matrix} 1 & 0 \\ 0 & V_{k} \end{matrix}) ∥_{F} subject to U_{k + 1}^{T} {\hat{A}}_{k} V_{k} y = U_{k + 1}^{T} {\hat{b}}_{k}, \end{matrix}

(8)

or

\begin{matrix} min ∥ (β_{1} e_{1}, B_{k}) - ({\hat{e}}_{k}, {\hat{B}}_{k}) ∥_{F} subject to {\hat{B}}_{k} y = {\hat{e}}_{k}, \end{matrix}

(9)

where

e_{1} = {(1, 0, \dots, 0)}^{T}

, and

{\hat{B}}_{k}

and

{\hat{e}}_{k}

are generally full. As in LSQR, seek an approximate TLS solution

\begin{matrix} x_{k} = V_{k} y_{k} \in K_{k} (A^{T} A, A^{T} b), \end{matrix}

where

K_{k} (B, y)

denotes the Krylov subspace span

{y, B y, \dots, B^{k - 1} y}

.

Let the SVD of

(β_{1} e_{1}, B_{k}) = {\bar{W}}_{k} {\bar{Σ}}_{k} {\bar{Z}}_{k}^{T}

and if let

\begin{matrix} (\begin{matrix} γ_{k} \\ z_{k} \end{matrix}) = (\begin{matrix} 1 & 0 \\ 0 & V_{k} \end{matrix}) ({\bar{Z}}_{k} e_{k + 1}) \end{matrix}

(10)

with

{\bar{Z}}_{k} = (\begin{matrix} {\bar{Z}}_{1} & {\bar{Z}}_{2} \end{matrix}) = \begin{matrix} k & 1 \\ {\bar{Z}}_{11} & {\bar{z}}_{12} & 1 \\ {\bar{Z}}_{21} & {\bar{z}}_{22} & k \end{matrix}

(11)

then the approximate TLS solution is given by

\begin{matrix} x_{k} = - \frac{z_{k}}{γ_{k}} \in K_{k} (A^{T} A, A^{T} b) . \end{matrix}

(12)

Note that we only need the last singular vector

{\bar{Z}}_{k} e_{k + 1}

to compute

x_{k}

. To this extent, summarizing the above process, we can get the Householder bidiagonalization TLS (HBITLS) algorithm as follows:

Remark 1.

A variant of Algorithm 1 can also be given, in which the product of the Householder transformations applying to vectors are replaced by operations that can be performed concurrently, to a large extent. This variation gives an efficient method for developing parallelism in the case of parallel computing matrix vector products. In regard to this variation of Algorithm 1, one can refer to [12] and we omit it here.

Algorithm 1 HBITLS

1:: Initialize: $C = (b, A),$ $U : = I$ , $V : = I$ .
2:: for $j = 1, \dots, n + 1$
$[v, β] = Householder (C (j : m, j));$
$P = I_{m - j + 1} - β v v^{T};$
$C (j : m, j : n + 1) = P C (j : m, j : n + 1);$
$G = diag {I_{j - 1}, P};$
$U = U G;$
$u_{j} = U (:, j);$ if $j \leq n - 1$
$[v, β] = Householder (C (j, j + 1 : n + 1));$
$Q = I_{n - j} - β v v^{T};$
$C (j : m, j + 1 : n + 1) = C (j : m, j + 1 : n + 1) Q;$
$H = diag {I_{j}, Q};$
$V = V H;$
$v_{j} = V (:, j)$ .
end
3:: Compute the last singular triplet for matrix $C_{k}$ by employing the implicit zero-shift QR algorithm.
4:: the approximate TLS solution is given by $x_{k} = - \frac{z_{k}}{γ_{k}}$ , see (10).
5:: end

Using the recursions (4) and (5), the following properties of

{v_{1}, v_{2}, \dots, v_{i}}

and

{u_{1}, u_{2}, \dots, u_{i}}

can be proved.

Lemma 1.

The sets

{v_{1}, v_{2}, \dots, v_{i}}

and

{u_{1}, u_{2}, \dots, u_{i}}

generated by Algorithm 1 are the orthonormal basis of

K_{i} (A^{T} A, A^{T} b)

and

K_{i} (A A^{T}, A A^{T} b)

respectively.

Proof.

As a result of the facts that

β_{1} v_{1} = A^{T} b

,

α_{1} u_{1} = A v_{1} = A A^{T} b / β_{1}

,

H_{1} v_{1} = e_{1}

and the process of Householder bidiagonalization, for

1 \leq k \leq i,

it’s easy to know that

v_{j} = H_{1} \dots H_{k} e_{j}

for

j = 1, \dots, k,

i.e., that

\begin{matrix} H_{1} \dots H_{k} = [v_{1}, \dots, v_{k}, \dots], 1 \leq k \leq i . \end{matrix}

(13)

Certainly

K_{1} (A^{T} A, A^{T} b) = R (v_{1})

. It clearly holds if

i = 1

. Suppose for some

i > 1

that the iteration has produced

V_{i} = [v_{1}, v_{2}, \dots, v_{i}]

with orthonormal columns such that

K_{i} (A^{T} A, A^{T} b) = span {v_{1}, (A^{T} A) v_{1}, \dots, {(A^{T} A)}^{i - 1} v_{1}} = span {v_{1}, v_{2}, \dots, v_{i}} .

It is easy to see from (4) that

V_{i}^{T} A^{T} A V_{i} = B_{i}^{T} B_{i}

and we have

V_{i}^{T} r_{i} = 0

, where

r_{i} = A^{T} A v_{i} - (α_{i}^{2} + β_{i + 1}^{2}) v_{i} - α_{i - 1} β_{i - 1} v_{i - 1}

. If

r_{i} \neq 0

, then

v_{i + 1} = r_{i} / {∥ r_{i} ∥}_{2}

is orthogonal to

v_{1}, v_{2}, \dots, v_{i}

. It follows that

v_{i + 1} \notin K_{i} (A^{T} A, A^{T} b)

and

v_{i + 1} \in span {A^{T} A v_{i}, v_{i}, v_{i - 1}} \subseteq K_{i + 1} (A^{T} A, A^{T} b) .

Thus,

V_{i + 1}^{T} V_{i + 1}^{T} = I

and

span (V_{i + 1}) = K_{i + 1} (A^{T} A, A^{T} b) .

On the other hand, if

r_{i} = 0

, then

A^{T} A V_{i} = V_{i} B_{i}^{T} B_{i}

. This says that

span (V_{i}) = K_{i} (A^{T} A, A^{T} b)

is invariant for

A^{T} A

and the induction is complete. The proof of

span (U_{i}) = K_{i} (A A^{T}, A A^{T} b)

is in a similar way. □

The Householder matrices

H_{i}

and

G_{i}

need not be formed explicitly. In other words, the matrices

V_{k}

and

U_{k}

can also remain in product form in the HBITLS algorithm. In floating-point operations, the Householder transformation does not have to worry too much about the loss of orthogonality.

2.2. The NIPALS-TLS Algorithm

For the NIPALS PLS algorithm, one can see in [3,13]. In this paper, we want to use it to solve the TLS problems and then form the NIPALS-TLS algorithm. We can find that the HBITLS algorithm and NIPALS-TLS algorithm generate the same sequences, orthonormal base

V_{k}

. From the uniqueness of this base, and combined with the relationship between the two algorithms, we conclude that the two algorithms generate the same numerical solution

x_{k} .

In [3], it tells us that we can set

A_{0} = A,

b_{0} = b,

for

k = 1, 2, \dots

, we can produce sequences

u_{k}

and

v_{k}

according to the following form:

\begin{matrix} v_{k} & = & A_{k - 1}^{T} b_{k - 1} / μ_{k}, μ_{k} = {∥ A_{k - 1}^{T} b_{k - 1} ∥}_{2}, \end{matrix}

(14)

\begin{matrix} p_{k} & = & A_{k - 1} v_{k - 1} / ρ_{k}, ρ_{k} = {∥ A_{k - 1} v_{k - 1} ∥}_{2}, \end{matrix}

(15)

\begin{matrix} (A_{k}, b_{k}) & = & (I - p_{k} p_{k}^{T}) (A_{k - 1}, b_{k - 1}) . \end{matrix}

(16)

In (16)

A_{k}

and

b_{k}

are formed by deflated

A_{k - 1}

and

b_{k - 1}

by subtracting their orthogonal projections onto

p_{k} .

We know that this operation uses elementary orthogonal transformations, such that

S = I - p p^{T}

,

{∥ p ∥}_{2} = 1 .

The deflation in () can also be written as

\begin{matrix} A_{k} & = & A_{k - 1} - p_{k} s_{k}^{T}, s_{k} = A_{k - 1}^{T} p_{k}, \end{matrix}

(17)

\begin{matrix} b_{k} & = & b_{k - 1} - p_{k} ζ_{k}^{T}, ζ_{k} = b_{k - 1}^{T} p_{k} . \end{matrix}

(18)

The process is terminated when it meets either

∥ A_{k - 1}^{T} b_{k - 1} ∥_{2} = 0

or

∥ A_{k - 1} v_{i} ∥_{2} = 0

. We note that if

p_{k}^{T} A_{k - 1} v_{k} \neq 0

, then the rank of the matrix

A_{k}

is one less than that of

A_{k - 1}

exactly.

Using exact arithmetic, the sets

{v_{1}, v_{2}, \dots, v_{k}}

and

{p_{1}, p_{2}, \dots, p_{k}}

generated by (14) and (15) are the unique orthogonal bases for the Krylov sub-spaces

K_{k} (A^{T} A, A^{T} b)

and

K_{k} (A A^{T}, A A^{T} b)

, respectively. Summing (17) and (18) generates

\begin{matrix} A = P_{k} S_{k}^{T} + A_{k}, b = P_{k} z_{k} + b_{k}, \end{matrix}

(19)

where

P_{k} = [p_{1}, p_{2}, \dots, p_{k}],

S_{k} = [s_{1}, s_{2}, \dots, s_{k}],

and

z_{k} = {(ζ_{1}, ζ_{2}, \dots, ζ_{k})}^{T}

. These relationships maintain working accuracy and do not depend on orthogonality. The matrix

P_{k} S_{k}^{T}

is a rank-k approximation to the data matrix A. From [3], we have

S_{k}^{T} = P_{k}^{T} A

and

S_{k}^{T} V_{k} = R_{k}

. Thus, in exact arithmetic, the matrix

S_{k}^{T} V_{k}

is upper bidiagonal with its elements

\begin{matrix} θ_{k} = s_{k - 1}^{T} v_{k} ρ_{k} = s_{k}^{T} v_{k} = {∥ A_{k - 1} v_{k} ∥}_{2}, \end{matrix}

(20)

and

R_{k} \equiv (\begin{matrix} ρ_{1} & θ_{2} \\ ρ_{2} & θ_{3} \\ ⋱ & ⋱ \\ ρ_{k - 1} & θ_{k} \\ ρ_{k} \end{matrix}) .

By Paige [14], we know that

R_{k}

must be identical to the matrix that would be obtained from the conventional QR factorization of

B_{k}

, such that

Q_{k} B_{k} = (\begin{matrix} R_{k} \\ 0 \end{matrix}) .

Then we have

\begin{matrix} (\begin{matrix} B_{k} & β_{1} e_{1} \end{matrix}) = Q_{k}^{T} (\begin{matrix} R_{k} & z_{k} \\ ∥ b_{k} ∥_{2} \end{matrix}) \equiv Q_{k}^{T} N_{k} . \end{matrix}

(21)

Let

N_{k} = {\overset{ˇ}{U}}_{k} {\overset{ˇ}{Σ}}_{k} {\overset{ˇ}{V}}_{k}

, and

{\overset{ˇ}{V}}_{k} e_{k + 1} \equiv {\overset{ˇ}{v}}_{k + 1} = {(\begin{matrix} {({\overset{ˇ}{v}}_{k + 1}^{(1)})}^{T} & {\overset{ˇ}{v}}_{k + 1}^{(2)} \end{matrix})}^{T},

then the solution of the projected TLS problem (9) is

y_{k} = - \frac{{\overset{ˇ}{v}}_{k + 1}^{(1)}}{{\overset{ˇ}{v}}_{k + 1}^{(2)}}

, and the TLS solution is

x_{k} = - V_{k} \frac{{\overset{ˇ}{v}}_{k + 1}^{(1)}}{{\overset{ˇ}{v}}_{k + 1}^{(2)}}

. And the following theorem comes by (12) and (21)

Theorem 1.

the HBITLS and NIPALS-TLS algorithms compute the same approximate solutions

x_{k}

.

3. Roundoff Error Analysis

In this section, we analyze the propagation of roundoff error during the process of the HBITLS algorithm and get the mixed forward–backward stability of the HBITLS algorithm and NIPALS-TLS algorithm naturally. The total roundoff error during the process of the HBITLS algorithm can be divided into the following four parts:

First, we can find that the HBITLS algorithm solves the original TLS problem (2) to a perturbed TLS problem. The propagation of the roundoff error of a Householder matrix in the HBITLS algorithm is advantageous when performing numerical computations.

From now one, we will denote by

ε

the machine precision under consideration. In [15], it shows that the computed Householder matrix

f l (H)

comes near the exact Householder matrix H itself:

\begin{matrix} {∥ f l (H) - H ∥}_{2} \leq 84 ε + O (ε^{2}) . \end{matrix}

Moreover, for a vector

y

, the computed updates with

f l (H)

are very close to the exact updates with H:

\begin{matrix} f l (f l (H) y) = {H (y + w), ∥ w ∥}_{2} \leq 87 ε {∥ y ∥}_{2} + O (ε^{2}) . \end{matrix}

and, in general,

\begin{matrix} f l (f l (H_{1}) \dots f l (H_{j}) y) = H_{1} \dots H_{j} (y + z) {, ∥ z ∥}_{2} \leq 87 j ε {∥ y ∥}_{2} + O (ε^{2}) . \end{matrix}

(22)

The following lemma tells us that the reduced system calculated by the HBITLS algorithm is equivalent to the system formed after the original system has been disturbed.

{\bar{B}}_{k}

,

{\hat{Q}}_{k}

and

{\hat{P}}_{k + 1}

are the floating-point computation of the matrices

B_{k}

,

V_{k}

and

U_{k + 1}

in HBITLS algorithm, respectively.

Lemma 2.

Let

{\bar{B}}_{k}

be the computed bidiagonalization matrix

(k + 1) \times k

matrix obtained by the HBITLS algorithm. Then, there comes a perturbation matrix E and exists two column orthogonal matrices

{\hat{Q}}_{k}

and

{\hat{P}}_{k + 1}

s.t.

\begin{matrix} (A + E) {\hat{Q}}_{k} = {\hat{P}}_{k + 1} {\bar{B}}_{k}, \end{matrix}

and

{∥ E ∥}_{2} \leq \sqrt{n} (174 n + 3 \sqrt{n} + 87) ε {∥ A ∥}_{2} + O (ε^{2}),

where n is the number of columns of matrix A. Furthermore, the matrix

{\hat{Q}}_{k}

is an orthonormal basis of

K_{k} ({(A + E)}^{T} (A + E), {(A + E)}^{T} (b + e))

with a perturbation vector

e

, where

{∥ e ∥}_{2} \leq 87 ε {∥ b ∥}_{2} + O (ε^{2}) .

Proof.

We prove this theorem by induction. The key point is that we should show the computed matrix, which will be shown by introduction from (3), for

G_{k}^{T} \dots G_{1}^{T} [v_{1}, A v_{1}, \dots, A v_{k}]

as follows

{\hat{P}}_{k + 1} \dots {\hat{P}}_{1} [{\bar{v}}_{1} + g_{1}, (A + G_{1}) {\bar{v}}_{1} + w_{1}, \dots, (A + G_{k}) {\bar{v}}_{k} + w_{k}] .

For

k = 1

, first, let

u_{1} = b / {∥ b ∥}_{2}

,

{\bar{u}}_{1} = f l (u_{1})

, a Householder matrix

P_{1}

is found s.t.

P_{1} u_{1} = e_{1}

. Set

{\bar{P}}_{1} = f l (P_{1}),

ref. [16] tells us that, corresponding to matrix

{\bar{P}}_{1}

, we can find a Householder matrix to make

f l ({\bar{P}}_{1} {\bar{v}}_{1}) = {\hat{P}}_{1} ({\bar{v}}_{1} + g_{1}),

with

∥ g_{1} ∥_{2} \leq 87 ε {∥ {\bar{v}}_{1} ∥}_{2} + O (ε^{2}) = 87 ε + O (ε^{2}) .

Next, let

v_{1} = A^{T} u_{1} / {∥ A^{T} u_{1} ∥}_{2},

f l (A^{T} u_{1}) = {(A + G_{0})}^{T} {\bar{u}}_{1},

where

∥ G_{0} ∥_{2} \leq ε 3 \sqrt{n} {∥ A ∥}_{2} .

Similarly, for

A v_{1},

the computed result can be written as

f l (A v_{1}) = (A + G_{1}) {\bar{v}}_{1},

where

∥ G_{1} ∥_{2} \leq ε 3 \sqrt{n} {∥ A ∥}_{2} .

Now, we set the Householder matrix

P_{2}

s.t.

P_{2} P_{1} [v_{1}, A v_{1}]

is upper bidiagonal matrix. We know

P_{2}

only works the vector

P_{1} A v_{1},

so there’s no change for the 1st column of

f l ({\bar{P}}_{1} [f l (v_{1}), f l (A v_{1})])

when producted by

{\bar{P}}_{2}

and

{\hat{P}}_{2}

. Likely, there’s a Householder matrix

{\hat{P}}_{2}

s.t.

{\hat{P}}_{2} f l ({\bar{P}}_{1} [{\bar{v}}_{1}, A {\bar{v}}_{1}]

is bidiagonal matrix in theory, but the algorithm computes a matrix

{\bar{P}}_{2}

in practice [16] such that

f l ({\bar{P}}_{2} {\bar{P}}_{1} f l (A v_{1})) = {\hat{P}}_{2} {\hat{P}}_{1} ((A + G_{1}) {\bar{v}}_{1} + w_{1}),

where

∥ w_{1} ∥_{2} \leq {174 ε (∥ A ∥}_{2} + ∥ G_{1} ∥_{2}) ∥ {\bar{v}}_{1} ∥_{2} = 174 ε {∥ A ∥}_{2} + O (ε^{2}) .

Finally, we have

\begin{matrix} f l ({\bar{P}}_{2} {\bar{P}}_{1} [{\bar{v}}_{1}, f l (A v_{1})]) & = & [f l ({\bar{P}}_{1} {\bar{v}}_{1}), f l ({\bar{P}}_{2} f l ({\bar{P}}_{1} f l (A v_{1})))] \\ = & [{\hat{P}}_{1} ({\bar{v}}_{1} + g_{1}), {\hat{P}}_{2} {\hat{P}}_{1} ((A + G_{1}) {\bar{v}}_{1} + w_{1})] \\ = & {\hat{P}}_{2} {\hat{P}}_{1} [{\bar{v}}_{1} + g_{1}, (A + G_{1}) {\bar{v}}_{1} + w_{1}] . \end{matrix}

For the kth step, assume that the HBITLS algorithm has calculated the matrices

{\bar{P}}_{1}, \dots, {\bar{P}}_{k},

associated with the Householder matrices

{\hat{P}}_{1}, \dots, {\hat{P}}_{k}

. Then, after k steps, we can get the following result:

\begin{matrix} f l ({\bar{P}}_{k} \dots {\bar{P}}_{1} [{\bar{v}}_{1}, f l (A v_{1}), \dots, f l (A v_{k})]) \\ = {\hat{P}}_{k} \dots {\hat{P}}_{1} [{\bar{v}}_{1} + g_{1}, (A + G_{1}) {\bar{v}}_{1} + w_{1}, \dots, (A + G_{k}) {\bar{v}}_{k} + w_{k}], \end{matrix}

where

∥ w_{i} ∥_{2} \leq 87 (i + 1) ε {∥ A ∥}_{2} + O (ε^{2}) .

We know

v_{k} = Q_{1} \dots Q_{k} e_{k},

and

{\bar{v}}_{k} = f l ({\bar{Q}}_{1} \dots {\bar{Q}}_{k} e_{k}) = {\hat{Q}}_{1} \dots {\hat{Q}}_{k} (e_{k} + f_{k}),

where

∥ f_{k} ∥_{2} \leq 87 k ε + O (ε^{2}) .

For

A v_{k},

the floating-point vector is

f l (A v_{k}) = (A + G_{k}) {\bar{v}}_{k},

where

∥ G_{k} ∥_{2} \leq ε 3 \sqrt{n} {∥ A ∥}_{2} .

Likely, there is a Householder matrix

P_{k + 1}

, which only works on the vector

P_{k} \dots P_{1} A v_{k}

, s.t.

P_{k + 1} P_{k} \dots P_{1} [v_{1}, A v_{1}, \dots, A v_{k}]

is the upper bidiagonal matrix. The algorithm computes a matrix

{\bar{P}}_{k + 1}

in practice so that

f l ({\bar{P}}_{k + 1} \dots {\bar{P}}_{1} f l (A v_{k})) = {\hat{P}}_{k + 1} \dots {\hat{P}}_{1} ((A + G_{k}) {\bar{v}}_{k} + w_{k}),

where

∥ w_{k} ∥_{2} \leq {87 (k + 1) ε (∥ A ∥}_{2} + ∥ G_{k} ∥_{2}) (1 + ∥ f_{k} ∥_{2} {) \leq 87 (k + 1) ε ∥ A ∥}_{2} + O (ε^{2}) .

Then the floating-point matrix is obtained, such that

{\hat{P}}_{k + 1} \dots {\hat{P}}_{1} [{\bar{v}}_{1} + g_{1}, (A + G_{1}) {\bar{v}}_{1} + w_{1}, \dots, (A + G_{k}) {\bar{v}}_{k} + w_{k}] .

Let

{\hat{P}}^{(k)}

,

{\hat{Q}}^{(k)}

be the matrices, such that

{({\hat{P}}^{(k)})}^{T} = {\hat{P}}_{k} \dots {\hat{P}}_{1}

and

{({\hat{Q}}^{(k)})}^{T} = {\hat{Q}}_{k} \dots {\hat{Q}}_{1}

, respectively, we find that the first

(i - 1)

rows of each

{\hat{Q}}_{i}

is

e_{1}, \dots, e_{i - 1} .

Let

q_{i}^{(j)}

be the i-th column of

{\hat{Q}}^{(j)}

, then the results are as follows

q_{i}^{(j)} = q_{i}^{(k)}, \forall k \geq j, i = 1, \dots, j .

Then there comes

{({\hat{P}}^{(k + 1)})}^{T} [{\bar{v}}_{1} + g_{1}, (A + G_{1}) {\bar{v}}_{1} + w_{1}, \dots, (A + G_{k}) {\bar{v}}_{k} + w_{k}],

it is an

n \times (k + 1)

upper bidiagonal matrix. And,

\forall j \leq k,

we obtain

\begin{matrix} (A + G_{j}) {\bar{v}}_{j} + w_{j} & = & (A + G_{j}) {\hat{Q}}_{1} \dots {\hat{Q}}_{j} (e_{j} + f_{j}) + w_{j} \\ = & A {\hat{Q}}^{(j)} e_{j} + G_{j} {\hat{Q}}^{(j)} e_{j} + A {\hat{Q}}^{(j)} f_{j} + G_{j} {\hat{Q}}^{(j)} f_{j} + w_{j} \\ = & A q_{j}^{(j)} + G_{j} q_{j}^{(j)} + A {\hat{Q}}^{(j)} f_{j} + G_{j} {\hat{Q}}^{(j)} f_{j} + w_{j} . \end{matrix}

Since

\forall k \leq j,

q_{j}^{(j)}

is the j-th column

q_{j}

of the matrix

{\hat{Q}}^{(j)}

, if we denote by

y_{j} = G_{j} q_{j}^{(j)} + A {\hat{Q}}^{(j)} f_{j} + G_{j} {\hat{Q}}^{(j)} f_{j} + w_{j},

we have

(A + G_{j}) {\bar{v}}_{j} + w_{j} = A q_{j} + y_{j},

and so we can obtain

\begin{matrix} {({\hat{P}}^{(k + 1)})}^{T} [{\bar{v}}_{1} + g_{1}, (A + G_{1}) {\bar{v}}_{1} + w_{1}, \dots, (A + G_{k}) {\bar{v}}_{k} + w_{k}] \\ = & {({\hat{P}}^{(k + 1)})}^{T} [{\bar{v}}_{1} + g_{1}, A q_{1} + y_{1}, \dots, A q_{k} + y_{k}] \\ = & {({\hat{P}}^{(k + 1)})}^{T} [({\bar{v}}_{1}, A q_{1}, \dots, A q_{k}) + (g_{1}, y_{1}, \dots, y_{k})] . \end{matrix}

If we cut off the first column of the matrix, we can set

{\tilde{B}}_{k}

with

n \times k

such that

{\tilde{B}}_{k} = {({\bar{B}}_{k}^{T}, 0)}^{T}

, here

{\bar{B}}_{k}

is an

(k + 1) \times k

upper bidiagonal matrix. If we denote

F_{i} = [y_{l}, \dots, y_{i}],

\forall i,

then

{\tilde{B}}_{k} = {({\hat{P}}^{(k + 1)})}^{T} (A [q_{1}, \dots, q_{k}] + F_{k}) .

In addition,

\forall j \leq k,

let

{\hat{Q}}_{j}

be the matrix made up of the first j columns of

{\hat{Q}}^{(k)}

, we obtain

{\hat{P}}^{(k + 1)} {\tilde{B}}_{k} = A {\hat{Q}}^{(k)} + F_{k},

and, from the structure of

{\tilde{B}}_{k},

there comes

{\hat{P}}_{k + 1} {\bar{B}}_{k} = A {\hat{Q}}_{k} + F_{k} .

Then we can write

F_{k} = F_{n} {\hat{Q}}_{n}^{T} {\hat{Q}}_{k}

owing to

{\hat{Q}}_{n}^{T} {\hat{Q}}_{k} = (\begin{matrix} I_{k} \\ 0 \end{matrix}),

and so

{\hat{P}}_{k + 1} {\bar{B}}_{k} = (A + F_{n} {\hat{Q}}_{n}^{T}) {\hat{Q}}_{k} .

If

E = F_{n} {\hat{Q}}_{n}^{T},

we can finish the proof of the first part of the lemma, because

\begin{matrix} {∥ E ∥}_{2} & \leq & {∥ E ∥}_{F} = {∥ F_{n} ∥}_{F} \\ = & \sqrt{\sum_{j = 1}^{n} {∥ y_{j} ∥}_{2}^{2}} \\ \leq & \sqrt{\sum_{j = 1}^{n} {∥ G_{j} q_{j}^{(j)} + A {\hat{Q}}^{(j)} f_{j} + w_{j} ∥}_{2}^{2}} + O (ε^{2}) \\ \leq & \sqrt{n} max_{j} {∥ G_{j} q_{j}^{(j)} + A {\hat{Q}}^{(j)} f_{j} + w_{j} ∥}_{2} + O (ε^{2}) \\ \leq & \sqrt{n} (174 n + 3 \sqrt{n} + 87) ε {∥ A ∥}_{2} + O (ε^{2}) . \end{matrix}

Finally, we prove that the subspace spanned by the columns of the matrix

{\hat{Q}}_{k}

is an orthogonal basis of a Krylov space. Let

\tilde{A} = A + E

,

\tilde{b} = {∥ A^{T} b ∥}_{2} {\hat{Q}}_{k} e_{1}

and form

K_{k} ({\tilde{A}}^{T} \tilde{A}, {\tilde{A}}^{T} \tilde{b})

. We know that

\tilde{b} = ∥ A^{T} {b ∥}_{2} {\hat{Q}}_{k} e_{1} = {∥ A^{T} b ∥}_{2} {\hat{Q}}_{1} e_{1}

and set

e = \tilde{b} - A^{T} b

, then we have

{∥ e ∥}_{2} = ∥ A^{T} {b ∥}_{2} ∥ ({\hat{Q}}_{1} - Q_{1}) e_{1} ∥_{2} \leq 87 ε {∥ A^{T} b ∥}_{2} + O (ε^{2}) .

We still prove the rest of the theorem by induction; that is, to prove

{({\tilde{A}}^{T} \tilde{A})}^{i} {\tilde{A}}^{T} \tilde{b} = {\hat{Q}}_{k} r_{i}, \forall i \leq k - 1,

where each vector

r_{i}

has only the first

(i + 1)

components, which are different from zero.

For

i = 1

, we have

{\tilde{A}}^{T} \tilde{A} {\tilde{A}}^{T} \tilde{b} = {\tilde{A}}^{T} \tilde{A} ∥ A^{T} {b ∥}_{2} {\hat{Q}}_{k} e_{1} = {∥ A^{T} b ∥}_{2} {\hat{Q}}_{k} {\bar{B}}_{k}^{T} {\bar{B}}_{k} e_{1} = {\hat{Q}}_{k} r_{1},

since the last component of the vector

{\bar{B}}_{k}^{T} {\bar{B}}_{k} e_{1}

is zero, in addition, except for the first two components, the rest of the components of vector

r_{1}

are all zero.

Suppose for a given i the following relation is true,

{({\tilde{A}}^{T} \tilde{A})}^{i + 1} {\tilde{A}}^{T} \tilde{b} = ({\tilde{A}}^{T} \tilde{A}) {\hat{Q}}_{k} r_{i} = {\hat{Q}}_{k + 1} {\bar{B}}_{k}^{T} {\bar{B}}_{k} r_{i},

and in the next step, we will show it is true for

i + 1

.

From the inductive hypothesis, we know that only the first

(i + 1)

components of

r_{i}

are not zero; therefore, the last component is zero of the vector

{\bar{B}}_{k}^{T} {\bar{B}}_{k} r_{i}

with

(i + 2)

non-zero elements. Then there comes a conclusion that

{({\tilde{A}}^{T} \tilde{A})}^{i + 1} {\tilde{A}}^{T} \tilde{b} = {\hat{Q}}_{k} r_{i + 1}

and, hence, the lemma is proved. □

Based on Lemma 2 and Algorithm 1, for the bidiagonalization matrix

{\bar{B}}_{k}

obtained by the HBITLS algorithm, one can find an orthonormal matrix

{\hat{Q}}_{k}

s.t.

(A + E) {\hat{Q}}_{k} = {\hat{P}}_{k + 1} {\bar{B}}_{k},

where

{\hat{Q}}_{k}

is just an orthogonal basis of

K_{k} ({(A + E)}^{T} (A + E), {(A + E)}^{T} (b + e))

. Based on this, we know that the first part of HBITLS, in exact arithmetic, gives the exact basis of the perturbed Krylov space

K_{k} ({(A + E)}^{T} (A + E), {(A + E)}^{T} (b + e))

. A perturbation bound for TLS solutions is given by Xie and Wei [17], see Lemma 3, which is related to the smallest singular value

σ_{n + 1}

of

(A, b)

, the TLS solution

x

, and the residual

r = b - A x

. Let

\hat{A} = A + Δ A

and

\hat{b} = b + Δ b

. Then, the unique solution of the perturbed TLS problem can be expressed as

\hat{x}

. Denote

κ_{b} = \frac{{∥ b ∥}_{2}}{{∥ x ∥}_{2}} {∥ B_{λ}^{- 1} A^{T} ∥}_{2}

and

κ_{A} = \frac{{∥ A ∥}_{2}}{{∥ x ∥}_{2}} ({∥ r ∥}_{2} ∥ B_{λ}^{- 1} ∥_{2} + {∥ x ∥}_{2} {∥ B_{λ}^{- 1} A^{T} ∥}_{2})

with

B_{λ} = A^{T} A - σ_{n + 1}^{2} I .

The perturbation bound is obtained under the genericity condition

{\bar{σ}}_{n} > σ_{n + 1}

, where

{\bar{σ}}_{n}

is the smallest singular value of A

Lemma 3

([17]). Consider the TLS problem (2) and assume that the genericity condition holds. If

{∥ (Δ A, Δ b) ∥}_{F}

is sufficiently small, then we obtain that

\begin{matrix} \frac{∥ x - \hat{x} ∥_{2}}{{∥ x ∥}_{2}} & ≲ & κ_{b} \frac{{∥ Δ b ∥}_{2}}{{∥ b ∥}_{2}} + κ_{A} \frac{{∥ Δ A ∥}_{2}}{{∥ A ∥}_{2}} . \end{matrix}

Suppose that

x

and

\hat{x}

are the exact TLS solutions of

A x \approx b

and

(A + E) \hat{x} \approx b + e

respectively. The error introduced in this part of the HBITLS algorithm is the inherent error, so we can give

∥ x - \hat{x} ∥_{2}

by Lemma 3 and Lemma 2 easily, see Theorem 2.

Secondly, let us consider the error between the TLS solution of the system

(A + E) \hat{x} \approx b + e

and the approximation solution

{\hat{x}}_{k} = {\hat{V}}_{k} {\hat{y}}_{k}

of the system computed by the HBITLS algorithm at step k with the exact arithmetic, i.e.,

{\hat{y}}_{k}

is the exact solution of the reduced TLS problem

\begin{matrix} min ∥ (β_{1} e_{1}, {\bar{B}}_{k}) - ({\overset{ˇ}{e}}_{k}, {\overset{ˇ}{B}}_{k}) ∥_{F} subject to {\overset{ˇ}{B}}_{k} {\hat{y}}_{k} = {\overset{ˇ}{e}}_{k}, \end{matrix}

(23)

For convenience, define

{\bar{V}}_{k} = (\begin{matrix} 1 & 0 \\ 0 & V_{k} \end{matrix})

and let

\begin{matrix} (b, A) = W Σ Z^{T}, \end{matrix}

(24)

where W and

Z = [z_{1}, z_{2}, \dots, z_{n + 1}]

are orthogonal matrices of dimension m and

n + 1

, respectively,

Σ

is an

m \times (n + 1)

diagonal matrix whose diagonal entries

σ_{i}

are the singular values of

(b, A)

, sorted in non-increasing order. Let

θ (u, v)

be the subspace angle between

R (u)

and

R (v)

.

Lemma 4.

Let

x

denotes the essential TLS solution to the linear system (2) satisfying genericity condition and

x_{k}

be the approximation solution obtained from Algorithm 1. Then

\begin{matrix} sin θ (z, z^{(k)}) \leq ∥ x - x_{k} ∥ \leq sin θ (z, z^{(k)}) \sqrt{1 + {∥ x ∥}^{2}} \sqrt{1 + ∥ x_{k} ∥^{2}}, \end{matrix}

(25)

where

\begin{matrix} \begin{matrix} z & \equiv z_{n + 1} = (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) \in R^{n + 1}, \\ z^{(k)} & \equiv (\begin{matrix} z_{12}^{(k)} \\ z_{22}^{(k)} \end{matrix}) = {\bar{V}}_{k} {\bar{z}}_{k + 1} = (\begin{matrix} 1 & 0 \\ 0 & V_{k} \end{matrix}) (\begin{matrix} {\bar{z}}_{12} \\ {\bar{z}}_{22} \end{matrix}) \in R^{n + 1} . \end{matrix} \end{matrix}

(26)

Proof.

It is easy to know that

\begin{matrix} (\begin{matrix} - 1 \\ x \end{matrix}) - (\begin{matrix} - 1 \\ x_{k} \end{matrix}) = (\begin{matrix} z_{12}^{(k)} \\ z_{22}^{(k)} \end{matrix}) {(z_{12}^{(k)})}^{- 1} - (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) z_{12}^{- 1} . \end{matrix}

(27)

Then we have an orthonormal matrix

G \in R^{(n + 1) \times n}

with the partition

G = \overset{n}{[\begin{matrix} G_{1} \\ G_{2} \end{matrix}]} \begin{matrix} 1 \\ n \end{matrix}

such that

G^{T} {({(z_{12}^{(k)})}^{T} {(z_{22}^{(k)})}^{T})}^{T} = 0

(i.e., G “forms a complete space”). From Equation (27) there comes

\begin{matrix} G^{T} [(\begin{matrix} - 1 \\ x \end{matrix}) - (\begin{matrix} - 1 \\ x_{k} \end{matrix})] = - G^{T} (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) z_{12}^{- 1}, \end{matrix}

(28)

and, therefore

\begin{matrix} G_{2}^{T} (x - x_{k}) = - G^{T} (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) z_{12}^{- 1} . \end{matrix}

(29)

From the CS theorem [18], we know that

σ_{m i n} {(G_{2})}^{- 1} = {(z_{12}^{(k)})}^{- 1}

. Then

\begin{matrix} σ_{m i n} (G_{2}) ∥ (x - x_{k}) ∥ & \leq & ∥ G_{2}^{T} (x - x_{k}) ∥ \\ = & ∥ G^{T} (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) z_{12}^{- 1} ∥ \\ \leq & sin θ (z, z^{(k)}) z_{12}^{- 1} . \end{matrix}

It was noticed that

sin θ (z, z^{(k)}) = ∥ G^{T} (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) ∥

denotes the sine of the subspace angle between

R (z)

and

R (z^{(k)}) .

Hence, the upper bound can be proved as follows

\begin{matrix} ∥ x - x_{k} ∥ & \leq & sin θ (z, z^{(k)}) z_{12}^{- 1} {(z_{12}^{(k)})}^{- 1} \\ = & sin θ (z, z^{(k)}) \sqrt{1 + {∥ x ∥}^{2}} \sqrt{1 + ∥ x_{k} ∥^{2}} . \end{matrix}

For the lower bound, we have

\begin{matrix} ∥ x - x_{k} ∥ & \geq & ∥ G_{2}^{T} (x - x_{k}) ∥ \\ = & ∥ G^{T} (\begin{matrix} z_{12} \\ z_{22} \end{matrix}) z_{12}^{- 1} ∥ \\ \geq & z_{12}^{- 1} sin θ (z, z^{(k)}) \\ \geq & sin θ (z, z^{(k)}) . \end{matrix}

Since

z_{12}^{- 1} \geq 1

, and this proves the upper bound case. □

Thirdly, we need to consider how to solve problem (9) and show that the error is between the solution obtained by this method and the theoretical solution. Let the computed solution be

${\bar{x}}_{k} = {\hat{V}}_{k} {\bar{y}}_{k}$ , where ${\bar{y}}_{k}$ is the computed solution of the problem (23).

In [19], James and Kahan posed an algorithm named QR iteration with a zero shift, which guaranteed forward stability. Furthermore, an implicit algorithm about it is given. Error analysis including the singular values and singular vectors are also given, which is just what we’ve needed.

Lemma 5

([19]). Let the matrix

\bar{B}

obtained by running the implicit zero-shift QR algorithm on a bidiagonal matrix B with

n \times n

. Suppose that all perturbation angles θ emerged from the operations of the algorithm satisfy

{sin}^{2} θ \leq τ < 1

. Let

σ_{1} \geq σ_{2} \geq \dots \geq σ_{n}

and

{\bar{σ}}_{1} \geq {\bar{σ}}_{2} \geq \dots \geq {\bar{σ}}_{n}

are the singular values of B and

\bar{B}

respectively. If

\begin{matrix} ω \equiv \frac{88 n ε}{{(1 - τ)}^{2}} < 1, \end{matrix}

(30)

then we have:

| σ_{i} - {\bar{σ}}_{i} | \leq \frac{ω}{1 - ω} σ_{i} .

Moreover, let

σ_{k 1} \geq σ_{k 2} \geq \dots \geq σ_{k n}

be the singular values of

B_{k}

produced after k steps of the implicit zero-shift QR algorithm. Then if condition (30) holds, and all perturbation angles θ satisfy

{sin}^{2} θ \leq τ < 1

, we obtain

| σ_{i} - σ_{k i} | \leq (\frac{1}{{(1 - ω)}^{k} - 1} σ_{i}) \approx \frac{88 k n ε}{{(1 - τ)}^{2}} σ_{i} .

James and Kahan [19] also give the relative differences between the singular vectors of B and the ones of

\bar{B}

.

Lemma 6

([19]). Let

σ_{i}

be the singular value of be an unreduced bidiagonal matrix B with

u_{i}

and

v_{i}

being its corresponding left and right singular vectors, respectively. Let

{\bar{u}}_{i}

and

{\bar{v}}_{i}

be the singular vectors computed by the implicit zero-shift QR algorithm. Then the bound of the errors in

{\bar{v}}_{i}

are shown by

\begin{matrix} θ ({\bar{v}}_{i}, v_{i}) \leq p (n) ε / r e l_{g a p} \equiv p (n) ε / min (| σ_{i} - σ_{i + 1} | / σ_{i}) . \end{matrix}

(31)

Then, combining with the perturbation bound of TLS given in [20] as shown in Lemma 7, we can give the error estimate

∥ {\hat{y}}_{k} - {\bar{y}}_{k} ∥_{2}

.

If let

(\tilde{A}, \tilde{b})

is a rank-k matrix approximation to

(A, b)

, and

(Δ \tilde{A}, Δ \tilde{b}) = (A, b) - (\tilde{A}, \tilde{b})

. Let

(\bar{A}, \bar{b}) = (A, b) + (Δ A, Δ b)

represent a perturbation of

(A, b)

,

(\overset{ˇ}{A}, \overset{ˇ}{b})

denote a rank-k matrix approximation to

(\bar{A}, \bar{b})

and define

(Δ \overset{ˇ}{A}, Δ \overset{ˇ}{b}) = (\bar{A}, \bar{b}) - (\overset{ˇ}{A}, \overset{ˇ}{b})

, then

Lemma 7

([20]). Let

x

and

\bar{x}

denote the TLS solution and the perturbed TLS solution. If

max (∥ Δ \tilde{A} ∥, ∥ Δ \overset{ˇ}{A} ∥ + ∥ Δ A ∥ < σ_{k}^{^{'}}

(the k-th singular value of A) may be provided. Then

\begin{matrix} sin θ (v, \bar{v}) \leq ∥ x - \bar{x} ∥ \leq sin θ (v, \bar{v}) \sqrt{1 + {∥ x ∥}^{2}} \sqrt{1 + ∥ \bar{x} ∥^{2}}, \end{matrix}

(32)

where

v

and

\bar{v}

are the smallest right singular vectors of

(A, b)

and

(\bar{A}, \bar{b})

respectively.

In summary, if let

{\tilde{x}}_{k} = f l ({\bar{V}}_{k} {\bar{y}}_{k})

be the final computed solution at the k-th step, then roundoff error analysis of HBITLS algorithm for TLS problem can be shown as follows

Theorem 2.

Considering the HBITLS algorithm at step k, the roundoff error emerged during the algorithm can be bounded as follows:

\begin{matrix} ∥ x - {\tilde{x}}_{k} ∥ & \leq & κ_{b} 87 ε + κ_{A} \sqrt{n} (174 n + 3 \sqrt{n} + 87) ε \\ + & sin θ (\hat{z}, {\hat{z}}^{(k)}) \sqrt{1 + ∥ \hat{x} ∥^{2}} \sqrt{1 + ∥ {\hat{x}}_{k} ∥^{2}} \\ + & sin θ ({\hat{z}}^{(k)}, \overset{ˇ}{z}) \sqrt{1 + ∥ {\hat{y}}_{k} ∥^{2}} \sqrt{1 + ∥ {\bar{y}}_{k} ∥^{2}} \\ + & 87 k ε ∥ {\bar{y}}_{k} ∥ + O (ϵ^{2}), \end{matrix}

where

\hat{z}

and

{\hat{z}}^{(k)}

are defined in (26) similarly,

\overset{ˇ}{z}

is the computed smallest right singular vector of

(β_{1} e_{1}, {\bar{B}}_{k})

.

Proof.

The roundoff error can be composed of the following parts

x - {\tilde{x}}_{k} = (x - \hat{x}) + (\hat{x} - {\hat{x}}_{k}) + ({\hat{x}}_{k} - {\hat{V}}_{k} {\bar{y}}_{k}) + ({\hat{V}}_{k} {\bar{y}}_{k} - f l ({\bar{V}}_{k} {\bar{y}}_{k})

and we analyze these errors separately.

For the first part,

x

and

\hat{x}

are the TLS solutions of the systems

A x \approx b

and

(A + E) \hat{x} \approx b + e

, respectively, in line with Lemma 2, so the error of this part is the inherent error. Then, combining with Lemma 3, we have

∥ x - \hat{x} ∥ \leq κ_{b} 87 ε + κ_{A} \sqrt{n} (174 n + 3 \sqrt{n} + 87) ε,

where

κ_{A}

and

κ_{b}

see Lemma 3.

For the second part, this error is owing to the approximate solution of

(A + E) \hat{x} \approx b + e

obtained by using HBITLS algorithm after k steps with the exact arithmetic. Lemma 4 tells us that

∥ \hat{x} - {\hat{x}}_{k} ∥ \leq sin θ (\hat{z}, {\hat{z}}^{(k)}) \sqrt{1 + ∥ \hat{x} ∥^{2}} \sqrt{1 + ∥ {\hat{x}}_{k} ∥^{2}} .

For the third part, it is noticed that

{\hat{x}}_{k} = {\hat{V}}_{k} {\hat{y}}_{k},

we have that

∥ {\hat{x}}_{k} - {\hat{V}}_{k} {\bar{y}}_{k} ∥ = ∥ {\hat{y}}_{k} - {\bar{y}}_{k} ∥,

where

∥ {\hat{y}}_{k} - {\bar{y}}_{k} ∥

is the roundoff error stem from the projected TLS solution. Since

B_{k}

is a special form of bidiagonal matrices, we consider using the implicit zero-shift QR algorithm to perform singular value decomposition. (31) gives an upper bound of the angle between the solution vectors, and combining Lemma 7, we know

∥ {\hat{x}}_{k} - {\hat{V}}_{k} {\bar{y}}_{k} ∥ = ∥ {\hat{y}}_{k} - {\bar{y}}_{k} ∥ \leq sin θ ({\hat{z}}^{(k)}, \overset{ˇ}{z}) \sqrt{1 + ∥ {\hat{y}}_{k} ∥^{2}} \sqrt{1 + ∥ {\bar{y}}_{k} ∥^{2}},

where

θ ({\hat{z}}^{(k)}, \overset{ˇ}{z})

is the subspace angle between the sub-spaces produced by the smallest right singular vector and the computed smallest right singular vector of

(β_{1} e_{1}, {\bar{B}}_{k})

, respectively.

For the last part, we know

V_{k} = H_{1} H_{2} \dots H_{k} (\begin{matrix} I_{k} \\ 0 \end{matrix}),

where

H_{1} H_{2} \dots H_{k}

is the product of k Householder matrices. So, on the basis of (22), we obtain

∥ {\hat{V}}_{k} {\bar{y}}_{k} - f l ({\bar{V}}_{k} {\bar{y}}_{k}) ∥ \leq 87 k ε ∥ {\bar{y}}_{k} ∥ + O (ϵ^{2}) .

□

By theorem 2, we get the mixed forward–backward stability of the HBITLS algorithm and NIPALS-TLS algorithm naturally. The backward stability will generate perturbation that will marginally influence the theoretical convergence of the residual to zero.

Remark 2.

The bound we introduced in Theorem 2 shows that the total roundoff errors are dominated by the approximation errors

∥ \hat{x} - {\hat{x}}_{k} ∥

. From this, we can know that, in many practical problems, we can safely select the stopping criteria required by the algorithm based on the theoretical nature of the original problem. This shows that, in a great deal of practical studies, the stopping criteria may be effectively selected based on the theoretical properties of the problem itself, thereby reducing the cost required to pursue an extremely accurate approximate solution to the original problem.

4. Conclusions

For large-scale problems, how to give an algorithm with good accuracy and stability is particularly important. In this paper, the Householder bidiagonalization total least squares (HBITLS) algorithm and nonlinear iterative partial least squares (NIPALS-TLS) algorithm are given. The HBITLS uses the Householder bidiagonalization algorithm for reducing

(b, A)

to upper bidiagonal form and then runs the implicit zero-shift QR algorithm to compute the smallest right singular vector of the reduced form for the approximation solutions. The NIPALS-TLS is based on rank-reducing orthogonal projections. The two algorithms compute the same approximate TLS solutions. By analyzing the propagation of the roundoff error during the process of the HBITLS algorithm, we find that the HBITLS algorithm and the NIPALS-TLS algorithm are to be mixed forward–backward stable. In addition, in many practical problems, the stop criterion can be safely selected on the basis of the rounding error analysis of the original problem. The upper bound of our roundoff error gives a more detailed and clearer approximation of the computed solution.

Author Contributions

Conceptualization, Z.Y. and X.L.; methodology, Z.Y. and X.L.; formal analysis, Z.Y.; data curation, X.L.; writing—original draft preparation, X.L.; writing—review and editing, Z.Y. and X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the NNSF of China (11571004, 11701456), Natural Science Foundation of Qinghai Province (2018-ZJ-717), Foundation Sciences Qinghai Nationalities University(2020XJG11, 2019XJZ10), Innovation team of Qinghai Nationalities University.

Conflicts of Interest

The authors declare no conflict of interest.

References

Golub, G.H.; Loan, C.F.V. Matrix Computations, 4th ed.; The John Hopkins University Press: Baltimore, MD, USA, 2013. [Google Scholar]
Huffel, S.V.; Vandwalle, J. The total least squares problem, computational aspects and analysis. In Frontiers in Applied Mathematics; SIAM: Philadelphia, PA, USA, 1991; Volume 9. [Google Scholar]
Björck, Å. Stability of two direct methods for bidiagonalization and partial least squares. SIAM J. Matrix Anal. Appl. 2014, 35, 279–291. [Google Scholar]
Higham, N.J. Accuracy and Stability of Numerical Algorithms, 2nd ed.; SIAM Press: Philadelphia, PA, USA, 2002. [Google Scholar]
Paige, C.C. The Computation of Eigenvalues and Eigenvectors of Very Large Sparse Matrices. Ph.D. Thesis, University of London, London, UK, 1971. [Google Scholar]
Paige, C.C. Computational variants of the Lanczos method for the eigenproblem. J. Inst. Math. Appl. 1972, 10, 373–381. [Google Scholar] [CrossRef]
Paige, C.C. Error analysis of the Lanczos algorithm for tridiagonalizing a symmetric matrix. J. Inst. Math. Appl. 1976, 18, 341–349. [Google Scholar] [CrossRef]
Parlett, B.N.; Scott, D.S. The Lanczos algorithm with selective orthogonalization. Math. Comput. 1979, 33, 217–238. [Google Scholar] [CrossRef]
Ikramov, K.D. Sparse matrices. Itogi Nauki Tekh. Mat. Anal. 1982, 20, 189–259. (In Russian) [Google Scholar] [CrossRef]
Parlett, B.N. The Symmetric Eigenvalue Problem; Prentice-Hall: Englewood Cliffs, NJ, USA, 1980. [Google Scholar]
Björck, Å. Numerical Methods for Least Squares Problems; SIAM: Philadelphia, PA, USA, 1996. [Google Scholar]
Walker, H.F. Implementation of the GMRES method using Householder transformations. SIAM J. Sci. Statist. Comput. 1988, 9, 152–163. [Google Scholar] [CrossRef]
Wold, H. Estimation of principal components and related models by iterative least squares. In Multivariate Analysis; Krishnaiah, P.R., Ed.; Academic Press: New York, NY, USA, 1966; pp. 391–420. [Google Scholar]
Paige, C.C.; Saunders, M.A. LSQR: An algorithm for sparse linear equations and sparse least squares. ACM Trans. Math. Software 1982, 8, 43–71. [Google Scholar] [CrossRef]
Wilkinson, J.H. The Algebraic Eigenvalue Problem; Oxford University Press: London, UK, 1965. [Google Scholar]
Lawson, C.; Hanson, R. Solving Least Squares Problems; Prentice Hall: Englewood Cliffs, NJ, USA, 1974. [Google Scholar]
Xie, P.; Xiang, H.; Wei, Y. A contribution to perturbation analysis for total least squares problems. Numer. Algorithm. 2017, 75, 381–395. [Google Scholar] [CrossRef]
Paige, C.C.; Saunders, M.A. Towards a generalized singular value decomposition. SIAM J. Numer. Anal. 1981, 18, 398–405. [Google Scholar] [CrossRef]
Demmel, J.; Kahan, W. Accurate singular values of bidiagonal matrices. SIAM J. Sci. Statist. Comput. 1990, 11, 873–912. [Google Scholar] [CrossRef]
Fierro, R.D.; Bunch, J.R. Perturbation theory for orthogonal projection methods with application to least squares and total least squares. Linear Algebra Appl. 1996, 234, 71–96. [Google Scholar] [CrossRef][Green Version]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, Z.; Liu, X. Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems. Mathematics 2021, 9, 2550. https://doi.org/10.3390/math9202550

AMA Style

Yang Z, Liu X. Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems. Mathematics. 2021; 9(20):2550. https://doi.org/10.3390/math9202550

Chicago/Turabian Style

Yang, Zhanshan, and Xilan Liu. 2021. "Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems" Mathematics 9, no. 20: 2550. https://doi.org/10.3390/math9202550

APA Style

Yang, Z., & Liu, X. (2021). Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems. Mathematics, 9(20), 2550. https://doi.org/10.3390/math9202550

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems^†

Abstract

1. Introduction

2. HBITLS Algorithm and NIPALS-TLS Algorithm

2.1. HBITLS Algorithm

2.2. The NIPALS-TLS Algorithm

3. Roundoff Error Analysis

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems †

Abstract

1. Introduction

2. HBITLS Algorithm and NIPALS-TLS Algorithm

2.1. HBITLS Algorithm

2.2. The NIPALS-TLS Algorithm

3. Roundoff Error Analysis

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Roundoff Error Analysis of an Algorithm Based on Householder Bidiagonalization for Total Least Squares Problems^†