Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank

Tang, Xia; Wen, Chun; Gu, Xian-Ming; Shen, Zhao-Li

doi:10.3390/sym13040636

Open AccessArticle

Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank

¹

School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China

²

School of Economic Mathematics, Southwestern University of Finance and Economics, Chengdu 611130, China

³

College of Science, Sichuan Agricultural University, Ya’an 625000, China

^*

Authors to whom correspondence should be addressed.

Symmetry 2021, 13(4), 636; https://doi.org/10.3390/sym13040636

Submission received: 15 March 2021 / Revised: 29 March 2021 / Accepted: 1 April 2021 / Published: 10 April 2021

(This article belongs to the Special Issue PDE, Optimization Modeling and Symmetry in Multi-Dimensional Data and Low-Level Vision Tasks)

Download

Browse Figures

Versions Notes

Abstract

:

Anderson(

m_{0}

) extrapolation, an accelerator to a fixed-point iteration, stores

m_{0} + 1

prior evaluations of the fixed-point iteration and computes a linear combination of those evaluations as a new iteration. The computational cost of the Anderson(

m_{0}

) acceleration becomes expensive with the parameter

m_{0}

increasing, thus

m_{0}

is a common choice in most practice. In this paper, with the aim of improving the computations of PageRank problems, a new method was developed by applying Anderson(1) extrapolation at periodic intervals within the Arnoldi-Inout method. The new method is called the AIOA method. Convergence analysis of the AIOA method is discussed in detail. Numerical results on several PageRank problems are presented to illustrate the effectiveness of our proposed method.

Keywords:

PageRank problems; Anderson acceleration; Arnoldi-Inout method

1. Introduction

As the core technology of network information retrieval, Google’s PageRank model (called the PageRank problem) uses the original hyperlink structure of the World Wide Web to determine the importance of each page and has received a lot of attention in the last two decades. The core of the PageRank problem is to compute a dominant eigenvector (or PageRank vector) of the Google matrix

A

by using the classical power method [1]:

A x = x, A = α P + (1 - α) v e^{T}, | | x | |_{1} = 1,

(1)

where

x

is the PageRank vector,

e

is a column vector with all elements equal to 1,

v

is a personalized vector and the sum of its elements is 1,

P

is a column-stochastic matrix (i.e., the dangling nodes have been replaced by columns with

1 / n

), and

α \in (0, 1)

is a damping factor.

As the damping factor

α

gradually approaches 1, the Google matrix is close to the original hyperlink structure. However, for large

α

such as

α \geq 0.99

, the second eigenvalue (

\leq α

) of the matrix

A

will be close to the main eigenvalue (equal to 1) [2], such that the classical power method suffers from slow convergence. In order to accelerate the power method, a lot of new algorithms are used to compute PageRank problems. The quadratic extrapolation method proposed by Kamvar et al. [3] accelerates the convergence by periodically subtracting estimates of non-dominant eigenvectors from the current iteration of the power method. It is worth mentioning that the authors [4] provide a theoretical justification for acceleration methods, generalizing the quadratic extrapolation and interpreting it as a Krylov subspace method. Gleich et al. [5] proposed an inner-outer iteration, wherein an inner PageRank linear system with a smaller damping factor is solved in each iteration. The inner-outer iteration shows good potential as a framework for accelerating PageRank computations, and a series of methods have been proposed based on it. For example, Gu et al. [6] constructed the power-inner-outer (PIO) method by combining the inner-outer iteration with the power method. It is worth mentioning that different versions of the Arnoldi algorithm applied to PageRank computations were first introduced in [7]. Gu and Wang [8] proposed the Arnoldi-Inout (AIO) algorithm by knitting the inner-outer iteration with the thick restarted Arnoldi algorithm [9]. Hu et al. [10] proposed a variant of the Power-Arnoldi (PA) algorithm [11] by using an extrapolation process based on a trace of the Google matrix

A

[12].

Anderson(

m_{0}

) acceleration [13,14] has been widely used to accelerate the convergence of a fixed-point iteration. Its principle is to store

m_{0} + 1

prior evaluations of the fixed-point method and compute a linear combination of those evaluations such that a new iteration is obtained. Anderson(0) is the given fixed-point iteration. Note that when the parameter

m_{0}

becomes large, the computational cost of the Anderson(

m_{0}

) acceleration becomes expensive. Hence, in most applications,

m_{0}

is chosen to be small, and we set

m_{0} = 1

as a usual choice in this paper. In [15], Toth et al. proved that Anderson(1) extrapolation was locally q-linearly convergent. Pratapa et al. [16] developed the Alternating Anderson–Jacobi (AAJ) method by periodically employing Anderson extrapolation to accelerate the classical Jacobi iterative method for sparse linear systems.

In this paper, with the aim of accelerating the Arnoldi-Inout method for computing PageRank problems, the Anderson(1) extrapolation is used as an accelerator, and thus a new method is presented by combining the Anderson(1) extrapolation with the Arnoldi-Inout method periodically. Our proposed method is called the AIOA method, and its construction and convergence behavior are analyzed in detail, and numerical simulation experiments prove the effectiveness of the new algorithm.

The other parts of this article are structured as follows: In Section 2, we briefly review the Anderson acceleration and the Arnoldi-Inout method for PageRank problems. In Section 3, the AIOA method is constructed, and its convergence behavior is discussed. In Section 4, numerical comparisons are reported. Finally, in Section 5, we give some conclusions.

2. Previous Work

2.1. Anderson Acceleration

Anderson acceleration (also known as Anderson mixing) has been widely used in electronic structure computations [17]. Walker et al. [14] developed it for solving fixed-point problems:

x = g (x)

, where

x \in R^{n}

and

g : R^{n} \to R^{n}

. They showed that Anderson acceleration without truncation was essentially equivalent, in a certain sense, to the generalized minimum residual method (GMRES) [18] for linear problems. It has been proved that the Anderson iteration is convergent if the fixed-point iteration

g

is a contraction and the coefficients in the linear combination remain bounded [15].

In this paper, we consider the Anderson(1) acceleration that stores two prior evaluations

g (x_{0}), g (x_{1})

and then computes

x_{2}

(a linear combination of

g (x_{0})

and

g (x_{1})

) as the new iteration. The main algorithmic steps of Anderson(1) are given as Algorithm 1.

Algorithm 1 The Anderson(1) acceleration
(1) Given an initial vector $x_{0}$ .
(2) Compute $x_{1} = g (x_{0})$ , where $g$ is a fixed-point iteration.
(3) Compute $F = (f_{0}, f_{1}),$ where $f_{i} = g (x_{i}) - x_{i}, i = 0, 1$ .
(4) Compute $γ = {(γ_{0}, γ_{1})}^{T}$ that satisfies
$\min_{γ = {(γ_{0}, γ_{1})}^{T}} \| \| F γ \| \|_{2}, s . t . \sum_{i = 0}^{1} γ_{i} = 1 .$	(2)
(5) Compute $x_{2} = γ_{0} g (x_{0}) + (1 - γ_{0}) g (x_{1})$ .

According to [15], the constrained linear least-squares problem (2) in step 4 of Algorithm 1 can be formulated as an equivalent, unconstrained least-squares problem:

\min_{γ_{0}} | | f_{1} + (f_{0} - f_{1}) γ_{0} | |_{2} .

(3)

It is easy to solve the unconstrained least-squares problem

(3)

, for example, Pratapa et al. [16] chose the generalized inverse to compute

γ_{0}

, and Walker et al. [19] chose QR decomposition [18] to compute

γ_{0}

.

2.2. The Arnoldi-Inout Method for Computing PageRank

Gu and Wang [8] proposed the Arnoldi-Inout method by preconditioning the inner-outer iteration with the thick restarted Arnoldi method. Its algorithmic version can be found in Algorithm 2.

Algorithm 2 Arnoldi-Inout method [8]

Input: an initial vector

x_{0}

, the size of the subspace

m

, the number of approximate eigenvectors that are retained from one cycle to the next

\hat{p}

, an inner tolerance

η

, an outer tolerance

τ

, three parameters

α_{1}, α_{2}

, and

m a x i t

to control the inner-outer iteration. Set

r e s t a r t = 0, r = 1, d = 1, d_{0} = d

.

Output: PageRank vector

x

.

(1). Apply the thick restarted Arnoldi algorithm [8,9] a few times (2–3 times). If the residual norm satisfies the prescribed tolerance, then stop; otherwise, continue.

(2). Run the inner-outer iteration with

x

as the initial guess, where

x

is the approximate vector obtained from the thick restarted Arnoldi algorithm:

r e s t a r t = 0

;

2.1. While

r e s t a r t < m a x i t & r > τ

2.2.

x = x / | | x | |_{1}; z = P x;

2.3.

r = | | α z + (1 - α) v - x | |_{2};

2.4.

r_{0} = r; r_{1} = r; r a t i o = 0;

2.5. While

r a t i o < α_{1} & r > τ

2.6.

f = (α - β) z + (1 - α) v;

2.7.

r a t i o_{1} = 0;

2.8. While

r a t i o_{1} < α_{2} & d > η

2.9.

x = f + β z; z = P x;

2.10.

d = | | f + β z - x | |_{2};

2.11.

r a t i o_{1} = d / d_{0}; d_{0} = d;

2.12. End While

2.13.

r = | | α z + (1 - α) v - x | |_{2};

2.14.

r a t i o = r / r_{0}; r_{0} = r;

2.15. End While

2.16.

x = α z + (1 - α) v; x = x / | | x | |_{1};

2.17. If

r / r_{1} > α_{1}

2.18.

r e s t a r t = r e s t a r t + 1;

2.19. End If

2.20. End While

2.21. If

r \leq τ

, stop, else goto step 1.

For Algorithm 2, it is necessary to indicate that:

(1): The detailed description of the thick restarted Arnoldi algorithm in step 1 can be found in [8,9]. Here, we leave out its implementation for conciseness.
(2): The parameters $α_{1}, α_{2}$ , $r e s t a r t$ and $m a x i t$ are used to control the conversion between the inner-outer iteration and the thick restarted Arnoldi algorithm. The specific utility mechanism and more details can be found in [8].

3. The AIOA Method for Computing PageRank

In this section, we combine the Arnoldi-Inout method with the Anderson(1) acceleration. The new method is called the AIOA method, which can be understood as the Arnoldi-Inout method accelerated with the Anderson(1) extrapolation. We first describe the construction of the AIOA method and then analyze its convergence behavior.

3.1. The Construction of the AIOA Method

The mechanism of the AIOA method can be described as follows: We first ran the Arnoldi-Inout method with a given initial guess

x_{0}

to get an approximation vector

{\tilde{x}}_{1}

. If the approximation vector was unsatisfactory, then we treated the inner-outer iteration as a fixed-point problem and ran Algorithm 1 with vector

{\tilde{x}}_{1}

as the starting vector to get another approximation vector

x_{n e w}

. If the vector

x_{n e w}

did not work better than the approximation vector

{\tilde{x}}_{3}

of the fixed-point problem, we set

x_{n e w} = {\tilde{x}}_{3}

. If the new approximation vector

x_{n e w}

was still not up to the specified accuracy, then we returned to the Arnoldi-Inout method with

x_{n e w}

as the starting vector. We repeated the above process similarly until the required accuracy was reached. The specific algorithmic version is shown as follows.

3.2. Convergence Analysis

The convergence of the Arnoldi-Inout method and that of the Anderson acceleration can be found in [8,14,15]. In this subsection, we analyze the convergence of the AIOA method. Specifically, the convergence analysis of Algorithm 3 focuses on the process when turning from the Anderson(1) acceleration to the Arnoldi-Inout method.

Algorithm 3 AIOA method

(1). Given a unit initial guess

x_{0}

, an inner tolerance

η

, an outer tolerance

τ

, the size of the subspace

m

, the number of approximate eigenvectors that are retained from one cycle to the next

\hat{p}

, three parameters

α_{1}, α_{2}

and

m a x i t

to control the inner-outer iteration. Set

r e s t a r t = 0, r = 1, d = 1, d_{0} = d, l = 1

.

(2). Run the Algorithm 2 with the initial vector

x_{0}

. If the residual norm satisfies

τ

, then stop, otherwise continue.

(3). Run the Algorithm 1 with

{\tilde{x}}_{1}

as the starting guess, where

{\tilde{x}}_{1}

is the approximation vector obtained from step 2.

3.1.

l = 1, z = P {\tilde{x}}_{1};

3.2. While

l < 3 & r > τ

3.3.

f = (α - β) z + (1 - α) v;

3.4. Repeat

3.5.

x = f + β z; z = P x;

3.6. Until

| | f + β z - x | |_{2} < η

3.7.

l = l + 1

;

3.8.

{\tilde{x}}_{l} = α z + (1 - α) v;

3.9.

r = | | {\tilde{x}}_{l} - x | |_{2};

3.10. End While

3.11. Compute

f_{0} = {\tilde{x}}_{2} - {\tilde{x}}_{1}, f_{1} = {\tilde{x}}_{3} - {\tilde{x}}_{2} .

3.12. Compute

γ_{0}

that satisfies

\min_{γ_{0}} | | f_{1} + (f_{0} - f_{1}) γ_{0} | |_{2}

.

3.13. Compute

x_{n e w} = γ_{0} {\tilde{x}}_{2} + (1 - γ_{0}) {\tilde{x}}_{3}

.

3.14. If

| | {\tilde{x}}_{3} - {\tilde{x}}_{2} | |_{2} < | | x_{n e w} - {\tilde{x}}_{2} | |_{2}

3.15.

x_{n e w} = {\tilde{x}}_{3};

3.16. else

3.17.

r = | | x_{n e w} - {\tilde{x}}_{2} | |_{2};

3.18. End If

3.19. If

r \leq τ

, stop, else go back to step 2 with the vector

x_{n e w}

as the starting vector.

Let

L_{m - 1}

denote the set of polynomials whose degree does not exceed

m - 1

and

σ (A)

represent the set of eigenvalues of the matrix

A

. Assume the eigenvalues of

A

are sorted in the decreasing order

1 = |λ_{1}| > |λ_{2}| \geq \dots \geq |λ_{n}|

. The following theorem proposed by Saad [20] describes the relationship between an approximate eigenvector

μ_{1}

and the Krylov subspace

K_{m}

.

Theorem 1.

[20] Assume that

A

is diagonalizable and that the initial vector

v_{0}

in Arnoldi’s method has expansion

v_{0} = \sum_{i = 1}^{n} ζ_{i} μ_{i}

with respect to the eigenbasis

{\{μ_{i}\}}_{i = 1, 2, 3, \dots, n}

in which

| | μ_{i} | |_{1} = 1, i = 1, 2, 3, \dots, n

and

ζ_{1} \neq 0

. Then the following inequality holds

| | (I - P_{m}) μ_{1} | |_{2} \leq ξ ε_{m},

(4)

where

P_{m}

is the orthogonal projector onto the subspace

K_{m} (A, v_{0})

,

ξ = \sum_{i = 2}^{n} |\frac{ζ_{i}}{ζ_{1}}|

and

ε_{m} = \min_{p \in L_{m - 1}, p (λ_{1}) = 1} \max_{λ \in σ (A) / λ_{1}} |p (λ)| .

For the purpose of analyzing the convergence speed of our algorithm, it is given that two useful theorems about the spectrum properties of the Google matrix

A

are as follows.

Theorem 2.

[21] Assume that the spectrum of the column-stochastic matrix

P

is

[1, π_{2}, \dots, π_{n}]

and then the spectrum of the matrix

A = α P + (1 - α) e v^{T}

is

[1, α π_{2}, \dots, α π_{n}]

, where

α \in (0, 1),

and

v

is a vector with nonnegative elements such that

e^{T} v = 1

.

Theorem 3.

[2] Let

P

be an

n \times n

column-stochastic matrix. Let

α

be a real number such that

0 < α < 1

. Let

E

be an

n \times n

rank-one column-stochastic matrix

E = v e^{T}

, where

e

is the

n

-vector whose elements are all ones and

v

is an

n

-vector whose elements are all nonnegative and sum to 1. Let

A = α P + (1 - α) E

be an

n \times n

column-stochastic matrix, and then its dominant eigenvalue

λ_{1} = 1

,

|λ_{2}| \leq α

.

In the Arnoldi-Inout method, let

v_{0}

from the previous thick restarted Arnoldi method be the starting vector for the inner-outer iteration. Next, the inner-outer method produces the vector

v_{1} = G^{k} v_{0}

, where

k \geq m a x i t

and

G = {(I - β P)}^{- 1} [(α - β) P + (1 - α) v e^{T}]

. The derivation of the iterative matrix

G

can be found in [5]. In our proposed method, we ran Algorithm 1 with vector

v_{1}

as the initial vector. Note that in the Anderson(1) acceleration, we treated the inner-outer iteration as a fixed-point iteration such that the new vector

v_{n e w} = ω [(1 - γ_{0}) G^{2} v_{1} + γ_{0} G v_{1}]

was produced such that

ω

was the normalizing factor. If the vector

v_{n e w}

worked better than the vector

G^{2} v_{1}

, then, as given in Algorithm 3, we set

v_{n e w} = G^{2} v_{1}

, which meant the Anderson(1) acceleration was reduced to the inner-outer iteration and the convergence of Algorithm 3 was certainly established for this case. Hence, it is discussed that the convergence for another case when the vector

v_{n e w} = ω [(1 - γ_{0}) G^{2} v_{1} + γ_{0} G v_{1}]

works better than the vector

G^{2} v_{1}

.

In the next cycle of the AIOA algorithm, a

m

-step Arnoldi process was run with

v_{n e w}

as the starting vector, and then the new Krylov subspace

K_{m} (A, v_{n e w}) = s p a n (v_{n e w}, A v_{n e w}, \dots, A^{m - 1} v_{n e w})

was constructed. Next, we introduced the theorem that illustrates the convergence of the AIOA method.

Theorem 4.

Suppose that the matrix

A

is diagonalizable if we denote by

\tilde{P_{m}}

the orthogonal projector onto the subspace

K_{m} (A, v_{n e w})

. Then under the notations of Theorem 1, it has

| | (I - \tilde{P_{m}}) μ_{1} | |_{2} \leq Λ \cdot {(\frac{α - β}{1 - β})}^{k + 1} \cdot ξ ε_{m},

(5)

where

Λ = |(1 - γ_{0}) \frac{α - β}{1 - β} + γ_{0}|

,

ξ = \sum_{i = 2}^{n} |\frac{ζ_{i}}{ζ_{1}}|

,

ε_{m} = \min_{p \in L_{m - 1}, p (λ_{1}) = 1} \max_{λ \in σ (A) / λ_{1}} |p (λ)|

and

k \geq m a x i t

.

Proof of Theorem 4.

For any

u \in K_{m} (A, v_{n e w})

, there exists

q (x) \in L_{m - 1}

such that

\begin{array}{l} u & = q (A) v_{n e w} \\ = ω \cdot q (A) \cdot [(1 - γ_{0}) G^{2} + γ_{0} G] v_{1} \\ = ω \cdot q (A) \cdot [(1 - γ_{0}) G^{k + 2} + γ_{0} G^{k + 1}] v_{0} \\ = ω \cdot q (A) \cdot [(1 - γ_{0}) G^{k + 2} + γ_{0} G^{k + 1}] \cdot (ζ_{1} μ_{1} + \sum_{i = 2}^{n} ζ_{i} μ_{i}), \end{array}

(6)

where

v_{0} = \sum_{i = 1}^{n} ζ_{i} μ_{i}

is the expansion of

v_{0}

within the eigenbasis

[μ_{1}, μ_{2}, \dots, μ_{n}]

.

As shown in [5] and [8], it has

\begin{array}{l} G & = {(I - β P)}^{- 1} [(α - β) P + (1 - α) v e^{T}] \\ = {(I - β P)}^{- 1} A - {(I - β P)}^{- 1} + I, \end{array}

then

\begin{array}{l} G μ_{i} & = {(I - β P)}^{- 1} A μ_{i} - {(I - β P)}^{- 1} μ_{i} + μ_{i} \\ = {(I - β P)}^{- 1} λ_{i} μ_{i} - {(I - β P)}^{- 1} μ_{i} + μ_{i} \\ = (λ_{i} - 1) {(I - β P)}^{- 1} μ_{i} + μ_{i}, \end{array}

where we use

A μ_{i} = λ_{i} μ_{i}, i = 1, 2, \dots, n

.

Assume that

π_{i}

is an eigenvalue of

P

, and from Theorem 2,

π_{1} = 1, π_{i} = \frac{1}{α} λ_{i}, i = 2, 3, \dots, n

, then the matrix

{(I - β P)}^{- 1}

has eigenvalues

η_{i} = \frac{1}{1 - β π_{i}}, i = 1, 2, \dots, n,

such that

G μ_{i} = \frac{λ_{i} - β π_{i}}{1 - β π_{i}} μ_{i}, i = 2, 3, \dots, n .

(7)

Using the fact that

λ_{1} = 1

and

π_{1} = 1

, we have

G μ_{1} = μ_{1}

and

G^{k} μ_{1} = μ_{1}

. Let

φ_{i} = \frac{λ_{i} - β π_{i}}{1 - β π_{i}}, i = 2, 3, \dots, n,

then, according to Theorem 3 and derivation in [8], it has

|λ_{i}| \leq α, i = 2, 3, \dots, n

, such that

|φ_{i}| = |\frac{λ_{i} - β π_{i}}{1 - β π_{i}}| \leq \frac{α - β}{1 - β} .

(8)

Substituting (7) and (8) into (6), it has

u = ω q (1) ζ_{1} μ_{1} + ω [(1 - γ_{0}) \sum_{i = 2}^{n} q (λ_{i}) φ_{i}^{k + 2} ζ_{i} μ_{i} + γ_{0} \sum_{i = 2}^{n} q (λ_{i}) φ_{i}^{k + 1} ζ_{i} μ_{i}],

and then

\begin{array}{l} | | \frac{u}{ω q (1) ζ_{1}} - μ_{1} | |_{2} & = | | \sum_{i = 2}^{n} \frac{ζ_{i}}{ζ_{1}} \cdot \frac{q (λ_{i})}{q (1)} \cdot φ_{i}^{k + 1} μ_{i} \cdot [(1 - γ_{0}) φ_{i} + γ_{0}] | |_{2} \\ \leq |(1 - γ_{0}) \frac{α - β}{1 - β} + γ_{0}| \cdot {(\frac{α - β}{1 - β})}^{k + 1} \sum_{i = 2}^{n} |\frac{ζ_{i}}{ζ_{1}}| \cdot \max_{i \neq 1} |p (λ_{i})| \\ = Λ \cdot {(\frac{α - β}{1 - β})}^{k + 1} ξ \cdot \max_{i \neq 1} |p (λ_{i})|, \end{array}

where we let

p (λ) = q (λ) / q (1)

satisfy

p (1) = 1

,

ξ = \sum_{i = 2}^{n} |\frac{ζ_{i}}{ζ_{1}}|

and

Λ = |(1 - γ_{0}) \frac{α - β}{1 - β} + γ_{0}|

.

Therefore, we proved

\begin{array}{l} | | (I - \tilde{P_{m}}) μ_{1} | |_{2} & = \min_{u \in K_{m} (A, v_{n e w})} | | u - μ_{1} | |_{2} \\ \leq Λ \cdot {(\frac{α - β}{1 - β})}^{k + 1} \cdot ξ \cdot \min_{p \in L_{m - 1}, p (λ_{1}) = 1} \max_{λ \in σ (A) / λ_{1}} |p (λ)| . \end{array}

□

Remark 1.

Comparing (4) with (5), it is easy to find that our method can improve the convergence speed by a factor of at least

Λ \cdot {(\frac{α - β}{1 - β})}^{k + 1}

when turning from the Anderson(1) acceleration to the Arnoldi-Inout method.

4. Numerical Experiments

In this section, we first give the appropriate choice for the parameter

m a x i t

and then test the effectiveness of the AIOA method. For the thick restarted Arnoldi method, there were two parameters,

m

and

\hat{p}

, that needed to be considered, but the thick restarted Arnoldi method had the same effect as the Arnoldi-Inout [8] method and the AIOA method. In addition, with the parameters

m

and

\hat{p}

increasing, the cost would have been expensive, and they usually take small values. As a result, we don’t discuss the choice of the two parameters

m

and

\hat{p}

in detail and set

m = 4

and

\hat{p} = 3

for all test examples.

All the numerical experiments were performed using MATLAB R2018a programming package on 2.10 GHZ CPU with 1 6GB RAM.

Table 1 lists the characteristics of the test matrices, where

n

represents the matrix size,

n n z

denotes the number of nonzero elements and

d e n

is the density which is defined by

d e n = \frac{n n z}{n \times n} \times 100 .

All the test matrices are available from https://sparse.tamu.edu/ (accessed on 14 July 2020). For the sake of justice, the same initial guess

x_{0} = v = e / n

with

e = {[1, 1, \dots, 1]}^{T}

was used. The damping factors were chosen as

α = 0.99, 0.993, 0.995

and

0.998

in all numerical experiments. The stopping criterion were set as the 2-norm of the residual, and the prescribed outer tolerance was

τ = 10^{- 8}

. For the inner-outer iterations, the inner residual tolerance was

η = 10^{- 2}

, and the smaller damping factor was

β = 0.5

. The parameters chosen to control the flip-flop were

α_{1} = α - 0.1

and

α_{2} = α - 0.1

. We ran the thick restarted Arnoldi procedure twice in each loop of the Arnoldi-Inout [8] method and the AIOA method. In the AIOA algorithm, we chose the QR decomposition to compute

γ_{0}

.

4.1. The Selection of Parameter $M a x i t$

In this subsection, we discuss the selection of the parameter value

m a x i t

by analyzing the numerical results of the Arnoldi-Inout [8] (denoted as “AIO”) method and the AIOA method for the web-Stanford matrix, which contains 281,903 pages and 2,312,497 links. Table 2 lists the matrix–vector products (MV) of the AIO method and the AIOA method for the web-Stanford matrix when

α = 0.99, 0.993, 0.995, 0.998

and

m a x i t = 2, 4, 6, 8, 10

. Figure 1 depicts the curves of computing time (CPU) of the two methods versus number

m a x i t

, respectively.

From Table 2, it is observed that the optimal

m a x i t

was different for different

α

and different methods. From Figure 1, optimal

m a x i t

is 6 and the worst performing

m a x i t

is 8 for the AIO method, but for the AIOA method, the best value of

m a x i t

is not 6. For fairness, we decided to choose the

m a x i t = 4

in the following numerical experiments. In addition, in Table 2, when

α = 0.995

and

m a x i t = 6

, the MV of the AIOA is a little more than that of the AIO method, but the CPU time of AIOA method is better than that of the AIO method. The situation suggests that our method has some potential.

4.2. Comparisons of Numerical Results

In this subsection, we tested the effectiveness of the AIOA method through numerical comparison experiments with the inner-outer (denoted as “Inout”) [5] method, the power-inner-outer (denoted as “PIO”) [6] method and the Arnoldi-Inout (denoted as “AIO”) [8] method in terms of iteration counts (IT), the number of matrix-vector products (MV) and the computing time (CPU) in seconds. In all experiments in this subsection, we set the parameters

m = 4, \hat{p} = 3

and

m a x i t = 4

. Table 3, Table 4, Table 5 and Table 6 give the numerical experiment results of the Inout method, the PIO method, the AIO method and the AIOA method for four matrices when

α = 0.99, 0.993, 0.995, 0.998

, and Figure 2, Figure 3, Figure 4 and Figure 5 describe the residual convergence images of the above methods with different

α

for all test matrices.

In order to better demonstrate the efficiency of our proposed method, we defined

speedup = \frac{{CPU}_{AIO} - {CPU}_{AIOA}}{{CPU}_{AIO}} \times 100 %,

to show the speedup of the AIOA method with respect to the AIO method in terms of CPU.

From the numerical results in Table 3, Table 4, Table 5 and Table 6, it is easy to see that the AIOA method performed better than the other three methods in terms of IT, MV and CPU time for four matrices with different damping factors. As we expected, the advantage of the AIOA method was obvious for large

α

. For instance, when

α = 0.995

, the speedup is

52.65 %

in Table 3 and

36.66 %

in Table 5. When

α = 0.998

, the speedup is

49.48 %

in Table 4 and

60.32 %

in Table 6. In addition, from Figure 2, Figure 3, Figure 4 and Figure 5, it is easy to observe that the AIOA method can reach the accuracy requirement faster than the Inout method, the PIO method and the AIO method for all test examples. Therefore, the above results verify the effectiveness of the AIOA method.

5. Conclusions

In this paper, by employing the Anderson(1) extrapolation at periodic intervals within the Arnoldi-Inout method, we have presented a new method called the AIOA method to accelerate the computation speed of PageRank problems. Its implementation process and convergence theorem can be found in Section 3. Numerical simulation experiment results in Section 4 proved that the AIOA method was very efficient and converged faster compared to the inner-outer method, the power-inner-outer method and the Arnoldi-Inout method. However, there is still a lot of work to be further studied. For example, it is difficult to handle the best choices for parameters

m, β, m a x i t

.

Author Contributions

Methodology, C.W.; software, X.T.; writing—original draft preparation, X.T.; writing—review and editing, C.W., X.-M.G. and Z.-L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Page, L.; Brin, S.; Motwani, R. The PageRank Citation Ranking: Bringing Order to the Web; Stanford InfoLab: Stanford, CA, USA, 1999. [Google Scholar]
Haveliwala, T.; Kamvar, S. The Second Eigenvalue of the Google Matrix; Stanford InfoLab: Stanford, CA, USA, 2003. [Google Scholar]
Kamvar, S.D.; Haveliwala, T.H.; Manning, C.D.; Golub, G.H. Extrapolation methods for accelerating PageRank computations. In Proceedings of the 12th International Conference on World Wide Web, Budapest, Hungary, 20–24 May 2003; pp. 261–270. [Google Scholar]
Brezinski, C.; Redivo-Zaglia, M. The PageRank vector: Properties, computation, approximation, and acceleration. SIAM J. Matrix Anal. Appl. 2006, 28, 551–575. [Google Scholar] [CrossRef]
Gleich, D.F.; Gray, A.P.; Greif, C. An inner-outer iteration for computing PageRank. SIAM J. Sci. Comput. 2010, 32, 349–371. [Google Scholar] [CrossRef]
Gu, C.Q.; Xie, F.; Zhang, K. A two-step matrix splitting iteration for computing PageRank. J. Comput. Appl. Math. 2015, 278, 19–28. [Google Scholar] [CrossRef]
Golub, G.H.; Greif, C. An Arnoldi-type algorithm for computing page rank. BIT 2006, 46, 759–771. [Google Scholar] [CrossRef]
Gu, C.Q.; Wang, W. An Arnoldi-Inout algorithm for computing PageRank problems. J. Comput. Appl. Math. 2017, 309, 219–229. [Google Scholar] [CrossRef]
Morgan, R.B.; Zeng, M. A harmonic restarted Arnoldi algorithm for calculating eigenvalues and determining multiplicity. Linear Algebra Appl. 2006, 415, 96–113. [Google Scholar] [CrossRef] [Green Version]
Hu, Q.Y.; Wen, C.; Huang, T.Z.; Shen, Z.L.; Gu, X.M. A variant of the Power–Arnoldi algorithm for computing PageRank. J. Comput. Appl. Math. 2021, 381, 113034. [Google Scholar] [CrossRef]
Wu, G.; Wei, Y. A Power—Arnoldi algorithm for computing PageRank. Numer. Linear Algebra Appl. 2007, 14, 521–546. [Google Scholar] [CrossRef]
Tan, X. A new extrapolation method for PageRank computations. J. Comput. Appl. Math. 2017, 313, 383–392. [Google Scholar] [CrossRef]
Anderson, D.G. Iterative procedures for nonlinear integral equations. JACM 1965, 12, 547–560. [Google Scholar] [CrossRef]
Walker, H.F.; Ni, P. Anderson acceleration for fixed-point iterations. SIAM J. Numer. Anal. 2011, 49, 1715–1735. [Google Scholar] [CrossRef] [Green Version]
Toth, A.; Kelley, C.T. Convergence analysis for Anderson acceleration. SIAM J. Numer. Anal. 2015, 53, 805–819. [Google Scholar] [CrossRef]
Pratapa, P.P.; Suryanarayana, P.; Pask, J.E. Anderson acceleration of the Jacobi iterative method: An efficient alternative to Krylov methods for large, sparse linear systems. J. Comput. Phys. 2016, 306, 43–54. [Google Scholar] [CrossRef] [Green Version]
Yang, C.; Meza, J.C.; Wang, L.W. A trust region direct constrained minimization algorithm for the kohn–sham equation. SIAM J. Sci. Comput. 2007, 29, 1854–1875. [Google Scholar] [CrossRef] [Green Version]
Allaire, G.; Kaber, S.M.; Trabelsi, K.; Allaire, G. Numerical Linear Algebra; Springer: New York, NY, USA, 2008. [Google Scholar]
Walker, H.F. Anderson Acceleration: Algorithms and Implementations; Report MS-6-15-50; WPI Math. Sciences Dept.: Worcester, MA, USA, 2011. [Google Scholar]
Saad, Y. Numerical Methods for Large Eigenvalue Problems; Manchester University Press: Manchester, UK, 1992. [Google Scholar]
Langville, A.; Meyer, C. Google’s PageRank and Beyond: The Science of the Search Engine Rankings; Princeton University Press: Princeton, NJ, USA, 2006. [Google Scholar]

Figure 1. The total computing (CPU) time of the Arnoldi-Inout (AIO) method and the AIOA method versus number

m a x i t

on the web-Stanford matrix.

Figure 1. The total computing (CPU) time of the Arnoldi-Inout (AIO) method and the AIOA method versus number

m a x i t

on the web-Stanford matrix.

Figure 2. Convergence behaviors of the four methods on the wb-cs-stanford matrix.

Figure 3. Convergence behaviors of the four methods on the usroads-48 matrix.

Figure 4. Convergence behaviors of the four methods on the web-Stanford matrix.

Figure 5. Convergence behaviors of the four methods on the wiki-Talk matrix.

Table 1. The characteristics of test matrices.

Name	$n$	$n n z$	$d e n$
wb-cs-stanford	9914	36,854	$0.375 \times 10^{- 1}$
usroads-48	126,146	323,900	$0.204 \times 10^{- 2}$
web-Stanford	281,903	2,312,497	$0.291 \times 10^{- 2}$
wiki-Talk	2,394,385	5,021,410	$0.875 \times 10^{- 4}$

Table 2. The number of the matrix–vector products of the AIO method and the AIOA method on the web-Stanford matrix.

$α$	$m a x i t = 2$		$m a x i t = 4$		$m a x i t = 6$		$m a x i t = 8$		$m a x i t = 10$
$α$	AIO	AIOA	AIO	AIOA	AIO	AIOA	AIO	AIOA	AIO	AIOA
$α = 0.99$	342	266	337	277	386	306	423	308	439	281
$α = 0.993$	446	309	422	327	433	376	542	417	563	358
$α = 0.995$	558	383	637	414	524	544	706	440	711	471
$α = 0.998$	1044	588	975	677	699	661	1503	669	1533	789

Table 3. Numerical results of the four methods on the wb-cs-stanford matrix.

$α$		Inout	PIO	AIO	AIOA
$α = 0.99$	IT	997	333	192	116
	MV	997	666	238	167
	CPU	0.2344	0.2011	0.1741	0.1038
	speedup				40.35%
$α = 0.993$	IT	1427	476	252	133
	MV	1427	952	316	200
	CPU	0.3283	0.2488	0.2297	0.1211
	speedup				47.28%
$α = 0.995$	IT	2000	667	304	143
	MV	2000	1334	378	209
	CPU	0.4590	0.3490	0.2689	0.1273
	speedup				52.65%
$α = 0.998$	IT	5009	1670	396	216
	MV	5009	3340	496	315
	CPU	1.1347	0.8670	0.3817	0.1985
	speedup				47.99%

Table 4. Numerical results of the four methods on the usroads-48 matrix.

$α$		Inout	PIO	AIO	AIOA
$α = 0.99$	IT	436	146	96	51
	MV	436	292	109	73
	CPU	1.1275	1.0362	0.7928	0.4347
	speedup				45.16%
$α = 0.993$	IT	537	180	118	59
	MV	537	360	135	84
	CPU	1.6484	1.0888	1.0487	0.6894
	speedup				34.26%
$α = 0.995$	IT	646	216	146	64
	MV	646	432	164	94
	CPU	1.9562	1.4969	1.1519	0.6894
	speedup				40.14%
$α = 0.998$	IT	999	334	242	106
	MV	999	668	272	155
	CPU	2.5375	2.0479	1.8138	0.9163
	speedup				49.48%

Table 5. Numerical results of the four methods on the web-Stanford matrix.

$α$		Inout	PIO	AIO	AIOA
$α = 0.99$	IT	768	381	284	191
	MV	769	762	337	277
	CPU	9.5426	11.7488	8.4437	7.2447
	speedup				14.20%
$α = 0.993$	IT	1087	544	360	229
	MV	1088	1088	422	327
	CPU	10.4567	13.4826	11.0564	8.2018
	speedup				25.81%
$α = 0.995$	IT	1516	763	540	279
	MV	1517	1526	637	414
	CPU	16.6344	17.8678	17.0485	10.7983
	speedup				36.66%
$α = 0.998$	IT	3781	1908	828	484
	MV	3782	3816	975	677
	CPU	38.1507	43.2843	26.5169	16.8771
	speedup				36.35%

Table 6. Numerical results of the four methods on the wiki-Talk matrix.

$α$		Inout	PIO	AIO	AIOA
$α = 0.99$	IT	687	230	97	86
	MV	687	460	117	109
	CPU	47.5235	34.3597	23.7834	20.1552
	speedup				15.25%
$α = 0.993$	IT	971	324	113	109
	MV	971	648	136	136
	CPU	73.2740	45.5463	27.8776	25.1927
	speedup				9.63%
$α = 0.995$	IT	1339	448	145	118
	MV	1339	896	173	157
	CPU	98.4781	62.3806	35.8122	29.3671
	speedup				17.99%
$α = 0.998$	IT	3127	1044	275	98
	MV	3127	2088	324	141
	CPU	208.5881	155.7358	65.6808	26.0576
	speedup				60.32%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tang, X.; Wen, C.; Gu, X.-M.; Shen, Z.-L. Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank. Symmetry 2021, 13, 636. https://doi.org/10.3390/sym13040636

AMA Style

Tang X, Wen C, Gu X-M, Shen Z-L. Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank. Symmetry. 2021; 13(4):636. https://doi.org/10.3390/sym13040636

Chicago/Turabian Style

Tang, Xia, Chun Wen, Xian-Ming Gu, and Zhao-Li Shen. 2021. "Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank" Symmetry 13, no. 4: 636. https://doi.org/10.3390/sym13040636

APA Style

Tang, X., Wen, C., Gu, X.-M., & Shen, Z.-L. (2021). Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank. Symmetry, 13(4), 636. https://doi.org/10.3390/sym13040636

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank

Abstract

1. Introduction

2. Previous Work

2.1. Anderson Acceleration

2.2. The Arnoldi-Inout Method for Computing PageRank

3. The AIOA Method for Computing PageRank

3.1. The Construction of the AIOA Method

3.2. Convergence Analysis

4. Numerical Experiments

4.1. The Selection of Parameter $M a x i t$

4.2. Comparisons of Numerical Results

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Anderson Acceleration of the Arnoldi-Inout Method for Computing PageRank

Abstract

1. Introduction

2. Previous Work

2.1. Anderson Acceleration

2.2. The Arnoldi-Inout Method for Computing PageRank

3. The AIOA Method for Computing PageRank

3.1. The Construction of the AIOA Method

3.2. Convergence Analysis

4. Numerical Experiments

4.1. The Selection of Parameter M a x i t

4.2. Comparisons of Numerical Results

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1. The Selection of Parameter $M a x i t$