ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS

Qin, Yuefeng; Mao, Chen; Li, Jiaofen

doi:10.3390/math13162680

Open AccessFeature PaperArticle

ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS

by

Yuefeng Qin

^1,*,

Chen Mao

¹ and

Jiaofen Li

^1,2

¹

School of Mathematics and Computing Science, Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation, Guilin University of Electronic Technology, Guilin 541004, China

²

Center for Applied Mathematics of Guangxi (GUET), Guilin 541004, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(16), 2680; https://doi.org/10.3390/math13162680

Submission received: 13 July 2025 / Revised: 11 August 2025 / Accepted: 14 August 2025 / Published: 20 August 2025

Download

Browse Figures

Versions Notes

Abstract

The Generalized Inner Product SCALing (GIPSCAL) model is a specialized tool for analyzing square asymmetric tables within asymmetric multidimensional scaling (MDS), with applications in sociology (e.g., social mobility tables) and marketing (e.g., brand switching data). This paper presents the development of an efficient numerical algorithm for solving the three-way GIPSCAL problem. We focus on vector

ε

-algorithm-accelerated fixed-point iterations, detailing the underlying acceleration principles. Extensive numerical experiments show that the proposed method achieves acceleration performance comparable to polynomial extrapolation and Anderson acceleration. Furthermore, compared to continuous-time projected gradient flow methods and first- and second-order Riemannian optimization algorithms from the Manopt toolbox, our approach demonstrates superior computational efficiency and scalability.

Keywords:

multidimensional scaling; three-way asymmetric data; generalized inner product scaling; ε-algorithm

MSC:

62-08; 62R30; 65F10; 65H10

1. Introduction

Asymmetric square matrices naturally arise in numerous scientific domains, including sociology (e.g., social mobility matrices), marketing (e.g., brand-switching data), and psychology (e.g., stimulus identification experiments) [1,2]. To facilitate the analysis of such data, Harshman [3] introduced the DEcomposition into DIrectional COMponents (DEDICOM) model, which approximates an asymmetric matrix

X \in R^{n \times n}

as

X = Q R Q^{T} + E,

(1)

where

Q \in R^{n \times r}

(with

r ≪ n

) is typically assumed to be column-orthonormal and encodes object coordinates in a latent space,

R \in R^{r \times r}

represents the (possibly asymmetric) inter-dimensional relationships, and E denotes the residual matrix. While DEDICOM offers interpretability in modeling asymmetric relationships, it lacks capabilities for effective visualization. To address this limitation, Chino [4] introduced the GIPSCAL (Generalized Inner Product SCALing) model, which not only handles asymmetric data but also provides a framework for graphical representation. However, the original GIPSCAL formulation may exhibit inadequate fit to empirical data. A generalized variant, introduced by Kiers and Takane [5], improves model flexibility via the decomposition

X = A (I_{r} + K) A^{T} + E,

(2)

where

A \in R^{n \times r}

is a weight matrix,

I_{r}

is the

r \times r

identity matrix,

K \in R^{r \times r}

is skew-symmetric, and E denotes the error matrix. The symmetric part

A A^{T}

may be visualized using Classical Multidimensional Scaling (MDS) techniques [6,7], while the asymmetric component

A K A^{T}

is treated using Gower’s method [7,8]. The model parameters are typically estimated by minimizing the least-squares objective, as outlined in Kiers and Takane [5]. Subsequent developments by Trendafilov and Gallo [9], Trendafilov [10] recast the GIPSCAL framework into the following constrained optimization problem:

\begin{matrix} Minimize & \frac{1}{2} | | X - Q (D + K) Q^{T} {| |}^{2}, \\ subject to & (Q, D, K) \in O (n, r) \times D_{+} (r) \times K (r) . \end{matrix}

(3)

Here,

∥ \cdot ∥

denotes the Frobenius norm,

O (n, r) : = {Q \in R^{n \times r} : Q^{T} Q = I_{r}}

is the Stiefel manifold of orthonormal matrices,

D_{+} (r)

is the set of

r \times r

diagonal matrices with nonnegative entries, and

K (r)

denotes the set of skew-symmetric

r \times r

matrices.

This reformulation clarifies the connection between GIPSCAL and the DEDICOM model in (1), revealing that GIPSCAL constitutes a special case of DEDICOM in which the symmetric part of R is constrained to lie in the nonnegative diagonal cone. Additionally, this formulation can be interpreted as a natural asymmetric extension of the INdividual Differences SCALing (INDSCAL) model [7,11]. The GIPSCAL methodology has further been generalized to handle three-way data arrays comprising N asymmetric slices

X_{i} \in R^{n \times n}

, analogous to three-way extensions of INDSCAL. In this setting, each slice is modeled as [9,10,12,13]

X_{i} = Q (D_{i} + K_{i}) Q^{T} + E_{i},

(4)

where

Q \in R^{n \times r}

is a common loading matrix shared across all slices,

D_{i} \in R^{r \times r}

are slice-specific nonnegative diagonal matrices, and

K_{i} \in R^{r \times r}

are slice-specific skew-symmetric matrices. Consequently, the three-way GIPSCAL seeks to determine

(Q, D_{1}, \dots, D_{N}, K_{1}, \dots, K_{N})

by fitting the model to the

n \times n

asymmetric data matrices

X_{i}

in a least-squares sense [9,10,12,13,14]

\begin{matrix} Minimize & \frac{1}{2} \sum_{i = 1}^{N} | | X_{i} - Q (D_{i} + K_{i}) Q^{T} {| |}^{2}, \\ subject to & (Q, D_{1}, \dots, D_{N}, K_{1}, \dots, K_{N}) \in O (n, r) \times D {(r)}_{+}^{N} \times K {(r)}^{N} . \end{matrix}

(5)

where

D_{+} {(r)}^{N}

and

K {(r)}^{N}

denote N independent copies of the nonnegative diagonal cone and skew-symmetric matrix space, respectively.

In this work, we revisit the numerical challenge of fitting the three-way GIPSCAL model (5), a problem characterized by nonlinear coupling among variables and the need for scalable, efficient algorithms. Despite its relevance in multivariate data analysis, the literature on this topic remains relatively sparse. Early contributions by Trendafilov [10,13] reformulated problem (5) as a constrained gradient dynamical system and proposed a continuous-time projected gradient flow algorithm. This method guarantees global convergence and has shown strong empirical performance across various applications in multivariate analysis [9,15,16,17,18]. Nevertheless, its scalability may be limited in large-scale settings due to computational inefficiencies. To accelerate the inherently slow convergence of alternating least squares (ALS) methods, Loisel and Takane [19] proposed a minimal polynomial extrapolation (MPE) scheme, leveraging vector sequence fixed-point iteration. Empirical results indicate that this approach can substantially speed up convergence. However, in practical implementations, selecting an appropriate backtracking step size often relies on heuristic tuning rather than principled criteria. More recently, Trendafilov and Gallo [9] explored optimization-based reformulations of multivariate models on matrix manifolds and demonstrated the effectiveness of the Manopt toolbox [20] in addressing such problems through Riemannian optimization techniques. Motivated by these developments, this paper builds upon the fixed-point acceleration strategy of Loisel and Takane [19], extending its application to problem (5). We propose a new algorithmic framework that interprets approximate ALS iterations as a matrix-valued fixed-point iteration. Furthermore, we develop acceleration schemes based on the vector

ε

-algorithm (VEA), the topological

ε

-algorithm (TEA), and its simplified variant (STEA) [21,22], incorporating recent advances in numerical extrapolation and available algorithmic toolboxes. Extensive numerical experiments show that, compared to the original matrix sequences generated by fixed-point iteration, the matrix sequence extrapolation acceleration significantly improves convergence. Furthermore, when compared with existing solvers for Problem (5), such as the continuous-time projection gradient flow algorithm and the Riemannian optimization-based Manopt toolbox solvers, the

ε

-algorithms accelerated fixed-point iteration demonstrates a notable reduction in iteration time.

The remainder of the paper is organized as follows: In Section 2, we present the fixed-point iteration framework for solving the three-way GIPSCAL problem in (5). Section 3 introduces the core acceleration principles and describes the implementation of the VEA, TEA, and STEA within this context. Section 4 reports a comprehensive set of numerical experiments, benchmarking the proposed acceleration schemes against the continuous-time projected gradient flow method and several first- and second-order Riemannian optimization algorithms implemented in Manopt. Finally, Section 5 concludes the paper.

2. Fixed-Point Iteration Framework for Problem (5)

Building on the conditional minimization strategy introduced by Loisel and Takane [19], the three-way GIPSCAL problem in (5) can be reformulated as a fixed-point iteration scheme. Their original framework also incorporates minimal polynomial extrapolation (MPE) to accelerate convergence. For the sake of completeness, we revisit and extend this approach by developing an alternating least squares (ALS) framework, which naturally leads to a fixed-point iteration formulation.

Let

S (r)

denote the space of symmetric

r \times r

matrices. For a point

Z : = (Q, D_{1}, \dots, D_{N}, K_{1}, \dots, K_{N})

in the product space

R^{n \times r} \times S {(r)}^{N} \times K {(r)}^{N}

, we define the residual mapping

F_{i} (Q, D_{i}, K_{i}) = X_{i} - Q (D_{i} + K_{i}) Q^{T}, i = 1, \dots, N,

(6)

which allows us to express the objective function of problem (5) as

f (Z) = \frac{1}{2} \sum_{i = 1}^{N} {∥ F_{i} ∥}^{2} .

Since the variables

D_{i}

and

K_{i}

are independent for each slice i, through straightforward algebraic derivation, the Euclidean gradient of

f (Z)

can be decomposed component-wise as follows:

\nabla f (Z) = (\nabla_{Q} f, \nabla_{D_{1}} f, \dots, \nabla_{D_{N}} f, \nabla_{K_{1}} f, \dots, \nabla_{K_{N}} f) .

(7)

Here,

\begin{matrix} \nabla_{Q} f & = - 2 \sum_{i = 1}^{N} sym (X_{i}) Q D_{i} - skew (X_{i}) Q K_{i} - Q (D_{i}^{2} - K_{i}^{2}), \\ \nabla_{D_{i}} f & = D_{i} - Q^{T} sym (X_{i}) Q, i = 1, \dots, N, \\ \nabla_{K_{i}} f & = K_{i} - Q^{T} skew (X_{i}) Q, i = 1, \dots, N . \end{matrix}

Additionally,

sym (A) = \frac{1}{2} (A + A^{T})

and

skew (A) = \frac{1}{2} (A - A^{T})

denote the symmetric and skew-symmetric parts of matrix A, respectively.

Given a current iterate

Z^{(j)} = (Q^{(j)}, D_{1}^{(j)}, \dots, D_{N}^{(j)}, K_{1}^{(j)}, \dots, K_{N}^{(j)}) \in O (n, r) \times D_{+} {(r)}^{N} \times K {(r)}^{N}

, the ALS-based iterative scheme for solving (5) is defined by the following update rules:

\{\begin{matrix} D_{i}^{(j + 1)} = & \underset{D_{i}}{argmin} f_{i} (Q^{(j)}, D_{i}, K_{i}^{(j)}), D_{i} \in D_{+} (r), i = 1, \dots, N; \\ K_{i}^{(j + 1)} = & \underset{K_{i}}{argmin} f_{i} (Q^{(j)}, D_{i}^{(j + 1)}, K_{i}), K_{i} \in K (r), i = 1, \dots, N; \\ Q^{(j + 1)} = & \underset{Q}{argmin} f (Q, D_{1}^{(j + 1)}, \dots, D_{N}^{(j + 1)}, K_{1}^{(j + 1)}, \dots, K_{N}^{(j + 1)}), Q \in O (n, r) . \end{matrix}

(8)

The update for

D_{i}

involves solving a convex constrained matrix optimization problem:

min_{D_{i} \in D_{+} (r)} f_{i} (Q^{(j)}, D_{i}, K_{i}^{(j)}) = \frac{1}{2} {∥X_{i} - Q^{(j)} (D_{i} + K_{i}^{(j)}) {Q^{(j)}}^{T}∥}^{2} .

(9)

By deriving the first-order optimality condition, we obtain the variational inequality

〈 D_{i} - D_{i}^{(j + 1)}, \nabla_{D_{i}} f_{i} (Q^{(j)}, D_{i}, K_{i}^{(j)}) 〉 \geq 0, \forall D_{i} \in D_{+} (r) .

(10)

According to Theorem 3.1.1 of Hiriart-Urruty and Lemaréchal [23], this is equivalent to solving the implicit projection equation:

D_{i}^{(j + 1)} - P_{D_{+} (r)} \{D_{i}^{(j + 1)} - \nabla_{D_{i}} f_{i} (Q^{(j)}, D_{i}^{(j + 1)}, K_{i}^{(j)})\} = 0, D_{i}^{(j + 1)} \in D_{+} (r) .

(11)

Here, the projection

P_{D_{+} (r)} : R^{r \times r} \to D_{+} (r)

is defined element-wise via

P_{D_{+} (r)} (M) = max {0, I_{r} ⊙ M}, \forall M \in R^{r \times r},

(12)

where ⊙ denotes the Hadamard (element-wise) product. The closed-form solution is thus

\begin{matrix} D_{i}^{(j + 1)} = max \{0, I_{r} ⊙ ({Q^{(j)}}^{T} sym (X_{i}) Q^{(j)})\}, i = 1, \dots, N . \end{matrix}

(13)

Since

K (r)

is a linear subspace, the optimal

K_{i}

satisfies the projected gradient condition

\begin{matrix} P_{K (r)} (\nabla_{K_{i}} f_{i} (Q^{(j)}, D_{i}^{(j + 1)}, K_{i}^{(j + 1)})) = 0, \end{matrix}

where

P_{K (r)} (M) = skew (M)

. Hence, the solution is explicitly given by

\begin{matrix} K_{i}^{(j + 1)} = {Q^{(j)}}^{T} skew (X_{i}) Q^{(j)}, i = 1, \dots, N . \end{matrix}

(14)

The update for Q entails solving the following orthogonally constrained, nonconvex optimization problem

min_{Q \in O (n, r)} \frac{1}{2} \sum_{i = 1}^{N} {∥X_{i} - Q (D_{i}^{(j + 1)} + K_{i}^{(j + 1)}) Q^{T}∥}^{2} .

(15)

The associated Lagrangian, incorporating the constraint

Q^{T} Q = I_{r}

, is given by

L (Q, Λ) = \frac{1}{2} \sum_{i = 1}^{N} {∥X_{i} - Q (D_{i}^{(j + 1)} + K_{i}^{(j + 1)}) Q^{T}∥}^{2} + 〈 Λ, Q^{T} Q - I_{r} 〉,

where

Λ

is a symmetric Lagrange multiplier matrix. The first-order optimality condition yields

\begin{matrix} (I_{r} - Q Q^{T}) \sum_{i = 1}^{N} (sym (X_{i}) Q D_{i}^{(j + 1)} - skew (X_{i}) Q K_{i}^{(j + 1)}) = 0 . \end{matrix}

(16)

Although solving (16) analytically is challenging, we follow the strategy of Loisel and Takane [19] and approximate the solution as follows:

Compute the matrix

$G^{(j)} : = \sum_{i = 1}^{N} (sym (X_{i}) Q^{(j)} D_{i}^{(j + 1)} - skew (X_{i}) Q^{(j)} K_{i}^{(j + 1)});$

(17)
Compute the thin singular value decomposition $G^{(j)} = U Σ V^{T}$ , and set

$Q^{(j + 1)} : = U V^{T} .$

(18)

By integrating the subproblem updates (13), (14), and (18) within the ALS scheme from (8), we obtain the following iterative sequence:

Q^{(j)} ⟶ (D_{1}^{(j + 1)}, \dots, D_{N}^{(j + 1)}, K_{1}^{(j + 1)}, \dots, K_{N}^{(j + 1)}) ⟶ G^{(j)} ⟶ Q^{(j + 1)} .

This iterative process naturally defines a fixed-point iteration:

Q^{(j + 1)} = H_{GIPSCAL} (Q^{(j)}),

(19)

Here,

H_{GIPSCAL} : R^{n \times r} \to R^{n \times r}

denotes the nonlinear mapping associated with one complete ALS update.

To further analyze the convergence of problem (5) and iteration (19), we present the following theoretical results:

Theorem 1.

Problem (5) has a global optimal solution.

Proof.

The feasible set

M : = O (n, r) \times D_{+} {(r)}^{N} \times K {(r)}^{N}

is compact under the Frobenius norm topology. The objective function

f (Z) = \frac{1}{2} \sum_{i = 1}^{N} {∥ X_{i} - Q (D_{i} + K_{i}) Q^{T} ∥}^{2}

is continuous. Therefore, by the Weierstrass extreme value theorem, f achieves its minimum value on

M

. □

Theorem 2.

The problem (5) does not have a closed-form analytical solution. This conclusion holds even in special cases where

N = 1

or

r = 1

.

Proof.

(i) The non-convex feasible set induced by the orthogonal constraint

Q^{⊤} Q = I_{r}

and the quartic complexity of the objective function with respect to Q results in the Hessian matrix being indefinite at the critical points.

(ii) When

D_{i}, K_{i}

are fixed, the Q-subproblem degenerates into a generalized orthogonal Procrustes problem

min_{Q \in O (n, r)} \sum_{i = 1}^{N} {∥ X_{i} - Q A_{i} Q^{⊤} ∥}_{F}^{2}, (A_{i} = D_{i} + K_{i}),

where the asymmetry of

A_{i}

(due to

K_{i} \neq 0

) causes this subproblem to have no analytical solution (unlike the case when

A_{i}

is symmetric and can be solved by eigenvalue decomposition);

(iii) The coupling between the slices induced by the summation

\sum_{i = 1}^{N}

in the objective function.

(i)–(iii) together exclude the possibility of a closed-form solution, so numerical iterative methods must be used to approximate the solution. □

Theorem 3.

The iterative sequence

{Z^{(j)}}_{j = 0}^{\infty}

, where

Z^{(j)} = (Q^{(j)}, D_{i}^{(j)}, K_{i}^{(j)})

, generated by alternating least squares to solve problem (5), has its objective function value sequence

f (Z^{(j)})

converge to a non-negative limit L.

Proof.

The objective function

f (Z) = \frac{1}{2} \sum_{i = 1}^{N} {∥ X_{i} - Q (D_{i} + K_{i}) Q^{T} ∥}^{2}

is a sum of squared Frobenius norms, and its value is always non-negative. Thus, the sequence

f (Z^{(j)})

is bounded below by 0. In the alternating update process,

Fixing $Q^{(j)}, K_{i}^{(j)}$ and updating $D_{i}^{(j + 1)}$ , the global optimality of the convex subproblem (13) ensures that $f_{i} (Q^{(j)}, D_{i}^{(j + 1)}, K_{i}^{(j)}) \leq f_{i} (Q^{(j)}, D_{i}^{(j)}, K_{i}^{(j)})$
Fixing $Q^{(j)}, D_{i}^{(j + 1)}$ and updating $K_{i}^{(j + 1)}$ , the closed-form solution in (14) guarantees that $f_{i} (Q^{(j)}, D_{i}^{(j + 1)}, K_{i}^{(j + 1)}) \leq f_{i} (Q^{(j)}, D_{i}^{(j + 1)}, K_{i}^{(j)})$
Fixing $D_{i}^{(j + 1)}, K_{i}^{(j + 1)}$ and updating $Q^{(j + 1)}$ , the orthogonal Procrustes projection in (18) as a contraction mapping ensures that $f (Q^{(j + 1)}, \cdot) \leq f (Q^{(j)}, \cdot)$

Thus,

f (Z^{(j + 1)}) \leq f (Z^{(j)})

. By the monotonicity and boundedness convergence theorem, the sequence

f (Z^{(j)})

converges to

L \geq 0

. □

3. $ε$ -Algorithms Acceleration for the Fixed-Point Problem (19)

In numerical analysis and applied mathematics, sequences arise naturally across a broad range of computational problems. When a sequence converges slowly, acceleration techniques are often employed to improve the convergence rate. A common strategy involves transforming the original sequence into another that converges more rapidly to the same limit, assuming appropriate regularity conditions. One of the most influential transformations in this context is the Shanks transformation [24], originally derived by Schmidt [25] for iterative solutions of linear systems. It was later implemented algorithmically through the scalar

ε

-algorithm introduced by Wynn [26]. In a subsequent extension, Wynn [27] generalized the scalar

ε

-algorithm to handle vector-valued sequences. However, the algebraic structure underlying the vector version does not follow directly from the scalar case. To address this gap, Brezinski [28] proposed two distinct generalizations of the Shanks transformation and its associated algorithms to sequences in vector spaces. This led to the formulation of the topological Shanks transformation and the development of two corresponding topological

ε

-algorithms (TEA1 and TEA2). These algorithms operate using elements from both a vector space E and its dual space

E^{*}

, enabling a rigorous extension to infinite-dimensional settings. Recognizing the computational overhead associated with dual space operations, Brezinski [29] introduced the simplified topological

ε

-algorithms (STEA1 and STEA2). These variants avoid direct manipulation of dual space elements by substituting scalar

ε

-algorithm outputs, thereby reducing memory usage and enhancing numerical stability. In parallel, the Shanks transformation has inspired a family of vector extrapolation methods, including minimal polynomial extrapolation (MPE), modified minimal polynomial extrapolation (MMPE), and reduced-rank extrapolation (RRE). These techniques—collectively referred to as vector extrapolation methods—share the advantages of simple iterative structure and avoidance of explicit matrix decompositions. Owing to their general applicability and efficiency,

ε

-type and vector extrapolation algorithms have found widespread use in the numerical solution of linear and nonlinear systems, eigenvalue computations, Padé-type approximations, matrix functions, matrix equations, and Krylov subspace methods such as Lanczos iterations [28,30,31,32,33].

In this section, we introduce the principles and specific implementation processes of three

ε

algorithms: VEA, TEA and STEA. We then explain how each of these methods can be systematically integrated into our fixed-point iteration scheme (introduced in the previous section) to accelerate convergence in the context of solving the three-way GIPSCAL problem.

3.1. Scalar Shanks Transform and Scalar $ε$ -Algorithm

Let

S_{n}

be a sequence of scalars in the field K, where K is either

R

or

C

. If

lim_{n \to \infty} S_{n} = S

, under certain conditions, then the sequence

S_{n}

can be transformed into a new sequence

T_{n}

that converges to the same limit more efficiently, as described by

lim_{n \to \infty} \frac{T_{n} - S}{S_{n} - S} = 0 .

(20)

Shanks [24] introduced a transformation technique in his 1955 study, which can be used to determine the anticipated limiting value of a sequence of

k + 1

terms. The Shanks transformation assumes that the sequence satisfies the following relation:

α_{0} (S_{n} - S) + \dots + α_{k} (S_{n + k} - S) = 0, n = 0, 1, \dots

(21)

Here, the coefficient

α_{i}

is an arbitrary constant independent of n and satisfies the condition

α_{0} α_{k} \neq 0

. Assuming that the equation

α_{0} + \dots + α_{k} \neq 0

holds for all n, we can expand and rearrange it to derive the difference form

Δ S_{n} α_{0} + \dots + Δ S_{n + k} α_{k} = 0,

(22)

where the forward difference operator

Δ

is defined as

Δ S_{i} = S_{i + 1} - S_{i}

. To determine the

k + 1

coefficients

α_{0}, \dots, α_{k}

, we further require that

α_{0} + \dots + α_{k} = 1

, leading to a linear system consisting of k scalar equations derived from (22):

\{\begin{matrix} α_{0} + \dots + α_{k} = 1, \\ Δ S_{n + i} α_{0} + \dots + Δ S_{n + k + i} α_{k} = 0, i = 0, \dots, k - 1 . \end{matrix}

(23)

The coefficients

α_{i}

can then be linearly combined to obtain S:

S = α_{0} S_{n} + α_{1} S_{n + 1} + \dots + α_{k} S_{n + k} .

(24)

Even if the sequence

{S_{n}}

does not satisfy the relation in Equation (21), the coefficients

α_{i}

can still be determined by solving the linear system in Equation (23) using a similar approach. Consequently, an approximate solution for the limit of the sequence S can be obtained via the linear combination outlined in Equation (24). It is important to note that these coefficients and the approximate solution now depend on the current starting index n and the depth k of the acceleration window. These dependencies are denoted as

α_{i}^{(n, k)}

and

e_{k} (S_{n})

, respectively. Therefore, we have

e_{k} (S_{n}) = α_{0}^{(n, k)} S_{n} + \dots + α_{k}^{(n, k)} S_{n + k}, k, n = 0, 1, \dots,

where

α_{i}^{(n, k)}

satisfies the following linear system:

\{\begin{matrix} α_{0}^{(n, k)} + \dots + α_{k}^{(n, k)} = 1, \\ α_{0}^{(n, k)} Δ S_{n} + \dots + α_{k}^{(n, k)} Δ S_{n + k} = 0, \\ ⋮ \\ α_{0}^{(n, k)} Δ S_{n + k - 1} + \dots + α_{k}^{(n, k)} Δ S_{n + 2 k - 1} = 0 . \end{matrix}

(25)

The original sequence

{S_{n}}

is transformed into a new sequence

{e_{k} (S_{n})}

, and this transformation,

{S_{n}} \to {e_{k} (S_{n})}

, is known as the Shanks transform. By applying Cramer’s rule for solving systems of linear equations,

e_{k} (S_{n})

can be written as the ratio of determinants, as shown below:

e_{k} (S_{n}) = \frac{|\begin{matrix} S_{n} & S_{n + 1} & \dots & S_{n + k} \\ Δ S_{n} & Δ S_{n + 1} & \dots & Δ S_{n + k} \\ ⋮ & ⋮ & ⋮ \\ Δ S_{n + k - 1} & Δ S_{n + k} & \dots & Δ S_{n + 2 k - 1} \end{matrix}|}{|\begin{matrix} 1 & 1 & \dots & 1 \\ Δ S_{n} & Δ S_{n + 1} & \dots & Δ S_{n + k} \\ ⋮ & ⋮ & ⋮ \\ Δ S_{n + k - 1} & Δ S_{n + k} & \dots & Δ S_{n + 2 k - 1} \end{matrix}|}, k, n = 0, 1, \dots .

(26)

Since directly computing the determinant in Equation (26) is relatively complex, Wynn [26] proposed the scalar

ε

-algorithm (SEA), which implements the Shanks transform using a straightforward recursive procedure. The computational rules for the SEA are given by

\{\begin{matrix} ε_{- 1}^{(n)} = 0, n = 0, 1, \dots, \\ ε_{0}^{(n)} = S_{n}, n = 0, 1, \dots, \\ ε_{k + 1}^{(n)} = ε_{k - 1}^{(n + 1)} + {(ε_{k}^{(n + 1)} - ε_{k}^{(n)})}^{- 1}, k, n = 0, 1, . . . . \end{matrix}

(27)

These elements are typically organized into a two-dimensional array, as shown in Figure 1, which is referred to as the

ε

-table. Rule (27) establishes a connection between the four vertices of a diamond in the

ε

-table, where the column index k remains constant across each column, and the row index n remains constant along the descending diagonals. Furthermore, Wynn [26] demonstrated the relationship between the Shanks transformation and the

ε

-algorithm by applying Sylvester’s and Schweins’ determinant identities

ε_{2 k}^{(n)} = e_{k} (S_{n}), ε_{2 k + 1}^{(n)} = \frac{1}{e_{k} (Δ S_{n})} .

(28)

3.2. Topological Shanks Transformation and Topological $ε$ -Algorithm

Assume that the sequence

S_{n}

is a sequence of vectors. The Samelson inverse of a nonzero vector in the vector space E is defined by the following formula:

z^{- 1} = \frac{z}{(z, z)}, \forall z \in E,

(29)

let

(\cdot, \cdot)

denote the standard inner product on the vector space E. Wynn [27] proposed the vector

ε

-algorithm (VEA) by “vectorizing” the scalar version of the

ε

-algorithm. However, the Shanks transformation cannot be directly extended to vector spaces. To address this, Brezinski et al. [34] introduced the algebraic dual space

E^{*}

of the vector space E, along with an auxiliary vector

y \in E^{*}

, and used the duality product

〈 \cdot, \cdot 〉

to derive two types of topological Shanks transformations. The first topological Shanks transformation is defined by the formula

{\hat{e}}_{k} (S_{n}) = α_{0}^{(n, k)} S_{n} + \dots + α_{k}^{(n, k)} S_{n + k}, k, n = 0, 1, \dots,

(30)

where the coefficients

α_{i}^{(n, k)}

satisfy the following system of linear equations:

\{\begin{matrix} α_{0}^{(n, k)} + \dots + α_{k}^{(n, k)} = 1, \\ α_{0}^{(n, k)} 〈 y, Δ S_{n} 〉 + \dots + α_{k}^{(n, k)} 〈 y, Δ S_{n + k} 〉 = 0, \\ ⋮ \\ α_{0}^{(n, k)} 〈 y, Δ S_{n + k - 1} 〉 + \dots + α_{k}^{(n, k)} 〈 y, Δ S_{n + 2 k - 1} 〉 = 0 . \end{matrix}

(31)

Similar to the scalar Shanks transformation,

{\hat{e}}_{k} (S_{n})

can be expressed as a ratio of determinants as follows:

{\hat{e}}_{k} (S_{n}) = \frac{|\begin{matrix} S_{n} & S_{n + 1} & \dots & S_{n + k} \\ 〈 y, Δ S_{n} 〉 & 〈 y, Δ S_{n + 1} 〉 & \dots & 〈 y, Δ S_{n + k} 〉 \\ ⋮ & ⋮ & ⋮ \\ 〈 y, Δ S_{n + k - 1} 〉 & 〈 y, Δ S_{n + k} 〉 & \dots & 〈 y, Δ S_{n + 2 k - 1} 〉 \end{matrix}|}{|\begin{matrix} 1 & 1 & \dots & 1 \\ 〈 y, Δ S_{n} 〉 & 〈 y, Δ S_{n + 1} 〉 & \dots & 〈 y, Δ S_{n + k} 〉 \\ ⋮ & ⋮ & ⋮ \\ 〈 y, Δ S_{n + k - 1} 〉 & 〈 y, Δ S_{n + k} 〉 & \dots & 〈 y, Δ S_{n + 2 k - 1} 〉 \end{matrix}|}, k, n = 0, 1, \dots

(32)

Here,

y \in E^{*}

. Furthermore, the second topological Shanks transformation can be obtained by replacing

S_{n}, \dots, S_{n + k}

with

S_{n + k}, \dots, S_{n + 2 k}

in (30).

We then introduce a new ordered vector pair

{u, y} \in E \times E^{*}

and define the inverse of the ordered vector pair as follows:

u^{- 1} = \frac{y}{〈 y, u 〉} \in E^{*}, y^{- 1} = \frac{u}{〈 y, u 〉} \in E .

(33)

Based on this definition, Brezinski and Redivo-Zaglia [21] derived two distinct forms of topological

ε

-algorithms, denoted as TEA1 and TEA2. The recursive rule for the first topological

ε

-algorithm (TEA1) to compute

{\hat{e}}_{k} (S_{n})

is given by:

\{\begin{matrix} {\hat{ε}}_{- 1}^{(n)} = 0 \in E^{*}, & n = 0, 1, \dots, \\ {\hat{ε}}_{0}^{(n)} = S_{n} \in E, & n = 0, 1, \dots, \\ {\hat{ε}}_{2 k + 1}^{(n)} = {\hat{ε}}_{2 k - 1}^{(n + 1)} + {({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)})}^{- 1} \in E^{*}, & k, n = 0, 1, \dots, \\ {\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + {({\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)})}^{- 1} \in E, & k, n = 0, 1, . . . . \end{matrix}

(34)

Due to the different inversion rules for elements in E and its algebraic dual space

E^{*}

, as stated in Equation (33), and the dependence on the ordered vector pair, the calculation rules for the odd-indexed and even-indexed sequences in TEA1 differ from those in SEA and VEA. Specifically, for the odd-indexed sequence in TEA1, corresponding to the calculation rule in SEA (27), we have

{\hat{ε}}_{2 k + 1}^{(n)} = {\hat{ε}}_{2 k - 1}^{(n + 1)} + {({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)})}^{- 1},

where

{\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)} \in E^{*}

, and the inversion corresponds to the ordered vector pair

({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}, y) \in E \times E^{*}

, where

y \in E^{*}

, i.e.,

{({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)})}^{- 1} = \frac{y}{〈 y, {\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)} 〉} \in E^{*} .

Similarly, corresponding to (27), for the even-indexed sequence in TEA1, we have

{\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + {({\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)})}^{- 1},

where

{\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)} \in E

, and the inversion corresponds to the ordered vector pair

({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}, {\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)}) \in E \times E^{*}

, i.e.,

{({\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)})}^{- 1} = \frac{{\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}}{〈 {\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)}, {\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)} 〉} \in E .

The recursive rule for computing

{\tilde{e}}_{k} (S_{n})

using the second topological

ε

-algorithm (TEA2) is given by

\{\begin{matrix} {\tilde{ε}}_{- 1}^{(n)} = 0 \in E^{*}, & n = 0, 1, \dots, \\ {\tilde{ε}}_{0}^{(n)} = S_{n} \in E, & n = 0, 1, \dots, \\ {\tilde{ε}}_{2 k + 1}^{(n)} = {\tilde{ε}}_{2 k - 1}^{(n + 1)} + {({\tilde{ε}}_{2 k}^{(n + 1)} - {\tilde{ε}}_{2 k}^{(n)})}^{- 1} \in E^{*}, & k, n = 0, 1, \dots, \\ {\tilde{ε}}_{2 k + 2}^{(n)} = {\tilde{ε}}_{2 k}^{(n + 1)} + {({\tilde{ε}}_{2 k + 1}^{(n + 1)} - {\tilde{ε}}_{2 k + 1}^{(n)})}^{- 1} \in E, & k, n = 0, 1, . . . . \end{matrix}

The computation of the odd-indexed sequence in TEA2 follows the same rule as

{\hat{ε}}_{2 k + 1}^{(n)}

in TEA1. For the even-indexed sequence, corresponding to (27), we have

{\tilde{ε}}_{2 k + 2}^{(n)} = {\tilde{ε}}_{2 k}^{(n + 1)} + {({\tilde{ε}}_{2 k + 1}^{(n + 1)} - {\tilde{ε}}_{2 k + 1}^{(n)})}^{- 1},

and, unlike TEA1, the inversion in TEA2 for

{\tilde{ε}}_{2 k + 1}^{(n + 1)} - {\tilde{ε}}_{2 k + 1}^{(n)} \in E

corresponds to the ordered vector pair

({\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)}, {\tilde{ε}}_{2 k + 1}^{(n + 1)} - {\tilde{ε}}_{2 k + 1}^{(n)}) \in E \times E^{*}

, i.e.,

{({\tilde{ε}}_{2 k + 1}^{(n + 1)} - {\tilde{ε}}_{2 k + 1}^{(n)})}^{- 1} = \frac{{\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)}}{〈 {\tilde{ε}}_{2 k + 1}^{(n + 1)} - {\tilde{ε}}_{2 k + 1}^{(n)}, {\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)} 〉} \in E .

Note the connection between the topological Shanks transformation and the topological

ε

-algorithm. For the first topological Shanks transformation,

{\hat{e}}_{k} (S_{n})

, and

{\hat{ε}}_{k}^{(n)}

in TEA1, the following holds:

\begin{matrix} {\hat{ε}}_{2 k}^{(n)} & = {\hat{e}}_{k} (S_{n}), & 〈 y, {\hat{ε}}_{2 k}^{(n)} 〉 & = e_{k} (〈 y, S_{n} 〉), \\ {\hat{ε}}_{2 k + 1}^{(n)} & = \frac{y}{〈 y, {\hat{e}}_{k} (Δ S_{n}) 〉}, & {\hat{ε}}_{2 k + 1}^{(n)} & = \frac{y}{e_{k} (〈 y, Δ S_{n} 〉)}, k, n = 0, 1, . . . . \end{matrix}

(35)

For the second topological Shanks transformation

{\tilde{e}}_{k} (S_{n})

and

{\tilde{ε}}_{k}^{(n)}

in TEA2, the following holds:

\begin{matrix} {\tilde{ε}}_{2 k}^{(n)} & = {\tilde{e}}_{k} (S_{n}), & 〈 y, {\tilde{ε}}_{2 k}^{(n)} 〉 & = e_{k} (〈 y, S_{n} 〉), \\ {\tilde{ε}}_{2 k + 1}^{(n)} & = \frac{y}{〈 y, {\tilde{e}}_{k} (Δ S_{n}) 〉}, & {\tilde{ε}}_{2 k + 1}^{(n)} & = \frac{y}{e_{k} (〈 y, Δ S_{n} 〉)}, k, n = 0, 1, . . . . \end{matrix}

(36)

The computational rules for the odd-indexed and even-indexed sequences in TEA1 and TEA2 are summarized in the Figure 2. For odd indices, the computational rules for the two types of topological

ε

-algorithms are identical and consistent with those of the scalar

ε

-algorithm (SEA). The computation of

ε_{2 k + 1}^{(n)}

requires only the three elements located at the vertices of the diamond structure table:

ε_{2 k - 1}^{(n + 1)}

,

ε_{2 k}^{(n)}

, and

ε_{2 k}^{(n + 1)}

. However, for even-indexed sequences, both TEA1 and TEA2 require additional elements beyond these three. Moreover, during the recursion process, both topological

ε

-algorithms must simultaneously store information about the odd-indexed sequence in E and the even-indexed sequence in the algebraic dual space

E^{*}

, thereby significantly increasing the storage burden in large-scale computations. Furthermore, both TEA1 and TEA2 require dual product calculations during the recursion process, which further increases the computational complexity.

3.3. Simplified Topological $ε$ -Algorithm

To optimize the computational rules of the two topological

ε

-algorithms, Brezinski and Redivo-Zaglia [21] proposed simplified versions of the these algorithms (STEA1 and STEA2) for TEA1 and TEA2. These algorithms combine the odd and even recursive rules of TEA into a single unified rule, thereby requiring the storage of only the even-indexed column vectors. This streamlined computational process not only reduces storage requirements and algorithmic complexity but also enhances overall efficiency, leading to improved performance in large-scale data processing.

For the scalar sequence

{〈 y, S_{n} 〉}

, based on the relationship in Equation (28) between the scalar Shanks transformation and SEA, the following holds:

ε_{2 k}^{(n)} = e_{k} (〈 y, S_{n} 〉), ε_{2 k + 1}^{(n)} = \frac{1}{e_{k} (〈 y, Δ S_{n} 〉)} .

(37)

From Equations (35) and (37), the following holds:

{\hat{ε}}_{2 k + 1}^{(n + 1)} - {\hat{ε}}_{2 k + 1}^{(n)} = \frac{y}{e_{k} (〈 y, Δ S_{n + 1} 〉)} - \frac{y}{e_{k} (〈 y, Δ S_{n} 〉)} = y (ε_{2 k + 1}^{(n + 1)} - ε_{2 k + 1}^{(n)}) .

(38)

Therefore, the recursive rule for the even-indexed sequence in TEA1 can be rewritten as

{\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + \frac{{\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}}{〈 y, {\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)} 〉 (ε_{2 k + 1}^{(n + 1)} - ε_{2 k + 1}^{(n)})}, k, n = 0, 1, \dots .

Then, by combining the recursive rules in Equations (27) and (34), the following four equivalent forms for the even indices of the TEA1 algorithm can be derived:

\begin{matrix} STEA 1 - 1 : {\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + \frac{1}{(ε_{2 k + 1}^{(n + 1)} - ε_{2 k + 1}^{(n)}) (ε_{2 k}^{(n + 1)} - ε_{2 k}^{(n)})} ({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}), \\ STEA 1 - 2 : {\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + \frac{ε_{2 k + 1}^{(n)} - ε_{2 k - 1}^{(n + 1)}}{ε_{2 k + 1}^{(n + 1)} - ε_{2 k + 1}^{(n)}} ({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}), \\ STEA 1 - 3 : {\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + \frac{ε_{2 k + 2}^{(n)} - ε_{2 k}^{(n + 1)}}{ε_{2 k}^{(n + 1)} - ε_{2 k}^{(n)}} ({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}), \\ STEA 1 - 4 : {\hat{ε}}_{2 k + 2}^{(n)} = {\hat{ε}}_{2 k}^{(n + 1)} + (ε_{2 k + 1}^{(n)} - ε_{2 k - 1}^{(n + 1)}) (ε_{2 k + 2}^{(n)} - ε_{2 k}^{(n + 1)}) ({\hat{ε}}_{2 k}^{(n + 1)} - {\hat{ε}}_{2 k}^{(n)}) . \end{matrix}

Here,

{\hat{ε}}_{0}^{(n)} = S_{n} \in E, n = 0, 1, \dots

. Similarly, the following four mutually equivalent recursive formulas for the even-indexed sequences in TEA2 can be derived:

\begin{matrix} STEA 2 - 1 : {\tilde{ε}}_{2 k + 2}^{(n)} = {\tilde{ε}}_{2 k}^{(n + 1)} + \frac{1}{(ε_{2 k + 1}^{(n + 1)} - ε_{2 k + 1}^{(n)}) (ε_{2 k}^{(n + 2)} - ε_{2 k}^{(n + 1)})} ({\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)}), \\ STEA 2 - 2 : {\tilde{ε}}_{2 k + 2}^{(n)} = {\tilde{ε}}_{2 k}^{(n + 1)} + \frac{ε_{2 k + 1}^{(n + 1)} - ε_{2 k - 1}^{(n + 2)}}{ε_{2 k + 1}^{(n + 1)} - ε_{2 k + 1}^{(n)}} ({\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)}), \\ STEA 2 - 3 : {\tilde{ε}}_{2 k + 2}^{(n)} = {\tilde{ε}}_{2 k}^{(n + 1)} + \frac{ε_{2 k + 2}^{(n)} - ε_{2 k}^{(n + 1)}}{ε_{2 k}^{(n + 2)} - ε_{2 k}^{(n + 1)}} ({\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)}), \\ STEA 2 - 4 : {\tilde{ε}}_{2 k + 2}^{(n)} = {\tilde{ε}}_{2 k}^{(n + 1)} + (ε_{2 k + 1}^{(n + 1)} - ε_{2 k - 1}^{(n + 2)}) (ε_{2 k + 2}^{(n)} - ε_{2 k}^{(n + 1)}) ({\tilde{ε}}_{2 k}^{(n + 2)} - {\tilde{ε}}_{2 k}^{(n + 1)}) . \end{matrix}

Here,

{\tilde{ε}}_{0}^{(n)} = S_{n} \in E, n = 0, 1, \dots

.

From the above calculation, it is evident that the generation of

e_{k} (S_{n})

in both STEA1 and STEA2 depends solely on the even-indexed sequence, with the odd-indexed sequence serving only as an auxiliary sequence. As a result, during the recursive processes of both simplified topological

ε

-algorithms, only the information of the even-indexed sequences needs to be stored, eliminating the need to store the odd-indexed sequences. Additionally, in STEA1 and STEA2, the number of pairwise product operations is reduced during recursion. The linear functional

y

in

E^{*}

is involved solely in the duality product operation with the initial sequence

S_{n}

in E, generating a scalar sequence

〈 y, S_{n} 〉

. Furthermore, the recursion of this scalar sequence

〈 y, S_{n} 〉

can be performed using SEA, as shown in Equation (27).

3.4. Implementation of the $ε$ -Algorithms

The core principle of the

ε

-algorithms is founded on the Shanks transformation, which leverages the linear difference structure of the sequence by forming specific linear combinations aimed at eliminating the dominant error terms during the convergence process. The algorithm calculates these constructed linear combinations following explicit recursive rules, thus facilitating the acceleration of the sequence’s convergence.

The implementation of

ε

-algorithms (SEA, VEA, TEA, and STEA) and the construction of the associated

ε

-number table are most directly achieved by storing all elements corresponding to each pair of indices k and n. This process begins with a prescribed number of terms in the first two columns and proceeds recursively, computing subsequent entries column by column. As each successive column contains one fewer element than the previous, the resulting structure forms the lower triangular part of the

ε

-table. However, this approach requires retaining all elements within the triangular region, which can lead to considerable memory overhead, especially in the case of vector- or matrix-valued sequences.

To address this limitation, a more memory-efficient strategy computes the terms of the original sequence incrementally and constructs the

ε

-table along ascending diagonals [21,22]. Specifically, after generating an initial triangular portion of the table and retaining only the last row along its ascending diagonal, a new element from the original sequence

S_{n}

is introduced, and the next ascending diagonal is computed iteratively. This approach requires storing only a single ascending diagonal

(e_{i})

and three auxiliary temporary variables, thereby significantly reducing the storage demands.

\begin{matrix} S t e p i | & 0 e_{i} = S_{i} e_{i - 1} e_{i - 2} \dots e_{2} e_{1} \\ S t e p i + 1 | & 0 e_{i + 1} = S_{i + 1} e_{i} e_{i - 1} \dots e_{3} e_{2} e_{1} \end{matrix}

The implementations of the SEA, VEA, TEA, and STEA algorithms were previously provided in the MATLAB toolbox EPSfun [22]. In this paper, we present a more intuitive and transparent implementation of these algorithms. Specifically, Algorithms 1 and 2 illustrate the procedures for SEA and VEA, respectively. Although both algorithms share the same fundamental computational structure, they differ in how the inverse operation is treated. In particular, the inverse operation in VEA is defined as shown in Equation (29).

For a fixed acceleration window of width k, both SEA and VEA compute new elements incrementally along the rising diagonals until the

2 k

-th column is completed. In Algorithm 2, the inner product in

R^{m}

is defined using the standard Euclidean inner product, namely,

(x, y) = x^{T} y

, for all

x, y \in R^{m}

.

Algorithm 1 Scalar

ε

-algorithm (SEA)

Require:: $2 k + 1$ elements of the scalar sequence ${S_{n}}$ : $S_{0}, S_{1}, \dots, S_{2 k}$ , where k is the acceleration window width.
Ensure:: Return the values W and e
1:: $e (1) = S_{1}$
2:: for $i = 2 : 2 k + 1$ do
3:: $N = 0; e (i) = S_{i}$
4:: for $j = i : - 1 : 2$ do
5:: $d = e (j) - e (j - 1); W = N + 1 / d$
6:: $N = e (j - 1); e (j - 1) = W$
7:: end for
8:: end for

Algorithm 2 Vector

ε

-algorithm (VEA)

Require:: $2 k + 1$ elements of the vector sequence ${S_{n}}$ : $S_{0}, S_{1}, \dots, S_{2 k}$ , where k is the acceleration window width.
Ensure:: Return the values W and e
1:: $e (:, 1) = S_{1}$
2:: for $i = 2 : 2 k + 1$ do
3:: $N = zeros (n, 1)$ % Initialize a zero vector of the same size as $S_{i}$ .
4:: $e (:, i) = S_{i}$
5:: for $j = i : - 1 : 2$ do
6:: $d = e (:, j) - e (:, j - 1); W = N + d / (d^{T} d)$
7:: $N = e (:, j - 1); e (:, j - 1) = W$
8:: end for
9:: end for

Algorithm 3 presents the implementation of the TEA algorithm. It is important to note that TEA applies distinct computational rules to subsequences with odd and even indices. Specifically, the computational rules for odd-indexed terms are identical to those used in SEA and VEA, whereas the computation of even-indexed terms requires the introduction of additional elements. Since the construction of the

ε

-table proceeds in a top-down manner, it is sufficient to store only the elements along a single ascending diagonal. In TEA2, the additional elements required for computing even-indexed terms correspond to the newly introduced elements from the initial sequence, and therefore, no extra storage is required. In contrast, in TEA1, the additional elements required for computing the even-indexed terms are not located on the current ascending diagonal. Therefore, TEA1 must additionally store the even-indexed elements from the previous ascending diagonal.

Algorithm 3 Topological

ε

-algorithm (TEA)

Require:: $2 k + 1$ elements of sequence ${S_{n}}$ : $S_{0}, S_{1}, \dots, S_{2 k}$ , where k is the acceleration window width.
Ensure:: Return the values W and e
1:: Select the appropriate duality pairing between the linear generalized function $y$ and the ordered vector pair ${u, y} \in E \times E^{*}$ , where $E^{*}$ denotes the algebraic dual space of E.
2:: $e (:, 1) = S_{1}$
3:: for $i = 2 : 2 k + 1$ do
4:: $N = zeros (n, 1); e (:, i) = S_{i}; counter = 1$
5:: for $j = i : - 1 : 2$ do
6:: $d = e (:, j) - e (:, j - 1)$
7:: if mod(counter, 2) == 1 then
8:: $W = N + y / 〈 y, d 〉$
9:: else
10:: $W = N + (e (:, j + 1) - N) / 〈 e (:, j + 1) - N, d 〉$
11:: end if
12:: $N = e (:, j - 1); e (:, j - 1) = W; counter = counter + 1$
13:: end for
14:: end for

Algorithm 4 presents the implementation of the STEA method. Notably, the structure of STEA consists of two components: a scalar part and a vector component. The scalar component retains a diamond-shaped structure, while the computational rules for the vector component are adapted to a triangular form. In the scalar part, each new scalar value is obtained by computing the duality product between the newly introduced elements of the sequence

S_{n}

and the vector

y

while preserving the entire ascending diagonal during computation. In contrast, the vector component retains only the even-indexed elements along the ascending diagonal. As a result, the algorithm requires storing only k vectors. The distinction between STEA1 and STEA2 is analogous to that between TEA1 and TEA2: STEA1 requires storing additional even-indexed elements from the previous ascending diagonal, while STEA2 avoids this, making it generally more memory-efficient. In the implementation provided in the literature [22], the scalar component of STEA is computed first using SEA, and the resulting scalar values are then incorporated into the STEA algorithm. In this paper, we integrate these two procedures into a unified approach. Algorithm 4 provides the detailed implementation of STEA2-3, while the other three equivalent variants can be obtained by modifying the index variable j within the algorithm.

Algorithm 4 The simplified topological

ε

-algorithms (STEA)

Require:: $2 k + 1$ elements of sequence ${S_{n}}$ : $S_{0}, S_{1}, \dots, S_{2 k}$ , where k is the acceleration window width.
Ensure:: Return the values W and e
1:: Select the appropriate duality pairing between the linear generalized function $y$ and the ordered vector pair ${u, y} \in E \times E^{*}$ , where $E^{*}$ denotes the algebraic dual space of E.
2:: $e_{1} {1} = S_{1};$ $e_{2} (1) = 〈 y, e_{1} {1} 〉;$ $i t e r = 0;$
3:: for $i = 2 : 2 k + 1$ do
4:: if mod(i, 2) == 0 then
5:: $e_{1} (1) = [];$ % Deletes the first element of the table and shifts the subsequent elements forward.
6:: $i t e r = i t e r + 1;$
7:: end if
8:: $N = 0;$ $e_{1} {i - i t e r} = S_{i};$ $e_{2} (i) = 〈 y, e_{1} {i - i t e r} 〉;$ $counter = 1;$
9:: for $j = i : - 1 : 2$ do
10:: $d = e_{2} (j) - e_{2} (j - 1); W = N + 1 / d;$
11:: if mod(counter, 2) == 0 then
12:: $J = (W - N) / (e (j + 1) - N);$
13:: $index = j - i t e r + 0.5 * (counter - 2);$
14:: $e_{1} {index} = e_{1} {index} + J * (e_{1} {index + 1} - e_{1} {index});$
15:: end if
16:: $N = e_{2} (j - 1); e_{2} (j - 1) = W; counter = counter + 1;$
17:: end for
18:: end for

Remark 1.

The ε-acceleration algorithms (SEA, VEA, TEA, and STEA) do not require matrix decomposition or subproblem solving, which gives them a significant advantage over polynomial extrapolation methods (such as MPE, MMPE, and RRE) and Anderson acceleration.

Remark 2.

In the algebraic dual space

E^{*}

of E, the selection of linear functionals

y

and the corresponding dual product are detailed in the literature [21,22] for common selection methods. If the original sequence is a vector sequence

{S_{n}} \subset R^{m}

or a matrix sequence

{S_{n}} \subset R^{m \times n}

, and considering that the vector space

R^{m}

and the matrix space

R^{m \times n}

are algebraically self-dual, the following selections are made:

$E = R^{m}$ , typically $y = ones (m, 1)$ , i.e., the m-dimensional vector with all elements equal to 1, and the dual product is defined as $〈 y, S_{n} 〉 = (y, S_{n}) = y^{T} S_{n}$ ;
$E = R^{n \times n}$ , typically $y = I_{n}$ , and the dual product is defined as $〈 y, S_{n} 〉 = trace (S_{n})$ ;
$E = R^{m \times n}$ , typically $y = ones (m, n)$ , and the dual product is defined as $〈 y, S_{n} 〉 = trace (y^{T} S_{n})$ .

3.5. Combining $ε$ -Algorithms with Fixed-Point Iterations to Solve the Problem (5)

Given that the sequence generated by the fixed-point iteration in Equation (19) for solving the GIPSCAL problem is a matrix sequence, and that the operator

H_{GIPSCAL} : R^{n \times r} \to R^{n \times r}

is a nonlinear mapping, we adopt a restart-based acceleration strategy for applying the

ε

-algorithm. This approach follows the vector sequence polynomial extrapolation acceleration framework proposed in [35].

In practical implementations of the acceleration algorithm, a delayed start strategy is commonly employed to prevent premature acceleration and improve overall performance. Specifically, the basic fixed-point iteration (19) is first executed for a fixed number of steps, or until the matrix sequence

{(Q^{(k)}, D_{1}^{(k)}, \dots, D_{N}^{(k)}, K_{1}^{(k)}, \dots, K_{N}^{(k)})}

a specified level of initial accuracy. Only after this preliminary phase is the acceleration algorithm applied. The detailed implementation steps of the iterative acceleration algorithm based on the

ε

-algorithm for solving the three-way GIPSCAL problem (5) are presented in Algorithm 5.

Algorithm 5 VEA, TEA, and STEA accelerated fixed-point iterations for solving the three-way GIPSCAL problem (5)

Require:: N asymmetric n -order matrices $X_{1}, \dots, X_{N}$ , the initial iteration $Q^{(0)} \in O (n, r)$ , and the window width parameter k.
1:: Basic Iteration:
Initialize the iteration matrix $S_{0}^{(0)}$ to $Q^{(0)}$ , such that $Z_{0}^{(0)}$ = $S_{0}^{(0)}$ and perform $κ$ iterations through the nonlinear map $H_{GIPSCAL}$ to get $Z_{1}, Z_{2}, \dots, Z_{κ}$ .
2:: Perform extrapolation acceleration:
Use $Z_{κ}$ as the initial value of acceleration $S_{0}$ , carry out $2 k$ iterations to obtain $S_{0}, S_{1}, S_{2}, \dots, S_{2 k}$ , and use this $2 k + 1$ sequence as the sequence of history iterations required for extrapolating the acceleration. Perform the VEA, TEA, or STEA algorithms to generate $S_{2 k}^{(0)}$ , which is denoted as $Q^{*}$ after re-orthogonalization by the “economic” SVD decomposition.
3:: Convergence judgment and iterative update:
Calculate $D_{i}^{*}$ and $K_{i}^{*}$ according to Equations (13) and (14) check whether $[Q^{*}, D_{1}^{*}, \dots, D_{N}^{*}, K_{1}^{*}, \dots, K_{N}^{*}]$ satisfies the termination condition, and if it doesn’t, update $Q^{(0)} = Q^{*}$ , and return to Step 1 to continue iterating until convergence.
4:: return $Q^{*}$ , $D_{1}^{*}, \dots, D_{N}^{*}$ , $K_{1}^{*}, \dots, K_{N}^{*}$ .

The termination criterion of Algorithm 5 refers to the first-order optimality conditions of problem (5) as presented in [10]. Note also that the Stiefel manifold

O (n, p)

is an embedded submanifold in the Euclidean space

R^{n \times p}

, as discussed in [36], while

Ω_{+} (p)

is a convex set, and

K (r)

is a linear subspace. The termination criterion can be formulated as follows:

\begin{matrix} Error = ( & {∥P_{T_{Q^{(k)}} O (n, r)} (\nabla_{Q} f (Z^{(k)}))∥}^{2} + \sum_{i = 1}^{N} {∥P_{K (r)} (\nabla_{K_{i}} f_{i} (Q^{(k)}, D_{i}^{(k)}, K_{i}^{(k)}))∥}^{2} \\ + \sum_{i = 1}^{N} {∥P_{D_{+} (r)} (D_{i}^{(k)} - \nabla_{D_{i}} f_{i} (Q^{(k)}, D_{i}^{(k)}, K_{i}^{(k)})) - D_{i}^{(k)}∥}^{2})^{\frac{1}{2}} \leq ϵ . \end{matrix}

(39)

Here,

ϵ

is a predefined accuracy threshold, and

P_{T_{Q^{(k)}} O (n, r)}

denotes the orthogonal projection onto the tangent space

T_{Q^{(k)}} O (n, r)

at the point

Q^{(k)}

. As shown in [9,36], for any matrix

M \in R^{n \times r}

and any

Q \in O (n, r)

, the projection is given by

P_{T_{Q} O (n, r)} (M) = M - Q sym (Q^{T} M),

(40)

where

sym (A) = \frac{1}{2} (A + A^{T})

denotes the symmetric part of the matrix A. For the implementation of Algorithm 5, the following notation is introduced:

Remark 3.

The selection of the linear generalized functional

y

and the corresponding duality product in the dual space

E^{*}

should follow the approach outlined in Remark 2.

Remark 4.

The TEA and STEA methods presented in Algorithm 5 can be applied directly to the matrix sequence generated by the fixed-point iteration in Equation (19). However, if Algorithm 5 employs VEA for sequence acceleration, a combination of matrix vectorization (straightening) and inverse vectorization (inverse straightening) operators is needed. Specifically, for the

2 k + 1

matrices

S_{0}, S_{1}, \dots, S_{2 k}

involved in each single-step acceleration loop, the matrix straightening operator is applied to obtain the corresponding

2 k + 1

vectors

s_{0}, s_{1}, \dots, s_{2 k}

. VEA is then applied to these vectors to calculate the accelerated vector

s_{2 k}^{(0)}

, which is subsequently converted back to matrix form via the inverse straightening operator, resulting in the matrix

S_{2 k}^{(0)}

. An “economy-size” singular value decomposition (SVD) is performed on

S_{2 k}^{(0)}

to reorthogonalize it, producing the updated matrix

Q^{*}

. Finally, Step 3 of Algorithm 5 is executed to proceed the iterative process.

4. Numerical Experiments

In this section, we present a comprehensive numerical evaluation of the proposed

ε

-algorithm accelerated fixed-point iterations for solving the three-way GIPSCAL problem in Equation (5). We begin by comparing the original fixed-point iteration with its accelerated variants. Additionally, we benchmark these methods against the continuous-time projected gradient flow algorithm introduced by Trendafilov [10,13] as well as several state-of-the-art first- and second-order Riemannian optimization algorithms from the MATLAB toolbox Manopt [9,20]. All experiments were conducted on a standard desktop computer equipped with an Intel(R) Core(TM)i7-13620H CPU (2.40 GHz) and 16.00 GB of RAM and running MATLAB R202b.

To enable controlled and diverse benchmarking, we generated a collection of N square, asymmetric data matrices

X_{i} \in R^{n \times n}

(i = 1, \dots, N)

using a factorial design approach inspired by Takane et al. [37], originally developed for orthogonal INDSCAL problems [9,17]. This setup includes three types of datasets: one purely random and two structured variants. In the random setting, each entry of

X_{i}

is drawn independently from a standard normal distribution, i.e.,

X_{i} = randn (n, n)

. Such unstructured randomness often leads to large fit errors in the objective function of the problem (5). For the structured datasets, we construct each slice as

X_{i} = Q ({\overset{ˇ}{D}}_{i} + K_{i}) Q^{T} + E_{i},

where

Q \in R^{n \times r}

,

{\overset{ˇ}{D}}_{i} \in R^{r \times r}

, and

K_{i} \in R^{r \times r}

are randomly generated, and

E_{i}

denotes an additive noise matrix. The matrix Q is populated with uniformly distributed random entries from

[0, 1]

via rand(n, r) and column-orthogonalized using singular value decomposition (SVD). The diagonal entries of

{\overset{ˇ}{D}}_{i}

are sampled from a standard normal distribution. Each

K_{i}

is drawn from the uniform distribution on

[0, 1]

and skew-symmetrized as

K_{i} \leftarrow \frac{1}{2} (K_{i} - K_{i}^{T}) .

To evaluate robustness under varying structural assumptions, we consider two structured variants: one allowing potentially negative diagonal elements in

{\overset{ˇ}{D}}_{i}

(indefinite case), and the other enforcing nonnegativity by taking element-wise absolute values (nonnegative definite or nnd case). The disturbance terms

E_{i}

are sampled from a normal distribution with zero mean and variance

σ^{2}

, where

σ

is set to 10% of the standard deviation of the structural term

Q ({\overset{ˇ}{D}}_{i} + K_{i}) Q^{T}

. To better reflect practical scenarios, the number of “subjects” N varies between 20 and 50, while the number of “stimuli” n is capped at 200. For effective visualization in multidimensional scaling (MDS), the target dimensionality r is set to three representative levels:

r = 2, 3,

and 5.

The initial iterate

Q^{(0)} \in O (n, r)

, used in both the original and accelerated fixed-point iterations, is computed using a structured initialization strategy consistent with the recommendations of [10,17]. Specifically, we perform an eigenvalue decomposition (EVD) on the symmetric component of the aggregated data

\sum_{i = 1}^{N} \frac{1}{2} (X_{i} + X_{i}^{T}) = P Λ P^{T},

where the eigenvalues in

Λ

are sorted in descending order. The first r eigenvectors, denoted

P_{r}

, are used to initialize

Q^{(0)} : = P_{r}

. The corresponding initial values of

{\tilde{D}}_{i}^{(0)}

and

K_{i}^{(0)}

, required by both the Riemannian optimization framework and the projected gradient flow method used to solve the equivalent product-manifold-constrained optimization problem in Equation (42), are given by

\begin{matrix} {\tilde{D}}_{i}^{(0)} & = {\{max (diag (\frac{1}{2} {Q^{(0)}}^{T} (X_{i} + X_{i}^{T}) Q^{(0)}), 0)\}}^{1 / 2}, i = 1, \dots, N; \\ K_{i}^{(0)} & = \frac{1}{2} {Q^{(0)}}^{T} (X_{i} - X_{i}^{T}) Q^{(0)}, i = 1, \dots, N . \end{matrix}

Due to the matrix-valued nature of the fixed-point sequence, storage constraints require relatively small window sizes k for

ε

-algorithm acceleration. To evaluate the impact of window width on acceleration performance, we experiment with

k = 4, 5,

and 6, selecting the value that produces the best empirical performance. To further enhance the reliability of the acceleration, particularly in cases where the base fixed-point iteration converges slowly, we adopt a delayed-start strategy. Under this scheme, the fixed-point algorithm is executed until an intermediate solution

[Q^{(j)}, D_{1}^{(j)}, \dots, D_{N}^{(j)}, K_{1}^{(j)}, \dots, K_{N}^{(j)}]

satisfies a specified error threshold,

Error \leq 10^{- 1}

. Only then is the

ε

-algorithm-based acceleration activated. In our experiment, the core implementations of various

ε

-algorithms (including SEA, VEA, TEA, and STEA) are provided by the MATLAB toolbox EPSfun, as described in [22]. The specific code can be downloaded from the following website: http://www.netlib.org/numeralgo/. In this experiment, the topological

ε

-algorithm uses the TEA2 version, while the simplified topological

ε

-algorithm uses the STEA2-3 version. For both the original fixed-point iteration and each of the acceleration algorithms, different termination thresholds are applied based on the data generation method for

X_{i}

. In the case of RAND-generated data, convergence is relatively slow; therefore, the termination tolerance is set to

10^{- 6}

. For the NND and IND cases, which typically converge faster, a tighter tolerance of

10^{- 8}

is used. The computation of

Error

follows the definition given in Equation (39). In all subsequent numerical results, the reported

Error

values are consistently calculated using this formula.

4.1. Numerical Comparison of Fixed-Point Acceleration Methods

This section presents a comprehensive numerical comparison between the original fixed-point iteration (denoted FPI) and several accelerated variants. These include

ε

-algorithm-based methods (FPI-VEA, FPI-TEA, and FPI-STEA), polynomial extrapolation techniques (FPI-MPE, FPI-RRE, and FPI-MMPE), and Anderson acceleration (FPI-Anderson). The experimental results, summarized in Table 1, span a range of configurations including different data generation strategies for the matrices

X_{i}

, system dimensions

[n, r]

, number of components N, and acceleration window widths k. In the table, IT denotes the total number of iterations to reach convergence, CPU refers to the total computation time in seconds, and Error represents the final residual error. It is important to note that for all acceleration schemes—including FPI-VEA, FPI-TEA, FPI-STEA, FPI-MPE, FPI-RRE, FPI-MMPE, and FPI-Anderson—the reported iteration count (IT) includes the initial delayed-start iterations, the base iterations required to form the extrapolation sequence, and the subsequent accelerated iterations.

From Table 1, we observe that to achieve equivalent termination accuracy, the polynomial extrapolation method FPI-MPE generally outperforms the

ε

-based methods in terms of iteration count and runtime. This performance gap is attributed to the number of sequence elements each method requires. Specifically, for a given window size k, polynomial extrapolation methods utilize

k + 1

vectors from the underlying sequence, while

ε

-algorithms require

2 k + 1

vectors [35]. Nonetheless, the

ε

-based methods (FPI-VEA, FPI-TEA, and FPI-STEA) demonstrate consistent acceleration across different system sizes and window widths, exhibiting good performance scalability.

In Section 6, Theorem 6.8 of the paper [21], the theoretical analysis of the STEA algorithm for accelerating the convergence rate with the window width k is presented. Specifically, as k increases, the convergence order after acceleration improves. The specific content is as follows:

Theorem 4

([21]). We consider sequences of the form

\begin{matrix} S_{n} - S \sim \sum_{i = 1}^{\infty} a_{i} λ_{i}^{n} u_{i} (n \to \infty) or \\ S_{n} - S \sim {(- 1)}^{n} \sum_{i = 1}^{\infty} a_{i} λ_{i}^{n} u_{i} (n \to \infty), \end{matrix}

where

a_{i}, λ_{i} \in K

,

u_{i} \in E

, and

1 > λ_{1} > λ_{2} > \dots > 0

. Then, when k is fixed and n tends to infinity,

\begin{matrix} {\hat{ε}}_{2 k}^{(n)} - S = O (λ_{k + 1}^{n}), & \frac{∥ {\hat{ε}}_{2 k + 2}^{(n)} - S ∥}{∥ {\hat{ε}}_{2 k}^{(n)} - S ∥} = O ({(λ_{k + 2} / λ_{k + 1})}^{n}) . \end{matrix}

(41)

Table 1 demonstrates that the choice of window width k significantly influences convergence behavior. While increasing k generally improves acceleration by leveraging more sequence information, it also raises the per-iteration computational cost. To strike a balance between efficiency and overhead, the acceleration window is fixed at

k = 5

for subsequent experiments.

Figure 3 illustrates the evolution of the error norm

{log}_{10} ∥ Error ∥

as a function of iteration time across different system configurations. The results are presented for three data-generation scenarios: nonnegative definite (NND), indefinite (IND), and fully random (RAND), with

k = 5

held fixed throughout. For data generated via NND and IND schemes, the underlying fixed-point sequences exhibit relatively fast convergence. In these cases, extrapolation is applied immediately, without delay. As seen in the top four subplots of Figure 3 (with the top two corresponding to NND and the next two to IND), the application of

ε

-acceleration after the initial

2 k + 1

base iterations leads to a substantial and rapid drop in the residual error. In contrast, for RAND-generated data—where the underlying sequence converges more slowly—a delayed-start strategy is employed. The final six subplots of Figure 3 show the evolution of

{log}_{10} | | Error | |

for RAND matrices across varying system sizes. These results confirm that even under more challenging conditions, the accelerated methods remain effective and yield significant convergence improvements.

The numerical experiments collectively demonstrate that

ε

-based acceleration methods achieve convergence performance comparable to polynomial extrapolation and Anderson acceleration when applied to the three-way GIPSCAL problem (5). A noteworthy advantage of the

ε

-algorithms is their ability to operate directly on matrix sequences without requiring vectorization (or “straightening”) and inverse reshaping procedures—transformations that are typically needed for polynomial and Anderson-based methods. In addition,

ε

-methods avoid costly matrix factorizations and auxiliary subproblem solves, resulting in simpler implementation and reduced computational overhead.

4.2. Comparison with Riemannian Optimization Methods in Manopt

In 2021, Trendafilov and Gallo [9] proposed a unified framework for classical multivariate data analysis models by leveraging the geometric structure of matrix manifolds. They further introduced a computational approach for the three-way GIPSCAL problem based on the Riemannian optimization toolbox Manopt [20]. In this section, we present a numerical comparison between the proposed fixed-point iteration acceleration methods (VEA, TEA, and STEA) and the existing optimization algorithms implemented in the Manopt toolbox. To enable the application of Riemannian optimization techniques, the original three-way GIPSCAL problem (5) is reformulated as a constrained optimization problem defined on a product manifold, as follows:

\begin{matrix} Minimize & \frac{1}{2} \sum_{i = 1}^{N} {∥ X_{i} - Q ({\tilde{D}}_{i}^{2} + K_{i}) Q^{T} ∥}^{2}, \\ subject to & (Q, {\tilde{D}}_{1}, \dots, {\tilde{D}}_{N}, K_{1}, \dots, K_{N}) \in M : = O (n, r) \times D {(r)}^{N} \times K {(r)}^{N} . \end{matrix}

(42)

Here,

D (r)

denotes the linear subspace of diagonal

r \times r

matrices. Through straightforward algebraic derivation, the Euclidean gradient of the objective function

\tilde{f}

in (42) with respect to

Z = (Q, {\tilde{D}}_{1}, \dots, {\tilde{D}}_{N}, K_{1}, \dots, K_{N}) \in R^{n \times r} \times S {(r)}^{N} \times K {(r)}^{N}

can be expressed componentwise as

\begin{matrix} \nabla_{Q} \tilde{f} (Z) & = \sum_{i = 1}^{N} (- 2 sym (X_{i}) Q {\tilde{D}}_{i}^{2} + 2 skew (X_{i}) Q K_{i} + 2 Q ({\tilde{D}}_{i}^{4} - K_{i}^{2})), \\ \nabla_{{\tilde{D}}_{i}} \tilde{f} (Z) & = - ({\tilde{D}}_{i} Q^{T} X_{i} Q + Q^{T} X_{i} Q {\tilde{D}}_{i}) + 2 {\tilde{D}}_{i}^{3} + {\tilde{D}}_{i} K_{i} + K_{i} {\tilde{D}}_{i}, i = 1, \dots, N, \\ \nabla_{K_{i}} \tilde{f} (Z) & = {\tilde{D}}_{i}^{2} + K_{i} - Q^{T} X_{i} Q, i = 1, \dots, N . \end{matrix}

Since

M

is an embedded submanifold of the Euclidean space, the Riemannian gradient is obtained by orthogonally projecting the Euclidean gradient onto the tangent space

T_{Z} M

[36]. Using the known projection operator (40) and the tangent space of the Stiefel manifold

O (n, r)

and recognizing that

D (r)

and

S (r)

are linear subspaces of

K (r)

and

R^{r \times r}

, respectively, the Riemannian gradient of

\tilde{f}

in (42) at

Z \in M

is given by

\begin{matrix} grad \tilde{f} (Z) & = {P_{Q} (\sum_{i = 1}^{N} (- 2 sym (X_{i}) Q {\tilde{D}}_{i}^{2} + 2 skew (X_{i}) Q K_{i} + 2 Q ({\tilde{D}}_{i}^{4} - K_{i}^{2}))), \\ 2 ({\tilde{D}}_{1}^{2} - Q^{T} sym (X_{1}) Q) ⊙ {\tilde{D}}_{1}, \dots, 2 ({\tilde{D}}_{N}^{2} - Q^{T} sym (X_{N}) Q) ⊙ {\tilde{D}}_{N}, \\ K_{1} - Q^{T} skew (X_{1}) Q \dots, K_{N} - Q^{T} skew (X_{N}) Q} . \end{matrix}

The generic update step in a Riemannian optimization algorithm over

M

is given by

Z^{(k + 1)} = R_{Z^{(k)}} (α_{k} Δ Z^{(k)}),

where

α_{k}

is the step size,

Δ Z^{(k)} = (ξ^{(k)}, η_{1}^{(k)}, \dots, η_{N}^{(k)}, θ_{1}^{(k)}, \dots, θ_{N}^{(k)}) \in T_{Z^{(k)}} M

is the search direction, and

R_{Z}

is a retraction operator that maps tangent vectors back to the manifold. Specifically, the retraction is defined as

R_{Z} (Δ Z) = (R_{Q} (ξ), {\tilde{D}}_{1} + η_{1}, \dots, {\tilde{D}}_{N} + η_{N}, K_{1} + θ_{1}, \dots, K_{N} + θ_{N}),

(43)

where

R_{Q} (ξ)

is a retraction on the Stiefel manifold

O (n, r)

. For conjugate gradient-type methods, vector transport operations are required as well. Given two tangent vectors

Δ Z, Δ Z^{'} \in T_{Z} M

, their transport is defined by

T_{Δ Z} (Δ Z^{'}) = (T_{ξ_{1}} (ξ_{2}), η_{1}^{'}, \dots, η_{N}^{'}, θ_{1}^{'}, \dots, θ_{N}^{'}),

where

T_{ξ_{1}} (ξ_{2})

denotes transport on the Stiefel manifold

O (n, r)

. Our experiments use Manopt’s default retraction and transport implementations. For second-order algorithms, the Riemannian Hessian is required as well. For brevity, derivations are omitted; detailed formulas can be found in Absil et al. [38]. For further literature on applying Riemannian optimization techniques to various matrix optimization models arising in multidimensional scaling, see also [14,39,40]. Optimization terminates when the following first-order optimality condition is satisfied:

Error = ∥grad \tilde{f} (Q^{(k)}, {\tilde{D}}_{1}^{(k)}, \dots, {\tilde{D}}_{N}^{(k)}, K_{1}^{(k)}, \dots, K_{N}^{(k)})∥ \leq 10^{- 6} .

(44)

We benchmarked our accelerated fixed-point methods against the following Manopt solvers: Riemannian steepest descent (Manopt-RSD), conjugate gradient (Manopt-RCG), Barzilai-Borwein (Manopt-RBB), trust-region (Manopt-RTR), limited-memory BFGS (Manopt-RLBFGS), and adaptive regularization by cubics (Manopt-ARC). All solvers were executed using default parameter settings with a maximum iteration cap of 10,000. To ensure consistency, all methods were initialized from the same starting point, and both the Riemannian gradient and (when required) Hessian were supplied explicitly. Notably, Manopt defaults to finite-difference approximations when the Hessian is not provided. In contrast, our implementation consistently utilized exact second-order information.

Table 2 summarizes performance across multiple problem sizes and input data types: completely random (RAND), and two structured categories (IND and NND). The performance metrics follow those in Table 1, with “CPU” denoting total runtime, “IT” indicating the number of iterations, and “Fvalue” representing the final objective value f or

\tilde{f}

. The accelerated fixed-point methods are applied to the original three-way GIPSCAL problem (5), while the Riemannian optimization algorithms solve the equivalent reformulation (42). The “Error” column records the norm associated with the respective stopping criteria (39) for accelerated fixed-point methods and (44) for Riemannian solvers. Convergence trajectories are visualized in Figure 4, where the horizontal axis denotes runtime and the vertical axis plots

{log}_{10} ∥ Error ∥

. The results clearly demonstrate that the proposed accelerated fixed-point methods outperform several Manopt solvers in terms of both convergence rate and computational efficiency. Notably, as shown by the pink-dotted solid line and black-dotted dashed line in Figure 4, both Manopt-RBB and Manopt-RSD exhibit pronounced slowdowns in convergence near stationary points. Furthermore, Table 2 highlights the elevated computational cost of Manopt-RLBFGS, which aligns with the internal implementation details of Manopt. Specifically, its default memory size of 30 necessitates 30 projection-based vector transport operations per iteration, contributing to significant per-iteration overhead.

4.3. Comparison with the Projected Gradient Flow Method

Trendafilov [10,13] reformulated the equivalent three-way GIPSCAL problem (42) as a constrained dynamical system and proposed a continuous-time projected gradient flow algorithm. This method is globally convergent, conceptually simple, and broadly applicable to matrix optimization problems arising in multidimensional data analysis [9,15,16,17,18]. In this subsection, we conduct a numerical comparison between the proposed accelerated fixed-point algorithms—VEA, TEA, and STEA—and the projected gradient flow algorithm.

For a general constrained optimization problem of the form

{min}_{X \in M} E (X)

, the projected gradient method generates a sequence of iterates

{X_{t}}

via

X_{t + 1} = π_{M} (X_{t} - h_{t} {\nabla E (X) |}_{X = X_{t}}),

where

h_{t}

is the step size, and

π_{M} (\cdot)

denotes the orthogonal projection onto the feasible set

M

. The associated continuous-time version, known as the projected gradient flow, evolves along the negative projected gradient and is governed by the following differential equation:

\frac{d X (t)}{d t} = - π_{M} (\nabla E (X (t))), X (0) = X_{0} \in M .

For the equivalent reformulated three-way GIPSCAL problem (42), this yields the following system of ordinary differential equations:

\{\begin{matrix} \frac{d Q}{d t} = & - P_{Q} (\sum_{i = 1}^{m} (- 2 sym (X_{i}) Q {\tilde{D}}_{i}^{2} + 2 skew (X_{i}) Q K_{i} + 2 Q ({\tilde{D}}_{i}^{4} - K_{i}^{2}))), \\ \frac{d {\tilde{D}}_{i}}{d t} = & - 2 ({\tilde{D}}_{i}^{2} - Q^{T} sym (X_{i}) Q) ⊙ {\tilde{D}}_{i}, i = 1, \dots, N, \\ \frac{d K_{i}}{d t} = & - K_{1} + Q^{T} skew (X_{1}) Q, i = 1, \dots, N . \end{matrix}

We employ MATLAB’s ode15s solver [41,42] to integrate this system numerically. The ode15s routine is a variable-step, variable-order implicit solver designed for stiff differential equations and is part of the Klopfenstein–Shampine family of solvers. We set both absolute and relative error tolerances to

10^{- 12}

to ensure highly accurate tracking of the flow dynamics. While such precision may exceed practical requirements in typical data analysis tasks, it facilitates a fair and rigorous comparison of algorithmic performance. Although looser tolerance settings could reduce runtime, their effect is marginal in this context. During integration, solution states are recorded at regular intervals of 10 time units. The integration process is terminated automatically when the relative decrease in the objective function between successive outputs falls below

10^{- 4}

, indicating proximity to a local minimum. This termination criterion, adopted from Loisel and Takane [19], is significantly more lenient than the convergence threshold used in the proposed accelerated fixed-point algorithms—VEA, TEA, and STEA.

Although the projected gradient flow algorithm exhibits global convergence and features a simple, intuitive structure that facilitates implementation, its efficiency tends to degrade when applied to large-scale problems. Table 3 reports a detailed numerical comparison between the accelerated fixed-point iteration methods (namely, FPI-VEA, FPI-TEA, and FPI-STEA) and the projection gradient flow approach (denoted as PG-ODE), under a fixed acceleration window width parameter of

k = 5

. The experiments are conducted across varying coefficient dimensions and under three distinct original data generation schemes: NND, IND, and RAND. The definitions of CPU, IT, Obj, and Error are consistent with those provided in Table 1. The results in Table 3 demonstrate that, in terms of iteration efficiency, the

ε

-accelerated fixed-point iteration algorithms significantly outperform the projected gradient flow method.

5. Conclusions

This paper examines a Generalized Inner Product SCALing model (GIPSCAL) in multidimensional scaling from a numerical perspective, with a focus on individual differences among the observed objects. The model can be formulated as a multivariate constrained matrix optimization problem with column orthogonality and non-negative diagonal constraints (problem (5)). Using the alternating least squares iterative approach, the original model is first transformed into a matrix-based, fixed-point iteration problem. Furthermore, by incorporating the

ε

-acceleration principle from vector sequence acceleration, we design fixed-point iteration acceleration algorithms based on the vector

ε

-algorithm, topological

ε

-algorithm, and simplified topological

ε

-algorithm (denoted as FPI-VEA, FPI-TEA, and FPI-STEA). Extensive numerical experiments show that, when solving the GIPSCAL problem in (5), the fixed-point iteration acceleration algorithms combined with the

ε

-algorithm achieve better acceleration convergence compared to the original fixed-point iteration algorithm. Additionally, compared to existing algorithms for solving matrix optimization models in multidimensional data analysis, such as the Riemannian optimization-based Manopt toolbox algorithms and the classical projected gradient flow algorithm, the fixed-point iteration acceleration algorithms demonstrate a clear advantage in iteration time. Improving the convergence speed of scalar, vector, matrix, or tensor sequences generated by iterative methods is highly significant in fields such as scientific and engineering computing and machine learning. It is worth noting that, unlike traditional vector sequence polynomial extrapolation acceleration methods, the implementation of FPI-VEA, FPI-TEA, and FPI-STEA does not require the use of matrix flattening and unflattening operators, nor does it involve matrix decomposition to solve related subproblems.

Author Contributions

Conceptualization, Y.Q. and J.L.; methodology, Y.Q.; validation, Y.Q., C.M., and J.L.; formal analysis, J.L.; writing—original draft preparation, Y.Q. and C.M.; writing—review and editing, Y.Q. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (12261026) and the National College Student Innovation and Entrepreneurship Training Program at Guilin University of Electronic Technology (202410595070).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Chino, N. A brief survey of asymmetric MDS and some open problems. Behaviormetrika 2012, 39, 127–165. [Google Scholar] [CrossRef]
Okada, A.; Imaizumi, T. Applied Multidimensional Scaling of Asymmetric Relationships; Springer: Berlin/Heidelberg, Germany, 2024. [Google Scholar]
Harshman, R.A. Models for analysis of asymmetrical relationships among N objects or stimuli. In Proceedings of the First Joint Meeting of the Psychometric Society and the Society of Mathematical Psychology, Hamilton, ON, Canada, August 1978. [Google Scholar]
Chino, N. A graphical technique for representing the asymmetric relationships between n objects. Behaviormetrika 1978, 5, 23–40. [Google Scholar] [CrossRef]
Kiers, H.A.; Takane, Y. A generalization of GIPSCAL for the analysis of nonsymmetric data. J. Classif. 1994, 11, 79–99. [Google Scholar] [CrossRef]
Cox, T.F.; Cox, M.A. Multidimensional Scaling; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar]
Krzanowski, W. Principles of Multivariate Analysis; OUP Oxford: Oxford, UK, 2000; Volume 23. [Google Scholar]
Constantine, A.; Gower, J.C. Graphical representation of asymmetric matrices. J. R. Stat. Soc. Ser. C (Applied Stat.) 1978, 27, 297–304. [Google Scholar] [CrossRef]
Trendafilov, N.; Gallo, M. Multivariate Data Analysis on Matrix Manifolds (with Manopt); Springer: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
Trendafilov, N.T. GIPSCAL revisited. a projected gradient approach. Stat. Comput. 2002, 12, 135–145. [Google Scholar] [CrossRef]
Carroll, J.D.; Chang, J.-J. Analysis of individual differences in multidimensional scaling via an N-way generalization of “Eckart-Young” decomposition. Psychometrika 1970, 35, 283–319. [Google Scholar] [CrossRef]
Chino, N. GIPSCAL. In Structure and Dynamics of Asymmetric Interactions; Springer: Berlin/Heidelberg, Germany, 2025; pp. 77–100. [Google Scholar]
Trendafilov, N.T. The dynamical system approach to multivariate data analysis. J. Comput. Graph. Stat. 2006, 15, 628–650. [Google Scholar] [CrossRef]
Zhou, X.-L.; Li, J.-F.; Li, C.-Q. An efficient algorithm for fitting the three-way GIPSCAL problem with missing values from asymmetric multidimensional scaling. In Numerical Algorithms; Springer: Berlin/Heidelberg, Germany, 2025; pp. 1–42. [Google Scholar]
Trendafilov, N.T. DINDSCAL: Direct INDSCAL. Stat. Comput. 2012, 22, 445–454. [Google Scholar] [CrossRef]
Trendafilov, N.T. Dynamical system approach to factor analysis parameter estimation. Br. J. Math. Stat. Psychol. 2003, 56, 27–46. [Google Scholar] [CrossRef]
Trendafilov, N.T. Orthonormality-constrained indscal with nonnegative saliences. In Proceedings of the International Conference on Computational Science and Its Applications, Assisi, Italy, 14–17 May 2004; Springer: Berlin/Heidelberg, Germany, 2004; pp. 952–960. [Google Scholar]
Trendafilov, N.T.; Jolliffe, I.T. Projected gradient approach to the numerical solution of the SCoTLASS. Comput. Stat. Data Anal. 2006, 50, 242–253. [Google Scholar] [CrossRef]
Loisel, S.; Takane, Y. Generalized GIPSCAL re-revisited: A fast convergent algorithm with acceleration by the minimal polynomial extrapolation. Adv. Data Anal. Classif. 2011, 5, 57–75. [Google Scholar] [CrossRef]
Boumal, N.; Mishra, B.; Absil, P.-A.; Sepulchre, R. Manopt, a matlab toolbox for optimization on manifolds. J. Mach. Learn. Res. 2014, 15, 1455–1459. [Google Scholar]
Brezinski, C.; Redivo-Zaglia, M. The simplified topological ε-algorithms for accelerating sequences in a vector space. SIAM J. Sci. Comput. 2014, 36, A2227–A2247. [Google Scholar] [CrossRef]
Brezinski, C.; Redivo-Zaglia, M. The simplified topological ε-algorithms: Software and applications. Numer. Algorithms 2017, 74, 1237–1260. [Google Scholar] [CrossRef]
Hiriart-Urruty, J.-B.; Lemaréchal, C. Convex Analysis and Minimization Algorithms I: Fundamentals; Springer Science and Business Media: Dordrecht, The Netherlands, 1996; Volume 305. [Google Scholar]
Shanks, D. Non-linear transformations of divergent and slowly convergent sequences. J. Math. Phys. 1955, 34, 1–42. [Google Scholar] [CrossRef]
Schmidt, R.J. Xxxii. on the numerical solution of linear simultaneous equations by an iterative method. Lond. Edinb. Dublin Philos. Mag. J. Sci. 1941, 32, 369–383. [Google Scholar] [CrossRef]
Wynn, P. On a device for computing the e_m(S_n) transformation. Math. Tables Other Aids Comput. 1956, 10, 91–96. [Google Scholar] [CrossRef]
Wynn, P. Acceleration techniques for iterated vector and matrix problems. Math. Comput. 1962, 16, 301–322. [Google Scholar] [CrossRef]
Brezinski, C. Padé-Type Approximation and General Orthogonal Polynomials; International Series of Numerical Mathematics; Birkhäuser Verlag: Basel, Switzerland, 1980; Volume 50. [Google Scholar]
Brezinski, C. Généralisations de la transformation de shanks, de la table de padé et de l’ε-algorithme. Calcolo 1975, 12, 317–360. [Google Scholar] [CrossRef]
Jbilou, K.; Reichel, L.; Sadok, H. Vector extrapolation enhanced TSVD for linear discrete ill-posed problems. Numer. Algorithms 2009, 51, 195–208. [Google Scholar] [CrossRef]
Salam, A.; Graves-Morris, P.R. On the vector ε-algorithm for solving linear systems of equations. Numer. Algorithms 2002, 29, 229–247. [Google Scholar] [CrossRef]
Smith, D.A.; Ford, W.F.; Sidi, A. Erratum: Correction to extrapolation methods for vector sequences. SIAM Rev. 1988, 30, 623–624. [Google Scholar] [CrossRef]
Tan, R.C. Implementation of the topological ε-algorithm. SIAM J. Sci. Stat. Comput. 1988, 9, 839–848. [Google Scholar] [CrossRef]
Brezinski, C.; Zaglia, M.R. Extrapolation Methods: Theory and Practice; Elsevier: Amsterdam, The Netherlands, 1991. [Google Scholar]
Sidi, A. Vector Extrapolation Methods with Applications; SIAM: Philadelphia, PA, USA, 2017. [Google Scholar]
Absil, P.-A.; Mahony, R.; Sepulchre, R. Optimization Algorithms on Matrix Manifolds; Princeton University Press: Princeton, NJ, USA, 2008. [Google Scholar]
Takane, Y.; Jung, K.; Hwang, H. An acceleration method for ten berge et al.’s algorithm for orthogonal INDSCAL. Comput. Stat. 2010, 25, 409–428. [Google Scholar] [CrossRef]
Absil, P.-A.; Mahony, R.; Trumpf, J. An extrinsic look at the Riemannian Hessian. In Proceedings of the International Conference on Geometric Science of Information, Paris, France, 28–30 August 2013; Springer: Berlin/Heidelberg, Germany, 2013; pp. 361–368. [Google Scholar]
Li, J.-F.; Zhou, J.; Zhou, X.-L.; Li, C.-Q.; Song, J.-S. A trust-region approach for iteration solution of the direct fitting metric MDS. BIT Numer. Math. 2025, 65, 339–375. [Google Scholar] [CrossRef]
Zhou, X.-L.; Li, C.-Q.; Li, J.-F.; Duan, X.-F. A Riemannian inexact newton method for solving the orthogonal INDSCAL problem in multidimensional scaling. IMA J. Numer. Anal. 2025, draf047. [Google Scholar] [CrossRef]
Shampine, L.F.; Reichelt, M.W. The matlab ODE suite. SIAM J. Sci. Comput. 1997, 18, 1–22. [Google Scholar] [CrossRef]
Shampine, L.F.; Gladwell, I.; Thompson, S. Solving ODEs with MATLAB; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]

Figure 1.

ε

-table [22].

Figure 1.

ε

-table [22].

Figure 2. Computation rules for odd-indexed sequences and even-indexed sequences in TEA1 and TEA2 [22].

Figure 3. Comparison of the performance of the accelerated fixed-point iterations with the original method across varying problem size. The horizontal axis indicates the computation time (in seconds), while the vertical axis plots the logarithm of the error norm, log₁₀

∥ Error ∥

.

Figure 3. Comparison of the performance of the accelerated fixed-point iterations with the original method across varying problem size. The horizontal axis indicates the computation time (in seconds), while the vertical axis plots the logarithm of the error norm, log₁₀

∥ Error ∥

.

Figure 4. Comparison of the performance of the accelerated fixed-point iterations with the original method across varying problem size. The horizontal axis indicates the computation time (in seconds), while the vertical axis plots the logarithm of the error norm, log₁₀

| | Error | |

.

Figure 4. Comparison of the performance of the accelerated fixed-point iterations with the original method across varying problem size. The horizontal axis indicates the computation time (in seconds), while the vertical axis plots the logarithm of the error norm, log₁₀

| | Error | |

.

Table 1. Numerical comparison of the accelerated fixed-point iterations with the original method across various problem settings.

Partially structured with nnd weight matrices (NND)
`N, [n,r]`		FPI					FPI-VEA									FPI-TEA
`N, [n,r]`		`IT`	`CPU`	`Error`	`Fvalue`		$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
							`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
$50, [45, 3]$		50	0.059	$8.98 \times 10^{- 9}$	0.660		19	0.024		17	0.020		15	0.021		25	0.029		30	0.037		36	0.040
$50, [45, 5]$		75	0.094	$8.75 \times 10^{- 9}$	1.363		19	0.026		23	0.035		27	0.037		36	0.052		34	0.052		40	0.053
$50, [100, 3]$		44	0.247	$7.66 \times 10^{- 9}$	0.666		15	0.091		13	0.072		14	0.079		19	0.105		21	0.116		19	0.106
$50, [100, 5]$		77	0.444	$9.33 \times 10^{- 9}$	1.380		19	0.111		23	0.135		27	0.157		30	0.169		41	0.233		35	0.199
$50, [200, 3]$		47	0.880	$7.38 \times 10^{- 9}$	0.665		18	0.338		16	0.290		14	0.245		19	0.327		23	0.408		24	0.439
$50, [200, 5]$		76	1.449	$9.47 \times 10^{- 9}$	1.375		19	0.364		23	0.435		27	0.505		36	0.672		32	0.609		31	0.577
FPI-STEA									FPI-MPE									FPI-RRE
$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
32	0.043		28	0.031		37	0.045		16	0.019		13	0.019		15	0.017		15	0.016		13	0.016		15	0.019
34	0.045		31	0.040		35	0.043		24	0.032		19	0.023		22	0.030		26	0.032		19	0.027		22	0.027
19	0.106		21	0.120		19	0.109		12	0.067		13	0.082		12	0.076		13	0.072		13	0.072		14	0.078
33	0.187		32	0.183		38	0.219		21	0.121		19	0.110		17	0.097		21	0.121		19	0.110		19	0.108
19	0.339		23	0.419		24	0.428		14	0.251		13	0.234		15	0.273		13	0.233		13	0.240		15	0.269
41	0.759		32	0.586		30	0.554		22	0.420		19	0.353		20	0.376		21	0.401		19	0.353		19	0.354
FPI-MMPE									FPI-Anderson
$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
16	0.021		13	0.016		15	0.018		9	0.016		9	0.012		9	0.010
21	0.027		19	0.026		22	0.028		22	0.040		16	0.020		17	0.025
16	0.088		13	0.072		15	0.083		8	0.048		8	0.045		8	0.043
21	0.123		19	0.110		21	0.121		16	0.091		15	0.085		15	0.087
16	0.293		13	0.238		15	0.269		8	0.146		8	0.138		8	0.143
19	0.370		19	0.350		20	0.369		17	0.319		15	0.271		15	0.292
Partially structured with indefinite weight matrices (IND)
`N, [n,r]`		FPI					FPI-VEA									FPI-TEA
`N, [n,r]`		`IT`	`CPU`	`Error`	`Fvalue`		$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
							`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
$30, [30, 3]$		23	0.013	$5.73 \times 10^{- 9}$	20.719		13	0.010		12	0.009		14	0.009		18	0.011		13	0.008		14	0.007
$50, [30, 5]$		29	0.026	$9.80 \times 10^{- 9}$	64.644		14	0.014		13	0.016		14	0.014		17	0.016		14	0.012		14	0.014
$50, [150, 3]$		22	0.312	$6.75 \times 10^{- 9}$	31.572		10	0.148		12	0.162		14	0.189		10	0.137		12	0.161		14	0.200
$50, [150, 5]$		25	0.351	$6.93 \times 10^{- 9}$	64.621		12	0.161		12	0.167		14	0.189		14	0.190		14	0.205		14	0.197
$50, [300, 3]$		22	0.768	$4.78 \times 10^{- 9}$	31.578		10	0.358		12	0.431		14	0.501		10	0.351		12	0.428		14	0.490
$50, [300, 5]$		27	0.955	$5.84 \times 10^{- 9}$	64.629		12	0.436		12	0.425		14	0.486		14	0.491		12	0.428		14	0.494
FPI-STEA									FPI-MPE									FPI-RRE
$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
18	0.014		13	0.010		14	0.009		12	0.008		13	0.007		11	0.005		12	0.006		13	0.007		12	0.006
17	0.019		14	0.014		14	0.014		16	0.016		14	0.016		15	0.015		16	0.014		14	0.013		15	0.016
10	0.147		12	0.157		14	0.196		10	0.137		8	0.110		8	0.107		11	0.151		8	0.109		8	0.111
14	0.197		14	0.193		14	0.195		15	0.201		13	0.186		14	0.186		15	0.213		13	0.183		14	0.197
10	0.358		12	0.424		14	0.498		11	0.389		8	0.288		8	0.284		11	0.387		8	0.274		8	0.281
14	0.521		12	0.423		14	0.499		15	0.550		13	0.459		13	0.473		15	0.550		13	0.468		13	0.470
FPI-MMPE									FPI-Anderson
$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
11	0.007		13	0.009		11	0.008		8	0.008		8	0.006		8	0.004
16	0.017		14	0.013		15	0.014		12	0.011		12	0.012		11	0.010
11	0.148		11	0.153		8	0.106		7	0.097		7	0.090		7	0.094
15	0.208		13	0.183		15	0.210		11	0.149		11	0.147		10	0.138
11	0.390		8	0.279		8	0.292		7	0.248		7	0.251		7	0.254
15	0.550		13	0.476		13	0.481		11	0.385		10	0.360		10	0.362
Completely random data sets (RAND)
`N, [n,r]`		FPI					FPI-VEA									FPI-TEA
`N, [n,r]`		`IT`	`CPU`	`Error`	`Fvalue`		$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
							`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
$30, [25, 2]$		203	0.130	$9.62 \times 10^{- 7}$	19059.419		144	0.090		144	0.072		144	0.071		176	0.101		172	0.083		169	0.081
$30, [25, 3]$		523	0.246	$9.71 \times 10^{- 7}$	18908.352		181	0.088		180	0.087		180	0.086		208	0.102		206	0.098		281	0.137
$50, [40, 3]$		358	0.349	$9.56 \times 10^{- 7}$	81523.263		246	0.246		248	0.244		252	0.252		282	0.284		270	0.267		270	0.264
$30, [40, 5]$		274	0.173	$9.95 \times 10^{- 7}$	48,131.172		129	0.083		131	0.088		134	0.088		158	0.106		180	0.117		158	0.105
$50, [60, 3]$		1211	2.628	$9.87 \times 10^{- 7}$	182304.537		678	1.489		673	1.442		669	1.430		797	1.706		737	1.570		689	1.477
$50, [60, 5]$		485	1.079	$9.94 \times 10^{- 7}$	181536.539		247	0.546		246	0.541		248	0.549		317	0.709		279	0.611		271	0.574
FPI-STEA									FPI-MPE									FPI-RRE
$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
176	0.108		172	0.079		169	0.076		142	0.074		142	0.068		144	0.070		143	0.076		142	0.071		144	0.070
216	0.106		213	0.100		279	0.138		167	0.081		168	0.080		169	0.080		168	0.080		168	0.081		169	0.082
282	0.282		270	0.266		271	0.269		244	0.238		238	0.230		240	0.235		246	0.243		240	0.233		240	0.235
157	0.100		180	0.118		158	0.108		130	0.088		131	0.089		127	0.084		129	0.083		131	0.085		127	0.085
806	1.739		760	1.628		821	1.755		652	1.376		648	1.362		649	1.371		649	1.382		645	1.374		641	1.370
315	0.676		283	0.613		271	0.586		245	0.527		238	0.499		236	0.505		245	0.523		243	0.509		240	0.513
FPI-MMPE									FPI-Anderson
$k = 4$			$k = 5$			$k = 6$			$k = 4$			$k = 5$			$k = 6$
`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`		`IT`	`CPU`
144	0.072		143	0.071		144	0.071		145	0.083		144	0.067		144	0.065
169	0.081		169	0.081		169	0.079		200	0.098		197	0.093		190	0.094
251	0.239		243	0.231		244	0.234		255	0.243		247	0.241		248	0.245
132	0.089		131	0.086		127	0.083		141	0.093		140	0.092		140	0.096
650	1.382		647	1.374		641	1.358		690	1.474		676	1.424		669	1.417
248	0.526		242	0.517		240	0.504		266	0.569		252	0.535		255	0.547

Table 2. Numerical comparison of the acclerated fixed-point iterations and existing algorithms in the toolbox Manopt.

	`N, [n,r]`	`CPU`	`IT`	$Error$	`Obj`	`N, [n,r]`	`CPU`	`IT`	$Error$	`Obj`
Partially structured with nnd weight matrices (NND)
`FPI-VEA`	30, [20, 2]	0.006	12	$3.4 \times 10^{- 13}$	0.255	20, [20, 3]	0.019	23	$7.3 \times 10^{- 12}$	0.305
`FPI-TEA`		0.006	12	$1.0 \times 10^{- 13}$	0.255		0.014	23	$2.7 \times 10^{- 9}$	0.305
`FPI-STEA`		0.006	12	$8.2 \times 10^{- 13}$	0.255		0.024	23	$1.3 \times 10^{- 9}$	0.305
`RCG-Manopt`		55.670	151	$8.5 \times 10^{- 7}$	0.255		115.256	598	$9.3 \times 10^{- 7}$	0.305
`RBB-Manopt`		92.000	306	$9.5 \times 10^{- 7}$	0.255		172.084	1141	$9.7 \times 10^{- 7}$	0.305
`RSD-Manopt`		333.157	2069	$7.4 \times 10^{- 6}$	0.255		810.095	10,000	$1.3 \times 10^{- 5}$	0.305
`RTR-Manopt`		112.788	7	$2.2 \times 10^{- 7}$	0.255		80.434	9	$8.0 \times 10^{- 7}$	0.305
`RLBFGS-Manopt`		662.733	115	$9.8 \times 10^{- 7}$	0.255		428.122	159	$7.8 \times 10^{- 7}$	0.305
`ARC-Manopt`		416.378	6	$6.2 \times 10^{- 7}$	0.255		158.398	9	$4.3 \times 10^{- 7}$	0.305
`FPI-VEA`	20, [30, 2]	0.016	12	$5.9 \times 10^{- 14}$	0.184	20, [30, 3]	0.014	22	$9.0 \times 10^{- 9}$	0.309
`FPI-TEA`		0.018	12	$6.7 \times 10^{- 14}$	0.184		0.010	23	$1.5 \times 10^{- 9}$	0.309
`FPI-STEA`		0.024	12	$1.8 \times 10^{- 12}$	0.184		0.010	23	$1.8 \times 10^{- 9}$	0.309
`RCG-Manopt`		26.947	129	$9.0 \times 10^{- 7}$	0.184		64.849	355	$9.0 \times 10^{- 7}$	0.309
`RBB-Manopt`		23.463	148	$9.9 \times 10^{- 7}$	0.184		96.764	616	$9.3 \times 10^{- 7}$	0.309
`RSD-Manopt`		81.875	979	$4.7 \times 10^{- 6}$	0.184		427.409	5153	$7.3 \times 10^{- 6}$	0.309
`RTR-Manopt`		39.143	5	$7.0 \times 10^{- 7}$	0.184		82.419	9	$2.2 \times 10^{- 7}$	0.309
`RLBFGS-Manopt`		291.052	107	$9.7 \times 10^{- 7}$	0.184		375.096	144	$8.2 \times 10^{- 7}$	0.309
`ARC-Manopt`		125.778	4	$7.2 \times 10^{- 7}$	0.184		201.743	8	$3.1 \times 10^{- 7}$	0.309
`FPI-VEA`	30, [60, 2]	0.041	12	$5.8 \times 10^{- 14}$	0.247	20, [60, 3]	0.030	13	$5.5 \times 10^{- 10}$	0.308
`FPI-TEA`		0.029	12	$3.9 \times 10^{- 14}$	0.247		0.026	16	$5.7 \times 10^{- 9}$	0.308
`FPI-STEA`		0.038	12	$4.3 \times 10^{- 14}$	0.247		0.025	16	$5.7 \times 10^{- 9}$	0.308
`RCG-Manopt`		28.099	134	$9.5 \times 10^{- 7}$	0.247		90.802	264	$9.6 \times 10^{- 7}$	0.308
`RBB-Manopt`		20.644	133	$9.8 \times 10^{- 7}$	0.247		64.967	232	$9.9 \times 10^{- 7}$	0.308
`RSD-Manopt`		158.428	560	$5.6 \times 10^{- 6}$	0.247		155.453	1005	$1.1 \times 10^{- 5}$	0.308
`RTR-Manopt`		273.369	5	$3.1 \times 10^{- 7}$	0.247		111.849	5	$9.0 \times 10^{- 7}$	0.308
`RLBFGS-Manopt`		1518.805	98	$9.5 \times 10^{- 7}$	0.247		462.869	161	$8.7 \times 10^{- 7}$	0.308
`ARC-Manopt`		867.478	4	$4.5 \times 10^{- 7}$	0.247		192.517	5	$3.0 \times 10^{- 7}$	0.308
Partially structured with indefinite weight matrices (IND)
`FPI-VEA`	30, [20, 2]	0.004	8	$1.4 \times 10^{- 9}$	14.050	20, [20, 3]	0.009	12	$4.3 \times 10^{- 9}$	13.503
`FPI-TEA`		0.004	8	$1.4 \times 10^{- 9}$	14.050		0.008	15	$6.6 \times 10^{- 9}$	13.503
`FPI-STEA`		0.004	8	$1.4 \times 10^{- 9}$	14.050		0.012	15	$6.6 \times 10^{- 9}$	13.503
`RCG-Manopt`		111.312	129	$9.5 \times 10^{- 7}$	14.050		126.823	330	$9.5 \times 10^{- 7}$	13.503
`RBB-Manopt`		121.659	136	$6.6 \times 10^{- 7}$	14.050		286.979	755	$9.9 \times 10^{- 7}$	13.503
`RSD-Manopt`		452.417	1065	$9.8 \times 10^{- 7}$	14.050		1884.260	10,000	$4.4 \times 10^{- 6}$	13.503
`RTR-Manopt`		281.018	7	$8.8 \times 10^{- 7}$	14.050		131.532	8	$9.0 \times 10^{- 7}$	13.503
`RLBFGS-Manopt`		1662.039	107	$8.8 \times 10^{- 7}$	14.050		972.222	157	$7.0 \times 10^{- 7}$	13.503
`ARC-Manopt`		1231.788	7	$4.6 \times 10^{- 7}$	14.050		477.481	8	$2.4 \times 10^{- 7}$	13.503
`FPI-VEA`	30, [30, 2]	0.005	8	$8.2 \times 10^{- 9}$	13.993	20, [30, 3]	0.049	12	$8.0 \times 10^{- 10}$	13.367
`FPI-TEA`		0.005	8	$8.2 \times 10^{- 9}$	13.993		0.048	12	$2.9 \times 10^{- 9}$	13.367
`FPI-STEA`		0.005	8	$8.2 \times 10^{- 9}$	13.993		0.051	12	$2.9 \times 10^{- 9}$	13.367
`RCG-Manopt`		32.592	161	$8.6 \times 10^{- 7}$	13.993		111.185	266	$9.7 \times 10^{- 7}$	13.367
`RBB-Manopt`		96.056	132	$9.2 \times 10^{- 7}$	13.993		155.082	480	$1.0 \times 10^{- 6}$	13.367
`RSD-Manopt`		668.282	1520	$9.3 \times 10^{- 7}$	13.993		944.964	5986	$9.8 \times 10^{- 7}$	13.367
`RTR-Manopt`		297.790	8	$2.4 \times 10^{- 7}$	13.993		105.322	8	$5.7 \times 10^{- 7}$	13.367
`RLBFGS-Manopt`		1333.034	85	$8.3 \times 10^{- 7}$	13.993		584.491	107	$6.7 \times 10^{- 7}$	13.367
`ARC-Manopt`		1212.596	7	$6.1 \times 10^{- 7}$	13.993		480.133	8	$2.2 \times 10^{- 7}$	13.367
`FPI-VEA`	30, [55, 2]	0.037	8	$7.0 \times 10^{- 9}$	13.969	20, [55, 3]	0.038	12	$2.2 \times 10^{- 10}$	13.380
`FPI-TEA`		0.037	8	$7.0 \times 10^{- 9}$	13.969		0.037	12	$1.8 \times 10^{- 10}$	13.380
`FPI-STEA`		0.039	8	$7.0 \times 10^{- 9}$	13.969		0.038	12	$1.9 \times 10^{- 10}$	13.380
`RCG-Manopt`		112.628	121	$9.8 \times 10^{- 7}$	13.969		89.848	211	$9.8 \times 10^{- 7}$	13.380
`RBB-Manopt`		154.269	219	$8.9 \times 10^{- 7}$	13.969		74.459	233	$9.9 \times 10^{- 7}$	13.380
`RSD-Manopt`		396.393	1122	$9.6 \times 10^{- 7}$	13.969		506.581	3202	$1.0 \times 10^{- 6}$	13.380
`RTR-Manopt`		228.985	7	$4.8 \times 10^{- 7}$	13.969		108.314	8	$4.2 \times 10^{- 7}$	13.380
`RLBFGS-Manopt`		1225.719	100	$7.0 \times 10^{- 7}$	13.969		567.610	104	$6.1 \times 10^{- 7}$	13.380
`ARC-Manopt`		954.378	7	$2.4 \times 10^{- 7}$	13.969		375.834	7	$6.9 \times 10^{- 7}$	13.380
Completely random data sets (RAND)
`FPI-VEA`	20, [25, 2]	0.009	13	$7.4 \times 10^{- 7}$	12,415.989	20, [25, 3]	0.012	21	$8.4 \times 10^{- 7}$	12,281.929
`FPI-TEA`		0.008	12	$7.5 \times 10^{- 7}$	12,415.989		0.023	45	$3.9 \times 10^{- 7}$	12,281.929
`FPI-STEA`		0.007	13	$7.9 \times 10^{- 7}$	12,415.989		0.028	44	$9.8 \times 10^{- 7}$	12,281.929
`RCG-Manopt`		0.217	1	$9.4 \times 10^{- 2}$	12,415.989		0.209	1	$8.0 \times 10^{- 2}$	12,281.929
`RBB-Manopt`		175.974	1573	$9.7 \times 10^{- 7}$	12,415.989		866.715	6021	$1.0 \times 10^{- 6}$	12,281.929
`RSD-Manopt`		320.402	4174	$3.7 \times 10^{- 5}$	12,415.989		605.810	7096	$5.4 \times 10^{- 5}$	12,281.929
`RTR-Manopt`		328.180	36	$8.4 \times 10^{- 7}$	12,415.989		215.108	15	$6.0 \times 10^{- 7}$	12,281.929
`RLBFGS-Manopt`		351.008	137	$5.8 \times 10^{- 6}$	12,415.989		444.609	178	$2.1 \times 10^{- 5}$	12,281.929
`ARC-Manopt`		1440.731	36	$8.4 \times 10^{- 7}$	12,415.989		904.213	15	$5.7 \times 10^{- 7}$	12,281.929
`FPI-VEA`	20, [40, 3]	0.011	23	$8.8 \times 10^{- 7}$	32,116.860	30, [40, 5]	0.016	13	$9.5 \times 10^{- 7}$	48,131.172
`FPI-TEA`		0.025	56	$5.8 \times 10^{- 7}$	32,116.860		0.047	62	$9.3 \times 10^{- 7}$	48,131.172
`FPI-STEA`		0.026	56	$4.0 \times 10^{- 7}$	32,116.860		0.046	62	$9.9 \times 10^{- 7}$	48,131.172
`RCG-Manopt`		0.197	1	$9.4 \times 10^{- 2}$	32,116.860		0.454	1	$9.1 \times 10^{- 2}$	48,131.172
`RBB-Manopt`		1278.205	10,000	$2.3 \times 10^{- 3}$	32,116.860		3126.306	10,000	$1.4 \times 10^{- 4}$	48,131.172
`RSD-Manopt`		727.261	10,000	$6.3 \times 10^{- 4}$	32,116.860		958.490	5508	$1.8 \times 10^{- 4}$	48,131.172
`RTR-Manopt`		737.793	32	$7.8 \times 10^{- 7}$	32,116.860		1279.051	24	$8.9 \times 10^{- 7}$	48,131.172
`RLBFGS-Manopt`		1414.241	525	$4.2 \times 10^{- 5}$	32,116.860		1321.907	249	$2.1 \times 10^{- 5}$	48,131.172
`ARC-Manopt`		2697.669	31	$9.9 \times 10^{- 7}$	32,116.860		5498.762	24	$8.3 \times 10^{- 7}$	48,131.172
`FPI-VEA`	30, [55, 3]	0.020	18	$8.5 \times 10^{- 7}$	91,180.316	30, [55, 5]	0.066	44	$9.8 \times 10^{- 7}$	90,577.201
`FPI-TEA`		0.046	44	$9.9 \times 10^{- 7}$	91,180.316		0.136	121	$9.9 \times 10^{- 7}$	90,577.201
`FPI-STEA`		0.040	43	$9.6 \times 10^{- 7}$	91,180.316		0.135	115	$1.0 \times 10^{- 6}$	90,577.201
`RCG-Manopt`		0.435	1	$9.0 \times 10^{- 2}$	91,180.316		0.462	1	$9.6 \times 10^{- 2}$	90,577.201
`RBB-Manopt`		2837.122	10,000	$1.1 \times 10^{- 3}$	91,180.316		3126.555	10,000	$2.9 \times 10^{- 3}$	90,577.201
`RSD-Manopt`		949.231	5953	$3.2 \times 10^{- 4}$	91,180.316		1686.069	10,000	$1.3 \times 10^{- 3}$	90,577.201
`RTR-Manopt`		1194.252	25	$8.5 \times 10^{- 7}$	91,180.316		4798.699	53	$9.9 \times 10^{- 7}$	90,577.201
`RLBFGS-Manopt`		978.798	174	$1.2 \times 10^{- 4}$	91,180.316		2383.108	351	$7.3 \times 10^{- 5}$	90,577.201
`ARC-Manopt`		5176.636	25	$8.1 \times 10^{- 7}$	91,180.316		9758.528	53	$9.6 \times 10^{- 7}$	90,577.201

Table 3. Numerical comparison of the acclerated fixed-point iterations and the projected gradient flow algorithm.

	`N, [n,r]`	`CPU`	`IT`	`Error`	`Obj`	`N, [n,r]`	`CPU`	`IT`	`Error`	`Obj`
Partially structured withnndweight matrices (NND)
`FPI-VEA`	50, [25, 2]	0.040	12	$3.5 \times 10^{- 12}$	0.371	50, [25, 3]	0.027	14	$7.1 \times 10^{- 9}$	0.661
`FPI-TEA`		0.023	12	$2.1 \times 10^{- 11}$	0.371		0.029	23	$3.6 \times 10^{- 10}$	0.661
`FPI-STEA`		0.029	12	$2.2 \times 10^{- 11}$	0.371		0.030	23	$8.4 \times 10^{- 9}$	0.661
`PG-ODE`		7.766	1321	$4.7 \times 10^{- 9}$	0.371		13.526	1096	$8.3 \times 10^{- 10}$	0.661
`FPI-VEA`	50, [50, 2]	0.028	12	$2.2 \times 10^{- 12}$	0.370	50, [50, 5]	0.055	23	$2.9 \times 10^{- 10}$	1.369
`FPI-TEA`		0.027	12	$2.4 \times 10^{- 13}$	0.370		0.084	36	$6.5 \times 10^{- 9}$	1.369
`FPI-STEA`		0.028	12	$1.8 \times 10^{- 12}$	0.370		0.081	34	$4.9 \times 10^{- 9}$	1.369
`PG-ODE`		7.926	853	$8.5 \times 10^{- 10}$	0.370		508.936	1411	$7.3 \times 10^{- 9}$	1.369
`FPI-VEA`	30, [100, 2]	0.289	12	$3.6 \times 10^{- 13}$	0.256	50, [100, 5]	0.201	23	$2.7 \times 10^{- 10}$	1.378
`FPI-TEA`		0.280	12	$1.3 \times 10^{- 12}$	0.256		0.390	44	$9.4 \times 10^{- 9}$	1.378
`FPI-STEA`		0.279	12	$1.9 \times 10^{- 12}$	0.256		0.400	45	$6.1 \times 10^{- 9}$	1.378
`PG-ODE`		26.166	555	$5.9 \times 10^{- 9}$	0.256		188.403	1056	$7.5 \times 10^{- 9}$	1.378
Partially structured with indefinite weight matrices (IND)
`FPI-VEA`	30, [30, 3]	0.057	12	$8.1 \times 10^{- 10}$	20.587	30, [30, 5]	0.012	13	$7.6 \times 10^{- 9}$	41.498
`FPI-TEA`		0.024	14	$7.0 \times 10^{- 9}$	20.587		0.015	23	$3.2 \times 10^{- 10}$	41.498
`FPI-STEA`		0.030	14	$7.0 \times 10^{- 9}$	20.587		0.017	23	$7.0 \times 10^{- 11}$	41.498
`PG-ODE`		7.973	1451	$3.7 \times 10^{- 9}$	20.587		26.815	1384	$1.4 \times 10^{- 10}$	41.498
`FPI-VEA`	30, [50, 2]	0.019	9	$8.0 \times 10^{- 10}$	13.985	30, [50, 3]	0.022	12	$1.5 \times 10^{- 11}$	20.591
`FPI-TEA`		0.014	9	$8.0 \times 10^{- 10}$	13.985		0.016	12	$1.3 \times 10^{- 10}$	20.591
`FPI-STEA`		0.016	9	$8.0 \times 10^{- 10}$	13.985		0.016	12	$1.4 \times 10^{- 10}$	20.591
`PG-ODE`		5.668	984	$4.0 \times 10^{- 9}$	13.985		13.004	1280	$9.5 \times 10^{- 9}$	20.591
`FPI-VEA`	50, [100, 3]	0.148	12	$8.8 \times 10^{- 13}$	31.567	30, [100, 5]	0.086	12	$1.3 \times 10^{- 9}$	41.516
`FPI-TEA`		0.134	12	$2.5 \times 10^{- 13}$	31.567		0.108	15	$5.1 \times 10^{- 9}$	41.516
`FPI-STEA`		0.143	12	$1.9 \times 10^{- 13}$	31.567		0.111	15	$5.1 \times 10^{- 9}$	41.516
`PG-ODE`		50.604	786	$6.7 \times 10^{- 9}$	31.567		78.532	1056	$3.9 \times 10^{- 9}$	41.516
Completely random data sets (RAND)
`FPI-VEA`	30, [25, 2]	0.110	144	$8.9 \times 10^{- 7}$	19,059.419	30, [25, 3]	0.085	180	$8.7 \times 10^{- 7}$	18,908.352
`FPI-TEA`		0.093	172	$8.6 \times 10^{- 7}$	19,059.419		0.096	206	$7.9 \times 10^{- 7}$	18,908.352
`FPI-STEA`		0.097	172	$8.6 \times 10^{- 7}$	19,059.419		0.103	213	$1.0 \times 10^{- 6}$	18,908.352
`PG-ODE`		3.829	2778	$9.2 \times 10^{- 14}$	19,157.130		9.497	3046	$2.4 \times 10^{- 10}$	19,008.643
`FPI-VEA`	50, [30, 3]	0.347	397	$5.2 \times 10^{- 7}$	45,923.018	50, [30, 5]	0.608	597	$9.1 \times 10^{- 7}$	45,485.370
`FPI-TEA`		0.379	445	$9.0 \times 10^{- 7}$	45,923.018		0.613	604	$9.0 \times 10^{- 7}$	45,485.370
`FPI-STEA`		0.368	444	$9.4 \times 10^{- 7}$	45,923.018		0.596	603	$9.9 \times 10^{- 7}$	45,485.370
`PG-ODE`		25.986	3303	$1.6 \times 10^{- 13}$	46,132.681		164.868	3775	$1.5 \times 10^{- 11}$	45,811.876
`FPI-VEA`	30, [60, 2]	1.704	219	$7.7 \times 10^{- 7}$	109,708.392	50, [60, 3]	8.621	673	$9.8 \times 10^{- 7}$	182,304.537
`FPI-TEA`		2.213	285	$9.4 \times 10^{- 7}$	109,708.392		9.378	737	$8.8 \times 10^{- 7}$	182,304.537
`FPI-STEA`		2.215	285	$9.6 \times 10^{- 7}$	109,708.392		9.648	760	$9.4 \times 10^{- 7}$	182,304.537
`PG-ODE`		28.139	1997	$2.9 \times 10^{- 13}$	109,927.833		171.939	4369	$2.1 \times 10^{- 11}$	182,628.808

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qin, Y.; Mao, C.; Li, J. ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS. Mathematics 2025, 13, 2680. https://doi.org/10.3390/math13162680

AMA Style

Qin Y, Mao C, Li J. ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS. Mathematics. 2025; 13(16):2680. https://doi.org/10.3390/math13162680

Chicago/Turabian Style

Qin, Yuefeng, Chen Mao, and Jiaofen Li. 2025. "ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS" Mathematics 13, no. 16: 2680. https://doi.org/10.3390/math13162680

APA Style

Qin, Y., Mao, C., & Li, J. (2025). ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS. Mathematics, 13(16), 2680. https://doi.org/10.3390/math13162680

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS

Abstract

1. Introduction

2. Fixed-Point Iteration Framework for Problem (5)

3. $ε$ -Algorithms Acceleration for the Fixed-Point Problem (19)

3.1. Scalar Shanks Transform and Scalar $ε$ -Algorithm

3.2. Topological Shanks Transformation and Topological $ε$ -Algorithm

3.3. Simplified Topological $ε$ -Algorithm

3.4. Implementation of the $ε$ -Algorithms

3.5. Combining $ε$ -Algorithms with Fixed-Point Iterations to Solve the Problem (5)

4. Numerical Experiments

4.1. Numerical Comparison of Fixed-Point Acceleration Methods

4.2. Comparison with Riemannian Optimization Methods in Manopt

4.3. Comparison with the Projected Gradient Flow Method

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

ε-Algorithm Accelerated Fixed-Point Iteration for the Three-Way GIPSCAL Problem in Asymmetric MDS

Abstract

1. Introduction

2. Fixed-Point Iteration Framework for Problem (5)

3. ε -Algorithms Acceleration for the Fixed-Point Problem (19)

3.1. Scalar Shanks Transform and Scalar ε -Algorithm

3.2. Topological Shanks Transformation and Topological ε -Algorithm

3.3. Simplified Topological ε -Algorithm

3.4. Implementation of the ε -Algorithms

3.5. Combining ε -Algorithms with Fixed-Point Iterations to Solve the Problem (5)

4. Numerical Experiments

4.1. Numerical Comparison of Fixed-Point Acceleration Methods

4.2. Comparison with Riemannian Optimization Methods in Manopt

4.3. Comparison with the Projected Gradient Flow Method

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. $ε$ -Algorithms Acceleration for the Fixed-Point Problem (19)

3.1. Scalar Shanks Transform and Scalar $ε$ -Algorithm

3.2. Topological Shanks Transformation and Topological $ε$ -Algorithm

3.3. Simplified Topological $ε$ -Algorithm

3.4. Implementation of the $ε$ -Algorithms

3.5. Combining $ε$ -Algorithms with Fixed-Point Iterations to Solve the Problem (5)