Towards a Vector Field Based Approach to the Proper Generalized Decomposition (PGD)

Antonio Falcó; Lucía Hilario; Nicolás Montés; Marta C. Mora; Enrique Nadal

doi:10.3390/math9010034

,

and

¹

ESI International Chair@CEU-UCH, Departamento de Matemáticas, Física y Ciencias Tecnológicas, Universidad Cardenal Herrera-CEU, CEU Universities San Bartolomé 55, 46115 Alfara del Patriarca, Spain

²

Departamento de Ingeniería Mecánica y Construcción, Universitat Jaume I, Avd. Vicent Sos Baynat s/n, 12071 Castellón, Spain

³

Departamento de Ingeniería Mecánica y de Materiales, Universitat Politècnica de València Camino de Vera, s/n, 46022 Valencia, Spain

^*

Author to whom correspondence should be addressed.

Mathematics2021, 9(1), 34;https://doi.org/10.3390/math9010034

This article belongs to the Special Issue Applications of Partial Differential Equations in Engineering

Version Notes

Order Reprints

Abstract

A novel algorithm called the Proper Generalized Decomposition (PGD) is widely used by the engineering community to compute the solution of high dimensional problems. However, it is well-known that the bottleneck of its practical implementation focuses on the computation of the so-called best rank-one approximation. Motivated by this fact, we are going to discuss some of the geometrical aspects of the best rank-one approximation procedure. More precisely, our main result is to construct explicitly a vector field over a low-dimensional vector space and to prove that we can identify its stationary points with the critical points of the best rank-one optimization problem. To obtain this result, we endow the set of tensors with fixed rank-one with an explicit geometric structure.

Keywords:

proper generalised decomposition; alternating least squares; greedy rank one update algorithm; tensor numerical methods

MSC:

15A69; 15A23; 57R25; 65N30

1. Introduction

Many problems in science and engineering are hard to compute due their numerical complexity. Moreover, in the analysis of complex systems under a real time constraint, the evaluation of all possible scenarios appears as a necessity [1]. Despite the improvements in techniques used in high dimensional problems, some challenging questions remain unresolved due to the efficiency of our computers. However, a novel technique called Proper Generalized Decomposition (PGD) [2,3] has been developed to provide an answer of these difficult tasks. It was initially proposed to compute, in a separated representation framework, the variational solution of partial differential equations (PDE) defined over a tensor product space [4]. It is possible to distinguish two different benefits. The first one is the possibility of managing high dimensional problems, and the second is the possibility to include the model’s parameters as extra-coordinates. This last fact gives a powerful strategy to deal with classical problems because the PGD framework facilitates an efficient design and a real-time decision-making [5,6].

This novel technique allows for computing the whole set of solutions of a parametrized problem. The strategy is to include in an equivalent non-parametrized problem all possible parameter values as extra-coordinates. The name of this particular PGD based approach is Progressive Variational Vademecum [1], and it can be implemented offline. As a consequence, the PGD based approach opens the possibility of solving problems in industry with a different strategy not envisioned until now.

The mathematical analysis of the PGD was given by Falcó and Nouy, Ref. [4] in a Hilbert space framework and in [7] for a more general setting.

The Greedy Rank One Algorithm (GROA) [8] used to solve high-dimensional linear systems (with a full rank matrix) is the procedure of choice in the engineering community to implement the PGD. It is an iterative method made up of two steps that we can cyclically repeat until convergence. The first one consists of computing the minimal residual of the linear system over the set of tensors with bounded rank-one. In the second step, we use this optimal rank-one solution to update the residual. In the following, we return to the first step. Ammar et al. [8] propose an Alternating Least Squares (ALS) Algorithm for the practical implementation of the first step of the GROA. In the aforementioned paper, the authors justify the choice of the ALS showing that its convergence to a critical point of the optimization problem (not necessarily an optimal one) is assured under very weak conditions. In addition, the convergence has been studied by El Hamidi, Osman and Jazar [9] in the framework of Sobolev tensor spaces.

In this work, we want to study the optimization problem of the first step the GROA. To this end, even though our problem is not convex, we will take into account the relationship between the underlying convex optimization problem and the behaviour of its associated gradient flow. It is well-known that the vector field constructed by using a convex functional (for example related with a convex minimization problem defined over a finite dimensional vector space), has a gradient flow that provides a dynamical system with a unique stationary point. Moreover, it can be shown that it is a sink and its stable manifold coincides with the whole domain of the convex functional. This fact motivates the classical paradigm about the convergence of gradient-based numerical optimization algorithms.

In this work, we want to adapt the above paradigm in the framework of the GROA [8]. The main goal is to find a vector field in a low-dimensional vector space related to the gradient flow of a convex functional defined over the set of tensors of fixed rank one. The idea is to use this vector field to characterize the behaviour of the solutions of the non-convex optimization problem associated with the first step of GROA. To achieve this, we will prove that the set of critical points of the optimization problem over the set of tensors with fixed rank-one can be identified with the set of stationary points of that vector field. In order to construct it, we will proceed as follows. First, we will endow the set tensors of fixed rank-one with an explicit structure of smooth manifold. Second, by the help of this geometric structure, we will explicitly construct a vector field over a low dimensional vector space related with the first step of GROA. Finally, we will show that the set of stationary points of this vector field coincides with the set of critical points of the optimization problem associated with the PGD algorithm. In consequence, this vector field allows us to explain the dynamical behaviour around each of its stationary points. Moreover, we can get explicit information, in a neighbourhood of each of these stationary points, about the structure of its stable and unstable manifolds. In our opinion, a more precise knowledge of these invariant sets can help us develop better and more efficient PGD approaches.

The paper is organised as follows. Section 2 provides some preliminary definitions and results used along this paper. Section 3 shows a geometric approach to the PGD. In Section 4, the characterization of the smooth manifold of the set of tensors of fixed rank-one is given. After that, Section 5 shows the first order optimality conditions for the PGD which is the main result of this paper. Finally, Section 6 provides some conclusions of the work.

2. Preliminary Definitions and Results

First of all, we introduce some notation used along this paper. We denote by

R^{N \times M}

the set of

N \times M

-matrices and by

A^{T}

the transpose of a given matrix

A .

As usual, we use

⟨ x, y ⟩ = x^{T} y = y^{T} x

to denote the Euclidean inner product in

R^{N},

and its corresponding 2-norm, by

{∥ x ∥}_{2} = {⟨ x, x ⟩}^{1 / 2} .

Let

I_{N}

be the

N \times N

-identity matrix and when the dimension is clear from the context, we simply denote it by

I .

Given a sequence

{u_{j}}_{j = 0}^{\infty} \subset R^{N},

we say that a vector

u \in R^{N}

can be written as

u = \sum_{j = 0}^{\infty} u_{j}

if and only if

lim_{n \to \infty} \sum_{j = 0}^{n} u_{j} = u

in the

{∥ \cdot ∥}_{2}

-topology. Now, we recall the definition and some properties of the Kronecker product. The Kronecker product of

A \in R^{N_{1}^{'} \times N_{1}}

and

B \in R^{N_{2}^{'} \times N_{2}},

written

A \otimes B,

is the tensor algebraic operation defined as

A \otimes B = [\begin{matrix} A_{1, 1} B & A_{1, 2} B & \dots & A_{1, N_{1}^{'}} B \\ A_{2, 1} B & A_{2, 2} B & \dots & A_{2, N_{1}^{'}} B \\ ⋮ & ⋮ & ⋱ & ⋮ \\ A_{N_{1}, 1} B & A_{N_{1}, 2} B & \dots & A_{N_{1}, N_{1}^{'}} B \end{matrix}] \in R^{N_{1}^{'} N_{2}^{'} \times N_{1} N_{2}} .

To conclude, we list some of the well-known properties of the Kronecker product (see, for example, [10] or [11]).

1.: $A \otimes (B \otimes C) = (A \otimes B) \otimes C .$
2.: $(A + B) \otimes (C + D) = (A \otimes C) + (B \otimes C) + (A \otimes D) + (B \otimes D) .$
3.: $A B \otimes C D = (A \otimes C) (B \otimes D) .$
4.: ${(A \otimes B)}^{- 1} = A^{- 1} \otimes B^{- 1} .$
5.: ${(A \otimes B)}^{T} = A^{T} \otimes B^{T} .$
6.: If A and B are banded, then $A \otimes B$ is banded.
7.: If A and B are symmetric, then $A \otimes B$ is symmetric.
8.: If A and B are definite positive, then $A \otimes B$ is definite positive.

The concept of separated representation was introduced by Beylkin and Mohlenkamp in [12], and it is related to the problem of constructing the approximate solutions of some classes of problems in high-dimensional spaces by means of a separable function. In particular, for a given map

u : {[0, 1]}^{d} \subset R^{d} ⟶ R,

we say that it has a separable representation if

u (x_{1}, \dots, x_{d}) = \sum_{j = 1}^{\infty} u_{1}^{(j)} (x_{1}) \dots u_{d}^{(j)} (x_{d})

(1)

Now, consider a mesh of

[0, 1]

in the

x_{k}

-variable given by

N_{k}

-mesh points,

1 \leq k \leq d,

then we can write a discrete version of (1) by

u (x_{i_{1}}, \dots, x_{i_{d}}) = \sum_{j = 1}^{\infty} u_{1}^{(j)} (x_{i_{1}}) \dots u_{d}^{(j)} (x_{i_{d}}),

(2)

where

1 \leq i_{k} \leq N_{k}

for

1 \leq k \leq d .

Observe that, for each

1 \leq k \leq d,

if

x_{k}^{j} \in R^{N_{k}}

denotes the vector with components

u_{k}^{(j)} (x_{i_{k}})

for

1 \leq i_{k} \leq N_{k},

then (2) is equivalent to

u = \sum_{j = 1}^{\infty} x_{1}^{j} \otimes \dots \otimes x_{d}^{j} .

(3)

We point out that (3) is a useful expression to implement numerical algorithms using the Matlab and Octave function kron.

Suppose that, for given a linear Partial Differential Equation, and after a discretization by means of Finite Elements, we need to solve the linear system:

A u = f,

(4)

where A is a

(N_{1} \dots N_{d}) \times (N_{1} \dots N_{d})

-dimensional invertible matrix, for some

N_{1}, \dots, N_{d} \in N,

that is,

A \in GL (N_{1} \dots N_{d}) .

Then, from all said above, a low rank approximation

A^{- 1} f \approx u_{n} = \sum_{j = 1}^{n} x_{1}^{j} \otimes \dots \otimes x_{d}^{j}

with sufficient approximation exists, for some

n \geq 1

and where

x_{i}^{j} \in R^{N_{i}}

for

i = 1, 2, \dots, d

and

j = 1, 2, \dots, n .

Moreover, we would show that

lim_{n \to \infty} {∥A^{- 1} f - u_{n}∥}_{2} = 0,

that is,

\begin{matrix} A^{- 1} f = \sum_{j = 1}^{\infty} x_{1}^{j} \otimes \dots \otimes x_{d}^{j} . \end{matrix}

(5)

Thus, in a first approach to solve it, we would like to determine vectors

x_{1}^{j}, \dots, x_{d}^{j}

for

j = 1, 2, \dots, n

that minimizes

{∥f - A (\sum_{j = 1}^{n} x_{1}^{j} \otimes \dots \otimes x_{d}^{j})∥}_{2},

or, in short

{argmin}_{{rank}_{\otimes} u \leq n} {∥ f - A u ∥}_{2},

(6)

by using the notation introduced in [13].

The Proper Generalised Decomposition (PGD in short) appears when we consider solving the linear Equation (4) as an optimization problem as follows. For each fixed

A \in GL (N_{1} N_{2} \dots N_{d})

and

f \in R^{N_{1} N_{2} \dots N_{d}}

, we define a map

J_{A, f} : R^{N_{1} \dots N_{d}} \to R, J_{A, f} (u) = \frac{1}{2} {∥ f - A u ∥}_{2}^{2},

hence

{argmin}_{u \in R^{N_{1} N_{2} \dots N_{d}}} J_{A, f} (u) = {A^{- 1} f}

holds. The goal is to use (5) to approximate the solution of (4). To this end, for each

n \in N,

we define the set

S_{n} = {x \in R^{N_{1} \dots N_{d}} : {rank}_{\otimes} x \leq n},

introduced in [13], in the following way. Given

x \in R^{N_{1} \dots N_{d}}

, we say that

x \in S_{1} = S_{1} (N_{1}, N_{2}, \dots, N_{d})

if

x = x_{1} \otimes x_{2} \otimes \dots \otimes x_{d},

where

x_{i} \in R^{N_{i}},

for

i = 1, \dots, d .

For

n \geq 2

, we define inductively

S_{n} = S_{n} (N_{1}, N_{2}, \dots, N_{d}) = S_{n - 1} + S_{1},

that is,

S_{n} = \{x : x = \sum_{i = 1}^{k} x^{(i)}, x^{(i)} \in S_{1} for 1 \leq i \leq k \leq n\} .

Note that

S_{n} \subset S_{n + 1}

for all

n \geq 1 .

Unfortunately, from Proposition 4.1 (a) of [13], we have that the set

S_{n}

is not necessarily (or even usually) closed for each

n \geq 2 .

However, from Proposition 4.2 of [13], it follows that

S_{1}

is a closed set in any norm-topology. This fact implies (see Lemma 1 in [8]), that given

A \in GL (N_{1} N_{2} \dots N_{d})

, then, for every

f \in R^{N_{1} \dots N_{d}}

, we have that the set

C (A, f) : = {argmin}_{x \in S_{1}} J_{A, f} (x) \neq \emptyset .

(7)

This allows for considering the following iterative scheme. Let

u_{0} = y_{0} = 0,

and, for each

n \geq 1

, take

\begin{matrix} r_{n - 1} & = & f - A u_{n - 1}, \end{matrix}

(8)

\begin{matrix} u_{n} & = & u_{n - 1} + y_{n} where y_{n} \in C (A, r_{n - 1}) . \end{matrix}

(9)

Note that, given

A \in GL (N_{1} N_{2} \dots N_{d})

and

f \in R^{N_{1} \dots N_{d}}

, we can construct for each

n,

by using (8) and (9), a vector

u_{n} = \sum_{j = 1}^{n} y_{n} \in S_{n} \ S_{n - 1} .

Here, we assume that

y_{j} \neq 0

for

1 \leq j \leq n,

that is,

{rank}_{\otimes} u_{n} = n .

Since

u_{n} \approx A^{- 1} f,

we define the

{rank}_{\otimes}

for

A^{- 1} f

obtained by the Greedy Rank-One Update Algorithms (8) and (9) as

{rank}_{\otimes}^{G} (A^{- 1} f) = \{\begin{matrix} \infty & if & {j \geq 1 : y_{j} = 0} = \emptyset, \\ min {j \geq 1 : y_{j} = 0} - 1 & otherwise . \end{matrix}

The following theorem (see Theorem 1 in [8]) gives the convergence of the Greedy Rank-One Update Approximation for solving linear systems with full rank matrix.

Theorem 1.

Let

f \in R^{N_{1} N_{2} \dots N_{d}}

and

A \in GL (N_{1} N_{2} \dots N_{d}) .

Then, by using the iterative scheme (8) and (9), we obtain that the sequence

{∥ r_{n} {∥_{2}}}_{n = 0}^{{rank}_{\otimes}^{G} (A^{- 1} f)},

is strictly decreasing and

A^{- 1} f = lim_{n \to \infty} u_{n} = \sum_{j = 0}^{{rank}_{\otimes}^{G} (A^{- 1} f)} y_{j} .

(10)

Moreover, the rate of convergence is given by

\frac{∥ r_{n} ∥_{2}}{∥ r_{0} ∥_{2}} = \prod_{j = 1}^{n} sin θ_{j}

(11)

for

1 \leq n \leq {rank}_{\otimes}^{G} (A^{- 1} f)

where

θ_{j} = arccos (\frac{⟨r_{j - 1}, A y_{j}⟩}{∥ r_{j - 1} ∥_{2} {∥ A y_{j} ∥}_{2}}) \in (0, π / 2)

(12)

for

1 \leq j \leq n .

From (10), we obtain that, if

{rank}_{\otimes}^{G} (A^{- 1} f) < \infty,

then

∥ r_{n} ∥_{2} = 0

for all

n > {rank}_{\otimes}^{G} (A^{- 1} f) .

Thus, the above theorem allows for us to construct a procedure, which we give in the pseudo-code form in Algorithm 1, under the assumption that we have a numerical method in order to find a

y

solving (7) (see the step 5 in Algorithm 1) and that we introduce below.

Algorithm 1 Greedy Rank-One Update

1: procedure GROU (

f, A, ε, tol, rank_\max

)
2:

r_{0} = f

3:

u = 0

4: for

i = 0, 1, 2, \dots, rank_\max

do
5:

y = procedure ({min}_{x \in S_{1}} J_{A, r_{i}} (x))

6:

r_{i + 1} = r_{i} - A y

7:

u \leftarrow u + y

8: if

∥ r_{i + 1} ∥_{2} < ε

or

| ∥ r_{i + 1} ∥_{2} - ∥ r_{i} ∥_{2} | < tol

then goto 13
9:        end if
10:    end for
11:    return

u

and

∥ r_{rank_\max} ∥_{2} .

12: break
13: return

u

and

∥ r_{i + 1} ∥_{2}

14: end procedure

3. A Geometric Approach to the PGD

In this section, to study the procedure given in the line 5 of the Algorithm 1, we introduce a smooth manifold. To this end, introduce first the set of tensors of fixed rank-one in the tensor space

R^{N_{1} \dots N_{d}} = ⨂_{j = 1}^{d} R^{N_{j}}

defined as

M_{N_{1} \dots N_{d}} = \{u \in ⨂_{j = 1}^{d} R^{N_{j}} : u = λ ⨂_{j = 1}^{d} u_{j}, λ \in R_{*}, u_{j} \in R^{N_{j}} \ {0}, 1 \leq j \leq d\}

where

R_{*} = R \ {0} .

Observe that the set

S_{1} = {0} \cup M_{N_{1} \dots N_{d}} .

Then, our first result is the following theorem of the alternative.

Theorem 2.

Let

A \in GL (N_{1} N_{2} \dots N_{d})

and

f \in R^{N_{1} N_{2} \dots N_{d}} .

Either

C (A, f) \subset M_{N_{1} \dots N_{d}},

or

0 \in C (A, f)

but not both.

Proof.

Assume that

0 \in C (A, f)

and that there exists

u \in C (A, f) \cap M_{N_{1} \dots N_{d}} .

Since we can write

J_{A, f} (u) = \frac{1}{2} {∥ f ∥}_{2}^{2} - f^{T} A u + \frac{1}{2} {∥ A u ∥}_{2}^{2}

and

J_{A, f} (0) = J_{A, f} (u) = \frac{1}{2} {∥ f ∥}_{2}^{2} \leq J_{A, f} (u^{'})

holds for all

u^{'} \in M_{N_{1} \dots N_{d}},

we have

f^{T} A u = \frac{1}{2} {∥ A u ∥}_{2}^{2} > 0 .

Now, consider the map

f : R ⟶ R

defined as

f (λ) : = J_{A, f} (λ u) = \frac{1}{2} {∥ f ∥}_{2}^{2} - λ f^{T} A u + \frac{λ^{2}}{2} {∥ A u ∥}_{2}^{2} = \frac{1}{2} {∥ f ∥}_{2}^{2} - f^{T} A u λ + f^{T} A u λ^{2} .

Then,

f (0) = J_{A, f} (0) = J_{A, f} (u) = f (1) \leq f (λ)

for all

λ \in R

holds. Observe that

f^{'} (λ) = - f^{T} A u + 2 f^{T} A u λ

and hence the map f has a global minimum for

λ = \frac{1}{2},

a contradiction. □

The main consequence of this result is the following. It says that the output of the procedure given in step 5 in Algorithm 1 always remains in the set

M_{N_{1} \dots N_{d}}

before it gives us the final output.

Corollary 1.

Let

f \in R^{N_{1} N_{2} \dots N_{d}}

and

A \in GL (N_{1} N_{2} \dots N_{d})

such that

A^{- 1} f \notin S_{1} .

Then,

{rank}_{\otimes}^{G} (A^{- 1}) = i - 1

for some

i > 1

if and only if

C (A, r_{i}) = {0},

and hence

C (A, r_{j}) \subset M_{N_{1} \dots N_{d}}

for all

j < i - 1 .

As a consequence of the above corollary, the situation of interest is when given

A \in GL (N_{1} N_{2} \dots N_{d})

and

f \in R^{N_{1} N_{2} \dots N_{d}}

, we have that

C (A, f) \subset M_{N_{1} \dots N_{d}} .

Thus, in order to study the vectors in

C (A, f),

we need to characterize the structure of the critical points of the map

u \mapsto ∥ f - A u ∥

restricted to the set

M_{N_{1} \dots N_{d}} .

To see this in the next section, we provide to

M_{N_{1} \dots N_{d}}

of a structure of smooth manifold.

4. The Set of Tensors of Fixed Rank-One as a Smooth Manifold

Along this paper, we will consider a manifold as a pair

(M, A)

, where

M

is a subset of some finite-dimensional vector space V and

A

is an atlas representing the local coordinate system of

M .

We recall the definition of an atlas associated with a set

M .

Definition 1.

Let

M

be a set. An atlas of class

C^{p} (p \geq 0)

or analytic on

M

is a family of charts with some indexing set

A,

namely

{(U_{α}, φ_{α}) : α \in A},

having the following properties (see [14]):

AT1: ${U_{α}}_{α \in A}$ is a covering of $M,$ that is, $U_{α} \subset M$ for all $α \in A$ and $\cup_{α \in A} U_{α} = M .$
AT2: For each $α \in A, (U_{α}, φ_{α})$ stands for a bijection $φ_{α} : U_{α} \to W_{α}$ of $U_{α}$ onto an open set $W_{α}$ of a finite dimensional normed space $(X_{α}, ∥ \cdot ∥_{α}),$ and for any α and β the set $φ_{α} (U_{α} \cap U_{β})$ is open in $X_{α} .$
AT3: Finally, if we let $U_{α} \cap U_{β} = U_{α, β}$ and $φ_{α} (U_{α, β}) = U_{α, β},$ the transition mapping $φ_{β} \circ φ_{α}^{- 1} : U_{α, β} \to U_{β, α}$ is a diffeomorphism of class $C^{p} (p \geq 0)$ or analytic.

Observe that the condition of an open covering is not used, see [14]. Moreover, in AT2, we do not require that the normed spaces to be the same for all indices

α,

or even to be isomorphic. If

X_{α}

is linearly isomorphic to some finite dimensional normed space X for all

α,

we have the following definition.

Definition 2.

Let

M

be a set and X be a finite dimensional normed space. We say that

M

is a

C^{p}

(respectively, analytic) manifold modelled on X if there exists an atlas of class

C^{p}

(respectively, analytic) over

M

with

X_{α}

linearly isomorphic to X for all

α \in A .

Since different atlases can give the same manifold, we say that two atlases are compatible if each chart of one atlas is compatible with the charts of the other atlas in the sense of AT3. One verifies that the relation of compatibility between atlases is an equivalence relation.

Definition 3.

An equivalence class of atlases of class

C^{p}

on

M,

also denoted by

A,

is said to define a structure of a

C^{p}

-manifold on

M,

and hence we say that

(M, A)

is a finite dimensional manifold. In a similar way, if an equivalence class of atlases is given by analytic maps, then we say that

(M, A)

is an analytic finite dimensional manifold.

For each

u = λ ⨂_{j = 1}^{d} u_{j} \in M_{N_{1} \dots N_{d}}

, we construct a local chart as follows. Let be

span {u_{j}}^{⊥} = {v_{j} \in R^{N_{j}} : v_{j}^{T} \cdot u_{j} = 0},

the orthogonal complement of the linear space

span {u_{j}}

for

1 \leq j \leq d .

Let us consider the set

U_{u} : = span {u_{1}}^{⊥} \times \dots \times span {u_{d}}^{⊥} \times R_{*},

which is an open and dense set of the finite-dimensional vector space

X_{u} : = span {u_{1}}^{⊥} \times \dots \times span {u_{d}}^{⊥} \times R .

Observe that the vector space

X_{u}

is linearly isomorphic to the vector space

R^{N_{1} - 1} \times \dots \times R^{N_{d} - 1} \times R

for all

u \in M_{N_{1} \dots N_{d}} .

Now, we introduce the set

U (u) : = \{u^{'} \in M_{N_{1} \dots N_{d}} : u^{'} = β ⨂_{j = 1}^{d} (u_{j} + w_{j}), \begin{matrix} w_{j} \in span {u_{j}}^{⊥}, 1 \leq j \leq d \\ β \in R_{*} \end{matrix}\}

in

M_{N_{1} \dots N_{d}}

for which we can construct a natural bijection:

φ_{u} : U (u) ⟶ U_{u}, u^{'} \mapsto (w_{1}, \dots, w_{d}, β) \Leftrightarrow u^{'} = β ⨂_{j = 1}^{d} (u_{j} + w_{j}) .

Then, we can state the following result.

Theorem 3.

The set

A_{N_{1} \dots N_{d}} = {(U (u), φ_{u}) : u \in M_{N_{1} \dots N_{d}}}

is an atlas for

M_{N_{1} \dots N_{d}}

and hence

(M_{N_{1} \dots N_{d}}, A_{N_{1} \dots N_{d}})

is a

C^{\infty}

-manifold modelled on

R^{N_{1} - 1} \times \dots \times R^{N_{d} - 1} \times R ≅ R^{{log}_{2} (2^{N_{1} \dots N_{d}}) - d + 1} .

Proof.

Clearly, AT1 holds. To prove AT2 and AT3, let us consider

u, u^{'} \in M_{N_{1} \dots N_{d}}

be such that

U (u) \cap U (u^{'}) \neq \emptyset .

Without loss of generality we may assume that

u = λ ⨂_{j = 1}^{d} u_{i}

and

u^{'} = λ^{'} ⨂_{j = 1}^{d} u_{i}^{'}

, where

{∥ u_{i} ∥}_{i, 2} = {∥ u_{i}^{'} ∥}_{i, 2} = 1

for

1 \leq i \leq d .

Then, for each

z \in U (u) \cap U (u^{'})

, there exists a unique

(w_{1}, \dots, w_{d}, β) \in U_{u}

and a unique

(w_{1}^{'}, \dots, w_{d}^{'}, β^{'}) \in U_{u^{'}}

such that

z = β ⨂_{j = 1}^{d} (u_{j} + w_{j}) = β^{'} ⨂_{j = 1}^{d} (u_{j}^{'} + w_{j}^{'}) .

Since

span {u_{j} + w_{j}} = span {u_{j}^{'} + w_{j}^{'}}

holds, there exists a unique

λ_{j} \in R_{*}

such that

\begin{matrix} u_{j} + w_{j} = λ_{j} (u_{j}^{'} + w_{j}^{'}) for 1 \leq j \leq d . \end{matrix}

(13)

Thus, multiplying (13) on the left side by

u_{j}^{T}

and, by using that

u_{j}^{T} u_{j} = 1

and

u_{j}^{T} w_{j} = 0,

we obtain

λ_{j} = \frac{1}{u_{j}^{T} u_{j}^{'} + u_{j}^{T} w_{j}^{'}} for 1 \leq j \leq d .

Hence,

w_{j} = \frac{u_{j}^{'} + w_{j}^{'}}{u_{j}^{T} u_{j}^{'} + u_{j}^{T} w_{j}^{'}} - u_{j}

defines a

C^{\infty}

-function from the open set

V_{j} = {w_{j} \in span {u_{j}^{'}}^{⊥} : u_{j}^{T} u_{j}^{'} + u_{j}^{T} w_{j}^{'} \neq 0}

to

span {u_{j}}^{⊥}

for each

1 \leq j \leq d .

Moreover,

1 + ∥ w_{j} ∥_{j, 2}^{2} = {∥\frac{u_{j}^{'} + w_{j}^{'}}{u_{j}^{T} u_{j}^{'} + u_{j}^{T} w_{j}^{'}}∥}_{j, 2}^{2}

holds for

1 \leq j \leq d .

Observe that

z

can be written as

\begin{matrix} z & = β ⨂_{j = 1}^{d} (u_{j} + w_{j}) = (β \prod_{j = 1}^{d} \sqrt{1 + ∥ w_{j} ∥_{j, 2}^{2}}) ⨂_{j = 1}^{d} z_{j}, \end{matrix}

where

z_{j} : = \frac{1}{\sqrt{1 + ∥ w_{j} ∥_{j, 2}^{2}}} u_{j} + \frac{1}{\sqrt{1 + ∥ w_{j} ∥_{j, 2}^{2}}} w_{j}

has norm one for

1 \leq j \leq d .

In addition,

z = (β^{'} \prod_{j = 1}^{d} \sqrt{1 + ∥ w_{j}^{'} ∥_{j, 2}^{2}}) ⨂_{j = 1}^{d} z_{j}^{'},

where

z_{j}^{'} : = \frac{1}{\sqrt{1 + ∥ w_{j}^{'} ∥_{j, 2}^{2}}} u_{j}^{'} + \frac{1}{\sqrt{1 + ∥ w_{j}^{'} ∥_{j, 2}^{2}}} w_{j}^{'}

has norm one for

1 \leq j \leq d .

Thus,

β = β^{'} \frac{\prod_{j = 1}^{d} \sqrt{1 + ∥ w_{j}^{'} ∥_{j, 2}^{2}}}{\prod_{j = 1}^{d} \sqrt{1 + ∥ w_{j} ∥_{j, 2}^{2}}} = β^{'} \frac{\prod_{j = 1}^{d} \sqrt{1 + ∥ w_{j}^{'} ∥_{j, 2}^{2}}}{\prod_{j = 1}^{d} {∥\frac{u_{j}^{'} + w_{j}^{'}}{u_{j}^{T} u_{j}^{'} + u_{j}^{T} w_{j}^{'}}∥}_{j, 2}}

clearly defines a

C^{\infty}

-function from the open set

V_{1} \times \dots \times V_{d} \times R_{*} \subset U_{u^{'}}

to

U_{u} .

Finally, we conclude that

φ_{u^{'}} (U (u) \cap U (u^{'})) = V_{1} \times \dots \times V_{d} \times R_{*} \subset U_{u^{'}},

the map

φ_{u} \circ φ_{u^{'}}^{- 1} : φ_{u^{'}} (U (u) \cap U (u^{'})) ⟶ φ_{u} (U (u) \cap U (u^{'}))

is given by

(w_{1}^{'}, \dots, w_{d}^{'}, β^{'}) \mapsto (\frac{u_{1}^{'} + w_{1}^{'}}{u_{1}^{T} u_{1}^{'} + u_{1}^{T} w_{1}^{'}} - u_{1}, \dots, \frac{u_{d}^{'} + w_{d}^{'}}{u_{d}^{T} u_{d}^{'} + u_{d}^{T} w_{d}^{'}} - u_{d}, β^{'} \frac{\prod_{j = 1}^{d} \sqrt{1 + ∥ w_{j}^{'} ∥_{j, 2}^{2}}}{\prod_{j = 1}^{d} {∥\frac{u_{j}^{'} + w_{j}^{'}}{u_{j}^{T} u_{j}^{'} + u_{j}^{T} w_{j}^{'}}∥}_{j, 2}},)

and it is

C^{\infty} .

This follows AT2, AT3 and concludes the proof of the theorem. □

The construction of

M_{N_{1} \dots N_{d}}

as an algebraic variety is well-known (see, for example, [15]). More recently, in [16], a structure of smooth manifold is given, in the framework of Banach spaces, for the set of tensors of fixed rank-one. Following [16], it can be shown that the manifold

M_{N_{1} \dots N_{d}}

is also a principal bundle as follows. Consider the Grassmann manifold of one-dimensional, subspaces of

R^{N_{j}},

denoted by

G_{1} (R^{N_{j}}),

for

1 \leq j \leq d

and define the surjective map

π : M_{N_{1} \dots N_{d}} ⟶ G_{1} (R^{N_{1}}) \times \dots \times G_{1} (R^{N_{d}}), u = λ ⨂_{j = 1}^{d} u_{j} \mapsto {(span {u_{j}})}_{j = 1}^{d}

Then, for each

u = λ ⨂_{j = 1}^{d} u_{j} \in M_{N_{1} \dots N_{d}}

, it holds that

π^{- 1} (π (u)) = ⨂_{j = 1}^{d} span {u_{j}} \ {0} = span {u} \ {0} ≅ R_{*} .

Consequently,

M_{N_{1} \dots N_{d}}

is also a principal bundle with base space

G_{1} (R^{N_{1}}) \times \dots \times G_{1} (R^{N_{d}})

and fibre

R_{*} .

It allows for decomposing the tangent space at

u \in M_{N_{1} \dots N_{d}},

denoted

T_{u} M_{N_{1} \dots N_{d}}

into the vertical and horizontal spaces:

\begin{matrix} T_{u} M_{N_{1} \dots N_{d}} & = X_{u} = H_{u} M_{N_{1} \dots N_{d}} + V_{u} M_{N_{1} \dots N_{d}}, \end{matrix}

where

H_{u} M_{N_{1} \dots N_{d}} . : = span {u_{1}}^{⊥} \times \dots \times span {u_{d}}^{⊥}

and

V_{u} M_{N_{1} \dots N_{d}} : = R .

5. On the First Order Optimality Conditions for the PGD

The goal of this section is given

A \in GL (N_{1} N_{2} \dots N_{d})

and

f \in R^{N_{1} N_{2} \dots N_{d}}

characterize the points in the manifold

M_{N_{1} \dots N_{d}}

satisfying the first order optimality conditions of the problem

\begin{matrix} min_{z \in M_{N_{1} \dots N_{d}}} J_{A, f} (z) . \end{matrix}

(14)

Recall that the map

J_{A, f}

is defined in the whole ambient space

R^{N_{1}} \otimes \dots \otimes R^{N_{d}} .

We will denote its derivative at

u \in R^{N_{1}} \otimes \dots \otimes R^{N_{d}}

by

J_{A, f}^{'} (u) = {(f - A u)}^{T},

which is a bounded linear map from

R^{N_{1}} \otimes \dots \otimes R^{N_{d}}

to

R .

From Theorem 3, we known that

M_{N_{1} \dots N_{d}}

is a

C^{\infty}

-manifold and hence it allows us to write the constrained map

J_{A, f} |_{M_{N_{1} \dots N_{d}}}

as follows. Since

J_{A, f} : R^{N_{1}} \otimes \dots \otimes R^{N_{d}} ⟶ R,

and

M_{N_{1} \dots N_{d}} \subset R^{N_{1}} \otimes \dots \otimes R^{N_{d}},

we can take into account the standard inclusion map

i : M_{N_{1} \dots N_{d}} ⟶ R^{N_{1}} \otimes \dots \otimes R^{N_{d}}, z \mapsto z

in order to write

J_{A, f} |_{M_{N_{1} \dots N_{d}}} = (J_{A, f} \circ i) .

Definition 4.

We say that

u \in M_{N_{1} \dots N_{d}}

is a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if

T_{u} J_{A, f} (v) : = [D (J_{A, f} \circ φ_{u}^{- 1}) (φ_{u} (u))] (v) = 0

holds for all

v \in T_{u} M_{N_{1} \dots N_{d}} .

Clearly, if

u

is an extremal point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

, then it is also a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}} .

Observe that we can write

J_{A, f} \circ φ_{u}^{- 1} = J_{A, f} \circ (i \circ φ_{u}^{- 1}),

where on the left side of the equality, we consider

J_{A, f}

over

M_{N_{1} \dots N_{d}}

, whereas, in the right one,

J_{A, f}

is considered defined over the whole space. Thus, by using the chain rule, we have

\begin{matrix} D (J_{A, f} \circ φ_{u}^{- 1}) (φ_{u} (u)) & = D (J_{A, f} \circ (i \circ φ_{u}^{- 1})) (φ_{u} (u)) \\ = J_{A, f}^{'} (u) \circ D (i \circ φ_{u}^{- 1}) (φ_{u} (u)), \end{matrix}

that is,

\begin{matrix} T_{u} J_{A, f} = J_{A, f}^{'} (u) \circ T_{u} i = {(f - A u)}^{T} T_{u} i . \end{matrix}

(15)

In order to compute (15), we consider first the standard inclusion map i that in local coordinates in a neighbourhood of

u = λ ⨂_{j = 1}^{d} u_{j} \in M_{N_{1} \dots N_{d}}

looks like

(i \circ φ_{u}^{- 1}) (w_{1}, \dots, w_{d}, β) = β ⨂_{j = 1}^{w} (u_{j} + w_{j}) .

Here,

(i \circ φ_{u}^{- 1}) : U_{u} ⟶ R^{N_{1}} \otimes \dots \otimes R^{N_{d}} .

Hence, its derivative as a morphism (a map between manifolds)

T_{u} i = D (i \circ φ_{u}^{- 1}) (0, \dots, 0, λ) : T_{u} M_{N_{1} \dots N_{d}} ⟶ R^{N_{1}} \otimes \dots \otimes R^{N_{d}},

is given by

T_{u} i (β, w_{1}, \dots, w_{d}) = \sum_{j = 1}^{d} w_{j} \otimes u_{[j]} + \frac{β}{λ} u,

where

w_{j} \otimes u_{[j]} : = λ u_{1} \otimes \dots \otimes u_{j - 1} \otimes w_{j} \otimes u_{j + 1} \otimes \dots \otimes u_{d} .

for

1 \leq j \leq d .

From (15), we have that

u^{*} = λ^{*} ⨂_{j = 1}^{d} u_{i}^{*}

is a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if and only if

\begin{matrix} {(f - A u^{*})}^{T} T_{u^{*}} i (w_{1}, \dots, w_{d}, β) = 0, holds for all (w_{1}, \dots, w_{d}, β) \in T_{u^{*}} M_{N_{1} \dots N_{d}}, \end{matrix}

(16)

that is, it is equivalent to state that

{(f - A u^{*})}^{T} (\sum_{j = 1}^{d} w_{j} \otimes u_{[j]}^{*} + \frac{β}{λ} u^{*}) = 0, holds for all (w_{1}, \dots, w_{d}, β) \in T_{u^{*}} M_{N_{1} \dots N_{d}} .

(17)

Now, we will prove the following result that characterizes the set of critical points for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

as the stationary points of a vector field in

R^{N_{1}} \times \dots \times R^{N_{d}}

, and it is the main result of this paper.

Theorem 4.

Given

A \in GL (N_{1} N_{2} \dots N_{d})

and

f \in R^{N_{1} N_{2} \dots N_{d}} .

Then,

u^{*} = ⨂_{j = 1}^{d} u_{i}^{*} \in M_{N_{1} \dots N_{d}}

is a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if and only if

X_{A, f} (u_{1}^{*}, \dots, u_{d}^{*}) = 0

, where

X_{A, f}

is a vector field in

M_{N_{1} \dots N_{d}}

given by

X_{A, f} (u_{1}, \dots, u_{d}) = (y_{1}, \dots, y_{d}) .

Here,

y_{j} : = {(u_{1} \otimes \dots \otimes u_{j - 1} \otimes i d_{R^{N_{j}}} \otimes u_{j + 1} \otimes \dots \otimes u_{d})}^{T} (f - A (u_{1} \otimes \dots \otimes u_{d}))

for

1 \leq j \leq d .

Proof.

Since

u^{*} = ⨂_{j = 1}^{d} u_{i}^{*} \in M_{N_{1} \dots N_{d}}

is a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if and only if (17) holds for

(w_{1}, \dots, w_{d}, β) \in T_{u^{*}} M_{N_{1} \dots N_{d}} .

Then, (17) is equivalent to prove that

\begin{matrix} {(f - A u^{*})}^{T} u^{*} = 0, \end{matrix}

(18)

and

\begin{matrix} {(f - A u^{*})}^{T} (w_{j} \otimes u_{[j]}^{*}) = 0 holds for all w_{j} \in span {u_{j}}^{⊥} and 1 \leq j \leq d . \end{matrix}

(19)

Since

w_{j} \otimes u_{[j]}^{*} = u^{*}

when

w_{j} = u_{j}^{*}

for

1 \leq j \leq d,

we conclude that

u^{*} = ⨂_{j = 1}^{d} u_{i}^{*} \in M_{N_{1} \dots N_{d}}

is a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if and only if

\begin{matrix} {(f - A u^{*})}^{T} (w_{j} \otimes u_{[j]}^{*}) = 0 holds for all w_{j} \in R^{N_{j}} and 1 \leq j \leq d . \end{matrix}

(20)

Writing the equality in (20) as

{(f - A u_{1}^{*} \otimes \dots \otimes u_{d}^{*})}^{T} (u_{1}^{*} \otimes \dots \otimes u_{j - 1}^{*} \otimes i d_{R^{N_{j}}} \otimes u_{j + 1}^{*} \otimes \dots \otimes u_{d}^{*}) w_{j} = 0,

we conclude that it is equivalent to state that

{(f - A u_{1}^{*} \otimes \dots \otimes u_{d}^{*})}^{T} (u_{1}^{*} \otimes \dots \otimes u_{j - 1}^{*} \otimes i d_{R^{N_{j}}} \otimes u_{j + 1}^{*} \otimes \dots \otimes u_{d}^{*}) = 0^{T}

for

1 \leq j \leq d .

This concludes the proof of theorem. □

Remark 1.

From (17), we can deduce that

u^{*} = ⨂_{j = 1}^{d} u_{i}^{*}

is a critical point for

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if and only if

\begin{matrix} {(f - A u^{*})}^{T} (\sum_{j = 1}^{d} w_{j} \otimes u_{[j]}^{*}) = 0 holds for all (w_{1}, \dots, w_{d}) \in R^{N_{1}} \times \dots \times R^{N_{d}} . \end{matrix}

(21)

Statement (21) was first introduced in [2,3] as a step to enrich the approximation basis in the PGD algorithm. From the proof of Theorem 4, we have that (21) is also equivalent to (18) and (19).

There are several strategies in the literature that can be used to solve

X_{A, f} (u) = 0

. The first one, closely related to the one used in [2,3], is to find a fixed point of the map

F_{A, f} (u_{1}, \dots, u_{d}) : = X_{A, f} (u_{1}, \dots, u_{d}) - (u_{1}, \dots, u_{d})

One of the most popular numerical strategies to compute an approximated value of

u^{*}

such that

X_{A, f} (u^{*}) = 0

is founded under the following argument. Since the

y_{1}

is equal to

\begin{matrix} {(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} (f - A u_{1} \otimes \dots \otimes u_{d}) = \\ {(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} f - {(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} A (u_{1} \otimes \dots \otimes u_{d}) = \\ {(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} f - {(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} A (i d_{R^{N_{1}}} \otimes u_{2} \dots \otimes u_{d}) u_{1}, \end{matrix}

we can choose randomly some vectors

u_{j} \in R^{N_{j}}

for

2 \leq j \leq d

and then try to solve the linear system

{(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} A (i d_{R^{N_{1}}} \otimes u_{2} \dots \otimes u_{d}) u_{1} = {(i d_{R^{N_{1}}} \otimes u_{2} \otimes \dots \otimes u_{d})}^{T} f .

By using Least Squares, we can compute a

u_{1}^{'},

and next we can proceed in a similar way iteratively with each of the other components

y_{2}, \dots, y_{d}

as follows. After the step

i \geq 1

, we know that

u_{1}^{'}, u_{2}^{'}, \dots, u_{i}^{'} .

Then, by choosing randomly

u_{i + 1}, \dots u_{d}

, we can solve, by using Least Squares, the linear system

\begin{matrix} {(u_{1}^{'} \otimes \dots \otimes u_{i}^{'} \otimes i d_{R^{N_{i}}} \otimes u_{i + 1} \otimes \dots \otimes u_{d})}^{T} A (i d_{R^{N_{1}}} \otimes u_{2} \dots \otimes u_{d}) u_{i} \\ = {(u_{1}^{'} \otimes \dots \otimes u_{i}^{'} \otimes i d_{R^{N_{i}}} \otimes u_{i + 1} \otimes \dots \otimes u_{d})}^{T} f, \end{matrix}

to compute

u_{i}^{'} .

We can proceed cyclically until

∥ f - A (u_{1}^{'} \otimes \dots \otimes u_{d}^{'}) ∥ < ε

holds. This strategy is known by the name of the Alternating Least Squares (ALS).

From (17), we have that

u^{*}

is a critical point of

J_{A, f}

in

M_{N_{1} \dots N_{d}}

if and only if

\begin{matrix} (f - A u^{*}) ⊥ T_{u^{*}} i (T_{u^{*}} M_{N_{1} \dots N_{d}}) \end{matrix}

(22)

holds, that is, the residual at

u^{*}

is orthogonal to linear subspace

T_{u^{*}} i (T_{u^{*}} M_{N_{1} \dots N_{d}}) .

Observe that

w_{j} \otimes u_{[j]} \in V_{j} (u),

where

V_{j} (u) : = span {u_{1}} \otimes \dots \otimes span {u_{j - 1}} \otimes span {u_{j}}^{⊥} \otimes span {u_{j + 1}} \otimes \dots \otimes span {u_{d}}

is an

N_{j} - 1

-dimensional subspace of

R^{N_{1}} \otimes \dots \otimes R^{N_{d}}

for

1 \leq j \leq d .

Then, the next result explicitly describes the linear subspace

T_{u^{*}} i (T_{u^{*}} M_{N_{1} \dots N_{d}}) .

Proposition 1.

The linear map

T_{u} i : T_{u} M_{N_{1} \dots N_{d}} ⟶ R^{N_{1}} \otimes \dots \otimes R^{N_{d}}

is injective. Moreover,

T_{u} i (T_{u} M_{N_{1} \dots N_{d}}) = ⨁_{j = 1}^{d} V_{j} (u) \oplus span {u} ≅ T_{u} M_{N_{1} \dots N_{d}} .

Proof.

Assume that

T_{u} i (β, w_{1}, \dots, w_{d}) = \sum_{j = 1}^{d} w_{j} \otimes u_{[j]} + \frac{β}{λ} u = 0 .

Since

u ⊥ (w_{j} \otimes u_{[j]})

for

1 \leq j \leq d

and

(w_{i} \otimes u_{[j]}) ⊥ (w_{j} \otimes u_{[j]})

for all

i \neq j,

then

β = 0 and w_{i} \otimes u_{[j]} = 0 for 1 \leq j \leq d .

Since

u_{[j]} \neq 0

, then

w_{j} = 0

for

1 \leq j \leq d .

This follows the first statement. To prove the second one, we remark that the inclusion

T_{u} i (T_{u} M_{N_{1} \dots N_{d}}) \subset ⨁_{j = 1}^{d} V_{j} (u) \oplus span {u}

between both subspaces is trivial. Clearly,

dim T_{u} i (T_{u} M_{N_{1} \dots N_{d}}) = dim (⨁_{j = 1}^{d} V_{j} (u) \oplus span {u})

holds and then the second statement is also proved. □

6. Conclusions

In this paper, a geometric upgrade of the PGD algorithm is proposed. To this end, we endow a set of tensors of fixed rank-one with a smooth manifold structure. This construction provides the solutions of a non-convex optimization problem by using a set of stationary points of a vector field. This new perspective allows for the characterization of the behaviour of the solutions of the Greedy Rank One Algorithm which could not be achieved with the ALS strategy used in [8].

Author Contributions

Investigation, writing and revision: A.F., L.H., N.M., M.C.M. and E.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the GVA/2019/124 grant from Generalitat Valenciana and by the RTI2018-093521-B-C32 grant from the Ministerio de Ciencia, Innovación y Universidades.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Falcó, A.; Montés, N.; Chinesta, F.; Hilario, L.; Mora, M.C. On the existence of a Progressive Variational Vademecum based on the Proper Generalized Decomposition for a Class of Elliptic Parametrized Problems. J. Comput. Appl. Math. 2018, 330, 1093–1107. [Google Scholar] [CrossRef]
Ammar, A.; Mokdad, B.; Chinesta, F.; Keunings, R. A new family of solvers for some classes of multidimensional partial differential equations encountered in Kinetic Theory modelling Complex Fluids. J. Non-Newton. Fluid Mech. 2006, 139, 153–176. [Google Scholar] [CrossRef]
Ammar, A.; Mokdad, B.; Chinesta, F.; Keunings, R. A new family of solvers for some classes of multidimensional partial differential equations encountered in Kinetic Theory modelling Complex Fluids. Part II: Transient Simulations using Space-Time Separated Representations. J. Non–Newton. Fluid Mech. 2007, 144, 98–121. [Google Scholar] [CrossRef]
Falcó, A.; Nouy, A. A Proper Generalized Decomposition for the solution of elliptic problems in abstract form by using a functional Eckart- Young approach. J. Math. Anal. Appl. 2011, 376, 469–480. [Google Scholar] [CrossRef]
Chinesta, F.; Leygue, A.; Bordeu, F.; Aguado, J.V.; Cueto, E.; González, D.; Alfaro, I.; Ammar, A.; Huerta, A. PGD-Based computational vademecum for efficient Design, Optimization and Control. Arch. Comput. Methods Eng. 2013, 20, 31–49. [Google Scholar] [CrossRef]
Chinesta, F.; Ladevéze, P. Separated Representations and PGD-Based Model Reduction; Springer International Centre for Mechanical Sciences: Cham, Switzerland, 2014. [Google Scholar]
Falcó, A.; Nouy, A. Proper Generalized Decomposition for Nonlinear Convex Problems in Tensor Banach Spaces. Numer. Math. 2012, 121, 503–530. [Google Scholar] [CrossRef]
Ammar, A.; Chinesta, F.; Falcó, A. On the convergence of a greedy rank-one update algorithm for a class of linear systems. Arch. Comput. Methods Eng. 2010, 17, 473–486. [Google Scholar] [CrossRef]
El Hamidi, A.; Osman, H.; Jazar, M. On the convergence of alternating minimization methods in variational PGD. Comput. Optim. Appl. 2017, 68, 455–472. [Google Scholar] [CrossRef]
Graham, A. Kronecker Products and Matrix Calculus with Applications; John Wiley: Hoboken, NJ, USA, 1981. [Google Scholar]
Van Loan, C.F. The ubiquitous Kronecker product. J. Comput. Appl. Math. 2000, 123, 85–100. [Google Scholar] [CrossRef]
Beylkin, G.; Mohlenkamp, M.J. Algorithms for Numerical Analysis in High Dimensions. SIAM J. Sci. Comput. 2005, 26, 2133–2159. [Google Scholar] [CrossRef]
de Silva, V.; Lim, L.-H. Tensor Rank and Ill-posedness of the best low-rank approximation problem. SIAM J. Matrix Anal. Appl. 2008, 30, 1084–1127. [Google Scholar] [CrossRef]
Lang, S. Differential and Riemannian Manifolds; Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 1995; Volume 160. [Google Scholar]
Landsberg, J.M. Tensor: Geometry and Applications. Graduate Studies in Mathematics; American Mathematical Society: Providence, Rhode Island, 2012; Volume 128. [Google Scholar]
Falcó, A.; Hackbusch, W.; Nouy, A. On the Dirac-Frenkel Variational Principal on Tensor Banach Spaces. Found. Comput. Math. 2019, 19, 159–204. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Towards a Vector Field Based Approach to the Proper Generalized Decomposition (PGD)

Abstract

1. Introduction

2. Preliminary Definitions and Results

3. A Geometric Approach to the PGD

4. The Set of Tensors of Fixed Rank-One as a Smooth Manifold

5. On the First Order Optimality Conditions for the PGD

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics