On Numerical Approximations of the Koopman Operator

Mezić, Igor

doi:10.3390/math10071180

Open AccessEditor’s ChoiceArticle

On Numerical Approximations of the Koopman Operator

by

Igor Mezić

Mechanical Engineering and Mathematics, University of California, Santa Barbara, CA 93106, USA

Mathematics 2022, 10(7), 1180; https://doi.org/10.3390/math10071180

Submission received: 5 January 2022 / Revised: 20 March 2022 / Accepted: 22 March 2022 / Published: 5 April 2022

(This article belongs to the Special Issue Dynamical Systems and Operator Theory)

Download Versions Notes

Abstract

:

We study numerical approaches to computation of spectral properties of composition operators. We provide a characterization of Koopman Modes in Banach spaces using Generalized Laplace Analysis. We cast the Dynamic Mode Decomposition-type methods in the context of Finite Section theory of infinite dimensional operators, and provide an example of a mixing map for which the finite section method fails. Under assumptions on the underlying dynamics, we provide the first result on the convergence rate under sample size increase in the finite-section approximation. We study the error in the Krylov subspace version of the finite section method and prove convergence in pseudospectral sense for operators with pure point spectrum. Since Krylov sequence-based approximations can mitigate the curse of dimensionality, this result indicates that they may also have low spectral error without an exponential-in-dimension increase in the number of functions needed.

Keywords:

koopman operator; numerical analysis; dynamical systems

MSC:

37M10

1. Introduction

Spectral theory of dynamical systems shifts the focus of investigation of dynamical systems behavior away from trajectories in the state space and towards spectral features of an associated infinite-dimensional linear operator. Of particular interest is the composition operator—in a measure-preserving setting called the Koopman operator [1,2,3,4,5]. Its spectral triple—eigenvalues, eigenfunctions and eigenmodes—can be used in a variety of contexts, from model reduction [5] to stability and control [6]. In practice, we only have access to finite-dimensional data from observations or outputs of numerical simulations. Thus, it is important to study approximation properties of finite-dimensional numerical algorithms devised to compute spectral objects [7]. Compactness is the property that imbues infinite-dimensional operators with quasi-finite-dimensional properties. Self-adjointness also helps in proving the approximation results. However, the composition operators under study here are rarely compact or self-adjoint. In addition, in the classical, measure-preserving case, the setting is that of unitary operators (and essentially self-adjoint generators for the continuous-time setting [8]), but in the general, dissipative case, composition operators are neither.

There are three main approaches to finding spectral objects of the Koopman operator:

1.: The first, suggested already in [9] is based on long time weighted averages over trajectories, rooted in ergodic theory of measure-preserving dynamical systems. An extension of that work that captures properties of continuous spectrum was presented in [10]. This approach was named Generalized Laplace Analysis (GLA) in [11], where concepts pertaining to dissipative systems were discussed also in terms of weighted averages along trajectories. In that sense, the ideas in this context provide an extension of ergodic theory for capturing transient (off-attractor) properties of systems. For on-attractor evolution, the properties of the method acting on $L^{2}$ functions were studied in [4]. The off-attractor case was pursued in [4,12] where Fourier averages (which are Laplace averages for the case when the eigenvalue considered is on the imaginary axis) were used to compute the eigenfunction whose level sets are isochrons, and [13] in which analysis for general eigenvalue distributions was pursued in Hardy-type spaces. This study was continued in [14] to construct dynamics-adapted Hilbert spaces. The advantage of the method is that it does not require the approximation of the operator itself, as it constructs eigenfunctions and eigenmodes directly from the data. In this sense, it is close (and in fact related) to the power method of approximating the spectrum of a matrix from data on iteration of a vector. In fact, the methodology extends the power method to the case when eigenvalues can be of magnitude 1. It requires separate computation to first determine the spectrum of the operator, which is also done without constructing it. This can potentially be hard to do because of issues such as spectral pollution—see remarks at the end of Section 3; also note the general long-standing problems of spectral pollution and computing the full spectrum of Schrödinger operators on a lattice were recently solved in [15]. The recent work [16] enables computation of full spectral measures using the combination of resolvent operator techniques (used for the first time in the Koopman operator context in [17]) and ResDMD—an extension of Dynamic Mode Decomposition (introduced next) technique that incorporates computation of residues from data snapshots (computation of residues was considered earlier in [18]).
2.: The second approach requires construction of an approximate operator acting on a finite-dimensional function subspace i.e., a finite section—the problem that is also of concern in a more general context of approximating infinite dimensional operators [7,19,20]. The best known such method is the Dynamic Mode Decomposition (DMD), invented in [21] and connected to the Koopman operator in [22]. It has a number of extensions (many of which are summarized in [23]), for example, Exact DMD [24]; Bayesian/subspace DMD [25]; Optimized DMD [26,27]; Recursive DMD [28]; Variational DMD [29]; DMD with control [30,31]; sparsity promoting DMD [32]; DMD for noisy systems [33,34,35]. The original DMD algorithm featured state observables. The Extended Dynamic Mode Decomposition [36] recognizes that nonlinear functions of state might be necessary to describe a finite-dimensional invariant subset of the Koopman operator and provides an algorithm for finite-section approximation of the Koopman operator. A study of convergence of such approximations is provided in [37], but the convergence was established only along subsequences, and the rate of convergence was not addressed. Here, we provide the first result on the rate of convergence of the finite section approximation under assumptions on the nature of the underlying dynamics. In addition, spectral convergence along subsequences is proven in [37] under the assumption of the weak limit of eigenfunction approximations not being zero. This condition is hard to verify in practice. Instead, in Section 5.2, we prove a result that obviates the weak convergence assumption using some additional information on the underlying dynamics. It was observed already in [9] that, instead of an arbitrary set of observables forming a basis, one can use observables generated by the dynamics—namely time delays of a single observable filling a Krylov subspace—to study spectral properties of the Koopman operator. In the DMD context, the methods developed in this direction are known under the name Hankel-DMD [38,39]. It is worth noticing that the Hankel matrix approach of [38] is in fact based on the Prony approximation and requests a different sample structure than the Dynamic Mode Decomposition. Computation of residues was considered in [18] to address the problem of spectral pollution, where discretization introduces spurious eigenvalues. As mentioned before, the recent work [16] provides another method to resolve the spectral pollution problem, introducing ResDMD—an extension of Dynamic Mode Decomposition that incorporates computation of residues from data snapshots. The relationship between GLA and finite section methods was studied in [40].
3.: The third approach is based on the kernel integral operator combined with the Krylov subspace methodology [41], enabling approximation of continuous spectrum. While GLA and EDMD techniques have been extended to dissipative systems, the kernel integral operator technique is currently available only for measure-preserving (on-attractor) systems.

In this paper, we continue with the development of ergodic theory-rooted ideas to understanding and numerically computing the spectral triple—eigenvalues, eigenfunctions and modes—for the Koopman operator. After some preliminaries, we start in Section 3 with discussing properties of algorithms of Generalized Laplace Analysis type in Banach spaces. Such results have previously been obtained in Hardy-type spaces [13], and here, we introduce a Gel’fand-formula-based technique that allows us to expand to general Banach spaces. We continue in Section 4 with setting the finite-section approximation of the Koopman operator in the ergodic theory context. An explicit relationship of finite section coefficients to dual basis is established. Under assumptions on the underlying dynamics, we provide the first result on the convergence rate under sample size increase in the finite-section approximation. The error in the finite section approximation is analyzed. In Section 5, we study finite section approximations of the Koopman operator based on Krylov sequences of time-delays of observables, and prove that under certain conditions, the approximation error decreases as the number of samples is increased, without dependence on the dimension of the problem. Namely, the Krylov subspace (Hankel-DMD) methodology has the advantage of convergence in the number of iterates and does not require a basis exponentially large in the number of dimensions. This solves the problem of the choice of observables, since the dynamics selects the basis by itself. In Section 6, we discuss an alternative point of view on the DMD approximations which is not related to finite sections, but samples of continuous functions on finite subsets of the state-space. The concept of weak eigenfunctions is discussed, continuing the analysis in [37]. We conclude in Section 7.

2. Preliminaries

For a Lipshitz-continuous (ensuring global existence and uniqueness of solutions) dynamical system

\dot{x} = F (x),

(1)

defined on a manifold

M \in R^{m}

(i.e.,

x \in M

—where we by slight abuse of notation identify a point in a manifold M with its vector representation

x

in

R^{m}

), where

x

is a vector and

F

is a possibly nonlinear vector-valued smooth function, of the same dimension as its argument

x

, denote by

S^{t} (x_{0})

the position at time t of trajectory of (1) that starts at time 0 at point

x_{0}

. We call the family of functions

S^{t}

the flow.

Denote by

f

an arbitrary, vector-valued observable

f : M \to R^{k}

. The value

f (t, x_{0})

of the observable

f

that the system trajectory starting from

x_{0}

at time 0 sees at time t is

f (t, x_{0}) = f (S^{t} (x_{0})) .

(2)

Note that the space of all observables

f

is a linear vector space. The family of operators

U^{t},

acting on the space of observables parameterized by time t is defined by

U^{t} f (x_{0}) = f (S^{t} (x_{0})) .

(3)

Thus, for a fixed time

τ

,

U^{τ}

maps the vector-valued observable

f (x_{0})

to

f (τ, x_{0})

. We will call the family of operators

U^{t}

indexed by time t the Koopman operator of the continuous-time system (1). This family was defined for the first time in [1], for Hamiltonian systems. In operator theory, such operators, when defined for general dynamical systems, are often called composition operators, since

U^{t}

acts on observables by composing them with the mapping

S^{t}

[3]. Discretization of

S^{t}

for times

τ, 2 τ, \dots, n τ, \dots

leads to the

τ

-mapping

T = S^{τ} : M \to M

with the discrete dynamics

x^{'} = T x,

(4)

and the associated Koopman operator U defined by

U f = f \circ T .

(5)

Let

F

be a space of observables and

U : F \to F

the Koopman operator associated with a map T (note this means that

f \circ T \in F

if

f \in F

). Appropriate (dynamics-adapted) spaces are discussed in [14]. A function

ϕ_{λ} \in F

is an eigenfunction of U associated with eigenvalue

λ

provided

U ϕ_{λ} = λ ϕ_{λ} .

(6)

Let

σ (U) \subset C

be the spectrum of U. The operator U is called scalar [42] on

F

provided

U = \int_{σ (U)} β d E (β),

(7)

where E is a family of spectral projections forming resolution of the identity, and the integral is over

β \in σ (U) \subset C

. Further, the operator U is called spectral provided

U = S + N,

(8)

where S is scalar and N quasi-nilpotent. Examples of functional spaces in which Koopman operators are scalar and spectral are given in [14]. Let

f \in F

be a vector of observables. For a scalar operator U, the Koopman mode

s_{λ}

of

f

associated with an eigenvalue

λ

of algebraic multiplicity 1 is given by

s_{λ} ϕ_{λ} = f_{λ},

(9)

where

ϕ_{λ}

is the unit norm eigenfunction associated with

λ

, and

f_{λ} = f - \int_{σ (U) / {λ}} β d E (β) f = \int_{{λ}} β d E (β) f

(10)

Note that, denoting by

E_{λ}

the projection on the eigenspace associated with the eigenvalue

λ

, we have

E_{λ} f_{λ} = E_{λ} f,

(11)

since

E_{λ} f_{λ} = f_{λ}

and

E_{λ} \circ \int_{σ (U) / {λ}} β d E (β) (f) = 0,

by one of the key properties of the spectral resolution [42]. Now,

E_{λ} f = c ϕ_{λ},

(12)

for some constant

c

, proving that

s_{λ}

is well-defined and independent of

x

.

Remark 1.

Note that in the more general case with algebraic multiplicities of eigenvalues larger than 1, an analogous definition of the Koopman mode can be obtained. For example, if algebraic multiplicity and geometric multiplicity are 2 and there are two linearly independent eigenfunctions

ϕ_{λ}^{1}

and

ϕ_{λ}^{2}

associated with the eigenvalue λ of multiplicity 2, and we are computing

s_{λ}^{1}

, then (10) contains an additional term on the RHS,

- s_{λ}^{2} ϕ_{λ}^{2}

, and similarly for

s_{λ}^{2}

, forming 2 equations. In the case of spectral operators, one works similarly, but the added complexity is in the use of generalized eigenfunctions [14].

We assume that the dynamical system T has a Milnor attractor

A

such that for every continuous function g, for almost every

x \in M

with respect to an a priori measure

ν

on M (without loss of generality as we can replace M with the basin of attraction of

A

) the limit

g^{*} (x) = lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} U^{i} g (x),

(13)

exists. This is the case, e.g., for smooth systems on subsets of

R^{n}

with Sinai–Bowen–Ruelle measures, where

ν

is the Lebesgue measure [43]. For such systems, Hilbert spaces on which the Koopman operator is spectral have been constructed in [14].

3. Generalized Laplace Analysis

An example of what we call Generalized Laplace Analysis (GLA) is the computation of eigenspace at 0 (namely, invariants) of dynamical systems using time averages: recall

h^{*} (x) = lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} h (S^{τ} (x)) d τ

(14)

is the time average at initial condition

x

of the function h under the dynamics of

S^{t}

. For fixed point attractors

U^{t} h^{*} (x) = 1 \cdot h^{*} (x)

(15)

As shown previously, this is valid in a much larger context: limit cycle attractors, toroidal attractors, Milnor attractors, and measure-preserving systems.

We generalize the idea that averages along trajectories produce eigenfunctions, by introducing weights:

h^{*} (x) = lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} a (τ) h (S^{τ} (x)) d τ \approx \frac{1}{n} \sum_{j = 0}^{n_{j}} a (j Δ τ) h (S^{j Δ τ} (x))

(16)

where

a (t)

is a function of time—typically a (possibly complex) exponential, and

Δ τ

is a sampling time interval. If we have a vectorized set of initial conditions

x_{k}, k = 1, \dots, n_{k}

, then we can generate a data matrix

H_{j k} = h (S^{j Δ τ} (x_{k}))

(17)

Vectorizing

{(a)}_{j} = a (j Δ τ)

, we get

h_{a}^{*} = H a .

(18)

H is the data matrix. For

a (t) = 1 = e^{0 \cdot t}

, we get

h_{a}^{*} = H 1

, where

1

is a vector of 1’s with

n_{j}

components. To obtain eigenfunctions using Fourier averages, as developed in [12], we set

a (t) = 1 = e^{- i ω t}

, to obtain

h_{e^{- i ω t}}^{*} (x) = lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} e^{- i ω τ} h (S^{τ} (x)) d τ \approx \frac{1}{n} \sum_{j = 0}^{n_{j}} e^{i ω j Δ τ} h (S^{j Δ τ} (x))

(19)

Both of the above examples were for the case when

| a (t) | = 1

corresponding to eigenvalues

0, i ω

, both on the imaginary axis. In the next subsection, we provide a general theorem that deals with eigenvalues distributed arbitrarily in the complex plane.

GLA for Fields of Observables

Many of the problems of interest in applications feature a distributed field of observables. For example, time evolution of temperature in a linear rod described by the coordinate

z \in [0, 1]

, is

T (t, T_{0}, z)

, where

T_{0} (z)

is the initial condition that belongs to the state space of all possible temperature distributions satisfying the boundary conditions, and t is time. We will set our analysis up having this example in mind—namely, we consider a field of observables

f (x, z)

, where

x

is in state space, and

z

is an indexing set—and consider the time evolution of such observables starting from an initial condition

x

.

Let

f (x, z)

be a bounded field of observables

f (x, z) : M \times A \to R^{m}

, continuous in

x

, where the observables are indexed over elements

z

of a set A, and M is a compact metric space. We will occasionally drop the dependence on the state-space variable

x

and denote

f (x, z) = f (z)

and the iterates of f by

f (T^{i} x, z) = f^{i} (z)

. Let U be the Koopman operator associated with a map

T : M \to M

. We assume that U is bounded, and acting in a closed manner on a Banach space of continuous functions C (this does not have to be the space of all continuous functions on M, see the remark after the theorem).

Theorem 1.

(Generalized Laplace Analysis). Let

λ_{0}, \dots, λ_{K}

be simple eigenvalues of U such that

| λ_{0} | \geq | λ_{1} | \geq \dots \geq | λ_{K} | > 0,

and there are no other points λ in the spectrum of U with

| λ | \geq | λ_{K} |

. Let

ϕ_{k}

be the eigenfunction of U associated with

λ_{k}, k \in {0, \dots, K}

. Then, the Koopman mode associated with

λ_{k}

is obtained by computing

\begin{matrix} f_{k} & = & lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} λ_{k}^{- i} (f (T^{i} x, z) - \sum_{j = 0}^{k - 1} λ_{j}^{i} ϕ_{j} (x) s_{j} (z)) \\ = & lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} λ_{k}^{- i} (f^{i} (z) - \sum_{j = 0}^{k - 1} λ_{j}^{i} f_{j}) \end{matrix}

(20)

where

f_{k} = ϕ_{k} (x) s_{k} (z)

,

ϕ_{k}

is an eigenfunction of U with

| ϕ_{k} | = 1

and

s_{k}

is the k-th Koopman mode.

Proof.

We introduce the operator

U_{λ_{0}} = λ_{0}^{- 1} U .

(21)

Then, for some function

g (x),

consider

\begin{matrix} U (lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} U_{λ_{0}}^{i} g (x)) & = & lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} λ_{0}^{- i} U^{i} g (T x) \\ = & lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} λ_{0}^{- i} U^{i + 1} g (x) \\ = & λ_{0} lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} λ_{0}^{- (i + 1)} U^{i + 1} g (x) \\ = & λ_{0} [lim_{n \to \infty} \frac{1}{n} (λ_{0}^{- n} g (T^{n} x) - g (x) + \sum_{i = 0}^{n - 1} U_{λ_{0}}^{i} g (x))] \\ = & λ_{0} [lim_{n \to \infty} \frac{1}{n} (λ_{0}^{- n} g (T^{n} x) + \sum_{i = 0}^{n - 1} U_{λ_{0}}^{i} g (x))], \end{matrix}

(22)

where the last line is obtained by boundedness of g. Due to the boundedness of U and continuity of g, we have

lim_{n \to \infty} | λ_{0}^{- n} U^{n} g | \leq | g | .

(23)

This is obtained as the consequence of the so-called Gel’fand formula that states that for a bounded operator V on a Banach space X,

{lim}_{n \to \infty} {| V^{n} |}^{1 / n} = ρ

where

ρ

is the spectral radius of V [44] (note that in our case

ρ = | λ_{0} |

). Thus, the first term in (22) vanishes in the limit. Denoting

g_{λ_{0}}^{*} (x) = lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} U_{λ_{0}}^{i} g (x),

(24)

where the convergence is again obtained from the Gel’fand formula, utilizing the assumption on convergence of time averages and (23). Thus, we obtain

U g_{λ_{0}}^{*} (x) = λ_{0} g_{λ_{0}}^{*} (x)

(25)

and, thus,

g_{λ_{0}}^{*} (x)

is an eigenfunction of U at eigenvalue

λ_{0}

(note that

g_{λ_{0}}^{*} (x) \in C

by the fact that partial sums form a Cauchy sequence). If we have a field of observables

f (x, z)

, parameterized by

z

, we get

f_{λ_{0}}^{*} (x, z) = ϕ_{k} (x) s_{j} (z),

(26)

since

f_{λ_{0}}^{*} (x, z)

is an eigenfunction of U at eigenvalue

λ_{0}

, so for every

z

, it is just a constant (depending on

z

) multiple of the eigenfunction

ϕ_{k} (x)

of norm 1. If we denote

P_{λ_{0}} = lim_{n \to \infty} \frac{1}{n} \sum_{i = 0}^{n - 1} U_{λ_{0}}^{i},

(27)

(note

P_{λ_{0}}

is a bounded projection operator), we can split the space of functions C into the direct sum

P_{λ_{0}} C ⨁ (I - P_{λ_{0}}) C

.

Now, let

0 < k < K

. Consider the space of observables

(I - P_{λ_{0}, λ_{1}, \dots, λ_{k - 1}}) C = (I - \sum_{j = 0}^{k - 1} P_{λ_{j}}) C,

(28)

complementary to the subspace

Φ

spanned by

ϕ_{j}, 0 \leq j < k

. The operator

{U |}_{Φ}

, the restriction of U to

Φ

has eigenvalues

λ_{0}, \dots, λ_{k - 1}

. Since

g_{k} = g - P_{λ_{0}, λ_{1}, \dots, λ_{k - 1}} g

(29)

does not have a component in

Φ

, we can reduce the space of observables to

(I - P_{λ_{0}, λ_{1}, \dots, λ_{k - 1}}) C

, on which

U_{λ_{k}}

satisfies the assumptions of the theorem, and obtain

U {(g_{k})}_{λ_{k}}^{*} (x) = λ_{k} {(g_{k})}_{λ_{k}}^{*} (x) .

(30)

If we have a field of observables

f (x, z)

, then

f_{k} (x, z) = f (x, z) - P_{λ_{0}, λ_{1}, \dots, λ_{k - 1}} f,

(31)

and, thus,

f_{k} (x, z) = ϕ_{k} (x) s_{k} (z) .

(32)

□

In other words,

f_{k}

is the skew-projection of the field of observables

f (x, z)

on the eigenspace of the Koopman operator associated with the eigenvalue

λ_{k}

.

Remark 2.

The assumptions on eigenvalues in the above theorem would not be satisfied for dynamical systems whose eigenvalues are dense on the unit circle (e.g., a map that, as

n \to \infty

approaches a unit circle in the complex plane on which the dynamics is given by

z^{'} = e^{i ω} z

, where ω is irrational w.r.t. π). However, in such a case, the space of functions can be restricted to the span of functions

e^{i k θ}, k = 1, \dots, N, θ \in [0, 2 π)

, and the requirements of the theorem would be satisfied. This amounts to restricting the observables to a set with finite resolution, which is standard in data analysis.

Remark 3.

Function spaces in which Koopman operators are spectral are typically special tensor products of on-attractor Hilbert spaces—for example,

L^{2} (μ)

where μ is the physical invariant measure—and off-attractor spaces of functions that are continuous or possess additional smoothness [14]. Provided we do not restrict the on-attractor part to a finite-dimensional subset like we did in the previous remark, the above theorem would apply to the off-attractor subset (which is an ideal set of functions that vanish a.e. on the attractor). However, the on-attractor Koopman modes can be obtained a.e. using the same procedure as above, and results relying on the Birkhoff’s Ergodic Theorem, valid in

L^{2} (μ)

, as in [4,5,45,46].

In principle, one can find the full spectrum of the Koopman operator by performing Generalized Laplace Analysis, where Theorem 1 is used on some function

g (x)

starting from the unit circle, successively subtracting parts of the signal corresponding to eigenvalues with decreasing

| λ |

. In practice, such computation can be unstable, since at large t, it involves a multiplication of a very large with a very small number. In addition, the eigenvalues are typically not known a priori. A large class of dynamical systems have eigenvalues on and inside the unit circle (or left half of the complex plane inclusive of the imaginary axis in the continuous time case) [14]. The eigenvalues on the unit circle can be found using the Fast Fourier Transform (FFT). Once the contributions to the dynamics from those eigenvalues are subtracted, the next largest set of eigenvalues have magnitude less than 1. Thus, the power method would enable finding the magnitude

| λ_{1} |

of the resulting eigenvalue. Scaling the operator (restricted to the space of functions not containing components from eigenspaces corresponding to eigenvalues of magnitude 1) with that magnitude, FFT can be performed again to identify the arguments of the eigenvalues of magnitude

| λ_{1} |

. Alternatively, as shown in the next section, we describe the finite section method, in which the operator is represented in a basis, and a finite-dimensional truncation of the resulting infinite matrix—a finite section—is used to approximate its spectral properties. Under some conditions [37], increasing the dimension of the finite section and the number of sample points, eigenvalues of the operator can be obtained.

4. The Finite Section Method

The GLA method for approximating eigenfunctions (and thus modes) of the Koopman operator, analyzed in the previous section, was proposed initially in [4,5,9] in the context of on-attractor (measure-preserving) dynamics, and extended to off-attractor dynamics in [11,12,13,39,47]. It is predicated on the knowledge of (approximate) eigenvalues—since the eigenvalues need to be known a priori to be able to perform weighted trajectory sums in (20). There is always the eigenvalue 1 that is known, and the trajectory sums in that case lead to invariants of the dynamics [45,46]. Other eigenvalues with modulus 1 can be approximated using signal processing methods (see e.g., [39]). Importantly, the GLA does not require the knowledge of an approximation to the Koopman operator and is in effect a sampling method which avoids the curse of dimensionality. In contrast, DMD-type methods, invented initially in [21] without the Koopman operator background, and connected to the Koopman operator setting in [22] produce a matrix approximation to the Koopman operator. There are many forms of the DMD methodology, but all of them require a choice of a finite set of observables that span a subspace. In this section, we analyze such methods in the context of finite section of the operator and explore connections to the dual basis.

4.1. Finite Section and the Dual Basis

Consider the Koopman operator acting on an observable space

F

of functions on the state space M, equipped with the complex inner product

〈\cdot, \cdot〉

(note that we are using the complex inner product linear in the first argument here; the physics literature typically employs the so-called Dirac notation, where the inner product is linear in its second argument) and let

{f_{j}}, j \in N

be an orthonormal basis on

F

, such that, for any function

f \in F

, we have

f = \sum_{j \in N} c_{j} f_{j} .

(33)

Let

u_{k j} = 〈U f_{j}, f_{k}〉 .

(34)

Then,

{(U f)}_{k} = 〈U f, f_{k}〉 = \sum_{j \in N} c_{j} 〈U f_{j}, f_{k}〉 = \sum_{j \in N} u_{k j} c_{j} .

(35)

Consider the (not necessarily orthogonal) unconditional basis

{f_{j}}

. The action of U on an individual basis function

f_{j}

is given by

U f_{j} = \sum_{k \in N} u_{k j} f_{k},

(36)

where

u_{k j}

are now just coefficients of

U f_{j}

in the basis. We obtain

U f = \sum_{j \in N} c_{j} U f_{j} = \sum_{j \in N} c_{j} \sum_{k \in N} u_{k j} f_{k} = \sum_{k \in N} (\sum_{j \in N} u_{k j} c_{j}) f_{k},

(37)

and we again have

{(U f)}_{k} = \sum_{j \in N} u_{k j} c_{j} .

(38)

As in the previous section, associated with any closed linear subspace

G

of

F

, there is a projection onto it, denoted

P = P^{2}

, that we can think of as projection “along” the space

(I - P) F

, since, for any

f \in F

, we have

P (I - P) f = (P - P^{2}) f = 0,

(39)

and, thus, any element of

(I - P) F

has projection 0. We denote by

\tilde{U}

the infinite-dimensional matrix with elements

u_{k j}, k, j \in N

. Thus, the finite-dimensional section of the matrix

{\tilde{U}}_{n} = [\begin{matrix} u_{11} & u_{12} & \dots & u_{1 n} \\ u_{21} & u_{22} & u_{2 n} \\ ⋮ & ⋱ & ⋮ \\ u_{n 1} & u_{n 2} & \dots & u_{n n} \end{matrix}],

(40)

is the so-called compression of

\tilde{U}

that satisfies

{\tilde{U}}_{n} = P_{n} \tilde{U} P_{n},

(41)

where

P_{n}

is the projection “along”

(I - P_{n}) F

to the span of the first n basis functions,

span (f_{1}, \dots, f_{n})

.

The key question now is: how are the eigenvalues of

{\tilde{U}}_{n}

related to the spectrum of the infinite-dimensional operator U? This was first addressed in [37].

Example 1.

Consider the translation T on the circle

S^{1}

given by

z^{'} = e^{i ω} z, z \in S^{1},

(42)

Let

f_{j} = e^{i j θ}, θ \in [0, 2 π) .

Then,

U f_{j} = f_{j} \circ T = e^{i j ω} e^{i j θ} .

(43)

Thus, from (34)

u_{k j} = δ_{k j} e^{i j ω}

, where

δ_{k j} = 1

for

k = j

and zero otherwise (the Kronecker delta), and

\tilde{U}

is a diagonal matrix. In this case, the finite section method provides us with the subset of the exact eigenvalues of the Koopman operator.

The following example shows how careful we need to be with the finite-section method when the underlying dynamical system has chaotic behavior:

Example 2.

Consider the map T on the circle

S^{1}

given by

z^{'} = z^{2}, z \in S^{1},

(44)

This is a mixing map that does not have any eigenvalues of the Koopman operator on

L^{2} (S^{1})

except for the (trivial) 1, while its spectrum is the whole unit circle [48]. Let

f_{j} = e^{i j θ}, θ \in [0, 2 π) .

Then,

U f_{j} = f_{j} \circ T = e^{i j 2 θ} .

(45)

Let

f (θ) = \sum_{j \in Z} c_{j} e^{i j θ} .

(46)

Then,

U f (θ) = \sum_{j \in Z} c_{j} e^{i 2 j θ} .

(47)

Thus,

{\tilde{U}}_{n}

is given by

{\tilde{U}}_{n} = [\begin{matrix} 0 & 0 & 0 & 0 & \dots \\ 1 & 0 & 0 & 0 & \dots \\ 0 & 0 & 0 & 0 & \dots \\ 0 & 1 & 0 & 0 & \dots \\ ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \end{matrix}],

(48)

provided

n \neq 2 k, k \in N

. In this case, the finite section method fails, as

U_{n}

has eigenvalue 0 of multiplicity n. This example illustrates how the condition in [37] that the weak convergence of a subsequence of eigenfunctions of

{\tilde{U}}_{N}

to a function ϕ must be accompanied by the requirement

| | ϕ | | \neq 0

in order that the limit of the associated subsequence of eigenvalues converges to a true eigenvalue of the Koopman operator. In particular, no subsequence of eigenvalues in this case converges to the true eigenvalue of the Koopman operator, since the map is measure preserving, and thus, its eigenvalues are on the unit circle. The example shows the peril of applying the finite section method to find eigenvalues of the Koopman operator when the underlying dynamical system has a continuous spectral part [5] (in this case, Lebesgue [48]) spectrum. Continuous spectrum is effectively dealt with in [10,49] using harmonic analysis and periodic approximation methods, respectively.

To apply the finite-section methodology of approximation of the Koopman operator, we need to estimate the coefficients

u_{k j}

from data. If we have access to measurements of N orthogonal functions

f_{1}, \dots, f_{N}

on m points on state space, as indicated in [37], assuming ergodicity, this becomes possible:

Theorem 2.

Let

{f_{1}, \dots, f_{N}}

be an orthogonal set of functions in

L^{2} (M, μ)

and let T be ergodic on M with respect to an invariant measure μ. Let

x_{l}, l \in N

be a trajectory on M. Then, for almost any

x_{1} \in M

u_{k j} = lim_{m \to \infty} \frac{1}{m} \sum_{l = 1}^{m} f_{k}^{c} (x_{l}) f_{j} \circ T (x_{l}) = lim_{m \to \infty} \frac{1}{m} \sum_{l = 1}^{m} f_{k}^{c} (x_{l}) f_{j} (x_{l + 1})

(49)

Proof.

This is a simple consequence of the Birkhoff ergodic theorem ([50]). Recall that

u_{k j} = 〈U f_{j}, f_{k}〉 = \int_{M} U f_{j} f_{k}^{c} d μ,

(50)

and the last expression is equal to

lim_{m \to \infty} \frac{1}{m} \sum_{l = 1}^{m} f_{k}^{c} (x_{l}) f_{j} \circ T (x_{l}),

(51)

by the Birkhoff Ergodic Theorem applied to the function

U f_{j} f_{k}^{c}

. □

In the case of non-orthonormal Riesz basis, denote by

{\hat{f}}_{k}

the dual basis vectors, such that

〈f_{j}, {\hat{f}}_{k}〉 = δ_{j k},

(52)

where

δ_{j j} = 1

for any j, and

δ_{j k} = 0

if

j \neq k

. For the infinite-dimensional Koopman matrix coefficients, we get

u_{k j} = 〈U f_{j}, {\hat{f}}_{k}〉 .

(53)

Let us consider the finite set of independent functions

\tilde{f} = {f_{1}, \dots, f_{N}}

and the associated dual set

{{\hat{g}}_{1}, \dots, {\hat{g}}_{N}}

in the span

\tilde{F}

of

\tilde{f}

, that satisfy

〈f_{j}, {\hat{g}}_{k}〉 = δ_{j k} .

(54)

Note that the functions

{\hat{g}}_{k}

are unique, since they are each orthonormal to

n - 1

vectors in

\tilde{F}

. Let

F = \tilde{F} + {\tilde{F}}^{T},

(55)

and

P_{\tilde{F}}

the orthogonal projection on

\tilde{F}

(this in effect assumes all the remaining basis functions are orthogonal to

\tilde{F}

). Then,

{\hat{g}}_{k} = P_{\tilde{F}} {\hat{f}}_{k},

(56)

since, by self-adjointness of orthogonal projections, and

P_{\tilde{F}} {\hat{f}}_{k} \in \tilde{F}

〈f_{j}, P_{\tilde{F}} {\hat{f}}_{k}〉 = 〈P_{\tilde{F}} f_{j}, {\hat{f}}_{k}〉 = 〈f_{j}, {\hat{f}}_{k}〉 = δ_{j k}

(57)

Now, we have

{\tilde{u}}_{k j} = 〈U f_{j}, {\hat{g}}_{k}〉 = 〈U f_{j}, P_{\tilde{F}} {\hat{f}}_{k}〉 = 〈P_{\tilde{F}} U f_{j}, {\hat{f}}_{k}〉

(58)

and thus, since

f_{j} \in F

, the coefficients

{\tilde{u}}_{k j}

are the elements of the finite section

P_{\tilde{F}} U P_{\tilde{F}}

in the basis

\tilde{f}

. It is again possible to obtain

{\tilde{u}}_{k j}

from data:

Theorem 3.

Let

{f_{1}, \dots, f_{N}}

be a non-orthogonal set of functions in

L^{2} (M, μ)

and let T be ergodic on M with respect to an invariant measure μ. Let

x_{l}, l \in N

be a trajectory on M. Then, for almost any

x_{1} \in M

{\tilde{u}}_{k j} = lim_{m \to \infty} \frac{1}{m} \sum_{l = 1}^{m} f_{j} \circ T (x_{l}) {\hat{g}}_{k}^{c} (x_{l}) = lim_{m \to \infty} \frac{1}{m} \sum_{l = 1}^{m} f_{j} (x_{l + 1}) {\hat{g}}_{k}^{c} (x_{l}),

(59)

where, for any finite m,

{\hat{g}}_{k}^{c} (x_{l}) / m, l = 1, \dots, m

are obtained as rows of the matrix

{(F^{†} F)}^{- 1} F^{†}

, where

F = [f_{1} (X) f_{2} (X) \dots f_{N} (X)],

(60)

F^{†} = {(F^{c})}^{T}

is the conjugate (Hermitian) transpose of F , and

f_{j} (X)

is the column vector

{(f_{j} (x_{1}) \dots f_{j} (x_{m}))}^{T}

.

Proof.

The fact that

{\hat{g}}_{k}^{c} (x_{l}) / m, l = 1, \dots, m

are obtained as rows of the matrix

{(F^{†} F)}^{- 1} F^{†}

follows from

{(F^{†} F)}^{- 1} F^{†} F = I_{N}

(61)

where

I_{N}

is the

N \times N

identity matrix. The rest of the proof is analogous to the proof of Theorem 2. □

Remark 4.

The key idea in the above results—Theorems 2 and 3—is that we sample the functions

f_{i}, i = 1, \dots, N

and the dual basis

g_{k}, k = 1, \dots, N

on m points in the state space, and then take the limit

m \to \infty

. Thus, besides approximating the action of U using the finite section

{\tilde{U}}_{N}

, we also approximate individual functions

f_{j}, g_{k}

by their sample on m points. The corollary of the theorems is that the finite sample approximations

{\tilde{U}}_{N, m}

, obtained by setting the coefficients

{\tilde{u}}_{k j, m} = \frac{1}{m} \sum_{l = 1}^{m} f_{j} \circ T (x_{l}) {\hat{g}}_{k}^{c} (x_{l})

(62)

converges to

{\tilde{U}}_{N}

as

m \to \infty

. This result has been obtained in [51], without the use of the dual basis, relying on the Moore–Penrose pseudoinverse, the connection which we discuss next.

We call F the data matrix. Note that the matrix

F^{+} = {(F^{†} F)}^{- 1} F^{†}

is the so-called Moore–Penrose pseudoinverse of F. Using matrix notation, from (59), the approximation of the finite section can be written as

{\tilde{U}}_{N}^{a} = F^{+} F (T (X)) = F^{+} F^{'},

(63)

where

X = {(x_{1}, \dots, x_{m})}^{T}

,

F^{'} = F (T (X)) = [\begin{matrix} f_{1} (T X) f_{2} (T X) \dots f_{N} (T X) \end{matrix}] .

(64)

and

f_{k} (T X)

is the column vector

[\begin{matrix} f_{k} (T x_{1}) \\ ⋮ \\ f_{k} (T x_{m}) \end{matrix}] .

(65)

If we now assume that there is an eigenfunction-eigenvalue pair

λ, ϕ

of U such that

ϕ \in span \tilde{F}

, then

P_{\tilde{F}} U P_{\tilde{F}} ϕ = P_{\tilde{F}} U ϕ = U ϕ = λ ϕ .

(66)

Thus, the eigenvalue

λ

will be in the spectrum of

{\tilde{U}}_{N}

. More generally, it is known that an operator U and a projection

P_{\tilde{F}}

commute if and only if

\tilde{F}

is an invariant subspace of U. Thus, the spectrum of the finite-section operator

{\tilde{U}}_{N}

is a subset of the spectrum of U for the case when

\tilde{F}

is an invariant subspace.

If an eigenfunction

ϕ

of U is in

\tilde{F},

it can be obtained from an eigenvector

a

of the finite section

{\tilde{U}}_{N}

as

ϕ = a \cdot \tilde{f} = \sum_{k = 1}^{N} a_{k} f_{k},

(67)

where

a = (a_{1}, \dots, a_{N})

satisfies

{\tilde{U}}_{N} a = λ a

, since, for such

ϕ

,

U ϕ = a \cdot U \tilde{f} = λ ϕ = λ a \cdot \tilde{f} = U_{N} a \cdot \tilde{f} .

(68)

We have introduced above the dot notation, that produces a function in

F

from an N-vector

a

and a set of functions

\tilde{f}

.

Remark 5.

The Theorems 2 and 3 are convenient in their use of sampling along trajectory and an invariant measure, thus enabling construction of finite section representations of the Koopman operator from a single trajectory. However, the associated space of functions

L^{2} (μ)

is restricted since the resulting spectrum is on the unit circle. Choosing a more general measure ν that has support in the basin of attraction is possible. Namely, when we construct the finite section, we then use a sequence

x_{l}, l = 1, \dots, m

of points that weakly converges to the measure ν, and their images under T,

y_{l} = T (x_{l})

. This is the approach in [51]. The potential issue with this approach is the choice of space—typically,

L^{2} (ν)

will have a very large spectrum, for example, filling the entire unit disk of the complex plane [52]. In contrast, Hilbert spaces adapted to the dynamics of a dissipative systems can be constructed [14], starting from the ideal set of continuous functions that vanish on the attractor, enabling a natural setting for computation of spectral objects for dissipative systems.

Koopman mode is the projection of a field of observables on an eigenfunction of U. Approximations of Koopman modes can also be obtained using a finite section. Let

{\tilde{U}}_{N}

be a finite section of

\tilde{U}

. Let

h : M \to C^{K}

be a vector observable (thus, a field of observables indexed over a discrete set). Then, the Koopman mode

s_{λ} (h)

associated with the eigenvalue

λ

of U is obtained as

s_{λ} (h) = 〈h, \hat{ϕ}〉 ϕ,

(69)

where

ϕ, \hat{ϕ}

are the eigenfunction and the dual eigenfunction associated with the eigenvalue

λ

. Let

a_{j}, j = 1, \dots, N

be eigenvectors of

{\tilde{U}}_{N}

, and thus, the associated eigenfunctions of the finite section are

ϕ_{j} = a_{j} \cdot \tilde{f}, j = 1, \dots, N

(70)

where

a_{j} = (a_{j 1}, \dots, a_{j N})

. Then, we get the dual basis

{\hat{ϕ}}_{j} = 〈{\hat{a}}_{j}, \hat{g}〉, j = 1, \dots, N

(71)

where

〈{\hat{a}}_{j}, a_{k}〉 = δ_{j k} .

(72)

This is easily checked by expanding:

\begin{matrix} 〈ϕ_{j}, {\hat{ϕ}}_{j}〉 & = & 〈\sum_{k = 1}^{N} a_{j k} f_{k}, \sum_{l = 1}^{N} {\hat{a}}_{i l} {\hat{g}}_{l}〉 \\ = & \sum_{k = 1}^{N} \sum_{l = 1}^{N} a_{j k} {\hat{a}}_{i l}^{c} 〈f_{k}, {\hat{g}}_{l}〉 \\ = & \sum_{k = 1}^{N} a_{j k} {\hat{a}}_{i k}^{c} = δ_{j i} . \end{matrix}

(73)

Thus, the approximation

{\tilde{s}}_{j} (h)

to the Koopman mode

s_{j} (h)

associated with the eigenvalue

λ_{j}

of the finite section reads

{\tilde{s}}_{j} (h) = 〈h, {\hat{ϕ}}_{j}〉 ϕ_{j} = \sum_{k = 1}^{N} {\hat{a}}_{j k} 〈h, {\hat{g}}_{k}〉 ϕ_{j} .

(74)

Now, assume that

h = \tilde{f}

,

〈\tilde{f}, {\hat{ϕ}}_{j}〉 ϕ_{j} = \sum_{k = 1}^{N} {({\hat{a}}_{j})}_{k} 〈\tilde{f}, {\hat{g}}_{k}〉 ϕ_{j} = {\hat{a}}_{j} ϕ_{j} .

(75)

Thus, the Koopman modes associated with the data vector of observables

\tilde{f}

are obtained as the left eigenvector

{\hat{a}}_{j} ϕ_{j}

of the finite section of the Koopman operator

{\tilde{U}}_{N}

.

Assuming that the approximation of the finite section, the

N \times N

matrix

{\tilde{U}}_{N}^{a}

has distinct eigenvalues

λ_{1}^{a}, \dots, λ_{N}^{a}

, we write the spectral decomposition

{\tilde{U}}_{N}^{a} = A Λ A^{- 1},

(76)

where

Λ

is the diagonal eigenvalue matrix and

A = [a_{1} a_{2} \dots a_{N}]

(77)

is the column eigenvector matrix. From

{\tilde{U}}_{N}^{a} = {(F^{†} F)}^{- 1} F^{†} F^{'} = A Λ A^{- 1},

(78)

we get that the data can be reconstructed by first observing

F^{†} F^{'} = F^{†} F A Λ A^{- 1} .

(79)

This represents N equations with m unknowns for each column of

F^{'}

. Assuming

m > N

, it is an underdetermined set of equations that can have many solutions for columns of

F^{'}

. Then,

F_{p}^{'} = {(F F^{†})}^{- 1} F F^{†} F A Λ A^{- 1} = F A Λ A^{- 1}

(80)

is the projection of all these solutions on the subspace spanned by the columns of F. If

m < N

, (79) is overdetermined, and the solution

F_{p}^{'}

is the closest—in least squares sense—to

F^{'}

in the span of the columns of F.

Note that

A^{- 1}

is the matrix in which rows are the Koopman modes

{\hat{a}}_{k}

:

A^{- 1} = [\begin{matrix} {\hat{a}}_{1} \\ ⋮ \\ {\hat{a}}_{N} \end{matrix}],

(81)

and, thus,

Λ A^{- 1} = [\begin{matrix} λ_{1} {\hat{a}}_{1} \\ ⋮ \\ λ_{N} {\hat{a}}_{N} \end{matrix}] .

(82)

Using (68), we get

F A = [\begin{matrix} \tilde{f} (x_{1}) \cdot a_{1} & \dots & \tilde{f} (x_{1}) \cdot a_{N} \\ \tilde{f} (x_{2}) \cdot a_{1} & \dots & \tilde{f} (x_{2}) \cdot a_{N} \\ ⋮ & \dots & ⋮ \\ \tilde{f} (x_{m}) \cdot a_{1} & \dots & \tilde{f} (x_{m}) \cdot a_{N} \end{matrix}] = [\begin{matrix} {\tilde{ϕ}}_{1} (x_{1}) & \dots & {\tilde{ϕ}}_{N} (x_{1}) \\ {\tilde{ϕ}}_{1} (x_{2}) & \dots & {\tilde{ϕ}}_{N} (x_{2}) \\ ⋮ & \dots & ⋮ \\ {\tilde{ϕ}}_{1} (x_{m}) & \dots & {\tilde{ϕ}}_{N} (x_{m}) \end{matrix}]

(83)

where

{\tilde{ϕ}}_{j}

is an eigenfunction of the finite section, and

a_{j}

’s are the columns of A. Note that

{\tilde{ϕ}}_{k} (x_{l}) = {\tilde{λ}}_{k}^{l - 1} {\tilde{ϕ}}_{k} (x_{1})

. Using (80), we get

F_{p}^{'} = F A Λ A^{- 1} = [\begin{matrix} \sum_{k = 1}^{N} {\tilde{λ}}_{k} {\tilde{ϕ}}_{k} (x_{1}) {\hat{a}}_{k} \\ \sum_{k = 1}^{N} λ_{k}^{2} {\tilde{ϕ}}_{k} (x_{1}) {\hat{a}}_{k} \\ ⋮ \\ \sum_{k = 1}^{N} λ_{k}^{m} {\tilde{ϕ}}_{k} (x_{1}) {\hat{a}}_{k} \end{matrix}] .

(84)

Remark 6.

The novelty in this section is the explicit treatment of the finite section approximation in terms of the dual basis that enables error estimates in the next subsection. The finite section is also known under the name Galerkin projection [36]. The relationship between GLA and finite section methods was studied in [40].

4.2. Convergence of the Finite Sample Approximation to the Finite Section

The time averages in (59) converge due to the Birkhoff’s Ergodic Theorem [50]. In the case when a dynamical system is globally stable to an attractor with a physical invariant measure, the rates of convergence depend on the type of asymptotic dynamics that the system is exhibiting. Namely, the Koopman operator U, when restricted to measure-preserving, on-attractor dynamics, is unitary. Its spectrum can in that case be written as

σ_{p} (U) \cup σ_{c} (U)

, where

σ_{p}

denotes the point spectrum corresponding to eigenvalues of U and

σ_{c}

the continous spectrum [53]. The next theorem describes convergence of the finite sample approximation to

{\tilde{U}}_{N}

when the asymptotic dynamics has only the point spectrum—e.g., when the attractor dynamics is that of a fixed point, limit cycle or ergodic rotation on a higher dimensional torus:

Theorem 4.

Let

T : M \to M

be a

C^{\infty}

dynamical system with an attractor A and an invariant measure supported on the attractor. Let U be the Koopman operator on

L^{2} (μ)

, with a pure point spectrum that is either a non-dense set on the unit circle, or generated by a set of eigenvalues whose imaginary parts

ω

= (ω_{1}, \dots, ω_{m})

satisfy the Diophantine conditions

| k \cdot

ω

- k_{0} | \geq 4 c_{0} {| k |}^{- μ}, μ > m + 1, k \in Z^{m}, k_{0} \in Z

. Let

f_{j}, g_{k}

be

C^{\infty}

for all

j, k

. Note that the coefficients in the finite section matrices depend on the initial condition

x

of the trajectory that was used to generate the finite section, with the notation

U_{N, m} (x), {\tilde{U}}_{N} (x)

. Then, for almost all initial conditions

x \in M

| | {\tilde{U}}_{N, m} (x) - {\tilde{U}}_{N} (x) {| |}_{2} \leq \frac{c (N)}{m}

(85)

where

| | \cdot {| |}_{2}

is the Frobenius norm.

Proof.

We suppress the dependence on

x

in the notation. The entries

{\tilde{u}}_{k j, m} = \frac{1}{m} \sum_{l = 1}^{m} f_{j} \circ T (x_{l}) {\hat{g}}_{k}^{c} (x_{l})

of

U_{N, m}

(see (62)) converge a.e. w.r.t.

μ

. Since T is conjugate to a rotation on an Abelian group [54], which is either discrete or the dynamics is uniformly ergodic (in which case, by assumption, the Diophantine condition is satisfied), for sufficiently smooth T and

f_{j}, {\hat{g}}_{k}

[55,56,57], we have

| | {\tilde{u}}_{k j, m} - {\tilde{u}}_{k j} {| |}_{2} \leq \frac{c (f_{j}, g_{k})}{m}

(86)

and the statement follows by setting

c (N) = N^{2} {max}_{j, k} c (f_{j}, g_{k})

. □

Remark 7.

The smoothness of

T, f_{j}, {\hat{g}}_{k}

and the Diophantine condition are required in order for the solution of the homological equation to exist [55]. Only finite smoothness is required [55], but we have assumed

C^{\infty}

for simplicity here.

The above means that

{\tilde{U}}_{N, m} (x)

converges to

{\tilde{U}}_{N} (x)

spectrally:

Corollary 1.

Let

λ_{m}

be an eigenvalue of

{\tilde{U}}_{N, m} (x)

with multiplicity h. Then, for arbitrary

ϵ > 0

, for sufficiently large

m > M

, there is a set of eigenvalues λ of

{\tilde{U}}_{N} (x)

whose multiplicity sums to h such that

| λ_{m} - λ | \leq ϵ .

Proof.

This follows from continuity of eigenvalues [58] to continuous perturbations (established by theorem 4). □

Remark 8.

If

f \circ T^{n}

are independent, the convergence estimate above deteriorates to

O (1 / \sqrt{m})

. Presence of continuous spectrum without the strong mixing property can lead to convergence estimates

O (1 / m^{α})

with

0 < α < 1 / 2

[56].

Remark 9.

Spectral convergence in the infinite-dimensional setting is a more difficult question (see [37] in which only convergence along subsequences was established under certain assumptions). Even if the result could be obtained, the practical question is the convergence in m and N. To address it further, we start with the formula for error in the finite section.

4.3. The Error in the Finite Section

It is of interest to find out how big is the error we are making in the finite section approximations discussed above. We have the following result.

Proposition 1.

Let

\tilde{ϕ} = \tilde{e} \cdot \tilde{f}

be an eigenfunction of the finite section associated with the eigenvalue

\tilde{λ}

and eigenvector

\tilde{e}

. Then,

U \tilde{ϕ} - \tilde{λ} \tilde{ϕ} = \tilde{e} \cdot (U \tilde{f} - P_{\tilde{F}} U \tilde{f}) .

(87)

Proof.

The first term on the right side of (87) follows from the definition of

\tilde{ϕ}

. We then need to show

\tilde{λ} \tilde{ϕ} = \tilde{e} \cdot P_{\tilde{F}} U \tilde{f} .

(88)

However, the left side is just

U_{N} \tilde{ϕ}

, and since

\tilde{f} \in \tilde{F}

,

U_{N} \tilde{f} = P_{\tilde{F}} U P_{\tilde{F}} \tilde{f} = P_{\tilde{F}} U \tilde{f}

, which proves the claim. □

5. Krylov Subspace Methods

A particularly useful feature of dynamical systems theory based on Koopman operator methods is that properties of the system can be surmised from data. Indeed, in the previous section, we found how a finite section of the matrix representation of the Koopman operator can be found from data. However, the discussion was based on existence of a basis, that typically might come from taking products on basis elements on 1-dimensional subspaces—for example, Fourier basis on an interval subset of

R

. Such constructions lead to an exponential growth in the number of basis elements, and the so-called curse of dimensionality. In this section, we study finite section numerical methods that are based on the dynamical evolution of a single or many observables—functions on state space—that span the so-called Krylov subspace. The idea is that one might start with a single observable, and due to its evolution, span an invariant subspace of the Koopman operator (note the connection of such methods with the Takens embedding theorem ideas [4,39]). Since the number of basis elements is in this case equal to the number of dynamical evolution steps, in any dimension, Krylov subspace-based methods do not suffer from the curse of dimensionality.

5.1. Single Observable Krylov Subspace Methods

Let T be a discrete-time dynamical system on a compact metric space M equipped with a measure

μ

on the Borel

σ

-algebra. Let

F

be a Hilbert space of functions on M (for suitable spaces, see [14]). For a finite-time evolution of an initial function

f (x) \in F

under T, we get a (Krylov) sequence

(f (x), f \circ T (x), \dots, f \circ T^{N} (x)) = (f (x), U f (x), \dots, U^{N} f (x)),

(89)

where U is the Koopman operator associated with T. Let

f_{i} = f \circ T^{i - 1} (x)

. Then, clearly

f_{i + 1} = U f_{i}

, for

i = 1, \dots, N

. If

f_{N + 1}

was in the space spanned by

f_{1}, \dots, f_{N}

, and these were linearly independent functions, we would have

f_{N + 1} = \sum_{i = 1}^{N} c_{i} f_{i},

(90)

for some constants

c_{i}, i = 1, \dots N

. In that case, the operator U would have a finite-dimensional approximation

{\tilde{U}}_{N}

on the

span (f_{1}, \dots f_{N})

, given by the companion matrix

\tilde{U} = C = (\begin{matrix} 0 & 0 & \dots & 0 & c_{1} \\ 1 & 0 & \dots & 0 & c_{2} \\ 0 & 1 & \dots & 0 & c_{3} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & c_{N} \end{matrix})

(91)

The above is, in the terminology of the previous section, the finite section representation of U.

Example 3.

Let V be a subspace of

L^{2} (M)

spanned by eigenfunctions

e_{1}, \dots, e_{N}

that satisfy

U e_{j} = e^{i 2 π ω_{j}} e_{j}

(92)

Let

g = \sum_{j = 1}^{N} a_{j} e_{j},

(93)

Then,

\begin{matrix} U^{k} g & = & \sum_{j = 1}^{N} a_{j} U^{k} e_{j} \\ = & \sum_{j = 1}^{N} a_{j} e^{i 2 π k ω_{j}} e_{j}, \end{matrix}

(94)

and

\begin{matrix} U^{N} g & = & \sum_{l = 1}^{N} d_{j} e_{j} \\ = & \sum_{j = 1}^{N} a_{j} e^{i 2 π N ω_{j}} e_{j} \\ = & \sum_{k = 1}^{N} c_{k} \sum_{j = 1}^{N} a_{j} e^{i 2 π k ω_{j}} e_{j} \\ = & \sum_{j = 1}^{N} (\sum_{k = 1}^{N} c_{k} e^{i 2 π k ω_{j}}) a_{j} e_{j} . \end{matrix}

(95)

Thus, the numbers

c_{k}, k = 1, \dots, N

in the companion matrix are determined by N equations with N unknowns

(\sum_{k = 1}^{N} c_{k} e^{i 2 π k ω_{j}}) a_{j} = d_{j}, j = 1, \dots, N .

(96)

Now, let

a_{1} = 1

,

a_{j} = 0, j = 2, \dots, N

. We get

U^{N} g = d_{1} e_{1} = e^{i 2 π N ω_{1}} e_{1}

and, thus,

c_{1} = e^{i 2 π N ω_{1}} .

(97)

It is clear that

c_{j} = 0, j = 2, \dots, N

. Note that, if

ω_{1} = j / N

for some integer j, we get

c_{1} = 1

and, thus, the companion matrix becomes the circulant shift matrix

\tilde{U} = (\begin{matrix} 0 & 0 & \dots & 0 & 1 \\ 1 & 0 & \dots & 0 & 0 \\ 0 & 1 & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & 0 \end{matrix}) .

(98)

Consider now the case when

U^{N} f

is not in the span of

f_{1}, \dots, f_{N}

. We have the projection formula (58)

{\tilde{u}}_{k j} = 〈U f_{j}, {\hat{g}}_{k}〉 .

(99)

Since

〈f_{j}, {\hat{g}}_{k}〉 = δ_{k, j},

(100)

for

j = 1, \dots, N - 1, k = 1, \dots, N

we have

{\tilde{u}}_{k j} = 〈U f_{j}, {\hat{g}}_{k}〉 = 〈f_{j + 1}, {\hat{g}}_{k}〉 = δ_{k, j + 1}

(101)

which produces zeros in all columns of row k except in the column

j - 1

, where we have a 1. There is no column

j - 1

for row 1, so we get all zeros up to the last column. Now, for the last column, we have

{\tilde{u}}_{k N} = 〈U f_{N}, {\hat{g}}_{k}〉 = 〈P_{\tilde{F}} U f_{N}, {\hat{g}}_{k}〉,

(102)

and, thus,

c_{k}

in the matrix (91) is the k-th coefficient of the orthogonal projection of

U f_{N}

on

\tilde{F}

in the basis

\tilde{f}

that here consists of the Krylov sequence of independent observables

(f, U f, \dots, U^{N - 1} f) \equiv (f_{1}, \dots, f_{N}),

(103)

where we defined

f_{1}, \dots, f_{N}

by the last relationship.

5.2. Error in the Companion Matrix Representation

Let

\tilde{e} = {(e_{1}, \dots, e_{N})}^{T}

be an eigenvector of

\tilde{U}

satisfying

\tilde{U} \tilde{e} = \tilde{λ} \tilde{e},

(104)

and

\tilde{f} = (f_{1}, f_{2}, f_{3}, \dots f_{N})

. The action of U on

\tilde{e} \cdot \tilde{f}

is given as

\begin{matrix} U \tilde{e} \cdot \tilde{f} & = & \tilde{e} \cdot \tilde{f} \circ T = \sum_{i = 1}^{N} e_{i} f_{i} \circ T \\ = & \sum_{i = 1}^{N} e_{i} f_{i + 1} . \end{matrix}

(105)

Now, we also have

\begin{matrix} \tilde{U} \tilde{e} & = & (\begin{matrix} 0 & 0 & \dots & 0 & c_{1} \\ 1 & 0 & \dots & 0 & c_{2} \\ 0 & 1 & \dots & 0 & c_{3} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & c_{N} \end{matrix}) (\begin{matrix} e_{1} \\ e_{2} \\ e_{3} \\ ⋮ \\ e_{N} \end{matrix}) = (\begin{matrix} c_{1} e_{N} \\ e_{1} + c_{2} e_{N} \\ e_{1} + c_{3} e_{N} \\ ⋮ \\ e_{N - 1} + c_{N} e_{N} \end{matrix}) \\ = & (\begin{matrix} 0 \\ e_{1} \\ e_{2} \\ ⋮ \\ e_{N - 1} \end{matrix}) + e_{N} (\begin{matrix} c_{1} \\ c_{2} \\ c_{3} \\ ⋮ \\ c_{N} \end{matrix}) = \tilde{λ} \tilde{e} \end{matrix}

(106)

Using this in (105), and denoting

\tilde{c} = (c_{1}, \dots, c_{N})

, we obtain

\begin{matrix} U \tilde{e} \cdot \tilde{f} & = & λ \tilde{e} \cdot \tilde{f} - e_{N} \tilde{c} \cdot \tilde{f} + e_{N} f \circ T^{N} \\ = & \tilde{λ} \tilde{e} \cdot \tilde{f} + e_{N} (f \circ T^{N} - \tilde{c} \cdot \tilde{f}) . \end{matrix}

(107)

This formula also follows directly from (87) by observing that

U \tilde{f} = (f_{2}, \dots, f_{N}, f_{N + 1})

, where

f_{N + 1} = f \circ T^{N}

, the fact that

P (f_{2}, \dots, f_{N}) = (f_{2}, \dots, f_{N})

, and

P_{\tilde{F}} f \circ T^{N} = \tilde{c} \cdot \tilde{f}

. Thus,

\tilde{e} \cdot (U \tilde{f} - P_{\tilde{F}} U \tilde{f}) = e_{N} (f \circ T^{N} - \tilde{c} \cdot \tilde{f}) .

(108)

We have the following simple consequence:

Lemma 1.

If

f \circ T^{N}

is in the

span (f_{1}, \dots f_{N})

, then

\tilde{ϕ} = \tilde{e} \cdot \tilde{f}

is an eigenfunction of U associated with the eigenvalue

\tilde{λ}

.

If the assumption that

f \circ T^{N}

is in

span (f_{1}, \dots f_{N})

is relaxed, and

\tilde{c} \cdot \tilde{f}

is the orthogonal projection of

f \circ T^{N}

to

\tilde{F}

, then

\tilde{e} \cdot \tilde{f}

is an approximation to the eigenvector of U with an approximate eigenvalue

\tilde{λ}

, with the error

e_{N} (f \circ T^{N} - \tilde{c} \cdot \tilde{f}) = e_{N} r,

(109)

where

r = f \circ T^{N} - \tilde{c} \cdot \tilde{f}

is called the residual. Note that Equation (107) could be written as

| U \tilde{e} \cdot \tilde{f} - \tilde{λ} \tilde{e} \cdot \tilde{f} | = | e_{N} r |,

(110)

which means that

\tilde{e} \cdot \tilde{f}

is in the

(\tilde{λ}, ϵ)

-pseudospectrum of U for

ϵ = | e_{N} r |

(see [59]).

The calculations above, first presented in [39], allow us to show how the finite section spectrum approximates the spectrum of the Koopman operator when the number of functions N in the Krylov sequence goes to infinity. The specific sense of approximation here is pseudospectral, and for a class of systems that satisfy a convergence requirement on the Krylov sequence, convergence in the pseudospectral sense can be proven:

Lemma 2.

Let the Krylov sequence satisfy

lim_{N \to \infty} | | f \circ T^{N} - \tilde{c} \cdot \tilde{f} | | = 0

(111)

Then, for and

ϵ > 0

, for large enough n, an eigenfunction of the finite section

\tilde{ϕ}

is in the ϵ-pseudospectrum of U.

Proof.

Without loss of generality, we assume

| \tilde{e} | = 1

and, thus,

| e_{N} | \leq 1

. From (109), taking N large enough, we get

| U \tilde{e} \cdot \tilde{f} - \tilde{λ} \tilde{e} \cdot \tilde{f} | = | e_{N} (f \circ T^{N} - \tilde{c} \cdot \tilde{f}) | \leq | f \circ T^{N} - \tilde{c} \cdot \tilde{f} | < ϵ,

(112)

which proves the claim. □

Theorem 5.

Assume that for any function g in the space of observables

F

equipped with a norm

| | \cdot | |

, we have

g = \sum_{j = 1}^{\infty} c_{k} ϕ_{k}

(113)

where

ϕ_{k}

are normalized (

| | ϕ_{k} | | = 1

) eigenfunctions of the Koopman operator associated with eigenvalues

| λ_{k} | \leq 1

, i.e., U has a pure point spectrum in

F

. Let

\tilde{f}

be the Krylov sequence generated by f and

F_{f}

the cyclic invariant subspace of U generated by f. Let

P_{\tilde{F}}^{n}

be the orthogonal projection on the subspace of

F

generated by the first n elements of the Krylov sequence. Let

\tilde{λ}, \tilde{ϕ}

be an eigenvalue-eigenfunction pair for the finite section. Then, for any

ϵ > 0

, there is an N such that

n \geq N

implies

| | U \tilde{ϕ} - \tilde{λ} \tilde{ϕ} | | < ϵ .

(114)

Proof.

Due to Lemma 2, we only need to prove that, under the assumption on the spectrum,

lim_{N \to \infty} | | f \circ T^{N} - \tilde{c} \cdot \tilde{f} | | = 0 .

(115)

Due to the assumption in Equation (113), we have

f = \sum_{k = 1}^{\infty} c_{k} ϕ_{k},

(116)

and, thus,

f^{N} = f \circ T^{N} = \sum_{j = 1}^{\infty} c_{k} λ_{k}^{N} ϕ_{k} .

(117)

We split the spectrum of U in

F

as

{σ (U) = σ (U) |}_{S^{1}} {+ σ (U) |}_{D}

, where D is the interior of the unit disk in the complex plane. Then,

f \circ T^{N} = f_{S_{1}}^{N} + f_{D}^{N} = \sum_{λ_{k} {\in σ (U) |}_{S^{1}}} c_{k} λ_{k}^{N} ϕ_{k} + \sum_{λ_{j} {\in σ (U) |}_{D}} c_{j} λ_{j}^{N} ϕ_{j} .

(118)

For sufficiently large N, for any

ϵ / 2

| \sum_{λ_{j} {\in σ (U) |}_{D}} c_{j} λ_{j}^{N} ϕ_{j} | \leq ϵ / 2 .

(119)

In addition,

\sum_{λ_{k} {\in σ (U) |}_{S^{1}}} c_{k} λ_{k}^{N} ϕ_{k}

(120)

is an almost periodic function and, thus, for sufficiently large

M > N

, we have

| f_{S_{1}}^{M} - f_{S_{1}}^{N} | \leq ϵ / 2 .

(121)

Combining (119) and (121) proves the claim, since

f^{M}

is

ϵ

-away from an element

f^{j}

of the

span (f, \dots, f^{M - 1})

, and

| | f \circ T^{M} - \tilde{c} \cdot \tilde{f} | |

is the minimal distance of

f^{M}

to the subspace

span (f, \dots, f^{M - 1})

that contains

f^{j}

. □

Remark 10.

The above construction only requires the Krylov sequence, and shows that the finite section approximation reveals the pseudospectrum of the Koopman operator. Thus, methods relying on Krylov sequences are “sampling” the high-dimensional space and can approximate the part of the spectrum contained in their invariant subspace irrespective of the dimension of the problem.

The use of Krylov sequences is also of interest because they span the smallest invariant subset that the observable f belongs to:

Theorem 6.

Let f be an observable. Then,

span (f, f \circ T, \dots, f \circ T^{n}, \dots)

is the smallest forward invariant subspace of U that contains f.

Proof.

Assume not. Then, there is

A \subset span (f, f \circ T, \dots, f \circ T^{n}, \dots)

, where A is a proper subset, that contains f, meaning that there is

f \circ T^{j}

for some integer j, that is not in A. However, then, A is not invariant since it contains f and

U^{j} f

is not in A. □

Remark 11.

The assumptions in Theorem 5 are satisfied by any dynamical system with a quasi-periodic attractor with the space of observables being an appropriately constructed Hilbert space [14]. However, they exclude systems with mixed or purely continuous spectrum, as evidenced by the Example 2.

5.3. Krylov Sequences from Data

If M is not a finite discrete set, numerically, we do not have

\tilde{f}

on the whole state space. Instead, we might be able to sample the function f on a discrete subset of points

X = {x_{1}, \dots, x_{m}}^{T} \subset M

. We can think of f as a column vector, and form again the

m \times N

data matrixF

F = [\begin{matrix} f_{1} (X) f_{2} (X) \dots f_{N} (X) \end{matrix}]

(122)

and its first iterate

\begin{matrix} F^{'} & = & [\begin{matrix} f_{2} (X) f_{3} (X) \dots f_{N + 1} (X) \end{matrix}] \\ = & [\begin{matrix} f_{1} (T X) f_{2} (T X) \dots f_{N} (T X) \end{matrix}] \\ = & [\begin{matrix} f_{1} (Y) f_{2} (Y) \dots f_{N} (Y), \end{matrix}] . \end{matrix}

(123)

where

Y = T X

. We have

F^{'} = F C,

(124)

or

C = F^{+} F^{'}

(125)

as could be surmised from (63), and the following corollary of Lemma 1 holds:

Corollary 2.

Let

f \circ T^{N}

be in the

span (f_{1}, \dots f_{N})

, and

rank F = N

. Let

\tilde{λ}, \tilde{e}

be an eigenvalue and the eigenvector of the companion matrix

\tilde{U}

. Then, an eigenvalue

\tilde{λ}

of

\tilde{U}

is an eigenvalue of U, and

\tilde{f} (X) \cdot \tilde{e}

is a sample of the corresponding eigenfunction of U on

X

.

Proof.

As soon as we know N samples of the function f, the vector

\tilde{c}

in the companion matrix is fixed, and thus the residual is zero. □

When

x_{k} = T x_{k - 1}, k = 1, \dots, m + n - 1

, i.e., the sampling points are on a single trajectory, the matrix F becomes the Hankel–Takens matrix

H = [\begin{matrix} f (x) & f (T x) & \dots & f (T^{n - 1} x) \\ f (T x) & f (T^{2} x) & \dots & f (T^{n} x) \\ ⋮ & \dots & \dots & ⋮ \\ f (T^{m} x) & f (T^{m + 1} x) & \dots & f (T^{m + n - 1} x) \end{matrix}] .

(126)

The reason for calling H the Hankel–Takens matrix is that, besides the usual property of Hankel matrices that have constant skew-diagonal terms—in this case

H_{i, j} = f (T^{k} x)

, where

k = | i | + | j | - 2

—it also satisfies

H_{i, j + 1} = H_{i, j} \circ T = H_{i + 1, j}

, a property which is related to the Takens embedding [4,60].

Let C have distinct eigenvalues. We diagonalize it using

C = A Λ A^{- 1} .

(127)

The companion matrix is diagonalized by the so-called Vandermonde matrix

A^{- 1} = [\begin{matrix} 1 & λ_{1} & λ_{1}^{2} & \dots & λ_{1}^{N - 1} \\ 1 & λ_{2} & λ_{2}^{2} & \dots & λ_{2}^{N - 1} \\ 1 & λ_{3} & λ_{3}^{2} & \dots & λ_{3}^{N - 1} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & λ_{N} & λ_{N}^{2} & \dots & λ_{N}^{N - 1} \end{matrix}] .

(128)

Thus, the Koopman modes of the vector of observables

\tilde{f}

composed of time delays are precisely the columns of the Vandermonde matrix, while the right eigenvectors are the columns of the inverse of the Vandermonde matrix.

5.4. Schmid’s Dynamical Mode Decomposition as a Finite Section Method

The key numerical issue with the Krylov subspace-based algorithms is the fact that the procedure requires inversion of the Vandermonde matrix (128). Since the condition number

| | A | | | | A^{- 1} | |

(where

| | \cdot | |

is the induced matrix norm) of the Vandermonde matrix scales exponentially in its size provided

λ_{k} \neq e^{i ω}

, for some k, even if

| λ_{k} | \approx 1

[61]. There are a variety of ways to resolve this issue, and the first one that appeared [21] is the following version of the Koopman operator approximation, based on singular value decomposition.

Let

F = G Σ V^{†}

(129)

be the “thin” singular value decomposition of the

m \times N

“data matrix” F, whose columns are samples of functions

f_{1}, \dots, f_{N}

. The

m \times N

matrix G and

N \times N

matrix V are unitary matrices,

V^{†}

is the conjugate transpose of

V,

and

Σ

is an

N \times N

diagonal matrix. Note that

F V = G Σ,

(130)

and, thus,

F v_{j} = σ_{j} u_{j}, j = 1, \dots, n

(131)

where

v_{j}

is the j-th column of V and

u_{j}

is the j-the column of

u

. Clearly then,

u_{j}

are linear combinations of vectors

f_{1} (X), f_{2} (X), \dots, f_{N} (X)

, and for

m \geq N

, there are N such linear combinations. We could consider each of these combinations as a sample of a function,

{\tilde{u}}_{j} = v_{j} \cdot \tilde{f},

(132)

where

\tilde{f} = (f_{1}, \dots, f_{N})

is the vector of independent functions. In other words,

\tilde{u} = (u_{1}, \dots, u_{N})

(133)

spans

F

and is an orthogonal basis for it. Now, G is in fact the data matrix whose columns are

u_{j}

’s:

G = [u_{1} u_{2} \dots u_{N}] = [u_{1} (X) u_{2} (X) \dots u_{N} (X)]

(134)

Then, the finite section is

\begin{matrix} {\tilde{U}}_{N}^{S} & = & G^{+} G^{'} = {(G^{†} G)}^{- 1} G^{†} G^{'} \\ = & G^{†} F^{'} V Σ^{- 1} . \end{matrix}

(135)

Now, since

G^{†} = {(F V Σ^{- 1})}^{†} = Σ^{- 1} V^{†} F^{†},

(136)

G^{†} G = Σ^{- 1} V^{†} F^{†} F V Σ^{- 1},

(137)

and, thus,

{(G^{†} G)}^{- 1} G^{†} = Σ V^{†} {(F^{†} F)}^{- 1} V Σ Σ^{- 1} V^{†} F^{†} = Σ V^{†} {(F^{†} F)}^{- 1} F^{†},

(138)

we have

{\tilde{U}}_{N}^{S} = Σ V^{†} {(F^{†} F)}^{- 1} F^{†} F^{'} V Σ^{- 1} = Σ V^{†} F^{+} F^{'} V Σ^{- 1} = Σ V^{†} {\tilde{U}}_{N}^{a} V Σ^{- 1} .

(139)

Therefore,

{\tilde{U}}_{N}^{S}

and

{\tilde{U}}_{N}^{a}

are similar matrices that thus have the same spectrum. If

a_{j}

is an eigenvector of

{\tilde{U}}_{N}^{S}

, then

V Σ^{- 1} a_{j}

is an eigenvector of

{\tilde{U}}_{N}^{a}

, and, according to (67)

{\tilde{ϕ}}_{j}^{N} = G a_{j}

(140)

is a finite section approximation to an eigenfunction of the Koopman operator.

6. Weak Eigenfunctions from Data

In the sections above we presented finite section approximations of the Koopman operator, starting from the idea that bounded infinite-dimensional operators are, given a basis, represented by infinite matrices, and then truncated those. In this section, we will present an alternative point of view that provides additional insights into the relationship between the finite-dimensional approximation and the operator. As a consequence of this approach, we show how the concept of a weak eigenfunction, first discussed in [37], arises.

We start again with a vector of observables,

\tilde{f} = (f_{1}, \dots, f_{N})

. Except when we can consider this problem analytically, we know the values of observables only on a finite set of points in state space,

X = {x_{1}, \dots, x_{m}}

. Assume also that we know the value of

\tilde{f}

at

Y = {y_{k}} = {T (x_{k})}

. We can think of

f_{j} (X) = (f_{j} (x_{1}), \dots, f_{j} (x_{m})), j \in {1, \dots, N}

as a sample of the observable

f_{j}

on

X \subset M

.

Consider the case

x_{k + 1} = T x_{k}, k = 1, \dots, m - 1

. There are many

m \times m

matrices A such that

f_{j} {(Y)}^{T} = A f_{j} {(X)}^{T}

(141)

One of them is the transpose of the companion matrix (91)

{\tilde{U}}^{T} = (\begin{matrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ c_{j 1} & c_{j 2} & c_{j 3} & \dots & c_{j m} \end{matrix}),

(142)

but there are many values that

c_{j k}, k = 1, \dots m

can assume, since the only requirement on them is

\sum_{k = 1}^{m} c_{j k} f_{j} (x_{k}) = f_{j} (y_{m})

(143)

and there are m unknowns and 1 equation that determines them. However, the

c^{'} s

need not depend on j, since the operator that maps the vectors

f_{j} {(X)}^{T}

to

f_{j} {(Y)}^{T}

is not dependent on j. Clearly, if there are m observables, then we get

\sum_{k = 1}^{m} c_{k} f_{j} (x_{k}) = f_{j} (y_{m}), j \in {1, \dots, m},

(144)

and, thus, we can determine

c = (c_{1}, \dots, c_{m})

uniquely.

If the number of observables N is larger than m, then

f_{j k} = f_{j} (x_{k})

are elements of an

N \times m

matrix F (note that this data matrix is precisely the transpose of the one we have used before, in (122)) and, thus, there are not enough components in

c

to solve

F c = \tilde{f} {(y_{m})}^{T} .

(145)

This system is overdetermined, so in general does not have a solution. The Dynamic Mode Decomposition method then solves for

c

using the following procedure: let P be the orthogonal projection onto span of columns of F. Then,

F c_{M P} = P \tilde{f} {(y_{m})}^{T},

(146)

has a solution, provided F has rank m:

P \tilde{f} {(y_{m})}^{T}

is an N-dimensional vector in the span of the columns of F and thus can be written as a linear combination of those vectors. In fact, we can write

c_{M P} = F^{+} \tilde{f} {(y_{m})}^{T} .

(147)

We now discuss the nature of the approximation of the Koopman operator U by the companion matrix (91)

{\tilde{U}}^{T} = C^{T} = (\begin{matrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ c_{1} & c_{2} & c_{3} & \dots & c_{m} \end{matrix}),

(148)

where

c = (c_{1}, \dots, c_{m}) = c_{M P}

obtained from Equation (147).

Let

S = {x_{1}, \dots, x_{m}}

be an invariant set for

T : M \to M

, where M is a measure space, with measure

μ

. Consider the space

{C |}_{S}

, of continuous functions in

L^{2} (μ)

restricted to

S

. This is an m-dimensional vector space. The restriction

{U |}_{S}

of the Koopman operator to

{C |}_{S}

, is then a finite-dimensional linear operator that can be represented in a basis by an

m \times m

matrix. An explicit example is given when

x_{j}, j = 1, \dots, m

represent successive points on a periodic trajectory, and the resulting matrix representation in the standard basis is the

m \times m

cyclic permutation matrix

Π = (\begin{matrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & 0 & 0 & \dots & 0 \end{matrix}),

(149)

If

S

is not an invariant set, an

m \times m

approximation of the reduced Koopman operator can still be provided. Namely, if we know m independent functions’ restrictions

(f_{j}) |_{S}

,

j = 1, \dots, m

in

{C |}_{S},

and we also know

f_{j} (T x_{k})

,

j, k \in {1, \dots, m}

, we can provide a matrix representation of

{U |}_{S}

. However, while in the case where

S

is an invariant set, the iterate of any function in

{C |}_{S}

can be obtained in terms of the iterate of m independent functions, for the case when

S

is not invariant, this is not necessarily so. Namely, the fact that

S

is not invariant means that functions in

{C |}_{S}

do not necessarily experience linear dynamics under

{U |}_{S}

. However, one can take N observables

f_{j}

,

j = 1, \dots, N

, where

N > m

, and approximate the nonlinear dynamics using linear regression on

\tilde{f} (X) \equiv (f (x_{1}), \dots, f (x_{m})),

where

f (\cdot) = {(f_{1} (\cdot), \dots, f_{N} (\cdot))}^{T}

—i.e., by finding an

m \times m

matrix C that gives the best approximation of the data in the Frobenius norm,

C^{T} = \underset{B \in C^{m \times m}}{arg min} | | f (T x) - {f (x) B | |}_{F} \equiv \underset{B \in C^{m \times m}}{arg min} {∥ {(f_{j} (T x_{k}))}_{j, k = 1, 1}^{n, m} - {(f_{j} (x_{k}))}_{j, k = 1, 1}^{n, m} B ∥}_{F} .

(150)

We have the following:

Theorem 7.

Let

T : M \to M

be a measure μ-preserving transformation on a metric space M, and let

S_{m} = {x_{j}}, j = 1, \dots, m

be a trajectory such that, when

m \to \infty

,

S_{m}

becomes dense in a compact invariant set

A \in M

. Then, for any N-vector of observables

{f \in C |}_{S_{m}}, N \geq m

, we have

lim_{m \to \infty} {| U |}_{S_{m}} f - C f | = 0

(151)

Proof.

By density of

S_{\infty}

, for sufficiently large M,

m \geq M

implies

| x_{m} - x_{j} | < ϵ_{M}

for some

x_{j} \in x_{1}, \dots, x_{m - 1}

. By continuity of observables,

{| U |}_{S_{m}} f - C f | \leq D ϵ_{M}

(152)

for some constant D. Taking M sufficiently large makes

ϵ_{M} \to 0

. □

Consider an m-dimensional eigenvector

\tilde{e} = (e_{1}, \dots, e_{m})

of

{\tilde{U}}^{T}

, associated with the eigenvalue

λ

. Since the eigenvector satisfies

{\tilde{U}}^{T} \tilde{e} = λ \tilde{e}

(153)

we have

{\tilde{e}}_{k + 1} = λ {\tilde{e}}_{k}, k = 1, \dots, m - 2 .

(154)

Thus,

\tilde{e}

can be considered as an eigenfunction on the finite set

x_{1}, \dots, x_{m - 1}

. On the last point of the sample,

x_{m}

, we have

\sum_{j = 1}^{m} c_{j} {\tilde{e}}_{j} = λ {\tilde{e}}_{m}

(155)

Let us now consider the concept of the weak eigenfunction, or eigendistribution. Let

ν

be some prior measure of interest on M. Let

ϕ

be a bounded function that satisfies

ϕ \circ T = λ ϕ

. We construct the functional L on

C (M)

by defining

L (h) = \int_{M} h ϕ d ν .

(156)

Set

U L (h) = \int_{M} h ϕ (T x) d ν

and we get

U L (h) = \int_{M} h ϕ (T x) d ν = λ \int_{M} h ϕ d ν = λ L (h) .

(157)

Clearly, this is satisfied if

ϕ

is a continuous eigenfunction of U at eigenvalue

λ

. However, Equation (157) is applicable for cases with much less regularity. Namely, if

μ

is a measure and

L (f) = \int f (x) d μ (x)

(158)

the associated linear functional, then we can define the action of U on L by

U L (f) = \int f (x) d μ (T x) .

(159)

Consider, for example, a set of points

x_{k}, k \in N^{+}

and assume that for every continuous h there exists the limit

L (h) = lim_{K \to \infty} \frac{1}{K} \sum_{k = 1}^{K} h (x_{k}) \tilde{e} (x_{k}) .

(160)

Then, by the Riesz representation theorem, there is a measure

μ

such that

L (h) = \int_{M} h d μ .

(161)

Definition 1.

Let a measure μ be such that the associated linear functional L satisfies

U L = λ L,

(162)

for some

λ \in C

. Then, μ is called a weak eigenfunction of U.

Now, we have

U L (h) = lim_{K \to \infty} \frac{1}{K} \sum_{k = 1}^{K} h (x_{k}) \tilde{e} (x_{k + 1}) = λ lim_{K \to \infty} \frac{1}{K} \sum_{k = 1}^{K} h (x_{k}) \tilde{e} (x_{k}) = λ L (h),

(163)

proving the following theorem:

Theorem 8.

Consider a set of points

x_{k}, k \in N^{+}

, on a trajectory of T, and assume that for every continuous h, there exists the limit

L (h) = lim_{K \to \infty} \frac{1}{K} \sum_{k = 1}^{K} h (x_{k}) \tilde{e} (x_{k}),

(164)

where

\tilde{e} (x_{k}) = λ \tilde{e} (x_{k - 1}) .

(165)

Then, the μ associated with

L (h)

by

L (h) = \int_{M} h d μ,

(166)

is a weak eigenfunction of U associated with the eigenvalue λ.

From the above, it follows that the left eigenvectors of

{\tilde{U}}^{T}

are approximations of the associated (possibly weak) Koopman modes, as it is assumed that ł is such an eigenvector,

ł_{j} \tilde{U} = λ_{j} ł_{j} .

(167)

Then,

〈ł, f_{j} (X)〉

(168)

is the projection of

f_{j} (X)

on the eigenspace spanned by the eigenvector

e_{j}

. Moreover, since

ł_{j} = λ ł_{j + 1} - c_{m} ł_{m}

(169)

the statement can be obtained in the limit

K \to \infty

by the so-called Generalized Laplace Analysis (GLA) that we described in Section 3.

Remark 12.

The standard interpretation of the Dynamic Mode Decomposition (e.g., on Wikipedia) was in some way a transpose of the one presented here: the observables

f_{1}^{T}, \dots f_{m}^{T}

(interpreted as column vectors) were assumed to be related by a matrix

A : f_{j + 1} = A f_{j}

. Instead, in the nonlinear, Koopman operator interpretation, each row is mapped into its image, and this allows interpretation on the space of observables. This is particularly important in the context of evolution equations, for example, fluid flows, where the evolution of the observables’ field—the field of velocity vectors at different spatial points—is not evolving linearly.

7. Conclusions

In this paper, we pursued analysis of two of the major approaches to computation of the Koopman operator spectrum: the Generalized Laplace Analysis and the finite section method. We derived approximation results and reinterpreted finite section as a method acting on samples of continuous functions on the state space. The example of a chaotic system with continuous spectrum shows how a failure of the finite section method can occur for that class of systems. The question of choice of observables is often raised in the context of finite-section approximations such as the EDMD. Specifically, the number of basis functions—e.g., Fourier basis on a box in a d-dimensional space—selected as observables can increase exponentially with the dimension d. The pseudospectral result proven here shows that choosing time-delayed observables avoids this issue, making time-delayed observations a natural choice. However, it is clear from the example we gave that the finite section method can fail to converge spectrally for systems with continuous spectrum.

One can understand the Krylov subspace approach as sampling by dynamics in the observables space. The weak eigenfunction approach is based on sampling in the state space. Thus, both techniques avoid the curse of dimensionality that methods such as EDMD potentially introduce.

There are a number of directions for future research based on the work presented here. Generalized Laplace Analysis methods could use results in numerical approximations of Laplace transfroms [62] to remedy some of the difficulties arising in computation. It could be coupled with the power methods in numerical linear algebra for computation of eigenvalues, eigenfunctions and modes. There is a vast literature on Krylov subspace methods that can be used to refine the computations using finite section methodologies. Finally, recent results in computation of Koopman operator approximations might provide a direction for obtaining pseudospectrum convergence results for the cases of dynamical systems with (partially) continuous spectrum.

Funding

This research was supported in part by the DARPA contract HR0011-16-C-0116, ARO grants W911NF-11-1-0511 and W911NF-14-1-0359, and AFOSR contract FA9550-17-C-0012.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

I am thankful to Hassan Arbabi and Mathias Wanner for carefully reading the paper and useful comments.

Conflicts of Interest

The author declares no conflict of interest.

References

Koopman, B.O. Hamiltonian systems and transformation in Hilbert space. Proc. Natl. Acad. Sci. USA 1931, 17, 315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lasota, A.; Mackey, M.C. Chaos, Fractals and Noise; Springer: New York, NY, USA, 1994. [Google Scholar]
Singh, R.K.; Manhas, J.S. Composition Operators on Function Spaces; Elsevier: Amsterdam, The Netherlands, 1993; Volume 179. [Google Scholar]
Mezić, I.; Banaszuk, A. Comparison of systems with complex behavior. Phys. D Nonlinear Phenom. 2004, 197, 101–133. [Google Scholar] [CrossRef]
Mezić, I. Spectral properties of dynamical systems, model reduction and decompositions. Nonlinear Dyn. 2005, 41, 309–325. [Google Scholar] [CrossRef]
Mauroy, A.; Mezić, I.; Susuki, Y. Koopman Operator in Systems and Control; Springer: Berlin/Heidelberg, Germany, 2020. [Google Scholar]
Hansen, A.C. Infinite-dimensional numerical linear algebra: Theory and applications. Proc. R. Soc. A Math. Phys. Eng. Sci. 2010, 466, 3539–3559. [Google Scholar] [CrossRef]
Tao, T. The Spectral Theorem and Its Converses for Unbounded Symmetric Operators. 2009. Available online: https://terrytao.wordpress.com/2011/12/20/the-spectral-theorem-and-its-conversesfor-unbounded-symmetric-operators/ (accessed on 3 February 2014).
Mezić, I.; Banaszuk, A. Comparison of systems with complex behavior: Spectral methods. In Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No. 00CH37187), Sydney, Australia, 12–15 December 2000; Volume 2, pp. 1224–1231. [Google Scholar]
Korda, M.; Putinar, M.; Mezić, I. Data-driven spectral analysis of the Koopman operator. Appl. Comput. Harmon. Anal. 2020, 48, 599–629. [Google Scholar] [CrossRef] [Green Version]
Mezić, I. Analysis of fluid flows via spectral properties of the Koopman operator. Annu. Rev. Fluid Mech. 2013, 45, 357–378. [Google Scholar] [CrossRef] [Green Version]
Mauroy, A.; Mezić, I. On the use of fourier averages to compute the global isochrons of (quasi) periodic dynamics. Chaos Interdiscip. J. Nonlinear Sci. 2012, 22, 033112. [Google Scholar] [CrossRef] [Green Version]
Mohr, R.; Mezić, I. Construction of eigenfunctions for scalar-type operators via Laplace averages with connections to the Koopman operator. arXiv 2014, arXiv:1403.6559. [Google Scholar]
Mezić, I. Spectrum of the Koopman operator, spectral expansions in functional spaces, and state-space geometry. J. Nonlinear Sci. 2020, 30, 2091–2145. [Google Scholar] [CrossRef] [Green Version]
Colbrook, M.J.; Roman, B.; Hansen, A.C. How to compute spectra with error control. Phys. Rev. Lett. 2019, 122, 250201. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Colbrook, M.J.; Townsend, A. Rigorous data-driven computation of spectral properties of koopman operators for dynamical systems. arXiv 2021, arXiv:2111.14889. [Google Scholar]
Susuki, Y.; Mauroy, A.; Mezic, I. Koopman resolvent: A laplace-domain analysis of nonlinear autonomous dynamical systems. SIAM J. Appl. Dyn. Syst. 2021, 20, 2013–2036. [Google Scholar] [CrossRef]
Drmac, Z.; Mezic, I.; Mohr, R. Data driven modal decompositions: Analysis and enhancements. SIAM J. Sci. Comput. 2018, 40, A2253–A2285. [Google Scholar] [CrossRef]
Böttcher, A.; Silbermann, B. The finite section method for toeplitz operators on the quarter-plane with piecewise continuous symbols. Math. Nachrichten 1983, 110, 279–291. [Google Scholar] [CrossRef]
Lewin, M.; Séré, É. Spectral pollution and how to avoid it. Proc. Lond. Math. Soc. 2010, 100, 864–900. [Google Scholar] [CrossRef] [Green Version]
Schmid, P.J. Dynamic mode decomposition of numerical and experimental data. J. Fluid Mech. 2010, 656, 5–28. [Google Scholar] [CrossRef] [Green Version]
Rowley, C.W.; Mezić, I.; Bagheri, S.; Schlatter, P.; Henningson, D.S. Spectral analysis of nonlinear flows. J. Fluid Mech. 2009, 641, 115–127. [Google Scholar] [CrossRef] [Green Version]
Kutz, J.N.; Brunton, S.L.; Brunton, B.W.; Proctor, J.L. Dynamic Mode Decomposition: Data-Driven Modeling of Complex Systems; SIAM: Philadelphia, PA, USA, 2016. [Google Scholar]
Tu, J.H. Dynamic Mode Decomposition: Theory and Applications. Ph.D. Thesis, Princeton University, Princeton, NJ, USA, 2013. [Google Scholar]
Takeishi, N.; Kawahara, Y.; Tabei, Y.; Yairi, T. Bayesian dynamic mode decomposition. In Proceedings of the IJCAI, Melbourne, Australia, 19–25 August 2017; pp. 2814–2821. [Google Scholar]
Chen, K.K.; Tu, J.H.; Rowley, C.W. Variants of dynamic mode decomposition: Boundary condition, koopman, and fourier analyses. J. Nonlinear Sci. 2012, 22, 887–915. [Google Scholar] [CrossRef]
Askham, T.; Kutz, J.N. Variable projection methods for an optimized dynamic mode decomposition. SIAM J. Appl. Dyn. Syst. 2018, 17, 380–416. [Google Scholar] [CrossRef] [Green Version]
Noack, B.R.; Stankiewicz, W.; Morzyński, M.; Schmid, P.J. Recursive dynamic mode decomposition of transient and post-transient wake flows. J. Fluid Mech. 2016, 809, 843–872. [Google Scholar] [CrossRef] [Green Version]
Azencot, O.; Yin, W.; Bertozzi, A. Consistent dynamic mode decomposition. SIAM J. Appl. Dyn. Syst. 2019, 18, 1565–1585. [Google Scholar] [CrossRef]
Proctor, J.L.; Brunton, S.L.; Kutz, J.N. Dynamic mode decomposition with control. SIAM J. Appl. Dyn. Syst. 2016, 15, 142–161. [Google Scholar] [CrossRef] [Green Version]
Korda, M.; Mezić, I. Linear predictors for nonlinear dynamical systems: Koopman operator meets model predictive control. Automatica 2018, 93, 149–160. [Google Scholar] [CrossRef] [Green Version]
Jovanović, M.R.; Schmid, P.J.; Nichols, J.W. Sparsity-promoting dynamic mode decomposition. Phys. Fluids 2014, 26, 024103. [Google Scholar] [CrossRef]
Bagheri, S. Effects of weak noise on oscillating flows: Linking quality factor, floquet modes, and koopman spectrum. Phys. Fluids 2014, 26, 094104. [Google Scholar] [CrossRef] [Green Version]
Hemati, M.S.; Rowley, C.W.; Deem, E.A.; Cattafesta, L.N. De-biasing the dynamic mode decomposition for applied koopman spectral analysis of noisy datasets. Theor. Comput. Fluid Dyn. 2017, 31, 349–368. [Google Scholar] [CrossRef]
Dawson, S.T.M.; Hemati, M.S.; Williams, M.O.; Rowley, C.W. Characterizing and correcting for the effect of sensor noise in the dynamic mode decomposition. Exp. Fluids 2016, 57, 42. [Google Scholar] [CrossRef] [Green Version]
Williams, M.O.; Kevrekidis, I.G.; Rowley, C.W. A data-driven approximation of the Koopman operator: Extending dynamic mode decomposition. J. Nonlinear Sci. 2015, 25, 1307–1346. [Google Scholar] [CrossRef] [Green Version]
Korda, M.; Mezić, I. On convergence of extended dynamic mode decomposition to the Koopman operator. J. Nonlinear Sci. 2018, 28, 687–710. [Google Scholar] [CrossRef] [Green Version]
Susuki, Y.; Mezic, I. A Prony approximation of Koopman mode decomposition. In Proceedings of the 2015 54th IEEE Conference on Decision and Control (CDC), Osaka, Japan, 15–18 December 2015; pp. 7022–7027. [Google Scholar]
Arbabi, H.; Mezic, I. Ergodic theory, dynamic mode decomposition, and computation of spectral properties of the Koopman operator. SIAM J. Appl. Dyn. Syst. 2017, 16, 2096–2126. [Google Scholar] [CrossRef]
Mezic, I.; Arbabi, H. On the computation of isostables, isochrons and other spectral objects of the Koopman operator using the dynamic mode decomposition. IEICE Proc. Ser. 2017, 29, 1–4. [Google Scholar]
Das, S.; Giannakis, D. Delay-coordinate maps and the spectra of Koopman operators. J. Stat. Phys. 2019, 175, 1107–1145. [Google Scholar] [CrossRef] [Green Version]
Dunford, N. Spectral operators. Pac. J. Math. 1954, 4, 321–354. [Google Scholar] [CrossRef]
Hunt, F.Y. Unique ergodicity and the approximation of attractors and their invariant measures using Ulam’s method. Nonlinearity 1998, 11, 307. [Google Scholar] [CrossRef]
Megginson, R.E. An Introduction to Banach Space Theory; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; Volume 183. [Google Scholar]
Mezić, I. On Geometrical and Statistical Properties of Dynamical Systems: Theory and Applications. Ph.D. Thesis, California Institute of Technology, Pasadena, CA, USA, 1994. [Google Scholar]
Mezić, I.; Wiggins, S. A method for visualization of invariant sets of dynamical systems based on the ergodic partition. Chaos 1999, 9, 213–218. [Google Scholar] [CrossRef] [Green Version]
Mauroy, A.; Mezić, I.; Moehlis, J. Isostables, isochrons, and Koopman spectrum for the action–angle representation of stable fixed point dynamics. Phys. D Nonlinear Phenom. 2013, 261, 19–30. [Google Scholar] [CrossRef] [Green Version]
Arnold, V.I.; Avez, A. Ergodic Problems of Classical Mechanics, 1968; Benjamin: New York, NY, USA, 1968. [Google Scholar]
Govindarajan, N.; Mohr, R.; Chandrasekaran, S.; Mezic, I. On the approximation of Koopman spectra for measure preserving transformations. SIAM J. Appl. Dyn. Syst. 2019, 18, 1454–1497. [Google Scholar] [CrossRef] [Green Version]
Petersen, K. Ergodic Theory; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
Klus, S. On the Numerical Approximation of the Perron-Frobenius and Koopman Operator. arXiv 2015, arXiv:1512.05997. [Google Scholar]
Ridge, W.C. Spectrum of a composition operator. Proc. Am. Math. Soc. 1973, 37, 121–127. [Google Scholar] [CrossRef]
Nadkarni, M.G. Spectral Theory of Dynamical Systems; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar]
Neumann, J.V. Zur operatorenmethode in der klassischen mechanik. Ann. Math. 1932, 587–642. [Google Scholar] [CrossRef]
Mezić, I. DSample: A deterministic algorithm for sampling with o(1/n) error. Preprint 2009. [Google Scholar]
Kachurovskii, A.G. The rate of convergence in ergodic theorems. Russ. Math. Surv. 1996, 51, 653. [Google Scholar] [CrossRef]
Jakvsić, V.; Molchanov, S. A note on the regularity of solutions of linear homological equations. Appl. Anal. 2000, 75, 371–377. [Google Scholar]
Texier, B.; Basic Matrix Perturbation Theory. Expository Note. 2017. Available online: www.math.jussieu.fr/~texier (accessed on 4 January 2022).
Trefethen, L.N.; Embree, M. Spectra and Pseudospectra: The Behavior of Nonnormal Matrices and Operators; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
Takens, F. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence, Warwick 1980; Springer: Berlin/Heidelberg, Germany, 1981; pp. 366–381. [Google Scholar]
Pan, V.Y. How bad are Vandermonde matrices? SIAM J. Matrix Anal. Appl. 2016, 37, 676–694. [Google Scholar] [CrossRef] [Green Version]
Rokhlin, V. A fast algorithm for the discrete laplace transformation. J. Complex. 1988, 4, 12–32. [Google Scholar] [CrossRef] [Green Version]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mezić, I. On Numerical Approximations of the Koopman Operator. Mathematics 2022, 10, 1180. https://doi.org/10.3390/math10071180

AMA Style

Mezić I. On Numerical Approximations of the Koopman Operator. Mathematics. 2022; 10(7):1180. https://doi.org/10.3390/math10071180

Chicago/Turabian Style

Mezić, Igor. 2022. "On Numerical Approximations of the Koopman Operator" Mathematics 10, no. 7: 1180. https://doi.org/10.3390/math10071180

APA Style

Mezić, I. (2022). On Numerical Approximations of the Koopman Operator. Mathematics, 10(7), 1180. https://doi.org/10.3390/math10071180

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On Numerical Approximations of the Koopman Operator

Abstract

1. Introduction

2. Preliminaries

3. Generalized Laplace Analysis

GLA for Fields of Observables

4. The Finite Section Method

4.1. Finite Section and the Dual Basis

4.2. Convergence of the Finite Sample Approximation to the Finite Section

4.3. The Error in the Finite Section

5. Krylov Subspace Methods

5.1. Single Observable Krylov Subspace Methods

5.2. Error in the Companion Matrix Representation

5.3. Krylov Sequences from Data

5.4. Schmid’s Dynamical Mode Decomposition as a Finite Section Method

6. Weak Eigenfunctions from Data

7. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI