Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics

Zhang, Wei; Schütte, Christof

doi:10.3390/e19070367

Open AccessFeature PaperArticle

Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics

by

Wei Zhang

¹ and

Christof Schütte

^1,2,*

¹

Institute of Mathematics, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany

²

Zuse Institute Berlin, Takustrasse 7, 14195 Berlin, Germany

^*

Author to whom correspondence should be addressed.

Entropy 2017, 19(7), 367; https://doi.org/10.3390/e19070367

Submission received: 25 April 2017 / Revised: 13 July 2017 / Accepted: 13 July 2017 / Published: 18 July 2017

(This article belongs to the Special Issue Understanding Molecular Dynamics via Stochastic Processes)

Download

Browse Figures

Versions Notes

Abstract

:

Many interesting rare events in molecular systems, like ligand association, protein folding or conformational changes, occur on timescales that often are not accessible by direct numerical simulation. Therefore, rare event approximation approaches like interface sampling, Markov state model building, or advanced reaction coordinate-based free energy estimation have attracted huge attention recently. In this article we analyze the reliability of such approaches. How precise is an estimate of long relaxation timescales of molecular systems resulting from various forms of rare event approximation methods? Our results give a theoretical answer to this question by relating it with the transfer operator approach to molecular dynamics. By doing so we also allow for understanding deep connections between the different approaches.

Keywords:

molecular dynamics; eigenproblem; effective dynamics; Galerkin method; variational approach; Markov state model; reaction coordinate

1. Introduction

The problem of accurate estimation of long relaxation timescales associated with rare events in molecular dynamics like ligand association, protein folding, or conformational changes has attracted a lot of attention recently. Often, these timescales are not accessible by direct numerical simulation. Therefore, different discrete coarse graining approaches for their approximation, like Markov state model (MSM) building [1,2] or time-lagged independent component analysis (TiCA) [3,4] have been introduced and successfully applied to various molecular systems [5,6]. These approaches are based on finite-dimensional Galerkin discretization [1] or variational approximation [7,8] of the transfer operator of the molecular dynamics process [9]. In several theoretical studies the approximation error of these numerical techniques regarding the longest relaxation timescales has been analyzed resulting in error estimates in terms of the dominant eigenvalues of the transfer operator [3,9]. In this article we first show how to obtain similar error estimates when replacing the transfer operator by the infinitesimal generator [10] associated with it. Furthermore, the analysis exhibits that the different approaches are deeply connected, that is, in the end they lead to an identical numerical problem. In addition to the different discrete coarse graining approaches, the literature contains various alternative reaction coordinate sampling approaches aiming at approximation of very long relaxation processes. In these sampling approaches, one assumes that the effective dynamical behavior of the systems on long timescales can be described by a relatively low dimensional object given by some reaction coordinates. Various advanced methods such as umbrella sampling [11,12], metadynamics [13,14], blue moon sampling [15], the adaptive biasing force method [16], or temperature-accelerated molecular dynamics (TAMD) [17], as well as trajectory-based techniques like milestoning [18], transition interface sampling [19], or forward flux sampling [20] may serve as some examples. These methods result in free energy barriers, transition rates, or first mean passage times for the rare events of interest; they are complemented by several approaches to the effective dynamics of the reaction coordinate space [21,22,23] that allow for significantly faster simulation of these rare events [24,25,26] including details of the underlying molecular mechanisms. Surprisingly, our analytic tools, originally developed for discrete coarse graining approaches, can also be utilized for evaluating the approximation quality of reaction coordinate sampling approaches to the effective dynamics. We derive an explicit error estimate for the longest timescale resulting from the choice of specific reaction coordinates.

However, estimating the approximation quality is not the only way of utilizing the analytical insights presented in this article. We also demonstrate how the new techniques for simulation of the effective dynamics can be used for efficient MSM building or TiCA applications.

Mathematically, the article is based on the analysis of the dominant timescales of reversible and ergodic diffusion processes in energy landscapes. The leading eigenvalues of the transfer operator (or, equivalently, the infinitesimal generator) and the corresponding eigenfunctions characterize the dynamical behavior of the process on long timescales [9,27]. Firstly, in several articles the approximation error with respect to these leading eigenvalues under discretization of the transfer operator has been discussed, cf. [3,7,8,28,29,30]. Following this work, we characterize the approximation quality for the (low-lying) eigenvalues of the infinitesimal generator. This permits us to study the connection between the effective dynamics considered in [23] and Galerkin discretization schemes for the transfer operator. Secondly, following the work [7,8], we study the variational approach for the infinitesimal generator. In fact, we will see that this approach leads to the same generalized matrix eigenproblem as the one resulting from Galerkin discretization. Thirdly, numerical issues related to the estimation of the coefficient matrices by means of the effective dynamics are discussed.

The paper is organized as follows. In Section 2, we introduce the various operators associated to the reversible diffusion processes and discuss the relation between eigenvalues and relaxation timescales. Next, in Section 3, we study the Galerkin discretization of generators/transfer operators for solving the eigenproblem and show that previous results can be extended to reaction coordinate subspaces. In Section 4, the variational approach to the approximation of the eigenproblem is considered and its relations to the Galerkin approach are worked out in detail. Then, in Section 5, we discuss numerical issues related to estimating the discretization matrices by means of simulating the effective dynamics for given reaction coordinates; the performance of this approach is studied numerically in Section 6. Finally, conclusions and some further remarks are given in Section 7. After being familiar with the facts in Section 2, readers who are more interested in numerical algorithmic aspects rather than detailed mathematical analysis can skip Section 3 and Section 4 and refer to Section 5 and Section 6 on first reading.

2. Diffusion Process and the Associated Operators

We consider a diffusion process given by the stochastic differential equation (SDE)

\begin{matrix} d x_{s} & = - \nabla V (x_{s}) d s + \sqrt{2 β^{- 1}} d w_{s}, s \geq 0, \\ x_{0} & = x, \end{matrix}

(1)

where

x_{s} \in R^{n}

, parameter

β > 0

is related to the inverse of system’s temperature, and

w_{s}

is an n-dimensional Brownian motion.

V : R^{n} \to R

is a potential function which is assumed to be smooth and bounded from below. The results presented subsequently can be extended to more general reversible diffusion processes with a state-dependent noise intensity matrix, cf. [23]. However, for the sake of simplicity of presentation we restrict our considerations to the specific case (1) typically studied in molecular dynamics.

The infinitesimal generator of the dynamics (1) is given by,

\begin{matrix} L = - \nabla V \cdot \nabla + \frac{1}{β} Δ . \end{matrix}

(2)

It is known that, under mild conditions on V, the solution process

{(x_{s})}_{s \geq 0}

of (1) is ergodic [31], and its unique invariant measure

π

is given by

π (d x) = ρ (x) d x

where,

\begin{matrix} ρ (x) = \frac{1}{Z} e^{- β V (x)}, with Z = \int_{R^{n}} e^{- β V (x)} d x . \end{matrix}

(3)

We introduce the Hilbert space

H = L^{2} (R^{n}, π)

, which is endowed with the inner product,

\begin{matrix} {〈 f, g 〉}_{π} = \int_{R^{n}} f (x) g (x) ρ (x) d x, \forall f, g \in H, \end{matrix}

(4)

and the norm

{| f |}_{π} = \sqrt{{〈 f, f 〉}_{π}}

,

\forall f \in H

. The domain of the operator

L

will be denoted as

D (L) \subset H

.

It is also known that the process

{(x_{s})}_{s \geq 0}

is a reversible process and that

L

is a self-adjoint operator with respect to the inner product (4). Whenever the potential V grows to infinity fast enough at infinity, its spectrum is discrete [9]. Let

λ_{i} \in C

and

φ_{i} \in D (L)

be the eigenvalues and the corresponding (normalized) eigenfunctions of

- L

, that is, the solutions of the eigenproblem,

\begin{matrix} - L f = λ f \end{matrix}

(5)

in H, or in weak form,

\begin{matrix} - {〈 L f, g 〉}_{π} = λ {〈 f, g 〉}_{π}, \forall g \in H . \end{matrix}

(6)

Due to the self-adjointness of

L

and the fact that,

\begin{matrix} {〈 - L f, f 〉}_{π} = \frac{1}{β} \int_{R^{n}} {| \nabla f (x) |}^{2} ρ (x) d x \geq 0, \forall f \in H, \end{matrix}

(7)

we can assume that

λ_{i} \in R

with,

\begin{matrix} 0 = λ_{0} < λ_{1} \leq \dots \leq λ_{k} \leq \dots, \end{matrix}

(8)

with

φ_{0} \equiv 1

.

Given

s \geq 0

, we define the operator

T_{s} : H \to H

by,

\begin{matrix} (T_{s} f) (x) = E (f (x_{s}) | x_{0} = x), f \in H, \end{matrix}

(9)

where

E

denotes the expectation taken with respect to the paths of (1) under the initial condition that

x_{0} = x

. It is well-known that

u (s, x) = T_{s} f (x)

is the solution of the Kolmogorov backward equation

\frac{d}{d s} u (s, \cdot) = L u (s, \cdot), u (0, \cdot) = f,

(10)

that is, the operators

T_{s}

,

s \geq 0

form a one-parameter semigroup whose infinitesimal generator is

L

, and therefore they are self-adjoint in H as well. Because of Equation (10), the formal expression

T_{s} = e^{s L}

is often used in the literature. Similarly to (8), we also know that the eigenvalues of

T_{s}

are given by,

\begin{matrix} 1 = e^{- λ_{0} s} > e^{- λ_{1} s} \geq \dots > 0, \end{matrix}

(11)

with the same eigenfunctions

φ_{i}

,

i = 0, 1, \dots

.

In the following we introduce another operator called the transfer operator, which has been extensively considered in the literature, to investigate the metastability of molecular systems and to build Markov state models (MSM) [1,6,9]. A lag time

τ > 0

is fixed, with

p (x, \cdot; τ)

being the transition density function of the process (1) starting from

x \in R^{n}

, i.e.,

p (x, y; τ)

describes the probability density of starting from state x at time

s = 0

and arriving at

y \in R^{n}

after time

τ

. For a bounded and continuous function

u \in H

, the transfer operator

T_{τ} : H \to H

is defined by [1,27,32],

\begin{matrix} (T_{τ} u) (y) = \frac{1}{ρ (y)} \int_{R^{n}} p (x, y; τ) u (x) ρ (x) d x, y \in R^{n} . \end{matrix}

(12)

From (12), it follows immediately that,

\begin{matrix} {〈 T_{τ} u, f 〉}_{π} = & \int_{R^{n}} \int_{R^{n}} p (x, y; τ) f (y) d y u (x) ρ (x) d x \\ = & \int_{R^{n}} [E (f (x_{τ}) | x_{0} = x)] u (x) ρ (x) d x \\ = & {〈 u, T_{τ} f 〉}_{π} = {〈 T_{τ} u, f 〉}_{π}, \forall f \in H, \end{matrix}

which then implies

T_{τ} = T_{τ}

, i.e., the transfer operator

T_{τ}

coincides with the operator

T_{τ}

, a member within the semigroup

{(T_{s})}_{s \geq 0}

. Denote the eigenvalues of

T_{τ}

as

μ_{i}

,

i \geq 0

, such that,

\begin{matrix} 1 = μ_{0} > μ_{1} \geq \dots > 0 . \end{matrix}

(13)

Then from the discussions above and the eigenvalues of

T_{s}

in (11), we can conclude that

μ_{i} = e^{- λ_{i} τ}

and the corresponding eigenfunctions are the same as the eigenfunctions

φ_{i}

of the infinitesimal generator

L

. These eigenvalues and eigenfunctions encode crucial timescale information of the dynamical system. Specifically, the relaxation timescales

t_{i}

of the dynamics (1) are given by [10],

t_{i} = λ_{i}^{- 1}, i = 1, 2, \dots .

This means that the dominant relaxation timescales of the dynamics (1) can be obtained by computing the dominant eigenvalues of

T_{τ}

(or, equivalently,

T_{τ}

,

L

), cf. [10,27].

3. Galerkin Approximation of the Eigenvalues of the Generator

In this section, we study the Galerkin method for computing the eigenvalues of the infinitesimal generator

L

. While Galerkin discretization of the transfer operator has been studied to some extent [9], results on the associated infinitesimal generator are rather sparse.

3.1. Some General Results

To introduce the Galerkin method, let

H_{0}

be a Hilbert subspace of H containing the constant function, and let

P

denote the orthogonal projection operator from H to

H_{0}

, which satisfies

P^{2} = P

and,

\begin{matrix} {〈 P f, g 〉}_{π} = {〈 f, g 〉}_{π}, \forall f \in H, g \in H_{0} . \end{matrix}

(14)

The Galerkin method aims at approximating the solution of (6) in the subspace

H_{0}

. Specifically, we want to find

f \in H_{0}

, such that,

\begin{matrix} - {〈 L f, g 〉}_{π} = κ {〈 f, g 〉}_{π}, \forall g \in H_{0}, \end{matrix}

(15)

for some constant

κ \geq 0

. Using the property (14), we know that problem (15) is equivalent to the eigenproblem for the operator

- P L

on the subspace

H_{0}

, i.e.,

\begin{matrix} - P L f = κ f . \end{matrix}

(16)

It is straightforward to verify that

- P L

is a self-adjoint operator on

H_{0}

. Similarly to (8), let

ζ_{i} \in H_{0}

be the orthonormal eigenfunctions of the operator

- P L

corresponding to eigenvalues

κ_{i}

, where,

\begin{matrix} 0 = κ_{0} < κ_{1} \leq κ_{2} \leq \dots, \end{matrix}

(17)

and

ζ_{0} \equiv 1

. When

H_{0}

is an infinite dimensional subspace, we assume

κ_{i} \to + \infty

as

i \to + \infty

.

In the following, we want to study the condition under which the eigenvalues of the projected generator

P L

are reliable approximations of the eigenvalues of the full generator

L

. The following approximation result was obtained in [23] and we include its proof for completeness:

Theorem 1.

For

i \geq 0

, let

φ_{i}

and

ζ_{i}

be the orthonormal eigenfunctions of the operators

- L

and

- P L

corresponding to the eigenvalues

λ_{i}

and

κ_{i}

, respectively. We have,

\begin{matrix} λ_{i} \leq κ_{i} \leq λ_{i} + \frac{1}{β} \int_{R^{n}} | \nabla (φ_{i} - ζ_{i}) (x) |^{2} ρ (x) d x . \end{matrix}

(18)

Proof.

From (15), we have

κ_{i} = - {〈 L ζ_{i}, ζ_{i} 〉}_{π}

. Define the subspace

E_{i + 1} = s p a n {ζ_{0}, \dots, ζ_{i}}

for

i \geq 0

. It follows from the orthogonality of the functions

ζ_{i}

that

E_{i + 1}

is an

(i + 1)

-dimensional subspace of H. Using (17) it is direct to verify that,

\begin{matrix} κ_{i} = \max_{f \in E_{i + 1}, {| f |}_{π} = 1} {〈 - L f, f 〉}_{π} . \end{matrix}

(19)

Applying the min–max theorem to the eigenvalues of the operator

- L

, we conclude,

\begin{matrix} κ_{i} = \max_{f \in E_{i + 1}, {| f |}_{π} = 1} {〈 - L f, f 〉}_{π} \geq \min_{E_{i + 1}^{'}} \max_{f \in E_{i + 1}^{'}, {| f |}_{π} = 1} {〈 - L f, f 〉}_{π} = λ_{i}, \end{matrix}

(20)

where

E_{i + 1}^{'}

goes over all

(i + 1)

-dimensional subspaces of H. For the upper bound, we can compute that,

\begin{matrix} {〈 - L (φ_{i} - ζ_{i}), (φ_{i} - ζ_{i}) 〉}_{π} \\ = {〈 - L φ_{i}, φ_{i} 〉}_{π} + 2 {〈 L φ_{i}, ζ_{i} 〉}_{π} + {〈 - L ζ_{i}, ζ_{i} 〉}_{π} \\ = λ_{i} - 2 λ_{i} {〈 φ_{i}, ζ_{i} 〉}_{π} + κ_{i} \\ = κ_{i} - λ_{i} + 2 λ_{i} (1 - {〈 φ_{i}, ζ_{i} 〉}_{π}) \geq κ_{i} - λ_{i}, \end{matrix}

where we have used the fact that

{〈 φ_{i}, ζ_{i} 〉}_{π} \leq | φ_{i} |_{π} {| ζ_{i} |}_{π} = 1

. The conclusion follows from (7). ☐

Previous studies on the Galerkin approximation of the dominant eigenvalues of the transfer operator have shown that the approximation error of eigenvalues can be reliably bounded by means of the projection errors of the corresponding eigenfunctions [28,29,30]. Next we will derive a similar result for the generator

L

. To this end, we introduce the orthogonal projection

P^{⊥}

from H to the complement subspace

H_{0}^{⊥}

of

H_{0}

, that is,

P^{⊥} = I - P

. We have

Theorem 2.

Let φ be a normalized eigenfunction of the operator

- L

corresponding to the eigenvalue λ. Define constants,

\begin{matrix} δ_{1} = | L P^{⊥} {φ |}_{π}, δ_{2} = {| P^{⊥} φ |}_{π} \leq 1, \end{matrix}

(21)

and suppose that

0 < δ_{2} < 1

. Then there is an eigenvalue

κ_{i}

of the operator

- P L

, such that,

\begin{matrix} | κ_{i} - λ | \leq \frac{δ_{1}}{{(1 - δ_{2}^{2})}^{\frac{1}{2}}} . \end{matrix}

(22)

Proof.

Since

δ_{2} = {| P^{⊥} φ |}_{π} = {(1 - {| P φ |}_{π}^{2})}^{\frac{1}{2}} < 1

, we have

| P φ | > 0

. Let

P φ = \sum_{i = 0}^{+ \infty} ω_{i} ζ_{i}

, where

ω_{i} = {〈 φ, ζ_{i} 〉}_{π}

, and the summation consists of finite terms when

H_{0}

is a finite dimensional subspace. For all

g \in H_{0}

, we can compute,

\begin{matrix} {〈 P L P^{⊥} φ, g 〉}_{π} = {〈 P L (φ - P φ), g 〉}_{π} \\ = {〈 P L φ, g 〉}_{π} - 〈 P L (\sum_{i = 0}^{+ \infty} ω_{i} ζ_{i}), g 〉_{π} \\ = - {〈 λ P φ, g 〉}_{π} + {〈 \sum_{i = 0}^{+ \infty} ω_{i} κ_{i} ζ_{i}, g 〉}_{π} = 〈 \sum_{i = 0}^{+ \infty} ω_{i} (κ_{i} - λ) ζ_{i}, g 〉_{π}, \end{matrix}

which implies

P L P^{⊥} φ = \sum_{i = 0}^{+ \infty} ω_{i} (κ_{i} - λ) ζ_{i}

, and,

\begin{matrix} | P L P^{⊥} {φ |}_{π}^{2} = \sum_{i = 0}^{+ \infty} ω_{i}^{2} | κ_{i} {- λ |}^{2} \geq (\min_{i} {| κ_{i} - λ |}^{2}) \sum_{i = 0}^{+ \infty} ω_{i}^{2} = \min_{i} | κ_{i} {- λ |}^{2} {| P φ |}_{π}^{2} . \end{matrix}

Therefore we have,

\begin{matrix} \min_{i} | κ_{i} - λ | \leq \frac{| P L P^{⊥} {φ |}_{π}}{{| P φ |}_{π}} \leq \frac{| L P^{⊥} {φ |}_{π}}{{(1 - | P^{⊥} {φ |}_{π}^{2})}^{\frac{1}{2}}} = \frac{δ_{1}}{{(1 - δ_{2}^{2})}^{\frac{1}{2}}} . \end{matrix}

(23)

☐

Remark 1.

Notice that our error bound above relies on both constants

δ_{1}

and

δ_{2}

, while the error bound in [30] for the transfer operator only depends on one constant, the projection error

δ_{2}

. This difference is due to the fact that the generator

L

is an unbounded operator while the transfer operator is bounded.

3.2. Finite Dimensional Subspaces

In applications, it is often assumed that

H_{0}

is spanned by finitely many basis functions. In particular, this is the situation when constructing MSMs based on indicator functions of partition sets [30] or based on core sets [10].

Let

H_{0}

be the finite dimensional space

H_{0} = s p a n {ψ_{1}, ψ_{2}, \dots, ψ_{N}}

, where

ψ_{i} \in H

are the basis functions, and consider the eigenproblem (15). As a direct application of Theorem 1 and Theorem 2, we have,

Corollary 1.

For Galerkin approximation of the eigenproblem (15) using the finite-dimensional ansatz space

H_{0}

, the following three statements are valid:

1.: Write $f = \sum_{i = 1}^{N} ω_{i} ψ_{i} \in H_{0}$ and let $X = {(ω_{1}, ω_{2}, \dots, ω_{N})}^{T} \in R^{N}$ . Then problem (15) is equivalent to the generalized matrix eigenproblem,

$\begin{matrix} C X = λ S X, \end{matrix}$

(24)

where $C, S$ are $N \times N$ matrices whose entries are given by,

$\begin{matrix} C_{l l^{'}} = {〈 - L ψ_{l}, ψ_{l^{'}} 〉}_{π}, S_{l l^{'}} = {〈 ψ_{l}, ψ_{l^{'}} 〉}_{π}, 1 \leq l, l^{'} \leq N, \end{matrix}$

(25)
2.: Let $0 = κ_{0} \leq κ_{1} \leq \dots \leq κ_{k}$ be the $(k + 1)$ smallest eigenvalues of problem (24) and,

$\begin{matrix} X_{i} = {(X_{i 1}, X_{i 2}, \dots, X_{i N})}^{T}, 0 \leq i \leq k, \end{matrix}$

be the orthonormal eigenvector corresponding to $κ_{i}$ such that $X_{i}^{T} S X_{i} = 1$ . Define $ζ_{i} = \sum_{l = 1}^{N} X_{i l} ψ_{l}$ , then we have,

$\begin{matrix} λ_{i} \leq κ_{i} \leq λ_{i} + \frac{1}{β} \int_{R^{n}} {| \nabla (φ_{i} - ζ_{i}) (x) |}^{2} ρ (x) d x, 0 \leq i \leq k, \end{matrix}$

(26)

where $λ_{i}$ , $φ_{i}$ are the eigenvalues and the eigenfunctions of the operator $- L$ , respectively.
3.: Let $P$ be the orthogonal projection operator from H to $H_{0}$ , and φ be an eigenfunction of the operator $- L$ corresponding to the eigenvalue λ. Define constants,

$\begin{matrix} δ_{1} = | L P^{⊥} {φ |}_{π}, δ_{2} = {| P^{⊥} φ |}_{π}, \end{matrix}$

and suppose that $δ_{2} < 1$ . Then there is an eigenvalue $κ_{i}$ of problem (24) such that,

$\begin{matrix} | κ_{i} - λ | \leq \frac{δ_{1}}{{(1 - δ_{2}^{2})}^{\frac{1}{2}}} . \end{matrix}$

(27)

3.3. Infinite Dimensional Subspace: Effective Dynamics

In this subsection, we discuss Galerkin approximations based on infinite-dimensional ansatz spaces; these cases appear when studying the effective dynamics given by a so-called reaction coordinate, cf. [23]. In order to explain the relation between Galerkin approximation and effective dynamics, let us first recall some definitions and results regarding the effective dynamics. For more details, readers are referred to [21,23,33] for related work.

Let

ξ : R^{n} \to R^{m}

be a reaction coordinate function,

m \geq 1

. For any function

f \in H

and

x \in R^{n}

, we define,

\begin{matrix} \begin{matrix} P f (x) = & \frac{1}{Q (z)} \int_{R^{n}} ρ (x^{'}) f (x^{'}) δ (ξ (x^{'}) - z) d x^{'}, \end{matrix} \end{matrix}

(28)

where

z = ξ (x) \in R^{m}

,

δ (\cdot)

denotes the delta function, and

Q (z) = \int_{R^{n}} ρ (x^{'}) δ (ξ (x^{'}) - z) d x^{'}

is a normalization factor satisfying

\int_{R^{m}} Q (z) d z = 1

. Define the probability measure

ν

on

R^{m}

given by

ν (d z) = Q (z) d z

for

z \in R^{m}

and consider the Hilbert space

\tilde{H} = L^{2} (R^{m}, ν)

.

\tilde{H}

induces a (infinite dimensional) linear subspace of H, namely,

\begin{matrix} H_{0} = \{f | f \in H, f = \tilde{f} \circ ξ, for some \tilde{f} \in \tilde{H}\} \subset H, \end{matrix}

(29)

and (28) clearly implies that

P f \in H_{0}

.

Let

\tilde{f} \in \tilde{H}

satisfy

P f = \tilde{f} \circ ξ

. Then, using (28), we can verify that

P^{2} = P

and,

\begin{matrix} {〈 f, h 〉}_{π} = {〈 P f, h 〉}_{π} = {〈 \tilde{f}, \tilde{h} 〉}_{ν}, \forall h = \tilde{h} \circ ξ \in H_{0} . \end{matrix}

(30)

Therefore, the mapping

P : H \to H_{0}

actually is the orthogonal projection operator from H to the subspace

H_{0}

. For

f \in H

,

z \in R^{m}

, in the following we will also write

P f (z)

instead of

\tilde{f} (z)

, where

\tilde{f} \in \tilde{H}

such that

P f = \tilde{f} \circ ξ

. The effective dynamics of the dynamics (1) for the reaction coordinate

ξ

is defined on

R^{m}

and satisfies the SDE,

\begin{matrix} d z_{s} = \tilde{b} (z_{s}) d s + \sqrt{2 β^{- 1}} \tilde{σ} (z_{s}) d w_{s}, \end{matrix}

(31)

where

z_{s} \in R^{m}

,

w_{s}

is a Brownian motion on

R^{m}

, and the coefficients

\tilde{b} : R^{m} \to R^{m}

,

\tilde{σ} : R^{m} \to R^{m \times m}

are given by,

\begin{matrix} \begin{matrix} {\tilde{b}}_{l} (z) = & P (L ξ_{l}) (z) = P (- \nabla V \cdot \nabla ξ_{l} + \frac{1}{β} Δ ξ_{l}) (z), \\ {\tilde{a}}_{l l^{'}} (z) = & {(\tilde{σ} {\tilde{σ}}^{T})}_{l l^{'}} (z) = P (\sum_{i = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{i}} \frac{\partial ξ_{l^{'}}}{\partial x_{i}}) (z), \end{matrix} \end{matrix}

(32)

for

\forall z \in R^{m}

,

1 \leq l, l^{'} \leq m

. The infinitesimal generator of the process governed by (31) is given by,

\begin{matrix} \tilde{L} = & \sum_{l = 1}^{m} {\tilde{b}}_{l} \frac{\partial}{\partial z_{l}} + \frac{1}{β} \sum_{l, l^{'} = 1}^{m} {\tilde{a}}_{l l^{'}} \frac{\partial^{2}}{\partial z_{l} \partial z_{l^{'}}}, \end{matrix}

(33)

which is a self-adjoint operator on space

\tilde{H}

with discrete spectrum under appropriate conditions on

ξ

. We consider the eigenproblem,

\begin{matrix} - \tilde{L} \tilde{f} = \tilde{λ} \tilde{f}, \tilde{f} \in \tilde{H}, \end{matrix}

(34)

and let

{\tilde{φ}}_{i} \in \tilde{H}

be the orthonormal eigenfunctions of the operator

- \tilde{L}

corresponding to the eigenvalues

{\tilde{λ}}_{i}

, where,

\begin{matrix} 0 = {\tilde{λ}}_{0} < {\tilde{λ}}_{1} \leq {\tilde{λ}}_{2} \leq \dots . \end{matrix}

(35)

Applying Theorems 1 and 2, we have the following result.

Corollary 2.

For the eigenproblem (34) associated with the effective dynamics, the following three statements are valid:

1.: For $f = \tilde{f} \circ ξ \in H_{0}$ where $\tilde{f} \in \tilde{H}$ , we have,

$\begin{matrix} P L f = (\tilde{L} \tilde{f}) \circ ξ . \end{matrix}$

(36)
2.: Let $φ_{i}$ and ${\tilde{φ}}_{i}$ be the normalized eigenfunctions of the operators $- L$ and $- \tilde{L}$ corresponding to eigenvalues $λ_{i}$ and ${\tilde{λ}}_{i}$ , respectively. We have,

$\begin{matrix} λ_{i} \leq {\tilde{λ}}_{i} \leq λ_{i} + \frac{1}{β} \int_{R^{n}} | \nabla (φ_{i} - {\tilde{φ}}_{i} \circ ξ) (x) |^{2} ρ (x) d x . \end{matrix}$

(37)
3.: Let φ be the normalized eigenfunction of the operator $- L$ corresponding to the eigenvalue λ. Define constants,

$\begin{matrix} δ_{1} = | L P^{⊥} {φ |}_{π}, δ_{2} = {| P^{⊥} φ |}_{π}, \end{matrix}$

and suppose $δ_{2} < 1$ . Then there is an eigenvalue ${\tilde{λ}}_{i}$ of the problem (34), such that,

$\begin{matrix} | {\tilde{λ}}_{i} - λ | \leq \frac{δ_{1}}{{(1 - δ_{2}^{2})}^{\frac{1}{2}}} . \end{matrix}$

(38)

Proof.

The proof of the first assertion can be found in [23]. Using (30) and (36), we can derive,

\begin{matrix} - {〈 P L (\tilde{φ_{i}} \circ ξ), f 〉}_{π} = - {〈 (\tilde{L} \tilde{φ_{i}}) \circ ξ, f 〉}_{π} = - {〈 \tilde{L} \tilde{φ_{i}}, \tilde{f} 〉}_{ν} = {\tilde{λ}}_{i} {〈 \tilde{φ_{i}} \circ ξ, f 〉}_{π}, \forall f = \tilde{f} \circ ξ \in H_{0}, \end{matrix}

(39)

i.e.,

{\tilde{λ}}_{i}

and

{\tilde{φ}}_{i} \circ ξ

are the eigenvalues and eigenfunctions of the projected operator

- P L

on the subspace

H_{0}

, respectively. Furthermore,

{〈 {\tilde{φ}}_{i} \circ ξ, {\tilde{φ}}_{i} \circ ξ 〉}_{π} = {〈 {\tilde{φ}}_{i}, {\tilde{φ}}_{i} 〉}_{ν} = 1

, i.e.,

{\tilde{φ}}_{i} \circ ξ

is normalized. Therefore, the second assertion is implied by Theorem 1. The third assertion follows from Theorem 2 in the same way. ☐

Remark 2.

As an interesting conclusion of the first assertion, we can conclude that, on the infinitesimal subspace

H_{0}

defined in (29), the projected operator

- P L

is essentially described by another differential operator

\tilde{L}

, which is defined in the Hilbert space

\tilde{H}

and coincides with the infinitesimal generator of the effective dynamics on

R^{m}

.

4. Variational Approach to Generator Eigenproblem

In this section, we study the variational approach to approximate the eigenvalues and eigenfunctions of the operator

- L

. This approach has been considered in [4,7,8] to study the related eigenproblem of the transfer operator. Its main idea is to approximate the dominant eigenvalues of a self-adjoint transfer operator via an appropriate form of the Rayleigh variational principle instead via Galerkin discretization [7]. Herein, we present a similar approach to the low-lying generator eigenvalues.

4.1. Variational Principle

The main object of the variational approach is the following functional

F : D {(L)}^{\oplus (k + 1)} \to R

, that acts on

k + 1

functions from

D (L)

.

Given arbitrary constants

ω_{i} > 0

,

0 \leq i \leq k

, we define the functional,

\begin{matrix} F (f_{0}, f_{1}, \dots, f_{k}) = \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π}, f_{i} \in D (L) . \end{matrix}

(40)

Clearly, for the (normalized) leading eigenfunctions

φ_{i}

of

L

, we have,

F (φ_{0}, φ_{1}, \dots, φ_{k}) = \sum_{i = 0}^{k} ω_{i} λ_{i},

where

λ_{i}

are the corresponding eigenvalues. The main workhorse of the variational principle is the following lower and upper bound:

Theorem 3 (Variational principle).

Let

ω_{i}

,

i = 0, 1, \dots, k

be a decreasing sequence of positive real numbers, i.e.,

ω_{0} > ω_{1} > \dots > ω_{k} > 0

. For any orthonormal family of functions

f_{i} \in D (L)

,

i = 0, 1, \dots, k

, we have,

\begin{matrix} F (φ_{0}, φ_{1}, \dots, φ_{k}) \leq F (f_{0}, f_{1}, \dots, f_{k}) \leq F (φ_{0}, φ_{1}, \dots, φ_{k}) + F (f_{0} - φ_{0}, f_{1} - φ_{1}, \dots, f_{k} - φ_{k}), \end{matrix}

(41)

or more explicitly,

\begin{matrix} \sum_{i = 0}^{k} ω_{i} λ_{i} \leq \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π} \leq \sum_{i = 0}^{k} ω_{i} λ_{i} + \sum_{i = 0}^{k} ω_{i} {〈 - L (f_{i} - φ_{i}), (f_{i} - φ_{i}) 〉}_{π} . \end{matrix}

(42)

In order to prove this variational principle we need the following simple lemma:

Lemma 1.

Suppose

k > 0

, and let

{(α_{i})}_{i = 0, 1, \dots, k}

and

{(ω_{i})}_{i = 0, 1, \dots, k}

be two ordered sequences of real numbers such that,

\begin{matrix} α_{0} \leq α_{1} \leq \dots \leq α_{k}, ω_{0} \geq ω_{1} \geq \dots \geq ω_{k} . \end{matrix}

Then, for any permutation

{(ω_{i}^{'})}_{i = 0, 1, \dots, k}

of the sequence

{(ω_{i})}_{i = 0, 1, \dots, k}

, we have,

\begin{matrix} \sum_{i = 0}^{k} α_{i} ω_{i}^{'} \geq \sum_{i = 0}^{k} α_{i} ω_{i} . \end{matrix}

(43)

Proof.

The proof of Theorem 3 is given in two steps:

For the lower bound, we consider the optimization problem,

$\begin{matrix} \begin{matrix} \min_{f_{i}} F (f_{0}, f_{1}, \dots, f_{k}) = \min_{f_{i}} \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π}, \\ subject to {〈 f_{i}, f_{j} 〉}_{π} = δ_{i j}, 0 \leq i, j \leq k . \end{matrix} \end{matrix}$

(44)

Next, we introduce the Lagrange multipliers $λ_{i j}$ for $0 \leq i \leq j \leq k$ , and consider the auxiliary functional,

$\begin{matrix} \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π} - \sum_{i = 0}^{k} \sum_{j = i}^{k} λ_{i j} ({〈 f_{i}, f_{j} 〉}_{π} - δ_{i j}) . \end{matrix}$

(45)

Applying calculus of variation, we conclude that the minimizer of (44) satisfies,

$\begin{matrix} \begin{matrix} - 2 ω_{i} L f_{i} - \sum_{j = i}^{k} λ_{i j} f_{j} - \sum_{j = 0}^{i} λ_{j i} f_{j} = 0, \forall 0 \leq i \leq k, \\ {〈 f_{i}, f_{j} 〉}_{π} = δ_{i j}, 0 \leq i, j \leq k . \end{matrix} \end{matrix}$

(46)

Multiplying $f_{j}$ for some $i < j \leq k$ in the first equation of (46) and integrating, we obtain $λ_{i j} = - 2 ω_{i} {〈 L f_{i}, f_{j} 〉}_{π}$ . In the same way we could also obtain $λ_{i j} = - 2 ω_{j} {〈 L f_{j}, f_{i} 〉}_{π}$ . Using the fact that $L$ is self-adjoint and $ω_{i} > ω_{j}$ for $i < j$ , we conclude that,

$\begin{matrix} λ_{i j} = {〈 L f_{i}, f_{j} 〉}_{π} = 0, \forall 0 \leq i < j \leq k, \end{matrix}$

(47)

and (46) reduces to an eigenproblem,

$\begin{matrix} - L f_{i} = \frac{λ_{i i}}{ω_{i}} f_{i}, 0 \leq i \leq k . \end{matrix}$

(48)

Therefore, the minimizer of (44) is given by the orthonormal eigenfunctions. Applying Lemma 1, we can further conclude that the lower bound is obtained when $f_{i} = φ_{i}$ , with value,

$\begin{matrix} \sum_{i = 0}^{k} ω_{i} {〈 - L φ_{i}, φ_{i} 〉}_{π} = \sum_{i = 0}^{k} ω_{i} λ_{i} . \end{matrix}$

(49)
For the upper bound, similarly to the proof of Theorem 1, direct computation gives,

$\begin{matrix} \sum_{i = 0}^{k} ω_{i} {〈 - L (f_{i} - φ_{i}), (f_{i} - φ_{i}) 〉}_{π} \\ = \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π} - \sum_{i = 0}^{k} ω_{i} λ_{i} + 2 \sum_{i = 0}^{k} ω_{i} λ_{i} (1 - {〈 f_{i}, φ_{i} 〉}_{π}) \\ \geq \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π} - \sum_{i = 0}^{k} ω_{i} λ_{i}, \end{matrix}$

where we have used the fact that $- L φ_{i} = λ_{i} φ_{i}$ and ${〈 f_{i}, φ_{i} 〉}_{π} \leq | f_{i} |_{π} {| φ_{i} |}_{π} = 1$ , since both $f_{i}, φ_{i}$ are normalized functions.

☐

4.2. Optimization Problem

The variational principle of Theorem 3 allows for approximation of the low-lying eigenvalues of the generator. In order to turn it into an algorithm, we again introduce N basis functions

ψ_{1}

, ⋯,

ψ_{N} \in D (L)

. We want to approximate the first

k + 1

eigenvalues

λ_{i}

, as well as the eigenfunctions

φ_{i}

,

0 \leq i \leq k

by approximating the eigenfunctions using linear combinations of the basis functions. That is, we consider the functions,

f_{i} = \sum_{l = 1}^{N} x_{i l} ψ_{l},

(50)

where

x_{i l}

are real-valued coefficients to be determined,

0 \leq i \leq k

,

1 \leq l \leq N

. Inspired by Theorem 3, we wish to determine the coefficients

x_{i l}

by solving the optimization problem,

\begin{matrix} \begin{matrix} \min_{{x_{i l}}} F (f_{0}, f_{1}, \dots, f_{k}) = \min_{{x_{i l}}} \sum_{i = 0}^{k} ω_{i} {〈 - L f_{i}, f_{i} 〉}_{π}, f_{i} = \sum_{l = 1}^{N} x_{i l} ψ_{l}, \\ subject to {〈 f_{i}, f_{j} 〉}_{π} = δ_{i j}, 0 \leq i, j \leq k . \end{matrix} \end{matrix}

(51)

Recalling the matrices

C, S

defined in (25) and defining the vectors

X_{i} = {(x_{i 1}, \dots, x_{i N})}^{T} \in R^{N}

,

0 \leq i \leq k

, the optimization problem (51) can be reformulated as,

\begin{matrix} \begin{matrix} \min_{{x_{i l}}} \sum_{i = 0}^{k} ω_{i} (\sum_{1 \leq l, l^{'} \leq N} x_{i l} x_{i l^{'}} C_{l l^{'}}), \\ subject to \sum_{1 \leq l, l^{'} \leq N} x_{i l} S_{l l^{'}} x_{j l^{'}} = δ_{i j}, 0 \leq i, j \leq k, \end{matrix} \end{matrix}

(52)

or, equivalently, in matrix form,

\begin{matrix} \begin{matrix} \min_{X_{0}, X_{1}, \dots, X_{k}} \sum_{i = 0}^{k} ω_{i} X_{i}^{T} C X_{i}, \\ subject to X_{i}^{T} S X_{j} = δ_{i j}, 0 \leq i, j \leq k . \end{matrix} \end{matrix}

(53)

Using a similar argument as in the proof of Theorem 3, we can obtain,

Theorem 4.

The minimum of the optimization problem (51) is achieved by the functions

f_{i}

as of (50) with the coefficients from the first

k + 1

eigenvectors

X_{i}

of the generalized matrix eigenproblem,

\begin{matrix} C X = λ S X . \end{matrix}

(54)

It is supposed that the eigenvectors

X_{i}

of (54) are chosen such that

X_{i}^{T} S X_{j} = δ_{i j}

and the corresponding eigenvalues are

κ_{i}

for

0 \leq i \leq k

, where

κ_{0} \leq κ_{1} \leq \dots \leq κ_{k}

. Then, the minimum of (51) is,

\begin{matrix} \sum_{i = 0}^{k} ω_{i} X_{i}^{T} C X_{i} = \sum_{i = 0}^{k} ω_{i} κ_{i} . \end{matrix}

(55)

Remark 3.

Combining the above result with SubSection 3.2, we see that both the Galerkin method and the variational approach lead to the same generalized matrix eigenproblem with an identical estimate for the eigenvalue error.

5. Numerical Algorithms

In this section, we consider how the matrices

C, S

defined in (25), that is,

\begin{matrix} C_{l l^{'}} = {〈 - L ψ_{l}, ψ_{l^{'}} 〉}_{π}, S_{l l^{'}} = {〈 ψ_{l}, ψ_{l^{'}} 〉}_{π}, 1 \leq l, l^{'} \leq N \end{matrix}

(56)

can be approximated from trajectories of the diffusion process. For the transfer operator this problem has been studied in [4,7,8] using trajectories of the original diffusion process given by (1). In contrast, we herein will consider trajectories of the effective dynamics (31) instead of the original diffusion process.

5.1. Computing Coefficient Matrices Using Effective Dynamics

Similar to the setup in SubSection 3.3, we assume that a reaction coordinate function

ξ : R^{n} \to R^{m}

, as well as N basis functions

ψ_{l}

,

1 \leq l \leq N

, are given. Furthermore, we suppose that the basis functions

ψ_{l}

can be written as

ψ_{l} = {\tilde{ψ}}_{l} \circ ξ

for some functions

{\tilde{ψ}}_{l} \in \tilde{H}

, i.e.,

ψ_{l} \in H_{0}

. In this case, it follows from the first assertion of Corollary 2 and the relation (30) that,

\begin{matrix} \begin{matrix} S_{l l^{'}} = & {〈 ψ_{l}, ψ_{l^{'}} 〉}_{π} = {〈 {\tilde{ψ}}_{l}, {\tilde{ψ}}_{l^{'}} 〉}_{ν}, \\ C_{l l^{'}} = & {〈 - L ψ_{l}, ψ_{l^{'}} 〉}_{π} = {〈 - L ψ_{l}, P ψ_{l^{'}} 〉}_{π} = {〈 - P L ψ_{l}, ψ_{l^{'}} 〉}_{π} \\ = & {〈 - (\tilde{L} \tilde{ψ_{l}}) \circ ξ, ψ_{l^{'}} 〉}_{π} = {〈 - \tilde{L} \tilde{ψ_{l}}, {\tilde{ψ}}_{l^{'}} 〉}_{ν} . \end{matrix} \end{matrix}

(57)

These equalities, though simple, are quite interesting, because they relate the entries of the coefficient matrices

C, S

to the infinitesimal generator

\tilde{L}

of the effective dynamics in (33). Since ν is the unique invariant measure of the effective dynamics [23], we can apply the ergodic theorem and get,

\begin{matrix} S_{l l^{'}} = \lim_{T \to + \infty} \frac{1}{T} \int_{0}^{T} {\tilde{ψ}}_{l} (z_{s}) {\tilde{ψ}}_{l^{'}} (z_{s}) d s \approx \frac{1}{M - M_{0}} \sum_{i = M_{0} + 1}^{M} {\tilde{ψ}}_{l} (z_{i Δ t}) {\tilde{ψ}}_{l^{'}} (z_{i Δ t}), \end{matrix}

(58)

where

z_{s}

denotes a realization of the effective dynamics (31),

Δ t > 0

is the step size,

M \in N

is a large integer, and only the parts of trajectories after time

M_{0} Δ t

are used for estimation.

For the matrix C, using (57), the definition of the infinitesimal generator

\tilde{L}

, as well as the ergodic theorem, we can derive,

\begin{matrix} \begin{matrix} C_{l l^{'}} = & {〈 - \tilde{L} \tilde{ψ_{l}}, {\tilde{ψ}}_{l^{'}} 〉}_{ν} \\ = & - \int_{R^{m}} \lim_{s \to 0} \frac{E ({\tilde{ψ}}_{l} (z_{s}) | z_{0} = z) - {\tilde{ψ}}_{l} (z)}{s} {\tilde{ψ}}_{l^{'}} (z) d ν (z) \\ = & - \lim_{s \to 0} \int_{R^{m}} \frac{E ({\tilde{ψ}}_{l} (z_{s}) | z_{0} = z) - {\tilde{ψ}}_{l} (z)}{s} {\tilde{ψ}}_{l^{'}} (z) d ν (z) \\ = & - \lim_{s \to 0} E [\frac{{\tilde{ψ}}_{l} (z_{s}) - {\tilde{ψ}}_{l} (z_{0})}{s} {\tilde{ψ}}_{l^{'}} (z_{0}) | z_{0} \sim ν] \\ = & - \lim_{s \to 0} \lim_{T \to + \infty} \frac{1}{T} \int_{0}^{T} \frac{{\tilde{ψ}}_{l} (z_{t + s}) - {\tilde{ψ}}_{l} (z_{t})}{s} {\tilde{ψ}}_{l^{'}} (z_{t}) d t \\ = & - \lim_{s \to 0} \lim_{T \to + \infty} \frac{1}{T} \int_{0}^{T} \frac{{\tilde{ψ}}_{l^{'}} (z_{t + s}) - {\tilde{ψ}}_{l^{'}} (z_{t})}{s} {\tilde{ψ}}_{l} (z_{t}) d t . \end{matrix} \end{matrix}

(59)

In the above,

E

denotes the mathematical expectation with respect to the effective dynamics

z_{s}

, and the last equality follows from the symmetry of the matrix C.

To compute

C_{l l^{'}}

numerically, we further introduce a parameter

τ ≪ 1

, and approximate (59) by,

\begin{matrix} \begin{matrix} C_{l l^{'}} & \approx & - \frac{1}{2 (M - M_{0})} [\sum_{i = M_{0} + 1}^{M} \frac{{\tilde{ψ}}_{l} (z_{i Δ t + τ}) - {\tilde{ψ}}_{l} (z_{i Δ t})}{τ} {\tilde{ψ}}_{l^{'}} (z_{i Δ t}) \\ + \sum_{i = M_{0} + 1}^{M} \frac{{\tilde{ψ}}_{l^{'}} (z_{i Δ t + τ}) - {\tilde{ψ}}_{l^{'}} (z_{i Δ t})}{τ} {\tilde{ψ}}_{l} (z_{i Δ t})] \\ = & - \frac{1}{2 (M - M_{0})} \sum_{i = M_{0} + 1}^{M} \frac{{\tilde{ψ}}_{l} (z_{i Δ t + τ}) {\tilde{ψ}}_{l^{'}} (z_{i Δ t}) + {\tilde{ψ}}_{l} (z_{i Δ t}) {\tilde{ψ}}_{l^{'}} (z_{i Δ t + τ}) - 2 {\tilde{ψ}}_{l} (z_{i Δ t}) {\tilde{ψ}}_{l^{'}} (z_{i Δ t})}{τ} . \end{matrix} \end{matrix}

(60)

Formulas (58) and (60) can be used to estimate the coefficient matrices

C, S

, provided that we can obtain a long trajectory of the effective dynamics (31).

Remark 4.

From the discussions in Section 2, we know that the eigenvalues of the transfer operator

T_{τ}

and those of the operator

- L

satisfy the relation

μ_{i} = e^{- λ_{i} τ}

,

i \geq 0

. When the lag time τ is small, the approximation

μ_{i} \approx 1 - λ_{i} τ

holds for the leading eigenvalues since

λ_{i}

is small. In fact, estimating the matrix C using the last expression in (60), we will have

C = \frac{S - \bar{C}}{τ}

, where the matrix

\bar{C}

is given by,

\begin{matrix} {\bar{C}}_{l l^{'}} = & \frac{1}{2 (M - M_{0})} \sum_{i = M_{0} + 1}^{M} [{\tilde{ψ}}_{l} (z_{i Δ t + τ}) {\tilde{ψ}}_{l^{'}} (z_{i Δ t}) + {\tilde{ψ}}_{l} (z_{i Δ t}) {\tilde{ψ}}_{l^{'}} (z_{i Δ t + τ})] . \end{matrix}

(61)

It is easy to observe that the eigenvalue estimations resulting from problem (54) are related to those of the problem

\bar{C} X = μ S X

by

μ = 1 - λ τ

. Note that (61) is very similar to the estimator derived in [3] except for the fact that here we use trajectories of the effective dynamics instead of the original dynamics. To summarize, when the lag time τ is small, the above discussion implies that after solving the problem (54) we can approximate the leading eigenvalues of the transfer operator by

μ_{i} = 1 - λ_{i} τ

.

5.2. Algorithms for Simulating the Effective Dynamics

In order to utilize the above results we have to be able to efficiently compute (long) realizations of the effective dynamics (31). In this subsection, we discuss two numerical algorithms for realizing this.

5.2.1. Algorithm 1

The first algorithm is based on the following formula for the coefficients

\tilde{b}, \tilde{a}

given in (32):

\begin{matrix} \begin{matrix} {\tilde{b}}_{l} (z) = & \lim_{s \to 0 +} E (\frac{ξ_{l} (x_{s}) - z_{l}}{s} | x_{0} \sim μ_{z}), 1 \leq l \leq m, \\ {\tilde{a}}_{l l^{'}} (z) = & \frac{β}{2} \lim_{s \to 0 +} E (\frac{(ξ_{l} (x_{s}) - z_{l}) (ξ_{l^{'}} (x_{s}) - z_{l^{'}})}{s} | x_{0} \sim μ_{z}), 1 \leq l, l^{'} \leq m, \end{matrix} \end{matrix}

(62)

where

x_{s}

is a realization of the original diffusive dynamics (1) and

μ_{z}

is the restriction of the invariant measure π to the submanifold

ξ^{- 1} (z) = \{x \in R^{n} | ξ (x) = z\}

. We refer readers to [23] for more details.

In order to utilize this for simulation, we fix two parameters

0 < Δ s ≪ Δ t

and proceed as follows:

At step $k \geq 0$ , starting from $x_{0} \sim μ_{z}$ , generate N trajectories $x_{Δ s}^{(i)}$ of length $Δ s$ of the (unconstrained) full dynamics $x_{s}$ by discretizing (1). Compute the coefficients $\tilde{b}, \tilde{a}$ by,

$\begin{matrix} \begin{matrix} {\tilde{b}}_{l} = & \frac{1}{N} \sum_{i = 1}^{N} \frac{ξ_{l} (x_{Δ s}^{(i)}) - z_{k Δ t, l}}{Δ s}, \\ {\tilde{a}}_{l l^{'}} = & \frac{β}{2} [\frac{1}{N} \sum_{i = 1}^{N} \frac{(ξ_{l} (x_{Δ s}^{(i)}) - z_{k Δ t, l}) (ξ_{l^{'}} (x_{Δ s}^{(i)}) - z_{k Δ t, l^{'}})}{Δ s} - {\tilde{b}}_{l} {\tilde{b}}_{l^{'}} Δ s], \end{matrix} \end{matrix}$

(63)

where $1 \leq l, l^{'} \leq m$ .
Compute $\tilde{σ}$ from $\tilde{a} = \tilde{σ} {\tilde{σ}}^{T}$ by matrix decomposition. Update $z_{(k + 1) Δ t}$ by,

$\begin{matrix} z_{(k + 1) Δ t, l} = z_{k Δ t, l} + {\tilde{b}}_{l} Δ t + \sqrt{\frac{2 Δ t}{β}} \sum_{i = 1}^{m} {\tilde{σ}}_{l i} η_{i}^{(k)}, 1 \leq l \leq m, \end{matrix}$

(64)

where $η_{i}^{(k)}$ are independent standard Gaussian variables, $1 \leq i \leq m$ .

In the above,

z_{k Δ t, l}

denotes the lth components of

z_{k Δ t} \in R^{m}

. The initial states

x_{0}

are sampled from the probability measure

μ_{z}

; this can be achieved by using the numerical schemes proposed in [15,34,35], which simulate the original dynamics (1) and then project the state onto the submanifold

ξ^{- 1} (z)

.

5.2.2. Algorithm 2

The second algorithm is inspired by the TAMD method proposed in [17]. In the following we provide a slightly different argument which motivates the method. The main idea is to consider the extended dynamics,

\begin{matrix} d x_{s, i} = & - \frac{\partial V}{\partial x_{i}} (x_{s}) d s - κ \sum_{j = 1}^{m} (ξ_{j} (x_{s}) - z_{s, j}) \frac{\partial ξ_{j}}{\partial x_{i}} (x_{s}) d s + \sqrt{2 β^{- 1}} d w_{s, i}, 1 \leq i \leq n, \\ d z_{s, l} = & κ \sum_{k = 1}^{m} (ξ_{k} (x_{s}) - z_{s, k}) \sum_{j = 1}^{n} \frac{\partial ξ_{k}}{\partial x_{j}} (x_{s}) \frac{\partial ξ_{l}}{\partial x_{j}} (x_{s}) d s + \sqrt{2 β^{- 1}} \sum_{j = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{j}} (x_{s}) d {\bar{w}}_{s, j}, 1 \leq l \leq m, \end{matrix}

(65)

where κ is a large constant,

w_{s}

,

{\bar{w}}_{s}

are independent Brownian motions on

R^{n}

, and

x_{s, i}

denotes the ith component of the state

x_{s}

(similar notations for

z_{s}, w_{s}, {\bar{w}}_{s}

). Note that the invariant measure of the dynamics (65) has a probability density,

\begin{matrix} ρ_{κ} (x, z) \propto e^{- β (V (x) + \frac{κ}{2} {| ξ (x) - z |}^{2})}, (x, z) \in R^{n + m}, \end{matrix}

(66)

with respect to the Lebesgue measure on the extended space

R^{n + m}

. If we choose

(x, z) \to z

as the reaction coordinate function of (65) and derive the effective dynamics following [21,23], we can obtain,

\begin{matrix} d z_{s} = & {\tilde{b}}^{(κ)} (z_{s}) d s + \sqrt{2 β^{- 1}} {\tilde{σ}}^{(κ)} (z_{s}) d w_{s}, \end{matrix}

(67)

where

w_{s}

is a Brownian motion on

R^{m}

, and,

\begin{matrix} \begin{matrix} {\tilde{b}}_{l}^{(κ)} (z) = & κ \int_{R^{n}} \sum_{k = 1}^{m} (ξ_{k} (x) - z_{k}) \sum_{i = 1}^{n} \frac{\partial ξ_{k}}{\partial x_{i}} (x) \frac{\partial ξ_{l}}{\partial x_{i}} (x) ρ_{κ} (x, z) d x = \int_{R^{n}} L ξ_{l} (x) ρ_{κ} (x, z) d x \\ {\tilde{a}}_{l l^{'}}^{(κ)} (z) = & {({\tilde{σ}}^{(κ)} {({\tilde{σ}}^{(κ)})}^{T})}_{l l^{'}} (z) = \int_{R^{n}} \sum_{i = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{i}} (x) \frac{\partial ξ_{l^{'}}}{\partial x_{i}} (x) ρ_{κ} (x, z) d x, \end{matrix} \end{matrix}

(68)

for

z \in R^{m}

,

1 \leq l, l^{'} \leq m

. Note that in (68),

L

is the generator given in (2) and integration by parts has been used to derive the second expression for

{\tilde{b}}^{(κ)}

. It is not difficult to show that

{\tilde{b}}^{(κ)} \to \tilde{b}

and

{\tilde{a}}^{(κ)} \to \tilde{a}

, when

κ \to + \infty

. Therefore (67) is an approximation of the effective dynamics (31) when

κ ≫ 1

. For numerical simulations, we can express (68) as time averages,

\begin{matrix} \begin{matrix} {\tilde{b}}_{l}^{(κ)} (z) = & \lim_{T \to \infty} \frac{κ}{T} \int_{0}^{T} \sum_{k = 1}^{m} (ξ_{k} (x_{s}) - z_{k}) \sum_{i = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{i}} (x_{s}) \frac{\partial ξ_{k}}{\partial x_{i}} (x_{s}) d s, 1 \leq l \leq m, \\ {\tilde{a}}_{l l^{'}}^{(κ)} (z) = & \lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} \sum_{i = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{i}} (x_{s}) \frac{\partial ξ_{l^{'}}}{\partial x_{i}} (x_{s}) d s, 1 \leq l, l^{'} \leq m, \end{matrix} \end{matrix}

(69)

where

x_{s}

satisfies the SDE (65) with fixed

z_{s} = z

, i.e.,

\begin{matrix} d x_{s, i} = & - \frac{\partial V}{\partial x_{i}} (x_{s}) d s - κ \sum_{j = 1}^{m} (ξ_{j} (x_{s}) - z_{j}) \frac{\partial ξ_{j}}{\partial x_{i}} (x_{s}) d s + \sqrt{2 β^{- 1}} d w_{s, i}, 1 \leq i \leq n . \end{matrix}

(70)

The main steps of the algorithm can be summarized as follows:

Denote $z = z_{k Δ t}$ at step $k \geq 0$ . Simulate dynamics (70) for M steps with time step size $Δ s$ . Compute the coefficients,

$\begin{matrix} \begin{matrix} {\tilde{b}}_{l} = & \frac{κ}{M - M_{0}} \sum_{j = M_{0} + 1}^{M} \sum_{l^{'} = 1}^{m} (ξ_{l^{'}} (x_{j Δ s}) - z_{l^{'}}) \sum_{i = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{i}} (x_{j Δ s}) \frac{\partial ξ_{l^{'}}}{\partial x_{i}} (x_{j Δ s}), 1 \leq l \leq m, \\ {\tilde{a}}_{l l^{'}} = & \frac{1}{M - M_{0}} \sum_{j = M_{0} + 1}^{M} \sum_{i = 1}^{n} \frac{\partial ξ_{l}}{\partial x_{i}} (x_{j Δ s}) \frac{\partial ξ_{l^{'}}}{\partial x_{i}} (x_{j Δ s}), 1 \leq l, l^{'} \leq m . \end{matrix} \end{matrix}$

(71)
Compute $\tilde{σ}$ from $\tilde{a} = \tilde{σ} {\tilde{σ}}^{T}$ by matrix decomposition. Update the state $z_{(k + 1) Δ t}$ according to,

$\begin{matrix} z_{(k + 1) Δ t, l} = z_{k Δ t, l} + {\tilde{b}}_{l} Δ t + \sqrt{\frac{2 Δ t}{β}} \sum_{i = 1}^{m} {\tilde{σ}}_{l i} η_{i}^{(k)}, 1 \leq l \leq m, \end{matrix}$

(72)

where $η_{i}^{(k)}$ are independent standard Gaussian variables, $1 \leq i \leq m$ .

6. Illustrative Example

In order to illustrate the analysis and the performance of the numerical methods presented in the previous sections, we study simple two-dimensional dynamics:

\begin{matrix} \begin{matrix} d x_{s, 1} & = - \frac{\partial V (x_{s})}{\partial x_{1}} d s + \sqrt{2 β^{- 1}} d w_{s, 1}, \\ d x_{s, 2} & = - \frac{\partial V (x_{s})}{\partial x_{2}} d s + \sqrt{2 β^{- 1}} d w_{s, 2}, \end{matrix} \end{matrix}

(73)

where

β > 0

,

x_{s} = (x_{s, 1}, x_{s, 2}) \in R^{2}

and

w_{s, 1}, w_{s, 2}

are two independent one-dimensional Brownian motions.

The potential V in dynamics (73) is defined as,

\begin{matrix} V (x) = V_{1} (θ) + \frac{1}{ϵ} V_{2} (r, θ), \end{matrix}

(74)

where

ϵ > 0

,

\begin{matrix} V_{1} (θ) = \{\begin{matrix} {[1 - \frac{9}{π^{2}} {(θ - \frac{π}{3})}^{2}]}^{2} & θ > \frac{π}{3}, \\ \frac{3}{5} - \frac{2}{5} \cos 3 θ & - \frac{π}{3} \leq θ \leq \frac{π}{3}, \\ {[1 - \frac{9}{π^{2}} {(θ + \frac{π}{3})}^{2}]}^{2} & θ < - \frac{π}{3}, \end{matrix} V_{2} (r, θ) = {(r^{2} - 1 - \frac{1}{1 + 4 r θ^{2}})}^{2}, \end{matrix}

and

(r, θ)

is the polar coordinate of the state

x = (x_{1}, x_{2})

satisfying,

\begin{matrix} \begin{matrix} x_{1} = r \cos θ, x_{2} = r \sin θ, \\ θ \in [- π, π], r \geq 0 . \end{matrix} \end{matrix}

(75)

Under the polar coordinate, it is easy to see that the potential V contains three local minima at linebreak

θ = 0, \pm \frac{2 π}{3}

where the radius is determined by the relation

r^{2} = 1 + \frac{1}{1 + 4 r θ^{2}}

. Furthermore, when parameter ϵ is small, one can expect that the dynamics (73) will be mainly confined in the neighbourhood of the curve defined by the relation

r^{2} = 1 + \frac{1}{1 + 4 r θ^{2}}

, where the potential is relatively flat. Profiles of the potentials

V_{1}

and V are displayed in Figure 1.

The main purpose of this numerical experiment is to demonstrate that the leading eigenvalues of the operator

- L

corresponding to dynamics (73) can be approximated with the help of its effective dynamics, provided that the reaction coordinate function as well as the basis functions are chosen appropriately.

We choose parameters

β = 4.0

and

ϵ = 0.05

in the following numerical experiment. In fact, for this two-dimensional problem, it is possible to directly solve the eigenproblem (5) by discretizing the operator

L

. First of all, we note that the generator can be written as

L = \frac{e^{β V}}{β} \nabla (e^{- β V} \nabla)

. Defining the operator

D

such that

D f = e^{- \frac{β}{2} V} f

for a function f, it is straightforward to see that the operator

- L_{D} = - D L D^{- 1}

has the same eigenvalues

λ_{i}

as

- L

and the corresponding eigenfunctions are given by

φ_{i}^{D} = D φ_{i} = e^{- \frac{β}{2} V} φ_{i}

, where

φ_{i}

are the eigenfunctions of

- L

. Furthermore,

L_{D}

is a self-adjoint operator under the standard

L^{2}

inner product. Instead of

- L

, we will work with

- L_{D}

and solve the eigenproblem

- L_{D} f = λ f

because the discretized matrix will be symmetric and the corresponding eigenfunctions

φ_{i}^{D}

decay rapidly.

Taking into account the profile of the potential V in Figure 1b, we truncate the whole space

R^{2}

into a finite domain

[- 2, 2] \times [- 2, 2]

, which is then discretized using a

500 \times 500

uniform mesh, leading to the cell resolution

Δ x_{1} = Δ x_{2} = \frac{4}{500} = 0.008

. For

1 \leq i, j \leq 500

, let

f_{i, j}, V_{i, j}

denote the values of the functions f, V evaluated at state

(- 2.0 + (i - \frac{1}{2}) Δ x_{1}, - 2.0 + (j - \frac{1}{2}) Δ x_{2})

, respectively. Other notations such as

V_{i \pm \frac{1}{2}, j}

are defined in a similar way. Approximating

- L_{D} f = - \frac{1}{β} e^{\frac{β}{2} V} \nabla (e^{- β V} \nabla (e^{\frac{β}{2} V} f))

by the centered finite difference scheme, we obtain,

\begin{matrix} \begin{matrix} - {(L_{D} f)}_{i, j} \approx & \frac{e^{\frac{β}{2} V_{i, j}}}{β} [\frac{e^{- β V_{i - \frac{1}{2}, j}}}{Δ x_{1}} \frac{e^{\frac{β}{2} V_{i, j}} f_{i, j} - e^{\frac{β}{2} V_{i - 1, j}} f_{i - 1, j}}{Δ x_{1}} - \frac{e^{- β V_{i + \frac{1}{2}, j}}}{Δ x_{1}} \frac{e^{\frac{β}{2} V_{i + 1, j}} f_{i + 1, j} - e^{\frac{β}{2} V_{i, j}} f_{i, j}}{Δ x_{1}} \\ + \frac{e^{- β V_{i, j - \frac{1}{2}}}}{Δ x_{2}} \frac{e^{\frac{β}{2} V_{i, j}} f_{i, j} - e^{\frac{β}{2} V_{i, j - 1}} f_{i, j - 1}}{Δ x_{2}} - \frac{e^{- β V_{i, j + \frac{1}{2}}}}{Δ x_{2}} \frac{e^{\frac{β}{2} V_{i, j + 1}} f_{i, j + 1} - e^{\frac{β}{2} V_{i, j}} f_{i, j}}{Δ x_{2}}], \end{matrix} \end{matrix}

(76)

for

1 < i, j < 500

. For boundary cells, the Neumann condition is applied when the neighboring cells are lying outside of the truncated domain. From (76), it can be observed that the resulting discretization matrix is both symmetric and sparse. Solving the eigenvalues of this matrix (of order 250,000 ) using the Krylov–Schur method through the numerical package SLEPc [36], we obtain the first four eigenvalues,

\begin{matrix} λ_{0} = 0.000, λ_{1} = 0.010, λ_{2} = 0.044, λ_{3} = 1.458, \end{matrix}

(77)

with relative residual errors smaller than

1.1 \times 10^{- 6}

. The corresponding eigenvectors are shown in Figure 2.

With the above reference result at hand, we continue to study the approximation quality of the effective dynamics with respect to the leading eigenvalues. For this purpose, we choose the reaction coordinate function as

ξ (x) = θ (x) \in [- π, π]

, i.e., our reaction coordinate is the angle of the polar coordinate representation. Direct calculation shows that the coefficients

\tilde{b}, \tilde{σ}

in (32) reduces to,

\begin{matrix} \begin{matrix} \tilde{b} (z) = P (- \nabla V \cdot \nabla θ) (z), \tilde{a} (z) = (\tilde{σ} {\tilde{σ}}^{T}) (z) = P (\frac{1}{r^{2}}) (z), z \in [- π, π] . \end{matrix} \end{matrix}

(78)

Discretizing the interval

[- π, π]

into 1000 subintervals and applying the projection scheme proposed in [34] for each fixed

z = - π + \frac{2 π j}{1000}

,

0 \leq j \leq 1000

, we can compute the coefficients of the effective dynamics; the resulting profiles are shown in Figure 3a,b. After these preparations, we can generate trajectories of the effective dynamics by simulating the SDE (31) using standard time stepping schemes. As shown in Figure 3c, the effective dynamics spend long times around values

- \frac{2 π}{3}, 0

and

\frac{2 π}{3}

, which is accordance with the behavior of dynamics (73) as well as with the profile of the potential V in Figure 1b. Since the effective dynamics is one-dimensional, we can also discretize its infinitesimal generator

\tilde{L}

in (33) and compute the eigenvalues of

- \tilde{L}

which gives,

\begin{matrix} {\tilde{λ}}_{0} = 0.000, {\tilde{λ}}_{1} = 0.012, {\tilde{λ}}_{2} = 0.044, {\tilde{λ}}_{3} = 2.068 . \end{matrix}

Comparing to (77), we conclude that the eigenvalues

λ_{0}, λ_{1}, λ_{2}

of the original dynamics (73) are quite well approximated by those of the effective dynamics.

As the final step of our experiment, we test the trajectory-based method proposed in SubSection 5.1. First of all, we define basis functions

{\tilde{ψ}}_{1} (z) \equiv 1.0

and

{\tilde{ψ}}_{i} (z) = \exp (- \frac{{(z - c_{i})}^{2}}{2 γ_{i}^{2}})

,

2 \leq i \leq 7

, where,

\begin{matrix} c_{i} = \{- \frac{2 π}{3}, - \frac{2 π}{3}, 0, 0, \frac{2 π}{3}, \frac{2 π}{3}\}, r_{i} = \{0.4, 0.7, 0.4, 0.7, 0.4, 0.7\} . \end{matrix}

That is, we have located two Gaussian-like basis functions with different radiuses (

0.4

and

0.7

) at each of the three local minima

θ = 0, \pm \frac{2 π}{3}

. The matrices S and C are then estimated according to (58) and (60) by generating four long trajectories of the effective dynamics with time step size

Δ t = 5 \times 10^{- 4}

, and parameters

τ = 20 Δ t

,

M_{0} = 1000

,

M = 2 \times 10^{7}

are used for each trajectories. Solving the generalized matrix eigenproblem

C X = λ S X

, we obtain the leading eigenvalues,

\begin{matrix} {\tilde{λ}}_{0} = 0.000, {\tilde{λ}}_{1} = 0.013, {\tilde{λ}}_{2} = 0.045, {\tilde{λ}}_{3} = 3.776 . \end{matrix}

As before, we conclude that the eigenvalues

λ_{0}

,

λ_{1}

,

λ_{2}

of the original dynamics are relatively well approximated.

7. Conclusions

In this work we have studied the approximation of eigenvalues and eigenfunctions of the infinitesimal generator associated with the longest relaxation processes of diffusive processes in energy landscapes. Following the previous studies on transfer operators, we consider the Galerkin discretization method, the variational approach and the effective dynamics given by a low-dimensional reaction coordinate for solving the eigenvalue problem in application to the generator. It turns out that: (1) there are rather similar results for the approximation error of the three methods; and (2) the first two methods lead to the same generalized matrix eigenproblem while the third can be used for efficient estimation of the associated coefficient matrices.

Before we conclude, it is worth mentioning several issues which go beyond the scope of our current work. Firstly, while we have assumed that the dynamics are driven by the gradient of a potential function, we emphasize that the analysis in the current work can be directly applied to more general reversible processes (see [23] for details). Secondly, for non-reversible dynamics, as, for example, for Langevin dynamics, it is not immediately clear how the results in the current work can be applied. However, the approach in [9] (Section 5.3), shows that the extended reversibility of Langevin dynamics may well allow for a generalization of our results. Thirdly, for the numerical algorithms which are briefly outlined in Section 5, both the numerical analysis and their applications to more complicated systems need to be further investigated. Lastly, both the analysis and the algorithms in our current work depend on the choice of the reaction coordinate function. Different choices will have different approximation qualities of the eigenvalues/eigenfunctions of the system [21,23,37]. Algorithmic identification of reaction coordinate functions for high-dimensional systems is a challenging problem and has attracted considerable attention; most approaches utilize machine learning approaches [38], while the relation between identification and effective dynamics has only been explored recently [39]. All of these issues are topics of ongoing research.

Acknowledgments

This research has been funded by Deutsche Forschungsgemeinschaft (DFG) through grant CRC 1114.

Author Contributions

Christof Schütte conceived and designed research; Wei Zhang and Christof Schütte developed the basic theory; Wei Zhang performed numerical experiment; Wei Zhang and Christof Schütte wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Schütte, C.; Fischer, A.; Huisinga, W.; Deuflhard, P. A direct approach to conformational dynamics based on hybrid Monte Carlo. J. Comput. Phys. 1999, 151, 146–168. [Google Scholar]
Pande, V.S.; Beauchamp, K.; Bowman, G.R. Everything you wanted to know about Markov state models but were afraid to ask. Methods 2010, 52, 99–105. [Google Scholar]
Pérez-Hernández, G.; Paul, F.; Giorgino, T.; De Fabritiis, G.; Noé, F. Identification of slow molecular order parameters for Markov model construction. J. Chem. Phys. 2013, 139, 015102. [Google Scholar]
Nüske, F.; Schneider, R.; Vitalini, F.; Noé, F. Variational tensor approach for approximating the rare-event kinetics of macromolecular systems. J. Chem. Phys. 2016, 144, 054105. [Google Scholar]
Noé, F.; Schütte, C.; Vanden-Eijnden, E.; Reich, L.; Weikl, T.R. Constructing the full ensemble of folding pathways from short off-equilibrium simulations. Proc. Natl. Acad. Sci. USA 2009, 106, 19011–19016. [Google Scholar]
Bowman, G.R.; Pande, V.S.; Noé, F. (Eds.) Advances in Experimental Medicine and Biology. In An Introduction to Markov State Models and Their Application to Long Timescale Molecular Simulation; Springer: Dordrecht, The Netherlands, 2014; Volume 797. [Google Scholar]
Noé, F.; Nüske, F. A variational approach to modeling slow processes in stochastic dynamical systems. Multiscale Model. Simul. 2013, 11, 635–655. [Google Scholar]
Nüske, F.; Keller, B.G.; Pérez-Hernández, G.; Mey, A.; Noé, F. Variational approach to molecular kinetics. J. Chem. Theory Comput. 2014, 10, 1739–1752. [Google Scholar]
Schütte, C.; Sarich, M. Metastability and Markov State Models in Molecular Dynamics: Modeling, Analysis, Algorithmic Approaches; Courant Lecture Notes; American Mathematical Society/Courant Institute of Mathematical Science: New York, NY, USA, 2014. [Google Scholar]
Schütte, C.; Noé, F.; Lu, J.; Sarich, M.; Vanden-Eijnden, E. Markov state models based on milestoning. J. Chem. Phys. 2011, 134, 204105. [Google Scholar]
Torrie, G.M.; Valleau, J.P. Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling. J. Comput. Phys. 1977, 23, 187–199. [Google Scholar]
Kumar, S.; Rosenberg, J.M.; Bouzida, D.; Swendsen, R.H.; Kollman, P.A. THE weighted histogram analysis method for free-energy calculations on biomolecules. I. The method. J. Comput. Chem. 1992, 13, 1011–1021. [Google Scholar]
Laio, A.; Parrinello, M. Escaping free-energy minima. Proc. Natl. Acad. Sci. USA 2002, 99, 12562–12566. [Google Scholar]
Laio, A.; Gervasio, F.L. Metadynamics: A method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science. Rep. Prog. Phys. 2008, 71, 126601. [Google Scholar]
Ciccotti, G.; Kapral, R.; Vanden-Eijnden, E. Blue moon sampling, vectorial eeaction coordinates, and unbiased constrained dynamics. ChemPhysChem 2005, 6, 1809–1814. [Google Scholar]
Darve, E.; Rodríguez-Gömez, D.; Pohorille, A. Adaptive biasing force method for scalar and vector free energy calculations. J. Chem. Phys. 2008, 128, 144120. [Google Scholar]
Maragliano, L.; Vanden-Eijnden, E. A temperature accelerated method for sampling free energy and determining reaction pathways in rare events simulations. Chem. Phys. Lett. 2006, 426, 168–175. [Google Scholar]
Faradjian, A.K.; Elber, R. Computing time scales from reaction coordinates by milestoning. J. Chem. Phys. 2004, 120, 10880–10889. [Google Scholar]
Moroni, D.; van Erp, T.; Bolhuis, P. Investigating rare events by transition interface sampling. Physica A 2004, 340, 395–401. [Google Scholar]
Becker, N.B.; Allen, R.J.; ten Wolde, P.R. Non-stationary forward flux sampling. J. Chem. Phys. 2012, 136, 174118. [Google Scholar]
Legoll, F.; Lelièvre, T. Effective dynamics using conditional expectations. Nonlinearity 2010, 23, 2131–2163. [Google Scholar]
Froyland, G.; Gottwald, G.A.; Hammerlindl, A. A computational method to extract macroscopic variables and their dynamics in multiscale systems. SIAM J. Appl. Dyn. Syst. 2014, 13, 1816–1846. [Google Scholar]
Zhang, W.; Hartmann, C.; Schutte, C. Effective dynamics along given reaction coordinates, and reaction rate theory. Faraday Discuss. 2016, 195, 365–394. [Google Scholar]
Kevrekidis, I.G.; Gear, C.W.; Hummer, G. Equation-free: The computer-aided analysis of complex multiscale systems. AIChE J. 2004, 50, 1346–1355. [Google Scholar]
Kevrekidis, I.G.; Samaey, G. Equation-free multiscale computation: Algorithms and applications. Annu. Rev. Phys. Chem. 2009, 60, 321–344. [Google Scholar]
Kevrekidis, I.G.; Gear, C.W.; Hyman, J.M.; Kevrekidid, P.G.; Runborg, O.; Theodoropoulos, C. Equation-free, coarse-grained multiscale computation: Enabling mocroscopic simulators to perform system-level analysis. Commun. Math. Sci. 2003, 1, 715–762. [Google Scholar]
Prinz, J.H.; Wu, H.; Sarich, M.; Keller, B.; Senne, M.; Held, M.; Chodera, J.D.; Schütte, C.; Noé, F. Markov models of molecular kinetics: Generation and validation. J. Chem. Phys. 2011, 134, 174105. [Google Scholar]
Djurdjevac, N.; Sarich, M.; Schütte, C. Estimating the eigenvalue error of Markov state models. Multiscale Model. Simul. 2012, 10, 61–81. [Google Scholar]
Sarich, M.; Noé, F.; Schütte, C. On the approximation quality of Markov state models. Multiscale Model. Simul. 2010, 8, 1154–1177. [Google Scholar]
Sarich, M.; Schütte, C. Approximating selected non-dominant timescales by Markov state models. Comm. Math. Sci. 2012, 10, 1001–1013. [Google Scholar]
Mattingly, J.C.; Stuart, A.M.; Higham, D.J. Ergodicity for SDEs and approximations: locally Lipschitz vector fields and degenerate noise. Stoch. Proc. Appl. 2002, 101, 185–232. [Google Scholar] [Green Version]
Schütte, C.; Huisinga, W.; Deuflhard, P. Transfer operator approach to conformational dynamics in biomolecular systems. In Ergodic Theory, Analysis, and Efficient Simulation of Dynamical Systems; Fiedler, B., Ed.; Springer: Berlin/Heidelberg, Germany, 2001; pp. 191–223. [Google Scholar]
Gyöngy, I. Mimicking the one-dimensional marginal distributions of processes having an Ito differential. Probab. Theory Relat. Fields 1986, 71, 501–516. [Google Scholar]
Ciccotti, G.; Lelièvre, T.; Vanden-Eijnden, E. Projection of diffusions on submanifolds: Application to mean force computation. Commun. Pure Appl. Math. 2008, 61, 371–408. [Google Scholar]
Lelièvre, T.; Rousset, M.; Stoltz, G. Langevin dynamics with constraints and computation of free energy differences. Math. Comput. 2012, 81, 2071–2125. [Google Scholar]
Hernandez, V.; Roman, J.E.; Vidal, V. SLEPc: A scalable and flexible toolkit for the solution of eigenvalue problems. ACM Trans. Math. Softw. 2005, 31, 351–362. [Google Scholar]
Hartmann, C.; Schütte, C.; Zhang, W. Model reduction algorithms for optimal control and importance sampling of diffusions. Nonlinearity 2016, 29, 2298–2326. [Google Scholar]
Rohrdanz, M.A.; Zheng, W.; Maggioni, M.; Clementi, C. Determination of reaction coordinates via locally scaled diffusion map. J. Chem. Phys. 2011, 134, 124116. [Google Scholar]
Bittracher, A.; Koltai, P.; Klus, S.; Banisch, R.; Dellnitz, M.; Schütte, C. Transition manifolds of complex metastable systems: Theory and data-driven computation of effective dynamics. J. Nonlinear Sci. 2017, submitted. [Google Scholar]

Figure 1. (a) Function

V_{1}

as a function of angle θ; (b) Potential V defined in (74) with parameter

ϵ = 0.05

.

Figure 1. (a) Function

V_{1}

as a function of angle θ; (b) Potential V defined in (74) with parameter

ϵ = 0.05

.

Figure 2. Eigenfunctions

φ_{i}^{D}

of operator

- L_{D}

corresponding to the first four eigenvalues in (77).

Figure 2. Eigenfunctions

φ_{i}^{D}

of operator

- L_{D}

corresponding to the first four eigenvalues in (77).

Figure 3. (a,b) Coefficients

\tilde{b}

and

\tilde{σ}

as given in (78). For each

z = - π + \frac{2 π j}{1000}

,

0 \leq j \leq 1000

, the coefficients

\tilde{b} (z)

,

\tilde{σ} (z)

are estimated by generating a trajectory of the constrained version of dynamics (73) using the projection scheme proposed in [34] with the time step size

2 \times 10^{- 5}

, and

3 \times 10^{6}

steps are simulated; (c) A typical sample trajectory of the effective dynamics for dynamics (73) with reaction coordinate function

ξ (x) = θ (x)

.

Figure 3. (a,b) Coefficients

\tilde{b}

and

\tilde{σ}

as given in (78). For each

z = - π + \frac{2 π j}{1000}

,

0 \leq j \leq 1000

, the coefficients

\tilde{b} (z)

,

\tilde{σ} (z)

are estimated by generating a trajectory of the constrained version of dynamics (73) using the projection scheme proposed in [34] with the time step size

2 \times 10^{- 5}

, and

3 \times 10^{6}

steps are simulated; (c) A typical sample trajectory of the effective dynamics for dynamics (73) with reaction coordinate function

ξ (x) = θ (x)

.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, W.; Schütte, C. Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics. Entropy 2017, 19, 367. https://doi.org/10.3390/e19070367

AMA Style

Zhang W, Schütte C. Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics. Entropy. 2017; 19(7):367. https://doi.org/10.3390/e19070367

Chicago/Turabian Style

Zhang, Wei, and Christof Schütte. 2017. "Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics" Entropy 19, no. 7: 367. https://doi.org/10.3390/e19070367

APA Style

Zhang, W., & Schütte, C. (2017). Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics. Entropy, 19(7), 367. https://doi.org/10.3390/e19070367

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Reliable Approximation of Long Relaxation Timescales in Molecular Dynamics

Abstract

1. Introduction

2. Diffusion Process and the Associated Operators

3. Galerkin Approximation of the Eigenvalues of the Generator

3.1. Some General Results

3.2. Finite Dimensional Subspaces

3.3. Infinite Dimensional Subspace: Effective Dynamics

4. Variational Approach to Generator Eigenproblem

4.1. Variational Principle

4.2. Optimization Problem

5. Numerical Algorithms

5.1. Computing Coefficient Matrices Using Effective Dynamics

5.2. Algorithms for Simulating the Effective Dynamics

5.2.1. Algorithm 1

5.2.2. Algorithm 2

6. Illustrative Example

7. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI