Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems

Ciccotti, Giovanni; Ferrario, Mauro

doi:10.3390/computation6010011

Open AccessFeature PaperReview

Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems

by

Giovanni Ciccotti

^1,2,3

and

Mauro Ferrario

^4,*

¹

Institute for Applied Mathematics “Mauro Picone” (IAC), CNR, Via dei Taurini 19, 00185 Rome, Italy

²

University of Rome “La Sapienza”, P.le Aldo Moro 5, 00185 Rome, Italy

³

University College Dublin (UCD), Belfield, Dublin 4, Ireland

⁴

Dipartimento di Scienze Fisiche, Informatiche e Matematiche, University of Modena and Reggio Emilia, Via Campi 213/A , 41125 Modena, Italy

^*

Author to whom correspondence should be addressed.

Computation 2018, 6(1), 11; https://doi.org/10.3390/computation6010011

Submission received: 8 January 2018 / Revised: 25 January 2018 / Accepted: 25 January 2018 / Published: 1 February 2018

(This article belongs to the Special Issue Computation in Molecular Modeling)

Download Versions Notes

Abstract

:

A dynamical system submitted to holonomic constraints is Hamiltonian only if considered in the reduced phase space of its generalized coordinates and momenta, which need to be defined ad hoc in each particular case. However, specially in molecular simulations, where the number of degrees of freedom is exceedingly high, the representation in generalized coordinates is completely unsuitable, although conceptually unavoidable, to provide a rigorous description of its evolution and statistical properties. In this paper, we first review the state of the art of the numerical approach that defines the way to conserve exactly the constraint conditions (by an algorithm universally known as SHAKE) and permits integrating the equations of motion directly in the phase space of the natural Cartesian coordinates and momenta of the system. We then discuss in detail SHAKE numerical implementations in the notable cases of Verlet and velocity-Verlet algorithms. After discussing in the same framework how constraints modify the properties of the equilibrium ensemble, we show how, at the price of moving to a dynamical system no more (directly) Hamiltonian, it is possible to provide a direct interpretation of the dynamical system and so derive its Statistical Mechanics both at equilibrium and in non-equilibrium conditions. To achieve that, we generalize the statistical treatment to systems no longer conserving the phase space volume (equivalently, we introduce a non-Euclidean invariant measure in phase space) and derive a generalized Liouville equation describing the ensemble even out of equilibrium. As a result, we can extend the response theory of Kubo (linear and nonlinear) to systems subjected to constraints.

Keywords:

holonomic constraints; non-Hamiltonian dynamics; SHAKE

1. Introduction

The dynamical and statistical behavior of a mechanical system of many degrees of freedom subjected to holonomic constraints presents specific features that seem worth presenting and discussing in a unified framework. A mechanical Hamiltonian system is a system whose evolution is derivable from a standard Hamiltonian

\begin{matrix} H (r, p) = K (p) + V (r), \end{matrix}

(1)

where

\begin{matrix} K (p) = \sum_{i = 1}^{N} p_{i}^{2} / 2 m_{i} \end{matrix}

(2)

is the kinetic energy expressed in Cartesian coordinates as a Euclidean quadratic form of the momenta

\begin{matrix} p : = {p_{i} = m_{i} {\dot{r}}_{i}, i = 1, \dots, N}, \end{matrix}

(3)

and

V = V (r)

is a function of the

3 N

Cartesian coordinates

\begin{matrix} r : = {r_{i}, i = 1, \dots, N} . \end{matrix}

(4)

N is the number of point particles in the system, and we have put ourselves in dimension 3. The space of the coordinates is called configuration space, while the phase space

{r, p}

gives the space of the mechanical states of the system. To say that the system is subjected to f holonomic constraints is equivalent to saying that the motion has to evolve on a

(3 N - f)

-dimensional configuration space, which results from imposing f geometrical conditions

σ_{α} (r) = 0, α = 1, \dots, f

at all times. These constraints can connect all the coordinates of the configuration space, in which case we call them global (Blue Moon [1,2]), or connect disjoint subgroups of the coordinates, which is, for example, the way in which molecular systems can sometimes be described [3], or else can be the conditions for orthonormality of single electron orbitals, as in the Car–Parrinello approach to ab initio molecular dynamics [4].

Global constraints can be used to bring the system in situations normally difficult to visit. In these cases, the constraints can act as a kind of Maxwell daemon. In the solution of the classical Statistical Mechanics of dynamical systems, the constraints confront two major problems. As we have seen before, the constraints are an essential ingredient in the definition of the dynamical system, therefore any acceptable algorithm introduced to solve the dynamics of such a system cannot propagate any error, as otherwise the statistical behavior of the ensemble in the presence of the constraints cannot be properly formulated. This last problem in principle is automatically solved for Hamiltonian systems by using generalized coordinates. However, especially for systems with many degrees of freedom, generalized coordinates are completely intractable, and one should be able to formulate properly the problem by using Cartesian coordinates in a standard way. As we will see, a family of algorithms avoiding the propagation of the errors have been introduced [3,5], while the proper formulation of the statistical ensemble is straightforward for the equilibrium case [6] but requires some more work for non-equilibrium, where the missing ingredient is the correct Liouville equation to use [7]. To get the proper Liouville equation one has to abandon the traditional Hamiltonian description, in which one had the constraint forces by using a Lagrange multiplier, and go straight to the non-Hamiltonian behavior of the equations of motion of the system in which the constraint forces are explicitly (analytically) solved. For this non-Hamiltonian equations the Euclidean nature of phase space is no more an invariant, therefore one has to find an invariant, non-Euclidean, measure to be associated with the phase space so that the statistical behavior of the system can be properly described by generalizing the Liouville equation [7]. As we will see, the non-equilibrium response of our constrained system to external perturbations can be derived directly [8], while the best known results of linear response theory, the so-called fluctuation-dissipation theorem, can be derived but requires some extra work [9]. The advised reader should be warned that in this review we have excluded the treatment of non-holonomic constraints, a very large family difficult to unify and in any event requiring special treatments [10].

In Section 2, we summarize the formalism needed to describe a system with constraints. In Section 3, we write down the general (SHAKE) equations to be solved to derive whatever family of numerical algorithms and briefly describe the two best known formulations: the one, adopted with the Verlet algorithm, usually referred to as Shake [11], not to be confused with the reference to the general equation to be solved, and the one modified to work with the velocity-Verlet algorithm, usually referred to as Rattle [12]. To these two, we will briefly add a more recent alternative devised to give a parallel implementation of SHAKE [13]. In Section 4, we derive the equilibrium ensemble of Hamiltonian systems subjected to holonomic constraints [6]. Section 5 presents an effective approach to compute conditional averages by the use of holonomic constraints (Blue Moon) [1,2,14,15]. The possibility to compute conditional averages can be used in conjunction with non-equilibrium molecular dynamics techniques [16,17,18,19,20] to compute rate constants, hydrodynamical phenomena and, in general, relaxations from large fluctuations statistically produced by introducing suitable constraints. In Section 6, we formulate the non-Hamiltonian equations of motion for a constrained system, we derive from them an invariant measure for the phase space, the correct generalized Liouville equation and, again, as a way to see how all that works, the expressions for the equilibrium ensembles. Then, we start from the generalized Liouville equation to give a rigorous expression for the response to external perturbations of a constrained system [8] and we prove with some rigorous arguments that also the classical results of linear response theory can be shown to hold [9]. The paper is concluded by a short outlook in which we try to assess the state of the art in the treatment of the computational classical Statistical Mechanics for systems subjected to holonomic constraints.

2. Dynamics with Holonomic Constraints

Given the Lagrangian

L (r, \dot{r}) = K (\dot{r}) - V (r)

of a dynamical system with N particles in dimension 3, subjected to f holonomic constraints

\begin{matrix} σ_{α} (r) = 0, α = 1, \dots, f, \end{matrix}

(5)

the equations of motion (Lagrange equation of I type) are

\begin{matrix} \frac{d}{d t} \frac{\partial L}{\partial {\dot{r}}_{i}} - \frac{\partial L}{\partial r_{i}} = m_{i} {\ddot{r}}_{i} + \frac{\partial V (r)}{\partial r_{i}} = - \sum_{α = 1}^{f} λ_{α} (t) \frac{\partial σ_{α} (r)}{\partial r_{i}}, i = 1, \dots, N, \end{matrix}

(6)

where the

λ_{α} (t), α = 1, \dots, f

are the unknown Lagrangian multipliers to be determined by imposing that

\begin{matrix} σ_{α} (r (t)) = 0, \forall t, α = 1, \dots, f . \end{matrix}

(7)

Note that the multipliers

λ_{α} (t)

together with all their derivatives of any order can be determined by taking successive time derivatives of the expressions

σ_{α} (r (t)) = 0

. In particular,

\begin{matrix} {\dot{σ}}_{α} (t) = {\dot{σ}}_{α} (r (t), \dot{r} (t)) = \sum_{i = 1}^{N} ({\dot{r}}_{i} \cdot \frac{\partial}{\partial r_{i}}) σ_{α} (r) = 0, α = 1, \dots, f \end{matrix}

(8)

are evident conditions to be satisfied since holonomic constraints do not perform any mechanical work and therefore permitted velocities and constrained forces have to be orthogonal. Moreover,

\begin{matrix} {\ddot{σ}}_{α} (t) = [\sum_{i = 1}^{N} ({\ddot{r}}_{i} \cdot \frac{\partial}{\partial r_{i}}) + \sum_{i = 1}^{N} \sum_{j = 1}^{N} ({\dot{r}}_{i} \cdot \frac{\partial}{\partial r_{i}}) ({\dot{r}}_{j} \cdot \frac{\partial}{\partial r_{j}})] σ_{α} (r) = 0, α = 1, \dots, f . \end{matrix}

(9)

Substituting in Equation (9) the equations of motion, Equation (6), and solving the resulting linear system for the

λ

s, we get

\begin{matrix} λ_{α} = \sum_{β = 1}^{f} [- \frac{1}{m_{i}} \frac{\partial V}{\partial r_{i}} \cdot (\frac{\partial}{\partial r_{i}}) σ_{β} + \sum_{i = 1}^{N} \sum_{j = 1}^{N} ({\dot{r}}_{i} \cdot \frac{\partial}{\partial r_{i}}) ({\dot{r}}_{j} \cdot \frac{\partial}{\partial r_{j}}) σ_{β}] Z_{β, α}^{- 1}, α = 1, \dots, f, \end{matrix}

(10)

where

\begin{matrix} Z_{α, β} (r) = \sum_{i = 1}^{N} \frac{1}{m_{i}} (\frac{\partial σ_{α}}{\partial r_{i}}) \cdot (\frac{\partial σ_{β}}{\partial r_{i}}), α = 1, \dots, f, β = 1, \dots, f . \end{matrix}

(11)

The expressions resulting from introducing Equation (10) in Equation (6) could possibly provide fully explicit, no longer Hamiltonian, dynamics. For the moment, we will not be interested in such formulation because any approximate algorithm that make use of Equation (10) will necessarily propagate the errors in the constraint relations, Equation (5), with dramatic consequences on the stability of the model. Further derivatives with respect to time of the

σ

s will provide linear relationships for the higher order derivatives of the

λ

s, which could be needed in higher order algorithms.

By taking the standard Legendre transform on the Lagrangian

L

,

\begin{matrix} H = \sum_{i = 1}^{N} p_{i} \cdot {\dot{r}}_{i} - L, \end{matrix}

(12)

the same dynamics can be straightforwardly formulated in Hamiltonian terms,

\begin{matrix} {\dot{r}}_{i} & = \frac{p_{i}}{m_{i}}, \end{matrix}

(13)

\begin{matrix} {\dot{p}}_{i} & = - \frac{\partial V (r)}{\partial r_{i}} - \sum_{α = 1}^{f} λ_{α} (t) \frac{\partial σ_{α} (r)}{\partial r_{i}}, i = 1, \dots, N, \end{matrix}

(14)

involving the same treatment for the constraint forces. In the following, it will be useful for theoretical purposes to consider an equivalent representation of the Hamiltonian expressed in terms of

(i): the f constraint relationships $σ : = {σ_{α} (r), α = 1, \dots, f}$ , and
(ii): the remaining $(3 N - f)$ generalized coordinates $q : = {q_{ν} (r), ν = 1, \dots, 3 N - f}$ .

This change of coordinates is a point transformation of the configuration space and therefore generates a canonical transformation [10]. With this change of variables, the Lagrangian

L

of our system generates the Lagrangian in the new coordinates given by

\begin{matrix} L^{*} (q, \dot{q}, σ, \dot{σ}) = L (r (q, σ), \dot{r} (q, σ, \dot{q}, \dot{σ})) . \end{matrix}

(15)

Sometimes, it is useful to call collectively the variable

{q, σ} = {u}

. From the Lagrangian

L^{*}

, we get

\begin{matrix} p_{u} = \frac{\partial L^{*}}{\partial \dot{u}} (p_{q} = \frac{\partial L^{*}}{\partial \dot{q}}, p_{σ} = \frac{\partial L^{*}}{\partial \dot{σ}}) \end{matrix}

(16)

and

\begin{matrix} H^{*} (u, p_{u}) = K (p (u, p_{u})) + V (r (u)) = K^{*} (u, p_{u}) + V^{*} (u), \end{matrix}

(17)

where the kinetic term is now

\begin{matrix} K^{*} = \frac{1}{2} p_{u}^{T} \cdot M^{- 1} (u) \cdot p_{u} \end{matrix}

(18)

and

\begin{matrix} {(M (u))}_{α, β} = \sum_{i = 1}^{N} m_{i} \frac{\partial r_{i}}{\partial u_{α}} \cdot \frac{\partial r_{i}}{\partial u_{β}}, α = 1, \dots, 3 N, β = 1, \dots, 3 N \end{matrix}

(19)

is the metric matrix associated with the new variables. Note that it is almost immediate to find for the inverse matrix

M^{- 1}

the explicit expression

\begin{matrix} {(M^{- 1} (r))}_{α, β} = \sum_{i = 1}^{N} \frac{1}{m_{i}} \frac{\partial u_{α}}{\partial r_{i}} \cdot \frac{\partial u_{β}}{\partial r_{i}}, α = 1, \dots, 3 N, β = 1, \dots, 3 N . \end{matrix}

(20)

There is an intimate connection between the matrix

M

, the Jacobian matrix

J = \frac{\partial r}{\partial u}

and the Jacobian determinant

J = \frac{\partial (u)}{\partial (r)} = | J |

of the

{r} ⟶ {u}

point transformation. Introducing the mass tensor

{(μ)}_{i, k} = m_{i} δ_{i, k}

for

i, k = 1, \dots, 3 N

by means of the Kronecker delta, one can rewrite

M

in Equation (19) and, of course, its determinant

| M |

, as

\begin{matrix} M = {(\frac{\partial r}{\partial u})}^{T} \cdot μ \cdot (\frac{\partial r}{\partial u}) = J^{T} \cdot μ \cdot J ⟹ |M| = | J | | μ | | J | = | μ | J^{2} . \end{matrix}

(21)

In this representation, the constrained motion is generated by the Lagrangian

L^{*} (q, \dot{q}, σ = 0, \dot{σ} = 0) = L_{c} (q, \dot{q})

in the

(3 N - f)

-dimensional space

\begin{matrix} \frac{\partial L_{c}}{\partial q_{ν}} - \frac{d}{d t} \frac{\partial L_{c}}{\partial {\dot{q}}_{ν}} = 0, ν = 1, \dots, 3 N - f . \end{matrix}

(22)

Note, however, that these equations are no longer in normal form.

The Hamiltonian formulation helps us to get back to an evolution expressed in normal form. Given that

\begin{matrix} p_{u} = \frac{\partial L^{*}}{\partial \dot{u}} = M (u) \cdot \dot{u} i . e ., \dot{u} = M^{- 1} (u) \cdot p_{u}, \end{matrix}

(23)

we have

\begin{matrix} \dot{u} & = \frac{\partial H^{*}}{\partial p_{u}} = M^{- 1} (u) \cdot p_{u}, \end{matrix}

(24)

\begin{matrix} {\dot{p}}_{u} & = - \frac{\partial H^{*}}{\partial u} . \end{matrix}

(25)

To proceed, it is useful to write the matrices

M

and

M^{- 1}

in block form

\begin{matrix} M = (\begin{matrix} A & B \\ B^{T} & Γ \end{matrix}), M^{- 1} = (\begin{matrix} Δ & E \\ E^{T} & Z \end{matrix}), \end{matrix}

(26)

where

\begin{matrix} A_{ν, η} (q, σ) & = \sum_{i = 1}^{N} m_{i} \frac{\partial r_{i}}{\partial q_{ν}} \cdot \frac{\partial r_{i}}{\partial q_{η}}, ν = 1, \dots, 3 N - f, η = 1, \dots, 3 N - f, \end{matrix}

(27)

\begin{matrix} B_{ν, α} (q, σ) & = \sum_{i = 1}^{N} m_{i} \frac{\partial r_{i}}{\partial q_{ν}} \cdot \frac{\partial r_{i}}{\partial σ_{α}}, ν = 1, \dots, 3 N - f, α = 1, \dots, f, \end{matrix}

(28)

\begin{matrix} Γ_{α, β} (q, σ) & = \sum_{i = 1}^{N} m_{i} \frac{\partial r_{i}}{\partial σ_{α}} \cdot \frac{\partial r_{i}}{\partial σ_{β}}, α = 1, \dots, f, β = 1, \dots, f, \end{matrix}

(29)

\begin{matrix} Δ_{ν, η} (r) & = \sum_{i = 1}^{N} \frac{1}{m_{i}} \frac{\partial q_{ν}}{\partial r_{i}} \cdot \frac{\partial q_{η}}{\partial r_{i}}, ν = 1, \dots, 3 N - f, η = 1, \dots, 3 N - f, \end{matrix}

(30)

\begin{matrix} E_{ν, α} (r) & = \sum_{i = 1}^{N} \frac{1}{m_{i}} \frac{\partial q_{ν}}{\partial r_{i}} \cdot \frac{\partial σ_{α}}{\partial r_{i}}, ν = 1, \dots, 3 N - f, α = 1, \dots, f, \end{matrix}

(31)

with

Z

already defined in Equation (11), to derive a number of results, which we will use in the following. In particular, note that the block matrices defined above are not independent from each other. A first set of useful relations can be derived by expanding the expressions for the identity

1 = M^{- 1} M = M M^{- 1}

\begin{matrix} M^{- 1} M = (\begin{matrix} Δ A + E B^{T} = 1 & Δ B + E Γ = 0 \\ E^{T} A + Z B^{T} = 0 & E^{T} B + Z Γ = 1 \end{matrix}) = (\begin{matrix} A Δ + B E^{T} = 1 & A E + B Z = 0 \\ B^{T} Δ + Γ E^{T} = 0 & B^{T} E + Γ Z = 1 \end{matrix}) = M M^{- 1} . \end{matrix}

(32)

Another useful relation is the one that, in a different language, is known as Fixman’s Theorem [21]. It relates the determinants of the matrices

A

and

Z

\begin{matrix} | A | = | M | | Z | . \end{matrix}

(33)

Equation (33) can be derived directly observing that (use Equation (32) )

\begin{matrix} (i) (\begin{matrix} A & 0 \\ B^{T} & 1 \end{matrix}) = M \cdot [M^{- 1} \cdot (\begin{matrix} A & 0 \\ B^{T} & 1 \end{matrix})] = M \cdot (\begin{matrix} 1 & E \\ 0 & Z \end{matrix}), \end{matrix}

(34)

and that

\begin{matrix} (i i) |(\begin{matrix} A & 0 \\ B^{T} & 1 \end{matrix})| = | A |, |(\begin{matrix} 1 & E \\ 0 & Z \end{matrix})| = | Z | . \end{matrix}

(35)

By putting Equation (33) together with Equation (21), we obtain for the determinant of the block matrix

A

, the interesting expression

\begin{matrix} {| A |}^{\frac{1}{2}} = {(| M | | Z |)}^{\frac{1}{2}} = J {| μ |}^{\frac{1}{2}} {| Z |}^{\frac{1}{2}} \end{matrix}

(36)

that will be useful later on.

Going back to the constraint relations, the conditions

\dot{σ} = 0

can now be written as

\begin{matrix} \dot{σ} = E^{T} p_{q} + Z p_{σ} = 0, \end{matrix}

(37)

giving the non-zero values of the conjugated momenta

{\tilde{p}}_{σ}

when the constraints are imposed,

\begin{matrix} {\tilde{p}}_{σ} = - Z^{- 1} (q, σ = 0) E^{T} (q, σ = 0) p_{q} = - {\tilde{Z}}^{- 1} {\tilde{E}}^{T} p_{q}, \end{matrix}

(38)

where

\tilde{Z}

and

\tilde{E}

are implicitly defined in Equation (38). In these conditions, the Hamiltonian of the constrained motion can be evaluated explicitly on the hypersurface

σ = 0, p_{σ} = {\tilde{p}}_{σ}

to obtain

\begin{matrix} H^{*} (q, σ = 0, p_{q}, p_{σ} = {\tilde{p}}_{σ}) & = K^{*} (q, σ = 0, p_{q}, p_{σ} = {\tilde{p}}_{σ}) + V^{*} (q, σ = 0) \end{matrix}

(39)

\begin{matrix} = \frac{1}{2} (p_{q}^{T} {\tilde{p}}_{σ}^{T}) (\begin{matrix} \tilde{Δ} & \tilde{E} \\ {\tilde{E}}^{T} & \tilde{Z} \end{matrix}) (\begin{matrix} p_{q} \\ {\tilde{p}}_{σ} \end{matrix}) + V_{c} (q) \end{matrix}

(40)

\begin{matrix} = \frac{1}{2} p_{q}^{T} (\tilde{Δ} - \tilde{E} {\tilde{Z}}^{- 1} {\tilde{E}}^{T}) p_{q} + V_{c} (q) \end{matrix}

(41)

\begin{matrix} = \frac{1}{2} p_{q}^{T} {\tilde{A}}^{- 1} (q) p_{q} + V_{c} (q) \equiv H_{c} (q, p_{q}), \end{matrix}

(42)

where we first used

{\tilde{p}}_{σ} = - {\tilde{Z}}^{- 1} {\tilde{E}}^{T} p_{q}

to go from Equation (40) to Equation (41) and, then, from Equation (41) to Equation (42), we have used the relations from Equation (32),

A Δ + B E^{T} = 1

and

A E + B Z = 0

. The Hamiltonian

H_{c}

generates the equations of motion in the

(3 N - f) -

dimensional space in normal form

\begin{matrix} \dot{q} & = \frac{\partial H_{c}}{\partial p_{q}}, \end{matrix}

(43)

\begin{matrix} {\dot{p}}_{q} & = - \frac{\partial H_{c}}{\partial q} . \end{matrix}

(44)

3. SHAKE, Integrating the Equations of Motion

The numerical integration of the equations of motion (6) requires discretizing the time and to provide a suitable algorithm of given precision. Generally, for evident reasons, one avoids the use of algorithms requiring more than the computation at each step of the forces

F_{i} = - \frac{\partial V}{\partial r_{i}}

avoiding successive derivatives, e.g.,

\dot{F_{i}} = - (\sum_{j} \dot{r_{j}} \cdot \frac{\partial}{\partial r_{j}}) (\frac{\partial V}{\partial r_{i}})

, etc. For illustrative purposes, we will limit ourselves to write down the integration of the Lagrangian equations of motion using the Verlet algorithm and of the Hamiltonian equations using the velocity-Verlet algorithm.

3.1. Verlet Algorithm

The celebrated (1967) Verlet algorithm is easily obtained by writing down and summing up the forward and backward Taylor expansions of each coordinate truncated to the fourth order. Calling x a generic variable in the configuration space set

r

(and

\partial_{x} = \frac{\partial}{\partial x}

), it reads

\begin{matrix} x (t + h) = - x (t - h) + 2 x (t) + h^{2} \ddot{x} (t) + O (h^{4}), \end{matrix}

(45)

where t is the running time and h is the integration step resulting from time discretization. The velocity with this algorithm is computed by subtracting the same forward and backward Taylor expansions. We get, with one timestep of delay,

\begin{matrix} \dot{x} (t) = \frac{x (t + h) - x (t - h)}{2 h} - h^{2} \frac{\overset{⃛}{x} (t)}{3} + O (h^{4}), \end{matrix}

(46)

where we have written explicitly the error of order

O (h^{2})

for further use. Notice that the velocities, which carry a larger error, do not enter in the computation of the trajectory, which remains precise to the order three, and, as it has been shown, has many other remarkable features that can be summarized by saying that this algorithm is simplectic [22,23]. In presence of holonomic constraints, the acceleration of any coordinate x can be decomposed in the two contributions,

F = - \partial_{x} V

coming from the interaction potential of the model and

G = - \sum_{α = 1}^{f} Λ_{α} (\partial_{x} σ_{α}) \equiv - Λ \cdot (\partial_{x} σ)

the constraint force, where

Λ

is a set of parameters to be determined, so that

\begin{matrix} \ddot{x} (t) = \frac{1}{m} [F (t) + G (t)] . \end{matrix}

(47)

Substituting Equation (47) in the algorithm Equation (45), we have

\begin{matrix} x (t + h) = \bar{x} (t + h) + \frac{h^{2}}{m} G (t) + O (h^{4}), \end{matrix}

(48)

where

\bar{x} (t + h)

, the provisional value of the coordinate at time

t + h

, is the position the coordinate would take in the absence of constraints. Now, and this is the essential conceptual content of the whole family of SHAKE algorithms, we determine the set of the

Λ

parameters by imposing and solving the set of algebraic equations

\begin{matrix} σ_{α} ({\bar{x} (t + h) - \frac{h^{2}}{m} Λ \cdot (\partial_{x} σ)}) = 0, α = 1, \dots, f . \end{matrix}

(49)

Since we have f

Λ

values and f constraint relationships, the system of algebraic, generally not linear, equations is well posed. The values of the

Λ

s solving these equations are in general different from the values of the

λ (t)

obtainable from Equations (10). However, the difference cannot be of greater order than the one involved in the algorithmic error. Therefore, the values of the coordinates at time

t + h

will entail an error equivalent to the one produced by the blind application of the Verlet algorithm. However, now, the constraint relationships will be satisfied exactly at every timestep and the dynamics of the system will not disrupt the model.

Many different ways have been proposed to solve the system of Equations (46), see e.g., [13,24,25,26,27]. The original, and still commonly used, goes back to Berendsen [3], who called it, again, Shake. It proceeds by satisfying one constraint at a time, iterating constraint relationship by constraint relationship until convergence. The technical details have been worked out, apart from the original paper, more pedagogically in [11]. Leimkuhler [28] has demonstrated that the resulting numerical procedure maintains the time reversal invariance and the simplectic character of the algorithm. In the referred to, original, implementation, the algorithm is inherently serial and cannot be easily parallelized. Practical parallelizations are either approximate or the algorithms are specifically tailored to the problems at hand. An interesting general parallel solution has been worked out by Weinbach and Elber [13]. They take advantage of the fact that the essential step in solving the SHAKE equation

\begin{matrix} σ_{α} (\bar{x} (t + h), Λ) = 0, α = 1, \dots, f \end{matrix}

(50)

can be recast as the solution of a sparse linear problem of the type

A y = b

with

y

the vector of unknowns. Constructing a suitable positive definite matrix, they solve the SHAKE equation using (parallel) conjugate gradient minimization of the quadratic form

\frac{1}{2} Λ^{T} A Λ - Λ^{T} σ

in place of the standard iterative process (inherently serial).

3.2. Velocity-Verlet Algorithm

The Verlet algorithm can be easily recast in an algebraically equivalent form that, when applied in the correct order, produces both the positions and the velocities at the same time. Using the same symbols, let us first rewrite Equation (45) by replacing

- x (t - h)

with its value extracted from Equation (46)

\begin{matrix} - x (t - h) & = 2 h \dot{x} (t) - x (t + h) + \frac{2 h^{3}}{3} \overset{⃛}{x} (t + h) + O (h^{5}), \end{matrix}

(51)

where we have written explicitly for further use the expression for the error

O (h^{3})

, to be normally rejected, in order to obtain the first equation of the velocity-Verlet algorithm, which expresses the position x at time

(t + h)

\begin{matrix} x (t + h) & = x (t) + h \dot{x} (t) + \frac{h^{2}}{2} \ddot{x} (t) + h^{3} \frac{\overset{⃛}{x} (t)}{3} + O (h^{5}) \end{matrix}

(52)

with an error of order

O (h^{3})

, here retained in its explicit form. Next, we write the velocity

\dot{x} (t + h)

from Verlet (46) taken at time

(t + h)

and eliminate the position

x (t + 2 h)

using, again, Verlet (45) taken to go from time

(t + h)

to time

(t + 2 h)

,

\begin{matrix} \dot{x} (t + h) & = [x (t + 2 h) - x (t)] /(2 h) - \frac{h^{2}}{3} \overset{⃛}{x} (t + h) + O (h^{4}) \end{matrix}

(53)

\begin{matrix} = [2 x (t + h) - x (t) + h^{2} \ddot{x} (t + h) + O (h^{4}) - x (t)] /(2 h) - \frac{h^{2}}{3} \overset{⃛}{x} (t + h) + O (h^{4}) \\ = \frac{x (t + h) + (2 x (t) - x (t - h) + h^{2} \ddot{x} (t) + O (h^{4})) - x (t) + h^{2} \ddot{x} (t + h) + O (h^{4}) - x (t)}{2 h} \end{matrix}

(54)

\begin{matrix} - \frac{h^{2}}{3} \overset{⃛}{x} (t + h) + O (h^{4}), \end{matrix}

(55)

where we made again use of Verlet (45) to expand one of the two

x (t + h)

contributions. Regrouping terms and simplifying, we can write

\begin{matrix} \dot{x} (t + h) & = \frac{x (t + h) - x (t - h)}{2 h} + \frac{h}{2} [\ddot{x} (t) + \ddot{x} (t + h)] + O (h^{3}) - \frac{h^{2}}{3} \overset{⃛}{x} (t + h), \end{matrix}

(56)

\begin{matrix} = \dot{x} (t) + \frac{h}{2} [\ddot{x} (t) + \ddot{x} (t + h)] + \frac{h^{2}}{3} [\overset{⃛}{x} (t) - \overset{⃛}{x} (t + h)] + O (h^{3}), \end{matrix}

(57)

where we got rid of the fraction

\begin{matrix} \frac{x (t + h) - x (t - h)}{2 h} = \dot{x} (t) + h^{2} \frac{\overset{⃛}{x} (t)}{3} + O (h^{4}), \end{matrix}

(58)

using Verlet (46). Finally, by expanding with Taylor the third derivative term at time

(t + h)

,

\overset{⃛}{x} (t + h) = \overset{⃛}{x} (t) + h \overset{⃜}{x} (t) + O (h^{2})

, we observe that the two terms

\propto h^{2}

cancel each other leaving a term in

h^{3}

. Finally, we arrive at the second equation of the velocity-Verlet algorithm

\begin{matrix} \dot{x} (t + h) & = \dot{x} (t) + \frac{h}{2} [\ddot{x} (t) + \ddot{x} (t + h)] + O (h^{3}), \end{matrix}

(59)

which expresses the velocity

\dot{x}

at time

(t + h)

, again with an error of order

O (h^{3})

.

Substituting Equation (47) into Equation (52), we have, analogously to the previous section,

\begin{matrix} x (t + h) = \tilde{x} (t + h) + \frac{h^{2}}{2 m} G^{'} (t) + O (h^{3}), \end{matrix}

(60)

where

\tilde{x} (t + h)

, the provisional value of the coordinate at time

t + h

, is the position that the coordinate would take in the absence of constraints using Equation (52), and

G^{'} = - Λ^{'} \cdot (\partial_{x} σ)

is the constraint force and the parameters

Λ^{'}

are determined by imposing and solving the set of equations

\begin{matrix} σ_{α} ({\tilde{x} (t + h) - \frac{h^{2}}{2 m} Λ^{'} \cdot (\partial_{x} σ)}) = 0, α = 1, \dots, f . \end{matrix}

(61)

Again, we have an algebraic system with f constraint relationships

σ

and f unknown values

Λ^{'}

. The problem is well posed and the solution can be retrieved exactly along the same lines as before, by using the iterative Shake algorithm. Of course, we will have a different set of parameters,

Λ^{'} \neq λ

, but the new positions

x (t + h)

will satisfy the constraints (5) exactly at the time

t + h

. To calculate the new velocities, the above procedure must be repeated by substituting Equation (47) in Equation (59), now at time

t + h

,

\begin{matrix} \dot{x} (t + h) = \tilde{\dot{x}} (t + h) + \frac{h}{2 m} G^{″} (t + h) + O (h^{3}), \end{matrix}

(62)

where

G^{″} = - Λ^{″} \cdot (\partial_{x} σ)

and

\begin{matrix} \tilde{\dot{x}} (t + h) = \dot{x} (t) + \frac{h}{2} [F (t) + F (t + h)] + \frac{h}{2 m} G^{'} (t), \end{matrix}

(63)

the provisional velocity at time

t + h

, i.e., the value the velocity would take in the absence of constraints at time

(t + h)

. Note that, at this stage, the

Λ^{'}

and, therefore, the constraint force

G^{'} (t)

at time t are already computed and therefore included in

\tilde{\dot{x}} (t + h)

, while one needs to determine the yet unknown parameters

Λ^{″}

by imposing and solving the set of equations

\begin{matrix} {\dot{σ}}_{α} ({x (t + h), \tilde{\dot{x}} (t + h) - \frac{h}{2 m} Λ^{″} \cdot (\partial_{x} σ) (t + h)}) = 0, α = 1, \dots, f . \end{matrix}

(64)

Once more, we have an algebraic system with f constraint relationships,

\dot{σ}

, and f unknown values,

Λ^{″}

. The problem is well posed and the solution can be retrieved by an iterative Shake-like procedure, i.e., proceeding by satisfying one constraint relation at a time. The whole procedure of imposing constraints within the velocity-Verlet scheme is known by a different name, the Rattle algorithm [12], although it is indeed nothing else than the same SHAKE procedure applied twice, once for positions and once for velocities, to two different sets of equations. The main difference with the original Shake algorithm [3] lies in the fact that the velocities calculated using Equation (62) are at each time exactly tangent to the constraint hypersurface

σ = 0

, while the velocities calculated, usually, simply using Equation (46) in the original Shake algorithm are tangent only within the algorithm accuracy (

O (h^{2})

). This extra precision does not come for free, but at the cost of doubling the effort in calculating the unknown

Λ

parameters. As long as the velocities in the Verlet algorithm do not enter directly into the numerical integration of the positions, such difference can be safely ignored; however, if desired, nothing would impede applying Shake in the same spirit, and with similar costs, to correct the velocities from Equation (46).

4. Equilibrium Statistical Mechanics in the Hamiltonian Formulation

The expression of the statistical equilibrium ensemble in Cartesian coordinates of a system subjected to holonomic constraints is not smooth but singular since the probability density defined in a

6 N

-dimensional phase space is associated with a mechanical system whose motion takes place in a

(6 N - 2 f)

-dimensional subspace, i.e., the intersection of the

2 f

hypersurfaces

σ (r) = 0

and

\dot{σ} (r, p) = 0

.

On the contrary, it is immediate to write down the (microcanonical) probability density in the reduced phase space of the

2 (3 N - f)

generalized coordinates

q, p_{q}

using the Hamiltonian

H_{c}

in Equation (42).

Let

\hat{O} (r, p)

be a dynamical variable defined using Cartesian coordinates and

{\hat{O}}_{c} (q, p_{q}) = \hat{O} (r (q, σ = 0), p (q, σ = 0, p_{q}, p_{σ} = {\tilde{p}}_{σ}))

the equivalent variable expressed using generalized coordinates, restricted to the constrained hypersurface. The familiar microcanonical average in generalized coordinates reads

\begin{matrix} {〈 \hat{O} 〉}_{N V E} = \frac{1}{N! Ω (N, V, E)} \int d q d p_{q} {\hat{O}}_{c} (q, p_{q}) δ (H_{c} (q, p_{q}) - E), \end{matrix}

(65)

where

\begin{matrix} Ω (N, V, E) = \frac{1}{N!} \int d q d p_{q} δ (H_{c} (q, p_{q}) - E) . \end{matrix}

(66)

We will now transform it into the equivalent integral in Cartesian coordinates by making use of the canonical transformation that connect the “generalized” phase space variables

(u, p_{u})

introduced in Section 2 to the “Cartesian” phase space variables

(r, p)

. We first remark that, on the

(6 N - 2 f)

phase space hypersurface, one has

\begin{matrix} d q d p_{q} = d q d σ δ (σ) d p_{q} d p_{σ} δ (p_{σ} - {\tilde{p}}_{σ}) = d u d p_{u} δ (σ) δ (p_{σ} - {\tilde{p}}_{σ}), \end{matrix}

(67)

where, for the product of the first f delta functions, we have used the shortcut notation

δ (σ) = \prod_{α} δ (σ_{α})

, and, equivalently, for the last term

δ (p_{σ} - {\tilde{p}}_{σ}) = \prod_{α} δ ({(p_{σ} - {\tilde{p}}_{σ})}_{α})

.

A more convenient expression for this product of delta functions can be derived by nothing that by multiplying Equation (37) by

Z^{- 1}

, one obtains:

\begin{matrix} Z^{- 1} \dot{σ} = p_{σ} + Z^{- 1} E^{T} p_{q} = p_{σ} - {\tilde{p}}_{σ} . \end{matrix}

(68)

Finally, using the facts that the Jacobian associated with a canonical transformation generated by a point transformation in the coordinates preserves the phase space volume, i.e.,

\frac{\partial (u, p_{u})}{\partial (r, p)} = 1

, and Equation (42), we can write for the microcanonical average (65)

\begin{matrix} {〈 \hat{O} 〉}_{N V E} & = \frac{1}{N! Ω (N, V, E)} \int d u d p_{u} {\hat{O}}^{*} (u, p_{u}) δ [H^{*} (u, p_{u}) - E] δ (σ) δ (p_{σ} - {\tilde{p}}_{σ}) \end{matrix}

(69)

\begin{matrix} = \frac{1}{N! Ω (N, V, E)} \int d r d p \hat{O} (r, p) δ [H (r, p) - E] δ (σ (r)) δ (Z^{- 1} \dot{σ} (r, p)), \end{matrix}

(70)

where now,

\begin{matrix} Ω (N, V, E) & = \frac{1}{N!} \int d r d p δ [H (r, p) - E] δ (σ (r)) δ (Z^{- 1} \dot{σ} (r, p)) \\ = \frac{1}{N!} \int d r d p | Z (r) | δ (\dot{σ} (r, p)) δ (σ (r)) δ [H (r, p) - E], \end{matrix}

(71)

with

| Z |

the modulus of the determinant of the matrix

Z

. From Equation (69), it follows directly the expression for the probability density in the microcanonical equilibrium ensemble in Cartesian coordinates

\begin{matrix} P^{(m i c r o)} (r, p) = \frac{δ [H (r, p) - E] | Z (r) | δ (\dot{σ} (r, p)) δ (σ (r))}{N! Ω (N, V, E)} \end{matrix}

(72)

and similar for other equilibrium ensembles. In particular for the Canonical ensemble, where classically momenta and coordinates are explicitly independent, the probability density will result in being

\begin{matrix} P (r, p) = \frac{\exp \{- β H (r, p)\} | Z (r) | δ (\dot{σ} (r, p)) δ (σ (r))}{\int d r d p \exp \{- β H (r, p)\} | Z (r) | δ (\dot{σ} (r, p)) δ (σ (r))} . \end{matrix}

(73)

For theoretical purposes, as we will see in the following, it is very useful to write the ensemble in terms of the marginal configurational probability density

P_{M} (r)

and the conditional probability density of the momenta

P_{C} (p | r)

:

\begin{matrix} P (r, p) = P_{M} (r) P_{C} (p | r), \end{matrix}

(74)

where

\begin{matrix} P_{M} (r) d r & = (\int d p P (r, p)) d r = \frac{\int d p e^{- β H (r, p)} δ (σ (r)) δ (Z^{- 1} \dot{σ} (r, p))}{\int d r d p e^{- β H (r, p)} δ (σ (r)) δ (Z^{- 1} \dot{σ} (r, p))} d r . \end{matrix}

(75)

Explicitly,

P_{M}

can be computed by first integrating out the delta functions for

\dot{σ}

in Equation (75), which amounts, after a change of variables, to the substitutions

p_{σ} = {\tilde{p}}_{σ}

, and then by executing the Gaussian integrals involved in the canonical ensemble. Assuming that the standard result of the integration of a unidimensional Gaussian integral is known,

\begin{matrix} \int_{- \infty}^{+ \infty} d x e^{- \frac{1}{2} x^{2} / a} = \sqrt{2 π a}, \end{matrix}

(76)

and, for a multidimensional Gaussian integral, diagonalizing the quadratic form and using the invariance of the determinant under unitary transformations, one derives immediately for the integral that, in n dimensions,

\begin{matrix} \int \dots \int d x e^{- \frac{1}{2} (x^{T} \cdot A^{- 1} \cdot x)} = {(2 π)}^{\frac{n}{2}} {|A|}^{\frac{1}{2}} . \end{matrix}

(77)

Focusing on the numerator in Equation (75), making a change of variables in the integral by using

d p = J^{- 1} d p_{u} = J^{- 1} d p_{q} d p_{σ}

, one has

\begin{matrix} e^{- β V} δ (σ) d r J^{- 1} \int d p_{q} d p_{σ} e^{- \frac{β}{2} [p_{u}^{T} \cdot M^{- 1} \cdot p_{u}]} δ (p_{σ} - {\tilde{p}}_{σ}) & = e^{- β V} δ (σ) d r J^{- 1} \int d p_{q} e^{- \frac{β}{2} [p_{q}^{T} \cdot A^{- 1} \cdot p_{q}]} \end{matrix}

(78)

\begin{matrix} = e^{- β V} δ (σ) d r {(\frac{2 π}{β})}^{\frac{3 N - f}{2}} J^{- 1} {| A |}^{\frac{1}{2}} & = {(\frac{2 π}{β})}^{\frac{3 N - f}{2}} {| μ |}^{\frac{1}{2}} e^{- β V} δ (σ) {| Z |}^{\frac{1}{2}} d r . \end{matrix}

(79)

To obtain Equation (79), we have proceeded as follows. First, we substitute the kinetic Hamiltonian term in the exponential with its “unconstrained” expression in Equation (40) and integrate over

d p_{σ}

using Equation (42). Now, we perform the remaining multidimensional Gaussian integral over the remaining momenta

p_{q}

by using Equation (77) and use Equation (36) to arrive at the result in Equation (79). Following the same procedure for the denominator and simplifying the constants in Equation (79), we finally gets for the normalized marginal probability density

\begin{matrix} P_{M} (r) & = \frac{e^{- β V (r)} δ (σ) {|Z|}^{\frac{1}{2}}}{\int d r e^{- β V (r)} δ (σ) {|Z|}^{\frac{1}{2}}} . \end{matrix}

(80)

Equation (80) tells us that the marginal probability density in configuration space, in the presence of constraints, is not simply

\propto \exp (- β V) δ (σ)

but contains the biasing term

{| Z (r) |}^{\frac{1}{2}}

coming from the limitations in momentum space induced by the constraints.

The conditional probability density in momentum space is given by

\begin{matrix} P_{C} (p | r) & = \frac{P (r, p)}{P_{M} (r)} = \frac{e^{- \frac{β}{2} [p^{T} \cdot μ^{- 1} \cdot p] - β V} δ (σ) δ (Z^{- 1} \dot{σ} (r, p))}{\int d p e^{- \frac{β}{2} [p^{T} \cdot μ^{- 1} \cdot p] - β V} δ (σ) δ (Z^{- 1} \dot{σ} (r, p))} \end{matrix}

(81)

\begin{matrix} = \frac{e^{- \frac{β}{2} [p^{T} \cdot μ^{- 1} \cdot p]} δ (Z^{- 1} \dot{σ} (r, p))}{{(2 π /β)}^{\frac{3 N - f}{2}} {| μ |}^{\frac{1}{2}} {| Z |}^{\frac{1}{2}}} \end{matrix}

(82)

\begin{matrix} = {(2 π /β)}^{- \frac{3 N - f}{2}} {| μ |}^{- \frac{1}{2}} e^{- \frac{β}{2} [p^{T} \cdot μ^{- 1} \cdot p]} {| Z |}^{\frac{1}{2}} δ (\dot{σ} (r, p)), \end{matrix}

(83)

where to get the first equality we referred to Equation (80); for the next step, the result implicit in Equation (79); and, finally, then Equation (83). The configuration dependent factor

{| Z (r) |}^{\frac{1}{2}}

in Equation (83) indicates that, when there are constraints, positions and momenta are no longer independent. In particular, the distribution of momenta becomes no more simply Maxwellian.

5. Rare Events and Blue Moon Ensemble

In the statistical mechanical treatment of macroscopic phenomena, one is interested in computing the properties of interest by identifying suitable observables, i.e., function of phase space,

\hat{ξ} (r, p)

, although here, and for a while, we will focus on observables depending only on the configuration space and obtaining their macroscopic counterpart by taking an ensemble average (to be definite, let us choose to work with the canonical ensemble) of it,

\begin{matrix} ξ = 〈 \hat{ξ} 〉 = \frac{1}{Q} \int d r d p e^{- β H (r, p)} \hat{ξ} (r), Q = \int d r d p e^{- β H (r, p)} . \end{matrix}

(84)

More generally, given one observable, it can be instructive to compute the marginal probability density associated with it in the ensemble

\begin{matrix} P_{ξ} (ξ^{'}) = \frac{1}{Q} \int d r d p e^{- β H (r, p)} δ (\hat{ξ} (r) - ξ^{'}) \equiv 〈δ (\hat{ξ} - ξ^{'})〉 . \end{matrix}

(85)

Macroscopically speaking, this probability density has a profound meaning since it can be associated, via the definition of the (Landau) free energy

\begin{matrix} W_{ξ} (ξ^{'}) = - k_{B} T \ln P_{ξ} (ξ^{'}) \end{matrix}

(86)

to the reversible work needed to bring the physical system from a reference state to the value

ξ = ξ^{'}

. This fact can be easily seen by taking the derivative with respect to

ξ^{'}

of Equation (86)

\begin{matrix} \frac{d W_{ξ} (ξ^{'})}{d ξ^{'}} & = - k_{B} T \frac{d}{d ξ^{'}} \ln P_{ξ} (ξ^{'}) - \frac{k_{B} T}{P_{ξ} (ξ^{'})} \frac{d}{d ξ^{'}} P_{ξ} (ξ^{'}) \end{matrix}

(87)

\begin{matrix} = - \frac{k_{B} T}{P_{ξ} (ξ^{'})} \int d r d p \frac{e^{- β H (r, p)}}{Q} \frac{d}{d ξ^{'}} δ (ξ (r) - ξ^{'}) . \end{matrix}

(88)

In the same spirit of Section 2, we introduce the canonical transformation from the Cartesian coordinates

{r, p}

to the generalized coordinates

{u, p_{u}}

, where

u = (q, ξ)

with the set

q

suitably chosen. Now, using that

\frac{d}{d ξ^{'}} = - (\frac{\partial}{\partial ξ})

, integrating by parts, we arrive at

\begin{matrix} \frac{d W_{ξ} (ξ^{'})}{d ξ^{'}} & = \frac{\int d u d p_{u} (\frac{\partial H^{*}}{\partial ξ}) e^{- β H^{*}} δ (ξ - ξ^{'})}{\int d u d p_{u} e^{- β H^{*}} δ (ξ - ξ^{'})} = \frac{〈 (\frac{\partial H^{*}}{\partial ξ}) δ (\hat{ξ} - ξ^{'}) 〉}{〈 δ (\hat{ξ} - ξ^{'}) 〉}, \end{matrix}

(89)

where

H^{*}

is the Hamiltonian expressed in the generalized

u

coordinates, see Equation (17). In generalized coordinates, the kinetic term

K^{*}

in the Hamiltonian (see Equation (18)) gives a non-zero contribution to the derivative, which is nothing but a geometrical correction that ultimately involves (see Appendix A) the Jacobian

J = \frac{\partial (u)}{\partial (r)}

of the coordinate transformation,

\begin{matrix} \frac{d W_{ξ} (ξ^{'})}{d ξ^{'}} & = 〈[(\frac{\partial V^{*}}{\partial ξ}) - k_{B} T (\frac{\partial \ln J}{\partial ξ})] δ (\hat{ξ} - ξ^{'})〉/ 〈δ (\hat{ξ} - ξ^{'})〉 . \end{matrix}

(90)

From Equation (90), we see that the derivative of

W_{ξ}

is a conditional average, at a given value of the observable, of the generalized force acting on the system, i.e., typically a thermodynamic force. The evaluation of this expression as given in Equation (90) requires constructing explicitly the set of generalized coordinate

u

, something usually very cumbersome, and needs an ad hoc derivation in each particular case. As a matter of fact, it is possible to circumvent this technical difficulty [15] and derive expressions directly in terms of the function

ξ (r)

and its derivatives with respect to the Cartesian coordinates

r

. The “work” associated with this force is what we identify with reversible work. By thermodynamic integration, we can get the reversible work relative to a reference state and by exponentiation the probability density associated with the random variable

\hat{ξ}

.

In standard conditions, when

\hat{ξ}

is a unimodal random variable, the sampling of its probability density is an easy matter that can be computed directly in any straightforward Monte Carlo or Molecular Dynamics simulation by simply recording the histogram of visited

ξ^{'}

values. Things become less evident when the probability distribution of the random variable is not only multimodal, but the regions in between the maxima are characterized by very low probabilities so that whatever simulation gets stuck in one of the highly probable regions and, physically speaking, we are in the presence of a metastability. When this is the case, a brute force sampling of the histogram is no longer possible and one has to find, by cunning, alternative ways to proceed. As we will see in the following, the concept of conditioned or constrained probability will take us out of the difficulty.

To see that, we consider the condition

ξ (r) = ξ^{'}

as the constraint

\begin{matrix} σ (r) = \hat{ξ} (r) - ξ^{'} = 0, \end{matrix}

(91)

and we compare the conditional probability

\begin{matrix} P (r | ξ^{'}) = \frac{P (r, ξ^{'})}{P (ξ^{'})} = \frac{e^{- β V} δ (\hat{ξ} - ξ^{'})}{\int d r e^{- β V} δ (\hat{ξ} - ξ^{'})} \end{matrix}

(92)

with Equation (80) for the marginal probability density in the presence of the constraint

σ

. We find

\begin{matrix} P (r | ξ^{'}) = \frac{{| Z |}^{- \frac{1}{2}} P_{M}^{c o n s t r} (r)}{\int d r {| Z |}^{- \frac{1}{2}} P_{M}^{c o n s t r} (r)} = \frac{{| Z |}^{- \frac{1}{2}}}{{〈 {| Z |}^{- \frac{1}{2}} 〉}_{σ}^{c o n s t r}} \frac{P_{M}^{c o n s t r} (r)}{\int d r P_{M}^{c o n s t r} (r)}, \end{matrix}

(93)

i.e., a way to sample the conditional probability of

r

given

ξ^{'}

by unbiasing a constrained probability density. Now, even regions of the configurational space associated with very low probabilities can be efficiently sampled and the metastability problem is taken out. In particular as a kind of corollary to Equation (93), we have for any configurational observable

\hat{O} (r)

,

\begin{matrix} {〈 \hat{O} (r) 〉}_{ξ^{'}}^{c o n d} = = \frac{{〈 {| Z |}^{- \frac{1}{2}} \hat{O} (r) 〉}_{σ}^{c o n s t r}}{{〈 {| Z |}^{- \frac{1}{2}} 〉}_{σ}^{c o n s t r}}, \end{matrix}

(94)

with the rhs that, at variance with its left counterpart, can be efficiently sampled even for values of

ξ^{'}

corresponding to metastabilities.

The next problem arises when we need to take conditional averages for observables depending on the whole phase space, i.e.,

r

and

p

. This case, apparently not so common, is instead general if one considers conditional dynamic properties (time correlation functions) even of configurational properties. Indeed, as it is immediately evident,

\hat{O} (r (t))

is nothing else than a function of the initial condition

(r, p)

parametrically dependent on the time t. Therefore, to be able to sample an unbiased conditional ensemble, with

σ = 0

, we need to have unbiased

P (r, p | ξ^{'})

. We know that the momenta in the constrained ensemble, Equation (83), are irreversibly biased and thus unusable. However, we can unbias the configurations taken along a constrained trajectory and associate with them momenta sampled from an unbiased probability distribution. Knowing that, in the original ensemble, positions and momenta are independent and moreover the distribution of momenta

P_{p} (p)

is just a product of Maxwellians, we can easily get such a sample

\begin{matrix} P (r, p | ξ^{'}) = P_{p} (p) P_{r} (r | ξ^{'}) \end{matrix}

(95)

from which directly a computable expression for a time correlation function at given

ξ^{'}

\begin{matrix} 〈 \hat{O} (r (t)) \hat{O} (r) | ξ^{'} 〉 = \frac{{〈 {| Z |}^{- \frac{1}{2}} \hat{O} (r (t)) \hat{O} (r) | ξ^{'} 〉}_{σ}^{c o n s t r}}{{〈 {| Z |}^{- \frac{1}{2}} 〉}_{σ}^{c o n s t r}}, \end{matrix}

(96)

where the time evolution now has to be intended to be fully unconstrained. The ensemble so constructed is the Blue Moon Ensemble and the problem of this particular metastability is now solved. In particular, if we are interested in the calculation of an unconditioned time correlation function in a system where a brute force calculation (due to metastability) is not possible, we can compute it by thermodynamic integration using the predetermined marginal probability of

ξ

,

P_{ξ} (ξ^{'})

. We get

\begin{matrix} 〈 \hat{O} (r (t)) \hat{O} (r) 〉 = \int d ξ^{'} P_{ξ} (ξ^{'}) 〈 \hat{O} (r (t)) \hat{O} (r) | ξ^{'} 〉 . \end{matrix}

(97)

To simplify the algebra and the notation, we have developed our argument only in the case of one scalar condition,

ξ

. The generalization to a vectorial condition is straightforward but cumbersome, and it can also be found explicitly derived in the literature [29]. The case in which the mechanical system contains constraints other than the ones representing physical conditions, typically molecular constraints, is formally more involved but conceptually identical. The interested reader can find all needed details in Ciccotti et al. [2].

6. Liouville Equation in the Presence of Constraints

The careful reader will have noticed at this point that we have properly solved the dynamics of a Hamiltonian system subjected to holonomic constraints and also formulated, in the Cartesian space, its statistical behavior at equilibrium. Instead, we have been unable to formulate in general the Statistical Mechanics of the system, including the evolution of the non equilibrium ensemble. The reason is that we miss the Liouville equation for this family of dynamical systems. We will see in the following that a generalized Liouville equation, always in the Cartesian reference description, can be derived at the price of abandoning the formulation of the dynamical evolution of the system by Lagrange multipliers and deriving, instead, the statistical behavior of a many-body, non-Hamiltonian system still satisfying the (assumed) conditions needed to justify a statistical treatment (e.g., chaotic behavior of the constituents, etc.). In these conditions, we will be able

(i): to get a correct generalized Liouville equation;
(ii): to find the results already obtained for the equilibrium ensemble;
(iii): to generalize to these systems the theory of the response to external perturbations (in particular Kubo linear response theory [30]) well known and of widespread use for Hamiltonian systems, not only theoretically but also in molecular dynamics simulations [19,31,32,33].

A general, non-Hamiltonian, dynamical autonomous system is defined, in the set of variables

{x} (\equiv {r, p = m \dot{r}})

, by

\begin{matrix} \dot{x} = G (x) \end{matrix}

(98)

with the single component

G_{i} (x)

not derivable as

\pm \frac{\partial H}{\partial x_{i^{'}}}

. The first and most important difference with a Hamiltonian system, especially in view of the derivation of the statistical properties of such a system, is that the phase space volume can be no more an invariant of the motion. If that happens, the standard approach of Statistical Mechanics is doomed to fail. However, it is easy to see that an invariant measure for the systems given in Equation (98) is easily found. Indeed, now

\begin{matrix} d x_{t} = J (x_{t}, x_{0}) d x_{0}, \end{matrix}

(99)

where

x_{t} = x_{t} (t; x_{0})

is the solution of Equation (98),

x_{0}

the initial condition and

J_{t} = J (x_{t}, x_{0})

the determinant of the Jacobian matrix of the time-generated change of variables. In Appendix B, it is shown that

\begin{matrix} \frac{d}{d t} J (x_{t}, x_{0}) = (\nabla_{x_{t}}^{T} {\dot{x}}_{t}) J (x_{t}, x_{0}) = κ (x_{t}) J (x_{t}, x_{0}), \end{matrix}

(100)

where

κ

, the divergence of the flow in phase space

κ (x_{t}) = \nabla_{x_{t}}^{T} {\dot{x}}_{t}

, is known as the phase space compressibility of the dynamical system, and we have introduced the shorthand notation

\nabla_{x_{t}}

for the gradient

\nabla_{x} = \frac{\partial}{\partial x}

with respect to coordinates

x_{t}

at time t. The solution of Equation (100) with the initial condition

J (x_{t = 0}, x_{0}) = 1

is

\begin{matrix} J (x_{t}, x_{0}) = \exp \{\int_{0}^{t} d τ κ (x_{τ}) d τ\} = \exp {w (x_{t}, t) - w (x_{0}, 0)}, \end{matrix}

(101)

where

w (x_{t}, t)

is the primitive function associated with the indefinite time integral of

κ (x_{t}),

which exists with certainty given that

κ (x_{t}) = \frac{d \ln J_{t}}{d t}

. Substituting this results in Equation (99), we find

\begin{matrix} \exp {- w (x_{t}, t)} d x_{t} = \exp {- w (x_{0}, 0)} d x_{0} \end{matrix}

(102)

i.e., the conservation in time of the measure

e^{- w (x_{t}, t)} d x_{t}

. The factor

\exp {- w (x_{t}, t)}

, let us call it

γ (x_{t}, t)

, is the metric factor associated with the coordinate transformation

x_{0} \to x_{t},

. It tells us that the statistical space of the variables

x

is no more Euclidean but has a non trivial metric structure. Remembering that the statistical ensemble is described by a probability density

P (x, t),

we have for the normalization condition,

\begin{matrix} \int d x γ (x) P (x, t) = 1, \forall t, \end{matrix}

(103)

from which we get

\frac{d}{d t} (\int d x γ P) = 0

and, therefore, the continuity equation

\begin{matrix} \frac{\partial}{\partial t} (γ (x) P (x, t)) + \nabla_{x}^{T} (\dot{x} γ (x) P (x, t)) = 0, \end{matrix}

(104)

which represents the new, valid form for the Liouville equation for our more general dynamical systems. The solutions of Equation (104) will give us the evolution in the time of the ensemble associated with our non-Hamiltonian systems in non-equilibrium conditions while their stationary, asymptotic solutions can represent the equilibrium ensemble. We will now proceed in two steps, in order to derive the consequences of this more general approach to Statistical Mechanics. First, we will discuss how to obtain, in these conditions, not just a stationary solution but the correct equilibrium ensemble corresponding to the microcanonical ensemble of the standard case and show, just for illustration, how by this procedure we can re-derive the equilibrium ensemble for systems subjected to holonomic constraints. Then, second, using the general form of the solution of Equation (104), we will show the validity of the response approach developed by Onsager and Kubo (at least) for the Hamiltonian case.

6.1. Generalized Distribution Function

Assuming that the system (98) possesses

n_{c}

conserved quantities

{\hat{C}}_{k} (x), k = 1, \dots, n_{c}

,

\begin{matrix} \frac{d {\hat{C}}_{k} (x)}{d t} = 0, k = 1, \dots, n_{c}, \end{matrix}

(105)

the space sampled by its trajectories will be the subspace intersection of the hypersurfaces

{\hat{C}}_{k} (x) = c_{k}

, where the values

c_{k}

are determined by the initial conditions. The “microcanonical” distribution function generated in these conditions is

\begin{matrix} P^{(m i c r o)} (x) \propto \prod_{k = 1}^{n_{c}} δ ({\hat{C}}_{k} (x) - c_{k}) . \end{matrix}

(106)

The solution (106) satisfies Equation (104) since its total time derivative is evidently zero and, moreover, is microcanonical since all accessible configurations are equiprobable. Other solutions exist, for example products of delta functions for subsets of the full set of conservation laws, but they do not correspond to physical ensembles since they will represent hypersurfaces containing states that will never be visited. In other words, physical ensembles cannot be obtained by using only the solutions of the Liouville equation (104). To satisfy the stationary Liouville equation is a necessary but not sufficient condition. From the previous observations, it is possible to derive the rules to be followed to construct the proper equilibrium ensemble and the correct invariant measure.

The ensemble:

Construct the distribution function by Equation (106) using all the independent conservation laws implicit in the equations of motion;
Eliminate from the statistical space all variables that result uncoupled to the bulk of the system or driven by it. By driven, we mean variables
(i)
whose evolution follows that of the other variables without influencing those ones and
(ii)
that do not appear in the phase space expression of any of the $n_{c}$ conserved quantities ${\hat{C}}_{k}$ .
A (not so) typical example could be that of particles of zero mass interacting with the system only via the holonomic constraints defining their own values (see Appendix C).

The measure:

3.: Once the essential, reduced, set of variables, let us call them $x^{'}$ , has been selected, calculate the phase space compressibility $κ (x^{'}) = \nabla_{x^{'}}^{T} {\dot{x}}^{'}$ of the reduced dynamical system

$\begin{matrix} {\dot{x}}^{'} = \tilde{G} (x^{'}) \end{matrix}$

(107)

and use $κ (x^{'})$ to determine $γ (x^{'})$ .

The results are

\begin{matrix} P^{(m i c r o)} (x^{'}) = \frac{\prod_{k = 1}^{n_{c}} δ ({\hat{C}}_{k} (x^{'}) - c_{k})}{\int d x^{'} γ (x^{'}) \prod_{k = 1}^{n_{c}} δ ({\hat{C}}_{k} (x^{'}) - c_{k}),} \end{matrix}

(108)

where, via the normalization factor, is implicitly defined the new partition function

\begin{matrix} Ω (c_{1}, \dots, c_{n_{c}}) = \int d x^{'} γ (x^{'}) \prod_{k = 1}^{n_{c}} δ ({\hat{C}}_{k} (x^{'}) - c_{k}) . \end{matrix}

(109)

We now turn, for illustrative purposes, to apply the formalism just developed to an originally Hamiltonian dynamical system subjected to holonomic constraints. As we have seen before, the non-Hamiltonian equations of motion are obtained inserting directly Equation (10) into Equation (14). The result is

\begin{matrix} {\dot{r}}_{i} = & \frac{p_{i}}{m_{i}}, \\ \dot{p_{i}} = & - \frac{\partial V}{\partial r_{i}} - \sum_{α = 1}^{f} \{\sum_{θ = 1}^{f} [- \frac{1}{m_{i}} \frac{\partial V}{\partial r_{i}} \cdot (\frac{\partial}{\partial r_{i}}) σ_{θ} \end{matrix}

(110)

\begin{matrix} + \sum_{k = 1}^{N} \sum_{j = 1}^{N} (\frac{p_{k}}{m_{k}} \cdot \frac{\partial}{\partial r_{k}}) (\frac{p_{j}}{m_{j}} \cdot \frac{\partial}{\partial r_{j}}) σ_{θ}] {(Z^{- 1})}_{θ, α}\} \frac{\partial σ_{α} (r)}{\partial r_{i}}, i = 1, \dots, N . \end{matrix}

(111)

Now,

σ_{α} = {\dot{σ}}_{α} = 0, α = 1, \dots, f

are

2 f

conservation laws to be added to the Hamiltonian. The compressibility factor

κ

is (easily) computed as

\begin{matrix} κ & = \nabla_{x}^{T} \dot{x} = \sum_{i = 1}^{N} (\frac{\partial}{\partial p_{i}} \cdot {\dot{p}}_{i}) = - \sum_{α = 1}^{f} \sum_{i = 1}^{N} (\frac{\partial λ_{α}}{\partial p_{i}}) \cdot \frac{\partial σ_{α}}{\partial r_{i}} \end{matrix}

(112)

\begin{matrix} = - 2 \sum_{α = 1}^{f} \sum_{θ = 1}^{f} \sum_{i = 1}^{N} [(\frac{1}{m_{i}} \frac{\partial}{\partial r_{i}}) \cdot (\sum_{j = 1}^{N} \frac{p_{j}}{m_{j}} \cdot \frac{\partial σ_{θ}}{\partial r_{j}})] {(Z^{- 1})}_{θ, α} \frac{\partial σ_{α}}{\partial r_{i}} \end{matrix}

(113)

\begin{matrix} = - \sum_{α = 1}^{f} \sum_{θ = 1}^{f} [2 \sum_{i = 1}^{N} \frac{1}{m_{i}} \frac{\partial σ_{α}}{\partial r_{i}} \cdot \frac{\partial {\dot{σ}}_{θ}}{\partial r_{i}}] {(Z^{- 1})}_{θ, α} = - \sum_{α = 1}^{f} \sum_{θ = 1}^{f} {\dot{Z}}_{α, θ} {(Z^{- 1})}_{θ, α} = - \frac{d}{d t} \ln | Z |, \end{matrix}

(114)

giving

\begin{matrix} γ = e^{- w} = | Z |, \end{matrix}

(115)

from which we recover the ensemble already derived, Equation (73).

6.2. Response Theory

We address now the central question of dynamical non-equilibrium Statistical Mechanics for systems subjected to holonomic constraints: how to get statistical averages when the evolution of the system is no more stationary be it due to time-dependent perturbations or to the study of relaxation processes [8]. These problems are already solved in the Hamiltonian case (even with non-Hamiltonian perturbations but conserving the phase space volume [19,31,32,33,34,35]); here, we extend that solution to our present case. Let us start from the simpler case of the study of relaxation. Here, we have the system prepared in a non-equilibrium condition and we intend to study the macroscopic relaxation of an observable:

\begin{matrix} {〈 \hat{O} 〉}_{t} = \int d ω (x) \hat{O} (x) P (x, t), \end{matrix}

(116)

where

d ω (x) = γ (x) d x

is the invariant measure already derived and

P (x, t)

is the ensemble at time t obtained by evolving with the generalized Liouville Equation (104) the initial non-stationary ensemble

P (x, 0) \equiv P (x)

. Let us define the Liouville operator

\begin{matrix} ı {\hat{L}}_{0} = {\dot{x}}^{T} \nabla_{x} = μ^{- 1} p^{T} \nabla_{r} - {(\nabla_{x} V)}^{T} \nabla_{p} - \sum_{α = 1}^{f} λ_{α} (x) {(\nabla_{r} σ_{α})}^{T} \nabla_{p} . \end{matrix}

(117)

It follows immediately that

\begin{matrix} \frac{d \hat{O} (x (t))}{d t} = ı {\hat{L}}_{0} \hat{O} (x (t)) \end{matrix}

(118)

with formal solution

\begin{matrix} \hat{O} (x (t)) = e^{ı {\hat{L}}_{0} t} \hat{O} (x) . \end{matrix}

(119)

As for the evolution of the ensemble, we can start from the Liouville equation (104)

\begin{matrix} \frac{\partial γ P}{\partial t} = - \nabla_{x}^{T} (\dot{x} γ P) = - (ı {\hat{L}}_{0} + κ) (γ P), \end{matrix}

(120)

from which we can (easily) find, by using the fact that

| Z | = | Z (r) |

and the identity

| Z | = e^{Tr \ln Z}

, that

\begin{matrix} ı {\hat{L}}_{0} γ & = \sum_{i = 1}^{N} \frac{p_{i}}{m_{i}} \cdot \frac{\partial | Z |}{\partial r_{i}} = \sum_{α = 1}^{f} \sum_{θ = 1}^{f} [\sum_{i = 1}^{N} \frac{p_{i}}{m_{i}} \cdot \frac{\partial Z_{α θ}}{\partial r_{i}}] {(Z^{- 1})}_{θ α} | Z | \end{matrix}

(121)

\begin{matrix} = [\sum_{α = 1}^{f} \sum_{θ = 1}^{f} {\dot{Z}}_{α θ} {(Z^{- 1})}_{θ α}] | Z | = - κ γ, \end{matrix}

(122)

i.e.,

\begin{matrix} \frac{\partial P}{\partial t} = (- ı {\hat{L}}_{0}) P, \end{matrix}

(123)

or else, once again,

\begin{matrix} \frac{d P}{d t} = \frac{\partial P}{\partial t} + ı {\hat{L}}_{0} P = 0 . \end{matrix}

(124)

Moreover, as in the standard case

\begin{matrix} P (x, t) = e^{- ı {\hat{L}}_{0} t} P (x) . \end{matrix}

(125)

By remembering that we are working with an invariant measure, we find, again,

\begin{matrix} {〈 \hat{O} 〉}_{t} = \int d ω \hat{O} (x) P (x, t) = \int d ω [e^{ı {\hat{L}}_{0} t} \hat{O} (x)] P (x) = \int d ω \hat{O} (x (t)) P (x), \end{matrix}

(126)

a relation easy to implement in molecular dynamics simulation of a relaxation process if we can prepare a sample of the non-equilibrium initial ensemble

P (x)

.

Let us now move to the case in which we are interested to compute the response of the system to an external time-independent field. The equations of motion become

\begin{matrix} \dot{x} = G (x) + D (x) F (t), \end{matrix}

(127)

with

D (x)

derivable (

D^{(r)} = \frac{\partial H_{p}}{\partial p}; D^{(p)} = - \frac{\partial H_{p}}{\partial r}

) or not from a Hamiltonian perturbation term (

H_{p}

). In any event, to simplify the formalism (and on the basis of what is usually done in transport studies [36]), let us assume that

\nabla_{x}^{T} D (x) = 0

, i.e., that the perturbation satisfies the incompressibility condition. This condition guarantees that, even in the presence of the perturbation, the non-zero compressibility arises only from the constraints and it is given by Equation (114). The Liouville operator is now time-dependent

\begin{matrix} ı \hat{L} (t) = p^{T} \nabla_{r} - (\nabla_{r}^{T} V + \sum_{α} λ_{α} \nabla_{r}^{T} σ_{α}) \nabla_{p} + F (t) D^{T} \nabla_{x} \end{matrix}

(128)

and

\begin{matrix} \hat{O} (x (t)) = T \exp \{ı \int_{0}^{t} d τ \hat{L} (τ)\} \hat{O} (x), \end{matrix}

(129)

where

T

is the time-ordering operator. However, in spite of this more daring complexity, again the probability density evolves with the operator

\begin{matrix} S^{†} (t, 0) = T \exp \{- ı \int_{0}^{t} d τ \hat{L} (τ)\} \end{matrix}

(130)

so that, again,

\begin{matrix} {〈 \hat{O} 〉}_{t} & = \int d ω \hat{O} (x) S^{†} (t, 0) P (x) \end{matrix}

(131)

\begin{matrix} = \int d ω \hat{O} (x (t)) P (x) \end{matrix}

(132)

is the initial condition an equilibrium distribution or a general one, in any event a relation easy to implement in molecular dynamics simulations. Equation (132) is valid in general both in the linear response regime and beyond it. In the case of small perturbations, it is possible to show, after some algebra, that, in the presence of constraints, the classical linear response result of Green [37] and Kubo [30] is recovered and holds without any alteration from the uncostrained case [9].

7. Conclusions

The dynamics and Statistical Mechanics of a many-body system subjected to holonomic constraints have been discussed both following, as for the equilibrium case, the classical historical Lagrange (Hamilton) approach, using Lagrangian multipliers, and, more generally, from the newer perspective, encompassing also non-equilibrium, of non-Hamiltonian flows in phase space. One section has been dedicated to review in depth the most relevant numerical implementations, while, for the sake of readability, the reader has been addressed to the relevant literature for the technically most involved cases. A quite peculiar application, the zero-mass particle case, has been discussed to show how constraints can be creatively used to extend the description of the system opening a different, efficient, way of incorporating, for example, new features in force field models. Let us, finally, remark that developing the statistical theory of dynamical systems subjected to holonomic constraints permits to cover both equilibrium and non-equilibrium simulations of molecular systems but also to explore the domain of rare events, including the computing of complex free energy landscapes, the probing of the dynamics of rare events and even performing non-equilibrium hydrodynamical simulations by properly sampling initial conditions assigning the proper weight to the ensemble of non-equilibrium trajectories that gives the correct (linear and nonlinear) response.

Acknowledgments

It gives us great pleasure to acknowledge the importance of the lifelong collaboration on these themes with Jean-Paul Ryckaert, Ray Kapral, Michiel Sprik, Glenn Martyna, Mark Tuckerman, Eric Vanden-Eijnden and Carsten Hartmann. This work was partially supported by grant MIUR PRIN-2012NNRKAF_004.

Author Contributions

The authors contributed equally to this work.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

To evaluate the thermodynamic force in Equation (90), we need to separately address the contributions from the kinetic and the potential terms in the Hamiltonian

H^{*} (u, p_{u})

,

\begin{matrix} \frac{d W_{ξ} (ξ^{'})}{d ξ^{'}} & = \frac{\int d u d p_{u} [(\frac{\partial K^{*}}{\partial ξ}) + (\frac{\partial V^{*}}{\partial ξ})] e^{- β H^{*}} δ (ξ - ξ^{'})}{\int d u d p_{u} e^{- β H^{*}} δ (ξ - ξ^{'}) .} \end{matrix}

(A1)

The potential term does not depend on the momenta

p_{u}

and its contribution is immediately addressed. The kinetic term

K^{*} = \frac{1}{2} p_{u}^{T} M^{- 1} p_{u}

can be rearranged in a more significant form by performing the integral over the momenta,

\begin{matrix} \frac{\int d p_{u} [\frac{1}{2} p_{u}^{T} \frac{\partial M^{- 1}}{\partial ξ} p_{u}] \exp {- \frac{β}{2} p_{u}^{T} M^{- 1} p_{u}}}{\int d p_{u} \exp {- \frac{β}{2} p_{u}^{T} M^{- 1} p_{u}}} \end{matrix}

(A2)

\begin{matrix} = - k_{B} T \frac{\frac{\partial}{\partial ξ} \int d p_{u} \exp {- \frac{β}{2} p_{u}^{T} M^{- 1} p_{u}}}{\int d p_{u} \exp {- \frac{β}{2} p_{u}^{T} M^{- 1} p_{u}}} \end{matrix}

(A3)

\begin{matrix} = - k_{B} T \frac{\frac{\partial}{\partial ξ} {(\frac{2 π}{β})}^{3 N / 2} {| M |}^{1 / 2}}{{(\frac{2 π}{β})}^{3 N / 2} {| M |}^{1 / 2}} \end{matrix}

(A4)

\begin{matrix} = - \frac{k_{B} T}{2} (\frac{\partial \ln | M |}{\partial ξ}) = - k_{B} T (\frac{\partial \ln J}{\partial ξ}), \end{matrix}

(A5)

where we have used Equation (77) and Equation (21).

Appendix B

We enclose for self-consistency of our text a straightforward demonstration of Equation (100). If the coordinate system is changed from coordinate

ξ = x (0)

to the coordinates

x = x (t | ξ)

from the time evolution transformation, the volume element in the n-dimensional space changes accordingly to the formula

\begin{matrix} d x_{1} d x_{2} \dots d x_{n} = d x = \frac{\partial (x_{1}, x_{2}, \dots, x_{n})}{\partial (ξ_{1}, ξ_{2}, \dots, ξ_{n})} d ξ_{1} d ξ_{2} \dots d ξ_{n} = J d ξ, \end{matrix}

(A6)

where

J = | J |

is the Jacobian, i.e., the determinant of the Jacobian matrix

\begin{matrix} {(J)}_{i, k} = \frac{\partial x_{i}}{\partial ξ_{k}} . \end{matrix}

(A7)

A simple linear algebra result, based on the invariance of determinants and traces with respect to unitary transformations, such as diagonalization, states that, for a given square matrix

A

, with

\tilde{A} = U^{- 1} A U

its diagonal form, the determinant

| A | = | \tilde{A} | = \prod_{i} {(\tilde{A})}_{i, i}

can be expressed using

Tr \ln A

, the trace of the logarithm of the matrix itself:

\begin{matrix} e^{Tr \ln A} = e^{Tr \ln \tilde{A}} = e^{\sum_{i} \ln {(\tilde{A})}_{i, i}} = \prod_{i} {(\tilde{A})}_{i, i} = | A | . \end{matrix}

(A8)

Applying Equation (A8) to the matrix

J

, one obtains for the Jacobian the formula

\begin{matrix} J (x, ξ) = | J | = e^{Tr \ln J} . \end{matrix}

(A9)

Equation (A9) can be derived with respect to time t to obtain for J the equation of motion

\begin{matrix} \frac{d J}{d t} = \frac{d e^{Tr \ln J}}{d t} = Tr (\frac{d J}{d t} J^{- 1}) e^{Tr \ln J} = (\sum_{i = 1}^{n} \sum_{k = 1}^{n} {(\frac{d J}{d t})}_{i, k} {(J^{- 1})}_{k, i}) J . \end{matrix}

(A10)

The derivatives of the elements of the Jacobian matrix can be expressed in terms of the velocity field

\dot{x} = \frac{d x}{d t} = G (x)

, where we have used Equation (98) to remind readers that the “velocities”

{\dot{x}}_{i}, i = 1, \dots, n

can be expressed as functions of the coordinates

x (ξ)

and, therefore, by exchanging the order of derivation,

\begin{matrix} {(\frac{d J}{d t})}_{i, k} = \frac{d}{d t} \frac{\partial x_{i}}{\partial ξ_{k}} = \frac{\partial {\dot{x}}_{i}}{\partial ξ_{k}} . \end{matrix}

(A11)

Substituting Equation (A11) in Equation (A10), and reminding readers that

{(J^{- 1})}_{k, i} = \frac{\partial ξ_{k}}{\partial x_{i}},

we finally obtain

\begin{matrix} \frac{d J}{d t} = (\sum_{i = 1}^{n} [\sum_{k = 1}^{n} \frac{\partial {\dot{x}}_{i}}{\partial ξ_{k}} \frac{\partial ξ_{k}}{\partial x_{i}}]) J = (\sum_{i = 1}^{n} \frac{\partial {\dot{x}}_{i}}{\partial x_{i}}) J = κ J, \end{matrix}

(A12)

i.e., Equation (100).

Appendix C

The axis of a diatomic molecule cannot provide a reference frame (a comoving frame) attached to the molecule. However, adding to the molecule a third point of mass zero not collinear with the physical molecule, we can get the comoving frame we were looking for. We show below that this extra variable is driven and doesn’t alter the dynamics and Statistical Mechanics of our system [38]. In order to create a rigid triatomic molecule, i.e., a rigid triangle, one needs to specify three so-called “bond” constraints for the square distances between each pair of atoms

\begin{matrix} σ_{i j} = \frac{1}{2} [{(r_{j} - r_{i})}^{2} - d_{i j}^{2}], i j = 12, 23, 31, \end{matrix}

(A13)

where

r_{i}

is the three-dimensional atomic coordinate of atom i with

i = 1, 2, 3

an

d_{i j}

is the (rigid) distance between atoms i and j. Bond constraints are easy to deal with since, as

\frac{\partial σ_{i j}}{\partial r_{i}} = - (r_{j} - r_{i})

, the constraint force is parallel to the bond, and the equations of motion are

\begin{matrix} m_{1} {\ddot{r}}_{1} = F_{1} + λ_{12} (r_{2} - r_{1}) + λ_{31} (r_{3} - r_{1}), \end{matrix}

(A14)

\begin{matrix} m_{2} {\ddot{r}}_{2} = F_{2} + λ_{23} (r_{3} - r_{2}) + λ_{12} (r_{1} - r_{2}), \end{matrix}

(A15)

\begin{matrix} m_{3} {\ddot{r}}_{3} = F_{3} + λ_{31} (r_{1} - r_{3}) + λ_{23} (r_{2} - r_{3}), \end{matrix}

(A16)

where

F_{i}

is the total force acting on atom i. Assuming

i = 3

is the index of the virtual particle and

F_{3} = 0

, one immediately has that

\begin{matrix} m_{3} {\ddot{r}}_{3} = λ_{31} (r_{1} - r_{3}) + λ_{23} (r_{2} - r_{3}) = 0, \end{matrix}

(A17)

implying

λ_{31} = 0

and

λ_{23} = 0

, as the two non-zero bond vectors

r_{1} - r_{3}

and

r_{2} - r_{3}

are, by definition, not collinear. By taking the limit for

m_{3} \to 0

of Equation (A16) after dividing it by

m_{3}

one has that, although the total force acting of

r_{3}

is zero, the acceleration

\begin{matrix} {\ddot{r}}_{3} = ζ_{1} (r_{1} - r_{3}) + ζ_{2} (r_{2} - r_{3}), (ζ_{1} = \lim_{m_{3} \to 0} \frac{λ_{31}}{m_{3}}, ζ_{2} = \lim_{m_{3} \to 0} \frac{λ_{23}}{m_{3}}) \end{matrix}

(A18)

does not need to vanish, and the equations of motion can be rewritten as

\begin{matrix} \{\begin{matrix} m_{1} {\ddot{r}}_{1} = F_{1} + λ_{12} (r_{2} - r_{1}), \\ m_{2} {\ddot{r}}_{2} = F_{2} + λ_{12} (r_{1} - r_{2}), \\ {\ddot{r}}_{3} = ζ_{1} (r_{1} - r_{3}) + ζ_{2} (r_{2} - r_{3}) . \end{matrix} \end{matrix}

(A19)

One can notice that, as expected, the motion of the two “real” atoms is not affected by the addition of the virtual one and the dynamics of the third virtual atom simply follows the motion of the first two, “driven” by the constraints. Noting, moreover, that the new variable doesn’t enter in any of the conservation laws of the system, we can conclude safely that also the statistical behavior of the system is not altered by the presence of the extra particle. Equations (A19) can be integrated numerically using SHAKE with the Verlet algorithm described in Section 3.

Another, possibly more interesting, case can arise with a force-field model containing extra-centers of force, whose positions do not coincide with the atomic positions but follow adiabatically the motion of the atoms, taking positions that satisfy the condition of zero force on them. The dynamics of these extra zero-mass points are again inherently driven by that of the material points and so does not intervene in the statistical behavior of the material system.

References

Carter, E.; Ciccotti, G.; Hynes, J.T.; Kapral, R. Constrained reaction coordinate dynamics for the simulation of rare events. Chem. Phys. Lett. 1989, 156, 472–477. [Google Scholar] [CrossRef]
Ciccotti, G.; Kapral, R.; Vanden-Eijnden, E. Blue Moon sampling, vectorial reaction coordinates, and unbiased constrained dynamics. ChemPhysChem 2005, 6, 1809–1814. [Google Scholar] [CrossRef] [PubMed]
Ryckaert, J.P.; Ciccotti, G.; Berendsen, H.J. Numerical integration of the Cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J. Comput. Phys. 1977, 23, 327–341. [Google Scholar] [CrossRef]
Car, R.; Parrinello, M. Unified approach for molecular dynamics and density-functional theory. Phys. Rev. Lett. 1985, 55, 2471–2474. [Google Scholar] [CrossRef] [PubMed]
Ciccotti, G.; Ferrario, M. Constrained and nonequilibrium molecular dynamics. In Classical and Quantum Dynamics in Condensed Phase Simulations; World Scientific: Singapore, 1998; pp. 157–177. [Google Scholar]
Ryckaert, J.P.; Ciccotti, G. Introduction of Andersen’s demon in the molecular dynamics of systems with constraints. J. Chem. Phys. 1983, 78, 7368–7374. [Google Scholar] [CrossRef]
Tuckerman, M.E.; Liu, Y.; Ciccotti, G.; Martyna, G.J. Non-Hamiltonian molecular dynamics: Generalizing Hamiltonian phase space principles to non-Hamiltonian systems. J. Chem. Phys. 2001, 115, 1678–1702. [Google Scholar] [CrossRef]
Ciccotti, G.; Kapral, R.; Sergi, A. Non-equilibrium molecular dynamics. In Handbook of Materials Modeling; Yip, S., Ed.; Springer: Berlin, Germany, 2005; pp. 745–761. [Google Scholar]
Hartmann, C.; Schütte, C.; Ciccotti, G. Communications: On the linear response of mechanical systems with constraints. J. Chem. Phys. 2010, 132, 111103. [Google Scholar] [CrossRef] [PubMed]
Goldstein, H.; Poole, C.P.; Safko, J.L. Classical Mechanics, 3rd Edition ed; Addison-Wesley: Boston, MA, USA, 2000. [Google Scholar]
Ciccotti, G.; Ryckaert, J. Molecular dynamics simulation of rigid molecules. Comput. Phys. Rep. 1986, 4, 346–392. [Google Scholar]
Andersen, H.C. Rattle: A “velocity” version of the shake algorithm for molecular dynamics calculations. J. Comput. Phys. 1983, 52, 24–34. [Google Scholar] [CrossRef]
Weinbach, Y.; Elber, R. Revisiting and parallelizing SHAKE. J. Comput. Phys. 2005, 209, 193–206. [Google Scholar] [CrossRef]
Ciccotti, G.; Ferrario, M.; Hynes, J.T.; Kapral, R. Molecular dynamics simulation of ion association reactions in a polar solvent. J. Chim. Phys. 1988, 85, 925–929. [Google Scholar] [CrossRef]
Sprik, M.; Ciccotti, G. Free energy from constrained molecular dynamics. J. Chem. Phys. 1998, 109, 7737–7744. [Google Scholar] [CrossRef]
Orlandini, S.; Meloni, S.; Ciccotti, G. Hydrodynamics from Statistical Mechanics: Combined dynamical-NEMD and conditional sampling to relax an interface between two immiscible liquids. Phys. Chem. Chem. Phys. 2011, 13, 13177–13181. [Google Scholar] [CrossRef] [PubMed]
Cottone, G.; Lattanzi, G.; Ciccotti, G.; Elber, R. Multiphoton absorption of myoglobin–nitric oxide complex: Relaxation by D-NEMD of a stationary state. J. Phys. Chem. B 2012, 116, 3397–3410. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Pourali, M.; Meloni, S.; Magaletti, F.; Maghari, A.; Casciola, C.M.; Ciccotti, G. Relaxation of a steep density gradient in a simple fluid: Comparison between atomistic and continuum modeling. J. Chem. Phys. 2014, 141, 154107. [Google Scholar] [CrossRef] [PubMed]
Ciccotti, G.; Ferrario, M. Dynamical non-equilibrium molecular dynamics. Entropy 2014, 16, 233–257. [Google Scholar] [CrossRef] [Green Version]
Ciccotti, G.; Bonella, S.; Ferrario, M.; Pierleoni, C. Probabilistic derivation of spatiotemporal correlation functions in the hydrodynamic limit. J. Phys. Chem. B 2016, 120, 1996–2000. [Google Scholar] [CrossRef] [PubMed]
Fixman, M. Classical Statistical Mechanics of constraints: A theorem and application to polymers. Proc. Nat. Acad. Sci. USA 1974, 71, 3050–3053. [Google Scholar] [CrossRef] [PubMed]
Hairer, E.; Lubich, C.; Wanner, G. Geometric numerical integration illustrated by the Störmer–Verlet method. Acta Numer. 2003, 12, 399–450. [Google Scholar] [CrossRef]
Hairer, E.; Wanner, G.; Lubich, C. Geometric Numerical Integration. Structure-Preserving Algorithms for Ordinary Differential Equations; Springer: Berlin, Germany, 2006. [Google Scholar]
Hess, B.; Bekker, H.; Berendsen, H.J.C.; Fraaije, J.G.E.M. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997, 18, 1463–1472. [Google Scholar] [CrossRef]
Kräutler, V.; van Gunsteren, W.F.; Hünenberger, P.H. A fast SHAKE algorithm to solve distance constraint equations for small molecules in molecular dynamics simulations. J. Comput. Chem. 2001, 22, 501–508. [Google Scholar] [CrossRef]
Gonnet, P. P-SHAKE: A quadratically convergent SHAKE in O(n2). J. Comput. Phys. 2007, 220, 740–750. [Google Scholar] [CrossRef]
Gonnet, P.; Walther, J.H.; Koumoutsakos, P. θ-SHAKE: An extension to SHAKE for the explicit treatment of angular constraints. Comput. Phys. Commun. 2009, 180, 360–364. [Google Scholar] [CrossRef]
Leimkuhler, B.; Reich, S. Symplectic integration of constrained Hamiltonian systems. Math. Comput. 1994, 63, 589–605. [Google Scholar] [CrossRef]
Sergi, A.; Ciccotti, G.; Falconi, M.; Desideri, A.; Ferrario, M. Effective binding force calculation in a dimeric protein by molecular dynamics simulation. J. Chem. Phys. 2002, 116, 6329–6338. [Google Scholar] [CrossRef]
Kubo, R. Statistical-Mechanical theory of irreversible processes. I. General theory and simple applications to magnetic and conduction problems. J. Phys. Soc. Japan 1957, 12, 570–586. [Google Scholar] [CrossRef]
Ciccotti, G.; Jacucci, G. Direct computation of dynamical response by molecular dynamics: The mobility of a charged Lennard-Jones particle. Phys. Rev. Lett. 1975, 35, 789–792. [Google Scholar] [CrossRef]
Ciccotti, G.; Jacucci, G.; McDonald, I.R. “Thought-experiments” by molecular dynamics. J. Stat. Phys. 1979, 21, 1–22. [Google Scholar] [CrossRef]
Ciccotti, G.; Ferrario, M. Non-equilibrium by molecular dynamics: A dynamical approach. Mol. Simul. 2016, 42, 1385–1400. [Google Scholar] [CrossRef]
Ferrario, M.; Bonella, S.; Ciccotti, G. On the establishment of thermal diffusion in binary Lennard-Jones liquids. Eur. Phys. J. Spec. Top. 2016, 225, 1629–1642. [Google Scholar] [CrossRef]
Bonella, S.; Ferrario, M.; Ciccotti, G. Thermal diffusion in binary mixtures: Transient behavior and transport coefficients from equilibrium and nonequilibrium molecular dynamics. Langmuir 2017, 33, 11281–11290. [Google Scholar] [CrossRef] [PubMed]
Evans, D.J.; Morriss, G. Statistical Mechanics of Nonequilibrium Liquids; Cambridge University Press: Cambridge, UK, 2008. [Google Scholar]
Green, M.S. Markoff random processes and the Statistical Mechanics of time-dependent phenomena. J. Chem. Phys. 1952, 20, 1281–1295. [Google Scholar] [CrossRef]
Ryckaert, J.P.; Bellemans, A.; Ciccotti, G. The rotation-translation coupling in diatomic molecules. Mol. Phys. 1981, 44, 979–996. [Google Scholar] [CrossRef]

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ciccotti, G.; Ferrario, M. Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems. Computation 2018, 6, 11. https://doi.org/10.3390/computation6010011

AMA Style

Ciccotti G, Ferrario M. Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems. Computation. 2018; 6(1):11. https://doi.org/10.3390/computation6010011

Chicago/Turabian Style

Ciccotti, Giovanni, and Mauro Ferrario. 2018. "Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems" Computation 6, no. 1: 11. https://doi.org/10.3390/computation6010011

APA Style

Ciccotti, G., & Ferrario, M. (2018). Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems. Computation, 6(1), 11. https://doi.org/10.3390/computation6010011

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Holonomic Constraints: A Case for Statistical Mechanics of Non-Hamiltonian Systems

Abstract

1. Introduction

2. Dynamics with Holonomic Constraints

3. SHAKE, Integrating the Equations of Motion

3.1. Verlet Algorithm

3.2. Velocity-Verlet Algorithm

4. Equilibrium Statistical Mechanics in the Hamiltonian Formulation

5. Rare Events and Blue Moon Ensemble

6. Liouville Equation in the Presence of Constraints

6.1. Generalized Distribution Function

6.2. Response Theory

7. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A

Appendix B

Appendix C

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI