From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics

Marle, Charles-Michel

doi:10.3390/e18100370

Open AccessFeature PaperArticle

From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics^†

by

Charles-Michel Marle

Institut de Mathématiques de Jussieu, Université Pierre et Marie Curie, 4, Place Jussieu, 75252 Paris Cedex 05, France

^†

In memory of Jean-Marie Souriau (1922–2012).

Entropy 2016, 18(10), 370; https://doi.org/10.3390/e18100370

Submission received: 28 July 2016 / Revised: 30 September 2016 / Accepted: 5 October 2016 / Published: 19 October 2016

(This article belongs to the Special Issue Differential Geometrical Theory of Statistics)

Download Versions Notes

Abstract

:

I present in this paper some tools in symplectic and Poisson geometry in view of their applications in geometric mechanics and mathematical physics. After a short discussion of the Lagrangian an Hamiltonian formalisms, including the use of symmetry groups, and a presentation of the Tulczyjew’s isomorphisms (which explain some aspects of the relations between these formalisms), I explain the concept of manifold of motions of a mechanical system and its use, due to J.-M. Souriau, in statistical mechanics and thermodynamics. The generalization of the notion of thermodynamic equilibrium in which the one-dimensional group of time translations is replaced by a multi-dimensional, maybe non-commutative Lie group, is fully discussed and examples of applications in physics are given.

Keywords:

Lagrangian formalism; Hamiltonian formalism; symplectic manifolds; Poisson structures; symmetry groups; momentum maps; thermodynamic equilibria; generalized Gibbs states

1. Introduction

1.1. Contents of the Paper, Sources and Further Reading

This paper presents tools in symplectic and Poisson geometry in view of their application in geometric mechanics and mathematical physics. The Lagrangian formalism and symmetries of Lagrangian systems are discussed in Section 2 and Section 3, the Hamiltonian formalism and symmetries of Hamiltonian systems in Section 4 and Section 5. Section 6 introduces the concepts of Gibbs state and of thermodynamic equilibrium of a mechanical system, and presents several examples. For a monoatomic classical ideal gas, eventually in a gravity field, or a monoatomic relativistic gas the Maxwell–Boltzmann and Maxwell–Jüttner probability distributions are derived. The Dulong and Petit law which governs the specific heat of solids is obtained. Finally Section 7 presents the generalization of the concept of Gibbs state, due to Jean-Marie Souriau, in which the group of time translations is replaced by a (multi-dimensional and eventually non-Abelian) Lie group.

Several books [1,2,3,4,5,6,7,8,9,10,11] discuss, much more fully than in the present paper, the contents of Section 2, Section 3, Section 4 and Section 5. The interested reader is referred to these books for detailed proofs of results whose proofs are only briefly sketched here. The recent paper [12] contains detailed proofs of most results presented here in Section 4 and Section 5.

The main sources used for Section 6 and Section 7 are the book and papers by Jean-Marie Souriau [13,14,15,16,17] and the beautiful small book by Mackey [18].

The Euler–Poincaré equation, which is presented with Lagrangian symmetries at the end of Section 3, is not really related to symmetries of a Lagrangian system, since the Lie algebra which acts on the configuration space of the system is not a Lie algebra of symmetries of the Lagrangian. Moreover in its intrinsic form that equation uses the concept of Hamiltonian momentum map presented later, in Section 5. Since the Euler–Poincaré equation is not used in the following sections, the reader can skip the corresponding subsection at his or her first reading.

1.2. Notations

The notations used are more or less those generally used now in differential geometry. The tangent and cotangent bundles to a smooth manifold M are denoted by

T M

and

T^{*} M

, respectively, and their canonical projections by

τ_{M} : T M \to M

and

π_{M} : T^{*} M \to M

. The vector spaces of k-multivectors and k-forms on M are denoted by

A^{k} (M)

and

Ω^{k} (M)

, respectively, with

k \in Z

and, of course,

A^{k} (M) = {0}

and

Ω^{k} (M) = {0}

if

k < 0

and if

k > dim M

, k-multivectors and k-forms being skew-symmetric. The exterior algebras of multivectors and forms of all degrees are denoted by

A (M) = \oplus_{k} A^{k} (M)

and

Ω (M) = \oplus_{k} Ω^{k} (M)

, respectively. The exterior differentiation operator of differential forms on a smooth manifold M is denoted by

d : Ω (M) \to Ω (M)

. The interior product of a differential form

η \in Ω (M)

by a vector field

X \in A^{1} (M)

is denoted by

i (X) η

.

Let

f : M \to N

be a smooth map defined on a smooth manifold M, with values in another smooth manifold N. The pull-back of a form

η \in Ω (N)

by a smooth map

f : M \to N

is denoted by

f^{*} η \in Ω (M)

.

A smooth, time-dependent vector field on the smooth manifold M is a smooth map

X : R \times M \to T M

such that, for each

t \in R

and

x \in M

,

X (t, x) \in T_{x} M

, the vector space tangent to M at x. When, for any

x \in M

,

X (t, x)

does not depend on

t \in R

, X is a smooth vector field in the usual sense, i.e., an element in

A^{1} (M)

. Of course a time-dependent vector field can be defined on an open subset of

R \times M

instead than on the whole

R \times M

. It defines a differential equation

\frac{d φ (t)}{d t} = X (t, φ (t)),

(1)

said to be associated to X. The (full) flow of X is the map

Ψ^{X}

, defined on an open subset of

R \times R \times M

, taking its values in M, such that for each

t_{0} \in R

and

x_{0} \in M

the parametrized curve

t \mapsto Ψ^{X} (t, t_{0}, x_{0})

is the maximal integral curve of Equation (1) satisfying

Ψ (t_{0}, t_{0}, x_{0}) = x_{0}

. When

t_{0}

and

t \in R

are fixed, the map

x_{0} \mapsto Ψ^{X} (t, t_{0}, x_{0})

is a diffeomorphism, defined on an open subset of M (which may be empty) and taking its values in another open subset of M, denoted by

Ψ_{(t, t_{0})}^{X}

. When X is in fact a vector field in the usual sense (not dependent on time),

Ψ_{(t, t_{0})}^{X}

only depends on

t - t_{0}

. Instead of the full flow of X we can use its reduced flow

Φ^{X}

, defined on an open subset of

R \times M

and taking its values in M, related to the full flow

Ψ^{X}

by

Φ^{X} (t, x_{0}) = Ψ^{X} (t, 0, x_{0}), Ψ^{X} (t, t_{0}, x_{0}) = Φ^{X} (t - t_{0}, x_{0}) .

For each

t \in R

, the map

x_{0} \mapsto Φ^{X} (t, x_{0}) = Ψ^{X} (t, 0, x_{0})

is a diffeomorphism, denoted by

Φ_{t}^{X}

, defined on an open subset of M (which may be empty) onto another open subset of M.

When

f : M \to N

is a smooth map defined on a smooth manifold M, with values in another smooth manifold N, there exists a smooth map

T f : T M \to T N

called the prolongation of f to vectors, which for each fixed

x \in M

linearly maps

T_{x} M

into

T_{f (x)} N

. When f is a diffeomorphism of M onto N,

T f

is an isomorphism of

T M

onto

T N

. That property allows us to define the canonical lifts of a vector field X in

A^{1} (M)

to the tangent bundle

T M

and to the cotangent bundle

T^{*} M

. Indeed, for each

t \in R

,

Φ_{t}^{X}

is a diffeomorphism of an open subset of M onto another open subset of M. Therefore

T Φ_{t}^{X}

is a diffeomorphism of an open subset of

T M

onto another open subset of

T M

. It turns out that when t takes all possible values in

R

the set of all diffeomorphisms

T Φ_{t}^{X}

is the reduced flow of a vector field

\bar{X}

on

T M

, which is the canonical lift of X to the tangent bundle

T M

.

Similarly, the transpose

{(T Φ_{- t}^{X})}^{T}

of

T Φ_{- t}^{X}

is a diffeomorphism of an open subset of the cotangent bundle

T^{*} M

onto another open subset of

T^{*} M

, and when t takes all possible values in

R

the set of all diffeomorphisms

{(T Φ_{- t}^{X})}^{T}

is the reduced flow of a vector field

\hat{X}

on

T^{*} M

, which is the canonical lift of X to the cotangent bundle

T^{*} M

.

The canonical lifts of a vector field to the tangent and cotangent bundles are used in Section 3 and Section 5. They can be defined too for time-dependent vector fields.

2. The Lagrangian Formalism

2.1. The Configuration Space and the Space of Kinematic States

The principles of mechanics were stated by the great English mathematician Isaac Newton (1642–1727) in his book Philosophia Naturalis Principia Mathematica published in 1687 [19]. On this basis, a little more than a century later, Joseph Louis Lagrange (1736–1813) in his book Mécanique analytique [20] derived the equations (today known as the Euler–Lagrange equations) which govern the motion of a mechanical system made of any number of material points or rigid material bodies interacting between them by very general forces, and eventually submitted to external forces.

In modern mathematical language, these equations are written on the configuration space and on the space of kinematic states of the considered mechanical system. The configuration space is a smooth n-dimensional manifold N whose elements are all the possible configurations of the system (a configuration being the position in space of all parts of the system). The space of kinematic states is the tangent bundle

T N

to the configuration space, which is

2 n

-dimensional. Each element of the space of kinematic states is a vector tangent to the configuration space at one of its elements, i.e., at a configuration of the mechanical system, which describes the velocity at which this configuration changes with time. In local coordinates a configuration of the system is determined by the n coordinates

x^{1}, \dots, x^{n}

of a point in N, and a kinematic state by the

2 n

coordinates

x^{1}, \dots, x^{n}, v^{1}, \dots v^{n}

of a vector tangent to N at some element in N.

2.2. The Euler–Lagrange Equations

When the mechanical system is conservative, the Euler–Lagrange equations involve a single real valued function L called the Lagrangian of the system, defined on the product of the real line

R

(spanned by the variable t representing the time) with the manifold

T N

of kinematic states of the system. In local coordinates, the Lagrangian L is expressed as a function of the

2 n + 1

variables,

t, x^{1}, \dots, x^{n}, v^{1}, \dots, v^{n}

and the Euler–Lagrange equations have the remarkably simple form

\frac{d}{d t} (\frac{\partial L}{\partial v^{i}} (t, x (t), v (t))) - \frac{\partial L}{\partial x^{i}} (t, x (t), v (t)) = 0, 1 \leq i \leq n,

where

x (t)

stands for

x^{1} (t), \dots, x^{n} (t)

and

v (t)

for

v^{1} (t), \dots, v^{n} (t)

with, of course,

v^{i} (t) = \frac{d x^{i} (t)}{d t}, 1 \leq i \leq n .

2.3. Hamilton’s Principle of Stationary Action

The great Irish mathematician William Rowan Hamilton (1805–1865) observed [21,22] that the Euler–Lagrange equations can be obtained by applying the standard techniques of Calculus of Variations, due to Leonhard Euler (1707–1783) and Joseph Louis Lagrange, to the action integral (Lagrange observed that fact before Hamilton, but in the last edition of his book he chose to derive the Euler–Lagrange equations by application of the principle of virtual works, using a very clever evaluation of the virtual work of inertial forces for a smooth infinitesimal variation of the motion).

I_{L} (γ) = \int_{t_{0}}^{t_{1}} L (t, x (t), v (t)) d t, with v (t) = \frac{d x (t)}{d t},

where

γ : [t_{0}, t_{1}] \to N

is a smooth curve in N parametrized by the time t. These equations express the fact that the action integral

I_{L} (γ)

is stationary with respect to any smooth infinitesimal variation of γ with fixed end-points

(t_{0}, γ (t_{0}))

and

(t_{1}, γ (t_{1}))

. This fact is today called Hamilton’s principle of stationary action. The reader interested in Calculus of Variations and its applications in mechanics and physics is referred to the books [23,24,25].

2.4. The Euler-Cartan Theorem

The Lagrangian formalism is the use of Hamilton’s principle of stationary action for the derivation of the equations of motion of a system. It is widely used in mathematical physics, often with more general Lagrangians involving more than one independent variable and higher order partial derivatives of dependent variables. For simplicity I will consider here only the Lagrangians of (maybe time-dependent) conservative mechanical systems.

An intrinsic geometric expression of the Euler–Lagrange equations, wich does not use local coordinates, was obtained by the great French mathematician Élie Cartan (1869–1951). Let us introduce the concepts used by the statement of this theorem.

Definition 1.

Let N be the configuration space of a mechanical system and let its tangent bundle

T N

be the space of kinematic states of that system. We assume that the evolution with time of the state of the system is governed by the Euler–Lagrange equations for a smooth, maybe time-dependent Lagrangian

L : R \times T N \to R

.

1.: The cotangent bundle $T^{*} N$ is called the phase space of the system.
2.: The map $L_{L} : R \times T N \to T^{*} N$

$L_{L} (t, v) = d_{vert} L (t, v), t \in R, v \in T N,$

where $d_{vert} L (t, v)$ is the vertical differential of L at $(t, v)$ , i.e., the differential at v of the the map $w \mapsto L (t, w)$ , with $w \in τ_{N}^{- 1} (τ_{N} (v))$ , is called the Legendre map associated to L.
3.: The map $E_{L} : R \times T N \to R$ given by

$E_{L} (t, v) = 〈 L_{L} (t, v), v 〉 - L (t, v), t \in R, v \in T N,$

is called the the energy function associated to L.
4.: The 1-form on $R \times T N$

${\hat{ϖ}}_{L} = L_{L}^{*} θ_{N} - E_{L} (t, v) d t,$

where $θ_{N}$ is the Liouville 1-form on $T^{*} N$ , is called the Euler–Poincaré 1-form.

Theorem 1 (Euler-Cartan Theorem).

A smooth curve

γ : [t_{0}, t_{1}] \to N

parametrized by the time

t \in [t_{0}, t_{1}]

is a solution of the Euler–Lagrange equations if and only if, for each

t \in [t_{0}, t_{1}]

the derivative with respect to t of the map

t \mapsto (t, \frac{d γ (t)}{d t})

belongs to the kernel of the 2-form

d {\hat{ϖ}}_{L}

, in other words if and only if

i (\frac{d}{d t} (t, \frac{d γ (t)}{d t})) d {\hat{ϖ}}_{L} (t, \frac{d γ (t)}{d t}) = 0 .

The interested reader will find the proof of that theorem in [26], (Theorem 2.2, Chapter IV, p. 262) or, for hyper-regular Lagrangians (an additional assumption which in fact, is not necessary) in [27], Chapter IV, Theorem 2.1, p. 167.

Remark 1.

In his book [14], Jean-Marie Souriau uses a slightly different terminology: for him the odd-dimensional space

R \times T N

is the evolution space of the system, and the exact 2-form

d {\hat{ϖ}}_{L}

on that space is the Lagrange form. He defines that 2-form in a setting more general than that of the Lagrangian formalism.

3. Lagrangian Symmetries

3.1. Assumptions and Notations

In this section N is the configuration space of a conservative Lagrangian mechanical system with a smooth, maybe time dependent Lagrangian

L : R \times T N \to R

. Let

{\hat{ϖ}}_{L}

be the Poincaré-Cartan 1-form on the evolution space

R \times T N

.

Several kinds of symmetries can be defined for such a system. Very often, they are special cases of infinitesimal symmetries of the Poincaré-Cartan form, which play an important part in the famous Noether theorem.

Definition 2.

An infinitesimal symmetry of the Poincaré-Cartan form

{\hat{ϖ}}_{L}

is a vector field Z on

R \times T N

such that

L (Z) {\hat{ϖ}}_{L} = 0,

L (Z)

denoting the Lie derivative of differential forms with respect to Z.

Example 1.

1.: Let us assume that the Lagrangian L does not depend on the time $t \in R$ , i.e., is a smooth function on $T N$ . The vector field on $R \times T N$ denoted by $\frac{\partial}{\partial t}$ , whose projection on $R$ is equal to 1 and whose projection on $T N$ is 0, is an infinitesimal symmetry of ${\hat{ϖ}}_{L}$ .
2.: Let X be a smooth vector field on N and $\bar{X}$ be its canonical lift to the tangent bundle $T N$ . We still assume that L does not depend on the time t. Moreover we assume that $\bar{X}$ is an infinitesimal symmetry of the Lagrangian L, i.e., that $L (\bar{X}) L =$ 0. Considered as a vector field on $R \times T N$ whose projection on the factor $R$ is 0, $\bar{X}$ is an infinitesimal symmetry of ${\hat{ϖ}}_{L}$ .

3.2. The Noether Theorem in Lagrangian Formalism

Theorem 2 (E. Noether’s Theorem in Lagrangian Formalism).

Let Z be an infinitesimal symmetry of the Poincaré-Cartan form

{\hat{ϖ}}_{L}

. For each possible motion

γ : [t_{0}, t_{1}] \to N

of the Lagrangian system, the function

i (Z) {\hat{ϖ}}_{L}

, defined on

R \times T N

, keeps a constant value along the parametrized curve

t \mapsto (t, \frac{d γ (t)}{d t})

.

Proof.

Let

γ : [t_{0}, t_{1}] \to N

be a motion of the Lagrangian system, i.e., a solution of the Euler–Lagrange equations. The Euler-Cartan Theorem 1 proves that, for any

t \in [t_{0}, t_{1}]

,

i (\frac{d}{d t} (t, \frac{d γ (t)}{d t})) d {\hat{ϖ}}_{L} (t, \frac{d γ (t)}{d t}) = 0 .

Since Z is an infinitesimal symmetry of

{\hat{ϖ}}_{L}

,

L (Z) {\hat{ϖ}}_{L} = 0 .

Using the well known formula relating the Lie derivative, the interior product and the exterior derivative

L (Z) = i (Z) \circ d + d \circ i (Z)

we can write

\begin{matrix} \frac{d}{d t} (i (Z) {\tilde{ϖ}}_{L} (t, \frac{d γ (t)}{d t})) & = 〈d i (Z) {\hat{ϖ}}_{L}, \frac{d}{d t} (t, \frac{d γ (t)}{d t})〉 \\ = - 〈i (Z) d {\hat{ϖ}}_{L}, \frac{d}{d t} (t, \frac{d γ (t)}{d t})〉 \\ = 0 . \end{matrix}

☐

Example 2.

When the Lagrangian L does not depend on time, application of Emmy Noether’s theorem to the vector field

\frac{\partial}{\partial t}

shows that the energy

E_{L}

remains constant during any possible motion of the system, since

i (\frac{\partial}{\partial t}) {\hat{ϖ}}_{L} = - E_{L}

.

Remark 2.

1.: Theorem 2 is due to the German mathematician Emmy Noether (1882–1935), who proved it under much more general assumptions than those used here. For a very nice presentation of Emmy Noether’s theorems in a much more general setting and their applications in mathematical physics, interested readers are referred to the very nice book by Yvette Kosmann-Schwarzbach [28].
2.: Several generalizations of the Noether theorem exist. For example, if instead of being an infinitesimal symmetry of ${\hat{ϖ}}_{L}$ , i.e., instead of satisfying $L (Z) {\hat{ϖ}}_{L} =$ 0 the vector field Z satisfies

$L (Z) {\hat{ϖ}}_{L} = d f,$

where $f : R \times T M \to R$ is a smooth function, which implies of course $L (Z) (d {\hat{ϖ}}_{L}) =$ 0, the function

$i (Z) {\hat{ϖ}}_{L} - f$

keeps a constant value along $t \mapsto (t, \frac{d γ (t)}{d t})$ .

3.3. The Lagrangian Momentum Map

The Lie bracket of two infinitesimal symmetries of

{\hat{ϖ}}_{L}

is too an infinitesimal symmetry of

{\hat{ϖ}}_{L}

. Let us therefore assume that there exists a finite-dimensional Lie algebra of vector fields on

R \times T N

whose elements are infinitesimal symmetries of

{\hat{ϖ}}_{L}

.

Definition 3.

Let

ψ : G \to A^{1} (R \times T N)

be a Lie algebras homomorphism of a finite-dimensional real Lie algebra

G

into the Lie algebra of smooth vector fields on

R \times T N

such that, for each

X \in G

,

ψ (X)

is an infinitesimal symmetry of

{\hat{ϖ}}_{L}

. The Lie algebras homomorphism ψ is said to be a Lie algebra action on

R \times T N

by infinitesimal symmetries of

{\hat{ϖ}}_{L}

. The map

K_{L} : R \times T N \to G^{*}

, which takes its values in the dual

G^{*}

of the Lie algebra

G

, defined by

〈K_{L} (t, v), X〉 = i (ψ (X)) {\hat{ϖ}}_{L} (t, v), X \in G, (t, v) \in R \times T N,

is called the Lagrangian momentum of the Lie algebra action ψ.

Corollary 1 (of E. Noether’s Theorem).

Let

ψ : G \to A^{1} (R \times T M)

be an action of a finite-dimensional real Lie algebra

G

on the evolution space

R \times T N

of a conservative Lagrangian system, by infinitesimal symmetries of the Poincaré-Cartan form

{\hat{ϖ}}_{L}

. For each possible motion

γ : [t_{0}, t_{1}] \to N

of that system, the Lagrangian momentum map

K_{L}

keeps a constant value along the parametrized curve

t \mapsto (t, \frac{d γ (t)}{d t})

.

Proof.

Since for each

X \in G

the function

(t, v) \mapsto 〈K_{L} (t, v), X〉

keeps a constant value along the parametrized curve

t \mapsto (t, \frac{d γ (t)}{d t})

, the map

K_{L}

itself keeps a constant value along that parametrized curve. ☐

Example 3.

Let us assume that the Lagrangian L does not depend explicitly on the time t and is invariant by the canonical lift to the tangent bundle of the action on N of the six-dimensional group of Euclidean diplacements (rotations and translations) of the physical space. The corresponding infinitesimal action of the Lie algebra of infinitesimal Euclidean displacements (considered as an action on

R \times T N

, the action on the factor

R

being trivial) is an action by infinitesimal symmetries of

{\hat{ϖ}}_{L}

. The six components of the Lagrangian momentum map are the three components of the total linear momentum and the three components of the total angular momentum.

Remark 3.

These results are valid without any assumption of hyper-regularity of the Lagrangian.

3.4. The Euler–Poincaré Equation

In a short Note [29] published in 1901, the great french mathematician Henri Poincaré (1854–1912) proposed a new formulation of the equations of mechanics.

Let N be the configuration manifold of a conservative Lagrangian system, with a smooth Lagrangian

L : T N \to R

which does not depend explicitly on time. Poincaré assumes that there exists an homomorphism ψ of a finite-dimensional real Lie algebra

G

into the Lie algebra

A^{1} (N)

of smooth vector fields on N, such that for each

x \in N

, the values at x of the vector fields

ψ (X)

, when X varies in

G

, completely fill the tangent space

T_{x} N

. The action ψ is then said to be locally transitive.

Of course these assumptions imply

dim G \geq dim N

.

Under these assumptions, Henri Poincaré proved that the equations of motion of the Lagrangian system could be written on

N \times G

or on

N \times G^{*}

, where

G^{*}

is the dual of the Lie algebra

G

, instead of on the tangent bundle

T N

. When

dim G = dim N

(which can occur only when the tangent bundle

T N

is trivial) the obtained equation, called the Euler–Poincaré equation, is perfectly equivalent to the Euler–Lagrange equations and may, in certain cases, be easier to use. But when

dim G > dim N

, the system made by the Euler–Poincaré equation is underdetermined.

Let

γ : [t_{0}, t_{1}] \to N

be a smooth parametrized curve in N. Poincaré proves that there exists a smooth curve

V : [t_{0}, t_{1}] \to G

in the Lie algebra

G

such that, for each

t \in [t_{0}, t_{1}]

,

ψ (V (t)) (γ (t)) = \frac{d γ (t)}{d t} .

(2)

When

dim G > dim N

the smooth curve V in

G

is not uniquely determined by the smooth curve γ in N. However, instead of writing the second-order Euler–Lagrange differential equations on

T N

satisfied by γ when this curve is a possible motion of the Lagrangian system, Poincaré derives a first order differential equation for the curve V and proves that it is satisfied, together with Equation (2), if and only if γ is a possible motion of the Lagrangian system.

Let

φ : N \times G \to T N

and

\bar{L} : N \times G \to R

be the maps

φ (x, X) = ψ (X) (x), \bar{L} (x, X) = L \circ φ (x, X) .

We denote by

d_{1} \bar{L} : N \times G \to T^{*} N

and by

d_{2} \bar{L} : N \times G \to G^{*}

the partial differentials of

\bar{L} : N \times G \to R

with respect to its first variable

x \in N

and with respect to its second variable

X \in G

.

The map

φ : N \times G \to T N

is a surjective vector bundles morphism of the trivial vector bundle

N \times G

into the tangent bundle

T N

. Its transpose

φ^{T} : T^{*} N \to N \times G^{*}

is therefore an injective vector bundles morphism, which can be written

φ^{T} (ξ) = (π_{N} (ξ), J (ξ)),

where

π_{N} : T^{*} N \to N

is the canonical projection of the cotangent bundle and

J : T^{*} N \to G^{*}

is a smooth map whose restriction to each fibre

T_{x}^{*} N

of the cotangent bundle is linear, and is the transpose of the map

X \mapsto φ (x, X) = ψ (X) (x)

.

Remark 4.

The homomorphism ψ of the Lie algebra

G

into the Lie algebra

A^{1} (N)

of smooth vector fields on N is an action of that Lie algebra, in the sense defined below Definition 11. That action can be canonically lifted into a Hamiltonian action of

G

on

T^{*} N

, endowed with its canonical symplectic form

d θ_{N}

Definition 13. The map J is in fact a Hamiltonian momentum map for that Hamiltonian action Proposition 5.

Let

L_{L} = d_{vert} L : T N \to T^{*} N

be the Legendre map defined in Definition 1.

Theorem 3 (Euler–Poincaré Equation).

With the above defined notations, let

γ : [t_{0}, t_{1}] \to N

be a smooth parametrized curve in N and

V : [t_{0}, t_{1}] \to G

be a smooth parametrized curve such that, for each

t \in [t_{0}, t_{1}]

,

ψ (V (t)) (γ (t)) = \frac{d γ (t)}{d t} .

(3)

The curve γ is a possible motion of the Lagrangian system if and only if V satisfies the equation

(\frac{d}{d t} - {ad}_{V (t)}^{*}) (J \circ L_{L} \circ φ (γ (t), V (t))) - J \circ d_{1} \bar{L} (γ (t), V (t)) = 0 .

(4)

The interested reader will find a proof of that theorem in local coordinates in the original Note by Poincaré [29]. More intrinsic proofs can be found in [12,30]. Another proof is possible, in which that theorem is deduced from the Euler-Cartan Theorem 1.

Remark 5.

Equation (3) is called the compatibility condition and Equation (4) is the Euler–Poincaré equation. It can be written under the equivalent form

(\frac{d}{d t} - {ad}_{V (t)}^{*}) (d_{2} \bar{L} (γ (t), V (t))) - J \circ d_{1} \bar{L} (γ (t), V (t)) = 0 .

(5)

Examples of applications of the Euler–Poincaré equation can be found in [5,6,12,30] and, for an application in thermodynamics, [31].

4. The Hamiltonian Formalism

The Lagrangian formalism can be applied to any smooth Lagrangian. Its application yields second order differential equations on

R \times N

(in local coordinates, the Euler–Lagrange equations) which in general are not solved with respect to the second order derivatives of the unknown functions with respect to time. The classical existence and unicity theorems for the solutions of differential equations (such as the Cauchy-Lipschitz theorem) therefore cannot be applied to these equations.

Under the additional assumption that the Lagrangian is hyper-regular, a very clever change of variables discovered by William Rowan Hamilton (Lagrange obtained however Hamilton’s equations before Hamilton, but only in a special case, for the slow “variations of constants” such as the orbital parameters of planets in the solar system [32,33]). Hamilton [21,22] allows a new formulation of these equations in the framework of symplectic geometry. The Hamiltonian formalism discussed below is the use of these new equations. It was later generalized independently of the Lagrangian formalism.

4.1. Hyper-Regular Lagrangians

Assumptions Made in this Section

We consider in this section a smooth, maybe time-dependent Lagrangian

L : R \times T N \to R

, which is such that the Legendre map Definition 1

L_{L} : R \times T N \to T^{*} N

satisfies the following property: for each fixed value of the time

t \in R

, the map

v \mapsto L_{L} (t, v)

is a smooth diffeomorphism of the tangent bundle

T N

onto the cotangent bundle

T^{*} N

. An equivalent assumption is the following: the map

({id}_{R}, L_{L}) : (t, v) \mapsto (t, L_{L} (t, v))

is a smooth diffeomorphism of

R \times T N

onto

R \times T^{*} N

. The Lagrangian L is then said to be hyper-regular. The equations of motion can be written on

R \times T^{*} N

instead of

R \times T N

.

Definition 4.

Under the assumption Section 4.1, the function

H_{L} : R \times T^{*} N \to R

given by

H_{L} (t, p) = E_{L} \circ {({id}_{R}, L_{L})}^{- 1} (t, p), t \in R, p \in T^{*} N,

(

E_{L} : R \times T N \to R

being the energy function defined in Definition 1) is called the Hamiltonian associated to the hyper-regular Lagrangian L.

The 1-form defined on

R \times T^{*} N

{\hat{ϖ}}_{H_{L}} = θ_{N} - H_{L} d t,

where

θ_{N}

is the Liouville 1-form on

T^{*} N

, is called the Poincaré-Cartan 1-form in the Hamiltonian formalism.

Remark 6.

The Poincaré-Cartan 1-form

{\hat{ϖ}}_{L}

on

R \times T N

, defined in Definition 1, is the pull-back, by the diffeomorphism

({id}_{R}, L_{L}) : R \times T N \to R \times T^{*} N

, of the Poincaré-Cartan 1-form

{\hat{ϖ}}_{H_{L}}

in the Hamiltonian formalism on

R \times T^{*} N

defined above.

4.2. Presymplectic Manifolds

Definition 5.

A presymplectic form on a smooth manifold M is a 2-form ω on M which is closed, i.e., such that

d ω = 0

. A manifold M equipped with a presymplectic form ω is called a presymplectic manifold and denoted by

(M, ω)

. The kernel

ker ω

of a presymplectic form ω defined on a smooth manifold M is the set of vectors

v \in T M

such that

i (v) ω = 0

.

Remark 7.

A symplectic form ω on a manifold M is a presymplectic form which, moreover, is non-degenerate, i.e., such that for each

x \in M

and each non-zero vector

v \in T_{x} M

, there exists another vector

w \in T_{x} M

such that

ω (x) (v, w) \neq 0

. Or in other words, a presymplectic form ω whose kernel is the set of null vectors.

The kernel of a presymplectic form ω on a smooth manifold M is a vector sub-bundle of

T M

if and only if for each

x \in M

, the vector subspace

T_{x} M

of vectors

v \in T_{x} M

which satisfy

i (v) ω = 0

is of a fixed dimension, the same for all points

x \in M

. A presymplectic form which satisfies that condition is said to be of constant rank.

Proposition 1.

Let ω be a presymplectic form of constant rank Remark 7 on a smooth manifold M. The kernel

ker ω

of ω is a completely integrable vector sub-bundle of

T M

, which defines a foliation

F_{ω}

of M into connected immersed submanifolds which, at each point of M, have the fibre of

ker ω

at that point as tangent vector space.

We now assume in addition that this foliation is simple, i.e., such that the set of leaves of

F_{ω}

, denoted by

M / ker ω

, has a smooth manifold structure for which the canonical projection

p : M \to M / ker ω

(which associates to each point

x \in M

the leaf which contains x) is a smooth submersion. There exists on

M / ker ω

a unique symplectic form

ω_{r}

such that

ω = p^{*} ω_{r} .

Proof.

Since

d ω = 0

, the fact that

ker ω

is completely integrable is an immediate consequence of the Frobenius’ theorem ([27], Chapter III, Theorem 5.1, p. 132). The existence and unicity of a symplectic form

ω_{r}

on

M / ker ω

such that

ω = p^{*} ω_{r}

results from the fact that

M / ker ω

is built by quotienting M by the kernel of ω. ☐

Presymplectic Manifolds in Mechanics

Let us go back to the assumptions and notations of Section 4.1. We have seen in Remark 6 that the Poincaré-Cartan 1-form in Hamiltonian formalism

{\hat{ϖ}}_{H_{L}}

on

R \times T^{*} N

and the Poincaré-Cartan 1-form in Lagrangian formalism

{\hat{ϖ}}_{L}

on

R \times T N

are related by

{\hat{ϖ}}_{L} = {({id}_{R}, L_{L})}^{*} {\hat{ϖ}}_{H_{L}} .

Their exterior differentials

d {\hat{ϖ}}_{L}

and

d {\hat{ϖ}}_{H_{L}}

both are presymplectic 2-forms on the odd-dimensional manifolds

R \times T N

and

R \times T^{*} N

, respectively. At any point of these manifolds, the kernels of these closed 2-forms are one-dimensional. They therefore Proposition 1 determine foliations into smooth curves of these manifolds. The Euler-Cartan Theorem 1 shows that each of these curves is a possible motion of the system, described either in the Lagrangian formalism, or in the Hamiltonian formalism, respectively.

The set of all possible motions of the system, called by Jean-Marie Souriau the manifold of motions of the system, is described by the quotient

(R \times T N) / ker d {\hat{ϖ}}_{L}

in the Lagrangian formalism, and by the quotient

(R \times T^{*} N) / ker d {\hat{ϖ}}_{H_{L}}

in the Hamiltonian formalism. Both are (maybe non-Hausdorff) symplectic manifolds, the projections on these quotient manifolds of the presymplectic forms

d {\hat{ϖ}}_{L}

and

d {\hat{ϖ}}_{H_{L}}

both being symplectic forms. Of course the diffeomorphism

({id}_{R}, L_{L}) : R \times T N \to R \times T^{*} N

projects onto a symplectomorphism between the Lagrangian and Hamiltonian descriptions of the manifold of motions of the system.

4.3. The Hamilton Equation

Proposition 2.

Let N be the configuration manifold of a Lagrangian system whose Lagrangian

L : R \times T N \to R

, maybe time-dependent, is smooth and hyper-regular, and

H_{L} : R \times T^{*} N \to R

be the associated Hamiltonian Definition 4. Let

φ : [t_{0}, t_{1}] \to N

be a smooth curve parametrized by the time

t \in [t_{0}, t_{1}]

, and let

ψ : [t_{0}, t_{1}] \to T^{*} N

be the parametrized curve in

T^{*} N

ψ (t) = L_{L} (t, \frac{d γ (t)}{d t}), t \in [t_{0}, t_{1}],

where

L_{L} : R \times T N \to T^{*} N

is the Legendre map Definition 1.

The parametrized curve

t \mapsto γ (t)

is a motion of the system if and only if the parametrized curve

t \mapsto ψ (t)

satisfies the equatin, called the Hamilton equation,

i (\frac{d ψ (t)}{d t}) d θ_{N} = - d H_{L t},

where

d H_{L t} = d H_{L} - \frac{\partial H_{L}}{\partial t} d t

is the differential of the function

H_{L t} : T^{*} N \to R

in which the time t is considered as a parameter with respect to which there is no differentiation.

When the parametrized curve ψ satisfies the Hamilton equation stated above, it satisfies too the equation, called the energy equation

\frac{d}{d t} (H_{L} (t, ψ (t))) = \frac{\partial H_{L}}{\partial t} (t, ψ (t)) .

Proof.

These results directly follow from the Euler-Cartan Theorem 1. ☐

Remark 8.

The 2-form

d θ_{N}

is a symplectic form on the cotangent bundle

T^{*} N

, called its canonical symplectic form. We have shown that when the Lagrangian L is hyper-regular, the equations of motion can be written in three equivalent manners:

1.: as the Euler–Lagrange equations on $R \times T M$ ,
2.: as the equations given by the kernels of the presymplectic forms $d {\hat{ϖ}}_{L}$ or $d {\hat{ϖ}}_{H_{L}}$ which determine the foliations into curves of the evolution spaces $R \times T M$ in the Lagrangian formalism, or $R \times T^{*} M$ in the Hamiltonian formalism,
3.: as the Hamilton equation associated to the Hamiltonian $H_{L}$ on the symplectic manifold $(T^{*} N, d θ_{N})$ , often called the phase space of the system.

4.3.1. The Tulczyjew Isomorphisms

Around 1974, Tulczyjew [34,35] discovered (

β_{N}

was probably known long before 1974, but I believe that

α_{N}

, much more hidden, was noticed by Tulczyjew for the first time) two remarkable vector bundles isomorphisms

α_{N} : T T^{*} N \to T^{*} T N

and

β_{N} : T T^{*} N \to T^{*} T^{*} N

.

The first one

α_{N}

is an isomorphism of the bundle

(T T^{*} N, T π_{N}, T N)

onto the bundle

(T^{*} T N, π_{T N}, T N)

, while the second

β_{N}

is an isomorphism of the bundle

(T T^{*} N, τ_{T^{*} N}, T^{*} N)

onto the bundle

(T^{*} T^{*} N, π_{T^{*} N}, T^{*} N)

. The diagram below is commutative.

Since they are the total spaces of cotangent bundles, the manifolds

T^{*} T N

and

T^{*} T^{*} N

are endowed with the Liouville 1-forms

θ_{T N}

and

θ_{T^{*} N}

, and with the canonical symplectic forms

d θ_{T N}

and

d θ_{T^{*} N}

, respectively. Using the isomorphisms

α_{N}

and

β_{N}

, we can therefore define on

T T^{*} N

two 1-forms

α_{N}^{*} θ_{T N}

and

β_{N}^{*} θ_{T^{*} N}

, and two symplectic 2-forms

α_{N}^{*} (d θ_{T N})

and

β_{N}^{*} (d θ_{T^{*} N})

. The very remarkable property of the isomorphisms

α_{N}

and

β_{N}

is that the two symplectic forms so obtained on

T T^{*} N

are equal:

α_{N}^{*} (d θ_{T N}) = β_{N}^{*} (d θ_{T^{*} N}) .

The 1-forms

α_{N}^{*} θ_{T N}

and

β_{N}^{*} θ_{T^{*} N}

are not equal, their difference is the differential of a smooth function.

4.3.2. Lagrangian Submanifolds

In view of applications to implicit Hamiltonian systems, let us recall here that a Lagrangian submanifold of a symplectic manifold

(M, ω)

is a submanifold N whose dimension is half the dimension of M, on which the form induced by the symplectic form ω is 0.

Let

L : T N \to R

and

H : T^{*} N \to R

be two smooth real valued functions, defined on

T N

and on

T^{*} N

, respectively. The graphs

d L (T N)

and

d H (T^{*} N)

of their differentials are Lagrangian submanifolds of the symplectic manifolds

(T^{*} T N, d θ_{T N})

and

(T^{*} T^{*} N, d θ_{T^{*} N})

. Their pull-backs

α_{N}^{- 1} (d L (T N))

and

β_{N}^{- 1} (d H (T^{*} N))

by the symplectomorphisms

α_{N}

and

β_{N}

are therefore two Lagrangian submanifolds of the manifold

T T^{*} N

endowed with the symplectic form

α_{N}^{*} (d θ_{T N})

, which is equal to the symplectic form

β_{N}^{*} (d θ_{T^{*} N})

.

The following theorem enlightens some aspects of the relationships between the Hamiltonian and the Lagrangian formalisms.

Theorem 4 (W. M. Tulczyjew).

With the notations specified above Section 4.3.2, let

X_{H} : T^{*} N \to T T^{*} N

be the Hamiltonian vector field on the symplectic manifold

(T^{*} N, d θ_{N})

associated to the Hamiltonian

H : T^{*} N \to R

, defined by

i (X_{H}) d θ_{N} = - d H

. Then

X_{H} (T^{*} N) = β_{N}^{- 1} (d H (T^{*} N)) .

Moreover, the equality

α_{N}^{- 1} (d L (T N)) = β_{N}^{- 1} (d H (T^{*} N))

holds if and only if the Lagrangian L is hyper-regular and such that

d H = d (E_{L} \circ L_{L}^{- 1}),

where

L_{L} : T N \to T^{*} N

is the Legendre map and

E_{L} : T N \to R

the energy associated to the Lagrangian L.

The interested reader will find the proof of that theorem in the works of Tulczyjew ([34,35]).

When L is not hyper-regular,

α_{N}^{- 1} (d L (T N))

still is a Lagrangian submanifold of the symplectic manifold

(T T^{*} N, α_{N}^{*} (d θ_{T N}))

, but it is no more the graph of a smooth vector field

X_{H}

defined on

T^{*} N

. Tulczyjew proposes to consider this Lagrangian submanifold as an implicit Hamilton equation on

T^{*} N

.

These results can be extended to Lagrangians and Hamiltonians which may depend on time.

4.4. The Hamiltonian Formalism on Symplectic and Poisson Manifolds

4.4.1. The Hamilton Formalism on Symplectic Manifolds

In pure mathematics as well as in applications of mathematics to mechanics and physics, symplectic manifolds other than cotangent bundles are encountered. A theorem due to the french mathematician Gaston Darboux (1842–1917) asserts that any symplectic manifold

(M, ω)

is of even dimension

2 n

and is locally isomorphic to the cotangent bundle to a n-dimensional manifold: in a neighbourhood of each of its point there exist local coordinates

(x^{1}, \dots, x^{n}, p_{1}, \dots, p_{n})

, called Darboux coordinates with which the symplectic form ω is expressed exactly as the canonical symplectic form of a cotangent bundle:

ω = \sum_{i = 1}^{n} d p_{i} \land d x^{i} .

Let

(M, ω)

be a symplectic manifold and

H : R \times M \to R

a smooth function, said to be a time-dependent Hamiltonian. It determines a time-dependent Hamiltonian vector field

X_{H}

on M, such that

i (X_{H}) ω = - d H_{t},

H_{t} : M \to R

being the function H in which the variable t is considered as a parameter with respect to which no differentiation is made.

The Hamilton equation determined by H is the differential equation

\frac{d ψ (t)}{d t} = X_{H} (t, ψ (t)) .

The Hamiltonian formalism can therefore be applied to any smooth, maybe time dependent Hamiltonian on M, even when there is no associated Lagrangian.

The Hamiltonian formalism is not limited to symplectic manifolds: it can be applied, for example, to Poisson manifolds [36], contact manifolds and Jacobi manifolds [37]. For simplicity I will consider only Poisson manifolds. Readers interested in Jacobi manifolds and their generalizations are referred to the papers by Lichnerowicz quoted above and to the very important paper by Kirillov [38].

Definition 6.

A Poisson manifold is a smooth manifold P whose algebra of smooth functions

C^{\infty} (P, R)

is endowed with a bilinear composition law, called the Poisson bracket, which associates to any pair

(f, g)

of smooth functions on P another smooth function denoted by

{f, g}

, that composition satisfying the three properties

1.: it is skew-symmetric,

${g, f} = - {f, g},$
2.: it satisfies the Jacobi identity

$\{f, {g, h}\} + \{g, {h, f}\} + \{h, {f, g}\} = 0,$
3.: it satisfies the Leibniz identity

${f, g h} = {f, g} h + g {f, h} .$

Example 4.

1.: On the vector space of smooth functions defined on a symplectic manifold $(M, ω)$ , there exists a composition law, called the Poisson bracket, which satisfies the properties stated in Definition 6. Let us recall briefly its definition. The symplectic form ω allows us to associate, to any smooth function $f \in C^{\infty} (M, R)$ , a smooth vector field $X_{f} \in A^{1} (M, R)$ , called the Hamiltonian vector field associated to f, defined by

$i (X_{f}) ω = - d f .$

The Poisson bracket ${f, g}$ of two smooth functions f and $g \in C^{\infty} (M, R)$ is defined by the three equivalent equalities

${f, g} = i (X_{f}) d g = - i (X_{g}) d f = ω (X_{f}, X_{g}) .$

Any symplectic manifold is therefore a Poisson manifold.
The Poisson bracket of smooth functions defined on a symplectic manifold (when that symplectic manifold is a cotangent bundle) was discovered by Siméon Denis Poisson (1781–1840) [39].
2.: Let $G$ be a finite-dimensional real Lie algebra, and let $G^{*}$ be its dual vector space. For each smooth function $f \in C^{\infty} (G^{*}, R)$ and each $ζ \in G^{*}$ , the differential $d f (ζ)$ is a linear form on $G^{*}$ , in other words an element of the dual vector space of $G^{*}$ . Identifying with $G$ the dual vector space of $G^{*}$ , we can therefore consider $d f (ζ)$ as an element in $G$ . With this identification, we can define the Poisson bracket of two smooth functions f and $g \in C^{\infty} (G^{*}, R)$ by

${f, g} (ζ) = [d f (ζ), d g (ζ)], ζ \in G^{*},$

the bracket in the right hand side being the bracket in the Lie algebra $G$ . The Poisson bracket of functions in $C^{\infty} (G^{*}, R)$ so defined satifies the properties stated in Definition 6. The dual vector space of any finite-dimensional real Lie algebra is therefore endowed with a Poisson structure, called its canonical Lie-Poisson structure or its Kirillov-Kostant-Souriau Poisson structure. Discovered by Sophus Lie, this structure was indeed rediscovered independently by Alexander Kirillov, Bertram Kostant and Jean-Marie Souriau.
3.: A symplectic cocycle of a finite-dimensional, real Lie algebra $G$ is a skew-symmetric bilinear map $Θ : G \times G \to G^{*}$ which satisfies, for all X, Y and $Z \in G$ ,

$Θ ([X, Y], Z) + Θ ([Y, Z], X) + Θ ([Z, X], Y) = 0 .$

The canonical Lie-Poisson bracket of two smooth functions f and $g \in C^{\infty} (G^{*}, R)$ can be modified by means of the symplectic cocycle Θ, by setting

${f, g}_{Θ} (ζ) = [d f (ζ), d g (ζ)] - Θ (d f (ζ), d g (ζ)), ζ \in G^{*} .$

This bracket still satifies the properties stated in Definition 6, therefore defines on $G^{*}$ a Poisson structure called its canonical Lie-Poisson structure modified by Θ.

4.4.2. Properties of Poisson Manifolds

The interested reader will find the proofs of the properties recalled here in [8,9,10,11].

1.: On a Poisson manifold P, the Poisson bracket ${f, g}$ of two smooth functions f and g can be expressed by means of a smooth field of bivectors Λ:

${f, g} = Λ (d f, d g), f and g \in C^{\infty} (P, R),$

called the Poisson bivector field of P. The considered Poisson manifold is often denoted by $(P, Λ)$ . The Poisson bivector field Λ identically satisfies

$[Λ, Λ] = 0,$

the bracket $[,]$ in the left hand side being the Schouten-Nijenhuis bracket. That bivector field determines a vector bundle morphism $Λ^{♯} : T^{*} P \to T P$ , defined by

$Λ (η, ζ) = 〈ζ, Λ^{♯} (η)〉,$

where η and $ζ \in T^{*} P$ are two covectors attached to the same point in P.
Readers interested in the Schouten-Nijenhuis bracket will find thorough presentations of its properties in [40,41].
2.: Let $(P, Λ)$ be a Poisson manifold. A (maybe time-dependent) vector field on P can be associated to each (maybe time-dependent) smooth function $H : R \times P \to R$ . It is called the Hamiltonian vector field associated to the Hamiltonian H, and denoted by $X_{H}$ . Its expression is

$X_{H} (t, x) = Λ^{♯} (x) (d H_{t} (x)),$

where $d H_{t} (x) = d H (t, x) - \frac{\partial H (t, x)}{\partial t} d t$ is the differential of the function deduced from H by considering t as a parameter with respect to which no differentiation is made.
The Hamilton equation determined by the (maybe time-dependent) Hamiltonian H is

$\frac{d φ (t)}{d t} = X_{H} ((t, φ (t)) = Λ^{♯} (d H_{t}) (φ (t)) .$
3.: Any Poisson manifold is foliated, by a generalized foliation whose leaves may not be all of the same dimension, into immersed connected symplectic manifolds called the symplectic leaves of the Poisson manifold. The value, at any point of a Poisson manifold, of the Poisson bracket of two smooth functions only depends on the restrictions of these functions to the symplectic leaf through the considered point, and can be calculated as the Poisson bracket of functions defined on that leaf, with the Poisson structure associated to the symplectic structure of that leaf. This property was discovered by Alan Weinstein, in his very thorough study of the local structure of Poisson manifolds [42].

5. Hamiltonian Symmetries

5.1. Presymplectic, Symplectic and Poisson Maps and Vector Fields

Let M be a manifold endowed with some structure, which can be either

a presymplectic structure, determined by a presymplectic form, i.e., a 2-form ω which is closed ( $d ω = 0$ ),
a symplectic structure, determined by a symplectic form ω, i.e., a 2-form ω which is both closed ( $d ω = 0$ ) and nondegenerate ( $ker ω = {0}$ ),
a Poisson structure, determined by a smooth Poisson bivector field Λ satisfying $[Λ, Λ] = 0$ .

Definition 7.

A presymplectic (resp. symplectic, resp. Poisson) diffeomorphism of a presymplectic (resp., symplectic, resp. Poisson) manifold

(M, ω)

(resp.

(M, Λ)

) is a smooth diffeomorphism

f : M \to M

such that

f^{*} ω = ω

(resp.

f^{*} Λ = Λ

).

Definition 8.

A smooth vector field X on a presymplectic (resp. symplectic, resp. Poisson) manifold

(M, ω)

(resp.

(M, Λ)

) is said to be a presysmplectic (resp. symplectic, resp. Poisson) vector field if

L (X) ω = 0

(resp. if

L (X) Λ = 0

), where

L (X)

denotes the Lie derivative of forms or mutivector fields with respect to X.

Definition 9.

Let

(M, ω)

be a presymplectic or symplectic manifold. A smooth vector field X on M is said to be Hamiltonian if there exists a smooth function

H : M \to R

, called a Hamiltonian for X, such that

i (X) ω = - d H .

Not any smooth function on a presymplectic manifold can be a Hamiltonian.

Definition 10.

Let

(M, Λ)

be a Poisson manifold. A smooth vector field X on M is said to be Hamiltonian if there exists a smooth function

H \in C^{\infty} (M, R)

, called a Hamiltonian for X, such that

X = Λ^{♯} (d H)

. An equivalent definition is that

i (X) d g = {H, g} f o r a n y g \in C^{\infty} (M, R),

where

{H, g} = Λ (d H, d g)

denotes the Poisson bracket of the functions H and g.

On a symplectic or a Poisson manifold, any smooth function can be a Hamiltonian.

Proposition 3.

A Hamiltonian vector field on a presymplectic (resp. symplectic, resp. Poisson) manifold automatically is a presymplectic (resp. symplectic, resp. Poisson) vector field.

The proof of this result, which is easy, can be found in any book on symplectic and Poisson geoemetry, for example [8,9,10].

5.2. Lie Algebras and Lie Groups Actions

Definition 11.

An action on the left (resp. an action on the right) of a Lie group G on a smooth manifold M is a smooth map

Φ : G \times M \to M

(resp. a smooth map

Ψ : M \times G \to M

) such that

for each fixed $g \in G$ , the map $Φ_{g} : M \to M$ defined by $Φ_{g} (x) = Φ (g, x)$ (resp. the map $Ψ_{g} : M \to M$ defined by $Ψ_{g} (x) = Ψ (x, g)$ ) is a smooth diffeomorphism of M,
$Φ_{e} = {id}_{M}$ (resp. $Ψ_{e} = {id}_{M}$ ), e being the neutral element of G,
for each pair $(g_{1}, g_{2}) \in G \times G$ , $Φ_{g_{1}} \circ Φ_{g_{2}} = Φ_{g_{1} g_{2}}$ (resp. $Ψ_{g_{1}} \circ Ψ_{g_{2}} = Ψ_{g_{2} g_{1}}$ ).

An action of a Lie algebra

G

on a smooth manifold M is a Lie algebras morphism of

G

into the Lie algebra

A^{1} (M)

of smooth vector fields on M, i.e., a linear map

ψ : G \to A^{1} (M)

which associates to each

X \in G

a smooth vector field

ψ (X)

on M such that for each pair

(X, Y) \in G \times G

,

ψ ([X, Y]) = [ψ (X), ψ (Y)]

.

Proposition 4.

An action Ψ, either on the left or on the right, of a Lie group G on a smooth manifold M, automatically determines an action ψ of its Lie algebra

G

on that manifold, which associates to each

X \in G

the vector field

ψ (X)

on M, often denoted by

X_{M}

and called the fundamental vector field on M associated to X. It is defined by

ψ (X) (x) = X_{M} (x) = \frac{d}{d s} (Ψ_{exp (s X)} (x)) |_{s = 0}, x \in M,

with the following convention: ψ is a Lie algebras homomorphism when we take for Lie algebra

G

of the Lie group G the Lie algebra or right invariant vector fields on G if Ψ is an action on the left, and the Lie algebra of left invariant vector fields on G if Ψ is an action on the right.

Proof.

If Ψ is an action of G on M on the left (respectively, on the right), the vector field on G which is right invariant (respectively, left invariant) and whose value at e is X, and the associated fundamental vector field

X_{M}

on M, are compatible by the map

g \mapsto Ψ_{g} (x)

. Therefore the map

ψ : G \to A^{1} (M)

is a Lie algebras homomorphism, if we take for definition of the bracket on

G

the bracket of right invariant (respectively, left invariant) vector fields on G. ☐

Definition 12.

When M is a presymplectic (or a symplectic, or a Poisson) manifold, an action Ψ of a Lie group G (respectively, an action ψ of a Lie algebra

G

) on the manifold M is called a presymplectic (or a symplectic, or a Poisson) action if for each

g \in G

,

Ψ_{g}

is a presymplectic, or a symplectic, or a Poisson diffeomorphism of M (respectively, if for each

X \in G

,

ψ (X)

is a presymplectic, or a symplectic, or a Poisson vector field on M.

Definition 13.

An action ψ of a Lie algeba

G

on a presymplectic or symplectic manifold

(M, ω)

, or on a Poisson manifold

(M, Λ)

, is said to be Hamiltonian if for each

X \in G

, the vector field

ψ (X)

on M is Hamiltonian.

An action Ψ (either on the left or on the right) of a Lie group G on a presymplectic or symplectic manifold

(M, ω)

, or on a Poisson manifold

(M, Λ)

, is said to be Hamiltonian if that action is presymplectic, or symplectic, or Poisson (according to the structure of M), and if in addition the associated action of the Lie algebra

G

of G is Hamiltonian.

Remark 9.

A Hamiltonian action of a Lie group, or of a Lie algebra, on a presymplectic, symplectic or Poisson manifold, is automatically a presymplectic, symplectic or Poisson action. This result immediately follows from Proposition 3.

5.3. Momentum Maps of Hamiltonian Actions

Proposition 5.

Let ψ be a Hamiltonian action of a finite-dimensional Lie algebra

G

on a presymplectic, symplectic or Poisson manifold

(M, ω)

or

(M, Λ)

. There exists a smooth map

J : M \to G^{*}

, taking its values in the dual space

G^{*}

of the Lie algebra

G

, such that for each

X \in G

the Hamiltonian vector field

ψ (X)

on M admits as Hamiltonian the function

J_{X} : M \to R

, defined by

J_{X} (x) = 〈J (x), X〉, x \in M .

The map J is called a momentum map for the Lie algebra action ψ. When ψ is the action of the Lie algebra

G

of a Lie group G associated to a Hamiltonian action Ψ of a Lie group G, J is called a momentum map for the Hamiltonian Lie group action Ψ.

The proof of that result, which is easy, can be found for example in [8,9,10].

Remark 10.

The momentum map J is not unique:

when $(M, ω)$ is a connected symplectic manifold, J is determined up to addition of an arbitrary constant element in $G^{*}$ ;
when $(M, Λ)$ is a connected Poisson manifold, the momentum map J is determined up to addition of an arbitrary $G^{*}$ -valued smooth map which, coupled with any $X \in G$ , yields a Casimir of the Poisson algebra of $(M, Λ)$ , i.e., a smooth function on M whose Poisson bracket with any other smooth function on that manifold is the function identically equal to 0.

5.4. Noether’s Theorem in Hamiltonian Formalism

Theorem 5 (Noether’s Theorem in Hamiltonian Formalism).

Let

X_{f}

and

X_{g}

be two Hamiltonian vector fields on a presymplectic or symplectic manifold

(M, ω)

, or on a Poisson manifold

(M, Λ)

, which admit as Hamiltonians, respectively, the smooth functions f and g on the manifold M. The function f remains constant on each integral curve of

X_{g}

if and only if g remains constant on each integral curve of

X_{f}

.

Proof.

The function f is constant on each integral curve of

X_{g}

if and only if

i (X_{g}) d f = 0

, since each integral curve of

X_{g}

is connected. We can use the Poisson bracket, even when M is a presymplectic manifold, since the Poisson bracket of two Hamiltonians on a presymplectic manifold still can be defined. So we can write

i (X_{g}) d f = {g, f} = - {f, g} = - i (X_{f}) d g .

☐

Corollary 2 (of Noether’s Theorem in Hamiltonian Formalism).

Let

ψ : G \to A^{1} (M)

be a Hamiltonian action of a finite-dimensional Lie algebra

G

on a presymplectic or symplectic manifold

(M, ω)

, or on a Poisson manifold

(M, Λ)

, and let

J : M \to G^{*}

be a momentum map of this action. Let

X_{H}

be a Hamiltonian vector field on M admitting as Hamiltonian a smooth function H. If for each

X \in G

we have

i (ψ (X)) (d H) = 0

, the momentum map J remains constant on each integral curve of

X_{H}

.

Proof.

This result is obtained by applying Theorem 5 to the pairs of Hamiltonian vector fields made by

X_{H}

and each vector field associated to an element of a basis of

G

. ☐

5.5. Symplectic Cocycles

Theorem 6 (J. M. Souriau [14]).

Let Φ be a Hamiltonian action (either on the left or on the right) of a Lie group G on a connected symplectic manifold

(M, ω)

and let

J : M \to G^{*}

be a momentum map of this action. There exists an affine action A (either on the left or on the right) of the Lie group G on the dual

G^{*}

of its Lie algebra

G

such that the momentum map J is equivariant with respect to the actions Φ of G on M and A of G on

G^{*}

, i.e., such that

J \circ Φ_{g} (x) = A_{g} \circ J (x) f o r a l l g \in G, x \in M .

The action A can be written, with

g \in G

and

ξ \in G^{*}

,

\{\begin{matrix} A (g, ξ) = {Ad}_{g^{- 1}}^{*} (ξ) + θ (g) & if Φ is an action on the left, \\ A (ξ, g) = {Ad}_{g}^{*} (ξ) - θ (g^{- 1}) & if Φ is an action on the right . \end{matrix}

Proof.

Let us assume that Φ is an action on the left. The fundamental vector field

X_{M}

associated to each

X \in G

is Hamiltonian, with the function

J_{X} : M \to R

, given by

J_{X} (x) = 〈J (x), X〉, x \in M,

as Hamiltonian. For each

g \in G

the direct image

{(Φ_{g^{- 1}})}_{*} (X_{M})

of

X_{M}

by the symplectic diffeomorphism

Φ_{g^{- 1}}

is Hamiltonian, with

J_{X} \circ Φ_{g}

as Hamiltonian. An easy calculation shows that this vector field is the fundamental vector field associated to

{Ad}_{g^{- 1}} (X) \in G

. The function

x \mapsto 〈J (x), {Ad}_{g^{- 1}} (X)〉 = 〈{Ad}_{g^{- 1}}^{*} \circ J (x), X〉

is therefore a Hamiltonian for that vector field. These two functions defined on the connected manifold M, which both are admissible Hamiltonians for the same Hamiltonian vector field, differ only by a constant (which may depend on

g \in G

). We can set, for any

g \in G

,

θ (g) = J \circ Φ_{g} (x) - {Ad}_{g^{- 1}}^{*} \circ J (x)

and check that the map

A : G \times G^{*} \to G^{*}

defined in the statement is indeed an action for which J is equivariant.

A similar proof, with some changes of signs, holds when Φ is an action on the right. ☐

Proposition 6.

Under the assumptions and with the notations of Theorem 6, the map

θ : G \to G^{*}

is a cocycle of the Lie group G with values in

G^{*}

, for the coadjoint representation. It means that it satisfies, for all g and

h \in G

,

θ (g h) = θ (g) + {Ad}_{g^{- 1}}^{*} (θ (h)) .

More precisely θ is a symplectic cocycle. It means that its differential

T_{e} θ : T_{e} G \equiv G \to G^{*}

at the neutral element

e \in G

can be considered as a skew-symmetric bilinear form on

G

:

Θ (X, Y) = 〈T_{e} θ (X), Y〉 = - 〈T_{e} θ (Y), X〉 .

The skew-symmetric bilinear form Θ is a symplectic cocycle of the Lie algebra

G

. It means that it is skew-symmetric and satisfies, for all X, Y and

Z \in G

,

Θ ([X, Y], Z) + Θ ([Y, Z], X) + Θ ([Z, X], Y) = 0 .

Proof.

These properties easily follow from the fact that when Φ is an action on the left, for g and

h \in G

,

Φ_{g} \circ Φ_{h} = Φ_{g h}

(and a similar equality when Φ is an action on the right). The interested reader will find more details in [9,12,14]. ☐

Proposition 7.

Still under the assumptions and with the notations of Theorem 6, the composition law which associates to each pair

(f, g)

of smooth real-valued functions on

G^{*}

the function

{f, g}_{Θ}

given by

{f, g}_{Θ} (x) = 〈x, [d f (x), d g (x)]〉 - Θ (d f (x), d g (x)), x \in G^{*},

(

G

being identified with its bidual

G^{* *}

), determines a Poisson structure on

G^{*}

, and the momentum map

J : M \to G^{*}

is a Poisson map, M being endowed with the Poisson structure associated to its symplectic structure.

Proof.

The fact that the bracket

(f, g) \mapsto {f, g}_{Θ}

on

C^{\infty} (G^{*}, R)

is a Poisson bracket was already indicated in Example 4. It can be verified by easy calculations. The fact that J is a Poisson map can be proven by first looking at linear functions on

G^{*}

, i.e., elements in

G

. The reader will find a detailed proof in [12]. ☐

Remark 11.

When the momentum map J is replaced by another momentum map

J_{1} = J + μ

, where

μ \in G^{*}

is a constant, the symplectic Lie group cocycle θ and the symplectic Lie algebra cocycle Θ are replaced by

θ_{1}

and

Θ_{1}

, respectively, given by

\begin{matrix} θ_{1} (g) & = θ (g) + μ - {Ad}_{g^{- 1}}^{*} (μ), g \in G, \\ Θ_{1} (X, Y) & = Θ (X, Y) + 〈μ, [X, Y]〉, X and Y \in G . \end{matrix}

These formulae show that

θ_{1} - θ

and

Θ_{1} - Θ

are symplectic coboundaries of the Lie group G and the Lie algebra

G

. In other words, the cohomology classes of the cocycles θ and Θ only depend on the Hamiltonian action Φ of G on the symplectic manifold

(M, ω)

.

5.6. The Use of Symmetries in Hamiltonian Mechanics

5.6.1. Symmetries of the Phase Space

Hamiltonian Symmetries are often used for the search of solutions of the equations of motion of mechanical systems. The symmetries considered are those of the phase space of the mechanical system. This space is very often a symplectic manifold, either the cotangent bundle to the configuration space with its canonical symplectic structure, or a more general symplectic manifold. Sometimes, after some simplifications, the phase space is a Poisson manifold.

The Marsden-Weinstein reduction procedure [43,44] or one of its generalizations [10] is the method most often used to facilitate the determination of solutions of the equations of motion. In a first step, a possible value of the momentum map is chosen and the subset of the phase space on which the momentum map takes this value is determined. In a second step, that subset (when it is a smooth manifold) is quotiented by its isotropic foliation. The quotient manifold is a symplectic manifold of a dimension smaller than that of the original phase space, and one has an easier to solve Hamiltonian system on that reduced phase space.

When Hamiltonian symmetries are used for the reduction of the dimension of the phase space of a mechanical system, the symplectic cocycle of the Lie group of symmetries action, or of the Lie algebra of symmetries action, is almost always the zero cocycle.

For example, if the group of symmetries is the canonical lift to the cotangent bundle of a group of symmetries of the configuration space, not only the canonical symplectic form, but the Liouville 1-form of the cotangent bundle itself remains invariant under the action of the symmetry group, and this fact implies that the symplectic cohomology class of the action is zero.

5.6.2. Symmetries of the Space of Motions

A completely different way of using symmetries was initiated by Jean-Marie Souriau, who proposed to consider the symmetries of the manifold of motions of the mechanical system. He observed that the Lagrangian and Hamiltonian formalisms, in their usual formulations, involve the choice of a particular reference frame, in which the motion is described. This choice destroys a part of the natural symmetries of the system.

For example, in classical (non-relativistic) mechanics, the natural symmetry group of an isolated mechanical system must contain the symmetry group of the Galilean space-time, called the Galilean group. This group is of dimension 10. It contains not only the group of Euclidean displacements of space which is of dimension 6 and the group of time translations which is of dimension 1, but the group of linear changes of Galilean reference frames which is of dimension 3.

If we use the Lagrangian formalism or the Hamiltonian formalism, the Lagrangian or the Hamiltonian of the system depends on the reference frame: it is not invariant with respect to linear changes of Galilean reference frames.

It may seem strange to consider the set of all possible motions of a system, which is unknown as long as we have not determined all these possible motions. One may ask if it is really useful when we want to determine not all possible motions, but only one motion with prescribed initial data, since that motion is just one point of the (unknown) manifold of motion!

Souriau’s answers to this objection are the following.

1.: We know that the manifold of motions has a symplectic structure, and very often many things are known about its symmetry properties.
2.: In classical (non-relativistic) mechanics, there exists a natural mathematical object which does not depend on the choice of a particular reference frame (even if the decriptions given to that object by different observers depend on the reference frame used by these observers): it is the evolution space of the system.

The knowledge of the equations which govern the system’s evolution allows the full mathematical description of the evolution space, even when these equations are not yet solved.

Moreover, the symmetry properties of the evolution space are the same as those of the manifold of motions.

For example, the evolution space of a classical mechanical system with configuration manifold N is

in the Lagrangian formalism, the space $R \times T N$ endowed with the presymplectic form $d {\hat{ϖ}}_{L}$ , whose kernel is of dimension 1 when the Lagrangian L is hyper-regular,
in the Hamiltonian formalism, the space $R \times T^{*} N$ with the presymplectic form $d {\hat{ϖ}}_{H}$ , whose kernel too is of dimension 1.

The Poincaré-Cartan 1-form

{\hat{ϖ}}_{L}

in the Lagrangian formalism, or

{\hat{ϖ}}_{H}

in the Hamiltonian formalism, depends on the choice of a particular reference frame, made for using the Lagrangian or the Hamiltonian formalism. But their exterior differentials, the presymplectic forms

d {\hat{ϖ}}_{L}

or

d {\hat{ϖ}}_{H}

, do not depend on that choice, modulo a simple change of variables in the evolution space.

Souriau defined this presymplectic form in a framework more general than those of Lagrangian or Hamiltonian formalisms, and called it the Lagrange form. In this more general setting, it may not be an exact 2-form. Souriau proposed as a new Principle, the assumption that it always projects on the space of motions of the systems as a symplectic form, even in relativistic mechanics in which the definition of an evolution space is not clear. He called this new principle the Maxwell Principle.

Bargmann proved that the symplectic cohomology of the Galilean group is of dimension 1, and Souriau proved that the cohomology class of its action on the manifold of motions of an isolated classical (non-relativistic) mechanical system can be identified with the total mass of the system [14], Chapter III, p. 153.

Readers interested in the Galilean group and momentum maps of its actions are referred to the recent book by de Saxcé and Vallée [45].

6. Statistical Mechanics and Thermodynamics

6.1. Basic Concepts in Statistical Mechanics

During the XVIII–th and XIX–th centuries, the idea that material bodies (fluids as well as solids) are assemblies of a very large number of small, moving particles, began to be considered by some scientists, notably Daniel Bernoulli (1700–1782), Rudolf Clausius (1822–1888), James Clerk Maxwell (1831–1879) and Ludwig Eduardo Boltzmann (1844–1906), as a reasonable possibility. Attemps were made to explain the nature of some measurable macroscopic quantities (for example the temperature of a material body, the pressure exerted by a gas on the walls of the vessel in which it is contained), and the laws which govern the variations of these macroscopic quantities, by application of the laws of classical mechanics to the motions of these very small particles. Described in the framework of the Hamiltonian formalism, the material body is considered as a Hamiltonian system whose phase space is a very high dimensional symplectic manifold

(M, ω)

, since an element of that space gives a perfect information about the positions and the velocities of all the particles of the system. The experimental determination of the exact state of the system being impossible, one only can use the probability of presence, at each instant, of the state of the system in various parts of the phase space. Scientists introduced the concept of a statistical state, defined below.

Definition 14.

Let

(M, ω)

be a symplectic manifold. A statistical state is a probability measure μ on the manifold M.

6.1.1. The Liouville Measure on a Symplectic Manifold

On each symplectic manifold

(M, ω)

, with

dim M = 2 n

, there exists a positive measure

λ_{ω}

, called the Liouville measure. Let us briefly recall its definition. Let

(U, φ)

be a Darboux chart of

(M, ω)

Section 4.4.1. The open subset U of M is, by means of the diffeomorphism φ, identified with an open subset

φ (U)

of

R^{2 n}

on which the coordinates (Darboux coordinates) will be denoted by

(p_{1}, \dots, p_{n}, x^{1}, \dots, x^{n})

. With this identification, the Liouville measure (restricted to U) is simply the Lebesgue measure on the open subset

φ (U)

of

R^{2 n}

. In other words, for each Borel subset A of M contained in U, we have

λ_{ω} (A) = \int_{φ (A)} d p_{1} \dots d p_{n} d x^{1} \dots d x^{n} .

One can easily check that this definition does not depend on the choice of the Darboux coordinates

(p_{1}, \dots, p_{n}, x^{1}, \dots, x^{n})

on

φ (A)

. By using an atlas of Darboux charts on

(M, ω)

, one can easily define

λ_{ω} (A)

for any Borel subset A of M.

Definition 15.

A statistical state μ on the symplectic manifold

(M, ω)

is said to be continuous (respectively, is said to be smooth) if it has a continuous (respectively, a smooth) density with respect to the Liouville measure

λ_{ω}

, i.e., if there exists a continuous function (respectively, a smooth function)

ρ : M \to R

such that, for each Borel subset A of M

μ (A) = \int_{A} ρ d λ_{ω} .

Remark 12.

The density ρ of a continuous statistical state on

(M, ω)

takes its values in

R^{+}

and of course satisfies

\int_{M} ρ d λ_{ω} = 1 .

For simplicity we only consider in what follows continuous, very often even smooth statistical states.

6.1.2. Variation in Time of a Statistical State

Let H be a smooth time independent Hamiltonian on a symplectic manifold

(M, ω)

,

X_{H}

the associated Hamiltonian vector field and

Φ^{X_{H}}

its reduced flow. We consider the mechanical system whose time evolution is described by the flow of

X_{H}

.

If the state of the system at time

t_{0}

, assumed to be perfectly known, is a point

z_{0} \in M

, its state at time

t_{1}

is the point

z_{1} = Φ_{t_{1} - t_{0}}^{X_{H}} (z_{0})

.

Let us now assume that the state of the system at time

t_{0}

is not perfectly known, but that a continuous probability measure on the phase space M, whose density with respect to the Liouville measure

λ_{ω}

is

ρ_{0}

, describes the probability distribution of presence of the state of the system at time

t_{0}

. In other words,

ρ_{0}

is the density of the statistical state of the system at time

t_{0}

. For any other time

t_{1}

, the map

Φ_{t_{1} - t_{0}}^{X_{H}}

is a symplectomorphism, therefore leaves invariant the Liouville measure

λ_{ω}

. The probability density

ρ_{1}

of the statistical state of the system at time

t_{1}

therefore satisfies, for any

x_{0} \in M

for which

x_{1} = Φ_{t_{1} - t_{0}}^{X_{H}} (x_{0})

is defined,

ρ_{1} (x_{1}) = ρ_{1} (Φ_{t_{1} - t_{0}}^{X_{H}} (x_{0})) = ρ_{0} (x_{0}) .

Since

{(Φ_{t_{1} - t_{0}}^{X_{H}})}^{- 1} = Φ_{t_{0} - t_{1}}^{X_{H}}

, we can write

ρ_{1} = ρ_{0} \circ Φ_{t_{0} - t_{1}}^{X_{H}} .

Definition 16.

Let ρ be the density of a continuous statistical state μ on the symplectic manifold

(M, ω)

. The number

s (ρ) = \int_{M} ρ log (\frac{1}{ρ}) d λ_{ω}

is called the entropy of the statistical state μ or, with a slight abuse of language, the entropy of the density ρ.

Remark 13.

1.: By convention we state that 0 log0 = 0. With that convention the function $x \mapsto x log x$ is continuous on $R^{+}$ . If the integral on the right hand side of the equality which defines $s (ρ)$ does not converge, we state that $s (ρ) = - \infty$ . With these conventions, $s (ρ)$ exists for any continuous probability density ρ.
2.: The above Definition 16 of the entropy of a statistical state, founded on ideas developed by Boltzmann in his Kinetic Theory of Gases [46], specially in the derivation of his famous (and controversed) Theorem Êta, is too related with the ideas of Claude Shannon [47] on information theory. The use of information theory in thermodynamics was more recently proposed by Jaynes [48,49] and Mackey [18]. For a very nice discussion of the use of probability concepts in physics and application of information theory in quantum mechanics, the reader is referred to the paper by Balian [50].

The entropy

s (ρ)

of a probability density ρ has very remarkable variational properties discussed in the following definitions and proposition.

Definition 17.

Let ρ be the density of a smooth statistical state on a symplectic manifold

(M, ω)

.

1.

For each function f defined on M, taking its values in

R

or in some finite-dimensional vector space, such that the integral on the right hand side of the equality

E_{ρ} (f) = \int_{M} f ρ d λ_{ω}

converges, the value

E_{ρ} (f)

of that integral is called the mean value of f with respect to ρ.

2.

Let f be a smooth function on M, taking its values in

R

or in some finite-dimensional vector space, satisfying the properties stated above. A smooth infinitesimal variation of ρ with fixed mean value of f is a smooth map, defined on the product

] - ε, ε [\times M

, with values in

R^{+}

, where

ε > 0

,

(τ, z) \mapsto ρ (τ, z), τ \in] - ε, ε [, z \in M,

such that

for $τ = 0$ and any $z \in M$ , $ρ (0, z) = ρ (z)$ ,
for each $τ \in] - ε, ε [$ , $z \mapsto ρ_{τ} (z) = ρ (τ, z)$ is a smooth probability density on M such that

$E_{ρ_{τ}} (f) = \int_{M} ρ_{τ} f d λ_{ω} = E_{ρ} (f) .$

3.

The entropy function s is said to be stationary at the probability density ρ with respect to smooth infinitesimal variations of ρ with fixed mean value of f, if for any smooth infinitesimal variation

(τ, z) \mapsto ρ (τ, z)

of ρ with fixed mean value of f

\frac{d s (ρ_{τ})}{d τ} |_{τ = 0} = 0 .

Proposition 8.

Let

H : M \to R

be a smooth Hamiltonian on a symplectic manifold

(M, ω)

and ρ be the density of a smooth statistical state on M such that the integral defining the mean value

E_{ρ} (H)

of H with respect to ρ converges. The entropy function s is stationary at ρ with respect to smooth infinitesimal variations of ρ with fixed mean value of H, if and only if there exists a real

b \in R

such that, for all

z \in M

,

ρ (z) = \frac{1}{P (b)} exp (- b H (z)), with P (b) = \int_{M} exp (- b H) d λ_{ω} .

Proof.

Let

τ \mapsto ρ_{τ}

be a smooth infinitesimal variation of ρ with fixed mean value of H. Since

\int_{M} ρ_{τ} d λ_{ω}

and

\int_{M} ρ_{τ} H d λ_{ω}

do not depend on τ, it satisfies, for all

τ \in] - ε, ε [

,

\int_{M} \frac{\partial ρ (τ, z)}{\partial τ} d λ_{ω} (z) = 0, \int_{M} \frac{\partial ρ (τ, z)}{\partial τ} H (z) d λ_{ω} (z) = 0 .

Moreover an easy calculation leads to

\frac{d s (ρ_{τ})}{d τ} |_{τ = 0} = - \int_{M} \frac{\partial ρ (τ, z)}{\partial τ} |_{τ = 0} (1 + log (ρ (z)) d λ_{ω} (z) .

A well known result in calculus of variations shows that the entropy function s is stationary at ρ with respect to smooth infinitesimal variations of ρ with fixed mean value of H, if and only if there exist two real constants a and b, called Lagrange multipliers, such that, for all

z \in M

,

1 + log (ρ) + a + b H = 0,

which leads to

ρ = exp (- 1 - a - b H) .

By writing that

\int_{M} ρ d λ_{ω} = 1

, we see that a is determined by b:

exp (1 + a) = P (b) = \int_{M} exp (- b H) d λ_{ω} .

☐

Definition 18.

Let

H : M \to R

be a smooth Hamiltonian on a symplectic manifold

(M, ω)

. For each

b \in R

such that the integral on the right side of the equality

P (b) = \int_{M} exp (- b H) d λ_{ω}

converges, the smooth probability measure on M with density (with respect to the Liouville measure)

ρ (b) = \frac{1}{P (b)} exp (- b H)

is called the Gibbs statistical state associated to b. The function

P : b \mapsto P (b)

is called the partition function.

The following proposition shows that the entropy function, not only is stationary at any Gibbs statistical state, but in a certain sense attains at that state a strict maximum.

Proposition 9.

Let

H : M \to R

be a smooth Hamiltonian on a symplectic manifold

(M, ω)

and

b \in R

be such that the integral defining the value

P (b)

of the partition function P at b converges. Let

ρ_{b} = \frac{1}{P (b)} exp (- b H)

be the probability density of the Gibbs statistical state associated to b. We assume that the Hamiltonian H is bounded by below, i.e., that there exists a constant m such that

m \leq H (z)

for any

z \in M

. Then the integral defining

E_{ρ_{b}} (H) = \int_{M} ρ_{b} H d λ_{ω}

converges. For any other smooth probability density

ρ_{1}

such that

E_{ρ_{1}} (H) = E_{ρ_{b}} (H),

we have

s (ρ_{1}) \leq s (ρ_{b}),

and the equality

s (ρ_{1}) = s (ρ_{b})

holds if and only if

ρ_{1} = ρ_{b}

.

Proof.

Since

m \leq H

, the function

ρ_{b} exp (- b H)

satisfies

0 \leq ρ_{b} exp (- b H) \leq exp (- m b) ρ_{b}

, therefore is integrable on M. Let

ρ_{1}

be any smooth probability density on M satisfying

E_{ρ_{1}} (H) = E_{ρ_{b}} (H)

. The function defined on

R^{+}

x \mapsto h (x) = \{\begin{matrix} x log (\frac{1}{x}) & if x > 0 \\ 0 & if x = 0 \end{matrix}

being convex, its graph is below the tangent at any of its points

(x_{0}, h (x_{0}))

. We therefore have, for all

x > 0

and

x_{0} > 0

,

h (x) \leq h (x_{0}) - (1 + log x_{0}) (x - x_{0}) = x_{0} - x (1 + log x_{0}) .

With

x = ρ_{1} (z)

and

x_{0} = ρ_{b} (z)

, z being any element in M, that inequality becomes

h (ρ_{1} (z)) = ρ_{1} (z) log (\frac{1}{ρ_{1} (z)}) \leq ρ_{b} (z) - (1 + log ρ_{b} (z)) ρ_{1} (z) .

By integration over M, using the fact that

ρ_{b}

is the probability density of the Gibbs state associated to b, we obtain

s (ρ_{1}) \leq 1 - 1 - \int_{M} ρ_{1} log ρ_{b} d λ_{ω} = s (ρ_{b}) .

We have proven the inequality

s (ρ_{1}) \leq s (ρ_{b})

. If

ρ_{1} = ρ_{b}

, we have of course the equality

s (ρ_{1}) = s (ρ_{b})

. Conversely if

s (ρ_{1}) = s (ρ_{b})

, the functions defined on M

z \mapsto φ_{1} (z) = ρ_{1} (z) log (\frac{1}{ρ_{1} (z)}) and z \mapsto φ (z) = ρ_{b} (z) - (1 + log ρ_{b} (z)) ρ_{1} (z)

are continuous on M except, maybe, for φ, at points z at which

ρ_{b} (z) = 0

and

ρ_{1} (z) \neq 0

, but the set of such points is of measure 0 since φ is integrable. They satisfy the inequality

φ_{1} \leq φ

. Both are integrable on M and have the same integral. The function

φ - φ_{1}

is everywhere

\geq 0

, is integrable on M and its integral is 0. That function is therefore everywhere equal to 0 on M. We can write, for any

z \in M

,

ρ_{1} (z) log (\frac{1}{ρ_{1} (z)}) = ρ_{b} (z) - (1 + log ρ_{b} (z)) ρ_{1} (z) .

(6)

For each

z \in M

such that

ρ_{1} (z) \neq 0

, we can divide that equality by

ρ_{1} (z)

. We obtain

\frac{ρ_{b} (z)}{ρ_{1} (z)} - log (\frac{ρ_{b} (z)}{ρ_{1} (z)}) = 1 .

Since the function

x \mapsto x - log x

reaches its minimum, equal to 1, for a unique value of

x > 0

, that value being 1, we see that for each

z \in M

at which

ρ_{1} (z) > 0

, we have

ρ_{1} (z) = ρ_{b} (z)

. At points

z \in M

at which

ρ_{1} (z) = 0

, Equation (6) shows that

ρ_{b} (z) = 0

. Therefore

ρ_{1} = ρ_{b}

. ☐

Remark 14.

The maximality property of the entropy function

ρ \mapsto s (ρ)

at a Gibbs state density

ρ_{b}

proven in Proposition 9 of course implies the stationarity of that function at

ρ_{b}

with respect to smooth infinitesimal variations of ρ with fixed mean value of H, proven in Proposition 8. That Proposition therefore could be omitted. We chose to keep it because its proof is much easier than that of Proposition 9, and explains why it is interesting to look at probability densities proportional to exp

(- b H)

for some

b \in R

.

The following proposition shows that a Gibbs statistical state remains invariant under the flow of the Hamiltonian vector field

X_{H}

. One can therefore say that a Gibbs state is a statistical equilibrium state. Of course there exist statistical equilibrium states other than Gibbs states.

Proposition 10.

Let H be a smooth Hamiltonian bounded by below on a symplectic manifold

(M, ω)

,

b \in R

be such that the integral defining the value

P (b)

of the partition function P at b converges. The Gibbs state associated to b remains invariant under the flow of of the Hamiltonian vector field

X_{H}

.

Proof.

The density

ρ_{b}

of the Gibbs state associated to b, with respect to the Liouville measure

λ_{ω}

, is

ρ_{b} = \frac{1}{P (b)} exp (- b H) .

Since H is constant along each integral curve of

X_{H}

,

ρ_{b}

too is constant along each integral curve of

X_{H}

. Moreover, the Liouville measure

λ_{ω}

remains invariant under the flow of

X_{H}

. Therefore the Gibbs probability measure associated to b too remains invariant under that flow. ☐

6.2. Thermodynamic Equilibria and Thermodynamic Functions

6.2.1. Assumptions Made in this Section

Any Hamiltonian H defined on a symplectic manifold

(M, ω)

considered in this section will be assumed to be smooth, bounded by below and such that for any real

b > 0

, each one of the three functions, defined on M,

z \mapsto exp (- b H (z))

,

z \mapsto | H (z) | exp (- b H (z))

and

z \mapsto {(H (z))}^{2} exp (- b H (z))

is everywhere smaller than some function defined on M integrable with respect to the Liouville measure

λ_{ω}

. The integrals which define

P (b) = \int_{M} exp (- b H) d λ_{ω} and E_{ρ_{b}} (H) = \int_{M} H exp (- b H) d λ_{ω}

therefore converge.

Proposition 11.

Let H be a Hamiltonian defined on a symplectic manifold

(M, ω)

satisfying the assumptions indicated in Section 6.2.1. For any real

b > 0

let

P (b) = \int_{M} exp (- b H) d λ_{ω} a n d ρ_{b} = \frac{1}{P (b)} exp (- b H)

be the value at b of the partition function P and the probability density of the Gibbs statistical state associated to b, and

E (b) = E_{ρ_{b}} (H) = \frac{1}{P (b)} \int_{M} H exp (- b H) d λ_{ω}

be the mean value of H with respect to the probability density

ρ_{b}

. The first and second derivatives with respect to b of the partition function P exist, are continuous functions of b given by

\frac{d P (b)}{d b} = - P (b) E (b), \frac{d^{2} P (b)}{d b^{2}} = \int_{M} H^{2} exp (- b H) d λ_{ω} = P (b) E_{ρ_{b}} (H^{2}) .

The derivative with respect to b of the function E exists and is a continuous function of b given by

\frac{d E (b)}{d b} = - \frac{1}{P (b)} \int_{M} {(H - E_{ρ_{b}} (H))}^{2} d λ_{ω} = - E_{ρ_{b}} ({(H - E_{ρ_{b}} (H))}^{2}) .

Let

S (b)

be the entropy

s (ρ_{b})

of the Gibbs statistical state associated to b. The function S can be expressed in terms of P and E as

S (b) = log (P (b)) + b E (b) .

Its derivative with respect to b exists and is a continuous function of b given by

\frac{d S (b)}{d b} = b \frac{d E (b)}{d b} .

Proof.

Using the assumptions Section 6.2.1, we see that the functions

b \mapsto P (b)

and

b \mapsto E_{ρ_{b}} (H) = E (b)

, defined by integrals on M, have a derivative with respect to b which is continuous and which can be calculated by derivation under the sign

\int_{M}

. The indicated results easily follow, if we observe that for any function f on M such that

E_{ρ_{b}} (f)

and

E_{ρ_{b}} (f^{2})

exist, we have the formula, well known in Probability theory,

E_{ρ_{b}} (f^{2}) - {(E_{ρ_{b}} (f))}^{2} = E_{ρ_{b}} ({(f - E_{ρ_{b}} (f))}^{2}) .

☐

6.2.2. Physical Meaning of the Introduced Functions

Let us consider a physical system, for example a gas contained in a vessel bounded by rigid, thermally insulated walls, at rest in a Galilean reference frame. We assume that its evolution can be mathematically described by means of a Hamiltonian system on a symplectic manifold

(M, ω)

whose Hamiltonian H satisfies the assumptions Section 6.2.1. For physicists, a Gibbs statistical state, i.e., a probability measure of density

ρ_{b} = \frac{1}{P (b)} exp (- b H)

on M, is a thermodynamic equilibrium of the physical system. The set of possible thermodynamic equilibria of the system is therefore indexed by a real parameter

b > 0

. The following argument will show what physical meaning can have that parameter.

Let us consider two similar physical systems, mathematically described by two Hamiltonian systems, of Hamiltonians

H_{1}

on the symplectic manifold

(M_{1}, ω_{1})

and

H_{2}

on the symplectic manifold

(M_{2}, ω_{2})

. We first assume that they are independent and both in thermodynamic equilibrium, with different values

b_{1}

and

b_{2}

of the parameter b. We denote by

E_{1} (b_{1})

and

E_{2} (b_{2})

the mean values of

H_{1}

on the manifold

M_{1}

with respect to the Gibbs state of density

ρ_{1, b_{1}}

and of

H_{2}

on the manifold

M_{2}

with respect to the Gibbs state of density

ρ_{2, b_{2}}

. We assume now that the two systems are coupled in a way allowing an exchange of energy. For example, the two vessels containing the two gases can be separated by a wall allowing a heat transfer between them. Coupled together, they make a new physical system, mathematically described by a Hamiltonian system on the symplectic manifold

(M_{1} \times M_{2}, p_{1}^{*} ω_{1} + p_{2}^{*} ω_{2})

, where

p_{1} : M_{1} \times M_{2} \to M_{1}

and

p_{2} : M_{1} \times M_{2} \to M_{2}

are the canonical projections. The Hamiltonian of this new system can be made as close to

H_{1} \circ p_{1} + H_{2} \circ p_{2}

as one wishes, by making very small the coupling between the two systems. The mean value of the Hamiltonian of the new system is therefore very close to

E_{1} (b_{1}) + E_{2} (b_{2})

. When the total system will reach a state of thermodynamic equilibrium, the probability densities of the Gibbs states of its two parts,

ρ_{1, b^{'}}

on

M_{1}

and

ρ_{2, b^{'}}

on

M_{2}

will be indexed by the same real number

b^{'} > 0

, which must be such that

E_{1} (b^{'}) + E_{2} (b^{'}) = E_{1} (b_{1}) + E_{2} (b_{2}) .

By Proposition 11, we have, for all

b > 0

,

\frac{d E_{1} (b)}{d b} \leq 0, \frac{d E_{2} (b)}{d b} \leq 0 .

Therefore

b^{'}

must lie between

b_{1}

and

b_{2}

. If, for example,

b_{1} < b_{2}

, we see that

E_{1} (b^{'}) \leq E_{1} (b_{1})

and

E_{2} (b^{'}) \geq E_{2} (b_{2})

. In order to reach a state of thermodynamic equilibrium, energy must be transferred from the part of the system where b has the smallest value, towards the part of the system where b has the highest value, until, at thermodynamic equilibrium, b has the same value everywhere. Everyday experience shows that thermal energy flows from parts of a system where the temperature is higher, towards parts where it is lower. For this reason physicists consider the real variable b as a way to appreciate the temperature of a physical system in a state of thermodynamic equilibrium. More precisely, they state that

b = \frac{1}{k T}

where T is the absolute temperature and k a constant depending on the choice of units of energy and temperature, called Boltzmann’s constant in honour of the great Austrian scientist Ludwig Eduard Boltzmann (1844–1906).

For a physical system mathematically described by a Hamiltonian system on a symplectic manifold

(M, ω)

, with H as Hamiltonian, in a state of thermodynamic equilibrium,

E (b)

and

S (b)

are the internal energy and the entropy of the system.

6.2.3. Towards Thermodynamic Equilibrium

Everyday experience shows that a physical system, when submitted to external conditions which remain unchanged for a sufficiently long time, very often reaches a state of thermodynamic equilibrium. At first look, it seems that Lagrangian or Hamiltonian systems with time-independent Lagrangians or Hamiltonians cannot exhibit a similar behaviour. Let us indeed consider a mechanical system whose configuration space is a smooth manifold N, described in the Lagrangian formalism by a smooth time-independent hyper-regular Lagarangian

L : T N \to R

or, in the Hamiltonian formalism, by the associated Hamiltonian

H_{L} : T^{*} N \to R

. Let

t \mapsto \vec{x (t)}

be a motion of that system,

\vec{x_{0}} = \vec{x (t_{0})}

and

\vec{x_{1}} = \vec{x (t_{0})}

be the configurations of the system for that motion at times

t_{0}

and

t_{1}

. There exists another motion

t \mapsto \vec{x^{'} (t)}

of the system for which

\vec{x^{'} (t_{0})} = \vec{x_{1}}

and

\vec{x^{'} (t_{1})} = \vec{x_{0}}

: since the equations of motion are invariant by time reversal, the motion

t \mapsto \vec{x^{'} (t)}

is obtained simply by taking as initial condition at time

t_{0}

\vec{x^{'} (t_{0})} = \vec{x (t_{1})}

and

\frac{d \vec{x^{'} (t)}}{d t} |_{t = t_{0}} = - \frac{d \vec{x (t)}}{d t} |_{t = t_{1}}

. Another more serious argument against a kind of thermodynamic behaviour of Lagarangian or Hamiltonian systems rests on the famous recurrence theorem due to Poincaré [51]. This theorem asserts indeed that when the useful part of the phase space of the system is of a finite total measure, almost all points in an arbitrarily small open subset of the phase space are recurrent, i.e., the motion starting of such a point at time

t_{0}

repeatedly crosses that open subset again and again, infinitely many times when

t \to + \infty

.

Let us now consider, instead of perfectly defined states, i.e., points in phase space, statistical states, and ask the question: When at time

t = t_{0}

a Hamiltonian system on a symplectic manifold

(M, ω)

is in a statistical state given by some probability measure of density

ρ_{0}

with respect to the Liouville measure

λ_{ω}

, does its statistical state converge, when

t \to + \infty

, towards the probability measure of a Gibbs state? This question should be made more precise by specifying what physical meaning has a statistical state and in what mathematical sense a statistical state can converge towards the probability measure of a Gibbs state. A positive partial answer was given by Ludwig Boltzmann when, developing his kinetic theory of gases, he proved his famous (but controversed) Êta theorem stating that the entropy of the statistical state of a gas of small particles is a monotonously increasing function of time. This question, linked with time irreversibility in physics, is still the subject of important researches, both by physicists and by mathematicians. The reader is referred to the paper [50] by Balian for a more thorough discussion of that question.

6.3. Examples of Thermodynamic Equilibria

6.3.1. Classical Monoatomic Ideal Gas

In classical mechanics, a dilute gas contained in a vessel at rest in a Galilean reference frame is mathematically described by a Hamiltonian system made by a large number of very small massive particles, which interact by very brief collisions between themselves or with the walls of the vessel, whose motions between two collisions are free. Let us first assume that these particles are material points and that no external field is acting on them, other than that describing the interactions by collisions with the walls of the vessel.

The Hamiltonian of one particle in a part of the phase space in which its motion is free is simply

\frac{1}{2 m} {∥ \vec{p} ∥}^{2} = \frac{1}{2 m} (p_{1}^{2} + p_{2}^{2} + p_{3}^{2}), with \vec{p} = m \vec{v},

where m is the mass of the particle,

\vec{v}

its velocity vector and

\vec{p}

its linear momentum vector (in the considered Galilean reference frame),

p_{1}

,

p_{2}

and

p_{3}

the components of

\vec{p}

in a fixed orhtonormal basis of the physical space.

Let N be the total number of particles, which may not have all the same mass. We use a integer

i \in {1, 2, \dots, N}

to label the particles and denote by

m_{i}

,

\vec{x_{i}}

,

\vec{v_{i}}

,

\vec{p_{i}}

the mass and the vectors position, velocity and linear momentum of the i-th particle.

The Hamiltonian of the gas is therefore

H = \sum_{i = 1}^{N} \frac{1}{2 m_{i}} {∥ \vec{p_{i}} ∥}^{2} + terms involving the collisions between particles and with the walls .

Interactions of the particles with the walls of the vessel are essential for allowing the motions of particles to remain confined. Interactions between particles are essential to allow the exchanges between them of energy and momentum, which play an important part in the evolution with time of the statistical state of the system. However it appears that while these terms are very important to determine the system’s evolution with time, they can be neglected, when the gas is dilute enough, if we only want to determine the final statistical state of the system, once a thermodynamic equilibrium is established. The Hamiltonian used will therefore be

H = \sum_{i = 1}^{N} \frac{1}{2 m_{i}} {∥ \vec{p_{i}} ∥}^{2} .

The partition function is

P (b) = \int_{M} exp (- b H) d λ_{ω} = \int_{D} exp (- b \sum_{i = 1}^{N} \frac{1}{2 m_{i}} {∥ {\vec{p}}_{i} ∥}^{2}) \prod_{i = 1}^{N} (d \vec{x_{i}} d \vec{p_{i}}),

where D is the domain of the

6 N

-dimensional space spanned by the position vectors

\vec{x_{i}}

and linear momentum vectors

\vec{p_{i}}

of the particles in which all the

\vec{x_{i}}

lie within the vessel containing the gas. An easy calculation leads to

P (b) = V^{N} {(\frac{2 π}{b})}^{3 N / 2} \prod_{i = 1}^{N} ({m_{i}}^{3 / 2}) = \prod_{i = 1}^{N} [V {(\frac{2 π m_{i}}{b})}^{3 / 2}],

where V is the volume of the vessel which contains the gas. The probability density of the Gibbs state associated to b, with respect to the Liouville measure, therefore is

ρ_{b} = \prod_{i = 1}^{N} [\frac{1}{V} {(\frac{b}{2 π m_{i}})}^{3 / 2} exp (\frac{- b ∥ \vec{p_{i}} ∥^{2}}{2 m_{i}})] .

We observe that

ρ_{b}

is the product of the probability densities

ρ_{i, b}

for the i-th particle

ρ_{i, b} = \frac{1}{V} {(\frac{b}{2 π m_{i}})}^{3 / 2} exp (\frac{- b ∥ \vec{p_{i}} ∥^{2}}{2 m_{i}}) .

The

2 N

stochastic vectors

\vec{x_{i}}

and

\vec{p_{i}}

,

i = 1, \dots, N

are therefore independent. The position

\vec{x_{i}}

of the i-th particle is uniformly distributed in the volume of the vessel, while the probability measure of its linear momentum

\vec{p_{i}}

is the classical Maxwell–Boltzmann probability distribution of linear momentum for an ideal gas of particles of mass

m_{i}

, first obtained by Maxwell in 1860. Moreover we see that the three components

p_{i 1}

,

p_{i 2}

and

p_{i 3}

of the linear momentum

\vec{p_{i}}

in an orhonormal basis of the physical space are independent stochastic variables.

By using the formulae given in Proposition 11 the internal energy

E (b)

and the entropy

S (b)

of the gas can be easily deduced from the partition function

P (b)

. Their expressions are

E (b) = \frac{3 N}{2 b}, S (b) = \frac{3}{2} \sum_{i = 1}^{N} log m_{i} + (\frac{3}{2} (1 + log (2 π)) + log V) N - \frac{3 N}{2} log b .

We see that each of the N particles present in the gas has the same contribution

\frac{3}{2 b}

to the internal energy

E (b)

, which does not depend on the mass of the particle. Even more: each degree of freedom of each particle, i.e., each of the the three components of the the linear momentum of the particle on the three axes of an orthonormal basis, has the same contribution

\frac{1}{2 b}

to the internal energy

E (b)

. This result is known in physics under the name Theorem of equipartition of the energy at a thermodynamic equilibrium. It can be easily generalized for polyatomic gases, in which a particle may carry, in addition to the kinetic energy due to the velocity of its centre of mass, a kinetic energy due to the particle’s rotation around its centre of mass. The reader can consult the books by Souriau [14] and Mackey [18] where the kinetic theory of polyatomic gases is discussed.

The pressure in the gas, denoted by

Π (b)

because the notation

P (b)

is already used for the partition function, is due to the change of linear momentum of the particles which occurs at a collision of the particle with the walls of the vessel containing the gas (or with a probe used to measure that pressure). A classical argument in the kinetic theory of gases (see for example [52,53]) leads to

Π (b) = \frac{2}{3} \frac{E (b)}{V} = \frac{N}{V b} .

This formula is the well known equation of state of an ideal monoatomic gas relating the number of particles by unit of volume, the pressure and the temperature.

With

b = \frac{1}{k T}

, the above expressions are exactly those used in classical thermodynamics for an ideal monoatomic gas.

6.3.2. Classical Ideal Monoatomic Gas in a Gravity Field

Let us now assume that the gas, contained in a cylindrical vessel of section Σ and length h, with a vertical axis, is submitted to the vertical gravity field of intensity g directed downwards. We choose Cartesian coordinates x, y, z, the z axis being vertical directed upwards, the bottom of the vessel being in the horizontal surface

z = 0

. The Hamiltonian of a free particle of mass m, position and linear momentum vectors

\vec{x}

(components x, y, z) and

\vec{p}

(components

p_{x}

,

p_{y}

and

p_{z}

) is

\frac{1}{2 m} (p_{x}^{2} + p_{y}^{2} + p_{z}^{2}) + m g z .

As in the previous section we neglect the parts of the Hamiltonian of the gas corresponding to collisions between the particles, or between a particle and the walls of the vessel. The Hamiltonian of the gas is therefore

H = \sum_{i = 1}^{N} (\frac{1}{2 m_{i}} (p_{i x}^{2} + p_{i y}^{2} + p_{i z}^{2}) + m_{i} g z_{i}) .

Calculations similar to those of the previous section lead to

\begin{matrix} P (b) & = \prod_{i = 1}^{N} [Σ {(\frac{2 π m_{i}}{b})}^{3 / 2} \frac{1 - exp (- m_{i} g b h)}{m_{i} g b}], \\ ρ_{b} & = \frac{1}{P (b)} exp [- b \sum_{i = 1}^{N} (\frac{∥ \vec{p_{i}} ∥^{2}}{2 m_{i}} + m_{i} g z_{i})] . \end{matrix}

The expression of

ρ_{b}

shows that the

2 N

stochastic vectors

\vec{x_{i}}

and

\vec{p_{i}}

still are independent, and that for each

i \in {1, \dots, N}

, the probability law of each stochastic vector

\vec{p_{i}}

is the same as in the absence of gravity, for the same value of b. Each stochastic vector

\vec{x_{i}}

is no more uniformly distributed in the vessel containing the gas: its probability density is higher at lower altitudes z, and this nonuniformity is more important for the heavier particles than for the lighter ones.

As in the previous section, the formulae given in Proposition 11 allow the calculation of

E (b)

and

S (b)

. We observe that

E (b)

now includes the potential energy of the gas in the gravity field, therefore should no more be called the internal energy of the gas.

6.3.3. Relativistic Monoatomic Ideal Gas

In a Galilean reference frame, we consider a relativistic point particle of rest mass m, moving at a velocity

\vec{v}

. We denote by v the modulus of

\vec{v}

and by c the modulus of the velocity of light. The motion of the particle can be mathematically described by means of the Euler–Lagrange equations, with the Lagrangian

L = - m c^{2} \sqrt{1 - \frac{v^{2}}{c^{2}}} .

The components of the linear momentum

\vec{p}

of the particle, in an orthonormal frame at rest in the considered Galilean reference frame, are

p_{i} = \frac{\partial L}{\partial v^{i}} = \frac{m v^{i}}{\sqrt{1 - \frac{v^{2}}{c^{2}}}}, therefore \vec{p} = \frac{m \vec{v}}{\sqrt{1 - \frac{v^{2}}{c^{2}}}} .

Denoting by p the modulus of

\vec{p}

, the Hamiltonian of the particle is

H = \vec{p} \cdot \vec{v} - L = \frac{m c^{2}}{\sqrt{1 - \frac{v^{2}}{c^{2}}}} = c \sqrt{p^{2} + m^{2} c^{2}} .

Let us consider a relativistic gas, made of N point particles indexed by

i \in {1, \dots, N}

,

m_{i}

being the rest mass of the i-th particle. With the same assumptions as those made in Section 6.3.1, we can take for Hamiltonian of the gas

H = c \sum_{i = 1}^{N} \sqrt{{p_{i}}^{2} + m^{2} c^{2}} .

With the same notations as those of Section 6.3.1, the partition function P of the gas takes the value, for each

b > 0

,

P (b) = \int_{D} exp (- b c \sum_{i = 1}^{N} \sqrt{{(p_{i})}^{2} + m^{2} c^{2}}) \prod_{i = 1}^{N} (d \vec{x_{i}} d \vec{p_{i}}) .

This integral can be expressed in terms of the Bessel function

K_{2}

, whose expression is, for each

x > 0

,

K_{2} (x) = x \int_{0}^{+ \infty} exp (- x ch χ) {sh}^{2} χ ch χ d χ .

We have

\begin{matrix} P (b) & = {(\frac{4 π V c}{b})}^{N} \prod_{i = 1}^{N} ({m_{i}}^{2} K_{2} (m_{i} b c^{2})), \\ ρ_{b} & = \frac{1}{P (b)} exp (- b c \sum_{i = 1}^{N} \sqrt{{p_{i}}^{2} + {m_{i}}^{2} c^{2}}) . \end{matrix}

This probability density of the Gibbs state shows that the

2 N

stochastic vectors

\vec{x_{i}}

and

\vec{p_{i}}

are independent, that each

\vec{x_{i}}

is uniformly distributed in the vessel containing the gas and that the probability density of each

\vec{p_{i}}

is exactly the probability distribution of the linear momentum of particles in a relativistic gas called the Maxwell–Jüttner distribution, obtained by Ferencz Jüttner (1878–1958) in 1911, discussed in the book by the Irish mathematician and physicist Synge [54].

Of course, the formulae given in Proposition 11 allow the calculation of the internal energy

E (b)

, the entropy

S (b)

and the pressure

Π (b)

of the relativistic gas.

6.3.4. Relativistic IDeal Gas of Massless Particles

We have seen in the previous Chapter that in an inertial reference frame, the Hamiltonian of a relativistic point particle of rest mass m is

c \sqrt{p^{2} + m^{2} c^{2}}

, where p is the modulus of the linear momentum vector

\vec{p}

of the particle in the considered reference frame. This expression still has a meaning when the rest mass m of the particle is 0. In an orthonormal reference frame, the equations of motion of a particle whose motion is mathematically described by a Hamiltonian system with Hamiltonian

H = c p = c \sqrt{{p_{1}}^{2} + {p_{2}}^{2} + {p_{3}}^{2}}

are

\{\begin{matrix} \frac{d x^{i}}{d t} & = \frac{\partial H}{\partial p_{i}} = c \frac{p_{i}}{p} \\ \frac{d p_{i}}{d t} & = - \frac{\partial H}{\partial x^{i}} = 0, \end{matrix} (1 \leq i \leq 3),

which shows that the particle moves on a straight line at the velocity of light c. It seems therefore reasonable to describe a gas of N photons in a vessel of volume V at rest in an inertial reference frame by a Hamiltonian system, with the Hamiltonian

H = c \sum_{i = 1}^{N} ∥ \vec{p_{i}} ∥ = c \sum_{i = 1}^{N} \sqrt{{p_{i 1}}^{2} + {p_{i 2}}^{2} + {p_{i 3}}^{2}} .

With the same notations as those used in the previous section, the partition function P of the gas takes the value, for each

b > 0

,

P (b) = \int_{D} exp (- b c \sum_{i = 1}^{N} ∥ \vec{p_{i}} ∥) \prod_{i = 1}^{N} (d \vec{x_{i}} d \vec{p_{i}}) = {(\frac{8 π V}{c^{3} b^{3}})}^{N} .

The probability density of the corresponding Gibbs state, with respect to the Liouville measure

λ_{ω} = \prod_{i = 1}^{N} (d \vec{x_{i}} d \vec{p_{i}})

, is

ρ_{b} = \prod_{i = 1}^{N} (\frac{c^{3} b^{3}}{8 π V}) exp (- b c ∥ \vec{p_{i}} ∥) .

This formula appears in the books by Synge [54] and Souriau [14]. Physicists consider it as not adequate for the description of a gas of photons contained in a vessel at thermal equilibrium because the number of photons in the vessel, at any given temperature, cannot be imposed: it results from the processes of absorption and emission of photons by the walls of the vessel, heated at the imposed temperature, which spontaneously occur. In other words, this number is a stochastic function whose probability law is imposed by Nature. Souriau proposes, in his book [14], a way to account for the possible variation of the number of photons. Instead of using the phase space of the system of N massless relativistic particles contained in a vessel, he uses the manifold of motions

M_{N}

of that system (which is symplectomorphic to its phase space). He considers that the manifold of motions M of a system of photons in the vessel is the disjoint union

M = ⋃_{N \in N} M_{N},

of all the manifolds of motions

M_{N}

of a system of N massless relativistic particles in the vessel, for all possible values of

N \in N

. Fo

N = 0

the manifold

M_{0}

is reduced to a singleton with, as Liouville measure, the measure which takes the value 1 on the only non empty part of that manifold (the whole manifold

M_{0}

). Moreover, since any photon cannot be distinguished from any other photon, two motions of the system with the same number N of massless particles which only differ by the labelling of these particles must be considered as identical. Souriau considers too that since the number N of photons freely adjusts itself, the value of the parameter

b = \frac{1}{k T}

must, at thermodynamic equilibrium, be the same in all parts

M_{N}

of the system,

N \in N

. He uses too the fact that a photon can have two different states of (circular) polarization. With these assumptions the value at any b of the partition function of the system is

P (b) = \sum_{N = 0}^{+ \infty} \frac{1}{N!} {(\frac{16 π V}{c^{3} b^{3}})}^{N} = exp (\frac{16 π V}{c^{3} b^{3}}) .

The number N of photons in the vessel at thermodynamic equilibrium is a stochastic function which takes the value n with the probability

Probability ([N = n]) = \frac{1}{n!} {(\frac{16 π V}{c^{3} b^{3}})}^{n} exp (- \frac{16 π V}{c^{3} b^{3}}) .

The expression of the partition function P allows the calculation of the internal energy, the entropy and all other thermodynamic functions of the system. However, the formula so obtained for the distribution of photons of various energies at a given temperature does not agree with the law, in very good agreement with experiments, obtained by Max Planck (1858–1947) in 1900. An assembly of photons in thermodynamic equilibrium evidently cannot be described as a classical Hamiltonian system. This fact played an important part for the development of quantum mechanics.

6.3.5. Specific Heat of Solids

The motion of a one-dimensional harmonic oscillator can be described by a Hamiltonian system with, as Hamiltonian,

H (p, q) = \frac{p^{2}}{2 m} + \frac{μ q^{2}}{2} .

The idea that the heat energy of a solid comes from the small vibrations, at a microscopic scale, of its constitutive atoms, lead physicists to attempt to mathematically describe a solid as an assembly of a large number N of three-dimensional harmonic oscillators. By dealing separately with each proper oscillation mode, the solid can even be described as an assembly of

3 N

one-dimensional harmonic oscillators. Exanges of energy between these oscillators is allowed by the existence of small couplings between them. However, for the determination of the thermodynamic equilibria of the solid we will, as in the previous section for ideal gases, consider as negligible the energy of interactions between the oscillators. We therefore take for Hamiltonian of the solid

H = \sum_{i = 1}^{3 N} (\frac{{p_{i}}^{2}}{2 m_{i}} + \frac{μ_{i} {q_{i}}^{2}}{2}) .

The value of the paritition function P, for any

b > 0

, is

P (b) = \int_{R^{6 N}} exp [- b \sum_{i = 1}^{3 N} (\frac{{p_{i}}^{2}}{2 m_{i}} + \frac{μ_{i} {q_{i}}^{2}}{2})] \prod_{i = 1}^{3 N} (d p_{i} d q_{i}) = \prod_{i = 1}^{3 N} (\frac{1}{ν_{i}}) b^{- 3 N},

where

ν_{i} = \frac{1}{2 π} \sqrt{\frac{μ_{i}}{m_{i}}}

is the frequency of the i-th harmonic oscillator.

The internal energy of the solid is

E (b) = - \frac{d log P (b)}{d b} = \frac{3 N}{b} .

We observe that it only depends on the the temperature and on the number of atoms in the solid, not on the frequencies

ν_{i}

of the harmonic oscillators. With

b = \frac{1}{k T}

this result is in agreement with the empirical law for the specific heat of solids, in good agreement with experiments at high temperature, discovered in 1819 by the French scientists Pierre Louis Dulong (1785–1838) and Alexis Thérèse Petit (1791–1820).

7. Generalization for Hamiltonian Actions

7.1. Generalized Gibbs States

In his book [15] and in several papers [13,16,17], Souriau extends the concept of a Gibbs state for a Hamiltonian action of a Lie group G on a symplectic manifold

(M, ω)

. Usual Gibbs states defined in Section 6 for a smooth Hamiltonian H on a symplectic manifold

(M, ω)

appear as special cases, in which the Lie group is a one-parameter group. If the symplectic manifold

(M, ω)

is the phase space of the Hamiltonian system, that one-parameter group, whose parameter is the time t, is the group of evolution, as a function of time, of the state of the system, starting from its state at some arbitrarily chosen initial time

t_{0}

. If

(M, ω)

is the symplectic manifold of all the motions of the system, that one-parameter group, whose parameter is a real

τ \in R

, is the transformation group which maps one motion of the system with some initial state at time

t_{0}

onto the motion of the system with the same initial state at another time

(t_{0} + τ)

. We discuss below this generalization.

Notations and Conventions

In this section,

Φ : G \times M \to M

is a Hamiltonian action (for example on the left) of a Lie group G on a symplectic manifold

(M, ω)

. We denote by

G

the Lie algebra of G, by

G^{*}

its dual space and by

J : M \to G^{*}

a momentum map of the action Φ.

Definition 19.

Let

b \in G

be such that the integrals on the right hand sides of the equalities

\begin{matrix} P (b) & = \int_{M} exp (- 〈 J, b 〉) d λ_{ω} a n d \\ E_{J} (b) & = E_{ρ_{b}} (J) = \frac{1}{P (b)} \int_{M} J exp (- 〈 J, b 〉) d λ_{ω} \end{matrix}

converge. The smooth probability measure on M with density (with respect to the Liouville measure

λ_{ω}

on M)

ρ_{b} = \frac{1}{P (b)} exp (- 〈 J, b 〉)

is called the generalized Gibbs statistical state associated to b. The functions

b \mapsto P (b)

and

b \mapsto E_{J} (b)

so defined on the subset of

G

made by elements b for which the integrals defining

P (b)

and

E_{J} (b)

converge are called the partition function associated to the momentum map J and the mean value of J at generalized Gibbs states.

The following Proposition generalizes 9.

Proposition 12.

Let

b \in G

be such that the integrals defining

P (b)

and

E_{J} (b)

in Definition 19 converge, and

ρ_{b}

be the density of the generalized Gibbs state associated to b. The entropy

s (ρ_{b})

, which will be denoted by

S (b)

, exists and is given by

S (b) = log (P (b)) + 〈E_{J} (b), b〉 = log (P (b)) - 〈D (log P (b)), b〉 .

(7)

Moreover, for any other smooth probability density

ρ_{1}

such that

E_{ρ_{1}} (J) = E_{ρ_{b}} (J) = E_{J} (b),

we have

s (ρ_{1}) \leq s (ρ_{b}),

and the equality

s (ρ_{1}) = s (ρ_{b})

holds if and only if

ρ_{1} = ρ_{b}

.

Proof.

Equation (7) follows from

log (\frac{1}{ρ_{b}}) = log (P (b)) + 〈 J, b 〉

, and

D (log P (b)) = - E_{J} (b)

. The remaining of the proof is the same as that of Proposition 9. ☐

Remark 15.

1.: The second part of Equation (7), $S (b) =$ log $(P (b)) - 〈D (log P (b)), b〉$ , expresses the fact that the functions log $(P (b))$ and $- S (b)$ are Legendre transforms of each other: they are linked by the same relation as the relation which links a smooth Lagrangian L and the associated energy $E_{L}$ .
2.: The Liouville measure $λ_{ω}$ remains invariant under the Hamiltonian action Φ, since the symplectic form ω itself remains invariant under that action. However, we have not a full analogue of Proposition 10 because the momentum map J does not remain invariant under the action Φ. We only have the partial anologue stated below.
3.: Legendre transforms were used by Massieu in thermodynamics in his very early works [55,56], more systematically presented in [57], in which he introduced his characteristic functions (today called thermodynamic potentials) allowing the determination of all the thermodynamic functions of a physical system by partial derivations of a suitably chosen characteristic function. For a modern presentation of that subject the reader is referred to [58,59], Chapter 5, pp. 131–152.

Proposition 13.

Let

b \in G

be such that the integrals defining

P (b)

and

E_{J} (b)

in Definition 19 converge. The generalized Gibbs state associated to b remains invariant under the restriction of the Hamiltonian action Φ to the one-parameter subgroup of G generated by b, {exp

(τ b) | τ \in R}

.

Proof.

The orbits of the action on M of the subgroup

\{exp (τ b) | τ \in R\}

of G are the integral curves of the Hamiltonian vector field whose Hamiltonian is

〈 J, b 〉

, which of course is constant on each of these curves. Therefore the proof of Proposition 10 is valid for that subgroup. ☐

7.2. Generalized Thermodynamic Functions

Assumptions Made in this Section

Notations and conventions being the same as in Section 7.1, let Ω be the largest open subset of the Lie algebra

G

of G containing all

b \in G

satisfying the following properties:

the functions defined on M, with values, respectively, in $R$ and in the dual $G^{*}$ of $G$ ,

$z \mapsto exp (- 〈J (z), b〉) and z \mapsto J (z) exp (- 〈J (z), b〉)$

are integrable on M with respect to the Liouville measure $λ_{ω}$ ;
moreover their integrals are differentiable with respect to b, their differentials are continuous and can be calculated by differentiation under the sign $\int_{M}$ .

It is assumed in this section that the considered Hamiltonian action Φ of the Lie group G on the symplectic manifold

(M, ω)

and its momentum map J are such that the open subset Ω of

G

is not empty. This condition is not always satisfied when

(M, ω)

is a cotangent bundle, but of course it is satisfied when it is a compact manifold.

Proposition 14.

Let

Φ : G \times M \to M

be a Hamiltonian action of a Lie group G on a symplectic manifold

(M, ω)

satisfying the assumptions indicated in Section 7.2. The partition function P associated to the momentum map J and the mean value

E_{J}

of J for generalized Gibbs states Definition 19 are defined and continuously differentiable on the open subset Ω of

G

. For each

b \in Ω

, the differentials at b of the functions P and log P (which are linear maps defined on

G

, with values in

R

, in other words elements of

G^{*}

) are given by

D P (b) = - P (b) E_{J} (b), D (log P) (b) = - E_{J} (b) .

For each

b \in Ω

, the differential at b of the map

E_{J}

(which is a linear map defined on

G

, with values in its dual

G^{*}

) is given by

〈D E_{J} (b) (Y), Z〉 = 〈E_{J} (b), Y〉 〈E_{J} (b), Z〉 - E_{ρ_{b}} (〈 J, Y 〉 〈 J, Z 〉), with Y and Z \in G,

where we have written, as in Definition 17,

E_{ρ_{b}} (〈 J, Y 〉 〈 J, Z 〉) = \frac{1}{P (b)} \int_{M} 〈 J, Y 〉 〈 J, Z 〉 exp (- 〈 J, b 〉) d λ_{ω} .

At each

b \in Ω

, the differential of the entropy function S Proposition 12, which is a linear map defined on

G

, with values in

R

, in other words an element of

G^{*}

, is given by

〈D S (b), Y〉 = 〈D E_{J} (b) (Y), b〉, Y \in G .

Proof.

By assumptions Section 7.2, the differentials of P and

E_{J}

can be calculated by differentiation under the sign

\int_{M}

. Easy (but tedious) calculations lead to the indicated results. ☐

Corollary 3.

With the same assumptions and notations as those in Proposition 14, for any

b \in Ω

and

Y \in G

,

〈D E_{J} (b) (Y), Y〉 = - \frac{1}{P (b)} \int_{M} {〈J - E_{J} (b), Y〉}^{2} d λ_{ω} \leq 0 .

Proof.

This result follows from the well known result in Probability theory already used in the proof of Proposition 11. ☐

The momentum map J of the Hamiltonian action Φ is not uniquely determined: for any constant

μ \in G^{*}

,

J_{1} = J + μ

too is a momentum map for Φ. The following proposition indicates how the generalized thermodynamic functions P,

E_{J}

and S change when J is replaced by

J_{1}

.

Proposition 15.

With the same assumptions and notations as those in Proposition 14, let

μ \in G^{*}

be a constant. When the momentum map J is replaced by

J_{1} = J + μ

, the open subset Ω of

G

remains unchanged, while the generalized thermodynamic functions P,

E_{J}

and S, are replaced, respectively, by

P_{1}

,

E_{J_{1}}

and

S_{1}

, given by

P_{1} (b) = exp (- 〈 μ, b 〉) P (b), E_{J_{1}} (b) = E_{J} (b) + μ, S_{1} (b) = S (b) .

The Gibbs satistical state and its density

ρ_{b}

with respect to the Liouville measure

λ_{ω}

remain unchanged.

Proof.

We have

exp (- 〈 J + μ, b 〉) = exp (- 〈 μ, b 〉) exp (- 〈 J, b 〉) .

The indicated results follow by easy calculations. ☐

The following proposition indicates how the generalized thermodynamic functions P,

E_{J}

and S vary along orbits of the adjoint action of the Lie group G on its Lie algebra

G

.

Proposition 16.

The assumptions and notations are the same as those in Proposition 14. The open subset Ω of

G

is an union of orbits of the adjoint action of G on

G

. In other words, for each

b \in Ω

and each

g \in G

,

{Ad}_{g} b \in Ω

. Moreover, let

θ : G \to G^{*}

be the symplectic cocycle of G for the coadjoin action of G on

G^{*}

such that, for any

g \in G

,

J \circ Φ_{g} = {Ad}_{g^{- 1}}^{*} \circ J + θ (g) .

Then for each

b \in Ω

and each

g \in G

\begin{matrix} P ({Ad}_{g} b) & = exp (〈θ (g^{- 1}), b〉) P (b) = exp (- 〈{Ad}_{g}^{*} θ (g), b〉) P (b), \\ E_{J} ({Ad}_{g} b) & = {Ad}_{g^{- 1}}^{*} E_{J} (b) + θ (g), \\ S ({Ad}_{g} b) & = S (b) . \end{matrix}

Proof.

We have

\begin{matrix} P ({Ad}_{g} b) & = \int_{M} exp (- 〈 J, {Ad}_{g} b 〉) d λ_{ω} = \int_{M} exp (- 〈 {Ad}_{g}^{*} J, b 〉) d λ_{ω} \\ = \int_{M} exp (- 〈J \circ Φ_{g^{- 1}} - θ (g^{- 1}, b〉) d λ_{ω} \\ = exp (〈θ (g^{- 1}), b〉) P (b) = exp (- 〈{Ad}_{g}^{*} θ (g), b〉) P (b), \end{matrix}

since

θ (g^{- 1}) = - {Ad}_{g}^{*} θ (g)

. By using Propositions 14 and 12, the other results easily follow. ☐

Remark 16.

The equality

E_{J} ({Ad}_{g} b) = {Ad}_{g^{- 1}}^{*} E_{J} (b) + θ (g)

means that the map

E_{J} : Ω \to G^{*}

is equivariant with respect to the adjoint action of G on the open subset Ω of its Lie algebra

G

and its affine action on the left on

G^{*}

(g, ξ) \mapsto {Ad}_{g^{- 1}}^{*} ξ + θ (g), g \in G, ξ \in G^{*} .

Proposition 17.

The assumptions and notations are the same as those in Proposition 14. For each

b \in Ω

and each

X \in G

, we have

\begin{matrix} 〈E_{J} (b), [X, b]〉 & = 〈Θ (X), b〉, \\ D E_{J} (b) ([X, b]) & = - {ad}_{X}^{*} E_{J} (b) + Θ (X), \end{matrix}

where

Θ = T_{e} θ : G \to G^{*}

is the 1-cocycle of the Lie algebra

G

associated to the 1-cocycle θ of the Lie group G.

Proof.

Let us set

g = exp (τ X)

in the first equality in Proposition 16, derive that equality with respect to τ, and evaluate the result at

τ = 0

. We obtain

D P (b) ([X, b]) = - P (b) 〈Θ (X), b〉 .

Since, by the first equality of Proposition 14,

D P (b) = - P (b) E_{J} (b)

, the first stated equality follows.

Let us now set

g = exp (τ X)

in the second equality in Proposition 16, derive that equality with respect to τ, and evaluate the result at

τ = 0

. We obtain the second equality stated. ☐

Corollary 4.

With the assumptions and notations of Proposition 17, let us define, for each

b \in Ω

, a linear map

Θ_{b} : G \to G^{*}

by setting

Θ_{b} (X) = Θ (X) - {ad}_{X}^{*} E_{J} (b) .

The map

Θ_{b}

is a symplectic 1-cocycle of the Lie algebra

G

for the coadjoint representation, which satisfies

Θ_{b} (b) = 0 .

Moreover if we replace the momentum map J by

J_{1} = J + μ

, with

μ \in G^{*}

constant, the 1-cocycle

Θ_{b}

remains unchanged.

Proof.

For X, Y and Z in

G

, we have since Θ is a 1-cocycle,

\sum_{circ (X, Y, Z)}

meaning a sum over circular permutations of X, Y and Z, using the Jacobi identity in

G

, we have

\begin{matrix} \sum_{circ (X, Y, Z)} 〈Θ_{b} (X), [Y, Z]〉 & = \sum_{circ (X, Y, Z)} 〈- {ad}_{X}^{*} E_{J} (b), [Y, Z]〉 \\ = \sum_{circ (X, Y, Z)} 〈- E_{J} (b), [X, [Y, Z]]〉 \\ = 0 . \end{matrix}

The linear map

Θ_{b}

is therefore a 1 cocycle, even a symplectic 1-cocycle since for all X and

Y \in G

,

〈Θ_{b} (X), Y〉 = - 〈Θ_{b} (Y), X〉

.

Using the first equality stated in Proposition 17, we have for any

X \in G

〈Θ_{b} (b), X〉 = 〈Θ (b) - {ad}_{b}^{*} E_{J} (b), X〉 = - 〈Θ (X), b〉 + 〈E_{J} (b), [X, b]〉 = 0 .

If we replace J by

J_{1} = J + μ

, the map

X \mapsto Θ (X)

is replaced by

X \mapsto Θ_{1} (X) = Θ (X) + {ad}_{X}^{*} μ

and

E_{J} (b)

by

E_{J_{1}} (b) = E_{J} (b) + μ

, therefore

Θ_{b}

remains unchanged. ☐

The following lemma will allow us to define, for each

b \in Ω

, a remarkable symmetric bilinear form on the vector subspace

[b, G] = \{[b, X]; X \in G\}

of the Lie algebra

G

.

Lemma 1.

Let Ξ be a 1-cocycle of a finite-dimensional Lie algebra

G

for the coadjoint representation. For each

b \in ker Ξ

, let

F_{b} = [G, b]

be the set of elements

X \in G

which can be written

X = [X_{1}, b]

for some

X_{1} \in G

. Then

F_{b}

is a vector subspace of

G

, and the value of the right hand side of the equality

Γ_{b} (X, Y) = 〈Ξ (X_{1}), Y〉, with X_{1} \in G, X = [X_{1}, b] \in F_{b}, Y \in F_{b},

depends only on X and Y, not on the choice of

X_{1} \in G

such that

X = [X_{1}, b]

. That equality defines a bilinear form

Γ_{b}

on

F_{b}

which is symmetric, i.e., satisfies

Γ_{b} (X, Y) = Γ_{b} (Y, X) for all X and Y \in F_{b} .

Proof.

Let

X_{1}

and

X_{1}^{'} \in G

be such that

[X_{1}, b] = [X_{1}^{'}, b] = X

. Let

Y_{1} \in G

be such that

[Y_{1}, b] = Y

. We have

\begin{matrix} 〈Ξ (X_{1} - X_{1}^{'}), Y〉 & = 〈Ξ (X_{1} - X_{1}^{'}), [Y_{1}, b]〉 \\ = - 〈Ξ (Y_{1}), [b, X_{1} - X_{1}^{'}]〉 - 〈Ξ (b), [X_{1} - X_{1}^{'}, Y_{1}]〉 \\ = 0 \end{matrix}

since

Ξ (b) = 0

and

[b, X_{1} - X_{1}^{'}] = 0

. We have shown that

〈Ξ (X_{1}), Y〉 = 〈Ξ (X_{1}^{'}), Y〉

. Therefore

Γ_{b}

is a bilinear form on

F_{b}

. Similarly

\begin{matrix} 〈Ξ (X_{1}), Y〉 & = 〈Ξ (X_{1}), [Y_{1}, b]〉 = - 〈Ξ (Y_{1}), [b, X_{1}]〉 - 〈Ξ (b), [X_{1}, Y_{1}]〉 = 〈Ξ (Y_{1}), X〉, \end{matrix}

which proves that

Γ_{b}

is symmetric. ☐

Theorem 7.

The assumptions and notations are the same as those in Proposition 14. For each

b \in Ω

, there exists on the vector subspace

F_{b} = [G, b]

of elements

X \in G

which can be written

X = [X_{1}, b]

for some

X_{1} \in G

, a symmetric negative bilinear form

Γ_{b}

given by

Γ_{b} (X, Y) = 〈Θ_{b} (X_{1}), Y〉, with X_{1} \in G, X = [X_{1}, b] \in F_{b}, Y \in F_{b},

where

Θ_{b} : G \to G^{*}

is the symplectic 1-cocycle defined in Corollary 4.

Proof.

We have seen in Corollary 4 that

b \in ker Θ_{b}

. The fact that the equality given in the statement above defines indeed a symmetric bilinear form on

F_{b}

directly follows from Lemma 1. We only have to prove that this symmetric bilinear form is negative. Let

X \in F_{b}

and

X_{1} \in G

such that

X = [X_{1}, b]

. Using Proposition 17 and Corollary 3, we have

\begin{matrix} Γ_{b} (X, X) & = 〈Θ_{b} (X_{1}), [X_{1}, b]〉 = 〈Θ (X_{1}) - {ad}_{X_{1}}^{*} E_{J} (b), [X_{1}, b]〉 = 〈D E_{J} (b) [X_{1}, b], [X_{1}, b]〉 \\ \leq 0 . \end{matrix}

The symmetric bilinear form

Γ_{b}

on

F_{b}

is therefore negative. ☐

Remark 17.

The symmetric negative bilinear forms encountered in Theorem 7 and Corollary 3 seem to be linked with the Fisher metric in information geometry discussed in [31,60,61].

7.3. Examples of Generalized Gibbs States

7.3.1. Action of the Group of Rotations on a Sphere

The symplectic manifold

(M, ω)

considered here is the two-dimensional sphere of radius R centered at the origin O of a three-dimensional oriented Euclidean vector space

\vec{E}

, equipped with its area element as symplectic form. The group G of rotations around the origin (isomorphic to

SO (3)

) acts on the sphere M by a Hamiltonian action. The Lie algebra

G

of G can be identified with

\vec{E}

, the fundamental vector field on M associated to an element

\vec{b}

in

G \equiv \vec{E}

being the vector field on M whose value at a point

m \in M

is given by the vector product

\vec{b} \times \vec{O m}

. The dual

G^{*}

of

G

will be too identified with

\vec{E}

, the coupling by duality being given by the Euclidean scalar product. The momentum map

J : M \to G^{*} \equiv \vec{E}

is given by

J (m) = - R \vec{O m}, m \in M .

Therefore, for any

\vec{b} \in G \equiv \vec{E}

,

〈J (m), \vec{b}〉 = - R \vec{O m} \cdot \vec{b} .

Let

\vec{b}

be any element in

G \equiv \vec{E}

. To calculate the partition function

P (\vec{b})

we choose an orthonormal basis

(\vec{e_{x}}, \vec{e_{y}}, \vec{e_{z}})

of

\vec{E}

such that

\vec{b} = ∥ \vec{b} ∥ \vec{e_{z}}

, with

∥ \vec{b} ∥ \in R^{+}

, and we use angular coordinates

(φ, θ)

on the sphere M. The coordinates of a point

m \in M

are

x = R cos θ cos φ, y = R cos θ sin φ, z = R sin θ .

We have

P (\vec{b}) = \int_{0}^{2 π} (\int_{- π / 2}^{π / 2} R^{2} exp (R ∥ \vec{b} ∥ sin θ d θ) d φ = \frac{4 π R}{∥ \vec{b} ∥} sh (R ∥ \vec{b} ∥) .

The probability density (with respect to the natural area measure on the sphere M) of the generalized Gibbs state associated to

\vec{b}

is

ρ_{b} (m) = \frac{1}{P (\vec{b})} exp (\vec{O m} \cdot \vec{b}), m \in M .

We observe that

ρ_{b}

reaches its maximal value at the point

m \in M

such that

\vec{O m} = \frac{R \vec{b}}{∥ \vec{b} ∥}

and its minimal value at the diametrally opposed point.

7.3.2. The Galilean Group, Its Lie Algebra and Its Actions

In view of the presentation, made below, of some physically meaningful generalized Gibbs states for Hamiltonian actions of subgroups of the Galilean group, we recall in this section some notions about the space-time of classical (non-relativistic) mechanics, the Galilean group, its Lie algebra and its Hamiltonian actions. The interested reader will find a much more detailed treatment on these subjects in the book by Souriau [14] or in the recent book by de Saxcé and Vallée [45]. The paper [62] presents a nice application of Galilean invariance in thermodynamics.

The space-time of classical mechanics is a four-dimensional real affine space which, once an inertial reference frame, units of length and time, orthonormal bases of space and time are chosen, can be identified with

R^{4} \equiv R^{3} \times R

(coordinates x, y, z, t). The first three coordinates x, y and z can be considered as the three components of a vector

\vec{r} \in R^{3}

, therefore an element of space-time can be denoted by

(\vec{r}, t)

. However, as the action of the Galilean group will show, the splitting of space-time into space and time is not uniquely determined, it depends on the choice of an inertial reference frame. In classical mechanics, there exists an absolute time, but no absolute space. There exists instead a space (which is an Euclidean affine three-dimensional space) for each value of the time. The spaces for two distinct values of the time should be considered as disjoint.

The space-time being identified with

R^{3} \times R

as explained above, the Galilean group G can be identified with the set of matrices of the form

(\begin{matrix} A & \vec{b} & \vec{d} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}), with A \in SO (3), \vec{b} and \vec{d} \in R^{3}, e \in R,

(8)

the vector space

R^{3}

being oriented and endowed with its usual Euclidean structure, the matrix

A \in SO (3)

acting on it.

The action of the Galilean group G on space-time, identified as indicated above with

R^{3} \times R

, is the affine action

(\begin{matrix} \vec{r} \\ t \\ 1 \end{matrix}) \mapsto (\begin{matrix} A & \vec{b} & \vec{d} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} \vec{r} \\ t \\ 1 \end{matrix}) = (\begin{matrix} A \vec{r} + t \vec{b} + \vec{d} \\ t + e \\ 1 \end{matrix}) .

The Lie algebra

G

of the Galilean group G can be identified with the space of matrices of the form

(\begin{matrix} j (\vec{ω}) & \vec{β} & \vec{δ} \\ 0 & 0 & ε \\ 0 & 0 & 0 \end{matrix}), with \vec{ω}, \vec{β} and \vec{δ} \in R^{3}, ε \in R .

(9)

We have denoted by

j (\vec{ω})

the

3 \times 3

skew-symmetric matrix

j (\vec{ω}) = (\begin{matrix} 0 & - ω_{z} & ω_{y} \\ ω_{z} & 0 & - ω_{x} \\ - ω_{y} & ω_{x} & 0 \end{matrix}) .

The matrix

j (\vec{ω})

is an element in the Lie algebra

s o (3)

, and its action on a vector

\vec{r} \in R^{3}

is given by the vector product

j (\vec{ω}) \vec{r} = \vec{ω} \times \vec{r} .

Let us consider a mechanical system made by a point particle of mass m whose position and velocity at time t, in the reference frame allowing the identification of space-time with

R^{3} \times R

, are the vectors

\vec{r}

and

\vec{v} \in R^{3}

. The action of an element of the Galilean group on

\vec{r}, \vec{v}

and t can be written as

(\begin{matrix} \vec{r} & \vec{v} \\ t & 1 \\ 1 & 0 \end{matrix}) \mapsto (\begin{matrix} A & \vec{b} & \vec{d} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} \vec{r} & \vec{v} \\ t & 1 \\ 1 & 0 \end{matrix}) = (\begin{matrix} A \vec{r} + t \vec{b} + \vec{d} & A \vec{v} + \vec{b} \\ t + e & 1 \\ 1 & 0 \end{matrix}) .

Souriau has shown in his book [14] that this action is Hamiltonian, with the map J, defined on the evolution space of the particle, with value in the dual

G^{*}

of the Lie algebra

G

of the Galilean group, as momentum map

J (\vec{r}, t, \vec{v}, m) = m (\vec{r} \times \vec{v}, \vec{r} - t \vec{v}, \vec{v}, \frac{1}{2} {∥ \vec{v} ∥}^{2}) .

Let

b = (\begin{matrix} j (\vec{ω}) & \vec{β} & \vec{δ} \\ 0 & 0 & ε \\ 0 & 0 & 0 \end{matrix})

be an element in

G

. Its coupling with

J (\vec{r}, t, \vec{v}, m) \in G^{*}

is given by the formula

〈J (\vec{r}, t, \vec{v}, m), b〉 = m (\vec{ω} \cdot (\vec{r} \times \vec{v}) - (\vec{r} - t \vec{v}) \cdot \vec{β} + \vec{v} \cdot \vec{δ} - \frac{1}{2} {∥ \vec{v} ∥}^{2} ε) .

7.3.3. One-Parameter Subgroups of the Galilean Group

In his book [14], Souriau has shown that when the considered Lie group action is the action of the full Galilean group on the space of motions of an isolated mechanical system, the open subset Ω of the Lie algebra

G

of the Galilean group on which the conditions specified in Section 7.2 are satisfied is empty. In other words, generalized Gibbs states of the full Galilean group do not exist. However, generalized Gibbs states for one-parameter subgroups of the Galilean group do exist which have an interesting physical meaning.

Let us consider an element b of

G

such that in its matrix expression (expression (9) above) we have

ε \neq 0

. The one-parameter subgroup

G_{1}

of the Galilean group generated by b is the set of matrices

exp (τ b)

, with

τ \in R

. We have

exp (τ b) = (\begin{matrix} A (τ) & \vec{b} (τ) & \vec{d} (τ) \\ 0 & 1 & τ ε \\ 0 & 0 & 1 \end{matrix}),

with

\begin{matrix} A (τ) & = exp (τ j (\vec{ω})), \\ \vec{b} (τ) & = (\sum_{n = 1}^{\infty} \frac{τ^{n}}{n!} {(j (\vec{ω}))}^{n - 1}) \vec{β}, \\ \vec{d} (τ) & = (\sum_{n = 1}^{\infty} \frac{τ^{n}}{n!} {(j (\vec{ω}))}^{n - 1}) \vec{δ} + ε (\sum_{n = 2}^{\infty} \frac{τ^{n}}{n!} {(j (\vec{ω}))}^{n - 2}) \vec{β}, \end{matrix}

with the usual convention that

{(j (\vec{ω}))}^{0}

is the unit matrix.

The physical meaning of this one-parameter subgroup of the Galilean group can be understood as follows. Let us call fixed the affine Euclidean reference frame of space

(O, \vec{e_{x}}, \vec{e_{y}}, \vec{e_{z}})

used to represent, at time

t = 0

, a point in space by a vector

\vec{r}

or by its three components x, y and z. Let us set

τ = \frac{t}{ε}

. For each time

t \in R

, the action of

A (τ) = A (\frac{t}{ε})

maps the fixed reference frame

(O, \vec{e_{x}}, \vec{e_{y}}, \vec{e_{z}})

onto another affine Euclidean reference frame

(O (t), \vec{e_{x}} (t), \vec{e_{y}} (t), \vec{e_{z}} (t))

, which we call the moving reference frame. The velocity and the acceleration of the relative motion of the moving reference frame with respect to the fixed reference frame is given, at time

t = 0

, by the fundamental vector field associated to the element b of the Lie algebra

G

of the Galilean group: we see that each point in space has a motion composed of a rotation around the axis through O parallel to

\vec{ω}

, at an angular velocity

\frac{∥ \vec{ω} ∥}{ε}

, and simultaneously a uniformly accelerated motion of translation at an initial velocity

\frac{\vec{δ}}{ε}

and acceleration

\frac{\vec{β}}{ε}

. At time t, the velocity and acceleration of the moving reference frame with respect to its instantaneous position at that time can be described in a similar manner, but instead of O,

\vec{ω}

,

\vec{β}

and

\vec{δ}

we must use the corresponding transformed elements by the action of

A (τ) = A (\frac{t}{ε})

.

7.3.4. A Gas Contained in a Moving Vessel

We consider a mechanical system made by a gas of N point particles, indexed by

i \in {1, 2, \dots, N}

, contained in a vessel with rigid, undeformable walls, whose motion in space is given by the action of the one-parameter subgroup

G_{1}

of the Galilean group made by the

A (\frac{t}{ε})

, with

t \in R

, above described. We denote by

m_{i}

,

\vec{r_{i}} (t)

and

\vec{v_{i}} (t)

the mass, position vector and velocity vector, respectively, of the i-th particle at time t. Since the motion of the vessel containing the gas is precisely given by the action of

G_{1}

, the boundary conditions imposed to the system are invariant by that action, which leaves invariant the evolution space of the mechanical system, is Hamiltonian and projects onto a Hamiltonian action of

G_{1}

on the symplectic manifold of motions of the system. We can therefore consider the generalized Gibbs states of the system, as discussed in Section 7.1. We must evaluate the momentum map J of that action and its coupling with the element

b \in G

. As in Section 6.3.1 we will neglect, for that evaluation, the contributions of the collisions of the particles between themselves and with the walls of the vessel. The momentum map can therefore be evaluated as if all particles were free, and its coupling

〈 J, b 〉

with b is the sum

\sum_{i = 1}^{N} 〈 J_{i}, b 〉

of the momentum map

J_{i}

of the i-th particle, considered as free, with b. We have

〈J_{i} (\vec{r_{i}}, t, \vec{v_{i}}, m_{i}), b〉 = m_{i} (\vec{ω} \cdot (\vec{r_{i}} \times \vec{v_{i}}) - (\vec{r_{i}} - t \vec{v_{i}}) \cdot \vec{β} + \vec{v_{i}} \cdot \vec{δ} - \frac{1}{2} {∥ \vec{v_{i}} ∥}^{2} ε) .

Following Souriau [14], Chapter IV, pp. 299–303, we observe that

〈 J_{i}, b 〉

is invariant by the action of

G_{1}

. We can therefore define

\vec{r_{i 0}}

,

t_{0}

and

\vec{v_{i 0}}

by setting

(\begin{matrix} \vec{r_{i 0}} & \vec{v_{i 0}} \\ t_{0} & 1 \\ 1 & 0 \end{matrix}) = exp (- \frac{t}{ε} b) (\begin{matrix} \vec{r_{i}} & \vec{v_{i}} \\ t & 1 \\ 1 & 0 \end{matrix})

and write

〈J_{i} (\vec{r_{i}}, t, \vec{v_{i}}, m_{i}), b〉 = 〈J_{i} (\vec{r_{i 0}}, t_{0}, \vec{v_{i 0}}, m_{i}), b〉 .

The vectors

\vec{r_{i 0}}

and

\vec{v_{i 0}}

have a clear physical meaning: they are the vectors

\vec{r_{i}}

and

\vec{v_{i}}

as seen by an observer moving with the moving affine Euclidean reference frame

(O (t), \vec{e_{x}} (t), \vec{e_{y}} (t), \vec{e_{z}} (t))

. Moreover, as can be easily verified,

t_{0} = 0

of course. We therefore have

\begin{matrix} 〈J_{i} (\vec{r_{i}}, t, \vec{v_{i}}, m_{i}), b〉 & = m_{i} (\vec{ω} \cdot (\vec{r_{i 0}} \times \vec{v_{i 0}}) - \vec{r_{i 0}} \cdot \vec{β} + \vec{v_{i 0}} \cdot \vec{δ} - \frac{1}{2} {∥ \vec{v_{i 0}} ∥}^{2} ε) \\ = m_{i} (\vec{v_{i 0}} \cdot (\vec{ω} \times \vec{r_{i 0}} + \vec{δ}) - \vec{r_{i 0}} \cdot \vec{β} - \frac{1}{2} {∥ \vec{v_{i 0}} ∥}^{2} ε) \end{matrix}

where we have used the well known property of the mixed product

\vec{ω} \cdot (\vec{r_{i 0}} \times \vec{v_{i 0}}) = \vec{v_{i 0}} \cdot (\vec{ω} \times \vec{r_{i 0}}) .

Let us set

{\vec{U}}^{*} = \frac{1}{ε} (\vec{ω} \times \vec{r_{i 0}} + \vec{δ}) .

Using

\vec{v_{i 0}} - {\vec{U}}^{*}

and

{\vec{U}}^{*}

instead of

\vec{v_{i 0}}

, we can write

〈J_{i} (\vec{r_{i}}, t, \vec{v_{i}}, m_{i}), b〉 = m_{i} ε (- \frac{1}{2} ∥ \vec{v_{i 0}} - {\vec{U}}^{*} ∥^{2} - \vec{r_{i 0}} \cdot \frac{\vec{β}}{ε} + \frac{1}{2} {∥ {\vec{U}}^{*} ∥}^{2}) .

We observe that the vector

{\vec{U}}^{*}

only depends on ε,

\vec{ω}

,

\vec{δ}

, which are constants once the element

b \in G

is chosen, and of

\vec{r_{i 0}}

, not on

\vec{v_{i 0}}

. It has a clear physical meaning: it is the value of the velocity of the moving affine reference frame with respect to the fixed affine reference frame, at point

\vec{r_{i 0}}

seen by an observer linked to the moving reference frame. Therefore the vector

\vec{w_{i 0}} = \vec{v_{i 0}} - {\vec{U}}^{*}

is the relative velocity of the i-th particle with respect to the moving affine reference frame, seen by an observer linked to the moving reference frame.

The three components of

\vec{r_{i 0}}

and the three components of

\vec{p_{i 0}} = m_{i} \vec{w_{i 0}}

make a system of Darboux coordinates on the six-dimensional symplectic manilold

(M_{i}, ω_{i})

of motions of the i-th particle. With a slight abuse of notations, we can consider the momentum map

J_{i}

as defined on the space of motions of the i-th particle, instead of being defined on the evolution space of this particle, and write

〈J_{i} (\vec{r_{i 0}}, \vec{p_{i, 0}}), b〉 = - ε (\frac{1}{2 m_{i}} {∥ \vec{p_{i 0}} ∥}^{2} + m_{i} f_{i} (\vec{r_{i 0}})), \vec{p_{i 0}} = m_{i} \vec{w_{i 0}} = m_{i} (\vec{v_{i 0}} - {\vec{U}}^{*}),

(10)

and

f_{i} (\vec{r_{i 0}}) = \vec{r_{i 0}} \cdot \frac{\vec{β}}{ε} - \frac{1}{2 ε^{2}} ∥ \vec{ω} \times \vec{r_{i 0}} ∥^{2} - \frac{\vec{δ}}{ε} \cdot (\frac{\vec{ω}}{ε} \times \vec{r_{i 0}}) - \frac{1}{2 ε^{2}} {∥ \vec{δ} ∥}^{2} .

Equation (10) is well suited for the determination of generalized Gibbs states of the system. Let us set

P_{i} (b) = \int_{M_{i}} exp (- 〈 J_{i}, b 〉) d λ_{ω_{i}}, E_{J_{i}} (b) = \frac{1}{P_{i} (b)} \int_{M_{i}} J_{i} exp (- 〈 J_{i}, b 〉) d λ_{ω_{i}} .

The integrals in the right hand sides of these equalities converge if and only if

ε < 0

. It means that the matrix b belongs to the subset Ω of the one-dimensional Lie algebra of the considered one-parameter subgroup

G_{1}

of the Galilean group on which generalized Gibbs states can be defined if and only if

ε < 0

. Assuming that condition satisfied, we can use Definitions 19. The generalized Gibbs state determined by b has the smooth density, with respect to the Liouville measure

\prod_{i = 1}^{N} λ_{ω_{i}}

on the symplectic manifold of motions

Π_{i = 1}^{N} (M_{i}, ω_{i})

,

ρ (b) = \prod_{i = 1}^{N} ρ_{i} (b), with ρ_{i} (b) = \frac{1}{P_{i} (b)} exp (- 〈 J_{i}, b 〉) .

The partition function, whose expression is

P (b) = \prod_{i = 1}^{N} P_{i} (b),

can be used, with the help of the formulae given in Section 7.2, to determine all the generalized thermodynamic functions of the gas in a generalized thermodynamic equilibrium state.

Remark 18.

1.: The physical meaning of the parameter ε which appears in the expression of the matrix b is clearly apparent in expression (10) of $〈 J_{i}, b 〉$ :

$ε = - \frac{1}{k T},$

T being the absolute temperature and k the Boltzmann’s constant.
2.: The same expression (10) shows that the relative motion of the gas with respect to the moving vessel in which it is contained, seen by an observer linked to that moving vessel, is described by a Hamiltonian system in which the kinetic and potential energies of the i-th particle are, respectively, $\frac{1}{2 m_{i}} {∥ \vec{p_{i 0}} ∥}^{2}$ and $m_{i} f_{i} (\vec{r_{i 0}})$ . This result can be obtained in another way: by deriving the Hamiltonian which governs the relative motion of a mechanical system with respect to a moving frame, as used by Jacobi [63] to determine the famous Jacobi integral of the restricted circular three-body problem (in which two big planets move on concentric circular orbits around their common center of mass, and a third planet of negligible mass moves in the gravitational field created by the two big planets).
3.: The generalized Gibbs state of the system imposes to the various parts of the system, i.e., to the various particles, to be at the same temperature $T = - \frac{1}{k ε}$ and to be statistically at rest in the same moving reference frame.

7.3.5. Three Examples

1.: Let us set $\vec{ω} = 0$ and $\vec{β} = 0$ . The motion of the moving vessel containing the gas (with respect to the so called fixed reference frame) is a translation at a constant velocity $\frac{\vec{δ}}{ε}$ . The function $f_{i} (\vec{r_{i 0}})$ is then a constant. In the moving reference frame, which is an inertial frame, we recover the thermodynamic equilibrium state of a monoatomic gas discussed in Section 6.3.1.
2.: Let us set now $\vec{ω} = 0$ and $\vec{δ} = 0$ . The motion of the moving vessel containing the gas (with respect to the so called fixed reference frame) is now an uniformly accelerated translation, with acceleration $\frac{\vec{β}}{ε}$ . The function $f_{i} (\vec{r_{i 0}})$ now is

$f_{i} (\vec{r_{i 0}}) = \vec{r_{i 0}} \cdot \frac{\vec{β}}{ε} .$

In the moving reference frame, which is no more inertial, we recover the thermodynamic equilibrium state of a monoatomic gas in a gravity field $\vec{g} = - \frac{\vec{β}}{ε}$ discussed in Section 6.3.2.
3.: Let us now set $\vec{ω} = ω \vec{e_{z}}$ , $\vec{β} = 0$ and $\vec{δ} = 0$ . The motion of the moving vessel containing the gas (with respect to the so called fixed reference frame) is now a rotation around the coordinate z axis at a constant angular velocity $\frac{ω}{ε}$ . The function $f_{i} (\vec{r_{i 0}})$ is now

$f_{i} (\vec{r_{i 0}}) = - \frac{ω^{2}}{2 ε^{2}} {∥ \vec{e_{z}} \times \vec{r_{i 0}} ∥}^{2} .$

The length $Δ = ∥ \vec{e_{z}} \times \vec{r_{i, 0}} ∥$ is the distance between the i-th particle and the axis of rotation of the moving frame (the coordinate z axis). Moreover, we have seen that $ε = \frac{- 1}{k T}$ . Therefore in the generalized Gibbs state, the probability density $ρ_{i} (b)$ of presence of the i-th particle in its symplectic manifold of motion $M_{i}, ω_{i}$ , with respect to the Liouville measure $λ_{ω_{i}}$ , is

$ρ_{i} (b) = \frac{1}{P_{i} (b)} exp (- 〈 J_{i}, b 〉) = Constant \cdot exp (- \frac{1}{2 m_{i} k T} {∥ \vec{p_{i 0}} ∥}^{2} + \frac{m_{i}}{2 k T} {(\frac{ω}{ε})}^{2} Δ^{2}) .$

This formula describes the behaviour of a gas made of point particles of various masses in a centrifuge rotating at a constant angular velocity $\frac{ω}{ε}$ : the heavier particles concentrate farther from the rotation axis than the lighter ones.

7.3.6. Other Applications of Generalized Gibbs States

Applications of generalized Gibbs states in thermodynamics of continua, with the use of affine tensors, are presented in the papers by de Saxcé [64,65].

Several applications of generalized Gibbs states of subgroups of the Poincaré group were considered by Souriau. For example, he presents in his book [14], Chapter IV, p. 308, a generalized Gibbs which describes the behaviour of a gas in a relativistic centrifuge, and in his papers [15,16], very nice applications of such generalized Gibbs states in Cosmology.

Acknowledgments

I address my thanks to Alain Chenciner for his interest and his help to study the works of Claude Shannon, to Roger Balian for his comments and his explanations about thermodynamic potentials, and to Frédéric Barbaresco for his kind invitation to participate in the GSI 2015 conference and his encouragements. My warmest thanks to the anonymous referees whose very careful and benevolent reading of my work allowed me to correct several mistakes and to improve this paper.

Conflicts of Interest

The author declares no conflict of interest.

References

Abraham, R.; Marsden, J.E. Foundations of Mechanics, 2nd ed.; American Chemical Society: Washington, DC, USA, 1978. [Google Scholar]
Arnold, V.I. Mathematical Methods of Classical Mechanics, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 1978. [Google Scholar]
Cannas da Silva, A. Lectures on Symplectic Geometry; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Guillemin, V.; Sternberg, S. Symplectic Techniques in Physics; Cambridge University Press: Cambridge, UK, 1984. [Google Scholar]
Holm, D. Geometric Mechanics, Part I: Dynamics ans Symmetry; World Scientific: Singapore, 2008. [Google Scholar]
Holm, D. Geometric Mechanics, Part II: Rotating, Translating and Rolling; World Scientific: Singapore, 2008. [Google Scholar]
Iglesias, P. Symétries et Moment; Éditions Hermann: Paris, France, 2000. (In French) [Google Scholar]
Laurent-Gengoux, C.; Pichereau, A.; Vanhaecke, P. Poisson Structures; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Libermann, P.; Marle, C.-M. Symplectic Geometry and Analytical Mechanics; Springer: Berlin/Heidelberg, Germany, 1987. [Google Scholar]
Ortega, J.-P.; Ratiu, T.-S. Momentum Maps and Hamiltonian Reduction; Birkhäuser: Boston, MA, USA; Basel, Switzerland; Berlin, Germany, 2004. [Google Scholar]
Vaisman, I. Lectures on the Geometry of Poisson Manifolds; Springer: Berlin/Heidelberg, Germany, 1994. [Google Scholar]
Marle, C.-M. Symmetries of hamiltonian systems on symplectic and poisson manifolds. In Similarity and Symmetry Methods, Applications in Elasticity and Mechanics of Materials; Ganghoffer, J.-F., Mladenov, I., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 183–269. [Google Scholar]
Souriau, J.-M. Définition covariante des équilibres thermodynamiques. Supplemento al Nuovo Cimento 1966, 4, 203–216. (In French) [Google Scholar]
Souriau, J.-M. Structure des Systèmes Dynamiques; Dunod: Malakoff, France, 1969. (In French) [Google Scholar]
Souriau, J.-M. Mécanique Statistique, Groupes de Lie et Cosmologie. In Géométrie Symplectique et Physique Mathématique; CNRS Éditions: Paris, France, 1974; pp. 59–113. (In French) [Google Scholar]
Souriau, J.-M. Géométrie symplectique et Physique mathématique. In Deux Conférences de Jean-Marie Souriau, Colloquium de la Société Mathématique de France, Paris, France, 19 February–12 November 1975. (In French)
Souriau, J.-M. Mécanique Classique et Géométrie Symplectique; Dunod: Malakoff, France, 1984. (In French) [Google Scholar]
Mackey, G.W. The Mathematical Foundations of Quantum Mechanics; W. A. Benjamin, Inc.: New York, NY, USA, 1963. [Google Scholar]
Newton, I. Philosophia Naturalis Principia Mathematica; Translated in French by Émilie du Chastelet (1756); London, UK, 1687. (In French) [Google Scholar]
Lagrange, J.L. Mécanique Analytique, 1st ed.; La veuve de Saint-Pierre: Paris, France, 1808; reprinted by Jacques Gabay: Paris, France, 1989. (In French) [Google Scholar]
Hamilton, W.R. On a general method in Dynamics. In Sir William Rowan Hamilton Mathematical Works, Volume II; Cambridge University Press: Cambridge, UK, 1940; pp. 247–308. [Google Scholar]
Hamilton, W.R. Second essay on a general method in Dynamics. In Sir William Rowan Hamilton Mathematical Works, Volume II; Cambridge University Press: Cambridge, UK, 1940; pp. 95–144. [Google Scholar]
Bérest, P. Calcul des Variations Application à la Mécanique et à la Physique; Ellipses/Éditions Marketing: Paris, France, 1997. (In French) [Google Scholar]
Bourguignon, J.-P. Calcul Variationnel; Éditions de l’École Polytechnique: Paris, France, 1991. (In French) [Google Scholar]
Lanczos, C.S. The Variational Principles of Mechanics, 4th ed.; Reprinted by Dover, New York, 1970; University of Toronto Press: Toronto, ON, Canada, 1970. [Google Scholar]
Malliavin, P. Géométrie Différentielle Intrinsèque; Éditions Hermann: Paris, France, 1972. (In French) [Google Scholar]
Sternberg, S. Lectures on Differential Geometry; Prentice-Hall: Upper Saddle River, NJ, USA, 1964. [Google Scholar]
Kosmann-Schwarzbach, Y. The Noether Theorems; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Poincaré, H. Sur une forme nouvelle des équations de la Méanique. C. R. Acad. Sci. 1901, 7, 369–371. [Google Scholar]
Marle, C.-M. On Henri Poincaré’s note “Sur une forme nouvelle des équations de la Mécanique”. J. Geom. Symmetry Phys. 2013, 29, 1–38. [Google Scholar]
Barbaresco, F. Symplectic structure of information geometry: Fisher metric and euler-poincaré equation of souriau lie group thermodynamics. In Geometric Science of Information: Second International Conference, GSI 2015, Proceedings; Nielsen, F., Barbaresco, F., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 529–540. (In French) [Google Scholar]
Lagrange, J.-L. Mémoire sur la Théorie Générale de la Variation des Constantes Arbitraires Dans Tous les Problèmes de Mécanique; Lu le 13 mars 1809 à l’Institut de France; Dans Œuvres de Lagrange; Gauthier-Villars: Paris, France, 1877; Volume VI, pp. 771–805. (In French) [Google Scholar]
Lagrange, J.-L. Second Mémoire sur la Théorie de la Variation des Constantes Arbitraires Dans les Problèmes de Mécanique; Gauthier-Villars: Paris, France, 1877; Volume VI, pp. 809–816. (In French) [Google Scholar]
Tulczyjew, W.M. Hamiltonian systems, Lagrangian systems and the Legendre transformation. Symp. Math. 1974, 14, 247–258. [Google Scholar]
Tulczyjew, W.M. Geometric Formulations of Physical Theories; Monographs and Textbooks in Physical Science; Bibliopolis: Napoli, Italy, 1989. [Google Scholar]
Lichnerowicz, A. Les variétés de Poisson et leurs algèbres de Lie associées. J. Differ. Geom. 1977, 12, 253–300. (In French) [Google Scholar]
Lichnerowicz, A. Les variétés de Jacobi et leurs algèbres de Lie associées. J. Math. Pures Appl. 1979, 57, 453–488. (In French) [Google Scholar]
Kirillov, A. Local lie algebras. Russ. Math. Surv. 1976, 31, 55–75. [Google Scholar] [CrossRef]
Poisson, S.D. Sur la variation des constantes arbitraires dans les questions de mécanique. Mémoire lu le 16 octobre 1809 à l’Institut de France. Journal de L’École Polytechnique quinzième cahier, tome VIII. 266–344. (In French)
Koszul, J.-L. Crochet de Schouten-Nijenhuis et cohomologie. In É. Cartan et les Mathématiques D’aujourd’hui; Astérisque, numéro hors série; Société Mathématique de France: Paris, France, 1985; pp. 257–271. (In French) [Google Scholar]
Marle, C.-M. Calculus on Lie algebroids, Lie groupoids and Poisson manifolds. Dissertationes Mathematicae 457, Institute of Mathematics, Polish Academy of Sciences (Warszawa), 2008; arXiv:0806.0919. [Google Scholar]
Weinstein, A. The local structure of Poisson manifolds. J. Differ. Geom. 1983, 18, 523–557. [Google Scholar]
Marsden, J.E.; Weinstein, A. Reduction of symplectic manifolds with symmetry. Rep. Math. Phys. 1974, 5, 121–130. [Google Scholar] [CrossRef]
Meyer, K. Symmetries and integrals in mechanics. In Dynamical Systems; Peixoto, M., Ed.; Academic Press: New York, NY, USA, 1973; pp. 259–273. [Google Scholar]
De Saxcé, G.; Vallée, C. Galilean Mechanics and Thermodynamics of Continua; John Wiley & Sons: Hoboken, NJ, USA, 2016. [Google Scholar]
Boltzmann, L.E. Leçons sur la Théorie des gaz. Available online: http://iris.univ-lille1.fr/handle/1908/1523 (accessed on 11 October 2016). (In French)
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423, 623–656. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics II. Phys. Rev. 1957, 108, 171–190. [Google Scholar] [CrossRef]
Balian, R. Information in statistical physics. Stud. Hist. Philos. Mod. Phys. Part B 2005, 36, 323–353. [Google Scholar] [CrossRef]
Poincaré, H. Sur le problème des trois corps et les équations de la dynamique. Acta Math. 1890. [Google Scholar] [CrossRef]
Kinetic Theory. Available online: http://hyperphysics.phy-astr.gsu.edu/hbase/kinetic/kinthe.html (accessed on 11 October 2016).
Gastebois, G. Théorie Cinétique des Gaz. Available online: http://gilbert.gastebois.pagesperso-orange.fr/java/gaz/gazparfait/theorie_gaz.pdf (accessed on 11 October 2016). (In French)
Synge, J.L. The Relativistic Gas; North Holland Publishing Company: Amsterdam, The Netherlands, 1957. [Google Scholar]
Massieu, F. Sur les Fonctions caractéristiques des divers fluides. C. R. Acad. Sci. Paris 1869, 69, 858–862. (In French) [Google Scholar]
Massieu, F. Addition au précédent Mémoire sur les Fonctions caractéristiques. C. R. Acad. Sci. Paris 1869, 69, 1057–1061. (In French) [Google Scholar]
Massieu, F. Thermodynamique. Mémoire sur les Fonctions Caractéristiques des Divers Fluides et sur la Théorie des Vapeurs; Académie des Sciences: Paris, France, 1876; pp. 1–92. (In French) [Google Scholar]
Balian, R. François Massieu et les Potentiels Thermodynamiques; Évolution des Disciplines et Histoire des Découvertes; Académie des Sciences: Paris, France, 2015. (In French) [Google Scholar]
Callen, H.B. Thermodynamics and an Introduction to Thermostatics, 2nd ed.; John Wiley and Sons: New York, NY, USA, 1985. [Google Scholar]
Barbaresco, F. Koszul information geometry and Souriau geometric temperature/capacity of lie group thermodynamics. Entropy 2014, 16, 4521–4565. [Google Scholar] [CrossRef]
Barbaresco, F. Geometric theory of heat from Souriau lie groups thermodynamics and koszul hessian geometry: Applications in information geometry for exponential families. Entropy 2016. [Google Scholar] [CrossRef]
De Saxcé, G.; Vallée, C. Bargmann group, momentum tensor and Galilean invariance of Clausius-Duhem inequality. Int. J. Eng. Sci. 2012, 50, 216–232. [Google Scholar] [CrossRef]
Jacobi, C.G.J. Sur le mouvement d’un point et sur un cas particulier du problème des trois corps. C. R. Acad. Sci. 1836, 3, 59–61. (In French) [Google Scholar]
De Saxcé, G. Entropy and structure for the thermodynamic systems. In Geometric Science of Information, Second International Conference GSI 2015 Proceedings; Nielsen, F., Barbaresco, F., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 519–528. [Google Scholar]
De Saxcé, G. Link between lie group statistical mechanics and thermodynamics of continua. Entropy 2016. [Google Scholar] [CrossRef]

© 2016 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Marle, C.-M. From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics. Entropy 2016, 18, 370. https://doi.org/10.3390/e18100370

AMA Style

Marle C-M. From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics. Entropy. 2016; 18(10):370. https://doi.org/10.3390/e18100370

Chicago/Turabian Style

Marle, Charles-Michel. 2016. "From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics" Entropy 18, no. 10: 370. https://doi.org/10.3390/e18100370

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics †

Abstract

1. Introduction

1.1. Contents of the Paper, Sources and Further Reading

1.2. Notations

2. The Lagrangian Formalism

2.1. The Configuration Space and the Space of Kinematic States

2.2. The Euler–Lagrange Equations

2.3. Hamilton’s Principle of Stationary Action

2.4. The Euler-Cartan Theorem

3. Lagrangian Symmetries

3.1. Assumptions and Notations

3.2. The Noether Theorem in Lagrangian Formalism

3.3. The Lagrangian Momentum Map

3.4. The Euler–Poincaré Equation

4. The Hamiltonian Formalism

4.1. Hyper-Regular Lagrangians

Assumptions Made in this Section

4.2. Presymplectic Manifolds

Presymplectic Manifolds in Mechanics

4.3. The Hamilton Equation

4.3.1. The Tulczyjew Isomorphisms

4.3.2. Lagrangian Submanifolds

4.4. The Hamiltonian Formalism on Symplectic and Poisson Manifolds

4.4.1. The Hamilton Formalism on Symplectic Manifolds

4.4.2. Properties of Poisson Manifolds

5. Hamiltonian Symmetries

5.1. Presymplectic, Symplectic and Poisson Maps and Vector Fields

5.2. Lie Algebras and Lie Groups Actions

5.3. Momentum Maps of Hamiltonian Actions

5.4. Noether’s Theorem in Hamiltonian Formalism

5.5. Symplectic Cocycles

5.6. The Use of Symmetries in Hamiltonian Mechanics

5.6.1. Symmetries of the Phase Space

5.6.2. Symmetries of the Space of Motions

6. Statistical Mechanics and Thermodynamics

6.1. Basic Concepts in Statistical Mechanics

6.1.1. The Liouville Measure on a Symplectic Manifold

6.1.2. Variation in Time of a Statistical State

6.2. Thermodynamic Equilibria and Thermodynamic Functions

6.2.1. Assumptions Made in this Section

6.2.2. Physical Meaning of the Introduced Functions

6.2.3. Towards Thermodynamic Equilibrium

6.3. Examples of Thermodynamic Equilibria

6.3.1. Classical Monoatomic Ideal Gas

6.3.2. Classical Ideal Monoatomic Gas in a Gravity Field

6.3.3. Relativistic Monoatomic Ideal Gas

6.3.4. Relativistic IDeal Gas of Massless Particles

6.3.5. Specific Heat of Solids

7. Generalization for Hamiltonian Actions

7.1. Generalized Gibbs States

Notations and Conventions

7.2. Generalized Thermodynamic Functions

Assumptions Made in this Section

7.3. Examples of Generalized Gibbs States

7.3.1. Action of the Group of Rotations on a Sphere

7.3.2. The Galilean Group, Its Lie Algebra and Its Actions

7.3.3. One-Parameter Subgroups of the Galilean Group

7.3.4. A Gas Contained in a Moving Vessel

7.3.5. Three Examples

7.3.6. Other Applications of Generalized Gibbs States

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics^†