Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning

Barbaresco, Frédéric

doi:10.3390/e20110840

Open AccessFeature PaperArticle

Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning

by

Frédéric Barbaresco

Department of Advanced Radar Concepts, Thales Land Air Systems, Voie Pierre-Gilles de Gennes, 91470 Limours, France

Entropy 2018, 20(11), 840; https://doi.org/10.3390/e20110840

Submission received: 9 August 2018 / Revised: 23 September 2018 / Accepted: 9 October 2018 / Published: 2 November 2018

(This article belongs to the Special Issue Joseph Fourier 250th Birthday: Modern Fourier Analysis and Fourier Heat Equation in Information Sciences for the XXIst century)

Download

Browse Figures

Versions Notes

Abstract

We introduce poly-symplectic extension of Souriau Lie groups thermodynamics based on higher-order model of statistical physics introduced by Ingarden. This extended model could be used for small data analytics and machine learning on Lie groups. Souriau geometric theory of heat is well adapted to describe density of probability (maximum entropy Gibbs density) of data living on groups or on homogeneous manifolds. For small data analytics (rarified gases, sparse statistical surveys, …), the density of maximum entropy should consider higher order moments constraints (Gibbs density is not only defined by first moment but fluctuations request 2nd order and higher moments) as introduced by Ingarden. We use a poly-sympletic model introduced by Christian Günther, replacing the symplectic form by a vector-valued form. The poly-symplectic approach generalizes the Noether theorem, the existence of moment mappings, the Lie algebra structure of the space of currents, the (non-)equivariant cohomology and the classification of G-homogeneous systems. The formalism is covariant, i.e., no special coordinates or coordinate systems on the parameter space are used to construct the Hamiltonian equations. We underline the contextures of these models, and the process to build these generic structures. We also introduce a more synthetic Koszul definition of Fisher Metric, based on the Souriau model, that we name Souriau-Fisher metric. This Lie groups thermodynamics is the bedrock for Lie group machine learning providing a full covariant maximum entropy Gibbs density based on representation theory (symplectic structure of coadjoint orbits for Souriau non-equivariant model associated to a class of co-homology).

Keywords:

higher order thermodynamics; Lie groups thermodynamics; homogeneous manifold; poly-symplectic manifold; dynamical systems; non-equivariant cohomology; Lie group machine learning; Souriau-Fisher metric

“Inviter les savants géomètres à traiter nos problèmes avec le soucis de la commodité et de l’agrément: qu’ils écartent tout ce qui n’a rien à voir avec la pénétration de l’esprit, seule qualité dont nous faisons grand cas et que nous nous sommes proposé d’éprouver et de couronner”
—Blaise Pascal—Deuxième Lettre sur la roulette, Paris, 19 Juillet 1658 [1]

“Nous avons fait de la Dynamique un cas particulier de la Thermodynamique, une Science qui embrasse dans des principes communs tous les changements d’état des corps, aussi bien les changements de lieu que les changements de qualités physiques”
—Pierre Duhem, Sur les équations générales de la Thermodynamique, 1891 [2]

“Nous prenons le mot mouvement pour désigner non seulement un changement de position dans l’espace, mais encore un changement d’état quelconque, lors même qu’il ne serait accompagné d’aucun déplacement… De la sorte, le mot mouvement s’oppose non pas au mot repos, mais au mot équilibre.”
—Pierre Duhem, Commentaire aux principes de la Thermodynamique, 1894 [3]

1. Introduction

These two Pierre Duhem’s citations (see [4] for English translations) make reference to Aristotle definition of “motion” (which can be found in The Physics) to designate not only a change of position in space, but also any change of state, even if not accompanied by any displacement. In this case, dynamics appears as a special case of General Thermodynamics [2,3,5], to describe in common principles all changes in the state of the body, both changes of place and changes in physical qualities. Making reference to Duhem’s “Energetics”, Stefano Bordini write in [6]: “This theoretical design led Duhem to rediscover and reinterpret the tradition of Aristotle’s natural philosophy and Pascal’s epistemology … This outcome was surprising and clearly echoed the Aristotelian language and concept of motion as change and transformation: within the framework of Aristotelian natural philosophy, motion in the modern physical sense was actually a special case of the general concept of motion. The mathematisation of thermodynamics coincided with a generalisation of mechanics, and this generalisation led to an unexpected connection between modern mathematical physics and ancient natural philosophy” (see [7,8] for more developments on the affiliation between Aristotle, Pascal and Duhem philosophies). This conceptual and epistemology point of view was enlightened 75 years later by Jean-Marie Souriau through the symplectic model of geometric mechanics applied to statistical mechanics and used to build a “Lie groups thermodynamics” of dynamical systems, where the Gibbs density is covariant with respect to the action of the Lie group on the system (dynamical groups as Galileo group). This Souriau theory is based on tools related to non-equivariant model associated to a class of co-homology and affine representation of Lie groups and Lie algebra (last approach was independently studied in mathematical domain by Koszul to characterize homogeneous convex cones geometry [9,10,11]). Duhem [12] and Souriau [13,14] also both studied how to extend Thermodynamics for a continuous media.

In this paper, we will explore and compare the joint geometric contextures shared in information theory (based on Koszul’s information geometry) and heat theory (based on Souriau’s Lie groups thermodynamics) to highlight their joint elementary structures. Classically, we address analogies between mathematical or physical models by comparing their “structures” defined as the arrangement of and relations between the parts or elements, or as the way in which the parts are arranged or organized. My personal concept of “contexture” is more general and phenomenological and could be defined as the act, process, or manner of weaving parts into a whole. We have then replaced the relations between objects by the act to build these relations. Based on Souriau’s general definition of entropy as the Legendre transform of the logarithm of generalized Laplace transform and symplectic structure associated to Lie group coadjoint orbits, we will see how geometric structures of information and heat theories are generated by these Souriau’s “generative processes”. We will extend theses contextures in the vector-valued case based on poly-symplectic model of higher order Souriau’s Lie groups thermodynamics.

In this paper, we identify the Riemanian metric introduced by Souriau based on co-homology, in the framework of “Lie groups thermodynamics” as an extension of classical Fisher metric introduced in information geometry. We have observed that Souriau metric preserves Fisher metric structure as the Hessian of the minus logarithm of a partition function, where the partition function is defined as a generalized Laplace transform on a convex cone. Souriau’s definition of Fisher metric extends the classical one in case of Lie groups or homogeneous manifolds. Souriau has developed “Lie groups thermodynamics” in the framework of homogeneous symplectic manifolds in geometric statistical mechanics for dynamical systems, but as observed by Souriau, these model equations are no longer linked to the symplectic manifold but only depend on the Lie group and the associated co-cycle.

This analogy with Fisher metric opens potential applications in machine learning, where the Fisher metric is used by information geometry, to define the “natural gradient” tool to improve ordinary stochastic gradient descent sensitivity to rescaling or changes of variable in parameter space [15,16,17,18,19,20,21,22]. In machine learning revised by natural gradient of information geometry, the ordinary gradient is designed to integrate the Fisher matrix. Amari has theoretically proved the asymptotic optimality of the natural gradient compared to classical gradient. With the Souriau approach, the Fisher metric could be extended, by Souriau-Fisher metric, to design natural gradients for data on homogeneous manifolds.

Information geometry has been derived from invariant geometrical structure involved in statistical inference. The Fisher metric defines a Riemannian metric as the Hessian of two dual potential functions, linked to dually coupled affine connections in a manifold of probability distributions. With the Souriau model, this structure is extended preserving the Legendre transform between two dual potential function parametrized in Lie algebra of the group acting transentively on the homogeneous manifold.

Classically, to optimize the parameter

θ

of a probabilistic model, based on a sequence of observations

y_{t}

, is an online gradient descent:

θ_{t} \leftarrow θ_{t - 1} - η_{t} \frac{\partial l_{t} {(y_{t})}^{T}}{\partial θ}

(1)

with learning rate

η_{t}

, and the loss function

l_{t} = - \log p (y_{t} / {\hat{y}}_{t})

. This simple gradient descent has a first drawback of using the same non-adaptive learning rate for all parameter components, and a second drawback of non invariance with respect to parameter re-encoding inducing different learning rates. Amari has introduced the natural gradient to preserve this invariance to be insensitive to the characteristic scale of each parameter direction. The gradient descent could be corrected by

I {(θ)}^{- 1}

where

I

is the Fisher information matrix with respect to parameter

θ

, given by:

\begin{array}{l} I (θ) = [g_{i j}] \\ with g_{i j} = {[- E_{y \sim p (y / θ)} [\frac{\partial^{2} \log p (y / θ)}{\partial θ_{i} \partial θ_{j}}]]}_{i j} = {[E_{y \sim p (y / θ)} [\frac{\partial \log p (y / θ)}{\partial θ_{i}} \frac{\partial \log p (y / θ)}{\partial θ_{j}}]]}_{i j} \end{array}

(2)

with natural gradient:

θ_{t} \leftarrow θ_{t - 1} - η_{t} I {(θ)}^{- 1} \frac{\partial l_{t} {(y_{t})}^{T}}{\partial θ}

(3)

Amari has proved that the Riemannian metric in an exponential family is the Fisher information matrix defined by:

g_{i j} = - {[\frac{\partial^{2} Φ}{\partial θ_{i} \partial θ_{j}}]}_{i j} with Φ (θ) = - \log \int_{ℝ} e^{- 〈 θ, y 〉} d y

(4)

and the dual potential, the Shannon entropy, is given by the Legendre transform:

S (η) = 〈 θ, η 〉 - Φ (θ) with η_{i} = \frac{\partial Φ (θ)}{\partial θ_{i}} and θ_{i} = \frac{\partial S (η)}{\partial η_{i}}

(5)

In geometric statistical mechanics, Souriau has developed a “Lie groups thermodynamics” of dynamical systems where the (maximum entropy) Gibbs density is covariant with respect to the action of the Lie group. In the Souriau model, previous structures of information geometry are preserved:

I (β) = - \frac{\partial^{2} Φ}{\partial β^{2}} with Φ (β) = - \int_{M} e^{- 〈 β, U (ξ) 〉} d λ

(6)

S (Q) = 〈 β, Q 〉 - Φ (β) with Q = \frac{\partial Φ (β)}{\partial β} \in g^{*} and β = \frac{\partial S (Q)}{\partial Q} \in g

(7)

In the Souriau Lie groups thermodynamics model,

β

is a “geometric” (Planck) temperature, element of Lie algebra

g

of the group, and

Q

is a “geometric” heat, element of dual Lie algebra

g^{*}

of the group. Souriau has proposed a Riemannian metric that we have identified as a generalization of the Fisher metric:

I (β) = [g_{β}] with g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])

(8)

with {\tilde{Θ}}_{β} (Z_{1}, Z_{2}) = \tilde{Θ} (Z_{1}, Z_{2}) + 〈 Q, a d_{Z_{1}} (Z_{2}) 〉 where a d_{Z_{1}} (Z_{2}) = [Z_{1}, Z_{2}]

(9)

The tensor

\tilde{Θ}

used to define this extended Fisher metric is defined by the moment map

J (x)

,

M

(homogeneous symplectic manifold) to the dual Lie algebra

g^{*}

, given by:

\tilde{Θ} (X, Y) = J_{[X, Y]} - {J_{X}, J_{Y}} with J (x) : M \to g^{*} such that J_{X} (x) = 〈 J (x), X 〉, X \in g

(10)

This tensor

\tilde{Θ}

is also defined in tangent space of the cocycle

θ (g) \in g^{*}

(this cocycle appears due to the non-equivariance of the coadjoint operator

A d_{g}^{*}

, action of the group on the dual lie algebra):

Q (A d_{g} (β)) = A d_{g}^{*} (Q) + θ (g)

(11)

\begin{array}{l} \tilde{Θ} (X, Y) : & g \times g \to ℜ & with Θ (X) = T_{e} θ (X (e)) \\ X, Y \mapsto 〈 Θ (X), Y 〉 \end{array}

(12)

In Souriau’s Lie groups thermodynamics, the invariance by re-parameterization in information geometry has been replaced by invariance with respect to the action of the group. When an element of the group

g

acts on the element

β \in g

of the Lie algebra, given by adjoint operator

A d_{g}

. Under the action of the group

A d_{g} (β)

, the entropy

S (Q)

and the Fisher metric

I (β)

are invariant:

β \in g \to A d_{g} (β) \Rightarrow {\begin{cases} S [Q (A d_{g} (β))] = S (Q) \\ I [A d_{g} (β)] = I (β) \end{cases}

(13)

In the case of small data analytics, we propose to parameterized the (maximum entropy) Gibbs density with higher order “geometric” temperature

β_{k}

and higher order heat

Q_{k}

, that parameterized higher order entropy

S (Q_{1}, \dots, Q_{n})

and dual potential function

Φ (β_{1}, \dots, β_{n})

:

\begin{array}{l} S (Q_{1}, \dots, Q_{n}) = \sum_{k = 1}^{n} 〈 β_{k}, Q_{k} 〉 - Φ (β_{1}, \dots, β_{n}) \\ with β_{k} = \frac{\partial S (Q_{1}, \dots, Q_{n})}{\partial Q_{k}} and Q_{k} = \frac{\partial Φ (β_{1}, \dots, β_{n})}{\partial β_{k}} \\ where Φ (β_{1}, \dots, β_{n}) = - \log \int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉} d ω \end{array}

(14)

We will develop in the paper that the geometric approach of statistical thermodynamics, introduced by Souriau, offers an advantage over traditional formulations. Classical thermodynamics has been developed for static systems taking into accound only the time evolution, but in case of dynamical systems (e.g., a centrifuge system), this statistical physics is no longer valid because the Gibbs density (the density of the maximum entropy) is not covariant. In case of only time translation, what is preserved is only the energy, but for dynamical systems where a group is acting, invariants are given by components of the “moment map” (which is a geometrization of the Noether theorem providing invariants if there are symmetries). The “moment map” has been introduced in parallel by Kostant in mathematics and by Souriau in physics. Souriau has developed the non-equivariant case, and has applied it to statistical mechanics. The main advantages of “Lie groups thermodynamics” of dynamical systems, is that this statistical physics is a coordinate-free model preserving invariances with respect to the action of the (dynamical) Lie group acting on the system. We give in appendix the development of the centrifuge thermodynamics with classical approach given by Roger Balian, and prove that with the Souriau approach, the problem is solved by only applying Lie groups thermodunamics equations through moment map computation, where in classical case, we should consider additional terms related to all moments (energy, angular momentum, …) through additional Lagrange hyper-parameters, that corresponds to components of Souriau’s “geometric” (Planck) temperature.

Before developing all these models, and because this topic needs transverse knowledges of many concepts developed in different disciplines as statistical physics & thermodynamics, information geometry, symplectic mechanics and multi-symplectic geometry, we propose to the readers, in the preamble, to study the following books and papers:

Introduction to Statistical Physics and Thermodynamics: [2,3,23,24,25,26]
Introduction to Higher Order Thermodynamics: [27,28,29,30,31,32,33,34,35,36,37,38,39,40]
Introduction to Information Geometry: [9,10,11,15,16,17,18,19,20,21,22,41,42]
Introduction to Symplectic Mechanics: [43,44,45,46,47]
Introduction to Multi-Symplectic Geometry: [48,49,50,51,52,53,54,55]

The geometric definition and extension of Fisher metric has been recently studied in the framework of quantum information geometry, but this community seems unaware of Souriau’s work on Lie groups thermodynamics for the study of statistical physics of dynamical systems based on symplectic geometry and c-homology tools in the 70s, and in particular the non-equivariant case developed by Souriau and Koszul. We can make reference to the following recent works on the symplectic formulation of the Fisher information theory [56,57,58,59].

The structure of the paper is the following:

In Section 1, we introduce seminal idea on Symplectic geometry used in mechanics and in statistical mechanics, as introduced by Jean-Marie-Souriau during the 60s. From previous work of François Gallissot extending Cartan’s results on integral invariant (theorem on types of differential forms generating equations of movement of a material point invariant in the transformations of the Galilean group), we present the Lagrange 2-form and moment map elaborated by Souriau to build a geometric mechanics theory, where a dynamical system is then represented by a foliation of the evolution, determined by an antisymmetric covariant second order tensor. Souriau has applied this tool for mechanical statistics to build a thermodynamics of dynamical systems, where the classical notion of Gibbs canonical ensemble is extended for a homogeneous symplectic manifold on which a Lie group (dynamical group) has a symplectic action. In case of Galileo group, the symmetry is broken, and new “co-homological” Souriau relations should be verified in Lie algebra of the group.
In Section 2, we synthetize results on higher order thermodynamics based on higher order temperatures and heats, as introduced by Ingarden and Jaworski for mesoscopic systems. This model is based on higher order maximum entropy Gibbs density definition constraining solution with respects to higher order moments.
In Section 3, we develop “Lie groups thermodynamics” model, developed to describe Gibbs state for dynamical systems, where Souriau introduced the concept of co-adjoint action of a group on its momentum space that allows designing physical observables like energy, heat and momentum or moment as pure geometrical objects. The Souriau model then generalizes the Gibbs equilibrium state to all symplectic manifolds that have a dynamical group, with a “geometric” (Planck) temperature as an element of the Lie algebra and “geometric heat” as an element of the dual Lie algebra. We have observed that Souriau has introduced a symmetric tensor that is an extension of classical Fisher metric in information geometry. This new Fisher-Souriau metric is invariant with respect to the action of the group. These equations are universal, because they are not dependent on the symplectic manifold but only on the dynamical group and its associated two-cocycle. Souriau called it “Lie groups thermodynamics”.
In Section 4, we give an extended Koszul study of Souriau’s non-equivariant model associated to a class of co-homology. Koszul has deepened the Souriau model, considering purely algebraic and geometric developments of geometric mechanics. Koszul has defined a skew symmetric bilinear form by a closed expression depending only on the cocycle and related to the Souriau antisymmetric bilinear map introduced previously in Section 3. This Koszul study of the moment map non-equivariance, and the existence of an affine action of G on g* is at the cornerstone of Souriau theory of Lie groups thermodynamics.
In Section 5, at the step of the Souriau Lie groups thermodynamics presentation, we will introduce a generalized Souriau definition of entropy, as the Legendre transform of the logarithm of the Laplace transform, making the connection with information geometry. This definition is a general contexture that can be extended to highly abstract spaces preserving Legendre structure, if we are able to generalize the Laplace transform.
In Section 6, we illustrate Souriau’s Lie groups thermodynamics for a centrifuge system. The main Souriau idea was to define the Gibbs states for one-parameter subgroups of the Galilean group, because he proved that the action of the full Galilean group on the space of motions of an isolated mechanical system is not related to any equilibrium Gibbs state (the open subset of the Lie algebra, associated to this Gibbs state, is empty).
In Section 7, we have defined an higher-order model of Lie groups thermodynamics based on a poly-symplectic vector valued approach. This multi-symplectic extension, is based on a multi-valued one that preserve the notion of (poly-)moment map built by Günther based on an n-symplectic model. We replace the symplectic form of the Souriau model by a vector valued form that is called poly-symplectic. We consider the non-equivariance of poly-moment map by introducing poly-cocycle. We finally conclude with poly-symplectic definition extension of the Fisher-Souriau metric.
In Section 8, we conclude with potential extension to Lie group machine learning.

To facilitate understanding of previous results, we add some additional complements:

In Appendix A, we recall a synthesis of Günther’s poly-symplectic model with initial notation
In Appendix B, we develop computation of the Fisher metric for multivariate Gaussian density, to establish links with Souriau’s Lie groups Gibbs density model.
In Appendix C, we give more details on the Legendre transform, the basic tool of information geometry and Souriau Lie groups thermodynamics. More especially, we give a definition of the Legendre transform with projective geometry definition by Chasles as reciprocal polar with respect to a paraboloid.
In Appendix D, we give solution of a centrifuge system thermodynamics, given by Roger Balian based on a classical approach, to make the link with the Souriau approach.
In Appendix E, we recall the main proofs of Souriau’s Lie groups thermodynamics and its poly-symplectic extension.
In Appendix F, we present another Souriau statistical physics model, developed for relativistic thermodynamics of continua, which preserves the Legendre transform, where temperature is given by a killing vector.

2. Seminal Idea of Symplectic Geometry in Mechanics and in Statistical Mechanics by Gallissot and Souriau

The symplectic structure has been introduced in mathematics much earlier than the word symplectic, in works of the French physicist Joseph Louis Lagrange (see paper on the slow changes of the orbital elements of planets in the solar system), who showed that this geometry is a fundamental tool in the mathematical model of any problem in mechanics. Jean-Marie Souriau has shown that Lagrange’s parentheses (nowdays called Lagranges bracket) are the components of the canonical symplectic 2-form on the manifold of motions of the mechanical system, in the chart of that manifold [60,61].

Jean-Marie Souriau, graduated from ENS ULM 1942, was the nephew of the philosopher Etienne Souriau (graduated from ENS Ulm 1912, ranked 1st at aggregation, a collaborator of Gaston Bachelard in Paris Sorbonne University, PhD supervisor of Film Maker Eric Rohmer), author of “Les Structures de l’oeuvre d’art” and grandson of the philosopher Paul Souriau (graduated from ENS Ulm 1873), author of “Esthétique du mouvement” and a Latin thesis « De motus perceptione », who both have worked on “aesthetics”, and little nephew of literature historian Maurice Souriau, the editor of a critical version Blaise Pascal’s “Pensées” (awarded by 4 prices of Académie Française). The Souriau family, with Paul, Etienne and Jean-Marie were motivated to explore esthetical issues of “motion structures” (we could summarize by the triptych: the Esthetism of Motion of Paul Souriau, the Structure of Esthetism of Etienne Souriau and the Structure of Motion of Jean-Marie Souriau). Jean-Marie Souriau’s book “Structure des Systems Dynamiques” (SSD) was elaborated in Carthage and Marseille, where Souriau was installed with his wife Christiane Souriau-Hoebrecht. In 1952 Souriau found a position at Institut des Hautes Études de Tunis (8 rue de Rome, Tunis) (see Figure 1) and was back in Marseille in a position in 1958 at the Faculté des Sciences. The manuscript was given to the editor Dunod in 1969, but only edited in 1970 (2019 is the 50th birthday of this book and tributes will be given in 2 events FGSI’19 [62] and SOURIAU 2019 [63]).

About the source of his book title, we are at the apogee or “acme” of the STRUCTURALISM in anthropology/sociology/linguistic/philosophy/ epistemology in France (Levi-Strauss, Barthes, Foucault, Althusser, Lacan, …). The word “structure” was in the air of the time, fashionable at the moment, circulating on all the lips as described by François Dosse in “Histoire du structuralisme I & II”. After his ONERA PhD Defence in 1953 (I have a copy of his PhD), his PhD supervisor André Lichnerowicz made one comment “you have many anti-symmetrical forms in your calculations, you should be interested in symplectic structures”.

As early as 1966, influenced by François Gallissot’s work, Souriau applied his theory of geometric mechanics to statistical mechanics, developed in Chapter IV of his book “Structure of Dynamical Systems” [43,64], what he called “Lie groups thermodynamics”. We have discovered that Souriau and Gallissot both attended the 1954 International Congress of Mathematicians (ICM’54) in Moscow. We could assume that they have discussed 1952 Gallissot’s paper introducing three types of differential forms generating equations of movement of a material point invariant in the transformations of the Galilean group and their links with Poincaré-Cartan integral invariant. This seminal work of Gallissot helped Souriau to formulate his new geometric mechanics and its extenxion to geometric statistical physics. Using Lagrange’s viewpoint, in Souriau statistical mechanics, a statistical state is a probability measure on the manifold of motions. As we can read in his book, Souriau was influenced by François Gallissot to introduce the Lagrange(-Souriau) 2-form.

In place of classical mechanical equations of a material point subjected to a force F, defined by its mass m and its position r at time t, the second order differential equations

m \frac{d^{2} r}{d t^{2}} = F

is rewritten by a system of first order differential equations in phase space

(\begin{matrix} r \\ v \end{matrix})

:

m \frac{d v}{d t} = F and v = \frac{d r}{d t}

(15)

If the force F is derived from a potential w, we have classical equations:

\begin{array}{l} {\begin{cases} L = \frac{1}{2} m v^{2} - w (Lagrangian) \\ H = \frac{1}{2} m v^{2} + w (Hamiltonian) \end{cases} \\ with A = \int_{t_{0}}^{t_{1}} L d t \end{array} and Hamilton - Jacobi equations {\begin{cases} \frac{d q_{i}}{d t} = \frac{\partial H}{\partial p_{i}} \\ \frac{d p_{i}}{d t} = - \frac{\partial H}{\partial q_{i}} \end{cases} with {\begin{cases} r = [\begin{matrix} q_{1} \\ q_{2} \\ q_{3} \end{matrix}] \\ m v = [\begin{matrix} p_{1} \\ p_{2} \\ p_{3} \end{matrix}] \end{cases}

(16)

This idea of Lagrange, rediscovered by Souriau was to consider time t like the others variables. One should use then the 7-dimensional space V (evolution space) (see Figure 2):

y = (\begin{matrix} t \\ r \\ v \end{matrix})

(17)

Classical system of first order differential equations in phase space can then be rewritten in evolution space V by the homogeneous form:

{\begin{cases} m δ v - F δ t = 0 \\ δ r - v δ t = 0 \end{cases}

(18)

At each point y of V, these equations define the tangent direction to the curve x described by the point y during the evolution of the system. These curves are the leaves (lines of force) of the field of directions defined by the equations of the homogeneous form, as defined for foliated manifolds. See [43], for more details on definition of the different derivatives used.

A dynamical system is then represented by a foliation of the evolution, where the foliation is determined by an antisymmetric covariant second order tensor, denoted by

σ

and called Lagrange-Souriau 2-form. The components of this tensor are expressions known as Lagrange brackets.

σ

is considered as a bilinear operator on tangent vectors of V. If we choose two such vectors:

δ y = (\begin{matrix} δ t \\ δ r \\ δ v \end{matrix}) and δ' y = (\begin{matrix} δ' t \\ δ' r \\ δ' v \end{matrix})

(19)

σ

associates to them an antisymmetric scalar product:

σ (δ y) (δ' y) = 〈 m δ v - F δ t, δ' r - v δ' t 〉 - 〈 m δ' v - F δ' t, δ r - v δ t 〉

(20)

In the Souriau-Lagrange model,

σ

is a 2-form on the evolution space V, and the differential equation of motion

δ y \in ε

implies:

σ (δ y) (δ' y) = 0, \forall δ' y

(21)

which can be written as:

σ (δ y) = 0 or δ y \in \ker (σ)

(22)

For study of this Souriau-Lagrange 2-form, readers should see the papers of Obădeanu [65,66,67].

Souriau has observed that this 2-form was introduced by Lagrange in a different language in his study of celestial mechanics in 1808. Souriau was also influenced by François Gallissot that used this 2-form in [68,69]. We will see in the following the Souriau’s “moment map μ” in dual Lie algebra of the group G, and the study of coadjoint orbits of G. For the definition of moment map, we make reference to [45]. Souriau has extended this model for thermodynamics. For this new phenomenological approach of mechanics, thermodynamics and information theory, we can give reference to Souriau introduction of his paper “Quantique? Alors c’est géométrique” [70] and a video of his talk [71]:

“Plaçons-nous d’abord dans le cadre de la mécanique classique. Étudions un système mécanique isolé, non dissipatif—nous dirons brièvement une «chose». L’ensemble des mouvements de cette «chose» est une variété symplectique. Pourquoi? Il suffit de se reporter à la Mécanique Analytique de Lagrange (1811); l’espace des mouvements y est traité comme variété différentiable; les coordonnées covariantes et contravariantes de la forme symplectique y sont écrites (Ce sont les “parenthèses“ et “crochets“ de Lagrange). Évoquons maintenant la géométrie du 20 éme siècle. Soit G un groupe difféologique (par exemple un groupe de Lie); μ un moment de G (un moment, c’est une 1-forme invariante à gauche sur G); alors l’action du groupe sur μ engendre canoniquement un espace symplectique (ces groupes pourront avoir une dimension infinie). Présomption épistémologique: derrière chaque «chose» est caché un groupe G (sa “source“), et les mouvements de la «chose» sont simplement des moments de G (doublet latin mnémotechnique : momentum-movimentum). L’isolement de la «chose» indique alors que le groupe de Poincaré (respectivement de Galilée-Bargman) est inséré dans G; voilà l’origine des grandeurs conservées relativistes (respectivement classiques) associées à un mouvement x: elles constituent simplement le moment induit sur le groupe spacio-temporel par le moment-mouvement x.” (In English: Let’s put ourselves first in the framework of classical mechanics. Let’s study an isolated, non-dissipative mechanical system—we will briefly say a “thing”. The set of movements of this “thing” is a symplectic manifold. Why? It is enough to refer to the Analytical Mechanics of Lagrange (1811); the space of movements is treated as a differentiable manifold; the covariant and contravariant coordinates of the symplectic form are written there (these are the “parentheses” and “brackets” of Lagrange). Let’s now talk about the geometry of the 20th century. Let G be a diffeological group (for example a Lie group); μ a moment of G (a moment is a left invariant 1-form on G); then the action of the group on μ canonically generates a symplectic space (these groups can have an infinite dimension). Epistemological presumption: behind each “thing” is hidden a group G (its “source”), and the movements of the “thing” are simply moments of G (mnemonic Latin doublet: momentum-movimentum). The isolation of the “thing” then indicates that the group of Poincaré (respectively Galileo-Bargman) is inserted in G; here is the origin of the relativistic (respectively classical) conserved magnitudes associated with a movement x: they simply constitute the moment induced on the spacio-temporal group by the moment-motion x.)

“Il y a un théorème qui remonte au XXème siècle. Si on prend une orbite coadjointe d’un groupe de Lie, elle est pourvue d’une structure symplectique. Voici un algorithme pour produire des variétés symplectiques: prendre des orbites coadjointes d’un groupe. Donc cela laisse penser que derrière cette structure symplectique de Lagrange, il y avait un groupe caché. Prenons le mouvement classique d’un moment du groupe, alors ce groupe est très «gros» pour avoir tout le système solaire. Mais dans ce groupe est inclus le groupe de Galilée, et tout moment d’un groupe engendre des moments d’un sous-groupe. On va retrouver comme cela les moments du groupe de Galilée, et si on veut de la mécanique relativiste, cela va être du groupe de Poincaré. En fait avec le groupe de Galilée, il y a un petit problème, ce ne sont pas les moments du groupe de Galilée qu’on utilise, ce sont les moments d’une extension centrale du groupe de Galilée, qui s’appelle le groupe de Bargman, et qui est de dimension 11. C’est à cause de cette extension, qu’il y a cette fameuse constante arbitraire figurant dans l’énergie. Par contre quand on fait de la relativité restreinte, on prend le groupe de Poincaré et il n’y a plus de problèmes car parmi les moments il y a la masse et l’énergie c’est mc². Donc le groupe de dimension 11 est un artéfact qui disparait, quand on fait de la relativité restreinte.” (In Engish: There is a theorem dating back to the twentieth century. If we take a coadjoint orbit of a Lie group, it is provided with a symplectic structure. Here is an algorithm to produce symplectic manifolds: take coadjoint orbits from a group. So it suggests that behind this symplectic structure of Lagrange, there was a hidden group. Take the classic movement of a moment of the group, so this group is very “big” to have the whole solar system. But in this group is included the Galileo group, and any moment of a group generates moments of a subgroup. We will find like that the moments of the group of Galileo, and if we want relativistic mechanics, it will be Poincaré group. In fact with Galileo group, there is a small problem, it is not the moments of the Galileo group that are used, it is the moments of a central extension of the Galileo group, which is called the Bargman group, and that is of dimension 11. It is because of this extension, that there is this famous arbitrary constant appearing in the energy. On the other hand, when we do special relativity, we take Poincaré group and there are no more problems because among the moments there is the mass and the energy is mc². So the 11-dimensional group is an artifact that disappears, when we do special relativity.)

François Gallissot has observed that in his famous lessons on integral invariants, Elie Cartan has shown that all the properties of the differential equations of the dynamics of holonomic systems result from the existence of the integral invariant:

\int ω with ω = \sum_{i} p_{i} d q_{i} - H d t

(23)

Thus every holonomic system whose forces derive from a force function is associated to a form

ω

, the equations of motion being the characteristics of the exterior form

d ω

. Around 1950, the theory of exterior forms on differentiable manifolds has been established on new foundations under the influence of topologists. The question was then to wonder:

if classical mechanics cannot benefit from these models by placing an exterior form of degree two at its base
if thanks to the notion of manifold, the notion of connection cannot be introduced in a more natural way
if the paradoxal indeterminations/impossibilities in the Lagrangian framework could be explained more clearly
if the problem of integration of equations of motion could be enlightened, generated by a form $Ω$ of degree two.

To reach these various objectives, Gallissot has resumed first the study of the logical bases on which the Galilean mechanics is built. He thus shown that when it is proposed to find generating forms of the equations of motion of a material invariant point in the transformations of the Galilean group, the most interesting form is an exterior form of degree two defined on a variety

E^{3} \times E \times T

(

E^{3}

Euclidean space,

T

temporal). Gallissot had shown that any holonomic parametric system with n degrees of freedom is associated with a form

Ω

of degree 2n defined on a differentiable manifold whose characteristics are the equations of the movement. This form is expressed by means of 2n Pfaff forms and by dt, the Hamiltonian form being a simple special case. He gave a summary of how we can get rid of the servitude of coordinates in the study of dynamical systems and the important role played by the operator

i ()

antiderivative introduced by Cartan, the characteristic field E of the form

Ω

being defined by the relation

i (E) Ω = 0

. Gallissot has then introduced the following theorem:

Theorem 1.

There are three types of differential forms generating equations of movement of a material point invariant in the transformations of the Galilean group:

\begin{array}{l} A : {\begin{cases} s = \frac{1}{2 m} \sum_{i = 1}^{3} {(m d v_{i} - F_{i} d t)}^{2} \\ e = \frac{m}{2} \sum_{j = 1}^{3} {(d x_{j} - v_{j} d t)}^{2} \end{cases} \\ B : f = \sum_{1}^{3} δ_{i j} (d x_{i} - v_{i} d t) (m d v_{j} - F_{j} d t) with δ_{i j} krönecker symbol \\ C : ω = \sum_{1}^{3} δ_{i j} (m d v_{i} - F_{i} d t) \land (d x_{j} - v_{j} d t) \end{array}

(24)

If we consider the last form “C”:

ω = \sum_{1}^{3} δ_{i j} (m d v_{i} - F_{i} d t) \land (d x_{j} - v_{j} d t) = m δ_{i j} d v_{i} \land d x_{j} - m δ_{i j} v_{i} d v_{j} \land d t + δ_{i j} F_{i} d x_{j} \land d t

(25)

d ω = 0

constraints Pfaff form

δ_{i j} F_{i} d x_{j}

to be closed, and to reduce the differential of function

U

:

ω = m δ_{i j} d v_{i} \land d x_{j} - d H \land d t

(26)

with H = T - U and T = \frac{1}{2} \sum_{i = 1}^{3} m {(v_{i})}^{2}

(27)

It proves that the exterior derivative of

ω

is:

d ω = \sum_{i = 1}^{3} m v_{i} d x_{j} - H d t

(28)

The form

ω^{*} = d ω

generates Elie Cartan integral invariant.

In Chapter IV of his book, Souriau applied this model based on symplectic geometry for statistical mechanics. Souriau observed that Gibbs equilibrium is not covariant with respect to dynamic groups of physics. To solve this breaking of symmetry, Souriau introduced a new “geometric theory of heat” where the equilibrium states are indexed by a parameter

β

with values in the Lie algebra of the group, generalizing the Gibbs equilibrium states, where

β

plays the role of a geometric (Planck) temperature. Souriau observed that the group of time translations of the classical thermodynamics is not a normal subgroup of the Galileo group, proving that if a dynamical system is conservative in an inertial reference frame, it need not be conservative in another. Based on this fact, Souriau generalized the formulation of the Gibbs principle to become compatible with Galileo’s relativity in classical mechanics and with Poincaré relativity in relativistic mechanics. The maximum entropy principle is preserved, and the Gibbs density is given by the density of maximum entropy (among the equilibrium states for which the average value of the energy takes a prescribed value, the Gibbs measures are those which have the largest entropy), but with a new principle “If a dynamical system is invariant under a Lie subgroup G’ of the Galileo group, then the natural equilibria of the system forms the Gibbs ensemble of the dynamical group G’”. The classical notion of Gibbs canonical ensemble is extended for a homogneous symplectic manifold on which a Lie group (dynamic group) has a symplectic action. In case of a Galileo group, the symmetry is broken, and new “cohomological” relations should be verified in Lie algebra of the group. A natural equilibrium state will thus be characterized by an element of the Lie algebra of the Lie group, determining the equilibrium temperature

β

. The entropy

s (Q)

, parametrized by

Q

the geometric heat (mean of energy

U

, element of the dual Lie algebra) is defined by the Legendre transform of the Massieu potential given by

Φ (β)

, parametrized by

β

(

Φ (β)

is the minus logarithm of the partition function

ψ_{Ω} (β)

):

s (Q) = 〈 β, Q 〉 - Φ (β) with Φ (β) = - \log \int_{M} e^{- 〈 β, U (ξ) 〉} d ω, Q = \frac{\partial Φ}{\partial β} \in g^{*} and β = \frac{\partial s}{\partial Q} \in g

(29)

Souriau has proposed to study the statistical mechanics from the new point of view of symplectic geometry, completing the work of Poincaré and Cartan on integral invariant, reinventing the Lagrangian symplectic form in place of classical variational formulation and geometrizing the Noether theorem with a moment map as new conserved quantities. Firstly, Souriau Lie groups thermodynamics gives geometrical status to the (Planck) temperature and the entropy with a new general definition of the Fisher Metric. Secondly, Souriau’s relativistic thermodynamics of continua provides a geometrization of the smecond principle by the permanence of the entropy current, whose flux has positive divergence [13,14,72,73,74]. This 2nd model of Souriau’s thermodynamics is described in the Appendix. Other authors have studied this relativistic thermodynamics of continua [75,76,77,78,79,80,81,82].

If some works have been done from the 80s by Ingarden [83,84] and Mrugala [85,86,87,88,89] and Arnold [90] to give a geometric structures to thermodynamics, Souriau’s Lie groups thermodynamics was ignored for more than 50 years until recently recovered in [23,91].

3. Higher Order Thermodynamics Based on Higher Order Temperatures

We will generalize Souriau’s theory [43,64], reconsidered in [23] and with links to information geometry in [91], in the framework of higher order thermodynamics as introduced by Ingarden [29,30,31] and Jaworski [32,33,34,35] for mesoscopic systems. We can make also reference to other publications of Ingarden [36,37,38,39,40], Jaworsky [92,93,94] and Nakagomi [95] on higher order thermodynamics. The Gibbs canonical state results from the maximum entropy principle when the statistical mean value of the energy is supposed to be known. A Polish school has studied the maximum entropy inference with higher-order moments of energy (when not only mean values but also statistical moments of higher order of some physical quantities are taken into account). Ingarden in 1963 and Jaworski in 1981 have introduced the concept of second and higher-order temperatures, by assuming a distribution function which includes information not only on the average of the energy but also on higher-order moments, in particular 2nd moment related to fluctuations. This case should be considered in situations where fluctuations are not negligible, such as near phase transitions or critical points, in metastable states in systems with a small number of degrees of freedom. Ingarden’s idea is that if we can measure more details, such as the first n cumulants of the energy, we can then introduce n high-order temperature, as the Lagrange multipliers when we maximize the entropy with respect to these values:

P_{(β_{1}, β_{2})} = \frac{1}{Z (β_{1}, β_{2})} e^{- β_{1} . H - β_{2} {(H - U)}^{2}} = e^{β_{0} - β_{1} . H - β_{2} {(H - U)}^{2}}

(30)

Ingarden proposed that if we can measure the second cumulant of the energy (the fluctuation of the energy), the equilibrium state is not the canonical state, but would need two temperatures. Ingarden argues that for a macroscopic system there is very little difference between the two states, and that we would need a mesoscopic or microscopic system to be able to detect the higher temperature. Jaworski [27,28] has shown that the contribution to the total entropy, arising from the extra information corresponding to the higher-order moments, is o(N), when N tends to infinity and N/V ratio is constant, with N the number of particles and V the volume. The main result of Jaworski is that from a purely thermodynamic point of view, the information corresponding to the higher-order moments of extensive physical quantities is not essential and can be neglected in the maximum entropy procedure. Jaworski showed that the maximum entropy inference has a certain stability property with respect to information corresponding to higher order moments of extensive quantities. It can serve as an argument in favor of the maximum entropy method in statistical physics and to understand better why these methods are successful. Streater [96] has prefered to say that the states with generalized temperatures are not in equilibrium, assuming that the final state, at large times, will be the canonical or grand canonical state depending on mixing properties. Streater [96] intends that this occur even for a mesoscopic system, such as a few atoms, adding that his approach is equivalent to Ingarden model if the relaxation time from the state with generalized temperatures to the final equilibrium is very long.

Some examples of higher order maximum Entropy are given by Ingarden:

● 1st Example of Higher Oder Maximum Entropy Density:

Density of maximum Entropy

S (P) = - \int_{- \infty}^{+ \infty} P (x) \log P (x) d x

(31)

under the constraints:

P (x) \geq 0, \int_{- \infty}^{+ \infty} P (x) d x = 1 and E (x^{2 n}) = \int_{- \infty}^{+ \infty} x^{2 n} P (x) d x = σ^{2 n}

(32)

is given by:

P (x) = \frac{1}{2 {(2 n)}^{\frac{1}{2 n}} σ . Γ (1 + 1 / 2 n)} \exp (- \frac{x^{2 n}}{2 n σ^{2 n}}) = f_{n} (x)

(33)

with the following parameters:

β_{n} = \frac{1}{2 n σ^{2 n}}, Z (β_{n}) = \frac{2 Γ (1 + 1 / 2 n)}{β_{n}^{1 / 2 n}}, S (P) = \log Z (β_{n}) + \frac{1}{2 n}

(34)

where:

E (x^{2 k - 1}) = 0 and - \frac{\partial \log Z (β_{k})}{\partial β_{k}} = σ^{2 k} = E (x^{2 k}) = \frac{{(2 n)}^{k / n} σ^{2 k} Γ (1 + (2 k + 1) / 2 n)}{(2 k + 1) Γ (1 + 1 / 2 n)}

(35)

We illustrate this higher order maximum entropy density in Figure 3.

● 2nd Example of Higher Oder Maximum Entropy Density:

Density of maximum Entropy

S (P) = - \int_{0}^{+ \infty} P (x) \log P (x) d x

under the constraints:

P (x) \geq 0, \int_{0}^{+ \infty} P (x) d x = 1 and E (x^{n}) = \int_{0}^{+ \infty} x^{n} P (x) d x = σ^{n}

(36)

is given by:

P (x) = \frac{1}{n^{\frac{1}{n}} σ . Γ (1 + 1 / n)} \exp (- \frac{x^{n}}{n σ^{n}}) = f_{n} (x)

(37)

with the following parameters:

β_{n} = \frac{1}{n σ^{n}}, Z (β_{n}) = \frac{Γ (1 + 1 / n)}{β_{n}^{1 / n}}, S (P) = \log Z (β_{n}) + \frac{1}{n}

(38)

where:

- \frac{\partial \log Z (β_{k})}{\partial β_{k}} = σ^{k} = E (x^{k}) = \frac{n^{k / n} σ^{k} Γ (1 + (k + 1) / n)}{(k + 1) Γ (1 + 1 / n)}

(39)

We illustrate this higher order maximum entropy density in Figure 4.

As soon as 1963, Ingarden has introduced this concept of higher order temperatures for statistical systems such as thermodynamics. In physics, the concept of temperature is connected with the mean value of kinetic energy of molecules in an ideal gas. For a general physical system with interactions among particles (the case of non-ideal gas: liquid or solid), an equilibrium probability distribution depends on temperature T as the only statistical parameter of the Gibbs state:

P_{β} (x) = \frac{1}{Z (β)} e^{- β . H (x)}

with

β = \frac{1}{k_{β} T}

and

H (x) = H (p, q)

where p is position, q the mechanical momentum and

k_{β}

the Boltzmann constant (a factor to insure that

β . H

is dimensionless). If there are no stochastic interactions between particles (ideal gas), the partition function Z has the property to be integrable and we can obtain Gauss distribution in the momentum space deduced from the result of the limit theorem for large N. The ideal gas model of Boltzmann can fail if the number of particles is not large enough in the case of mesoscopic systems, and also if the interactions between particles are not weak enough. Gibbs hypothesis can also fail in other cases when stochastic interactions with the environment are not sufficiently weak. As remarked by Ingarden, nobody has ever observed thermal Gibbs equilibrium in large and complex systems (cosmic systems, Earth’s atmosphere, biological organisms), but only in cases of turbulence, flows or pumping, by replacing classical approach by local temperature and concept of thermodynamic flows (non-equilibrium thermodynamics and thermo-hydrodynamics), that is non-coherent with the classical concept of temperature which is, by definition, global/intensive and does not depend on position. R.S. Ingarden proposed to consider the stationary case using of the concept of higher order temperatures given by the Gibbs density:

P_{(β_{1}, \dots, β_{n})} (x) = \frac{1}{Z (β_{1}, \dots, β_{n})} e^{- β_{1} . H (x) - β_{2} {(H (x) - U)}^{2} - \dots - β_{n} {(H (x) - U)}^{n}}

(40)

where

U = E (H)

is the mean energy. This mean energy has been introduced to preserve the the total energy invariance with respect to an arbitrary additive constant, and

β_{0} = - \log Z (β_{1}, \dots, β_{n})

the constant of normalization. The new constants

β_{k}

are said to be β-temperatures of order k.

H (x)

is usually defined as a quadratic function of x. The probability distribution is uniquely defined from statistical moments which should be measured experimentally. But if values number is too high to make this method practical, we are only able to measure the lowest moments up to some order (if we can neglect the higher orders that do not change the result to a given accuracy), and to fix β-temperatures defined as Lagrange multipliers by maximization of entropy of distribution

S = - \int P_{(β_{1}, \dots, β_{n})} (x) \log P_{(β_{1}, \dots, β_{n})} (x) d x

, with the given moments as constraints. R.S. Ingarden observed that the entropy maximization randomizes higher moments in a symmetric way, and it cancel any possible bias with respect to their special values, and it gives the best estimate to a given accuracy. The values of

β

can be found by:

E (x^{k}) = \frac{\partial β_{0}}{\partial β_{k}} = \frac{\partial \log Z}{\partial β_{k}} with E (x^{k}) = Z^{- 1} \int x^{k} e^{- \sum_{k = 1}^{n} β_{k} x^{k}} d x = \int x^{k} P_{(β_{1}, \dots, β_{n})} (x) d x

(41)

Z = \int e^{- \sum_{k = 1}^{n} β_{k} x^{k}} d x and the relation : S = \sum_{k = 1}^{n} β_{k} E (x^{k}) + \log Z = \sum_{k = 1}^{n} β_{k} \frac{\partial β_{0}}{\partial β_{k}} - β_{0}

(42)

Ingarden has applied this model for linguistic statistics, assuming the appearance of higher order temperatures since there occur rather strong statistical correlations between phonemes and words as elements of these statistics. He argued his choice observing that in the case of word statistics, the existence of strong correlations is given by grammatical or semantical studies [9]. Ingarden made the conjecture that his high order thermodynamics is the model of statistically interacting, biological living systems, and small systems although the calculation/observation are more difficult.

Ingarden higher order temperatures could be defined in the case when no variation is considered, but when a probability distribution depending on more than one parameter. It has been observed by Ingarden, that Gibbs assumption can fail if the number of components of the sum goes to infinity and the components of the sum are stochastically independent, and if stochastic interactions with the environment are not sufficiently weak. In all these cases, we never observe absolute thermal equilibrium of Gibbs type but only flows or turbulence. Non-equilibrium thermodynamics could be indirectly addressed by means of high order temperatures.

4. Model of Souriau Lie Groups Thermodynamics

For introduction to symplectic geometry, we make reference to Marle’s book [45] and Koszul’s book [44]. In 1969, Souriau [43,64] introduced the concept of co-adjoint action of a group on its momentum space, based on the orbit method works, that allows to define physical observables like energy, heat and momentum or moment as pure geometrical objects. The moment map is a constant of the motion and is associated to symplectic cohomology. In a first step to establish new foundations of thermodynamics, Souriau has defined a Gibbs canonical ensemble on a symplectic manifold M for a Lie group action on M. In classical statistical mechanics, a state is given by the solution of Liouville equation on the phase space, the partition function. As symplectic manifolds have a completely continuous measure, invariant by diffeomorphisms, the Liouville measure λ, all statistical states will be the product of the Liouville measure by the scalar function given by the generalized partition function

e^{Φ (β) - 〈 β, U (ξ) 〉}

defined by the energy

U

(defined in the dual of the Lie algebra of this dynamical group) and the geometric temperature

β

, where

Φ

is a normalizing constant such the mass of probability is equal to 1,

Φ (β) = - \log \int_{M} e^{- 〈 β, U (ξ) 〉} d λ

. Souriau then generalizes the Gibbs equilibrium state to all symplectic manifolds that have a dynamical group. Souriau has observed that if we apply this theory for a Galileo group, the symmetry has been broken. For each temperature

β

, element of the Lie algebra

g

, Souriau has introduced a tensor

{\tilde{Θ}}_{β}

, equal to the sum of the cocycle

\tilde{Θ}

and the heat coboundary (with [.,.] Lie bracket):

{\tilde{Θ}}_{β} (Z_{1}, Z_{2}) = \tilde{Θ} (Z_{1}, Z_{2}) + 〈 Q, a d_{Z_{1}} (Z_{2}) 〉

(43)

This tensor

{\tilde{Θ}}_{β}

has the following properties:

\tilde{Θ} (X, Y) = 〈 Θ (X), Y 〉

where the map

Θ

is the symplectic one-cocycle of the Lie algebra

g

with values in

g^{*}

, with

Θ (X) = T_{e} θ (X (e))

where

θ

the one-cocycle of the Lie group G.

\tilde{Θ} (X, Y)

is constant on M and the map

\tilde{Θ} (X, Y) : g \times g \to ℜ

is a skew-symmetric bilinear form, and is called the symplectic two-cocycle of Lie algebra

g

associated to the moment map

J

, with the following properties:

\tilde{Θ} (X, Y) = J_{[X, Y]} - {J_{X}, J_{Y}} with J the Moment Map

(44)

\tilde{Θ} ([X, Y], Z) + \tilde{Θ} ([Y, Z], X) + \tilde{Θ} ([Z, X], Y) = 0

(45)

where

J_{X}

linear application from

g

to differential function on

M

:

g \to C^{\infty} (M, R), X \to J_{X}

and the associated differentiable application

J

, called moment(um) map:

J : M \to g^{*}, x \mapsto J (x) such that J_{X} (x) = 〈 J (x), X 〉, X \in g

(46)

The geometric temperature, element of the algebra

g

, is in the the kernel of the tensor

{\tilde{Θ}}_{β}

:

β \in K e r {\tilde{Θ}}_{β} such that {\tilde{Θ}}_{β} (β, β) = 0, \forall β \in g

(47)

The following symmetric tensor

g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])

, defined on all values of

a d_{β} (.) = [β, .]

is positive definite, and defines extension of the classical Fisher metric in information geometry (as the Hessian of the logarithm of partition function):

g_{β} ([β, Z_{1}], Z_{2}) = {\tilde{Θ}}_{β} (Z_{1}, Z_{2}), \forall Z_{1} \in g, \forall Z_{2} \in Im (a d_{β} (.))

(48)

with:

g_{β} (Z_{1}, Z_{2}) \geq 0, \forall Z_{1}, Z_{2} \in Im (a d_{β} (.))

(49)

These equations are universal, because they are not dependent on the symplectic manifold but only on the dynamical group G, the symplectic two-cocycle

Θ

, the temperature

β

and the heat

Q

. Souriau called it “Lie groups thermodynamics” (see Figure 5 and Figure 6).

Theorem 2. [Souriau Theorem of Lie Groups Thermodynamics]

Let

Ω

be the largest open proper subset of

g

, Lie algebra of G, such that

\int_{M} e^{- 〈 β, U (ξ) 〉} d λ

and

\int_{M} ξ . e^{- 〈 β, U (ξ) 〉} d λ

are convergent integrals, this set

Ω

is convex and is invariant under every transformation

A d_{g} (.)

. Then, the fundamental equations of Lie groups thermodynamics are given by the action of the group:

Action of Lie group on Lie algebra:

$β \to A d_{g} (β)$

(50)
Characteristic function after Lie group action:

$Φ \to Φ - 〈 θ (g^{- 1}), β 〉$

(51)
Invariance of entropy with respect to action of Lie group:

$s \to s$

(52)
Action of Lie group on geometric heat:

$Q \to a (g, Q) = A d_{g}^{*} (Q) + θ (g)$

(53)

Souriau’s equations of Lie groups thermodynamics are summarized in the following figures.

In the framework of Lie group action on a symplectic manifold, equivariance of moment could be studied to prove that there is a unique action a(.,.) of the Lie group

G

on the dual

g^{*}

of its Lie algebra for which the moment map

J

is equivariant, that means for each

x \in M

:

J (Φ_{g} (x)) = a (g, J (x)) = A d_{g}^{*} (J (x)) + θ (g)

(54)

When the group is not abelian (non-commutative group), the symmetry is broken, and new “cohomological” relations should be verified in Lie algebra of the group. A natural equilibrium state will thus be characterized by an element of the Lie algebra of the Lie group, determining the equilibrium temperature

β

. The entropy

s (Q)

, parametrized by

Q

the geometric heat (mean of energy

U

, element of the dual Lie algebra) is defined by the Legendre transform [97,98,99,100,101,102,103] of the Massieu potential

Φ (β)

parametrized by

β

(

Φ (β)

is the minus logarithm of the partition function

ψ_{Ω} (β)

):

s (Q) = 〈 β, Q 〉 - Φ (β) with {\begin{cases} Q = \frac{\partial Φ}{\partial β} \in g^{*} \\ β = \frac{\partial s}{\partial Q} \in g \end{cases}

(55)

\begin{array}{l} p_{G i b b s} (ξ) = e^{Φ (β) - 〈 β, U (ξ) 〉} = \frac{e^{- 〈 β, U (ξ) 〉}}{\int_{M} e^{- 〈 β, U (ξ) 〉} d ω}, Q = \frac{\partial Φ (β)}{\partial β} = \frac{\int_{M} U (ξ) e^{- 〈 β, U (ξ) 〉} d ω}{\int_{M} e^{- 〈 β, U (ξ) 〉} d ω} = \int_{M} U (ξ) p (ξ) d ω \\ with Φ (β) = - \log \int_{M} e^{- 〈 β, U (ξ) 〉} d ω \end{array}

(56)

Souriau completed his “geometric heat theory” by introducing a 2-form in the Lie algebra, that is a Riemannian metric tensor in the values of adjoint orbit of

β

,

[β, Z]

with

Z

an element of the Lie algebra. This metric is given for

(β, Q)

:

g_{β} ([β, Z_{1}], [β, Z_{2}]) = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 Q, [Z_{1}, [β, Z_{2}]] 〉

(57)

where

Θ

is a cocycle of the Lie algebra, defined by

Θ = T_{e} θ

with

θ

a cocycle of the Lie group defined by

θ (M) = Q (A d_{M} (β)) - A d_{M}^{*} Q

.

We observe that Souriau Riemannian metric, introduced with symplectic cocycle, is a generalization of the Fisher metric, that we call the Souriau-Fisher metric, that preserves the property to be defined as a Hessian of the partition function logarithm

g_{β} = - \frac{\partial^{2} Φ}{\partial β^{2}} = \frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}

as in classical information geometry. We will establish the equality of two terms, between Souriau definition based on Lie group cocycle

Θ

and parameterized by “geometric heat” Q (element of dual Lie algebra) and “geometric temperature” β (element of Lie algebra) and hessian of characteristic function

Φ (β) = - \log ψ_{Ω} (β)

with respect to the variable β:

g_{β} ([β, Z_{1}], [β, Z_{2}]) = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 Q, [Z_{1}, [β, Z_{2}]] 〉 = \frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}

(58)

If we differentiate this relation of Souriau theorem

Q (A d_{g} (β)) = A d_{g}^{*} (Q) + θ (g)

, this relation occurs:

\frac{\partial Q}{\partial β} (- [Z_{1}, β], .) = \tilde{Θ} (Z_{1}, [β, .]) + 〈 Q, A d_{. Z_{1}} ([β, .]) 〉 = {\tilde{Θ}}_{β} (Z_{1}, [β, .])

(59)

- \frac{\partial Q}{\partial β} ([Z_{1}, β], Z_{2} .) = \tilde{Θ} (Z_{1}, [β, Z_{2}]) + 〈 Q, A d_{. Z_{1}} ([β, Z_{2}]) 〉 = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])

(60)

\Rightarrow - \frac{\partial Q}{\partial β} = g_{β} ([β, Z_{1}], [β, Z_{2}])

(61)

As the entropy is defined by the Legendre transform of the characteristic function, this Souriau-Fisher metric is also equal to the inverse of the hessian of “geometric entropy”

s (Q)

with respect to the variable Q:

\frac{\partial^{2} s (Q)}{\partial Q^{2}}

.

For the maximum entropy density (Gibbs density), the following three terms coincide:

\frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}

that describes the convexity of the log-likelihood function,

I (β) = - E [\frac{\partial^{2} \log p_{β} (ξ)}{\partial β^{2}}]

the Fisher metric that describes the covariance of the log-likelihood gradient, whereas

I (β) = E [(ξ - Q) {(ξ - Q)}^{T}] = V a r (ξ)

that describes the covariance of the observables.

We can also observe that the Fisher metric

I (β) = - \frac{\partial Q}{\partial β}

is exactly the Souriau metric defined through symplectic cocycle:

I (β) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}]) = g_{β} ([β, Z_{1}], [β, Z_{2}])

(62)

The Fisher metric

I (β) = - \frac{\partial^{2} Φ (β)}{\partial β^{2}} = - \frac{\partial Q}{\partial β}

has been considered by Souriau as a generalization of “heat capacity”. Souriau called it

K

the “geometric capacity”.

We could observe that Souriau Lie groups thermodynamics is compatible with Balian and Valentin’s theory of thermodynamics [24], that is obtained by symplectization in dimension 2n + 2 of contact manifold in dimension 2n + 1. All elements of the Souriau geometric temperature vector are multiplied by the same gauge parameter. The Balian and Valentin model was first explored in [104] and has been recently developed by der Schaft and Maschke in [26,105].

5. Extended Koszul Study of Souriau Non-Equivariant Model Associated to a Class of Cohomology

Koszul has deepened Souriau’s model in his book “Introduction to symplectic geometry” [44] as explained in [10]. In the historical foreword of this book, Koszul write “The development of analytical mechanics provided the basic concepts of symplectic structures. The term symplectic structure is due largely to analytical mechanics. But in this book, the applications of symplectic structure theory to mechanics is not discussed in any detail”. Koszul considers in this book purely algebraic and geometric developments of geometric/analytic mechanics developed during the 60s, more especially in Jean-Marie Souriau’s works detailed in chapters 4 and 5. The originality of this book lies in the fact that Koszul develops new points of view, and demonstrations not considered initially by Souriau and after by the geometrical mechanics community.

To highlight the importance of this Koszul book, we will illustrate the links of the detailed tools, including demonstrations or original Koszul extensions, with Souriau’s Lie groups thermodynamics. Koszul originally developed Souriau’s model, in the case of non-equivariance, of the action of the group G on the moment map. As explained in [106] by Thomas Delzant at the 2010 CIRM conference “Action Hamiltoniennes: invariants et classification”, organized with Michel Brion: “The definition of the moment map is due to Jean-Marie Souriau…. In the book of Souriau, we find a proof of the proposition: the map J is equivariant for an affine action of G on g* whose linear part is Ad*…. In Souriau’s book, we can also find a study of the non-equivariant case and its applications to classical and quantum mechanics. In the case of the Galileo group operating in the phase space of space-time, obstruction to equivariance (a class of cohomology) is interpreted as the inert mass of the object under study”. We can uniquely define the moment map up to an additive constant of integration, that can always be chosen to make the moment map equivariant (a moment map is G-equivariant, when G acts on g* via the coadjoint action) if the group is compact or semi-simple. In 1969, Souriau has considered the non-equivariant case where the coadjoint action must be modified to make the map equivariant by a 1-cocycle on the group with values in dual Lie algebra g*.

The concept and seminal idea of moment map was in the Sophus Lie’s book 2nd volume published in 1890, developed for homogeneous canonical transformations. Professor Marsden has summarized the development of this concept by Jean-Marie Souriau and Bertram Kostant based on their both testimonials: “In Kostant’s 1965 Phillips lectures at Haverford, and in the 1965 U.S.–Japan Seminar, Kostant introduced the momentum map to generalize a theorem of Wang and thereby classified all homogeneous symplectic manifolds; this is called today ‘Kostant’s coadjoint orbit covering theorem’…. Souriau introduced the momentum map in his 1965 Marseille lecture notes and put it in print in 1966. The momentum map finally got its formal definition and its name, based on its physical interpretation, by Souriau in 1967. Souriau also studied its properties of equivariance, and formulated the coadjoint orbit theorem. The momentum map appeared as a key tool in Kostant’s quantization lectures in 1970 [46], and Souriau discussed in 1970 it at length in his book [43]. Kostant and Souriau realized its importance for linear representations, a fact apparently not foreseen by Lie”. Souriau’s book reference date is 1970, but it was published by Dunod in 1969. For information, Jean-Louis Koszul knew very well the Souriau and Kostant works, and as soon as 1958, Koszul made a survey of first Kostant’s works at a Bourbaky seminar [47].

In this book in Chapter 4, Koszul calls symplectic G-space a symplectic manifold (M; ω) on which a Lie group G acts by a symplectic action (an action which leaves unchanged the symplectic form ω). Koszul then introduces and develop properties of the moment map μ (Souriau’s invention) of a Hamiltonian action of the Lie algebra g. Koszul also defines the Souriau 2-cocycle, considering that the difference of two moments of the same Hamiltonian action is a locally constant application on M ,showing that when μ is a moment map, for every pair (a;b) of elements of g, the function

c_{μ} (a, b) = {〈 μ, a 〉, 〈 μ, b 〉} - 〈 μ, {a, b} 〉

is locally constant on M, defining an antisymmetric bilinear application of gxg in H⁰(M; R) which verifies Jacobi’s identity. This is the 2-cocycle introduced by Jean-Marie Souriau in Geometric Mechanics, that will play a fundamental role in Souriau Lie Groups Thermodynamics to define an extension of the Fisher Metric from Information Geometry: “Fisher-Souriau metric”.

The antisymmetric bilinear map (31) and (32), with definition (27) and (28), introduced by Souriau is exactly equal to the mathematical object extensively studied in Chapter 4 of Koszul’s book:

c_{μ} (a, b) = {〈 μ, a 〉, 〈 μ, b 〉} - 〈 μ, {a, b} 〉

(63)

In this book, Koszul has studied this antisymmetric bilinear map considering the following developments. For any moment map

μ

, Koszul defines the skew symmetric bilinear form

c_{μ} (a, b)

on Lie algebra by:

c_{μ} (a, b) = 〈 d θ_{μ} (a), b 〉, a, b \in g

(64)

Koszul observes that if he uses:

θ_{μ} (s t) = μ (s t x) - A d_{s t}^{*} μ (x) = θ_{μ} (s) + A d_{s}^{*} μ (t x) - A d_{s}^{*} A d_{t}^{*} μ (x) = θ_{μ} (s) + A d_{s}^{*} θ_{μ} (t)

(65)

by developing

d μ (a x) = {}^{t}{a d}_{a} μ (x) + d θ_{μ} (a), x \in M, a \in g

, he obtains:

〈 d μ (a x), b 〉 = 〈 μ (x), [a, b] 〉 + 〈 d θ_{μ} (a), b 〉 = {〈 μ, a 〉, 〈 μ, b 〉} (x), x \in M, a, b \in g

(66)

He has then:

c_{μ} (a, b) = {〈 μ, a 〉, 〈 μ, b 〉} - 〈 μ, [a, b] 〉 = 〈 d θ_{μ} (a), b 〉, a, b \in g

(67)

and the property:

c_{μ} ([a, b], c) + c_{μ} ([b, c], a) + c_{μ} ([c, a], b) = 0, a, b, c \in g

(68)

Koszul concludes by observing that if the moment map is transform as

μ' = μ + ϕ

then we have:

c_{μ'} (a, b) = c_{μ} (a, b) - 〈 ϕ, [a, b] 〉

(69)

Finally using

c_{μ} (a, b) = {〈 μ, a 〉, 〈 μ, b 〉} - 〈 μ, [a, b] 〉 = 〈 d θ_{μ} (a), b 〉, a, b \in g

, koszul highlights the property that:

{μ^{*} (a), μ^{*} (b)} = {〈 μ, a 〉, 〈 μ, b 〉} = μ^{*} ([a, b] + c_{μ} (a, b)) = μ^{*} {a, b}_{c_{μ}}

(70)

In Chapter 4, Koszul introduces the equivariance of the moment map μ. Based on the definitions of the adjoint and coadjoint representations of a Lie group or a Lie algebra, Koszul proves that when (M; ω) is a connected Hamiltonian G-space and

μ : M \to g^{*}

a moment of the action of G, there exists an affine action of G on g*, whose linear part is the coadjoint action, for which the moment μ is equivariant. This affine action is obtained by modifying the coadjoint action by means of a cocycle. This notion is also developed in Chapter 5 for studying Poisson manifolds.

Defining classical operation

A d_{s} a = s a s^{- 1}, s \in G, a \in g

,

a d_{a} b = [a, b], a \in g, b \in g

and

A d_{s}^{*} = {}^{t}{A d}_{s^{- 1}}, s \in G

with classical properties:

A d_{\exp a} = \exp (- a d_{a}), a \in g or A d_{\exp a}^{*} = \exp {}^{t}{(a d_{a})}, a \in g

(71)

Koszul considers:

x \mapsto s x, x \in M, μ : M \to g^{*}

(72)

From which, he obtains:

〈 d μ (v), a 〉 = ω (a x, v)

(73)

Koszul then study

μ \circ s_{M} - A d_{s}^{*} \circ μ : M \to g^{*}

, and develops:

d 〈 A d_{s}^{*} \circ μ, a 〉 = 〈 A d_{s}^{*} d μ, a 〉 = 〈 d μ, A d_{s^{- 1}} a 〉

(74)

〈 d μ (v), A d_{s^{- 1}} a 〉 = ω (s^{- 1} a s x, v) = ω (a s x, s v) = 〈 d μ (s v), a 〉 = (d 〈 μ \circ s_{M}, a 〉) (v)

(75)

d 〈 A d_{s}^{*} \circ μ, a 〉 = d 〈 μ \circ s_{M}, a 〉 and then proves that d 〈 μ \circ s_{M} - A d_{s}^{*} \circ μ, a 〉 = 0

(76)

Koszul considers the cocycle given by

θ_{μ} (s) = μ (s x) - A d_{s}^{*} μ (x), s \in G

, and observes that:

θ_{μ} (s t) = θ_{μ} (s) - A d_{s}^{*} θ_{μ} (t), s, t \in G

(77)

From this action of the group on dual Lie algebra:

G \times g^{*} \to g^{*}, (s, ξ) \mapsto s ξ = A d_{s}^{*} ξ + θ_{μ} (s)

(78)

Koszul introduces the following properties:

μ (s x) = s μ (x) = A d_{s}^{*} μ (x) + θ_{μ} (s), \forall s \in G, x \in M

(79)

G \times g^{*} \to g^{*}, (e, ξ) \mapsto e ξ = A d_{e}^{*} ξ + θ_{μ} (e) = ξ + μ (x) - μ (x) = ξ

(80)

\begin{array}{l} (s_{1} s_{2}) ξ = A d_{s_{1} s_{2}}^{*} ξ + θ_{μ} (s_{1} s_{2}) = A d_{s_{1}}^{*} A d_{s_{2}}^{*} ξ + θ_{μ} (s_{1}) + A d_{s_{1}}^{*} θ_{μ} (s_{2}) \\ (s_{1} s_{2}) ξ = A d_{s_{1}}^{*} (A d_{s_{2}}^{*} ξ + θ_{μ} (s_{2})) + θ_{μ} (s_{1}) = s_{1} (s_{2} ξ), \forall s_{1}, s_{2} \in G, ξ \in g^{*} \end{array}

(81)

This Koszul study of the moment map μ equivariance, and the existence of an affine action of G on g*, whose linear part is the coadjoint action, for which the moment μ is equivariant, is at the cornerstone of Souriau theory of geometric mechanics and Lie groups thermodynamics.

We compare Souriau and Koszul notations in Figure 7.

We have also to make reference to Muriel Casalis’ papers [41,42] on this topic.

6. Souriau Model of Generalized Entropy Based on Legendre and Laplace Transforms

At the step of the development of Souriau Lie groups thermodynamics, we will introduce generalized Souriau definition of entropy. Souriau first start to define “Laplace transform”:

Let

E

a vector space of finite size,

μ

a measure of its dual

E^{*}

, then the function given by:

α \mapsto \int_{E^{*}} e^{M α} μ (M) d M

(82)

for all

α \in E

such that the integral is convergent. This function is called (generalized) Laplace transform. This transform

F

of the measure

μ

is differentiable inside is definition set

d e f (F)

. Its p-th derivative is given by the following convergent integral for all point inside

d e f (F)

:

F^{(p)} (α) = \int_{E^{*}} M \otimes M \dots \otimes M μ (M) d M

(83)

Theorem 3. [Souriau Theorem]

Let

E

a vector space of finite size,

μ

a non-zero positive measure of dual space

E^{*}

,

F

its Laplace transform, then:

-: $F$ is semi-definite convex function,

$F (α) > 0, \forall α \in d e f (F)$

(84)
-: $f = \log F$ is convex and semi-continuous
-: Let $α$ an interior point of $d e f (F)$ then:

$D^{2} (f) (α) \geq 0$

(85)

$D^{2} (f) (α) = \int_{E^{*}} e^{M α} {[M - D (f) (α)]}^{\otimes^{2}} μ (M) d M$

(86)

$D^{2} (f) (α) inversible \Leftrightarrow Affine envelop (μ)) = E^{*}$

(87)

See [107], for links between dual convex functions and optimization.

Before introducing Entropy, Souriau introduced the following lemma:

Lemma 1.

Let

X

be a locally compact space, Let

λ

a positive measure of

X

, having

X

as support, then the following function

Φ

is convex:

Φ (h) = \log \int_{X} e^{h (X)} λ (x) d x, \forall h \in C (X)

(88)

such that the integral is converging.

The integral is strictly positive when it converges, and then insures existence of its logarithm. The epigraph of

Φ

is the set of

(\begin{matrix} h \\ y \end{matrix})

such that

\int_{X} e^{h (x) - y} λ (x) d x \leq 1

. Convexity of exponential shows that this epigraph is convex. Finally, Souriau introduced the “negentropy” as Legendre transform of the function

Φ

:

Definition1. [Souriau Entropy Definition]

We call “Boltzmann Law” (relative to

λ

) all measure

μ

of

X

such that the set of real values:

μ (h) - Φ (h), h \in d e f (Φ) and h is μ-integrable

(89)

This definition of entropy by Souriau is a general scheme that can be extended to highly abstract spaces preserving Legendre structure [108], if we can define generalized Laplace transform. These operations of Laplace and Legendre transforms are the core contextures of theory of Information and Heat, generating the well-defined structures, from which we can preserve the definition of “average value”. Jean-Marie Souriau explained this contexture property in the following sentence:

“Il est évident que l’on ne peut définir de valeurs moyennes que sur des objets appartenant à un espace vectoriel (ou affine); donc—si bourbakiste que puisse sembler cette affirmation—que l’on n’observera et ne mesurera de valeurs moyennes que sur des grandeurs appartenant à un ensemble possédant physiquement une structure affine. Il est clair que cette structure est nécessairement unique—sinon les valeurs moyennes ne seraient pas bien définies.” (In English: It is obvious that one can only define average values on objects belonging to a vector (or affine) space; Therefore—so this assertion may seem Bourbakist—that we will observe and measure average values only as quantity belonging to a set having physically an affine structure. It is clear that this structure is necessarily unique—if not the average values would not be well defined.)

See also papers of Kostant [109] and Leray [100] for generalized Laplace transforms.

7. Illustration of Souriau Thermodynamics of a Centrifuge System

Duhem [110,111,112,113] and Poincaré [114] have studied statistical mechanics model of centrifuges. We will illustrate Souriau’s Lie groups thermodynamics for Souriau Gibbs states for Hamiltonian actions of subgroups of the Galilean group, as illustrated in Souriau’s book [43] and more recentltly by Charles-Michel Marle [23].

Consider a Galilean Lie group:

(\begin{matrix} A & \vec{b} & \vec{d} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}) with {\begin{cases} A \in S O (3) : rotation \\ \vec{b} \in R^{3} : boost \\ \vec{d} \in R^{3} : space translation \\ e : time translation \end{cases}

(90)

Galilean Lie algebra:

(\begin{matrix} j (\vec{ω}) & \vec{α} & \vec{δ} \\ 0 & 1 & ε \\ 0 & 0 & 0 \end{matrix}) with {\begin{cases} \vec{ω} = (\begin{matrix} ω_{x} \\ ω_{y} \\ ω_{z} \end{matrix}), \vec{α} and \vec{δ} \in R^{3}, ε \in R \\ j (\vec{ω}) = (\begin{matrix} 0 & - ω_{z} & ω_{y} \\ ω_{z} & 0 & - ω_{x} \\ - ω_{y} & ω_{x} & 0 \end{matrix}) \in so (3), j (\vec{ω}) \vec{r} = \vec{ω} \times \vec{r} \end{cases}

(91)

Action of Lie group:

(\begin{matrix} A & \vec{b} & \vec{d} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} \vec{r} \\ t \\ 1 \end{matrix}) = (\begin{matrix} A \vec{r} + t \vec{b} + \vec{d} \\ t + e \\ 1 \end{matrix}) with \vec{r} = (\begin{matrix} x \\ y \\ z \end{matrix})

(92)

Galilean transformation on position and speed is given by:

(\begin{matrix} \vec{r}' & \vec{v}' \\ t' & 1 \\ 1 & 0 \end{matrix}) = (\begin{matrix} A & \vec{b} & \vec{d} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}) (\begin{matrix} \vec{r} & \vec{v} \\ t & 1 \\ 1 & 0 \end{matrix}) = (\begin{matrix} A \vec{r} + t \vec{b} + \vec{d} & A \vec{v} + \vec{b} \\ t + e & 1 \\ 1 & 0 \end{matrix})

(93)

Souriau has proved that this action is Hamiltonian, with the map J, defined on the evolution space of the particle, with value in the dual g* of the Lie algebra G, as momentum map:

J (\vec{r}, t, \vec{v}, m) = m (\begin{matrix} \vec{r} \times \vec{v} & 0 & 0 \\ \vec{r} - t \vec{v} & 0 & 0 \\ \vec{v} & \frac{1}{2} {‖ \vec{v} ‖}^{2} & 0 \end{matrix}) = m {\vec{r} \times \vec{v}, \vec{r} - t \vec{v}, \vec{v}, \frac{1}{2} {‖ \vec{v} ‖}^{2}} \in g^{*}

(94)

where the coupling formula is given by:

\begin{array}{l} 〈 J (\vec{r}, t, \vec{v}, m), β 〉 = 〈 m {\vec{r} \times \vec{v}, \vec{r} - t \vec{v}, \vec{v}, \frac{1}{2} {‖ \vec{v} ‖}^{2}}, {\vec{ω}, \vec{α}, \vec{δ}, ε} 〉 \\ 〈 J (\vec{r}, t, \vec{v}, m), β 〉 = m (\vec{ω} . \vec{r} \times \vec{v} - (\vec{r} \times \vec{v}) . \vec{α} + \vec{v} . \vec{δ} - \frac{1}{2} {‖ \vec{v} ‖}^{2} ε) \end{array}

(95)

with:

Z = (\begin{matrix} j (\vec{ω}) & \vec{α} & \vec{δ} \\ 0 & 1 & ε \\ 0 & 0 & 0 \end{matrix}) = {\vec{ω}, \vec{α}, \vec{δ}, ε} \in g

(96)

Souriau gave the demonstration for the Galilean moment map for a free particle, considering the definition of moment map:

σ (d p) (δ p) = - d 〈 J, Z 〉, \forall d p

(97)

and the definition of tangent vector field:

Z_{V} (p) = δ [a_{V} (p)]

(98)

Z = (\begin{matrix} j (\vec{ω}) & \vec{α} & \vec{δ} \\ 0 & 1 & ε \\ 0 & 0 & 0 \end{matrix}) \in g \underset{Z_{V} (p) = δ [a_{V} (p)]}{\Rightarrow} {\begin{cases} δ t = ε \\ δ r_{j} = \vec{ω} \times r_{j} + \vec{α} t + \vec{δ} \\ δ v_{j} = \vec{ω} \times v_{j} + \vec{α} \end{cases}

(99)

Then, as General Lagrange 2 form for a force F is:

d p = (\begin{matrix} d t \\ d r \\ d v \end{matrix}) and δ p = (\begin{matrix} δ t \\ δ r \\ δ v \end{matrix}) \Rightarrow σ (d p) (δ p) = 〈 m d v - F d t, δ r - v δ t 〉 - 〈 m δ v - F δ t, d r - v d t 〉

(100)

If F is equal to zero, we obtain:

\begin{array}{l} σ (d p) (δ p) = \sum_{j} 〈 m d v, \vec{ω} \times r_{j} + \vec{α} t + \vec{δ} - v ε 〉 - 〈 m (\vec{ω} \times v_{j} + \vec{α}), d r - v d t 〉 \\ σ (d p) (δ p) = = - d 〈 J, Z 〉 = - d J_{Z} = - d H \end{array}

(101)

and the co-cycle is given by:

θ (g) = J (A d_{g} Z) - A d_{g}^{*} (J (Z)) = {\vec{d} \times \vec{b}, \vec{d} - \vec{b} e, \vec{b}, \frac{1}{2} {‖ \vec{b} ‖}^{2}}

(102)

The main Souriau idea was to define the Gibbs states for one-parameter subgroups of the Galilean group. Souriau has proved that action of the full Galilean group on the space of motions of an isolated mechanical system is not related to any equilibrium Gibbs state (the open subset of the Lie algebra, associated to this Gibbs state, is empty). Then, if we consider the 1-parameter subgroup of the Galilean group generated by b element of Lie algebra, is the set of matrices:

\exp (τ β) = (\begin{matrix} A (τ) & \vec{b} (τ) & \vec{d} (τ) \\ 0 & 1 & τ ε \\ 0 & 0 & 1 \end{matrix}) with {\begin{cases} A (τ) = \exp (τ j (\vec{ω})) and \vec{b} (τ) = (\sum_{i = 1}^{\infty} \frac{τ^{i}}{i!} {(j (\vec{ω}))}^{i - 1}) \vec{α} \\ \vec{d} (τ) = (\sum_{i = 1}^{\infty} \frac{τ^{i}}{i!} {(j (\vec{ω}))}^{i - 1}) \vec{δ} + ε (\sum_{i = 2}^{\infty} \frac{τ^{i}}{i!} {(j (\vec{ω}))}^{i - 2}) \vec{α} \end{cases}

(103)

and:

β = (\begin{matrix} j (\vec{ω}) & \vec{α} & \vec{δ} \\ 0 & 1 & ε \\ 0 & 0 & 0 \end{matrix}) \in g

(104)

Then, Gibbs state defined for a gas enclosed in a moving box could be computed by Souriau formula. If we fix the affine Euclidean reference frame

(0, {\vec{e}}_{x}, {\vec{e}}_{y}, {\vec{e}}_{z})

at

t = 0

, if we set the value

τ = t / ε

, moving frame

(0, {\vec{e}}_{x} (t), {\vec{e}}_{y} (t), {\vec{e}}_{z} (t))

velocity and acceleration are given by the vector field related to

β

element of the Lie algebra. For each point, we can associate a rotation speed

‖ \vec{ω} ‖ / ε

, a speed

\vec{δ} / ε

and an acceleration

\vec{α} / ε

. If we consider a gas made of N point particles, indexed by i ∈ {1,2, …, N}, enclosed in a box with rigid and undeformable walls, whose motion is described by the action of the 1-parameter subgroup of the Galilean group,

A (t / ε)

where t ∈ R. If we consider

m_{i}, r_{i} (t), v_{i} (t)

, respectively the mass, position vector and velocity vector of the ith particle at time t. If we assume free particle and we neglect contributions given by the collisions of the particles between themselves collisions with the walls, then we can write:

〈 J, β 〉 = \sum_{i = 1}^{N} 〈 J_{i}, β 〉 with 〈 J_{i} ({\vec{r}}_{i}, t, {\vec{v}}_{i}, m_{i}), β 〉 = m_{i} (\vec{ω} . ({\vec{r}}_{i} \times {\vec{v}}_{i}) - ({\vec{r}}_{i} - t {\vec{v}}_{i}) . \vec{α} + {\vec{v}}_{i} . \vec{δ} - \frac{1}{2} {‖ {\vec{v}}_{i} ‖}^{2} ε)

(105)

The important idea is to observe that

〈 J_{i}, β 〉

is invariant by the action of 1-parameter subgroup. The proof of

〈 J_{i}, β 〉

invariance is based on Souriau equation for default of equivariance with cocyle. If the action of the 1-parameter subgroup is

\exp (\frac{t}{ε} β)

, according to Souriau equation:

a (g, J) = A d_{g}^{*} (J) + θ (g)

(106)

We obtain for:

〈 J_{i} (p), β 〉 = 〈 A d_{g}^{*} (J_{i} (p_{0}), β 〉 + 〈 θ (g), β 〉 = 〈 J_{i} (p_{0}), A d_{g^{- 1}} β 〉 + 〈 θ (g), β 〉

that can be reduded by using the properties:

{\begin{cases} A d_{g^{- 1}} β = β \\ 〈 θ (g), β 〉 = 0 \end{cases} \Rightarrow 〈 J_{i} (p), β 〉 = 〈 J_{i} (p_{0}), β 〉

(107)

and:

\begin{array}{l} at t = 0 then 〈 J_{i} ({\vec{r}}_{i}, t, {\vec{v}}_{i}, m_{i}), β 〉 & = m_{i} (\vec{ω} . ({\vec{r}}_{i 0} \times {\vec{v}}_{i 0}) - {\vec{r}}_{i 0} . \vec{α} + {\vec{v}}_{i 0} . \vec{δ} - \frac{1}{2} {‖ {\vec{v}}_{i} ‖}^{2} ε) \\ = m_{i} ({\vec{v}}_{i 0} . (\vec{ω} \times {\vec{v}}_{i 0} + \vec{δ}) - {\vec{r}}_{i 0} . \vec{α} - \frac{1}{2} {‖ {\vec{v}}_{i} ‖}^{2} ε) \end{array}

(108)

To obtain Souriau’s Gibbs maximum entropy density, we have to use the following change of variables:

{\vec{U}}^{*} = \frac{1}{ε} (\vec{ω} \times {\vec{v}}_{i 0} + \vec{δ})

(109)

〈 J_{i} ({\vec{r}}_{i}, t, {\vec{v}}_{i}, m_{i}), β 〉 = m_{i} ε (- \frac{1}{2} {‖ {\vec{v}}_{i 0} - {\vec{U}}^{*} ‖}^{2} - {\vec{r}}_{i 0} . \frac{\vec{α}}{ε} + \frac{1}{2} {‖ {\vec{U}}^{*} ‖}^{2})

(110)

We can then write:

\begin{array}{l} 〈 J_{i} ({\vec{r}}_{i 0}, {\vec{p}}_{i 0}), β 〉 = - ε (- \frac{1}{2 m_{i}} {‖ {\vec{p}}_{i 0} ‖}^{2} + m_{i} f_{i} ({\vec{r}}_{i 0})) with ε = - \frac{1}{κ T} \\ with {\begin{cases} {\vec{p}}_{i 0} = m_{i} {\vec{w}}_{i 0} = m_{i} ({\vec{v}}_{i 0} - {\vec{U}}^{*}) \\ f_{i} ({\vec{r}}_{i 0}) = {\vec{r}}_{i 0} . \frac{\vec{α}}{ε} - \frac{1}{2 ε^{2}} {‖ \vec{ω} \times {\vec{r}}_{i 0} ‖}^{2} - \frac{\vec{δ}}{ε} . (\frac{\vec{ω}}{ε} \times {\vec{r}}_{i 0}) - \frac{1}{2 ε^{2}} {‖ \vec{δ} ‖}^{2} \end{cases} \end{array}

(111)

and finally, the Souriau Gibbs density is given by:

ρ (β) = \prod_{i = 1}^{N} ρ_{i} (β) with ρ_{i} (β) = \frac{1}{P_{i} (β)} \exp (- 〈 J_{i}, β 〉)

(112)

P_{i} (β) = \int_{M_{i}} \exp (- 〈 J_{i}, β 〉) d λ_{ω_{i}}, Q_{i} (β) = \int_{M_{i}} J_{i} \exp (- 〈 J_{i}, β 〉) d λ_{ω_{i}} et P (β) = \prod_{i = 1}^{N} P_{i} (β)

(113)

If we consider the case of the centrifuge (as for a butter churn, device used to convert cream into butter), the parameter of Galilean group Lie algebra are reduced to:

\begin{array}{l} \vec{ω} = ω {\vec{e}}_{z}, \vec{α} = 0 and \vec{δ} = 0 \\ R o t a t i o n s p e e d : \frac{ω}{ε} \end{array} with β = (\begin{matrix} j (\vec{ω}) & \vec{α} & \vec{δ} \\ 0 & 1 & ε \\ 0 & 0 & 0 \end{matrix}) \in g

(114)

with variables:

f_{i} ({\vec{r}}_{i 0}) = - \frac{ω^{2}}{2 ε^{2}} {‖ {\vec{e}}_{z} \times {\vec{r}}_{i 0} ‖}^{2} with Δ = ‖ {\vec{e}}_{z} \times {\vec{r}}_{i 0} ‖ distance to axis z

(115)

We obtain the closed form for maximum entropy Souriau-Gibbs density:

ρ_{i} (β) = \frac{1}{P_{i} (β)} \exp (- 〈 J_{i}, β 〉) = c s t . \exp (- \frac{1}{2 m_{i} κ T} {‖ {\vec{p}}_{i 0} ‖}^{2} + \frac{m_{i}}{2 κ T} {(\frac{ω}{ε})}^{2} Δ^{2})

(116)

This equation describes the behaviour of a gas made of point particles of various masses in a centrifuge rotating at a constant angular velocity and explains the observation that the heavier particles concentrate farther from the rotation axis than the lighter ones. Souriau made reference to thermodynamics of butter churn (see Figure 8).

Souriau Lie groups thermodynamics provides right results if we apply it to subgroups of Galileo group, as previous example of a cylindrical box with fluid with an invariance sub-group of size 2 (rotation along the axis, time translation) providing a 2-dimensional Souriau (Planck) temperature-vector. Souriau has observed that the process, by which a refrigerated centrifuge transmits its own temperature-vector to its content, has two names: thermal conduction and viscosity, depending on the temperature-vector component that is considered. Conduction and viscosity should therefore be unified in a fundamental theory of irreversible processes (theory that remains to be constructed).

In the Appendix, we develop a solution given by Roger Balian [25] for the previous case of centrifuge thermodynamics based on classical methods. Balian recover the same Gibbs density but by introducing an additional Lagrange hyper-parameter associated to total angular momentum. Balian has computed the Boltzmann-Gibbs distribution without knowing the Souriau equations (exercice 7b of). Balian started by considering the constants of motion that are the energy and the component

J_{z}

of the total angular momentum

J = \sum_{i} (r_{i} \times p_{i})

. Balian observed that he must add to the Lagrangian parameter, given by (Planck) temperature

β

for energy, an additional one associated with

J_{z}

. He identifies this additional multiplier with

- β ω

by evaluating the mean velocity at each point. He then introduced the same results also by changing the frame of reference, the Lagrangian and the Hamiltonian in the rotating frame and by writing down the canonical equilibrium in that frame. He uses the resulting distribution to find, through integration, over the momenta, an expression for the particles density as the function of the distance from the cylinder axis. The main Souriau model advantage is that we can define covariant Gibbs density for dynamical systems, only by applying formulas without any considerations [64].

8. Higher-Order Model of Lie Groups Thermodynamics Based on Poly-Symplectic Vector Valued Model

As observed by Souriau in Chapter IV of [43], the Gausian density is a maximum entropy density of 1st order. Considering multivariate Gaussian density, this remark is clear if we replace classical parameterization

z

and

(m, R)

by the new parameterization, linked to information geometry coordinates,

ξ

and

β

:

\begin{array}{l} p_{(m, R)} (z) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2}} e^{- \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m)} = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2} e^{\frac{1}{2} m^{T} R^{- 1} m}} e^{- [- m^{T} R^{- 1} z + \frac{1}{2} z^{T} R^{- 1} z]} \\ p_{(m, R)} (z) = p_{\hat{ξ}} (ξ) = \frac{1}{Z} e^{- 〈 β, ξ 〉} with ξ = [\begin{matrix} z \\ z z^{T} \end{matrix}], \hat{ξ} = [\begin{matrix} E [z] \\ E [z z^{T}] \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}] \\ and β = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}] = [\begin{matrix} a \\ H \end{matrix}] where 〈 β, ξ 〉 = a^{T} z + z^{T} H z = T r [z a^{T} + z z^{T} H^{T}] \\ with \log (Z) = \frac{n}{2} \log (2 π) + \frac{1}{2} \log \det (R) + \frac{1}{2} m^{T} R^{- 1} m and S (\hat{ξ}) = 〈 \hat{ξ}, β 〉 - Φ (β) \\ \hat{ξ} = Θ (β) = \frac{\partial Φ (β)}{\partial β} and β = Θ^{- 1} (\hat{ξ}) with Φ (β) = - \log ψ_{Ω} (β) = - \log \int_{Ω^{*}} e^{- 〈 β, ξ 〉} d ξ \\ F i s h e r : I (β) = \frac{\partial^{2} \log ψ_{Ω} (β)}{\partial β^{2}} = E [\frac{\partial \log p_{β} (ξ)}{\partial β} {\frac{\partial \log p_{β} (ξ)}{\partial β}}^{T}] = E [(ξ - \hat{ξ}) {(ξ - \hat{ξ})}^{T}] \end{array}

(117)

We can observe in previous equations that classical multivariate Gaussian density, classically expressed by

p_{(m, R)} (z) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2}} e^{- \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m)}

could be rewritten in a new parameterization in a Gibbs density form

p_{\hat{ξ}} (ξ) = \frac{1}{Z} e^{- 〈 β, ξ 〉}

with tensor variable

ξ = [\begin{matrix} z \\ z z^{T} \end{matrix}]

, where

\hat{ξ} = E [ξ] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}]

and tensor parameterization

β = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}] = [\begin{matrix} a \\ H \end{matrix}]

with the following definition of duality braket given by

〈 β, ξ 〉 = a^{T} z + z^{T} H z = T r [z a^{T} + z z^{T} H^{T}]

also written in the initial parameterization

〈 β, ξ 〉 = - m^{T} R^{- 1} z + \frac{1}{2} z^{T} R^{- 1} z = T r [- z m^{T} R^{- 1} + \frac{1}{2} z z^{T} R^{- 1}]

. To understand the meaning of these tensors, we can consider them as homeomorph to the following respective matrices

ξ = [\begin{matrix} z z^{T} & z \\ 0_{1 \times n} & 0 \end{matrix}]

,

\hat{ξ} = [\begin{matrix} R + m m^{T} & m \\ 0_{1 \times n} & 0 \end{matrix}]

and

β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0_{1 \times n} & 0 \end{matrix}]

with

〈 β, ξ 〉 = T r [β ξ^{T}]

(see [91] for more details).

Z is the classical normalization constant that is equal to

l o g (Z) = \frac{n}{2} \log (2 π) + \frac{1}{2} \log \det (R) + \frac{1}{2} m^{T} R^{- 1} m

. In this new parameterization, we can express the entropy by Legendre transform

S (\hat{ξ}) = 〈 \hat{ξ}, β 〉 - Φ (β)

of Massieu characteristic function

Φ (β) = - \log ψ_{Ω} (β) = - \log \int_{Ω^{*}} e^{- 〈 β, ξ 〉} d ξ

(minus logarithm of partition function

ψ_{Ω} (β) = \int_{Ω^{*}} e^{- 〈 β, ξ 〉} d ξ

), with the Souriay (Planck) geometric temperature given by

β = Θ^{- 1} (\hat{ξ})

where the function

Θ (.)

is the inverse of the function given by

\hat{ξ} = Θ (β) = \frac{\partial Φ (β)}{\partial β}

(the temperature is also given by

β = \frac{\partial S (\hat{ξ})}{\partial \hat{ξ}}

given by Lagendre transform; where we recover classical definition of entropy by Clausius

d S = \frac{d Q}{T}

when

β = \frac{1}{T}

and

\hat{ξ} = Q

heat). We can also defined Fisher metric of information geometry by

I (β) = \frac{\partial^{2} \log ψ_{Ω} (β)}{\partial β^{2}}

or

I (β) = - E [\frac{\partial^{2} \log p_{β} (ξ)}{\partial β^{2}}] = E [\frac{\partial \log p_{β} (ξ)}{\partial β} {\frac{\partial \log p_{β} (ξ)}{\partial β}}^{T}] = E [(ξ - \hat{ξ}) {(ξ - \hat{ξ})}^{T}]

. From this development, we can observe that classical multivariate Gaussian Density

p_{\hat{ξ}} (ξ) = \frac{1}{Z} e^{- 〈 β, ξ 〉}

is a maximum entropy Gibbs density of 1st order with respect to the tensorial variable

\hat{ξ} = E [ξ] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}]

. Classically Gaussian density is considered as a maximum entropy Gibbs density of 2nd order where

p_{(m, R)} (z) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2}} e^{- \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m)}

is solution to

- \int p_{(m, R)} (z) \log p_{(m, R)} (z) d z

under the constraints that first two moments are known

m = \int z . p_{(m, R)} (z) d z

and

R = \int (z - m) {(z - m)}^{T} . p_{(m, R)} (z) d z

. The question is then, could we define a Gaussian density of higher order?

We have seen that Souriau has replaced classical maximum entropy approach by replacing Lagrange parameters by only one geometric “temperature vector” as element of Lie algebra. In parallel, Ingarden has introduced second and higher order temperature of the Gibbs state that could be extended to Souriau’s theory of thermodynamics. The question is then, how to extend the Souriau model to define an higher order Lie groups thermodynamics. For this purpose, we propose to consider multi-symplectic geometry and more particularly poly-symplectic geometry [115]. The variational problems generalization with several variables was developed by Volterra in two papers [116,117] where two different generalizations of the Hamilton system of equations are introduced. In parallel, De Donder [53] has also studied this approach in a geometrical framework based on Elie Cartan’s idea of invariant structure with no dependence to local coordinates and based on affine multisymplectic manifold. We can also formalize the multisymplectic geometry with an extension of the Poincaré-Cartan invariant integrals. Frédéric Hélein has observed the fact that different theories could cohabitate was considered jointly by Lepage [54], Dedecker [118,119] and Kijowski [92,93,94]. The Lepage–Dedecker theory was developed by Hélein [120], and the modern formulation using the multisymplectic (n + 1)-form as the fundamental structure of the theory starts with Kijowski’s papers. The geometrical multisymplectic approach uses the generalized Legendre correspondence introduced by Lepage and Dedecker and Hamiltonian formalism developed by Hélein [55]. We can also make references to poly-symplectic formulation of physical systems by Carathéodory [121] and Weyl [122].

Among all multi-symplectic models, the more natural multi-valued one that preserve the notion of (poly-)moment map has been initiated by Günther based on n-symplectic model. Günther has shown that the symplectic structure on the phase space remains true, if we replace the symplectic form by a vector valued form, that is called poly-symplectic. The Günther formalism is based on the notion of a poly-symplectic form, which is a vector valued generalization of symplectic forms. Hamiltonian formalism for multiple integral variational problems and field theory is presented in a global geometric setting. Günther has introduced in this poly-symplectic formalism: Hamiltonian equations, canonical transformations, Lagrange systems, symmetries, Field theoretic moment mappings, a classification of G-homogeneous field theoretic systems on a generalization of coadjoint orbits.

Günther has defined six conditions for a multidimensional Hamiltonian formalism:

C0: For each field system, an evolution space can be constructed, which describes the states of the system completely.
C1: The evolution space carries a geometric structure, which assigns to each function (Hamiltonian density) its Hamiltonian equations.
C2: The geometry of the evolution space gives ‘canonical transformations’, i.e., the general symmetry group of a system independently of the choice of Hamiltonian density.
C3: The formalism is covariant, i.e., no special coordinates or coordinate systems on the parameter space are used to construct the Hamiltonian equations.
C4: There is an equivalence between regular Lagrange systems and certain (regular) Hamiltonian systems.
C5: For one dimensional parameter space the theory reduces to the ordinary Hamiltonian formalism on symplectic manifolds in classical mechanics.

Günther has observed that Hamiltonian field theory by Marsden is not covariant, because C3 is not verified and causes problems in relativistic theories, and by the multisymplectic approach by Tulczyjew, based on the general theory by Dedecker, does not satisfy C1 and C2.

The key idea of Günther for this generalized Hamiltonian formalism is to replace the symplectic form in classical mechanics by a vector valued, so called poly-symplectic form with the property that:

the evolution space of a classical field will appear as the dual of a jet bundle, which carries naturally a polysymplectic structure.
canonical transformations are bundle isomorphisms leaving this poly-symplectic form invariant.

The polysymplectic approach recovers all classical results also generalize the Noether theorem based on canonical transformations and preserve the existence of momentum mappings. Christian Günther’s work was inspired by the symplectic formulation of classical mechanics by Souriau and by the work of Edelen [52,123] and Rund [124] on a local Hamiltonian formulation of field theory. Edelen’s work is a coordinate version of the local polysymplectic approach of Günther.

Initiated by Gunther [48,49] based on n-symplectic model [50,51], it has been shown that the symplectic structure on the phase space remains true, if we replace the symplectic form by a vector valued form, that is called polysymplectic.

In Günther’s poly-symplectic model, we set:

P : space of field values, ϕ : U \to P

and we consider the bundle of linear maps from Rⁿ into the tangent spaces of P:

I^{n} P ≅ H o m (R^{n}, T P) ≅ T P \otimes R^{n *}

(118)

The base of Rⁿ is interpreted as n-tangent υectors of M, there is the isomophy:

I^{n} P ≅ \oplus_{1}^{n} T P

(119)

The natural projection is given by:

τ_{P}^{n} : I^{n} P \to P

(120)

The cojet space

H o m (R^{n}, T P)

carries a natural Rⁿ-valued:

one-form: $Θ_{0}$ (canonical one-form):

$Θ_{0} = \sum_{i = 1}^{n} p_{i} d q \otimes \frac{\partial}{\partial x_{i}}$

(121)
two-form: $Ω_{0} = - d Θ_{0}$ closed & non-degenerate (canonical polysymplectic form)

$Ω_{0} = \sum_{i = 1}^{n} d q \land d p_{i} \otimes \frac{\partial}{\partial x_{i}}$

(122)

Definition 2.

A closed nondegenerate Rⁿ-valued two-form Ω on a manifold M is called a polysymplectic form. The pair (M, Ω) is a polysymplectic manifold.
A polysymplectic form Ω on a manifold M is called a standard form iff M has an atlas of canonical charts for Ω, i.e., charts in which locally Ω is written as the canonical evaluation form on P x Lin (P,Rⁿ). (M, Ω) is called a standard polysymplectic manifold.

The classification of symplectic homogeneous spaces by coadjoint orbits by Souriau belong to the major achievements in Hamiltonian mechanics. Günther has extended these results to polysymplectic manifolds. Let

A d : G \times L G \to L G

be the adjoint action. We denote by

A d^{n}

induced action on

L i n (R^{n}, L G)

:

\begin{array}{l} A d_{g}^{n} : G \times L i n (R^{n}, L G) \to L i n (R^{n}, L G) \\ A d_{g}^{n} (f) (x) = A d_{g} (f (x)), f \in L i n (R^{n}, L G), x \in R^{n}, g \in G \end{array}

(123)

The dual of

A d^{n}

is denoted by

A d_{g}^{(n) *}

:

A d^{#} : G \times L G^{*} \otimes R^{n} \to L G^{*} \otimes R^{n}

(124)

Corollary 1. [Günther Corollary]

Let the moment map

J^{(n)} : M \to L i n (L G, R^{n}) = L G^{*} \otimes R^{n}

, there is a smooth map

θ^{(n)}

:

θ^{(n)} : G \to L G^{*} \otimes R^{n}, θ^{(n)} (g) = J^{(n)} (Φ_{g} (x)) - A d_{g}^{(n) *} (J^{(n)} (x))

(125)

with the following properties:

θ^{(n)}

is a 1-cocyle for all

g, h \in G

then:

θ^{(n)} (g h) = A d_{h}^{(n) *} (θ^{(n)} (g)) + θ^{(n)} (h)

(126)

Theorem 4. [Günther Theorem (Vector-Valued Extension of Souriau Theorem)]

The map:

\begin{array}{l} a : G \times L G^{*} \otimes R^{n} \to G \times L G^{*} \otimes R^{n} \\ a (g, η) = A d_{g}^{(n) *} η + θ^{(n)} (g) \end{array}

(127)

is an affine operation of

G

on

L G^{*} \otimes R^{n}

, and commutes for all

g \in G

.

This extension by Günther defines an action of G over

g^{*} \times \overset{(n)}{\dots} \times g^{*}

called n-coadjoint action:

Definition 3.

\begin{array}{l} A d_{g}^{* (n)} : & G \times (g^{*} \times \overset{(n)}{\dots} \times g^{*}) \to g^{*} \times \overset{(n)}{\dots} \times g^{*} \\ g \times μ_{1} \times \dots \times μ_{n} \mapsto A d_{g}^{* (n)} (μ_{1}, \dots, μ_{n}) = (A d_{g}^{*} μ_{1}, \dots, A d_{g}^{*} μ_{n}) \end{array}

(128)

Let

μ = (μ_{1}, \dots, μ_{n})

a poly-momentum, element of

g^{*} \times \overset{(n)}{\dots} \times g^{*}

, we can define a n-coadjoint orbit

O_{μ} = O_{(μ_{1}, \dots, μ_{n})}

at the point

μ

, for which the canonical projection

\Pr_{k} : g^{*} \times \overset{(n)}{\dots} \times g^{*} \to g^{*}, (ν_{1}, \dots, ν_{n}) \mapsto ν_{k}

induces a smooth map between the n-coadjoint orbit

O_{μ}

and the coadjoint orbit

O_{μ_{k}}

:

π_{k} : O_{μ} = O_{(μ_{1}, \dots, μ_{n})} \to O_{μ_{k}}

that is a surjective submersion with

\cap_{k = 1}^{n} K e r T π_{k} = {0}

.

Proposition 1.

Extending Souriau’s approach, equivariance of poly-moment is a unique action a(.,.) of the Lie group

G

on

g^{*} \times \overset{(n)}{\dots} \times g^{*}

for which the polymoment map

J^{(n)} = (J^{1}, \dots, J^{n}) : M \to g^{*} \times \overset{(n)}{\dots} \times g^{*}

verifies

x \in M

and

g \in G

:

J^{(n)} (Φ_{g} (x)) = a (g, J^{(n)} (x)) = A d_{g}^{* (n)} (J^{(n)} (x)) + θ^{(n)} (g)

(129)

with:

A d_{g}^{* (n)} (J^{(n)} (x)) = (A d_{g}^{*} J^{1}, \dots, A d_{g}^{*} J^{n})

(130)

and:

θ^{(n)} (g) = (θ^{1} (g), \dots, θ^{n} (g))

(131)

θ^{(n)} (g)

is a poly-symplectic one-cocycle.

Definition 4.

We define a poly-symplectic two-cocycle

{\tilde{Θ}}^{(n)} = ({\tilde{Θ}}^{1}, \dots, {\tilde{Θ}}^{n})

with

{\tilde{Θ}}^{k} (X, Y) = 〈 Θ^{k} (X), Y 〉 = J_{[X, Y]}^{k} - {J_{X}^{k}, J_{Y}^{k}}

(132)

where:

Θ^{k} (X) = T_{e} θ^{k} (X (e))

(133)

Finally, we propose to define the poly-symplectic Souriau-Fisher metric.

Definition 5.

g_{β} ([β, Z_{1}], Z_{2}) = d i a g {[{\tilde{Θ}}_{β_{k}} (Z_{1}, Z_{2})]}_{k}, \forall Z_{1} \in g, \forall Z_{2} \in Im (a d_{β} (.)), β = (β_{1}, \dots, β_{n})

(134)

with

{\tilde{Θ}}_{β_{k}} (Z_{1}, Z_{2}) = - \frac{\partial Φ (β_{1}, \dots, β_{n})}{\partial β_{k}} = {\tilde{Θ}}^{k} (Z_{1}, Z_{2}) + 〈 Q_{k}, a d_{Z_{1}} (Z_{2}) 〉

(135)

is a poly-symplectic extension of Souriau-Fisher Metric.

Compared to the Souriau model, heat is replaced by previous polysymplectic model:

Q = (Q_{1}, \dots, Q_{n}) \in g^{*} \times \overset{(n)}{\dots} \times g^{*} with Q_{k} = \frac{\partial Φ (β_{1}, \dots, β_{n})}{\partial β_{k}} = \frac{\int_{M} U^{\otimes k} (ξ) . e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{\otimes k} (ξ) 〉} d ω}{\int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{\otimes k} (ξ) 〉} d ω}

(136)

Proposition 2.

The characteristic function:

Φ (β_{1}, \dots, β_{n}) = - \log \int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{\otimes k} (ξ) 〉} d ω

(137)

exists.

Proof.

We extrapolate Souriau’s results, who proved in [1,2] that

\int_{M} U^{\otimes k} (ξ) . e^{- 〈 β_{k}, U^{\otimes k} (ξ) 〉} d ω

is locally normally convergent using multi-linear norm

‖ U^{\otimes k} ‖ = \underset{U}{S u p} {〈 E, U 〉}^{k}

and where

U^{\otimes k} = U \otimes \overset{(k)}{U} \dots \otimes U

is defined as a tensorial product [43]. □

Entropy is defined by the Legendre transform of the Souriau-Massieu characteristic function:

Definition 6.

The poly-entropy is given by Legendre transform of the poly-symplectic characteristic function:

S (Q_{1}, \dots, Q_{n}) = \sum_{k = 1}^{n} 〈 β_{k}, Q_{k} 〉 - Φ (β_{1}, \dots, β_{n}) where β_{k} = \frac{\partial S (Q_{1}, \dots, Q_{n})}{\partial Q_{k}}

(138)

The Gibbs density could be then extended with respect to high order temperatures.

Definition 7.

Gibbs density is defined as the maximum entropy density of poly-Entropy:

p_{G i b b s} (ξ) = e^{Φ (β_{1}, \dots, β_{n}) - \sum_{k = 1}^{n} 〈 β_{k}, U^{\otimes k} (ξ) 〉} = \frac{e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{\otimes k} (ξ) 〉}}{\int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{\otimes k} (ξ) 〉} d ω}

(139)

9. Conclusions and Possible Extensions

We have introduced contextures of geometric theory of information and heat based on Souriau’s approach, but information geometry is at the interface between different geometries. First, information geometry is at the intersection between “Riemannian geometry”, “complex geometry” and “symplectic geometry”. Based on seminal work of Cartan on homogeneous domains and other works [125,126,127,128], information geometry is jointly founded by (see Figure 9):

Geometry of Jean-Marie Souriau: Study of homogeneous symplectic manifolds geometry with the action of dynamical groups. Introduction of the Lie groups thermodynamics in statistical mechanics [43,44].
Geometry of Jean-Louis Koszul: Study of homogeneous bounded domains geometry, symmetric homogeneous spaces and sharp convex cones. Introduction of an invariant 2-form [9,10,11,97,98,129].
Geometry of Erich Kähler: Study of differential manifolds geometry equipped with a unitary structure satisfying a condition of integrability. The homogeneous Kähler case studied by André Lichnerowicz [130].

We have extended Souriau’s Lie groups thermodynamics by a vector-valued model based on poly-symplectic geometry, introducing higher order Souriau-Gibbs density with higher order Souriau temperatures, and elements of Lie algebra. This model preserves all contextures of Souriau’s thermodynamics with covariance of Gibbs density with respect to dynamical groups in physics. Poly-moment maps are compliant with the Noether theorem generalization in vector-valued cases.

The Jean-Marie Souriau model and equations were extensively studied in the Koszul Lecture given in China in 1986 “Introduction to Symplectic Geometry”, in Chinese (see Figure 10). This book should be translated in English in 2019. Chuan Yu Ma has written on the Koszul book: “This beautiful, modern book should not be absent from any institutional library. …. During the past eighteen years there has been considerable growth in the research on symplectic geometry. Recent research in this field has been extensive and varied. This work has coincided with developments in the field of analytic mechanics. Many new ideas have also been derived with the help of a great variety of notions from modern algebra, differential geometry, Lie groups, functional analysis, differentiable manifolds and representation theory. [Koszul’s book] emphasizes the differential-geometric and topological properties of symplectic manifolds. It gives a modern treatment of the subject that is useful for beginners as well as for experts.”

We have seen that in geometrical mechanics, the Galileo group related to classical mechanics:

[\begin{matrix} \vec{x}' \\ t' \\ 1 \end{matrix}] = [\begin{matrix} R & \vec{u} & \vec{w} \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} \vec{x} \\ t \\ 1 \end{matrix}], R \in S O (3), \vec{u}, \vec{w} \in R^{3}, e \in R

(140)

and its central extension given by the Bargman group:

[\begin{matrix} R & \vec{u} & 0 & \vec{w} \\ 0 & 1 & 0 & e \\ - {\vec{u}}^{t} R & - \frac{{‖ \vec{u} ‖}^{2}}{2} & 1 & f \\ 0 & 0 & 0 & 1 \end{matrix}]

(141)

and Poincaré group in relativity. We then observe, that affine group or its sub-groups are at cornerstone of different disciplines such as:

In robotics, the special Euclidean group SE(3) which is the homogeneous Galileo group (robotics also consider the group of similitudes SIM(3)):

$[\begin{matrix} Z' \\ 1 \end{matrix}] = [\begin{matrix} Ω & t \\ 0 & 1 \end{matrix}] [\begin{matrix} Z \\ 1 \end{matrix}], {\begin{cases} Ω \in S O (3) \\ t \in R^{3} \end{cases}$

(142)
In information geometry, the general affine group is involved A(n,R) for exponential family:

$[\begin{matrix} Z' \\ 1 \end{matrix}] = [\begin{matrix} A & t \\ 0 & 1 \end{matrix}] [\begin{matrix} Z \\ 1 \end{matrix}], {\begin{cases} A \in G L (n) \\ t \in R^{n} \end{cases}$

(143)

with particular case of Gaussian density, associated by Cholesky factorisation of covariance matrix, where covariance matrix square root is triangular matrix with positive elements on its diagonal (it is a group):

$[\begin{matrix} Y \\ 1 \end{matrix}] = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}] [\begin{matrix} X \\ 1 \end{matrix}], {\begin{cases} R^{1 / 2} \in T_{n}^{+} \\ (R^{1 / 2} : Cholesky de R) \\ m \in R^{n} \end{cases}$

(144)
In the study of homogeneous bounded domains, as the simplest one given by Poincaré upper-half plane:

$[\begin{matrix} X' \\ 1 \end{matrix}] = [\begin{matrix} a & b \\ 0 & 1 \end{matrix}] [\begin{matrix} X \\ 1 \end{matrix}], a \in R_{+}^{*} et b \in R$

(145)

As illustrated in Figure 11, Jean-Marie Souriau developed these models at Carthage in Tunisia and at Marseilles in France during 50’s and 60’s. Jean-Marie Souriau was motivated by group invariance, not only in physics but also in neuroscience. Souriau intuition was highly premonitory, because this neuroscience domain has been developed few decades after by Alain Berthoz at College de France (http://public.weconext.eu/academie-sciences/2017-10-03_5a7/video_id_002/index.html) and by Daniel Bennequin (https://www.youtube.com/watch?v=a-ctwxBpJxE) to study the brain sense of movment. We can read in Souriau’s text the very interesting remarks on geometry and neuroscience:

“Je me suis dit, à force de rencontrer des groupes, il y a quelque chose de caché là-dessous. La catégorie métaphysique des groupes qui plane dans l’empyrée des mathématiques, que nous découvrons et que nous adorons, elle doit se rattacher à quelque chose de plus proche de nous. En écoutant de nombreux exposés faits par des neurophysiologistes, j’ai fini par apprendre le rôle primitif du déplacement des objets. Nous savons manipuler ces déplacements mentalement avec une très grande virtuosité. Ce qui nous permet de nous manipuler nous-même, de marcher, de courir, de sauter, de nous rattraper quand nous tombons, etc. Ce n’est pas vrai seulement pour nous, c’est vrai aussi pour les singes ; ils sont beaucoup plus adroits que nous pour anticiper les résultats d’un déplacement. Pour certaines opérations élémentaires de «lecture», ils vont même dix fois plus vite que nous. Beaucoup de neurophysiologistes pensent qu’il y a une structure spéciale génétiquement inscrite dans le cerveau, le câblage d’un groupe … Lorsque il y un tremblement de terre, nous assistons à la mort de l’Espace. … Nous vivons avec nos habitudes que nous pensons universelles. … La neuroscience s’occupe rarement de la géométrie … Pour les singes qui vivent dans les arbres, certaines propriétés du groupe d’Euclide sont mieux câblées dans leurs cerveaux.” (In Engish: “I said to myself, because of meeting groups everywhere, there is something hidden there. The metaphysical category of groups that hovers in the empyrean of mathematics, which we discover and adore, must be connected with something closer to us. Listening to many presentations by neurophysiologists, I ended up learning the primitive role of moving objects. We know how to manipulate these movements mentally with great virtuosity. That allows us to manipulate ourselves, to walk, run, jump, catch up when we fall, and so on. This is not true only for us, it is true also for monkeys; they are much more adroit than we are to anticipate the results of a trip. For some basic “reading” operations, they are even ten times faster than us. Many neurophysiologists think that there is a special structure genetically inscribed in the brain, the wiring of a group… When there is an earthquake, we witness the death of Space. … We live with our habits that we think universal. … Neuroscience rarely deals with geometry … For monkeys living in trees, some of Euclid’s group properties are better wired in their brains.)

Our new research directions will concern extension of “Le Hasard et la Courbure (Randomness and Curvature)” (title of Yann Ollivier HDR), that we have synthetized in Souriau-Fisher metric to “Le Hasard et la Torsion (Randomness and Torsion)” based on Elie Cartan works founded on Cosserats brothers model of elasticity [125,126,127,131].

“Il est une Cosmologie avec laquelle la Thermodynamique générale présente une analogie non-méconnaissable; cette Cosmologie, c’est la Physique péripatéticienne … Parmi les attributs de la substance, la Physique péripatéticienne confère une égale importance à la catégorie de la quantité et à la catégorie de la qualité; or, par ses symboles numériques, la Thermodynamique générale représente également les diverses grandeurs des quantités et les diverses intensités des qualités. Le mouvement local n’est, pour Aristote, qu’une des formes du mouvement général, tandis que les Cosmologies cartésienne, atomistique et newtonienne concordent en ceci que le seul mouvement possible est le changement de lieu dans l’espace. Et voici que la Thermodynamique générale traite, en ses formules, d’une foule de modifications telles que les variations de températures, les changements d’état électrique ou d’aimantation, sans chercher le moins du monde à réduire ces variations au mouvement local”
—Pierre Duhem—La théorie Physique: son objet, sa structure [132].

“Pour la théorie de la connaissance mais aussi pour les sciences est fondamentale la notion de perspective. Or, les expériences faites dans la géométrie algébriques, dans la théorie des nombres, et dans l’algèbre abstraite m’induisent à tenter une formulation mathématique de cette notion pour surmonter ainsi au moyen de raisonnements d’origine géométrique la géométrie. Il me semble en effet, que la tendance vers l’abstraction observée dans les mathématiques d’aujourd’hui, loin d’être l’ennemi de l’intuition ait le sens profond de quitter l’intuition pour la faire renaitre dans une alliance entre «esprit de géométrie» et «esprit de finesse», alliance rendue possible par les réserves énormes des mathématiques pures dont Pascal et Goethe ne pouvaient pas encore se douter”
—Erich Kähler—Sur la théorie des corps purement algébriques, 1952.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Günther’s Polysymplectic Model

We recall in this appendix, a synthesis of Christian Günther Poly-symplectic model with his initial notation [48].

We set:

\begin{array}{l} Q : space of field values \\ φ : U \to Q \end{array}

(A1)

The bundle of linear maps from Rⁿ into the tangent spaces of Q.

I^{n} Q ≅ H o m (R^{n}, T Q) ≅ T Q \otimes R^{n *}

(A2)

If a base of Rⁿ is chosen, can also be interpreted as n-tangent υectors of Q, there is the isomophy:

I^{n} Q ≅ \oplus_{1}^{n} T Q

(A3)

The natural projection is given by:

τ_{Q}^{n} : I^{n} Q \to Q

(A4)

In analogy to the canonical forms on the cotangent bundle, the cojet space

H o m (R^{n}, T Q)

carries a natural Rⁿ-valued:

one-form: $Θ_{0}$ (canonical one-form)
two-form: $Ω_{0} = - d Θ_{0}$ closed & non-degenerate (canonical polysymplectic form)

In the natural bundle coordinates the canonical forms on

H o m (R^{n}, T Q)

have the local representation:

Θ_{0} = \sum_{i = 1}^{n} p_{i} d q \otimes \frac{\partial}{\partial x_{i}}

(A5)

Ω_{0} = \sum_{i = 1}^{n} d q \land d p_{i} \otimes \frac{\partial}{\partial x_{i}}

(A6)

Following diffeomorphism leaves invariant one and two forms:

\begin{array}{l} f : Q \to Q and I^{n *} f : H o m (T Q, R^{n}) \to H o m (T Q, R^{n}) \\ {(I^{n *} f)}^{*} Θ_{0} = Θ_{0} and {(I^{n *} f)}^{*} Ω_{0} = Ω_{0} \end{array}

(A7)

Definition A1.

A closed nondegenerate Rⁿ-valued two-form Ω on a manifold M is called a polysymplectic form. The pair (M, Ω) is a polysymplectic manifold.

The classification of linear polysymplectic forms is not trivial, because two polysymplectic forms are not necessarily locally equivariant.

Definition A2.

A polysymplectic form Ω on a manifold M is called a standard form iff M has an atlas of canonical charts for Ω, i.e., charts in which locally Ω is written as the canonical evaluation form on Q x Lin (Q,Rⁿ). (M, Ω) is called a standard polysymplectic manifold.

The polysymplectic structure provides the procedure which assigns to a function on M, the Hamiltonian, its associated Hamiltonian equations. Let (M, Ω) a polysymplectic manifold:

{\begin{cases} Ω^{b} : & T M \to H o m (T M, R^{n}) \\ w_{m} \mapsto Ω_{(v_{m})}^{b} (w_{m}) = Ω (v_{m}, w_{m}) \end{cases} and {\begin{cases} Ω^{#} : H o m (T M, R^{n}) \to T^{*} M \\ X_{m} \mapsto Ω^{#} (X_{m}) = t r (Ω^{b} \circ X_{m}) \\ with t r (Ω^{b} \circ X_{m}) . v_{m} = - t r (Ω^{b} (v_{m}) \circ X_{m}) \end{cases}

(A8)

An affine sub bundle of

H o m (R^{n}, T Q)

is defined by:

Ω^{# - 1} (d H) = {X_{m} \in H o m (R^{n}, T Q) / Ω^{#} (X_{m}) = d H (m)}

(A9)

Definition A3.

Ω^{# - 1} (d H)

is called the system of Hamiltonian partial differential equations associated with the Hamiltonian function H. A smooth map

ψ : U \to M

is a solution of

Ω^{# - 1} (d H)

iff:

T_{u} ψ \in Ω^{# - 1} (d H (ψ (u))) \forall u \in U

(A10)

Theorem A1.

Let (M, Ω) be a standard polysymplectic manifold, (p,q) canonical coordinates for Ω on M, and H a Hamiltonian function. A smooth map

ψ : U \to M

is a solution of

Ω^{# - 1} (d H)

iff in canonical coordinates:

t r d p (u) = - \frac{\partial H}{\partial q} (ψ (u)) and D q (u) = \frac{\partial H}{\partial p} (ψ (u))

(A11)

If a base

e_{1}, \dots, e_{n}

of Rⁿ is chosen and

p (u) = (p_{1} (u), \dots, p_{n} (u))

with respect to this base, then the equations take the form:

\sum_{i = 1}^{n} \frac{\partial p_{i}}{\partial x_{i}} (u) = - \frac{\partial H}{\partial q} (ψ (u)) and \frac{\partial q}{\partial x_{i}} (u) = \frac{\partial H}{\partial p_{i}} (ψ (u))

(A12)

Proof.

\begin{array}{l} X (ψ (u)) = D ψ (u) \in L i n (R^{n}, T_{ψ (u)} M) \\ X (m) = X_{q} (m) + X_{p} (m), X_{q} (m) \in L i n (R^{n}, Q), X_{p} (m) \in L i n (R^{n}, L i n (Q, R^{n})) \\ v (m) = \dot{q} (m) + \dot{p} (m), \dot{q} (m) \in Q, \dot{p} (m) \in L i n (Q, R^{n}) \end{array}

(A13)

\begin{array}{l} Ω^{#} (X) . v = t r Ω^{b} \circ X (v) = - t r Ω^{b} (v) \circ X \\ Ω^{b} (\dot{q}, \dot{p}) . (\dot{q}, \dot{p}) = \dot{p} (\dot{q}), (\dot{q}, \dot{p}) \in T M \\ Ω^{#} (X) . (\dot{q}, \dot{p}) = - t r (X_{p} (\dot{q}) - \dot{p} \circ X_{q}) = d H (\dot{q}, \dot{p}) \\ d H = \frac{\partial H}{\partial q} d q + \sum_{i = 1}^{n} \frac{\partial H}{\partial p_{i}} d p_{i} \Rightarrow - t r X_{p} = \frac{\partial H}{\partial q}, \frac{\partial H}{\partial p} = X_{q} \end{array}

(A14)

□

Example A1.

Consider a scalar field where

n = 4, Q = R and M = R \times R^{4}

with scalar coordinates

(q, p_{1}, \dots, p_{4})

Let

H (q, p_{1}, \dots, p_{4}) = \frac{1}{2} \sum_{i = 1}^{4} p_{i}^{2} + m q^{2}

an Hamiltonian on M, the canonical polysymplectic form Ω is given by:

Ω = \sum_{i = 1}^{4} d q \land d p_{i} \otimes \frac{\partial}{\partial x_{i}}

(A15)

The Hamiltonian equations for a scalar field:

ψ (x_{1}, \dots, x_{4}) = (q (x_{1}, \dots, x_{4}), p_{1} (x_{1}, \dots, x_{4}), \dots, p_{4} (x_{1}, \dots, x_{4}))

(A16)

are:

\sum_{i = 1}^{4} \frac{\partial p_{i}}{\partial x_{i}} = m q and \frac{\partial q}{\partial x_{i}} = p_{i}

(A17)

Definition A4.

Let (M, Ω) be a polysymplectic manifold,

Ω^{#} (X) = d H

,

H

is called an momentum tensor iff

t r d H = d H

(A18)

Proposition A1.

X \neg Θ_{0} = 0, d (t r L_{X} Θ_{0}) = 0 and t r L_{X} Θ_{0} = - d (H - t r (X \neg Θ_{0}))

(A19)

Proof.

\begin{array}{l} Θ_{0} = \sum_{i} p_{i} d q \otimes \frac{\partial}{\partial x_{i}} and X = X_{q} \frac{\partial}{\partial q} + \sum_{i} X_{p_{i}} \frac{\partial}{\partial p_{i}} \\ \Rightarrow X \neg Θ_{0} = \sum_{i} p_{i} X_{q} \otimes \frac{\partial}{\partial x_{i}} \end{array}

(A20)

\begin{array}{l} t r L_{X} Θ_{0} = t r (d X \neg Θ_{0} + X \neg d Θ_{0}) \\ t r (d X \neg Θ_{0} + X \neg d Θ_{0}) = - d H + t r d X \neg Θ_{0} \end{array}

(A21)

□

The classification of symplectic homogeneous spaces by coadjoint orbits by Souriau belong to the major achievements in Hamiltonian mechanics. C. Günther has extend these results to polysymplectic manifolds. Let

A d : G \times L G \to L G

be the adjoint action. We denote by

A d^{n}

induced action on

L i n (R^{n}, L G)

:

\begin{array}{l} A d^{n} : & G \times L i n (R^{n}, L G) \to L i n (R^{n}, L G) \\ A d_{g}^{n} (f) (x) = A d_{g} (f (x)), f \in L i n (R^{n}, L G), x \in R^{n}, g \in G \end{array}

(A22)

The dual of

A d^{n}

is denoted by

A d^{#}

:

A d^{#} : G \times L G^{*} \otimes R^{n} \to L G^{*} \otimes R^{n}, A d_{g}^{#} (α) = α \circ A d_{g}^{n}

(A23)

λ (A d_{g} u) = Λ_{g}^{*} (λ (u)) \Rightarrow Λ_{g}^{*} λ^{n} (f) = λ^{n} (A d_{g}^{n} f) for all g \in G, f \in L i n (R^{n}, L G)

(A24)

Proposition A2 [Günther Proposition].

Let

Λ : G \times M \to M

be a strongly polysymplectic group action with momentum map

μ : M \to L i n (L G, R^{n}) = L G^{*} \otimes R^{n}

. Assume

M

is connected. Then the map:

\begin{array}{l} M \to L G^{*} \otimes R^{n} \\ m \mapsto μ (Λ_{g} m) - A d_{g}^{#} (μ (m)) \end{array}

(A25)

is a constant on

M

for all

g \in G

.

Corollary A1.

There is a smooth map

χ

:

χ : G \to L G^{*} \otimes R^{n}, χ (g) = μ (Λ_{g} m) - A d_{g}^{#} (μ (m))

(A26)

with the following properties:

is a 1-cocyle for all $g, h \in G$ then

$χ (g h) = A d_{h}^{#} (χ (g)) + χ (h)$

(A27)
bilinear map $φ$ on $L G$ : $φ : = L_{χ} : L G \to L G^{*} \otimes R^{n}, φ : L G \times L G \to R^{n}$ is a 2 cocycle

$φ (u, [v, w]) + φ (v, [w, u]) + φ (w, [u, v]) = 0, \forall u, v, w \in L G$

(A28)

Proof.

\begin{array}{l} χ (h g) = μ \circ Λ_{h g} (m) - A d_{h g}^{#} μ (m) \\ χ (h g) = μ \circ Λ_{g} (Λ_{h} m) - A d_{g}^{#} \circ μ (Λ_{h} m) + A d_{g}^{#} \circ μ (Λ_{h} m) - A d_{g}^{#} A d_{h}^{#} \circ μ (m) \\ χ (h g) = χ (g) + A d_{g}^{#} (χ (h)) \end{array}

(A29)

□

Theorem A2. [Günther Theorem (Vector-valued extension of Souriau Theorem)]

Let

Λ : G \times M \to M

be a polysymplectic action with momentum map

μ : M \to L G^{*} \otimes R^{n}

. Then the map:

\begin{array}{l} Ξ : G \times L G^{*} \otimes R^{n} \to G \times L G^{*} \otimes R^{n} \\ Ξ (g, η) = A d_{g}^{#} η + χ (g) \end{array}

(A30)

is an affine operation of

G

on

L G^{*} \otimes R^{n}

, and commutes for all

g \in G

and

μ

is G-equivariant.

Proof.

\begin{array}{l} Ξ (g h, η) = χ (g h) + A d_{g h}^{#} η + χ (h) + χ (g) \circ A d_{h} + A d_{h}^{#} \circ A d_{g}^{#} η \\ Ξ (g h, η) = χ (h) + A d_{h}^{#} (χ (g) + A d_{g}^{*} h) = Ξ (h, Ξ (g, η)) \end{array}

(A31)

Ξ

is an action.

\begin{array}{l} Ξ_{g} \circ μ (m) = χ (g) + A d_{g}^{#} \circ μ (m) \\ Ξ_{g} \circ μ (m) = μ (Λ_{g} m) - A d_{g}^{#} (μ (m)) + A d_{g}^{#} μ (m) = μ \circ Λ_{g} (m) \end{array}

(A32)

□

Christian Günther in a never found 1987 paper wrote that “The mathematical framework developed in this paper is used in a separate publication to provide a rigorous foundation for field theory”. For a more recent study of Günther’s poly-symplectic model, we make reference to [133].

Appendix B. Fisher Metric for Multivariate Gaussian Density

We will in the following illustrate information geometry for multivariate Gaussian density:

p_{\hat{ξ}} (ξ) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2}} e^{- \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m)}

(A33)

If we develop:

\begin{array}{l} \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m) & = \frac{1}{2} [z^{T} R^{- 1} z - m^{T} R^{- 1} z - z^{T} R^{- 1} m + m^{T} R^{- 1} m] \\ = \frac{1}{2} z^{T} R^{- 1} z - m^{T} R^{- 1} z + \frac{1}{2} m^{T} R^{- 1} m \end{array}

(A34)

We can write the density as a Gibbs density:

\begin{array}{l} p_{\hat{ξ}} (ξ) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2} e^{\frac{1}{2} m^{T} R^{- 1} m}} e^{- [- m^{T} R^{- 1} z + \frac{1}{2} z^{T} R^{- 1} z]} = \frac{1}{Z} e^{- 〈 ξ, β 〉} \\ ξ = [\begin{matrix} z \\ z z^{T} \end{matrix}] and β = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}] = [\begin{matrix} a \\ H \end{matrix}] with 〈 ξ, β 〉 = a^{T} z + z^{T} H z = T r [z a^{T} + H^{T} z z^{T}] \end{array}

(A35)

We can then rewrite density with canonical variables:

\begin{array}{l} p_{\hat{ξ}} (ξ) = \frac{1}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} . d ξ} e^{- 〈 ξ, β 〉} = \frac{1}{Z} e^{- 〈 ξ, β 〉} with \log (Z) = n \log (2 π) + \frac{1}{2} \log \det (R) + \frac{1}{2} m^{T} R^{- 1} m \\ ξ = [\begin{matrix} z \\ z z^{T} \end{matrix}], \hat{ξ} = [\begin{matrix} E [z] \\ E [z z^{T}] \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}], β = [\begin{matrix} a \\ H \end{matrix}] = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}] with 〈 ξ, β 〉 = T r [z a^{T} + H^{T} z z^{T}] \\ R = E [(z - m) {(z - m)}^{T}] = E [z z^{T} - m z^{T} - z m^{T} + m m^{T}] = E [z z^{T}] - m m^{T} \end{array}

(A36)

The first potential function (free energy/logarithm of characteristic function) is given by:

ψ_{Ω} (β) = \int_{Ω^{*}} e^{- 〈 ξ, β 〉} . d ξ and Φ (β) = - \log ψ_{Ω} (β) = \frac{1}{2} [- T r [H^{- 1} a a^{T}] + \log [{(2)}^{n} \det H] - n \log (2 π)]

(A37)

We verify the relation between the first potential function and moment:

\begin{array}{l} \frac{\partial Φ (β)}{\partial β} = \frac{\partial [- \log ψ_{Ω} (β)]}{\partial β} = \int_{Ω^{*}} ξ \frac{e^{- 〈 ξ, β 〉}}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} . d ξ} . d ξ = \int_{Ω^{*}} ξ . p_{\hat{ξ}} (ξ) . d ξ = \hat{ξ} \\ \frac{\partial Φ (β)}{\partial β} = [\begin{matrix} \frac{\partial Φ (β)}{\partial a} \\ \frac{\partial Φ (β)}{\partial H} \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}] = \hat{ξ} \end{array}

(A38)

The second potential function (Shannon entropy) is given as a Legendre transform of the first one:

\begin{array}{l} S (\hat{ξ}) = 〈 \hat{ξ}, β 〉 - Φ (β) with \frac{\partial Φ (β)}{\partial β} = \hat{ξ} and \frac{\partial S (\hat{ξ})}{\partial \hat{ξ}} = β \\ S (\hat{ξ}) = - \int_{Ω^{*}} \frac{e^{- 〈 ξ, β 〉}}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} . d ξ} \log \frac{e^{- 〈 ξ, β 〉}}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} . d ξ} . d ξ = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \log p_{\hat{ξ}} (ξ) . d ξ \end{array}

(A39)

S (\hat{ξ}) = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \log p_{\hat{ξ}} (ξ) . d ξ = \frac{1}{2} [\log {(2)}^{n} \det [H^{- 1}] + n \log (2 π . e)] = \frac{1}{2} [\log \det [R] + n \log (2 π . e)]

(A40)

This remark was made by Jean-Souriau in his book as soon as 1969. He has observed, as illustrated in following Figure that if we take vector with tensor components

ξ = (\begin{matrix} z \\ z \otimes z \end{matrix})

, components of

\hat{ξ}

will provide moments of the first and second order of the density of probability

p_{\hat{ξ}} (ξ)

. He used this change of variable

z' = H^{1 / 2} z + H^{- 1 / 2} a

, to compute the logarithm of the characteristic function

Φ (β)

(see Figure A1 extracted from Souriau Book):

Figure A1. Introduction of potential function for multivariate Gaussian law in Souriau book.

Appendix C. Geometric Definition of Legendre Transform by Chasles as Reciprocal Polar with Respect to a Paraboloid

The Legendre transform plays a central role related to duality and convexity. Adrien-Marie Legendre [102] has introduced the Legendre transform to solve a minimal surface problem given by Monge (Monge requested him to consolidate its proof), with a link to Poncelet duality [103]. Chasles and Darboux interpreted the Legendre transform as reciprocal polar with respect to a paraboloid (re-used by Hadamard and Fréchet in calculus of variations). Before Legendre, Alexis Clairaut introduced a Clairaut Equation that has been developed by Maurice Fréchet to characterize «distinguished densities» (densities with parameters that have covariance matrix reaching the Fréchet-Cramer-Rao Bound) [9].

Legendre Transform transformes one fonction defined by its value in one point in a fonction defined by its tangent, as illustrated in Figure A2.

Figure A2. Legendre Transform and duality. (a) Classical Geometry; (b) Plücker Geometry; (c) Legendre Transform.

Darboux gave in his book one interpretation of Chasles: “Ce qui revient suivant une remarque de M. Chasles, à substituer à la surface sa polaire réciproque par rapport à un paraboloïde». In the lecture «Leçons sur le calcul des variations”, Hadamard, followed by Vessiot, used the reciprocal polar of figurative, and figuratrice. This has also been developed by Belgodère as presented by Cartan on «Extrémale d’une surface» [134,135]. Polarity on the plane is a transformation taking points to lines and dually lines to points. A polarity preserves incidence and has degree 2. For a point P (that we name the pole) a conic polarity transforms it to its image which is a line p (that we name the polar) as follows: from P we draw the two tangents to the conic, which touch it in the points Q, R. If we now connect points Q, R with a line p we obtain the polar line of the pole P. A Self-conjugate point Q is incident with its polar q; that is Q lies on q.

Geometric interpretation of the Legendre transform by reciprocal polar with respect to a paraboloid is given by the following simple development. First, let’s consider the surface:

z = f (x, y) with p = \frac{\partial z}{\partial x} and q = \frac{\partial z}{\partial y}

(A41)

We consider the equation of the paraboloid:

x^{2} + y^{2} = 2 z

(A42)

Reciprocal polar with respect to paraboloid has coordinates:

X, Y, Z

The polar plan with respect to paraboloid of this reciprocal polar

X x + Y y - z - Z = 0

should be equal to tangent plan of the surface at point

(x_{0}, y_{0}, z_{0})

:

z - z_{0} = p_{0} (x - x_{0}) + q_{0} (y - y_{0}) \Rightarrow p_{0} x + q_{0} y - z - (p_{0} x_{0} + q_{0} y_{0} - z_{0}) = 0

(A43)

This equality provides:

X = p_{0}, Y = q_{0}, Z = p_{0} x_{0} + q_{0} y_{0} - z_{0}

(A44)

This is the Legendre transform. So in classical thermodynamics, the Legendre transform

S (Q) = 〈 β, Q 〉 - Φ (β)

is linked with polar reciprocal with respect to the paraboloid:

Q^{2} = 2 S (Q)

(A45)

We can develop other properties of Legengre transform. Let’s

z = f (x, y) with p = \frac{\partial z}{\partial x} and q = \frac{\partial z}{\partial y}

and

X = p, Y = q, Z = p x + q y - z

the Legendre transform.

We compute the first derivative of Z:

d Z = P d X + Q d Y with P = \frac{\partial Z}{\partial X} and Q = \frac{\partial Z}{\partial Y}

(A46)

Z = p x + q y - z \Rightarrow d Z = p d x + q d y - d z + x d p + y d q \underset{\begin{array}{l} d z = p d x + q d y \\ X = p, Y = q \end{array}}{\Rightarrow} d Z = x d X + y d Q \Rightarrow P = x, Q = y

(A47)

We compute the 2nd derivative of Z:

R = \frac{\partial^{2} Z}{\partial X^{2}} = \frac{\partial P}{\partial X} = \frac{\partial x}{\partial X}, T = \frac{\partial^{2} Z}{\partial X \partial Y} = \frac{\partial P}{\partial Y} = \frac{\partial Q}{\partial X} = \frac{\partial x}{\partial Y} = \frac{\partial y}{\partial X}, S = \frac{\partial^{2} Z}{\partial Y^{2}} = \frac{\partial Q}{\partial Y} = \frac{\partial y}{\partial Y}

(A48)

\begin{array}{l} \begin{matrix} {\begin{cases} d X = r d x + s d y \\ d Y = s d x + t d y \end{cases} \\ r = \frac{\partial^{2} z}{\partial x^{2}}, t = \frac{\partial^{2} z}{\partial y^{2}}, s = \frac{\partial^{2} z}{\partial x \partial y} \end{matrix} \Rightarrow {\begin{cases} d x = \frac{t}{r t - s^{2}} d X - \frac{s}{r t - s^{2}} d Y \\ d y = \frac{- s}{r t - s^{2}} d X + \frac{r}{r t - s^{2}} d Y \end{cases} \\ \Rightarrow {\begin{cases} R = \frac{\partial x}{\partial X} = \frac{t}{r t - s^{2}} \\ S = \frac{\partial x}{\partial Y} = \frac{- s}{r t - s^{2}} \\ T = \frac{\partial y}{\partial Y} = \frac{r}{r t - s^{2}} \end{cases} \Rightarrow {\begin{cases} r = \frac{T}{R T - S^{2}} \\ s = \frac{- S}{R T - S^{2}} \\ t = \frac{R}{R T - S^{2}} \end{cases} \end{array}

(A49)

The link with with contact transformations is then the following. Considering new variables X,Y,Z and P,Q the derivatives of Z with respect to X and Y, problem of finding in which case this five quantities could be express of x,y,z,p and q est the same problem where we look for five functions X,Y,Z,P and Q of five independant variables x,y,z,p and q satisfying the differential equation:

d Z - P d X - Q d Y = ρ (d z - p d x - q d y)

(A50)

where

ρ

is a function of x,y,z,p and q.

Proof.

{\begin{cases} p = \frac{\partial z}{\partial x} \\ q = \frac{\partial z}{\partial y} \end{cases} \Rightarrow d z - p d x - q d y = 0 \Rightarrow d Z = P d X + Q d Y \Rightarrow {\begin{cases} P = \frac{\partial Z}{\partial X} \\ Q = \frac{\partial Z}{\partial Y} \end{cases}

(A51)

and the reciprocal:

ρ = \frac{\partial Z}{\partial z} - P \frac{\partial X}{\partial z} - Q \frac{\partial Y}{\partial z}

(A52)

Links with Ampere transformation is given then by the following developments. Let’s consider Ampere transformation:

\begin{array}{l} d z - p d x - q d y = d (z - q y) - p d x + y d q \\ Set {\begin{matrix} Z = z - q y, X = x, Y = q \\ P = p, Q = - y \end{matrix} \Rightarrow d Z - p d X - Q d Y = d z - p d x - q d y \end{array}

(A53)

Then

ρ = 1

, and we have a contact transformation, also valid when Legendre transform is no longer valide (when

r t - s^{2} = 0

, p and q are not independant)

The link between Legendre transformation and Ampere transformation is then deduced. Legendre transform is obtained by same equality:

\begin{array}{l} d z - p d x - q d y = d (z - q y) - p d x + y d q \\ Set {\begin{matrix} Z = z - q y, X = x, Y = q \\ P = p, Q = - y \end{matrix} \Rightarrow d Z - p d X - Q d Y = d z - p d x - q d y \end{array}

(A54)

We can set:

\begin{array}{l} X = p, Y = q, Z = z - p x - q y \\ P = x, Q = y \end{array}

(A55)

□

For complementary studies on the Legendre transform, we can make reference to [99,101].

Appendix D. Centrifuge Thermodynamics by Roger Balian Based on Classical Approach

Balian has studied the case of gas enclosed in a vessel rotating with an angular velocity

ω

in thermal equilibrium, and proved that the density of the gas is proportional to

e^{\frac{m ω^{2} r^{2}}{2 k T}}

, with classical approach. The density is increased at the periphery due to centrifugal effects.

Balian has computed the Boltzmann-Gibbs distribution without knowing Souriau equations (exercice 7b of [25]). Balian started by considering the constants of motion that are the energy and the component

J_{z}

of the total angular momentum

J = \sum_{i} (r_{i} \times p_{i})

. Balian observed that he must add to the Lagrangian parameter, given by (Planck) temperature

β

for energy, an additional one associated with

J_{z}

. He identifies this additional multiplier with

- β ω

by evaluating the mean velocity at each point. He then introduced the same results also by changing the frame of reference, the Lagrangian and the Hamiltonian in the rotating frame and by writing down the canonical equilibrium in that frame. He uses the resulting distribution to find, through integration, over the momenta, an expression for the particles density as the function of the distance from the cylinder axis. The fluid carried along by the walls of the rotating vessel acquires a non-vanishing average angular momentum

〈 J_{z} 〉

around the axis of rotation, that is a constant of motion. In order to be able to assign to it a definite value, Balian proposed to associate with it a Lagrangian multiplier

λ

, in exactly the same way as we classicaly associate the multiplier

β

with the energy in canonical equilibrium. The average

〈 J_{z} 〉

will be a function of

λ

. The Gibbs density for rotating gas is given by Balian as:

D = \frac{1}{Z} e^{- β H - λ J_{z}} = \frac{1}{Z} \exp {\sum_{i} [\frac{β p_{i}^{2}}{2 m} + λ (x_{i} p_{y_{i}} - y_{i} p_{x_{i}})]}

(A56)

With the energy and the average angular momentum given by:

U = - \frac{\partial \ln Z}{\partial β} = \frac{1}{k T} and 〈 J_{z} 〉 = - \frac{\partial \ln Z}{\partial λ}

(A57)

The Lagrangian parameter

λ

has a mechanical nature. To identify this parameter, Balian compared microscopic and macroscopy descriptions of fluid mechanics. He described the single-particle reduced density by:

\begin{array}{l} f (r, p) & \propto \exp {- \frac{β p^{2}}{2 m} - λ (x p_{y} - y p_{x})} \\ = \exp {- \frac{β}{2 m} {(p + \frac{m}{β} [λ \times r])}^{2} + \frac{m λ^{2}}{2 β} (x^{2} + y^{2})} \end{array} and 〈 J_{z} 〉 = - \frac{\partial \ln Z}{\partial λ}

(A58)

Whence Balian finds the velocity distribution at a point r to be proportional to:

\exp {- \frac{m}{2 k T} {(v + \frac{1}{β} [λ \times r])}^{2}} and 〈 J_{z} 〉 = - \frac{\partial \ln Z}{\partial λ}

(A59)

The mean velocity of the fluid at the point r is equal to:

〈 v 〉 = - \frac{1}{β} [λ \times r] and 〈 J_{z} 〉 = - \frac{\partial \ln Z}{\partial λ}

(A60)

and can be identified with the velocity

[ω \times r]

in an uniform rotation with angular velocity

ω

. By comparison, Balian put

ω = - \frac{λ}{β}

. Balian made the remarks that “The angular momentum is imparted to the gas when the molecules collide with the rotating walls, which changes the Maxwell distribution at every point, shifting its origin. The walls play the role of an angular momentum reservoir. Their motion is characterized by a certain angular velocity, and the angular velocities

ω

of the fluid and of the walls become equal at equilibrium, exactly like the equalization of the temperature through energy exchanges”.

Considering the invariance principle, Balian observed that the Lagrangian can be taken as remaining under any change of reference frame, because the stationary action principle is independent of the frame. Comparing the Hamiltonian in two frames for a single particle with position

r'

and the velocity

v'

in the rotating frame:

L_{1} = \frac{1}{2} m v^{2} = \frac{1}{2} m {(v' + [ω \times r'])}^{2}

(A61)

Balian then considered the conjugate momentum of

r'

:

p' = \frac{\partial L_{1}}{\partial v'} = m (v' + [ω \times r'])

(A62)

and the Hamiltonian in the rotating frame:

H_{1}' = (p' . v') - L_{1} = \frac{p'^{2}}{2 m} - (ω . [r' \times p'])

(A63)

The Gibbs density in the rotating frame is then given by:

D = \frac{1}{Z} e^{- β H'}

(A64)

where H’ is the sum over N particles:

H' = \sum_{i = 1}^{N} (\frac{p_{i}'^{2}}{2 m_{i}} - (ω . [r_{i}' \times p_{i}']))

(A65)

At this step, Balian observed that to switch back to the original coordinates,

p'

and

[r' \times p']

can be derived from

p

and

[r \times p]

, respectively, by means of the same change of coordinates that leads from

r

to

r'

. Balian then got:

H' = H - (ω . J)

(A66)

and identified density D with the earlier expression, provided

λ = - β ω

.

Balian observed that as in the case of equilibrium of a gas in a gravitational field, the result could have obtained by a macroscopic calculation from Thermodynamics and Fluid Mechanics, using locally the perfect gas law and the balance between the forces, here centrifugal forces and pressure gradients. Balian recalled that we should fix the value of these Lagrangian multipliers by requiring that on the average the angular and linear momenta vanish. For symmetry reasons these quantities vanish at the same time as the corresponding multipliers, and we have:

〈 J_{z} 〉 = - \frac{\partial \ln Z}{\partial λ} = N m ω R^{2} [\frac{1}{1 - \exp (- \frac{m ω^{2} R^{2}}{2 k T})} - \frac{2 k T}{m ω^{2} R^{2}}] \underset{ω \to 0}{\sim} \frac{1}{2} ω N m R^{2}

(A67)

and the energy:

U = - \frac{\partial \ln Z}{\partial β} = \frac{3}{2} N k T + \frac{1}{2} ω 〈 J_{z} 〉

(A68)

Balian observed that in the change of frame, the linear momentum

m v'

is no longer equal to the momentum

p'

because the velocity

v = p / m

in the fixed frame is transformed in

v' = p' / m - [ω \times r']

in the rotating frame. Balian made the analogy with a particle of charge

q

in a magnetic field characterized by a velocity

(p - q A) / m

.

Balian wrote “Whereas positions and velocities are physical quantities, momenta have a certain amount of arbitrariness which is connected with the fact that we can change the Lagrangian by adding to it a time derivative without changing the equations of motion.” Balian gave the example in a Gallilean transformation with velocity

u

with the procedure where the Lagragian is assumed to be invariant

p_{i}' = p_{i}

whereas

v_{i}' = v_{i} - u

, the Hamiltonian becomes

H' = H - 〈 u, P 〉

, where

P

is the total momentum. Balian observed that another procedure, that better exhibits the Gallielan invariance consists in adding to the Lagrangian the ineffective term:

- \sum_{i} m_{i} ((v_{i}' . u) + \frac{1}{2} u^{2}) = \frac{d}{d t} (\sum_{i} m_{i} (\frac{1}{2} u^{2} t - (r . u)))

(A69)

When we change coordinates

(r_{i}, v_{i})

to

(r_{i}', v_{i}')

, the momentum which is conjugate to

r_{i}'

is

p_{i}'' = p_{i} - m_{i} u = m_{i} v_{i}^{'}

and not

p_{i}' = p_{i}

and the Hamiltonian

H'' = H - (u . P) + \frac{1}{2} M u^{2}

has in terms of the

p_{i}^{''}

exactly the same form as

H

in terms of the

p_{i}

.

Balian presented these argues to be regarded as a microscopic justification of such a calculation and wrote “As in the case of equilibrium of a gas in a gravitational field, we could have obtained the result by a macroscopic calculation from thermodynamics and fluid mechanics, using locally the perfect gas laws and the balance between the forces, here centrifugal forces and pressure gradients”.

Balian observed that usually no conditions are unquired about the Lagrangian multipliers for dynamical constants of motion sur as the angular or the linear momentum. Balian proposes to fix the values of these multipliers by requiring that on the average the angular and linear momenta vanish. Balian observed that for symmetry reasons, these quantities vanish at the same time as the corresponding multipliers, and we have:

\begin{array}{l} 〈 J_{z} 〉 & = - \frac{\partial \ln Z}{\partial λ} = N m ω R^{2} [\frac{1}{1 - e^{- m ω^{2} R^{2} / 2 K T}} - \frac{2 k T}{m ω^{2} R^{2}}] \\ \underset{ω \to 0}{\sim} \frac{1}{2} ω N m R^{2} \end{array}

(A70)

The angular momentum

〈 J_{z} 〉

is to lowest order in

ω

the same as for the rotation of a cylinder with uniform density, which has a moment of inertia equal to

\frac{1}{2} N m R^{2}

. The energy contains a contribution due to the motion, and is given by:

〈 J_{z} 〉 = - \frac{\partial \ln Z}{\partial β} = \frac{3}{2} N k T + \frac{1}{2} ω 〈 J_{z} 〉

(A71)

The entropy also depends on the rotational velocity, but only to order

ω^{4}

. It decreases with

ω

, as the rotation produces changes in density which increase the spatial order.

Appendix E. Proof of Convergence for Poly-Symplectic Model Based on Souriau Proof

Jean-Marie Souriau has given the following definition:

Definition A5. [Souriau Generalized Temperature Definition]

Let

G

a Lie group acting on a symplectic Manifold

(M, ω)

by an Hamiltonian action

Γ : G \times M \to M

,

g

is Lie algebra and

J : M \to g^{*}

a moment map of the action, ageneralized temperatureis an element

β \in g

such that the integral:

\int_{M} e^{- 〈 β, J 〉} d λ_{ω}

(A72)

is normally convergent.

Normal convergence means that there exist an open neighborhood

V

from

β

to

g

, and a function

f : M \to ℜ^{+}

integrable on

M

relative to Liouville measure

λ_{ω}

, such that:

\int_{M} e^{- 〈 β, J 〉} d λ_{ω}

(A73)

Lebesgue theorem on dominated convergence gives the proof.

Jean-Marie Souriau then introduced the following proposition:

Proposition A3. [Souriau Differentiability Proposition]

Consider

Ω

, a non-empty set of generalized temperatures,

Ω

is a convex open set of Lie algebra

g

that doesn’t depend on the choice of the choice of the moment map

J

associated with the Hamiltonian action. The partition function

I : Ω \leftrightarrow ℜ

given by

I_{0} (β) = \int_{M} e^{- 〈 β, J 〉} d λ_{ω}

is infinitely differentiable on

Ω

. Its nth differentiation is given by the tensorial integral:

I_{n} (β) = \int_{M} J^{\otimes n} e^{- 〈 β, J 〉} d λ_{ω}

(A74)

and is normally convergent.

Let

$β_{0}, β_{1} \in Ω$
$V_{0}, V_{1}$ neighborhoods respectively of $β_{0}, β_{1}$
$f_{0}, f_{1}$ positive integrable function on $M$ such that:

${\begin{cases} e^{- 〈 β_{0}', J 〉} \leq f_{0}, if β_{0}' \in V_{0} \\ e^{- 〈 β_{1}', J 〉} \leq f_{1}, if β_{1}' \in V_{1} \end{cases}$

(A75)

\forall λ \in [0, 1], V_{λ} = {(1 - λ) β_{0}' + λ β_{1}' / β_{0}' \in V_{0}, β_{1}' \in V_{1}}

is a neighborhood of

β_{λ}

given by

β_{λ} = (1 - λ) β_{0} + λ β_{1}

, and the function

f_{λ} = (1 - λ) f_{0} + λ f_{1}

is integrable on

M

and

e^{- 〈 β_{λ}', J 〉} \leq f_{λ}, \forall β_{λ}' \in V_{λ}

. Then

β_{λ} \in Ω

proving that

Ω

is convex.

n-th differential of

e^{- 〈 β, J 〉}

is given:

D^{n} (e^{- 〈 β, J 〉}) = {(- 1)}^{n} J^{\otimes n} e^{- 〈 β, J 〉}

(A76)

Selecting a norm on Lie algebra

g

, and considering Sup Norm on space

L (g, ℜ)

of n-multilinear forms on

g

. We can deduce on

g^{*}

and on

{[g^{*}]}^{\otimes n}

a norm of multi-linear map:

‖ J^{\otimes n} ‖ = \underset{β}{S u p} | 〈 β, J 〉 |

(A77)

Let:

β \in Ω, ε > 0 and e^{- 〈 β, J 〉} \leq f, if β' \in g and ‖ β' - β ‖ \leq ε

(A78)

Let

β'' \in g and ‖ β'' - β ‖ \leq \frac{ε}{2}

, for all

X \in g and ‖ X ‖ = 1

, then:

‖ 〈 X, J 〉 ‖ \leq \frac{2 n}{ε} e^{\frac{ε}{2 n} ‖ 〈 X, J 〉 ‖} \Rightarrow {‖ 〈 X, J 〉 ‖}^{n} e^{- 〈 β'', J 〉} \leq {(\frac{2 n}{ε})}^{n} e^{- 〈 β'' \pm \frac{ε}{2} X, J 〉}

(A79)

The last relation is established by considering:

\forall α \in ℜ, \forall n \in ℵ, {| \frac{2 α}{n} |}^{n} \leq {| 2 s h (\frac{α}{n}) |}^{n} = {| e^{\frac{α}{n}} - e^{- \frac{α}{n}} |}^{n} = | \sum_{p = 0}^{n} {(- 1)}^{p} C_{n}^{p} e^{- [- 1 + 2 \frac{p}{n}] α} |

(A80)

If we select

X \in g

and

α = 〈 X, J 〉

:

{| \frac{2}{n} |}^{n} e^{- 〈 β, J 〉} {| 〈 X, J 〉 |}^{n} \leq | \sum_{p = 0}^{n} {(- 1)}^{p} C_{n}^{p} e^{- 〈 β - [2 \frac{p}{n} - 1] X 〉} |

(A81)

e^{- 〈 β, J 〉} \leq f \Rightarrow e^{- 〈 β, J 〉} {| 〈 X, J 〉 |}^{n} \leq n^{n} f, if ‖ β - β_{0} ‖ \leq \frac{ε}{2}, ‖ V ‖ \leq \frac{ε}{2}

(A82)

For

X

unitary, and by setting

X = J \frac{ε}{2}

{‖ 〈 X, J 〉 ‖}^{n} e^{- 〈 β, J 〉} \leq {(\frac{2 n}{ε})}^{n} f

(A83)

In

{‖ 〈 X, J 〉 ‖}^{n} e^{- 〈 β'', J 〉} \leq {(\frac{2 n}{ε})}^{n} e^{- 〈 β'' \pm \frac{ε}{2} X, J 〉}

, the sign

\pm

is selected such that

〈 \pm ε X, J 〉 \geq 0

.

As

‖ β - β'' \pm \frac{ε}{2} X ‖ \leq ε

, the final result is deduced:

‖ D^{n} (e^{- 〈 β'', J 〉}) ‖ \leq {[\frac{2 n}{ε}]}^{n} f \Rightarrow ‖ J^{\otimes n} e^{- 〈 β'', J 〉} ‖ \leq {[\frac{2 n}{ε}]}^{n} f

(A84)

It proves that the n-differential of

e^{- 〈 β, J 〉}

is normally integrable on

M

with respect to Liouville measure, the partition function is infinitely differentiable on

Ω

.

By considering the taylor expansion of exponential function:

e^{α} - 1 - α = \frac{α^{2}}{2} e^{λ α}, λ \in [0, 1]

(A85)

From which, we deduce that:

e^{- 〈 β - X, J 〉} J^{\otimes n} - e^{- 〈 β, J 〉} J^{\otimes n} - e^{- 〈 β, J 〉} J^{\otimes n + 1} (X) = \frac{1}{2} e^{〈 β - λ X, J 〉} J^{\otimes n + 2} (X) (X)

(A86)

where

T (X)

means the contraction of a covariant tensor with vector

X

. Then:

‖ J^{\otimes n + 2} e^{- 〈 β, J 〉} ‖ \leq {[\frac{2 (n + 2)}{ε}]}^{n + 2} f \Rightarrow \frac{1}{2} e^{〈 β - λ X, J 〉} J^{\otimes n + 2} (X) (X) \leq \frac{1}{2} {[\frac{2 (n + 2)}{ε}]}^{n + 2} f {‖ X ‖}^{2}

(A87)

By integration on

V

and using

\int_{V} f . V o l = a < + \infty

, we obtain:

‖ I_{n} (β - X) - I_{n} (β) - I_{n + 1} (β) ‖ \leq \frac{a}{2} {[\frac{2 (n + 2)}{ε}]}^{n + 2} {‖ X ‖}^{2} if β \in > B (β_{0}, \frac{ε}{4}) and ‖ X ‖ \leq \frac{ε}{4}

(A88)

It proves that the function

I_{n} : β \in g \to ℜ

is continuous and derivable in a neighborhood of

β_{0}

, and its derivative is given

I_{n + 1}

. Then

I_{0}

is an infinite derivable function with

I_{n}

as nth derivable.

These demonstrations can be extended for poly-symplectic model of Souriau Lie groups Thermodynamic by considering the polysymplectic partition function:

I_{0}^{p o l y} = \int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, J^{\otimes k} (ξ) 〉} d λ_{ω}

(A89)

and its n-th derivatices given by:

I_{n, i}^{p o l y} = \frac{\partial^{n} I_{0}}{\partial β_{i}^{n}} = \int_{M} J^{\otimes n k} e^{- \sum_{k = 1}^{n} 〈 β_{k}, J^{\otimes k} 〉} d λ_{ω}

(A90)

where

J^{\otimes k} = J \otimes \overset{(k)}{J} \dots \otimes J

is defined as a tensorial product.

Appendix F. Relativistic Souriau Thermodynamics of Continua

We will summarize in this appendix the Souriau relativistic thermodynamics of fluids. This Souriau model about relativistic thermodynamics of continua will give a solution to Duhem’s general thermodynanics: Nous avons fait de la dynamique un cas particulier de la thermodynamique, une Science qui embrasse dans des principes communs tous les changements d’état des corps, aussi bien les changements de lieu que les changements de qualités physiques “. (In English: We made dynamics a special case of thermodynamics, and science that embraces common principles in all changes of state bodies, changes of places as well as changes in physical qualities”.)

The objective is not to make a survey of all literature on this topic. We give this model to compare Souriau’s approaches related to invariance and symmetries in thermodynamics. I think that this is the first time that Souriau relativistic model is presented in English. My objective is to underline that Souriau has replaced the geometric temperature of “Lie groups thermodynamics”, where the temperature is an element of Lie algebra, by a temperature that is defined as a killing vector. I also underline, that in both models, Souriau was motivated to search solutions where the “Legendre transform” structure is preserved between Massieu thermodynamics potentials.

Kinematics is defined by the vector field

Θ

and the measurement of number of molecules: using two state functions, Souriau has built a (thermo-)dynamic according to the two principles: conservation of the Noetherian quantities attached to the Poincaré group, positive Entropy production. Such a dissipative fluid has movements in which the entropy production is nil;

Θ

is then a killing vector; the equations of motion fully integrate; Souriau found in particular the results of kinetic theory at equilibrium. This method can be used to study perfect fluids; Souriau recover the classic Lichnerowicz results; moreover, we can build, even in the non-isentropic case, an space-time 2-Form

Ω

which is Integral invariant (in the sense of Cartan-Poincaré) of the temperature vector

Θ

; this provides a generalization of Helmholtz’s theorem. In weakly dissipative movements, naturally occur the two viscosity coefficients, as well as the thermal conductivity coefficient; they are accompanied by two other coefficients that may be measurable on actual fluids.

Jean-Marie Sourias has first considered the kinematics of a relativistic simple fluid, considering the following space-time vectors field by temperature vector

X \mapsto Θ

with:

Θ = U ε {\begin{cases} U : Unitary quadri - vector \\ ε = \frac{1}{T} > 0 (Boltzman k = 1) \end{cases}

(A91)

Θ

generates a group with a parameter of diffeomorphisms of space-time E₄; the group’s orbits (the current lines of the fluid) form an abstract space V₃ (has a manifold structure of dimension 3, characterized by the fact that the following projection is a restricted submersion:

X \in E_{4} \mapsto x \in V_{3}

(A92)

Let the metric tensor g Lie derivative (for the vector field

X \in E_{4} \mapsto Θ

):

{\begin{cases} γ = \frac{1}{2} δ_{L} g \\ δ X = Θ \end{cases}

(A93)

The Killing formula gives the symmetric tensor:

γ_{λ μ} = \frac{1}{2} [\partial_{λ} Θ_{μ} + \partial_{μ} Θ_{λ}]

(A94)

Let consider positive density

n

of quotient manifold

V_{3}

:

x \in V_{3} \mapsto n

(A95)

Integral of

n

on

V_{3}

gives the number of molecules. Its reciprocal image by projection is defined by:

X \in E_{4} \mapsto N

(A96)

Particules conservation is given by:

\partial_{λ} N^{λ} = 0 with N = U n

(A97)

Direction of

U

or

Θ

defines a foliation of space-time E₄. Leaves are current lines solutions of:

\frac{d X}{d s_{c}} = U

(A98)

We illustrate in Figure A3, the Souriau’s midel of Thermodynamics of continua.

Figure A3. Souriau’s model of Thermodynamics of Continua.

Thermodynamic 1st principle in this model is given by:

\partial_{λ} T^{λ μ} = 0 with T^{λ μ} = T^{μ λ}

(A99)

The energy-momentum density tensor

T^{λ μ}

has been built by Souriau using the kinematic quantities, such as to verify the second principle.

Lemma A1 [Souriau Lemma].

Let

(n, ε) \mapsto ζ

a differentiable function, then there is a symmetric tensor

{\overset{⌢}{T}}^{λ μ}

such that:

\partial_{λ} [N^{λ} ζ] = - {\overset{⌢}{T}}^{λ μ} γ_{λ μ} with Θ = U ε et N = U n

(A100)

{\overset{⌢}{T}}^{λ μ} = \frac{n^{2}}{ε} \frac{\partial ζ}{\partial n} [g^{λ μ} - U^{λ} U^{μ}] - n \frac{\partial ζ}{\partial ε} U^{λ} U^{μ}

(A101)

We assume that there exist

φ = φ (n, Θ, γ)

such this function is convex and energy-momentum density are given by:

T^{λ μ} = \frac{\partial φ}{\partial γ_{λ μ}}

(A102)

If we assume that

{γ_{λ μ} = 0} \Rightarrow {T^{λ μ} = {\overset{⌢}{T}}^{λ μ}}

then the following vector has a positive divergence:

S^{λ} = N^{λ} ζ + T^{λ μ} Θ_{μ}

(A103)

The Thermodynamic 2nd principle is given by:

\partial_{λ} S^{λ} \geq 0

(A104)

Proof is given by:

\partial_{λ} S^{λ} = [T^{λ μ} - {\overset{⌢}{T}}^{λ μ}] γ_{λ μ} \partial_{λ} S^{λ} = {φ (γ) - φ (0) - {\overset{⌢}{T}}^{λ μ} γ_{λ μ}} + {φ (0) - φ (γ) - T^{λ μ} (- γ_{λ μ})} \geq 0

(A105)

\partial_{λ} S^{λ} \geq 0

Souriau proposed to define the dynamics of the fluid by means of the two functions

ζ

and

φ

which give at each point the energy tensor

T^{λ μ}

and the entropy flux

S^{λ}

by following formulas. These functions being determined, we have 5 equations to determine the 5 variables

(n, Θ^{λ})

and, moreover, the

S^{λ}

;

\partial_{λ} S^{λ} \geq 0

will express the 2nd principle.

{\begin{cases} T^{λ μ} = \frac{\partial φ (n, Θ, γ)}{\partial γ_{λ μ}} \\ S^{λ} = N^{λ} ζ (n, ε) + T^{λ μ} Θ_{μ} with Θ = U ε and N = U n \\ γ_{λ μ} = \frac{1}{2} [\partial_{λ} Θ_{μ} + \partial_{μ} Θ_{λ}] \\ \partial_{λ} T^{λ μ} = 0 and \partial_{λ} N^{λ} = 0 \end{cases}

(A106)

Souriau has then considered the case of non-dissipative movements. If

φ

is strictly convex for variable

γ

then:

\partial_{λ} S^{λ} = 0 \Leftrightarrow γ_{λ μ} = 0 \Leftrightarrow Θ infinitesimal isometry

(A017)

For non-dissipative solution of movement equations,

Θ

is a Killing vector, associated to an element of Lie algebra of Poincaré group:

Θ = [\begin{matrix} Λ & Γ \\ 0 & 0 \end{matrix}]

(A108)

with:

\begin{matrix} Θ_{λ} = Λ_{λ μ} X^{μ} + Γ_{λ} & (Λ_{λ μ} + Λ_{μ λ} = 0) \\ Θ = U ε \end{matrix}} \Rightarrow U^{λ}, ε

(A109)

The equations of motion integrate through an arbitrary constant:

ζ + \frac{\partial ζ}{\partial n} n = C s t e \Rightarrow n

(A110)

Thermodynamics constants are the following:

specific molecular volume:

$u = \frac{1}{n}$

(A111)
specific mass:

$ρ = - n \frac{\partial ζ}{\partial ε} = - \frac{1}{u} \frac{\partial ζ}{\partial ε}$

(A112)
pressure:

$ρ = - \frac{n^{2}}{ε} \frac{\partial ζ}{\partial n} = - \frac{1}{ε} \frac{\partial ζ}{\partial u}$

(A113)

In case of a nill entropy production:

\begin{array}{l} \partial_{λ} S^{λ} = 0 \Rightarrow {\begin{cases} γ = 0 \\ Θ = Λ . X + Γ \end{cases} \Rightarrow {\begin{cases} U^{λ} \partial_{λ} ε = 0 \Rightarrow \exists ε, x \in V_{3} \mapsto ε \\ \partial_{λ} U^{λ} = 0 \Rightarrow [\partial_{λ} N^{λ} = 0 \Rightarrow U^{λ} \partial_{λ} n = 0] \Rightarrow \exists n, x \in V_{3} \mapsto n \\ ε U^{λ} \partial_{λ} U_{μ} + \partial_{μ} ε = 0 \end{cases} \\ \Rightarrow variable n and ε are constant on current lines \end{array}

(A114)

We can also deduce the following equations:

{\begin{cases} Θ = Λ . X + Γ \\ \partial_{λ} N^{λ} = 0 \end{cases} \Rightarrow U^{λ} \partial_{λ} [\frac{n^{2}}{ε} \frac{\partial ζ}{\partial n}] = 0 and U^{λ} \partial_{λ} [n \frac{\partial ζ}{\partial n}] = 0

(A115)

From tensor computation, Souriau has computed the energy-momentum density currents:

\begin{array}{l} \partial_{λ} N^{λ} = 0 \Rightarrow \partial_{λ} [N^{λ} ζ] = N^{λ} \partial_{λ} ζ = U^{λ} n [\frac{\partial ζ}{\partial n} \partial_{λ} n + \frac{\partial ζ}{\partial ε} \partial_{λ} ε] \\ γ_{λ μ} = \frac{1}{2} [\partial_{λ} Θ_{μ} + \partial_{μ} Θ_{λ}] = \frac{ε}{2} [\partial_{λ} U_{μ} + \partial_{μ} U_{λ}] + \frac{1}{2} [U_{λ} \partial_{μ} ε + U_{μ} \partial_{λ} ε] \\ \Rightarrow g^{λ μ} γ_{λ μ} = ε \partial_{λ} U^{λ} + U^{λ} \partial_{λ} ε \end{array}

(A116)

with the following developments:

\begin{array}{l} U unitary \Rightarrow U_{λ} U^{λ} = g_{λ μ} U^{λ} U^{μ} = 1 \Rightarrow U^{λ} \partial_{μ} U_{λ} = 0 \Rightarrow U^{λ} U^{μ} γ_{λ μ} = U^{λ} \partial_{λ} ε \\ \partial_{λ} N^{λ} = 0 \Rightarrow U^{λ} \partial_{λ} n + \partial_{λ} U^{λ} n = 0 \\ \Rightarrow T^{λ μ} = \frac{n^{2}}{ε} \frac{\partial ζ}{\partial n} [g^{λ μ} - U^{λ} U^{μ}] - n \frac{\partial ζ}{\partial ε} U^{λ} U^{μ} \end{array}

(A117)

For this non-dissipative movement, we can prove:

{\begin{cases} U^{λ} \partial_{λ} [\frac{n^{2}}{ε} \frac{\partial ζ}{\partial n}] = 0 \\ U^{λ} \partial_{λ} [n \frac{\partial ζ}{\partial n}] = 0 \end{cases} and {\begin{cases} U^{λ} \partial_{λ} ε = 0 \\ \partial_{λ} U^{λ} = 0 \\ ε U^{λ} \partial_{λ} U_{μ} + \partial_{μ} ε = 0 \end{cases}

(A118)

\begin{array}{l} T^{λ μ} = \frac{n^{2}}{ε} \frac{\partial ζ}{\partial n} [g^{λ μ} - U^{λ} U^{μ}] - n \frac{\partial ζ}{\partial ε} U^{λ} U^{μ} \\ \Rightarrow \partial_{λ} T^{λ μ} = g^{λ μ} {\partial_{λ} [\frac{n^{2}}{ε} \frac{\partial ζ}{\partial n}] + \frac{\partial_{λ} ε}{ε} [\frac{n^{2}}{ε} \frac{\partial ζ}{\partial n} + n \frac{\partial ζ}{\partial ε}]} = \frac{n}{ε} g^{λ μ} \partial_{λ} {n \frac{\partial ζ}{\partial n} + ζ} \end{array}

(A119)

\Rightarrow {\begin{cases} \partial_{λ} T^{λ μ} = 0 \\ \partial_{λ} N^{λ} = 0 \end{cases} integrable on {\begin{cases} n constant on current lines \\ n \frac{\partial ζ}{\partial n} + ζ constant in space - time \end{cases}

(A120)

Souriau has proved that the entropy vector preserves the Legendre transform:

\begin{array}{l} {\begin{cases} S^{λ} = N^{λ} ζ + T^{λ μ} Θ_{μ} \\ T^{λ μ} = \frac{n^{2}}{ε} \frac{\partial ζ}{\partial n} [g^{λ μ} - U^{λ} U^{μ}] - n \frac{\partial ζ}{\partial ε} U^{λ} U^{μ} \\ Θ = U ε and N = U n \end{cases} \Rightarrow S^{λ} = N^{λ} [ζ - ε \frac{\partial ζ}{\partial ε}] \\ S^{λ} = N^{λ} s \Rightarrow s = ζ - ε \frac{\partial ζ}{\partial ε} \end{array}

(A121)

with the entropy per molecule:

s = ζ + ρ u ε

(A122)

ζ

is the Massieu potential (Massieu charcateristic function):

\begin{array}{l} ζ = - \frac{F}{T} = - \frac{u ρ - T s}{T} with F : Helmoltz Free Energy \\ ζ + \frac{\partial ζ}{\partial n} n = - \frac{G}{T} = - \frac{F + p u}{T} with G : Free Gibbs - Duhem Energy \end{array}

(A123)

The link with Souriau 2-form and Poincaré-Cartan integral invariant is given by the following developments. Consider the 1-form given by enthalpy:

H_{λ} = h U_{λ} with h = \frac{p + ρ}{n} = u [p + ρ]

(A124)

Its 2-form given by exterior differentiation

Ω_{λ μ} = \partial_{λ} H_{μ} - \partial_{μ} H_{λ}

(A125)

Movement’s equation are replaced by:

{\begin{cases} \partial_{λ} N^{λ} = 0 \\ \partial_{λ} T^{λ μ} = 0 \end{cases} \Rightarrow {\begin{cases} \partial_{λ} N^{λ} = 0 \\ Ω_{λ μ} Θ^{μ} + \partial_{λ} s = 0 \end{cases}

(A126)

Ω

is a Poincaré-Cartan integral invariant of the field:

\begin{array}{l} Ω_{λ μ} Θ^{μ} + \partial_{λ} s = 0 \Rightarrow {\begin{cases} δ s = 0 \\ δ_{L} Ω = 0 \end{cases} for δ X = Θ \\ if \partial_{λ} s = 0 (isentropic movment) \Rightarrow Θ \in \ker (Ω) \end{array}

(A127)

Souriau has then considered weakly dissipative movements. If we cannot know

φ = φ (n, Θ, γ)

, it can be approximated by 2nd order development in

γ

variable:

φ = φ_{0} + {\overset{⌢}{T}}^{λ μ} γ_{λ μ} + \frac{1}{2} C^{λ μ, v q} γ_{λ μ} γ_{v q} \Rightarrow T^{λ μ} = T^{λ μ} = \frac{\partial φ}{\partial γ_{λ μ}} = {\overset{⌢}{T}}^{λ μ} + C^{λ μ, v q} γ_{v q}

(A128)

Entropy production is given by:

\begin{array}{l} \partial_{λ} S^{λ} = [T^{λ μ} - {\overset{⌢}{T}}^{λ μ}] γ_{λ μ} = C^{λ μ, v q} γ_{λ μ} γ_{v q} \\ Onsager Reciprocity \Rightarrow C^{λ μ, v q} = C^{v q, λ μ} \end{array}

(A129)

55 coefficients of Transport coefficients

C^{λ μ, v q}

are reduced to 5 coefficients (by fluid symetries and Onsager reciprocity): A, B, C, E & F.

Souriau then obtained relativistic (Fourier) equation of heat. Let us consider the constraints tensor:

\begin{array}{l} τ_{j k} = - T_{j k} = δ_{j k} [- p + λ_{v i s} \partial_{l} v^{l} - B \frac{\partial ε}{\partial t}] + μ_{v i s} [\partial_{j} v_{k} + \partial_{k} v_{j}] \\ (j, k = 1, 2, 3 and v_{j} speed, zero at the point considered) \end{array}

(A130)

With the equations given by:

Heat Flux:

$T^{j 0} = {F [\vec{g r a d ε - ε \frac{\partial \vec{v}}{\partial t}}]}^{j}$

(A131)
Specific Mass-Energy:

$T^{00} = ρ + C \frac{\partial ε}{\partial t} - B ε d i v (\vec{v})$

(A132)

with:

$λ_{v i s} = [A - \frac{2 E}{3}] ε, μ_{v i s} = E ε, ε = \frac{1}{T} and Thermo - conductivity : \frac{F}{T^{2}}$

Variables A, B, C, E & F are functions of

ε

and

n

, and convexity of

φ

induces:

A > 0, C > 0, E > 0, F > 0, | B | < \sqrt{A C}

(A133)

References

Chatelain, J.M. Pascal, le Coeur et la Raison; Bibliothèque nationale de France: Paris, France, 2016. [Google Scholar]
Duhem, P. Sur les équations générales de la thermodynamique. In Annales Scientifiques de l’École Normale Supérieure; Ecole Normale Supérieure: Paris, France, 1891; Volume 8, pp. 231–266. (In French) [Google Scholar]
Duhem, P. Commentaire aux principes de la Thermodynamique—Troisième partie. J. Math. Appl. 1894, 10, 207–286. (In French) [Google Scholar]
Needham, P. Commentary on the Principles of Thermodynamics by Pierre Duhem; Boston Studies in the Philosophy of Science; Needham, P., Ed.; Springer: Berlin, Germany, 2011. [Google Scholar]
Duhem, P. Commentaire aux principes de la Thermodynamique—Première partie. J. Math. Appl. 1892, 8, 269–330. (In French) [Google Scholar]
Bordoni, S. From thermodynamics to philosophical tradition: Pierre Duhem’s research between 1891 and 1896. Lettera Matematica 2017, 5, 261–266. [Google Scholar] [CrossRef]
Stoffel, J.-F. Pierre Duhem: Un savant-philosophe dans le sillage de Blaise Pascal. Revista Portuguese de Filosofia 2007, 63, 275–307. [Google Scholar] [CrossRef]
Le Ferrand, H.; Mazliak, L. Pierre Duhem (1861–1916) et ses Contemporains Institut Henri Poincaré, 14 Septembre 2016; Organisée par Hervé Le Ferrand (Dijon)—Laurent Mazliak: Paris, France, 2016. [Google Scholar]
Barbaresco, F. Jean-Louis Koszul and the elementary structures of Information Geometry. In Geometric Structures of Information Geometry; Nielsen, F., Ed.; Springer: Berlin, Germany, 2018. [Google Scholar]
Barbaresco, F. Koszul Lecture Contemporaneity: Elementary Structures of Information Geometry and Geometric Heat Theory. In Introduction to Symplectic Geometry; Koszul, J.L., Ed.; Springer: Berlin, Germany, 2018. [Google Scholar]
Barbaresco, F. Jean-Louis Koszul et les Structures Elémentaires de la Géométrie de l’Information. Revue SMAI Matapli. SMAI, Ed.; 2018, Volume 116. Available online: https://www.see.asso.fr/pdf_viewer/22381/m (accessed on 2 November 2018).
Duhem, P. Recherches sur l‘élasticité. Ann. Ecole Norm. 1905, 22, 143–217. [Google Scholar] [CrossRef]
Souriau, J.M. Thermodynamique Relativiste des Fluides; Rendiconti del Seminario Matematico; Università Politecnico di Torino: Torino, Italy, 1978; Volume 35, pp. 21–34. [Google Scholar]
Souriau, J.M. Milieux continus de dimension 1, 2 ou 3: Statique et dynamique. In Proceedings of the 13eme Congrès Français de Mécanique, Poitiers, France, 1–5 September 1997; pp. 41–53. [Google Scholar]
Amari, S.I. Natural gradient works efficiently in learning. Neural Comput. 1998, 10, 251–276. [Google Scholar] [CrossRef]
Amari, S.I.; Nagaoka, H. Methods of Information Geometry; Harada, D., Ed.; Translations of Mathematical Monographs; American Mathematical Society: Providence, RI, USA, 2000; Volume 191. [Google Scholar]
Pascanu, R.; Bengio, Y. Natural gradient revisited. arXiv 2013, arXiv:1301.3584v1. [Google Scholar]
Martens, J. New insights and perspectives on the natural gradient method. arXiv 2014, arXiv:1412.1193. [Google Scholar]
Ollivier, Y. Riemannian metrics for neural networks I: Feedforward networks. Inf. Inference 2015, 4, 108–153. [Google Scholar] [CrossRef]
Amari, S.I. Information Geometry and Its Applications; Applied Mathematical Sciences; Springer: Berlin, Germany, 2016. [Google Scholar]
Ollivier, Y.; Arnold, L.; Auger, A.; Hansen, N. Information-geometric optimization algorithms: A unifying picture via invariance principles. J. Mach. Learn. Res. 2017, 18, 1–65. [Google Scholar]
Ollivier, Y.; Marceau-Caron, G. Natural Langevin dynamics for neural networks. In Geometric Science of Information (GSI 2017); Nielsen, F., Barbaresco, F., Eds.; Lecture Notes in Computer Science 10589; Springer: Berlin, Germany, 2017; pp. 451–459. [Google Scholar]
Marle, C.-M. From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics. Entropy 2016, 18, 370. [Google Scholar] [CrossRef]
Balian, R.; Valentin, P. Hamiltonian structure of thermodynamics with gauge. Eur. Phys. J. B 2001, 21, 269–282. [Google Scholar] [CrossRef]
Balian, R. From Microphysics to Macrophysics, 2nd ed.; Springer: Berlin, Germany, 2007; Volume I. [Google Scholar]
Der Schaft, A.; Maschke, B. Homogeneous Hamiltonian Control Systems Part I: Geometric Formulation; Elsevier: Amsterdam, The Netherlands, 2018; Volume 51, pp. 1–6. [Google Scholar]
Jaworski, W. Information thermodynamics with the second order temperatures for the simplest classical systems. Acta Phys. Pol. 1981, 60, 645–659. [Google Scholar]
Jaworski, W. Higher-order moments and the maximum entropy inference: The thermodynamical limit approach. J. Phys. A Math. Gen. 1987, 20, 915–926. [Google Scholar] [CrossRef]
Ingarden, H.S.; Meller, J. Temperatures in linguistics as a model of thermodynamics. Open Syst. Inf. Dyn. 1994, 2, 211–230. [Google Scholar] [CrossRef]
Ingarden, R.S.; Nakagomi, T. The second order extension of the Gibbs state. Open Syst. Inf. Dyn. 1992, 1, 259–268. [Google Scholar] [CrossRef]
Ingarden, R.S.; Kossakowski, A.; Ohya, M. Information Dynamics and Open Systems; Classical and Quantum Approach, Fundamental Theories of Physics; Springer: Berlin, Germany, 1997; Volume 86. [Google Scholar]
Jaworski, W.; lngarden, R.S. On the partition function in information thermodynamics with higher order temperatures. Bull. Acad. Pol. Sci. Sér. Phys. Astron. 1980, 1, 28–119. [Google Scholar]
Jaworski, W. On Information Thermodynamics with Temperatures of the Second Order. Master’s Thesis, Institute of Physics, Nicolaus Copernicus University, Torun, Poland, 1981. (In Polish). [Google Scholar]
Jaworski, W. On the thermodynamic limit in information thermodynamics with higher-order temperatures. Acta Phys. Pol. 1983, A63, 3–19. [Google Scholar]
Jaworski, W. Investigation of the Thermodynamic Limit for the States Maximizing Entropy under Auxiliary Conditions for Higher-Order Statistical Moments. Ph.D. Thesis, Institute of Physics, Nicolaus Copernicus University, Torun, Poland, 1983. (In Polish). [Google Scholar]
Ingarden, R.S.; Kossakowski, A. Statistical thermodynamics with higher order temperatures for ideal gases of bosons and fermions. Acta Phys. Pol. 1965, 28, 499–511. [Google Scholar]
Ingarden, R.S.; Tamassy, L. On parabolic geometry and irreversible macroscopic time. Rep. Math. Phys. 1993, 32, 11–33. [Google Scholar] [CrossRef]
Ingarden, R.S. Towards mesoscopic thermodynamics: Small systems in higher-order states. Open Syst. Inf. Dyn. 1993, 1, 75–102. [Google Scholar] [CrossRef]
Ingarden, R.S.; Janyszek, H.; Kossakowski, A.; Kawaguchi, T. Information geometry of quantum statistical systems. Tensor Ns 1982, 37, 105–111. [Google Scholar]
Ingarden, R.S.; Kossakowski, A. On the connection of nonequilibrium information thermodynamics with non-hamiltonian quantum mechanics of open systems. Ann. Phys. 1975, 89, 451–485. [Google Scholar] [CrossRef]
Casalis, M. Familles Exponentielles Naturelles Invariantes par un Groupe. Ph.D. Thesis, l’Université Paul Sabatier, Toulouse, France, 1990. [Google Scholar]
Casalis, M. Familles Exponentielles Naturelles sur Rd Invariantes par un Groupe. Int. Stat. Rev. 1991, 59, 241–262. [Google Scholar] [CrossRef]
Souriau, J.-M. Structures des Systèmes Dynamiques; Dunod: Paris, France, 1970. [Google Scholar]
Koszul, J.L. Introduction to Symplectic Geometry; Science Press: Beijing, China, 1986. (In Chinese), translated by SPRINGER in English, 2018 [Google Scholar]
Marle, C.M. Géométrie Symplectique et Géométrie de Poisson; Mathématiques en Devenir, Calvage & Mounet: Paris, France, 2018. [Google Scholar]
Kostant, B. Quantization and Unitary Representations; Lecture Notes in Math. 170; Springer: Berlin, Germany, 1970. [Google Scholar]
Koszul, J.L.; Travaux, D.B. Kostant sur les Groupes de Lie Semi-Simples; Séminaire Bourbaki: Paris, France, 1958–1960; pp. 329–337.
Gunther, C. The polysymplectic Hamiltonian formalism in field theory and calculus of variations I: The local case. J. Differ. Geom. 1987, 25, 23–53. [Google Scholar] [CrossRef]
Munteanu, F.; Rey, A.M.; Salgado, M. The Günther’s formalism in classical field theory: Momentum map and reduction. J. Math. Phys. 2004, 5, 1730–1751. [Google Scholar] [CrossRef]
Awane, A. k-symplectic structures. J. Math. Phys. 1992, 33, 4046–4052. [Google Scholar] [CrossRef]
Awane, A.M. Goze, Pfaffian Systems, k-Symplectic Systems; Springer: Berlin, Germany, 2000. [Google Scholar]
Edelen, D.G.B. The invariance group for Hamiltonian systems of partial differential equations. Arch. Rational Mech. Anal. 1961, 5, 95–176. [Google Scholar] [CrossRef]
De Donder, T. Théorie Invariante du Calcul des Variations, Nuov. ed.; Gauthiers–Villars: Paris, France, 1935. [Google Scholar]
Lepage, T. Sur les champs géodésiques du calcul des variations. Bull. Acad. R. Belg. Classes Sci. 1936, 22. [Google Scholar]
Hélein, F. Multisymplectic formalism and the covariant phase space. In Variational Problems in Differential Geometry; Bielawski, R., Houston, K., Speight, M., Eds.; London Mathematical Society Lecture Note Series 394; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
Facchi, P.; Kulkarni, R.; Manko, V.I.G.; Marmo, S.E.C.G.; Ventriglia, F. Classical and quantum Fisher information in the geometrical formulation of quantum mechanics. Phys. Lett. A 2010, 374, 4801. [Google Scholar] [CrossRef]
Contreras, E.; Schiavina, M. On the geometry of mixed states and the quantum information tensor. J. Math. Phys. 2016, 57, 062209. [Google Scholar] [CrossRef]
Luati, A. Maximum Fisher information in mixed state quantum systems. Ann. Stat. 1770, 32, 2004. [Google Scholar] [CrossRef]
Contreras, E.; Schiavina, M. Kähler fibrations in quantum information theory. arXiv 2018, arXiv:1801.09793. [Google Scholar]
Souriau, J.-M. La structure symplectique de la mécanique décrite par Lagrange en 1811. Math. Sci. Hum. 1986, 94, 45–54. [Google Scholar]
Marle, C.M. The inception of Symplectic Geometry: The works of Lagrange and Poisson during the years 1808–1810. Lett. Math. Phys. 2009, 90, 3. [Google Scholar] [CrossRef]
Barbaresco, F.; Boyom, M. Foundations of Geometric Structure of Information. In Proceedings of the FGSI’19, IMAG lab (Institut Montpelliérain Alexander Grothendieck), Montpellier, France, 4–6 February 2019; Available online: https://fgsi2019.sciencesconf.org/ (accessed on 11 November 2018).
Szczeciniarz, J.-J.; Iglesias-Zemmour, P. SOURIAU 2019 Conference, SPHERE, Université Paris-Diderot, Paris, France, 27–31 May 2019. Available online: http://souriau2019.fr/ (accessed on 1 November 2018).
Souriau, J.-M. Mécanique statistique, groupes de Lie et cosmologie, Colloques int. du CNRS numéro 237. In Proceedings of the Géométrie Symplectique et Physique Mathématique, Aix-en-Provence, France, 24–28 June 1974; pp. 59–113. [Google Scholar]
Obădeanu, V. Structures géométriques associées a certains systèmes dynamiques. Balkan J. Geom. Appl. 2000, 5, 81–89. [Google Scholar]
Obădeanu, V. Systèmes Dynamiques et Structures Géométriques Associées; Universitatea din Timișoara, Facultatea de Matematică: Timișoara, Romania, 1999. [Google Scholar]
Obădeanu, V. Systèmes Biodynamiques et Lois de Conservation Applications au Systèmes de Neurones; Universitatea din Timișoara, Facultatea de Matematicӑ: Timișoara, Romania, 1994. [Google Scholar]
Gallissot, F. Les formes extérieures en mécanique. Annales de l’Institut Fourier 1952, 4, 145–297. [Google Scholar] [CrossRef]
Gallissot, F. les formes extérieures et la mécaniques des milieux continus. Annales de l’Institut Fourier 1958, 8, 291–335. [Google Scholar] [CrossRef]
Souriau, J.M. C’est quantique? Donc c’est Géométrique. Feuilletages—Quantification Géométrique: Textes des Journées D’étude des 16 et 17 Octobre 2003. Available online: http://semioweb.msh-paris.fr/f2ds/docs/feuilletages/Jean-Marie_Souriau3.pdf (accessed on 1 November 2018).
Souriau, J.M. C’est Quantique? Donc c’est Géométrique. Feuilletages—Quantification Géométrique Video. 2003. Available online: https://www.youtube.com/watch?time_continue=417&v=vZeidrBPljM (accessed on 1 November 2018).
Souriau, J.M. Géométrie et Relativité. Collection Enseignement des Sciences; Hermann: Paris, France, 1964. [Google Scholar]
Souriau, J.M. Thermodynamique et géométrie. Lecture Notes Math. 1976, 676, 369–397. [Google Scholar]
Souriau, J.M.; Iglesias, P. Le Chaud, le Froid et la Géométrie, Groupe de Contact de Géométrie Différentielle et de Topologie Algébrique du FNRS; Université de Liège: Liège, Belgium, 1980. [Google Scholar]
Stueckelberg, E.C.G.; Wanders, G. Thermodynamique en Relativité Générale. Helv. Phys. Acta 1953, 26, 307–316. [Google Scholar]
Lichnerowicz, A. Théories Relativistes de la Gravitation et de L’électromagnétisme; Relativité Générale et Théories Unitaires; Masson et Cie: Paris, France, 1955. [Google Scholar]
Vallée, C. Lois de Comportement des Milieux Continus Dissipatifs Compatibles avec la Physique Relativiste. Ph.D. Thesis, University of Poitiers, Poitiers, France, 1978. [Google Scholar]
Vallée, C. Relativistic thermodynamics of continua. Int. J. Eng. Sci. 1981, 19, 589–601. [Google Scholar] [CrossRef]
Garrel, J. Tensorial Local-Equilibrium Axion and Operator of Evolution. Il Nuovo Cimento 1986, 94, 119–139. [Google Scholar] [CrossRef]
Anile, A.; Choquet-Bruhat, Y. Relativistic Fluid Dynamics; Lecture Notes in Mathematics; Springer: Berlin, Germany, 1989. [Google Scholar]
De Saxcé, G.; Vallée, C. Galilean Mechanics and Thermodynamics of Continua; Wiley-ISTE: Hoboken, NJ, USA, 2016. [Google Scholar]
de Saxcé, G. 5-Dimensional Thermodynamics of Dissipative Continua. In Models, Simulation, and Experimental Issues in Structural Mechanics; Springer: Berlin, Germany, 2017. [Google Scholar]
Ingarden, R.S. Information geometry in functional spaces of classical ad quantum finite statistical systems. Int. J. Eng. Sci. 1981, 19, 1609–1616. [Google Scholar] [CrossRef]
Ingarden, R.S. Information Geometry of Thermodynamics. Trans. Tenth Prague Conf. 1988, 10, 421–428. [Google Scholar]
Mrugala, R. On equivalence of two metrics in classical thermodynamics. Physica 1984, 125A, 631–639. [Google Scholar] [CrossRef]
Mrugala, R. Riemannian and Finslerian geometry in thermodynamics. Open Syst. Inf. Dyn. 1992, 1, 379–396. [Google Scholar] [CrossRef]
Mrugala, R. On a special family of thermodynamic processes and their invariants. Rep. Math. Phys. 2000, 46, 461–468. [Google Scholar] [CrossRef]
Mrugała, R. On contact and metric structures on thermodynamic spaces. RIMS Kokyuroku 2000, 1142, 167–181. [Google Scholar]
Mrugała, R. Structure group U(n) x 1 in thermodynamics. J. Phys. A Math. Gen. 2005, 38, 10905. [Google Scholar] [CrossRef]
Arnold, V.I. Contact geometry: The geometrical method of Gibbs’s thermodynamics. In Proceedings of the Gibbs Symposium, New Haven, CT, USA, 15–17 May 1989; Caldi, D.G., Mostow, G.D., Eds.; Yale University: New Haven, CT, USA, 1989; pp. 163–179. [Google Scholar]
Barbaresco, F. Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry. Entropy 2016, 18, 386. [Google Scholar] [CrossRef]
Kijowski, W. A finite dimensional canonical formalism in the classical field theory. Commun. Math. Phys. 1973, 30, 99–128. [Google Scholar] [CrossRef]
Kijowski, W. Multiphase spaces and gauge in the calculus of variations. Bulletin de L Academie Polonaise des Sciences-Serie des Sciences Mathematiques Astronomiques et Physiques 1974, 22, 1219–1225. [Google Scholar]
Kijowski, W.; Szczyrba, W. A canonical structure for classical field theories. Commun. Math Phys. 1976, 46, 183–206. [Google Scholar] [CrossRef]
Nakagomi, T. Mesoscopic version of thermodynamic equilibrium condition. Another approach to higher order temperatures. Open Syst. Inf. Dyn. 1992, 1, 233–241. [Google Scholar] [CrossRef]
Nencka, H.; Streater, R.F. Information Geometry for some Lie algebras. Infin. Dimens. Anal. Quantum Probab. Relat. Top. 1999, 2, 441–460. [Google Scholar] [CrossRef]
Sampieri, U. Lie group structures and reproducing kernels on homogeneous siegel domains. Annali di Matematica Pura ed Applicata 1988, 152, 1–19. [Google Scholar] [CrossRef]
Alexeevsky, D. Vinberg’s Theory of Homogeneous Convex Cones: Developments and Applications; Transformation groups 2017. Conference dedicated to Prof. Ernest B. Vinberg on the occasion of his 80th birthday, Moscow, December 2017 [Video]. Available online: http://www.mathnet.ru/present19121 (accessed on 1 November 2018).
Trépreau, J.-M. Transformation de Legendre et pseudoconvexité avec décalage. J. Fourier Anal. Appl. 1995, 1, 569–588. [Google Scholar]
Leray, J. Le calcul differentiel et intégral sur une variété analytique complexe. Bull. Soc. Math. France 1952, 87, 81–180. [Google Scholar]
Brenier, Y. Un algorithme rapide pour le calcul de transformées de Legendre-Fenchel discrètes. C. R. Acad. Sci. Paris 1989, 308, 587–589. [Google Scholar]
Legendre, A.M. Mémoire Sur L’intégration de Quelques Equations aux Différences Partielles; Mémoires de l’Académie des Sciences: Paris, France, 1787; pp. 309–351. [Google Scholar]
Konstantatou, M.; McRobie, A. Reciprocal constructions using conic sections and Poncelet duality. In Proceedings of the IASS 2016 Tokyo Symposium: Spatial Structures in the 21st Century—Graphic Statics, Tokyo, Japan, 26–30 September 2016. [Google Scholar]
Benayoun, L. Méthodes Géométriques pour L’étude des Systèmes Thermodynamiques et la Génération D’équations D’état. Ph.D. Thesis, Institut National Polytechnique de Grenoble, Grenoble, France, 1999. [Google Scholar]
Der Schaft, A.; Maschke, B. Homogeneous Hamiltonian Control Systems Part II: Application to thermodynamic systems. IFAC-PapersOnLine 2018, 51, 7–12. [Google Scholar]
Delzant, T.; Wacheux, C. Action Hamiltoniennes: Invariants et Classification; Organisé par Michel Brion et Thomas Delzant, CIRM: Luminy, France, 2010; Volume 1, pp. 23–31. [Google Scholar]
Moreau, J.J. Fonctions convexes duales et points proximaux dans un espace hilbertien. C. R. Acad. Sci. Paris 1962, 255, 2897–2899. [Google Scholar]
Libermann, P. Legendre foliations on contact manifolds. Differ. Geom. Appl. 1991, 1, 57–76. [Google Scholar] [CrossRef]
Kostant, B.; Sahi, S. The Capelli identity, tube domains, and the generalized Laplace transform. Adv. Math. 1991, 87, 71–92. [Google Scholar] [CrossRef]
Duhem, P. Sur la stabilité d’un système animé d’un mouvement de rotation, Comptes rendus, t. CXXXII, séance du 29 Avril 1901. 1021.
Duhem, P. Sur la stabilité de l’équilibre d’une masse fluide animée d’un mouvement de rotation. J. Math. 1901, VII, 311–330. [Google Scholar]
Duhem, P. Stabilité pour des perturbations quelconques, d’un système animé d’un mouvement de rotation uniforme. C. R. 1902, CXXXIV, 23. [Google Scholar]
Duhem, P. Sur la stabilité pour des perturbations quelconques, d’un système animé d’un mouvement de rotation uniforme. Journal de Mathématiques pures et Appliquées 1902, VIII, 5. [Google Scholar]
Poincaré, H. Sur l’équilibre d’une masse fluide animée d’un mouvement de rotation, chap. 14, Stabilité des ellipsoïdes. Acta Mathematica 1885, VII, 366–367. [Google Scholar]
Barbaresco, F. Poly-symplectic Model of Higher Order Souriau Lie Groups Thermodynamics for Small Data Analytics. In Geometric Science of Information; Springer: Berlin, Germany, 2017; Volume 10589, pp. 432–441. [Google Scholar]
Volterra, V. Sulle Equazioni Differenziali che Provengono da Questiono di Calcolo delle Variazioni; Serise IV; Tip. della R. Accademia dei Lincei: Roma, Italy, 1890; Volume VI, pp. 42–54. [Google Scholar]
Volterra, V. Sopra una Estensione della Teoria Jacobi-Hamilton del Calcolo delle Variazioni; Serise IV; Tip. della R. Accademia dei Lincei: Roma, Italy, 1890; Volume VI, pp. 127–138. [Google Scholar]
Dedecker, P. Calcul des variations, formes différentielles et champs géodésiques. Géométrie Différentielle 1953, 52, 17. [Google Scholar]
Dedecker, P. On the generalization of symplectic geometry to multiple integrals in the calculus of variations. In Differential Geometrical Methods in Mathematical Physics; Bleuler, K., Reetz, A., Eds.; Lect. Notes Maths; Springer: Berlin, Germany, 1977; Volume 570, pp. 395–456. [Google Scholar]
Hélein, F.; Kouneiher, J. Covariant Hamiltonian formalism for the calculus of variations with several variables: Lepage–Dedecker versus De Donder–Weyl. Adv. Theor. Math. Phys. 2004, 8, 565–601. [Google Scholar] [CrossRef]
Carathéodory, C. Uber die Extremalen und geod ätischen Felder in der Variationsrechnung der mehrfachen Integrale. Acta Sci. Math. (Szeged) 1929, 4, 193–216. [Google Scholar]
Weyl, H. Geodesic fields in the calculus of variations. Ann. Math. 1935, 36, 607–629. [Google Scholar] [CrossRef]
Edelen, D.G.B. Nonlocal Variations and Local Invariance of Fields; American Elsevier: New York, NY, USA, 1969. [Google Scholar]
Rund, H. The Hamilton-Jacobi Theory in the Calculus of Variations; Van Nostrand: Princeton, NJ, USA, 1966. [Google Scholar]
Cartan, E. Sur les espaces à connexion affine et la théorie de la relativité généralisée, partie I. Ann. Ec. Norm 1923, 40, 325–412. [Google Scholar]
Cartan, E. Sur les espaces à connexion affine et la théorie de la relativité généralisée (suite). Ann. Ec. Norm. 1924, 41, 1–25. [Google Scholar]
Cartan, E. Sur les espaces connexion affine et la théorie de la relativité généralisée partie II. Ann. Ec. Norm. 1925, 42, 17–88. [Google Scholar]
Cartan, E. La Méthode du Repère Mobile, la Théorie des Groupes Continus et les Espaces Généralisés; Exposés de Géométrie, No. 5; Hermann: Paris, France, 1935. [Google Scholar]
Alekseevsky, D. Vinberg’s Theory of Homogeneous Convex Cones: Developments and Applications, Transformation Groups 2017. Conference Dedicated to Prof. Ernest B. Vinberg on the Occasion of His 80th Birthday, Moscow, December 2017. Available online: https://www.mccme.ru/tg2017/slides/alexeevsky.pdf (accessed on 1 November 2018).
Lichnerowicz, A.; Medina, A. On Lie groups with left-invariant symplectic or Kählerian structures. Lett. Math. Phys. 1988, 16, 225–235. [Google Scholar] [CrossRef]
Scholz, E.E. Cartan’s attempt at bridge-building between Einstein and the Cosserats – or how translational curvature became to be known as torsion. arXiv 2018, arXiv:1810.03872v1 [math.HO]. [Google Scholar]
Duhem, P. La théorie physique: Son objet, sa structure, Vrin Edition ed. 2007. Available online: https://books.openedition.org/enseditions/6077 (accessed on 2 November 2018).
Conteras, I.; Alba, N.M. Poly-Poisson sigma models and their relational poly-symplectic groupoids. J. Math. Phys. 2018, 59, 072901. [Google Scholar] [CrossRef]
Belgodère, P. Courbure moyenne généralisée. C. R. Acad. Sci. Paris 1944, 218, 739–740. [Google Scholar]
Belgodère, P. Extremales d’une intégrale de surface Sg(p, q)dxdy. C. R. Acad. Sci. Paris 1944, 219, 272–273. [Google Scholar]

Figure 1. Institut des Hautes Etudes de Tunis, 8 rue de Rome where Souriau has developed his theory of Geometric Mechanics and Lie Groups Thermodynamics (http://www.ina.fr/video/AFE01000164).

Figure 2. Evolution space V, Space of motions U and classical space time.

Figure 3. Higher order maximum entropy density for constraints (32) from Ingarden’s paper.

Figure 4. Higher order maximum entropy density for constraints (36) from Ingarden paper.

Figure 5. Global Souriau scheme of Lie groups thermodynamics.

Figure 6. Broken symmetry on geometric heat Q due to adjoint action of the group on temperature β as an element of the Lie algebra.

Figure 7. Comparison of the Souriau equations (column on the left) and Koszul equations (column on the right).

Figure 8. Most simple use-case of Souriau’s Lie groups thermodynamics: the thermodynamics of the centrifuge of butter churn (device used to convert cream into butter). (a) butter churn centrifuge with horizontal axis; (b) butter churn centrifuge with vertical axis.

Figure 9. Three Sources of Geometric Structures for Information and Heat.

Figure 10. Koszul Lecture on “Introduction of Symplectic Geometry” where the Souriau model of non-equivariance is developed.

Figure 11. Mediterranean sources of Souriau Book on Structure of Dynamical systems at Carthage and Massilia where souriau wrote this text and theory.

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Barbaresco, F. Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning. Entropy 2018, 20, 840. https://doi.org/10.3390/e20110840

AMA Style

Barbaresco F. Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning. Entropy. 2018; 20(11):840. https://doi.org/10.3390/e20110840

Chicago/Turabian Style

Barbaresco, Frédéric. 2018. "Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning" Entropy 20, no. 11: 840. https://doi.org/10.3390/e20110840

APA Style

Barbaresco, F. (2018). Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning. Entropy, 20(11), 840. https://doi.org/10.3390/e20110840

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning

Abstract

1. Introduction

2. Seminal Idea of Symplectic Geometry in Mechanics and in Statistical Mechanics by Gallissot and Souriau

3. Higher Order Thermodynamics Based on Higher Order Temperatures

4. Model of Souriau Lie Groups Thermodynamics

5. Extended Koszul Study of Souriau Non-Equivariant Model Associated to a Class of Cohomology

6. Souriau Model of Generalized Entropy Based on Legendre and Laplace Transforms

7. Illustration of Souriau Thermodynamics of a Centrifuge System

8. Higher-Order Model of Lie Groups Thermodynamics Based on Poly-Symplectic Vector Valued Model

9. Conclusions and Possible Extensions

Funding

Conflicts of Interest

Appendix A. Günther’s Polysymplectic Model

Appendix B. Fisher Metric for Multivariate Gaussian Density

Appendix C. Geometric Definition of Legendre Transform by Chasles as Reciprocal Polar with Respect to a Paraboloid

Appendix D. Centrifuge Thermodynamics by Roger Balian Based on Classical Approach

Appendix E. Proof of Convergence for Poly-Symplectic Model Based on Souriau Proof

Appendix F. Relativistic Souriau Thermodynamics of Continua

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI