Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks

Fré, Pietro G.; Sorin, Alexander S.; Trigiante, Mario

doi:10.3390/e28040365

Open AccessArticle

Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks

by

Pietro G. Fré

^1,2,*,

Alexander S. Sorin

³

and

Mario Trigiante

^4,5

¹

Dipartimento di Fisica, Università di Torino, Via P. Giuria 1, I-10125 Torino, Italy

²

Additati & Partners Consulting s.r.l.,Via Filippo Pacini 36, I-51100 Pistoia, Italy

³

Center for Quantum Science and Technology, Tel-Aviv University, Tel Aviv 69978, Israel

⁴

Dipartimento DISAT, Politecnico di Torino, Corso Duca degli Abruzzi 24, I-10129 Torino, Italy

⁵

Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Torino, I-10125 Torino, Italy

^*

Author to whom correspondence should be addressed.

Entropy 2026, 28(4), 365; https://doi.org/10.3390/e28040365

Submission received: 18 December 2025 / Revised: 4 March 2026 / Accepted: 10 March 2026 / Published: 24 March 2026

(This article belongs to the Collection Feature Papers in Information Theory)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we clarify several issues concerning the abstract geometrical formulation of thermodynamics on non-compact symmetric spaces

U / H

that are the mathematical model of hidden layers in the new paradigm of Cartan Neural Networks. We introduce a clear-cut distinction between the generalized thermodynamics associated with Integrable Dynamical Systems and the challenging proposal of Gibbs probability distributions on

U / H

provided by generalized thermodynamics à la Souriau. Our main result is the proof that

U / H

.s supporting such Gibbs distributions are only the Kähler ones. Furthermore, for the latter, we solve the problem of determining the space of temperatures, namely, of Lie algebra elements for which the partition function converges. The space of generalized temperatures is the orbit under the adjoint action of

U

of a positivity domain in the Cartan subalgebra

C_{c} \subset H

of the maximal compact subalgebra

H \subset U

. We illustrate how our explicit constructions for the Poincaré and Siegel planes might be extended to the whole class of Calabi–Vesentini manifolds utilizing Paint Group symmetry. Furthermore, we claim that Rao’s, Chentsov’s, and Amari’s Information Geometry and the thermodynamical geometry of Ruppeiner and Lychagin are the very same thing. In particular, we provide an explicit study of thermodynamical geometry for the Poincaré plane. The key feature of the Gibbs probability distributions in this setup is their covariance under the entire group of symmetries U. The partition function is invariant against

U

transformations, and the set of its arguments, namely the generalized temperatures, can always be reduced to a minimal set whose cardinality is equal to the rank of the compact denominator group

H \subset U

.

Keywords:

generalized thermodynamics; Lie groups; symplectic geometry; Cartan neural networks; contact geometry; Shannon entropy; Cartan Hadamard manifolds; non compact symmetric spaces; partition function; Siegel plane

1. Introduction

The purpose of the present paper is to clarify all the proper relations, identifications, and, when necessary, clear-cut distinctions among several mathematical constructions that have been recently introduced by different researchers into the mathematical formulation of Machine Learning and that admit, as their own conceptual pivot, the notions of Lie group, Hamiltonian dynamical system and the geometrical rephrasing of thermodynamical equilibrium states. This is particularly relevant in view of the new paradigm of Cartan Neural Networks [1,2,3,4], intrinsically characterized by the identification of the network hidden layers with as many non-compact symmetric spaces

U_{i} / H_{i}

, each metrically equivalent to an appropriate solvable Lie group. The generalized notion of Gibbs state provides the proper way to introduce Gaussian-like probability distributions on non-compact symmetric spaces that, in Cartan Neural Network architectures, constitute the hidden layers. Hence, we start with a short summary of the new CaNN paradigm, highlighting its geometrical and group-theoretical strategic aspects. After that, we introduce the other mathematical ingredients of our discussion and clarification plan.

Let us remark since the very beginning that the generalized Gibbs probability distributions, whose structure, properties and appropriate construction principles are the main target of the present investigation, have been introduced into Machine Learning about a decade ago as a preferred method of Machine Learning analyzis of electromagnetic signals such as those involved in radar technologies [5] (see also the review [6]). More generally, these probability distributions, which are covariant with respect to the full group of isometries of the manifold on which they are defined and fit into geometrical thermodynamics (alias information geometry), are promising tools for Machine Learning architectures concerned with all kinds of electromagnetic signals and also all kinds of temporal sequences.

1.1. Cartan Neural Networks: A New Paradigm

In [1], whose authors’ list includes two of us, a new mathematical paradigm was introduced for the engineering of neural network architectures under the name of PGTS (PGTS is an acronym for Paint Group Tits–Satake) theory of non-compact symmetric spaces. The essential points of this paradigm are:

The systematic substitution of the Euclidean $R^{n}$ space with a non-compact symmetric coset manifold $U / H$ , where $U$ is a simple non-compact Lie group, and $H$ is the maximal compact subgroup of $U$ . All these manifolds are Cartan–Hadamard manifolds and are metrically equivalent to a specific solvable Lie group manifold $S_{U / H}$ .
The grouping of these manifolds into Tits Satake universality classes, which provides an ideal mathematical definition of neural layers.
The systematic suppression of point-wise activation functions like the sigmoid and its close relatives, the necessary non-linearity being universally provided by generalized exponential maps from Lie Algebras to the corresponding Lie Groups and the generalized logarithm maps that are the inverse of the former.

In a twin pair of papers [2,3], it was shown how a generic multi-layer neural network can be cast into a form that implements points (1), (2), and (3) of the above paradigm. This class of neural network architectures based on the above principles was named Cartan neural networks (CaNNs) in honor of the monumental achievement of Èlie Cartan, who obtained the complete classification of all symmetric spaces and their one-to-one correspondence with the classification of real forms of simple complex Lie algebras [7,8,9,10]. In [2,3], the general scheme of supervised learning for a classification task within CaNNs was addressed. In [4], many additional mathematical general conceptions and constructions were included in the toolbox for the engineering of CaNNs, encompassing, in particular, the general theory of codimension one separators, harmonic analyzis on non-compact

U / H

, Tits Satake fibre bundles and tessellation groups. We refer the reader to [4] for further details on such topics.

Covariance of CaNNs

In [2], an algorithm was described in which each datum is linearly mapped (with a matrix

W_{0}

, target of learning) to the solvable coordinate vector

Υ

labeling a point in a non-compact symmetric space

U_{1} / H_{1}

. The latter is the first layer in a sequence of similar layers

U_{i} / H_{i}

, each a non-compact symmetric space with (a priori different) dimension

d_{i}

. As discussed at length in [2], the general scheme also allows the non-compact rank and the type of the various

U_{i} / H_{i}

to be different; yet, thanks to the fundamental property of metric equivalence with a suitable solvable group

S_{i}

, we can constrain the map from one layer to the next one to be a group homomorphism derived from a linear homomorphism of the corresponding solvable Lie algebras. More specifically, denoting by

K_{i}

the map from the ith-layer, described by the space

U_{i} / H_{i}

, to the subsequent one

U_{i + 1} / H_{i + 1}

, this map is described by a group homomorphism between

S_{i}

and

S_{i + 1}

while the corresponding push forward map

K_{i *}

is a linear homomorphism between the solvable Lie algebras

S o l v_{i}, S o l v_{i + 1}

, generating

S_{i}

and

S_{i + 1}

, respectively:

\forall X, Y \in S o l v_{i} : K_{i *} [X, Y] = [K_{i *} X, K_{i *} Y] \in S o l v_{i + 1} .

If

d_{i} \leq d_{i + 1}

,

K_{i}

can be characterized as an isometric inclusion [11] having the defining property that, if

g_{i}

and

g_{i + 1}

denote the Riemannian metrics on

U_{i} / H_{i}

and

U_{i + 1} / H_{i + 1}

, respectively,

\forall X, Y \in S o l v_{i} : g_{i} (X, Y) = g_{i + 1} (K_{i *} X, K_{i *} Y) .

As shown in [11], the mapping

K_{i *}

between the tangent spaces at corresponding points on the two manifolds, separately isomorphic to

S o l v_{i}

and

S o l v_{i + 1}

, is injective. If, on the other hand,

d_{i} > d_{i + 1}

, as a mapping between a linear space

S o l v_{i}

and a lower-dimensional one

S o l v_{i + 1}

,

K_{i *}

has a non-trivial kernel. If we define the metric

g_{i}^{(0)}

on

S_{i}

by the property:

\forall X, Y \in S o l v_{i} : g_{i}^{(0)} (X, Y) = g_{i + 1} (K_{i *} X, K_{i *} Y),

g_{i}^{(0)}

is singular and thus does not coincide with

g_{i}

. Nevertheless, being

K_{i *}

, a homomorphism between Lie algebras,

Ker (K_{i *})

, of dimension

d_{i} - d_{i + 1}

, is an ideal of

S o l v_{i}

consisting of the zero-norm vectors with respect to

g_{i}^{(0)}

, orthogonal to all the other vectors with respect to the same singular metric. Restricted to the solvable Lie algebra

S o l v_{i}^{'} \equiv S o l v_{i} ⊖ Ker (K_{i *})

,

g_{i}^{(0)}

coincides with

g_{i}

and, therefore, when

d_{i} > d_{i + 1}

,

K_{i}

can be characterized as an isometry between

S_{i}^{'} \equiv exp (S o l v_{i}^{'})

, with metric

g_{i}

restricted to

S o l v_{i}^{'} \times S o l v_{i}^{'}

, and

S_{i + 1}

.

This general characterization of

K_{i} : U_{i} / H_{i} \to U_{i + 1} / H_{i + 1}

as an isometric mapping implies its general covariance with respect to the transformations of both the

U_{i}

and the

U_{i + 1}

group. The action of the two groups on

K_{i}

can be formally described as follows:

K_{i} \to U_{i + 1} \circ K_{i} \circ U_{i} .

(1)

The Relevance of Covariance

Covariance, as expressed in Equation (1), is the fundamental conceptual and architectural advance provided by CaNN.s that, as explained in [2], are not just one proposal among others, rather they constitute the unique available scheme, allowed by basic theorems of differential geometry, able to dispose off the point-wise activation functions (contradictory with any sort of covariance), to maintain the existence of a unique distance function on each network layer and to preserve indispensable non-linearity.

1.2. The Mathematical Basis of CaNN

Having introduced the new paradigm, we summarize the mathematical key items that constitute its foundation.

The strategic metric equivalence with solvable groups

As discussed at length in the foundational paper [1] and already recalled above, the strategic element that allows the construction of CaNN.s, with all the properties mentioned above, is the metric equivalence of all non-compact symmetric spaces

U / H

with a suitable solvable Lie subgroup

S_{U / H} \subset U

which is a generalization, in each different

U

-case, of the Borel subgroup, applying to the case

U = SL (N, R)

. This metric equivalence amounts to the statement that all non-compact symmetric spaces are Alekseveskian Normal Riemannian Manifolds.

Alekseevsky Normal Manifolds and Solvable Lie Groups

Following the original viewpoint of Alekseevsky [12,13], we say that a Riemannian manifold

(M, g)

is normal if it admits a completely solvable Lie group

S_{M} ≃ exp [{Solv}_{M}]

of isometries that acts on the manifold in a simply transitive manner (i.e., for every 2 points in the manifold there is one and only one group element connecting them). The group

S_{M}

is then generated by a so-called normal metric Lie algebra, that is, a completely solvable Lie algebra

{Solv}_{M}

endowed with an Euclidean, positive definite,

S o l v

-invariant, symmetric form

<, >

. The main tool to classify and study the normal homogeneous spaces is provided by the theorem [8,14] that states that if a Riemannian manifold

(M, g)

admits a transitive normal solvable group of isometries

exp [{Solv}_{M}]

, then it is metrically equivalent to this solvable group manifold

\begin{matrix} M & ≃ & exp [{Solv}_{M}], \\ g ∣_{e \in M} & = & <, > \end{matrix}

(2)

where

<, >

is the Euclidean metric defined on the normal solvable Lie algebra

{Solv}_{M}

.

The original conjecture of Alekseevsky was just restricted to quaternionic Kähler manifolds and stated that any such manifold

M

that was also homogeneous and of negative Ricci curvature should be normal, in the over-mentioned sense, namely a transitive solvable group of isometries

exp [{Solv}_{M}]

should exist, that could be identified with the manifold itself. Note that the actual group of isometries

U

of

M

could be much larger than the solvable group,

U \supset exp [{Solv}_{M}],

(3)

as it is, for instance, the case for all symmetric spaces

M = \frac{U}{H}

; yet, the solvable normed Lie algebra

({Solv}_{M}, <, >)

had to exist. The problem of classifying the considered manifolds was turned in this way into the problem of classifying the normal metric solvable Lie algebras

(S o l v, <, >)

. Note that in Alekseevsky’s case, the symmetric form

<, >

was not only required to be positive definite but also quaternionic Kähler. Alekseevsky’s conjecture actually applies to much more general homogeneous Riemannian manifolds than the quaternionic ones: For instance, it applies to all those endowed with a special Kähler geometry or with a real special one, as the classification of de Wit et al. [15,16,17] demonstrated. It also applies to the symmetric spaces appearing in the scalar sector of extended supergravities with more than eight supercharges. For all these manifolds, there exists the corresponding normal metric algebra

(S o l v, <, >)

; in other words, they are normal. Actually, by explicit construction, as discussed in [1], all non-compact symmetric spaces

U / H

with

U

a simple non-compact Lie group and

H \subset U

its maximal compact subgroup are normal Aleksveskian manifolds.

1.3. The Link with Symplectic Geometry and Generalized Thermodynamics

Analyzing the mathematical foundations of CaNNs, we are naturally led to observe a natural link with symplectic geometry and generalized thermodynamics. In this subsection, we unveil, also historically, such a conceptual path.

Integrability of geodesic equations, Poisson, and symplectic manifolds

The mathematical theory that links non-compact symmetric manifolds to Normed Solvable Lie Algebras was pioneered by mathematicians but then it was extensively developed within the context of supergravity, since all scalar manifolds of extended supergravity Lagrangians happen to be non-compact symmetric spaces and their solvable representations played a decisive role in the systematic resolution of several problems, in particular the construction of cosmic billiards and extremal black-hole solutions. It was in this framework that the integrability of geodesic equations for such manifolds was analyzed by Fré and Sorin in terms of a Lax pair equation [18] in 2006, and also a Poisson manifold viewpoint underlying such integrability was introduced by the same authors in 2009 [19]. The complete integrability and the explicit integration of geodesic equations in

U / H

symmetric spaces is an essential brick in the construction of Cartan Neural Network architectures and it is discussed at length in the foundational paper [1], where it is shown that the explicit result for the solvable coordinates, as functions

Υ (t)

of the affine parameter t, can be obtained directly in terms of the initial data, namely the starting point and the so named matrix of conserved Noether charges Q, bypassing the solution of Lax equation. It might seem from this that the Poisson manifold viewpoint [19] is interesting yet unnecessary in the context of Machine Learning, but such a conclusion is too hasty and incorrect for the following reason. The virtue of the Poisson/symplectic approach to the geodesic problem is that it puts it into the perspective of dynamical systems and at the same time of geometric thermodynamics, creating a triple link among the geodesics on

U / H

, the symplectic/contact geometry of thermodynamics, and the contact structures of fluid-dynamics [20,21]. As we are going to see, a recent research line in Machine Learning introduces, in a different setup and under the name of Gibbs States for Lie Groups, basic structures of geometric thermodynamics, so that a conceptual clarification of all the relations is quite appropriate and useful in order to combine Cartan Neural Networks with statistical conceptions as those advocated in the mentioned research line.

Generalized Geometric Thermodynamics

A process of fundamental importance in Chemistry is the separation of different substances that are present in gas mixtures and multi-component liquids. From a conceptual point of view, any separation method is based on the thermodynamics of mixtures of different components coexisting in different phases. Since the 19th century, this phenomenon has been carefully conceptualized by great chemists, physicists, and mathematicians, and its study has become a focal point of statistical mechanics and classical thermodynamics. Gibbs’ rule of phases and the use, at the level of statistical mechanics, of the grand canonical ensemble (see Appendix C.3), with the introduction of the chemical potential are two fundamental junctures in this affair. However, the discouraging and critical aspect in this area of knowledge is that the exact calculation of canonical and grand canonical partition functions is of extreme difficulty when the particles forming the chemicals of our interest interact with each other, i.e., always, in the case of real and non-ideal substances. The cases of exact computation of the partition function are isolated and rare, reducing essentially to those of the classical ideal gas, the quantum free gases of bosons or fermions, and the two-dimensional Ising model for ferromagnetism. In all other cases, there is a plethora of approximation methods and sophisticated perturbative or approximate computational techniques. The object of primary interest for thermodynamic calculations is the equation of state, i.e., the relation between both extensional (such as volume V, entropy S, and internal energy U) and intensive (such as temperature T and chemical potentials

μ_{i}

) thermodynamic quantities that is valid in equilibrium states (see Appendix C for a summary of classical thermodynamics). Equations of state can be derived exactly from the partition function if one knows the latter. Thus, alternating with attempts at direct calculations of certain partition functions, there has been, over the last century, a great deal of modeling activity, both theoretical and experimental, aimed at constructing mathematical formulations of equations of state in want of the missing partition function. However, such equations of state are just phenomenological models, and a deeper understanding of their rationale is necessary. Thanks to the work, hitherto little known outside a small community of specialists, of an even smaller number of low-temperature physicists and mathematical physicists, there exists a surprisingly innovative geometric view of classical thermodynamics that provides a more intrinsic view of thermodynamic states and seems able, by classical geometric means to provide mesoscopic information about real gases and liquids, while also defining a conceptual frame of reference in which phenomenological equations of state can be evaluated and possibly modified in a more systematic and profound way, particularly taking into account the possible isometries of the Riemannian metric surprisingly associated with the space of thermodynamic variables.

The problem is quite general, also in systems of different nature; for example, in Big Data sets, if one arrives at a thermodynamical limit description, one can wonder about the advantages of a geometrical formulation of thermodynamics.

In view of applications to the research compound of Geometric Deep Learning, which implies the use of predetermined metrics, it is a stimulating perspective to compare the metric setup of Geometrical Thermodynamics and that of Information Geometry, in particular focusing on the symplectic structure that can migrate from Thermodynamics to Data Science.

The small group of low-temperature physicists and mathematical physicists to whom we owe the entire body of developments related to the geometric view of thermodynamics we have referred to consists of three senior founders the American George Ruppeiner affiliated with the New College of Florida in Sarasota and the two Russians Valentin Lychagin and Mikhail Roop, plus a small cohort of adherents consisting of their occasional collaborators, Ph.D. students, postdocs, and so on. It is very interesting to read Ruppeiner’s autobiographical article [22] written in April 2016 for the commemorative volume in honor of Horst Meyer’s 90th birthday, who unfortunately passed away a few months later. In this article, the author recounts how, in the years 1975–1980 when he was a Ph.D. student at Duke University conducting low-temperature gas experiments in Meyer’s laboratory, devoting most of his efforts to perfecting himself in low-temperature experimental physics, he nonetheless had a drive to take General Relativity Courses and deepen his knowledge of differential geometry. A spark was ignited in his mind, he recounts, when he read in Physics Today an article by Frank Weinhold [23] in which a metric form was introduced in the context of thermodynamic variables, something hitherto considered peregrine and absurd. Ruppeiner, on the other hand, regarded it as a serious suggestion and deemed that a Riemannian view of thermodynamics could be constructed and could also be useful in the analyzis of critical phenomena. Gradually, continuing on the path he had taken, he arrived at constructing two-dimensional metrics in the temperature-density plane that were consistent with the principles of thermodynamics and went so far as to calculate the curvature scalar R of such metrics. A salient moment in the development of his thought was when he arrived at the physical interpretation of R:

| R | \propto ξ^{3}

(4)

where

ξ

is the statistical correlation length, which, as everyone knows, tends to infinity in the vicinity of critical points and phase transitions. Of course, ideal gases correspond to flat metrics with zero curvature

R = 0

and no critical points. Thus, thermodynamic curvature became a classical indicator of molecular interactions at the mesoscopic level, and in the first two decades of the 21st century, Ruppeiner contributed a series of very interesting papers on the use of Riemannian geometry in the study of thermodynamics and its critical phenomena: [22,24,25,26,27,28]. More recently, but following reflections developed over the years and expounded in his 2018 lectures at Wisla [29], Valentin Lychagin, a Russian mathematician for a long time professor at Tromso University in Norway, identified and systematically expounded within the framework of information theory, an interesting connection between contact geometry and thermodynamics, characterizing possible thermodynamic equilibrium states as Legendrian subvarieties of contact varieties. Because of the complex and general relationships between contact varieties and symplectic varieties (see Appendix A for a summary of contact and symplectic geometry), thermodynamic states can also be interpreted as Lagrangian subvarieties of symplectic varieties, and the canonical symplectic form on them naturally connects to a Riemannian metric, which is the one hypothesized and studied by Ruppeiner.

1.4. Gibbs States and Lie Group Generalized Thermodynamics

In a series of papers of which we quote only a small selection [30,31,32,33,34], that is most informative about the main idea, a group of French authors, including Charles-Michel Marle, Frédéric Barbaresco, Yann Cabanes and Pierre-Yves Lagrave, relying on old ideas of late Jean-Marie Souriau, have introduced the notion of Gibbs States of Mechanical Systems with Symmetries and of Lie Group Thermodynamics which bears a close similarity with the geometrical formulation of thermodynamics as expounded in Lychagin’s lectures [29], yet is quite distinct from it. As we explain in the main body of the present article, the original distinctive idea of Lie Group Thermodynamics is the definition of a subspace

Ω \subset G

of the Lie algebra of symplectic Killing vector fields

X \in G

that leave invariant a symplectic manifold

(M, ω)

in the sense that the Lie derivative along them of the symplectic form vanishes

L_{X} ω = 0

(5)

(compare Equation (5) with Definition A11 in Appendix A.7 of Liouville vector fields) such that the following integral (the partition function) is convergent:

\forall β \in Ω : Z (β) \equiv \int_{M} exp [- β \cdot P (Φ)] d λ (Φ) < \infty

(6)

where

Φ

are the coordinates on the 2n-dimensional differentiable manifold

M

;

d λ (Φ) \equiv \underset{n - times}{\underset{︸}{ω \land ω \land \dots \land ω}}

(7)

is its Liouville integration measure; and

P (Φ)

is the moment map (see below for the discussion of such a concept):

\forall k \in G P (Φ) : k ⟶ P_{k} (Φ) \in C^{\infty} (M)

(8)

The subspace

Ω \subset G

of the symmetry Lie algebra is named the space of generalized temperatures.

1.4.1. Symplectic Moment Map

On a d-dimensional Riemannian space

M

, admitting an isometry Lie algebra

U

, moment maps can be defined as linear mappings between

U

and smooth functions on

M

, valued in the holonomy algebra

Hol (M)

:

\begin{matrix} X \in U ⟶ P_{X} \in Hol (M) \times C^{\infty} (M), \\ \forall k_{X}^{i} Killing vector ⟶ {(P_{X})}_{i}^{j} \equiv \nabla_{i} k_{X}^{j} \in Hol (M) . \end{matrix}

(9)

satisfying certain equivariance conditions. (For a general characterization of the moment maps, see, for instance, https://arxiv.org/abs/1605.05559, accessed on 17 December 2025). For several applications, one considers moment maps with values in a specific subalgebra

H_{0}

of

Hol (M)

. For instance, if

M

is Kähler, Special Kähler, or Hyper-Kähler,

H_{0} = u (1)

, Lie algebra of the

U (1)

group of Kähler transformations. In this case, the compact

u (1)

generator has an invertible action on the tangent space to

M

. The curvature

K

of the corresponding

U (1)

-connection is a non-singular closed 2-form which provides a symplectic structure on the manifold, namely a maximal rank, closed 2-form

ω = K

. In general, the existence of a symplectic 2-form is not related to the metric properties of the manifold, or to the very existence of a metric. On a Riemannian manifold, endowed with a symplectic 2-form, we require the latter to be consistent with the Riemannian structure and, in particular, with its isometries. This is only possible if the Lie derivative of

ω

, with respect to all Killing vectors, vanishes. The only possibility for this is that the manifold be of Kähler type and

ω

is proportional to the Kähler 2-form

K

, since, in this case, the

u (1)

subalgebra, defining

K

, is in the center of the Holonomy algebra. Kähler, Special Kähler, and Hyper-Kähler manifolds are important ingredients in supergravity/supersymmetric gauge theories. Indeed, they play a very important role in the whole landscape of supersymmetric field theories: in particular, they are the basic building blocks in the construction of scalar potentials (see, for instance, the “Physics Report” [35], the book [36], and the general paper [37]). Moment maps also play a fundamental role in the resolution of singularities via Kähler and Hyper-Kähler quotients à la Kronheimer (for a review, see, for instance, the lecture notes [38] and all the vast literature quoted therein). In the series of papers [30,31,32,33,34], the authors rely on moment maps as a fundamental ingredient in the construction of partition functions for symplectic manifolds and the explicit examples they present, namely the cases of the hyperbolic plane and of the Siegel plane (see also [1] for the role that the latter might play in Machine Learning), the symplectic structure utilized to define the moment maps is the one provided by the Kähler 2-form; hence, it applies to the very manifold

M

, which in the considered examples is indeed Kählerian, rather then to its tangent bundle

TM

which, instead, is the geometrical substratum of the geodesic dynamical system that can be defined on every Riemannian manifold

M

.

1.4.2. Coadjoint Orbits

In the discussion of the thermodynamics that might be associated with symmetric spaces

U / H

, another source of possible conceptual confusions is given by the symplectic structure, named after Kirillov–Kostant–Souriau, that can be defined on coadjoint orbits of any Lie group

G

. This matter is presented in a crystal clear form in chapter 5 of the book [39].

As we just anticipated, and we show systematically in Section 2 and Section 3 we can define generalized temperatures and partition functions on a smooth manifold

M

, whenever the latter is endowed with a bona fide symplectic structure, namely a closed antisymmetric 2-form

ω

of maximal rank, and we have a Lie group

G

acting on

M

by means of diffeomorphisms:

\forall g \in G D (g) : M ⟶ M; \forall g_{1}, g_{2} \in G D (g_{1} \cdot g_{2}) = D (g_{1}) \circ D (g_{2})

(10)

which are generated by Hamiltonian vector fields, namely Killing vector fields

k_{A}

(see Definition (5) of symplectic Killing vector fields), satisfying its Lie Algebra

G

:

[k_{B}, k_{C}] = f_{B C}^{A} k_{A}; A, B, C = 1, 2, \dots, \dim G

(11)

Indeed, each symplectic Killing vector field

k

on the symplectic manifold

M

is associated with a moment map

P_{k}

, as already anticipated in Equation (8), which is a function on the manifold

M

:

P_{k} : M ⟶ R

(12)

locally satisfying the condition:

i_{k} \cdot ω = d P_{k}

(13)

where

i_{X} \cdot

is the contraction operation along the vector field

X

acting on any p-form and

d

is the exterior derivative also acting on any p-form.

The moment maps are Hamiltonians and can be used to define partition functions as in Equation (6), where a candidate generalized temperature is any element of the Lie algebra

G

.

Keeping this essential point in mind, we turn to coadjoint orbits of a Lie group

G

.

As explained in [39], for any Lie group

G

, one can define the dual

G^{★}

of its Lie algebra

G

, which is also a vector space of the same dimension, and from the adjoint action of the group on

G

:

\forall g \in G, \forall X \in G : {Adj}_{g} (X) \equiv g^{- 1} X g \in G

(14)

We obtain the coadjoint action of

G

on

G^{★}

as follows:

\forall g \in G, \forall X \in G, \forall λ \in G^{*}; {CoAdj}_{g} (λ) (X) \equiv λ ({Adj}_{g} (X)) = λ (g^{- 1} X g)

(15)

Fixing a particular element

λ \in G^{★}

, the coadjoint orbit

O_{λ}

is defined as the subset of elements of

G^{★}

that are images of

λ

under the coadjoint action of some group element of

G

:

\forall λ \in G^{★}; O_{λ} = \{μ \in G^{★} ∣ \exists g \in G / {CoAdj}_{g} (λ) = μ\}

(16)

Equation (16) can be decoded in a more friendly and usable way if, on the Lie algebra

G

, which is a vector space, we introduce a non-degenerate positive definite symmetric scalar product

〈, 〉

so that any element of

G^{★}

, by definition a linear functional on

G

, can be described as follows:

\forall λ \in G^{★}, \forall X \in G : λ (X) = 〈 λ^{†}, X 〉 where λ^{†} \in G

(17)

Given a set of generators

T_{A}

that form a basis for the vector space

G

, we have the symmetric, invertible, positive definite matrix

κ_{A B}

defined below, together with its inverse:

\begin{matrix} κ_{A B} & \equiv & 〈 T_{A}, T_{B} 〉 & ; & κ^{A B} \equiv {(κ^{- 1})}^{A B} \end{matrix}

(18)

and Equation (17) becomes:

λ^{†} = λ^{A} T_{A}; X = X^{B} T_{B}; λ (X) = λ^{A} X^{B} κ_{A B}

(19)

The adjoint representation of the Lie group is explicitly given by the adjoint matrix defined below:

g^{- 1} T_{A} g = A {(g)}_{A}^{B} T_{B}

(20)

and the coadjoint representation is defined by the position:

〈 CA {(g)}_{A}^{P} T_{P}, T_{R} 〉 = 〈 T_{A}, A {(g)}_{R}^{Q} T_{Q} 〉

(21)

which matrix-wise implies:

CA (g) = κ \cdot A (g) \cdot κ^{- 1}

(22)

If

λ^{A}

are the coefficients of the element

λ^{†} \in G

that defines the orbit, the latter is formed by all those elements of

G

that have the following form:

μ^{†} (g) = μ^{A} (g) T_{A} \equiv \underset{μ^{A} (g)}{\underset{︸}{λ^{B} CA {(g)}_{B}^{A}}} T_{A} \in G; g \in G

(23)

One might think that, as g varies in the group

G

for a generic choice of

λ^{†}

, the element

μ^{†} (g)

will span the entire Lie algebra

G

. If this were true, the coadjoint orbit would be diffeomorphic to the Lie group

G

. However, this is not true, since there is always a non trivial Lie subgroup

S (λ^{†}) \subset G

for which the adjoint action on

λ^{†}

is trivial:

g^{- 1} λ^{†} g = λ^{†} iff g \in S (λ^{†})

(24)

That in Equation (24) is the very definition of the stabilizer subgroup of the Lie algebra element

λ^{†}

, and such a subgroup is never trivial since it includes at least the one-dimensional subgroup generated by

λ^{†}

itself; for special choices of

λ^{†}

, the stabilizer can be much larger. This means that the coadjoint orbit

O_{λ}

is always diffeomorphic to a coset manifold, namely

G / S (λ^{†})

. The non-degenerate symplectic form

ω

of Kirillov–Kostant–Souriau is not defined on

G

; rather, it is defined on each coadjoint orbit labeled by a Lie algebra element

λ

, namely on a coset manifold

G / S (λ^{†})

. Hence, the symplectic manifolds that admit a Hamiltonian action of the group

G

are already all captured by the scan of all coset manifolds

G / H

where the subgroup

H \subset G

stabilizes some non-trivial element

λ^{†}

of the Lie algebra

G

.

Just for completeness, let us mention that the Kirillov–Kostant–Souriau symplectic 2-form

ω^{K K S}

defined on a coadjoint orbit is very simply given. Let

t_{A}^{♯}

be the realization on the orbit

O_{λ}

, namely on the coset manifold

G / S (λ^{†})

of the invariant vector fields

t_{A}

spanning the Lie algebra

G

. They form a basis of sections of the tangent bundle

T O_{λ}

. Hence, a 2-form

ω

is completely defined if we give its value on any pair of such vector fields. The Kirillov–Kostant–Souriau form is defined by setting

ω_{λ}^{K K S} (t_{A}^{♯}, t_{B}^{♯}) = f_{A B}^{C} κ_{C E} λ^{E}

(25)

We will restrict ourselves to the case in which

G = U

is a semisimple, isometry group of a symmetric space

U / H

. In this case,

κ_{A B}

is non-singular, and H is the maximal compact subgroup of

U

. The KKS symplectic form is invariant under

U

only if the element

λ^{†}

is central in

H

, the Lie algebra of

H

, namely if

U / H

is Kähler and

λ^{†}

corresponds to the Kähler

u (1)

generator.

1.5. Clearcut Distinctions

Notwithstanding whether the manifold

M

has a symplectic structure or not, its tangent bundle

TM

always has the symplectic structure associated with the Hamiltonian description of geodesic equations on

M

. Here comes the first essential distinction. Whenever we have a canonical dynamical system on a symplectic manifold

({SM}_{2 n}, ω)

, like the geodesic one where

{SM}_{2 n} = {TM}_{n}

, we can construct standard thermodynamics in geometrical formulation, starting from the minimization of the Shannon entropy functional and arriving at Gibbs states of the form:

\begin{matrix} G (λ, Φ) & = & \frac{exp [- λ \cdot H (Φ)]}{Z (λ)} \\ Z (λ) & \equiv & \int_{SM} exp [- λ \cdot H (Φ)] d λ (Φ) \end{matrix}

(26)

where

H (Φ)

is the multiplet of k Hamiltonians in involution admitted by the dynamical system (if the dynamical system is Liouville integrable, one has

k = n

, the dimension of the symplectic manifold being

2 n

. In general,

1 \leq k < n

, and it is just 1 for a generic dynamical system without conserved charges; including the standard one defined by the Legendre transform of the Lagrangian):

H (Φ) = \{H_{1} (Φ), \dots, H_{k} (Φ)\}; \underset{Poisson bracket}{\underset{︸}{\{H_{i}, H_{j}\}}} = 0 \forall i, j

(27)

λ \in R^{k}

is a vector of generalized temperatures, and

Z (λ)

is the partition function.

Following the conception reviewed in Section Conditional Minimalization of Information and the Partition Function and making reference to Equation (39) and following ones, we should also note that the stochastic vector variable

X (q)

, of which we fix the average value in order to define a probability distribution that extremizes the Shannon functional with constraints, does not need to be a set of Hamiltonians in involution and there is no need of integrability of any dynamical system in order to introduce a generalized thermodynamics. In this case, the space of events (see Appendix B for the basic definitions of probability theory) is a symplectic manifold endowed with Hamiltonian vector fields. We can use the moment-maps of the latter as a convenient set of stochastic variables in order to introduce a generalized thermodynamics by fixing their average values. Yet this is only a subclass of examples in a general class.

The Lie group thermodynamics advocated in [30,31,32,33,34] leads to Gibbs states of the form:

\begin{matrix} G (β, Y) & = & \frac{exp [- β \cdot P (Y)]}{Z (β)} \\ Z (β) & \equiv & \int_{M} exp [- β \cdot P (Y)] dg [Y] \end{matrix}

(28)

where

Y

denotes the coordinates of the Riemannian manifold

(M, g)

and the generalized temperature

β \in Ω

is an element of the Lie algebra

G

of the isometry group

G

such that the integral defining the partition function is convergent. Let us observe that in Equation (28)

dg [Y]

is the Riemannian integration measure which coincides with the Liouville measure (7) if

(M, g)

is a Kähler manifold and the Kähler 2-form

K

is utilized to define the symplectic structure on

M

.

A third possibility, which is the conceptual framework underlining [30,31,32,33,34], is to use the setup of Equation (28) using, however, as substratum manifold

M

some coadjoint orbit of

O_{λ^{†}}

under the action of a group

G

of some special Lie algebra element

λ^{†} \in G

. As extensively discussed in the previous subsection, coadjoint orbits are, anyhow, coset manifolds, and it is conceptually much more economic to start from the coset manifold structure, asking oneself the question: Given

G / H

, what is the Lie algebra element

λ^{†} \in G

that is stabilized by the chosen subgroup

H

? The answer is fairly simple. In view of the discussion of the previous section, it is clear that

λ^{†} \in H \subset G

since the one-parameter group generated by

λ^{†}

must be contained in

H

. Therefore, the condition is just:

[H, λ^{†}] = 0

(29)

The solution of the constraint (29) is immediate. The Lie algebra

H

must have the following structure:

H = H^{'} \oplus H_{0}; H_{0} = span [λ^{†}]

(30)

and the unidimensional Lie algebra

H_{0}

is either

R

or

u (1)

depending on whether the Lie algebra element

λ^{†}

is non-compact or compact.

1.5.1. Kähler Non-Compact Symmetric Spaces

In view of Cartan Neural Networks, where the relevant manifolds are non-compact symmetric spaces

U / H

, with

H \subset U

, the maximal compact subgroup of a non-compact simple Lie group, it follows that

H

is compact and

H_{0} = u (1)

. This has a universal and simple interpretation: the presence in the isotropy subgroup

H

of a factor

U (1)

simply means that

U / H

is a Kähler manifold, and that the symplectic structure is provided by the Kähler 2-form.

1.5.2. Hence Two Cases

Summarizing the previous discussion, we conclude that there are just two distinct cases of geometrical thermodynamics related to non-compact symmetric spaces

U / H

(A): The thermodynamics associated with the Geodesic Dynamical System (GDS) on $U / H$ , where the symplectic structure is that provided by the phase-space of the GDS, existing for all manifolds and in particular for all symmetric spaces $U / H$ .
(B): Kähler thermodynamics on the symmetric spaces $U / H$ defined by

$\begin{matrix} G_{K} (β, Y) & = & \frac{exp [- β \cdot P (Y)]}{Z_{K} (β)} \end{matrix}$

(31)

$\begin{matrix} Z_{K} (β) & \equiv & \int_{U / H} exp [- β \cdot P (Y)] \underset{n - times}{\underset{︸}{K \land K \land \dots \land K}} \end{matrix}$

(32)

$\begin{matrix} \dim \frac{U}{H} & = & 2 n; n \in N \end{matrix}$

(33)

$\begin{matrix} K & = & K ä hler 2 - form \end{matrix}$

(34)

$\begin{matrix} i_{k_{A}} K & = & d P_{A} \end{matrix}$

(35)

where $P (Y)$ denotes the vector of moment maps $P_{A} (Y)$ associated with a basis of Killing vectors $k_{A}$ that correspond to a basis $T_{A}$ of generators of the $U$ Lie algebra and $β = β^{A} T_{A} \in Ω \subset U$ is a generalized temperature vector such that the partition function integral (32) converges.

1.6. Relevance for Cartan Neural Networks

In the new paradigm of Cartan Neural Networks, all the manifolds that model the hidden layers and to which data are injected are diffeomorphic to as many solvable Lie groups

S

. Furthermore, they have a simple group

U

of isometries, which gives rise to a Lie algebra

U

. For all these manifolds, the geodesic dynamical system is completely integrable, and one can construct a nice algebraic resetting of the corresponding Hamiltonian setup liable to be used in the study of Gibbs states of the conventional type defined by Equation (26). The use of such Gibbs states in Machine Learning algorithms based on the CaNNs paradigm is a perspective to be investigated with care. However, it must be noted that, as we show in the sequel, the Gibbs probability distribution depends only on the momenta (velocities) and not on the positions in the manifold

U / H

. Hence, if one is interested in probability distributions on the very manifold to which data are mapped, the Geodesic Dynamical System thermodynamics seems to be of little use.

On the other hand, in the organization of non-compact symmetric spaces into Tits Satake universality classes, there are entire classes consisting of Kähler manifolds, for instance, the

r = 2

class (see [1] for details on the classification):

M^{[2, q]} \equiv \frac{SO (2, 2 + q)}{SO (2) \times SO (2 + q)}

(36)

Hence, for such cases, the possibility of defining Kähler thermodynamics and corresponding Gibbs states à la Souriau as in Equation (31) arises. Once again, the use of such generalized Gibbs states in Machine Learning algorithms has to be studied, yet its viability is guaranteed by the already existent applications to radar signal analyzis and to other time series discussed in the thesis [40] and in all references quoted therein (in particular [5]). Indeed, such Gibbs states provide a Gaussian-like probability distribution on the very manifold

U / H

, rather than on the fibres of its tangent bundle.

The perspective of Gibbs states à la Souriau for non-compact symmetric spaces

U / H

that are Kählerian requires a systematic theoretical study, which seems to be so far missing, namely that of an intrinsic characterization of the subspace

Ω

of generalized temperatures inside the relevant

U

algebras. We consider such a study an interesting priority and address it in the present paper, both in its general form and in the case study of two examples. We found a general answer that was so far missing: the space of generalized temperature is the adjoint orbit of the positivity domain in the space of the Cartan subalgebra of the compact subalgebra

H

(see Section 7).

Furthermore, let us also recall that in [31], Barbaresco has claimed that the geometry of Lie Group thermodynamics is to be identified with the Riemannian Geometry of Information introduced several years ago by Rao [41] and Chentsov [42] (see the review paper [43] and the book [44] whose author is frequently credited for the introduction of Information Geometry in the Data Science community). Once the conceptual framework is clarified, as we hope to have done with the present paper, the relation between Kähler thermodynamics and the Riemannian metric naturally associated with equilibrium states, via the general setup of generalized geometrical thermodynamics, becomes clear and universal, as we are going to show.

1.7. Outline of This Paper

The present paper is organized as follows. First, we briefly recall the basic principles of generalized thermodynamics and of its link with Shannon’s information functional. Next, we analyze the general structure of the Geodesic Dynamical System and its specialization to the case of symmetric spaces

U / H

. This is instrumental in completing the Poissonian structure on dual solvable Lie algebras into a full symplectic structure on the tangent bundle of non-compact symmetric spaces. In this perspective, we can study examples of generalized thermodynamics associated with integrable dynamical systems and show that they are too simple and of little interest for Machine Learning applications. We come next to discuss generalized thermodynamics à la Souriau, and it is in this context that we obtain our most relevant results that are summarized in the conclusive Section 8. We do not anticipate them here. We just say that, according to our opinion, thanks to a strategic use of the metric equivalence of non-compact symmetric spaces

U / H

with appropriate solvable Lie groups, we have established generalized thermodynamics à la Souriau on clear general principles for all non-compact symmetric spaces that are Kähler manifolds, introducing, in this way, a new powerful weapon for Machine Learning algorithms.

We have equipped our paper with several mathematical and physical appendices in order to make it self-contained and readable to a larger audience.

2. Shannon Information Entropy and the Partition Function

In his celebrated 1948 paper [45], Claude Elwood Shannon introduced what is called the entropy of information relative to a probability density

ρ

defined on some measurable space

Ω

(see Appendix B for a summary of the fundamental principle and concepts of probability theory as exposed in standard textbooks like [46]).

Let

q \in Ω

be a point in the stochastic space we consider; let

d μ (q)

be the integration measure on

Ω

; and let

ρ (q) \in [0, 1]

be the value in

q

of the probability density that is obviously normalized as follows:

N [ρ] \equiv \int_{Ω} ρ (q) d μ (q) = 1

(37)

The measure of information contained in the probability distribution

ρ

was defined by Shannon by means of the following functional:

I [ρ] \equiv - \int_{Ω} ρ (q) log [ρ (q)] d μ (q)

(38)

Conditional Minimalization of Information and the Partition Function

The precise conceptual connection between Information Theory and Statistical Mechanics and thus with Thermodynamics can be made through the notion of conditional minimization introduced by Jaynes in 1957 who, in the papers [47,48], clarified transparently and definitively the logical relationship between Shannon’s theory and Statistical Thermodynamics. Thanks to recent works [29,49,50,51,52], this relationship is further clarified in geometric terms and completes the design of the conceptual framework in which the association of a Riemannian metric with Thermodynamics obtains a solid foundation.

We pose the following problem: determine the probability distribution that extremizes the functional

I [ρ]

under the following two conditions:

(A): The correct normalization (37) should hold true.
(B): The average value of a certain stochastic vector $X$ should be fixed to a certain precise vector $x \in V$ :

〈 X 〉 \equiv \int_{Ω} X (q) ρ (q) d μ (q) = x \in V

(39)

The classical way to solve this problem is to use variational calculus in the presence of Lagrange multipliers. One introduces

r + 1

multipliers:

λ_{0}

associated with the normalization constraint (37) and

r = \dim V

multipliers

λ^{i}

that we can regard as the components of a vector in

λ \in V^{★}

that are associated with the constraints (39). Thus, the new functional to be extremized is as follows:

F [ρ] = - I [ρ] - λ_{0} (N [ρ] - 1) + λ \cdot (〈 X 〉 - x)

(40)

The variation of the functional in

δ ρ

yields

\frac{δ F [ρ]}{δ ρ} = log [ρ] + 1 - λ_{0} + λ \cdot X = 0

(41)

which implies:

ρ (q) = exp [λ_{0} - 1 - λ \cdot X (q)]

(42)

Imposing the normalization constraint (37) fixes the value of

λ_{0}

so that the final expression of the extremal probability distribution is the following:

ρ_{e x} (q) = \frac{exp [- λ \cdot X (q)]}{Z (λ)}

(43)

where

Z (λ) \equiv \int_{Ω} exp [- λ \cdot X (q)] d μ (q)

(44)

is the Partition Function and, for reasons that will become immediately clear, the following object

H^{s t o c h} (λ) = - log [Z (λ)]

(45)

is named the stochastic Hamiltonian. As a consequence of the definition (45), the value

x

imposed to the stochastic vector

X

is obtained from the Hamiltonian by means of a derivative:

x = d_{λ} H^{s t o c h} (λ) \Rightarrow short hand for x_{i} = \frac{\partial}{\partial λ^{i}} H^{s t o c h} (λ_{1}, \dots, λ_{r})

(46)

Calculating Shannon Entropy Functional (38) on the extremal probability distribution (43) with elementary algebra, we obtain

- I [ρ_{e x}] = H^{s t o c h} (λ) - λ \cdot x = H^{s t o c h} (λ) - λ^{i} \frac{\partial}{\partial λ^{i}} H^{s t o c h} (λ)

(47)

which has the form of a Legendre transform. Hence, the Shannon functional plays the same role as that of a Lagrangian; the stochastic Hamiltonian is indeed a Hamiltonian; the intensive variables of Thermodynamics (i.e., the Lagrange multipliers

λ

) are the momenta; and the average values

x^{i}

are the coordinates.

Next, we turn to classical thermodynamics. We refer to Appendix C for a recollection of its basic concepts and constructions, which are presented in order to fix notation and also for the benefit of those readers who are not physicists by education. In the next Section 3, we illustrate the geometric reformulation of classical thermodynamics in the context of contact and symplectic geometry, which leads to the introduction of the new notion of thermodynamical curvature.

Note that what is named metric of Information Geometry in the Machine Learning literature is the following Hessian obtained from a parameterized by a vector

λ = {λ_{1}, \dots, λ_{r}}

probability distribution

ρ_{λ} (X (q))

of the stochastic variable

X (q)

over the manifold of events:

d s_{i n f o}^{2} \equiv \frac{\partial^{2}}{\partial λ^{i} \partial λ^{j}} log [ρ_{λ} (X (q))] d λ^{i} \times d λ^{j}

(48)

When the probability distribution is the generalized Gibbs one of Equation (43), we find

d s_{i n f o}^{2} \equiv \frac{\partial^{2}}{\partial λ^{i} \partial λ^{j}} H^{s t o c h} (λ) d λ^{i} \times d λ^{j}

(49)

As we will show in the next section, the metric (49) coincides with the thermodynamics metric introduced in geometrical thermodynamics, much before the advent of Machine Learning contributions.

3. Geometrical Structure of Thermodynamics

In this section, we present the reformulation of classical thermodynamic laws in geometrical terms, based on what we explained in Section 2, where we elucidated the relation between information theory and statistical mechanics. It is now time to show how classical thermodynamic laws are linked with the notion of a contact manifold

(M^{2 n + 1}, ξ_{α})

, defined by a suitable contact 1-form

α

and certain Legendrian submanifolds

L_{n} \subset M^{2 n + 1}

of the latter, specifically defined and identifiable with Lagrangian submanifolds

L_{n} \subset S_{2 n}

of a symplectic manifold

(S_{2 n}, ω)

that is canonically associated with the contact one according to the scheme (A43). The Lagrangian vision leads to the definition of a canonical Riemannian metric induced on

L_{n}

, which is the most relevant novelty of the introduced conceptual framework.

Indeed, calculating the thermodynamical curvature is a new powerful investigation tool in all applications.

The intuition of the relevance of thermodynamical curvature as a probe of molecular interactions at the mesoscopic level is indeed, as we stressed in the introductory Section 1, particularly due to Ruppeiner.

There is, however, something even more pertinent that should be stressed. Recalling the fundamental relation between Information Theory and Statistical Mechanics outlined in Section 2 and contextually illustrated in Appendix C it appears that the geometric reformulation of classical thermodynamics has a much wider scope than physical or chemical systems. Indeed, any conditioned probability distribution describing whatever phenomena and fitting to whatever Big Data system defines a thermodynamical setup and would lead to equations of state if we knew the probability distribution and were able to calculate the partition function. The geometrical formulation of the equations of state as embedding functions of Lagrangian submanifolds is a scheme that can be utilized in an inverse engineering procedure to work out the probability distribution and possibly learn it from Data Behavior. This is a challenging possibility for Deep Learning, completely unexplored at the present moment.

3.1. The Geometric Reformulation

To develop the program announced above, we need only to collect the ideas already introduced, focusing on the standard Darboux expression of a contact form given in Equation (A34) and on Equation (47), which shows that the Functional measuring Information

I [ρ_{e x}]

is related to the stochastic Hamiltonian

H (λ)

by a Legendre transform. Summarizing, we can say that in thermodynamics, we have

n + 1 \geq 3

extensive variables collectively denoted

x_{0}, x_{i}

(

i = 1, \dots, n

) which explicitly might be identified as follows:

Internal Energy U;
Entropy S;
Volume V;
Molar Fractions $N_{ℓ}$ ( $ℓ = 1, \dots, n - 3$ );

and n-intensive variables; collectively denoted

λ^{i}

and explicitly identified as:

Temperature T;
Pressure P;
Chemical Potentials $μ^{ℓ}$ ( $ℓ = 1, \dots, n - 3$ ).

The first principle of thermodynamics, combined with the second, can be formulated by stating that the following differential form vanishes:

0 \approx \tilde{α} \equiv d U - T d S + P d V - \sum_{ℓ = 1}^{n - 3} μ^{ℓ} d N_{ℓ} = d x_{0} + \sum_{i = 1}^{n} λ^{i} d x_{i}

(50)

The last form à la Darboux of

\tilde{α}

follows from the identification of

x_{0}

with the internal energy U and the remaining coordinates as follows

λ = \{- T, P, - μ^{ℓ}\}

e

x^{i} = \{S, V, N_{ℓ}\}

. Obviously, because of its form, à la Darboux

\tilde{α}

satisfies the defining condition in order to be a contact 1-form, namely:

\underset{n times}{\underset{︸}{d \tilde{α} \land d \tilde{α} \land \dots \land d \tilde{α}}} \land \tilde{α} \neq 0

(51)

To better conjugate the emerging contact geometry underlying thermodynamics with the Equation (47) that identifies, minus a multiplicative factor, the information measure with thermodynamic entropy (see Equation (A93)), it is convenient to multiply the form

\tilde{α}

introduced in Equation (50) times a factor

1 / (k_{B} T)

, obtaining in this way:

\begin{matrix} α & = & {(k_{B} T)}^{- 1} \tilde{α} = - k_{B} d S + {(k_{B} T)}^{- 1} d U + {(k_{B} T)}^{- 1} P d V - \sum_{ℓ = 1}^{n - 3} {(k_{B} T)}^{- 1} μ^{ℓ} d N_{ℓ} \\ = & d I - \sum_{i = 1}^{n} λ^{i} d x_{i} \end{matrix}

(52)

where the new definition of the

2 n + 1

coordinates is the following one:

λ = \{- \frac{1}{k_{B} T}, - \frac{P}{k_{B} T}, \frac{μ^{ℓ}}{k_{B} T}\}; x = \{U, V, N_{ℓ}\}; x_{0} = I

(53)

Obviously, the 1-form

α

satisfies the same condition (51) as

\tilde{α}

and, therefore, it is also a contact 1-form. Thus, we have defined a contact manifold

(M_{2 n + 1}, ξ)

, where

ξ = \ker α

is the contact structure

M_{2 n + 1} = R^{2 n + 1}

has

2 n + 1

coordinates

\{I, λ, x\}

, the variable I being, at this stage, a free coordinate, just as

λ

and

x

.

3.1.1. Legendrian Submanifolds

Definition A7 of Legendrian submanifolds given in Appendix A states that they are isotropic submanifolds of maximal dimension n of a contact manifold

M_{2 n + 1}

. On the other hand, we recall that an isotropic submanifold is a submanifold such that its tangent bundle is in the kernel of the contact form, namely, the 1-form

α

vanishes on each Legendrian submanifold. The great intuition of the authors of [49,50] has been that of identifying, independently from the utilized coordinates and hence in an intrinsic way, the thermodynamic equilibrium states with the points of specific Legendrian submanifolds of the ambient space. In terms of the theory of conditional minimization discussed in Section Conditional Minimalization of Information and the Partition Function, it is very simple to define the Legendrian submanifolds that represent the thermodynamic equilibrium states.

Definition 1.

Any admissible thermodynamic state can be identified with a point in the following n-dimensional submanifold of the contact manifold:

L_{n} = \{I = I (λ, x), x_{i} = \frac{\partial}{\partial λ^{i}} H (λ)\} \subset M_{2 n + 1}

(54)

Theorem 1.

The submanifold

L_{n}

defined by means of Equation (54) is isotropic and hence Legendrian.

Proof.

The proof is extremely simple. It suffices to recall Equation (47). Using that relation, we can evaluate the total differential

d I

, as follows:

d I = d I (λ, x) = d (H (λ) - λ \cdot x) = (\frac{\partial}{\partial λ^{i}} H (λ) - x^{i}) d λ^{i} - λ^{i} d x_{i} = - λ^{i} d x_{i}

(55)

Hence, on the submanifold (54), we have

d I + λ^{i} d x_{i} = 0

□

3.1.2. The Lagrangian Submanifold and Its Metric

Given the original contact variety

M^{2 n + 1}

with the contact 1-form given by the presentation (52), we see at once that the Reeb vector field is

R = \frac{\partial}{\partial I}

(56)

In fact, it satisfies the two conditions:

α (R) = 1; d α (R, X) = 0 \forall X \in Γ [{TM}^{2 n + 1}, M^{2 n + 1}]

(57)

On the other hand, from the general discussion in Appendix A.8, we know that every

2 n

-dimensional submanifold of a contact manifold

M^{2 n + 1}

that is transverse to the Reeb vector field of the latter is a symplectic manifold

S^{2 n}

whose symplectic 2-form is the restriction to

S^{2 n}

of the exterior differential of the contact 1-form:

ω = d α ∣_{S^{2 n}}

(58)

Hence, applying these general notions to the case at hand, we see that the symplectic variety transverse to Reeb’s vector (56) is given by the following projection map:

π : M^{2 n + 1} \to S^{2 n}; π (I, λ, x) = (λ, x)

(59)

and the symplectic 2-form is as follows:

ω = - \sum_{i = 1}^{n} d λ^{i} \land d x_{i}; d α = π^{★} (ω)

(60)

Let us now consider the Legendrian submanifold

L_{n} \subset M^{2 n + 1}

which contains the thermodynamic equilibrium states introduced in Definition 1. It is obvious that we can consider its image through the projection map (59):

S^{2 n} \supset L_{n} \equiv π (L_{n})

(61)

The important result is that the submanifold

L_{n}

thus defined is a Lagrangian submanifold, namely one on which the symplectic form completely vanishes. The demonstration of this fact follows immediately from the definition. In fact, we can translate Equation (61) into the following constructive definition:

L_{n} = \{x^{i} = \frac{\partial}{\partial λ^{i}} H (λ)\}

(62)

Using the latter, we find:

{ω|}_{L_{n}} = - \sum_{i, j = 1}^{n} d λ^{i} \land d λ^{j} \partial_{i} \partial_{j} H (λ) = 0

(63)

which follows because of the commutativity of the partial derivatives. We can therefore conclude that the thermodynamic equilibrium states are immersed in a Lagrangian submanifold of the thermodynamic symplectic space

(S^{2 n}, Ω)

, the coordinates of this latter being

λ

and

x

in terms of traditional thermodynamic variables, specified by relations in (53).

3.1.3. The Canonical Riemannian Metric on the Lagrangian Submanifold

The Lagrangian submanifold

L_{n}

is naturally equipped with a Riemannian canonical metric, which is the image

d s_{L_{n}}^{2} = ι^{★} (d s_{c a n}^{2})

(64)

through the pull-back

ι^{★}

of the immersion map:

ι : L_{n} \overset{ι}{↪} S^{2 n}

(65)

of the canonical flat metric on the symplectic ambient manifold:

d s_{c a n}^{2} = \frac{1}{2} \sum_{i = 1}^{n} (d λ^{i} \otimes d x^{i} + d x^{i} \otimes d λ^{i})

(66)

The canonical Riemannian metric (66) on the ambient symplectic space corresponds to assuming the standard complex structure and standard symplectic 2-form matrices

I = (\begin{matrix} - i 1_{n \times n} & 0 \\ 0 & i 1_{n \times n} \end{matrix}); K = \frac{1}{2} (\begin{matrix} 0 & i 1_{n \times n} \\ - i 1_{n \times n} & 0 \end{matrix})

(67)

so that, according to the general theory, the canonical Hermitian metric is indeed given by the matrix:

G = K \cdot I

(68)

The Riemannian metric (64) is the one that was promoted to an investigation tool of mesoscopic physical chemistry of critical phenomena, in particular by Ruppeiner and collaborators. This metric would be perfectly defined and calculable if we explicitly knew the stochastic Hamiltonian

H (λ)

, since by means of the immersion equations we get:

d s_{L_{n}}^{2} = - H_{i j} (λ) d λ^{i} \times d λ^{j}; H_{i j} (λ) \equiv \partial_{i} \partial_{j} H

(69)

where

H_{i j}

is named the Hessian. As we already anticipated above, the metric (69) exactly coincides with the metric (49) and, hence, with the metric (48) named the Information Geometry metric in the Machine Learning literature, when the probability distribution is the generalized Gibbs distribution (43). This is also reminiscent of the AMSY symplectic formulation of Toric Kähler Geometry in the action-angle coordinates, for which we refer the reader to [53,54] and, for a review, to [38] and to the original references there cited. The problem is that the stochastic Hamiltonian is by definition the negative of the logarithm of the partition function

Z (λ)

and generally beyond the reach of explicit analytical computation, as we have repeatedly pointed out. In the absence of this generally inaccessible tool, there was an extensive research activity in devising and proposing phenomenological equations of state, each of which provides an explicit recipe for calculating the metric (64). In the next subsection, we will examine this methodology in general for the case where

n = 2

, corresponds to the canonical ensemble and is the most frequently utilized in the field of phenomenological equations of state.

3.1.4. The Lagrangian Submanifold in the Two-Dimensional Case and Its Riemannian Structure

In order to illustrate the general concepts and as a term of comparison with the subsequent examples of geometrical thermodynamics on Riemannian manifolds and, in particular, on symmetric spaces, we provide here a brief sketch of the geometrical treatment for physical thermodynamics. In the simplest situation, we deal with a 4-dimensional symplectic space with coordinates

T, P, U, V

, where the former two are intensive quantities (temperature and pressure), and the latter two are extensive ones: internal energy and volume (we recall that the fifth coordinate to complete the odd-dimensional contact manifold is the entropy S). In the even-dimensional space, the symplectic 2-form

ω

is the following:

ω \equiv d [T^{- 1}] \land d U + d [T^{- 1} P] \land d V

(70)

The following two are assumed to be the embedding equations of the two-dimensional Lagrangian variety:

(A): The thermic equation

$f (P, T, V, U) \equiv P - A (T, V)$

(71)
(B): The caloric equation

$g (P, T, V, U) \equiv U - B (T, V)$

(72)

The first condition is the true equation of state. The second must be found in agreement with the constraint that the hypersurface cut out by the two constraints in the symplectic manifold should be Lagrangian (namely, should make the 2-form

ω

vanish). Introducing the shorthand

w \equiv {P, T, V, U}

, we can write

{SM}_{4} \supset L_{2} = \{w \in {SM}_{4} ∣ f (w) = 0 and g (w) = 0\}

(73)

The surface

L_{2}

is Lagrangian if the symplectic 2-form vanishes when restricted to it, which is equivalent to saying that the Poisson bracket of the two embedding functions is zero:

ω |_{L_{2}} = 0 \Leftrightarrow \{f, g\} = 0

(74)

Substituting the equations of state (71) and (72) into the symplectic form (70), we obtain:

ω |_{L_{2}} = \frac{d T \land d V (T \partial_{T} A (T, V) - A (T, V) - \partial_{V} B (T, V))}{T^{2}}

(75)

Hence, the constraint to be satisfied by the two immersion functions

A (T, V), B (T, V)

in order, for The immersed submanifold to be Lagrangian is that the image through the projection

π

of a Legendrian submanifold

L_{2}

immersed in the contact manifold

M_{5}

should be the following:

T \partial_{T} A (T, V) - A (T, V) - \partial_{V} B (T, V) = 0

(76)

Therefore, the canonical Riemannian metric on the Lagrangian submanifold

L_{2}

is the following:

d s_{L_{2}}^{2} = d [T^{- 1}] \otimes d B (T, V) + d [T^{- 1} A (T, V)] \otimes d V

(77)

which makes sense if and only if the constraint (76) is satisfied. We can verify Equation (76) in the familiar case of Ideal Gases, recalling Equations (A112)–(A114) from which we see that in the ideal gas case, we have:

A (T, V) = \frac{k_{B} N T}{V}; B (T, V) = \frac{3}{2} k_{B} T

(78)

In this case, the thermodynamical metric (77) becomes:

d s_{I G}^{2} = - k_{B} (\frac{3}{2} {(\frac{d T}{T})}^{2} + N {(\frac{d V}{V})}^{2})

(79)

which is obviously a flat metric. Indeed, it suffices to change variables (

T = \sqrt{(} \frac{2}{3}) log [x]

,

V = \frac{1}{\sqrt{N}} log [y]

) and (79) becomes proportional to the standard Euclidean metric on

R^{2}

.

In Appendix C.5, as an illustrative counterexample, we briefly discuss the van der Waals model of a real gas equation of state, and we show that the immersion functions of the Lagrangian equilibrium submanifold (78) are substituted by the new immersion functions (A122), which also satisfy the Lagrangian constraint (76), as they should. Correspondingly, the flat thermodynamical metric (79) is replaced by its van der Waals equivalent, shown in Equation (A123), which is not flat. Its curvature, shown in Figure A2, has a non-trivial behavior and displays a singularity along the critical curve separating the gas from the liquid phase.

3.2. General Conclusion of This Section

What we have shown above is the use of the geometrical definition of equilibrium states as Lagrangian submanifolds in the context of equations of state for conventional thermodynamical systems. Yet we should keep in mind that the identification of the thermodynamical metric with the Hessian of the stochastic Hamiltonian displayed in Equation (69) is general and applies to any Gibbs state probability distribution of whatever type. Indeed, given any dynamical system, we can define the partition function as in Equation (26), and we obtain the stochastic Hamiltonian from Equation (45). This is also true for the geodesic dynamical system that we discuss in the next section and for Kähler thermodynamics of non-compact symmetric spaces discussed in later sections. The thermodynamical curvature, as we are going to see, also exists in this case and might display singular behaviors signaling critical phenomena.

4. The Geodesic Dynamical System

As recalled in the introduction and fully explained in [2], Cartan Neural Networks are based on the scheme:

V_{i n p u t} \underset{injection}{\underset{︸}{\overset{ι_{[Q, Λ]}}{↪}}} \underset{hidden layers}{\underset{︸}{U_{1} / H_{1} \overset{{\hat{K}}_{[W_{1}, Ψ_{2}]}^{1}}{⟶} U_{2} / H_{2} \overset{{\hat{K}}_{[W_{2}, Ψ_{3}]}^{2}}{⟶} \dots \overset{{\hat{K}}_{[W_{N - 1}, Ψ_{N}]}^{N - 1}}{⟶} U_{N} / H_{N}}} \underset{output map}{\underset{︸}{⇛ V_{o u t p u t}}}

(80)

where the hidden layers

U_{i} / H_{i}

are non-compact symmetric spaces, metrically equivalent to as many solvable Lie groups

S_{i} \subset U_{i}

, and the maps

K_{i} : U_{i} / H_{i} \to U_{i + 1} / H_{i + 1}

are isometric mappings endowed with general covariance with respect to the transformations of both the

U_{i}

and the

U_{i + 1}

group. The initial injection map and the final output map depend on the task for which the network architecture is designed and its categorical type. For instance in the frequent case of the classification task (e.g., for images) implemented with a simple CaNN, the initial datum is regarded as a single vector

Ξ_{i}

(e.g., the list of all pixels) and the injection map is a linear relation between the solvable coordinates (see [1,2] for the general definition of solvable coordinates on

U / H

manifolds)

Y_{A}

of a point in the first hidden layer and the components of the datum vector:

Y_{A} = Q_{A}^{i} Ξ_{i} + Λ_{A}

(81)

where

Q_{A}^{i}

is a

\dim [U_{1} / H_{1}] \times \dim [V_{i n p u t}]

matrix and

Λ_{A}

a

\dim [U_{1} / H_{1}]

-dimensional vector, both being targets of the learning process (see section 6.3 of [2]). In the same case of the classification task and in the same categorical type of a simple CaNN, the output map is schematically described below (see Equation (6.11) of [2]):

\underset{output map}{\underset{︸}{⇛ V_{o u t p u t}}} = \underset{partition}{\underset{︸}{\overset{S}{⟶} M_{+} \cup M_{-}}} \underset{\log . regr .}{\underset{︸}{\overset{σ}{⟶} {[0, 1]}_{o u t}}}

(82)

where

\overset{S}{⟶}

is the partition map of the last layer into two or more disjoined components, induced by one or more separator submanifolds, defined in Definition 6.1 of [2], whose general constructive theory is instead presented in [4]. After separation, the final classification output is obtained with the probabilistic setup of the logistic regression or of its multi-component generalization, i.e., the softmax.

In the case of the Convolutional Cartan Neural Networks outlined in section 1.4 of [4], the hidden layers are chosen as

U_{i} / H_{i} = M^{[r, q_{i}]}

, where

M^{[r, q]} \equiv \frac{SO (r, r + q)}{SO (r) \times SO (r + q)}

(83)

Fixing r once for all, the hidden layers have to be regarded as the total spaces of Tits Satake vector bundles sharing the same base manifold, namely the Tits Satake submanifold

M^{[r, 1]}

:

\begin{matrix} M^{[r, q]} & = & tot [E^{[r (q - 1)]}] \\ E^{[r (q - 1)]} & \overset{π_{T S}}{⟶} & M^{[r, 1]} \end{matrix}

(84)

that have structural group

G_{struc} = SO (r) \times SO (q - 1)

(85)

and having as standard fibre a vector space of dimension

r \times (q - 1)

F = V^{(r ∣ q - 1)}

(86)

in the mentioned direct product representation of

G_{struc}

.

In any case, the important feature of the manifolds corresponding to all the inner layers of the network is what we already emphasized, namely that they are Cartan–Hadamard manifolds, hence diffeomorphic to

R^{n}

and each metrically equivalent to a suitable solvable Lie subgroup manifold

S \subset U

. As such, all the

U_{i} / H_{i}

admit a uniquely defined distance function

d (u, v)

between any two points

u, v \in U_{i} / H_{i}

and such a distance function is the length of the unique geodesic arc starting at u and ending in v. The existence of a distance function is essential for all Machine Learning algorithms, and it is for this reason that the properties of the geodesics, their general construction for

U_{i} / H_{i}

symmetric manifolds, and the structure of the distance function were carefully analyzed in [1].

In the present section, we reconsider the problem of geodesics from the point of view of Hamiltonian mechanics. This is a necessary step in order to transform the space of geodesics into a symplectic manifold and introduce the thermodynamical structures discussed in Section 2 and Section 3. We begin with a general framework by recasting the geodesic problem on a generic Riemannian manifold

(M, g)

into the form of a Hamiltonian dynamical system. Then we specialize such a general setup to the case of

U / H

non-compact symmetric spaces, metrically equivalent to appropriate solvable Lie groups,

S_{U / H}

, and show how the general framework specializes to such a case, revealing additional relevant properties.

4.1. The Geodesic Dynamical System in General

Let

(M, g)

be a generic finite-dimensional Riemannian space

\dim_{R} (M) = d < \infty

and let us consider the problem of deriving the second-order differential equations whose solutions are its geodesic curves:

γ : R ⟶ M

(87)

In each coordinate patch

x^{α} = {x^{1}, \dots, x^{d}}

, every geodesic is described by d functions

x^{α} (t)

, where

t \in R

is the affine parameter. As explained in [55] (volume 1, page 145), the well-known geodesic differential equations can be derived as the Euler–Lagrange equations of a mechanical system whose Lagrangian is the following:

L (x, \dot{x}) = \frac{1}{2} g_{α β} (x) {\dot{x}}^{α} {\dot{x}}^{β}

(88)

having denoted by

{\dot{x}}^{α} \equiv d x^{α} / d t

the generalized velocities and by

g_{α β} (x)

the metric tensor. We can easily convert the Lagrangian system (88) into a Hamiltonian one, introducing, as usual, the canonical momenta:

p_{α} \equiv \frac{\partial L}{\partial {\dot{x}}^{α}} = g_{α β} (x) {\dot{x}}^{β} \Rightarrow {\dot{x}}^{α} = g^{α β} (x) p_{β}

(89)

where

g^{α β} (x)

is the inverse metric, and defining the Hamiltonian as the Legendre transform of the Lagrangian:

H (p, x) \equiv {\dot{x}}^{α} p_{α} - L (x, \dot{x}) = \frac{1}{2} g^{α β} (x) p_{α} p_{β}

(90)

According to the definitions and conventions of Appendix A.7, we define the canonical symplectic manifold by introducing the

2 d

canonical coordinates

Z^{Λ} = \{p_{α}, x^{β}\}; α, β = 1, \dots, d

(91)

and the symplectic 2-form

ω \equiv ω_{Λ Σ} d Z^{Λ} \land d Z^{Σ}; ω_{Λ Σ} = \frac{1}{2} (\begin{matrix} 0_{d \times d} & 1_{d \times d} \\ - 1_{d \times d} & 0_{d \times d} \end{matrix})

(92)

which agrees with Definition A9, being closed, non-degenerate, and of maximal rank as in Equation (A35). In terms of canonical momenta and canonical coordinates, one has:

ω = \sum_{α = 1}^{d} d p_{α} \land d x^{α}

(93)

and according to the Definition A9, the total space

tot [TM]

of the tangent bundle

TM \overset{π}{⟶} M

(94)

to the Riemannian manifold

(M, g)

equipped with the 2-form

ω

, locally defined in each coordinate patch U, by the expression (93), becomes a symplectic

2 d

-dimensional manifold

(tot [TM], ω)

, namely, the phase-space of the geodesic dynamical system. Since

ω

is non-degenerate, the symplectic manifold

(tot [TM], ω)

is also a Poisson manifold. Indeed, in every coordinate patch, it suffices to invert the matrix

ω_{Λ Σ}

so as to obtain the Poissonian bivector:

π^{Λ Σ} = {(ω^{- 1})}^{Λ Σ}

(95)

and given any function

f (Y) \in C^{\infty} (tot [TM])

, according to Equation (A39), we can associate with it the corresponding Hamiltonian vector field:

X_{f} \equiv π^{Λ Σ} \partial_{Λ} f \partial_{Σ}; \partial_{Γ} \equiv \frac{\partial}{\partial Z^{Γ}}

(96)

and we obtain the Poisson bracket fulfilling the properties listed in its Definition A10 by setting:

\forall f, g \in C^{\infty} (tot [TM]) : \{f, g\} = ω (X_{f}, X_{g})

(97)

like in Equation (A40).

The standard geometric geodesic equation:

{\ddot{x}}^{α} + Γ_{β γ}^{α} (x) {\dot{x}}^{β} {\dot{x}}^{γ} = 0

(98)

where

Γ_{β γ}^{α} (x) = \frac{1}{2} g^{α μ} (\partial_{β} g_{γ μ} + \partial_{γ} g_{β μ} - \partial_{μ} g_{β γ})

(99)

denotes the Christoffel symbols, i.e., the components of the Levi Civita connection on

(M, g)

, and is retrieved from the Hamiltonian equations:

\begin{matrix} {\dot{x}}^{α} & = & \{H, x^{α}\} & = & g^{α ν} p_{ν} \\ {\dot{p}}_{α} & = & \{H, p_{α}\} & = & - \frac{1}{2} \partial_{α} g^{ρ σ} p_{ρ} p_{σ} \end{matrix}

(100)

4.2. The Geodesic Dynamical System for Non-Compact Symmetric Spaces

In the case of non-compact symmetric spaces

U / H

, thanks to their metric equivalence with a suitable solvable group manifold

S_{U / H}

, the structure of the geodesic dynamical system can be recast into a more algebraic and very convenient form. To this effect, we observe that the unique Einstein

U

-invariant metric on

U / H

can be written as:

d s^{2} \equiv g = κ_{A B} e^{A} \times e^{B}

(101)

where

κ_{A B} = κ_{B A}

is a constant symmetric matrix and

e^{A} = e {(Υ)}_{α}^{A} d Y^{α}

(102)

are the left-invariant Maurer–Cartan 1-forms on the group manifold

S_{U / H}

, satisfying the Maurer–Cartan equations of the solvable Lie algebra

S o l v_{U / H}

:

d e^{A} + \frac{1}{2} f_{B C}^{A} e^{B} \land e^{C} = 0

(103)

having denoted by

Υ

the solvable coordinates, namely the parameters of the solvable Lie group

S_{U / H} \subset U

, and by

f_{B C}^{A}

, the solvable Lie algebra structure constants (see [1,2] for further details). Metric equivalence is nothing more than Equations (101) and (103). Indeed, let

t_{A} = e {(Υ)}_{A}^{α} \frac{\partial}{\partial Y^{α}}

(104)

be a basis of sections of the tangent bundle

T (U / H)

made of left-invariant vector fields dual to the left-invariant one-forms

e^{A}

:

e^{A} (t_{B}) \equiv e {(Υ)}_{α}^{A} e {(Υ)}_{B}^{α} = δ_{B}^{A}; [t_{B}, t_{C}] = f_{B C}^{A} t_{A}

(105)

The linear combinations with constant coefficients of the

t_{A}

vector fields constitute a vector space endowed with a Lie bracket that is the very definition of the Lie algebra

S o l v_{U / H}

of the solvable group

S_{U / H}

(see, for instance, [9]):

X \in S o l v_{U / H} \Leftrightarrow X = X^{A} t_{A}; X^{A} \in R

(106)

and we obtain:

\forall X, Y \in S o l v_{U / H} : g (X, Y) = κ_{A B} X^{A} Y^{B} \equiv κ (X, Y)

(107)

In this way, we see that once reduced to the left-invariant vector fields, the metric is a scalar product on the solvable Lie Algebra, and the symmetric matrix

κ_{A B}

provides the coefficients of the corresponding symmetric quadratic form. Conversely, any quadratic form

k (,)

on

S o l v_{U / H}

induces a metric on the solvable Lie group

S_{U / H}

:

{d s}_{k}^{2} = k_{A B} e^{A} \times e^{B}

(108)

that is

S_{U / H}

-invariant but not necessarily

U

-invariant, nor Einstein. So any positive definite quadratic bilinear

k (,)

equips

S_{U / H}

with the structure of an Alekseevskian normal Riemannian space [2,12,13,56], yet there is only one quadratic form that corresponds to the unique Einstein metric (The Einstein metric is unique up to a homothety, namely up to a overall constant rescaling of the metric tensor or of the vielbein and such is the corresponding invariant quadratic form on the solvable Lie algebra) of the symmetric space

U / H

.

The structure of Equation (101) being clarified, we reconsider the generic form of the geodesic dynamical system Lagrangian (88), and we rewrite it as:

L_{g e o U H} = \frac{1}{2} κ_{A B} e {(Υ)}_{α}^{A} {\dot{Y}}^{α} e {(Υ)}_{β}^{B} {\dot{Y}}^{β}

(109)

Next, we introduce the anholonomic Lagrangian velocities. The anholonomic Lagrangian velocities are defined as the velocities in an anholonomic basis and, as such, should not be intended as time derivatives of coordinates. By an abuse of notation, we still denote them using an upper dot.

{\dot{q}}^{A} \equiv e {(Υ)}_{α}^{A} {\dot{Y}}^{α} = i_{\partial_{t}} e^{A}

(110)

which are just the contraction of the Maurer–Cartan 1-forms with the time derivative. Similarly, we introduce the anholonomic Hamiltonian momenta as:

p_{A} \equiv \frac{\partial L_{g e o U H}}{\partial {\dot{q}}^{A}} = κ_{A B} {\dot{q}}^{B}

(111)

Then the Hamiltonian is defined as usual by the Legendre transform

H_{g e o U H} = {\dot{q}}^{A} p_{A} - L_{g e o U H} = \frac{1}{2} κ^{A B} p_{A} p_{B}

(112)

where, according to standard conventions,

κ^{A B}

is the inverse of the quadratic form matrix

κ_{A B}

:

κ^{A B} κ_{B C} = δ_{C}^{A}

(113)

4.2.1. The Symplectic 2-Form

Next, we have to convert the standard symplectic form of Equation (93) to the new basis of Hamiltonian coordinates:

Φ^{Λ} = \{p_{A}, Y^{α}\}

(114)

We write the identification:

ω = ω_{Λ Σ} (Z) d Z^{Λ} \land d Z^{Σ} = {\hat{ω}}_{Λ Σ} (Φ) d Φ^{Λ} \land d Φ^{Σ}

(115)

so that our convention is that we name

ω_{Λ Σ}

the components of

ω

in the old Hamiltonian basis and

{\hat{ω}}_{Λ Σ}

the components of the same 2-form in the new Hamiltonian basis. The derivation of

{\hat{ω}}_{Λ Σ}

is the simple and direct calculation sketched below:

\begin{matrix} - ω & = & d Y^{α} \land d p_{α} = d Y^{α} \land d [g_{α β} {\dot{Y}}^{β}] = d Y^{α} \land d [e_{α}^{P} (Υ) κ_{P Q} e_{β}^{Q} (Υ) {\dot{Y}}^{β}] \\ = & d Y^{α} \land d Y^{μ} \partial_{μ} e_{α}^{P} (Υ) (κ_{P Q} e_{β}^{Q} (Υ) {\dot{Y}}^{β})) + d Y^{α} e_{α}^{P} (Υ) \land d [κ_{P Q} e_{β}^{Q} (Υ) {\dot{Y}}^{β}] \\ = & - d e^{A} p_{A} + e^{A} \land d p_{A} \\ = & \frac{1}{2} f_{B C}^{A} e^{B} \land e^{C} + e^{A} \land d p_{A} \end{matrix}

(116)

Summarizing, we have the simple and elegant formula:

ω = - \frac{1}{2} f_{B C}^{A} e^{B} \land e^{C} p_{A} - e^{A} \land d p_{A}

(117)

which is defined on the total space of the tangent bundle of the symmetric space, coinciding with the total space of the tangent bundle of the corresponding solvable Lie group manifold:

T (U / H) ≃ T (S_{U} / H)

. The 2-form

ω

is closed and of maximal rank:

\begin{matrix} d ω & = & 0 \\ \underset{d - times}{\underset{︸}{ω \land ω \land \dots \land ω}} & \neq & 0; d \equiv \dim_{R} [\frac{U}{H}] = \dim_{R} [S_{U / H}] \end{matrix}

(118)

The first line in Equation (118) follows from the consistency of the Maurer–Cartan Equation (103) while the second is evident since we get

ω \land ω \land \dots \land ω = \underset{\neq 0}{\underset{︸}{const}} \times \underset{Vol (M_{2 d})}{\underset{︸}{e^{1} \land e^{2} \land \dots \land e^{d} \land d p_{1} \land \dots \land d p_{d}}}

(119)

where by

Vol (M_{2 d})

we have denoted the top

2 d

-form on the manifold

M_{2 d}

defined as the total space of the tangent bundle

T (S_{U} / H)

:

M_{2 d} = tot [T (S_{U} / H)]

(120)

Hence, the pair

(M_{2 d}, ω)

as defined by Equations (117) and (120) is a bona-fide symplectic manifold as defined and illustrated in Appendix A.7 and such a statement is true for the solvable Lie group

S_{U / H}

singled out by any non-compact symmetric space

U / H

with

U

simple as thoroughly discussed in [1,2].

4.2.2. The Poissonian Bi-Vector

According to the general theory discussed in Appendix A.7 and partially already recalled above in Equations (95)–(97), every symplectic manifold is also a Poissonian manifold, although the reverse is not true. Indeed, the Poissonian bivector

π^{Λ Σ}

can be obtained from the symplectic 2-form components as in Equation (95). Therefore, our next task is that of retrieving the Poissonian bivector starting from the symplectic 2-form in Equation (117). This is easily done. In the coordinate basis (114) the

2 d \times 2 d

matrix

{\hat{ω}}_{Λ Σ}

has the following structure:

\begin{matrix} {\hat{ω}}_{Λ Σ} & = & (\begin{matrix} 0_{d \times d} & {\hat{ω}}_{β}^{M} \\ {\hat{ω}}_{α}^{N} & {\hat{ω}}_{α β} \end{matrix}) \\ {\hat{ω}}_{α β} & = & - \frac{1}{2} f_{B C}^{A} p_{A} e_{α}^{B} e_{β}^{C}; {\hat{ω}}_{β}^{M} = \frac{1}{2} e_{β}^{M}; {\hat{ω}}_{α}^{N} = - \frac{1}{2} e_{α}^{N} \end{matrix}

(121)

The Poissonian bivector is an antisymmetric matrix

π^{Λ Σ}

such that:

ω_{Λ Σ} π^{Σ Δ} = δ_{Λ}^{Δ}

(122)

We immediately find

\begin{matrix} π_{Λ Σ} & = & (\begin{matrix} π_{M N} & π_{M}^{β} \\ π_{N}^{α} & 0_{d \times d} \end{matrix}) \\ π^{M N} & = & - 2 f_{M N}^{A} p_{A}; π_{M}^{β} = - 2 e_{M}^{β}; π_{N}^{α} = 2 e_{N}^{α} \end{matrix}

(123)

where

e_{α}^{N}

are the components of the left-invariant Maurer–Cartan forms

e^{M}

and

e_{M}^{β}

, the components of their dual left-invariant vector fields on the solvable Lie group manifold:

e^{A} = e_{α}^{A} d Y^{α};; t_{B} = e_{B}^{β} \frac{\partial}{\partial Y^{β}}; e^{A} (t_{B}) = e_{α}^{A} e_{B}^{α} = δ_{B}^{A}

(124)

that generate right-translations.

4.2.3. Hamiltonian Vector Fields and the Poisson Bracket

Having determined the Poissonian bivector, we can write the explicit form of the Hamiltonian vector field associated with any function

φ (Φ) \in C^{\infty} (tot [T (S_{U} / H)])

. Recalling Equation (96), we set:

\begin{matrix} φ (Φ) & \to & X_{φ} & = & π^{Λ Σ} \partial_{Λ} φ \frac{\partial}{\partial Φ^{Σ}} \\ = & - 2 p_{A} f_{B C}^{A} \frac{\partial φ}{\partial p_{B}} \frac{\partial}{\partial p_{C}} - 2 \frac{\partial φ}{\partial p_{M}} t_{M} + 2 t_{M} φ \frac{\partial}{\partial p_{M}} \end{matrix}

(125)

and we define the Poisson bracket as:

\begin{matrix} \forall φ (Φ), ψ (Φ) \in C^{\infty} (tot [T (S_{U} / H)]) & : & \{φ, ψ\} & \equiv & ω (X_{φ}, X_{ψ}) \\ = & - 2 (p_{A} f_{B C}^{A} \frac{\partial φ}{\partial p_{B}} \frac{\partial ψ}{\partial p_{C}} + \frac{\partial φ}{\partial p_{M}} t_{M} ψ - t_{M} φ \frac{\partial ψ}{\partial p_{M}}) \end{matrix}

(126)

4.2.4. Symplectic Moment Map

Given a vector field

k \in Γ [TM, M]

, namely a section of the tangent bundle to (Here and in the following lines we are always talking about the solvable Lie group

S

metrically equivalent to the symmetric space

U / H

, and for notation simplicity we drop the subscript

U / H

).

M = tot [TS]

(127)

which has the symplectic structure specified by the 2-form (117); we define the moment map:

μ : k ⟶ μ_{k} (Φ) \in C^{\infty} (M)

(128)

by imposing the condition:

\forall f (Φ) \in C^{\infty} : k f = \{μ_{k}, f\} = ω (k, X_{f})

(129)

where

X_{f}

is the Hamiltonian vector field associated with the function f (see Equation (125)). Hence, the moment map

μ_{k}

is a solution to the following differential equation:

k = - 2 f_{B C}^{A} p_{A} \frac{\partial μ_{k}}{\partial p_{B}} \frac{\partial}{\partial p_{C}} - 2 \frac{\partial μ_{k}}{\partial p_{M}} t_{M} + 2 t_{M} μ_{k} \frac{\partial}{\partial p_{M}}

(130)

Consider, in particular, the vector fields

k_{N} \equiv t_{N} + v_{N}

(131)

where

t_{N}

are the purely horizontal, right-invariant vector fields defined over the solvable group manifold

S

that generate its solvable Lie algebra

S o l v

since

[t_{N}, t_{R}] = f_{N R}^{A} t_{R}

(132)

while

v_{N} \equiv f_{N B}^{A} p_{A} \frac{\partial}{\partial p_{B}}

(133)

are purely vertical vector fields that, as a consequence of the Jacobi identities, satisfy the commutation relations of

S o l v

:

[v_{N}, v_{R}] = f_{N R}^{A} v_{R}

(134)

and commute with the horizontal partners:

[v_{N}, t_{R}] = 0

(135)

Hence, the vector fields

k_{N}

also satisfy the solvable Lie algebra commutation relations:

[k_{N}, k_{R}] = f_{N R}^{A} k_{R}

(136)

and are the infinitesimal generators for the action of the solvable Lie group

S

on the total space of its tangent bundle, namely the phase-space of the geodesic dynamical system. We claim that for the vector fields

k_{N}

, the appropriate moment map is the following

μ_{N} \equiv μ_{t_{N}} = - \frac{1}{2} p_{N}

(137)

4.2.5. Relation with the Nomizu Operator

Once the metric form is given, the construction of geometry and of the associated geodesic equations follows uniquely. The issue is just that of calculating the Levi–Civita connection of the metric g induced on the manifold by the form

<, >

defined on the solvable Lie algebra

S o l v

. One way of describing this Levi–Civita connection is by means of the so called Nomizu operator acting on

S o l v

. The latter is defined as follows:

\begin{matrix} L & : & S o l v \otimes S o l v \to S o l v, \\ \forall X, Y, Z \in S o l v & : & 2 < L_{X} Y, Z > = < [X, Y], Z > - < X, [Y, Z] > - < Y, [X, Z] > \end{matrix}

(138)

The Riemann curvature operator on

S o l v

can be expressed as

Riem (X, Y) = [L_{X}, L_{Y}] - L_{[X, Y]}

(139)

If we introduce a basis of abstract generators

{T_{A}}

for

S o l v

and the corresponding structure constants

[T_{A}, T_{B}] = f_{A B}^{C} T_{C}

(140)

that are the same as those appearing in Equations (103) and (105) together with the metric tensor

< T_{A}, T_{B} > = κ_{A B}

(141)

the connection defined by Equation (138) leads to the following connection coefficients:

\begin{matrix} L_{A} T_{B} & = & Γ_{A B}^{C} T_{C} \\ Γ_{A B}^{C} & = & f_{A B}^{C} - κ_{A D} κ^{C E} f_{B E}^{D} - κ_{B D} κ^{C E} f_{A E}^{D} \end{matrix}

(142)

which are constant numbers.

Given the connection coefficients, the differential geodesic equations can be written immediately. In the chosen basis, the tangent vector to the geodesic is described by n fields

Π^{A} (t) \equiv κ^{A B} p_{B} (t)

(143)

which depend on the affine parameter t along the curve. The geodesic equation is given by the following first-order differential system:

\frac{d}{d t} Π^{A} + Γ_{B C}^{A} Π^{B} Π^{C} = 0

(144)

The above equation contains two pieces of data:

(1): the structure constants of the solvable Lie algebra $f_{A B}^{C}$ ;
(2): the constant tensor $κ_{A B}$ defining the norm on the solvable Lie algebra.

Equation (144) can be obtained as Hamiltonian equations from the definition (112) of the Hamiltonian

H_{g e o U H}

and the explicit expression of the Poisson bracket (126):

\begin{matrix} \partial_{t} Π^{A} & = & \{H_{g e o U H}, Π^{A}\} \end{matrix}

(145)

\begin{matrix} \partial_{t} {\dot{Y}}^{α} & = & \{H_{g e o U H}, Y^{α}\} = e_{M}^{α} Π^{M} \end{matrix}

(146)

Note that the first Equation (145) yielding Equation (144) was obtained in [18] using the definition (126) of Poisson brackets reduced to functions only of the momenta

p_{A}

, namely:

{\{φ (p), ψ (p)\}}_{r e d} = - 2 p_{A} f_{B C}^{A} \frac{\partial φ}{\partial p_{B}} \frac{\partial ψ}{\partial p_{C}}

(147)

It was observed in [18] that Equation (147) equips the dual vector space

S o l v^{★}

to any solvable Lie algebra

S o l v

with the structure of a Poisson manifold, which is not a symplectic manifold since the bivector provided by the solvable structure constants

f_{B C}^{A}

is not invertible. Yet, as recalled in [18], when the solvable Lie algebra is the Borel subalgebra of the special linear group

S o l v = B_{N} \equiv B [sl (N, R)]

(148)

namely the space of

N \times N

triangular traceless matrices, it was proved by Arkhangel’skii in [57] that the Hamiltonian system based on the Poisson bracket (147) is always Liouville integrable since it possesses the required number of Hamiltonians in involution. Specifically, distinguishing the even case

B_{2 ν}

from the odd one

B_{2 ν + 1}

(where

ν \in N

), we have that for

B_{2 ν}

there are

ν^{2} + ν - 1

Hamiltonians in involution, while for

B_{2 ν + 1}

the number of such objects is

ν^{2} + 2 ν

. Furthermore, in both cases, among the Hamiltonian in involution there are respectively

r = ν - 1

and

r = ν

Casimirs

C_{i} (p)

(

i = 1, \dots, r

), namely such functions of the momenta

p

that have vanishing Poisson bracket with all of them:

\forall i, A : \{p_{A}, C_{i} (p)\} = 0

(149)

According to the general theory of dynamical systems, one can define level surfaces of the Casimirs:

L_{k_{1}, \dots, k_{r}} \subset S o l v^{★}; \forall p \in L_{k_{1}, \dots, k_{r}} : C_{i} (p) = k_{i} = const \in R

(150)

and, as explained in [18], such level surfaces always have an even dimension and become symplectic manifolds since when restricted to them the Poissonian bi-vector becomes invertible.

Although this is very interesting, as one sees from the above discussion, it is only one aspect of the full story. Indeed, the complete space that includes not only the momenta

p_{A}

but also the coordinates

Y^{α}

is always symplectic, and the full definition of the Poisson bracket is that given in Equation (126). Yet the nice point about Arkhangel’skii Hamiltonians is that those functions of the momenta

p

that are in involution with respect to the reduced Poisson bracket (147) remain in involution also with respect to the full Poisson bracket (126), so that they are constant along any geodesic.

5. A Master Example for the Geodesic Dynamical System: $SL (3, R) / SO (3)$

As a concrete illustration of the above-discussed concepts and constructions we choose the 5-dimensional symmetric space:

M_{5} \equiv \frac{SL (3, R)}{SO (3)}

(151)

The reasons for such a choice are several:

$M_{5}$ is the smallest symmetric space with a non-compact rank $r > 1$ and a non-trivial solvable Lie algebra $S o l v_{5}$ .
$M_{5}$ belongs to the mother series of non-compact symmetric spaces $\frac{SL (N, R)}{SO (N)}$ that, thanks to the triangular embedding, explained in [1,2], contains all the members of the other series as submanifolds.
$M_{5}$ is not a Kähler manifold, yet its 10-dimensional tangent bundle ${TM}_{5}$ has the symplectic structure discussed in Section 4.2 and Section 5 as any other tangent bundle. This allows to illustrate the distinction among the symplectic manifold of the GDS utilized here with respect to what is done in paper [32], where, as we extensively stressed in the introduction, the moment maps and the thermodynamical states are defined with respect to the Souriau symplectic 2-form (25) constructed on coadjoint orbits and also with respect to what was done in the above mentioned papers [18,57], where the Poisson structure is defined only on the standard fibre of the tangent bundle to $TU / H$ . As we stressed in the introduction, the Souriau case corresponds to thermodynamics on Käehler manifolds.

The Solvable Lie Algebra Generators

An explicit basis for the solvable Lie algebra

S o l v_{5}

of traceless, upper triangular matrices in 3-dimension is the following one:

\begin{matrix} T_{1} & = & (\begin{matrix} 1 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & - 1 \end{matrix}) & ; & T_{2} & = & (\begin{matrix} 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & - 1 \end{matrix}) & ; T_{3} & = & (\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}) & ; & T_{4} & = & (\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{matrix}) \\ T_{5} & = & (\begin{matrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}) & ; \end{matrix}

(152)

where the diagonal

T_{1, 2}

are Cartan generators, while

T_{3, 4}

correspond to the simple roots

α_{1, 2}

and

T_{5}

is associated with the highest root

α_{1} + α_{2}

.

The solvable Lie group generic element

Next, it is very simple to write the generic element of the solvable Lie group

S_{5} ≅ exp [S o l v_{5}]

obtained according to the rules of the exponential map

Σ

as defined in [1,2].

S_{5} ∋ L (Υ) \equiv Σ [Y^{A} T_{A}] = \prod_{A = 1}^{5} exp [Y^{A} T_{A}] = (\begin{matrix} e^{Y^{1}} & e^{Y^{1}} Y^{3} & e^{Y^{1}} (Y^{3} Y^{4} + Y^{5}) \\ 0 & e^{Y^{2}} & e^{Y^{2}} Y^{4} \\ 0 & 0 & e^{- Y^{1} - Y^{2}} \end{matrix})

(153)

The Left Invariant Cartan 1-form Matrix on

S_{5}

Given the generic solvable Lie group element

L (Υ)

, we easily compute the solvable Lie algebra valued left-invariant 1-form

Θ

:

\begin{matrix} Θ & \equiv & L^{- 1} (Υ) \cdot d L (Υ) \\ = & (\begin{matrix} d Y^{1} & d Y^{3} + Y^{3} (d Y^{1} - d Y^{2}) & d Y^{5} + Y^{4} d Y^{3} + Y^{3} Y^{4} d Y^{1} - Y^{3} Y^{4} d Y^{2} + 2 Y^{5} d Y^{1} + Y^{5} d Y^{2} \\ 0 & d Y^{2} & d Y^{4} + Y^{4} (d Y^{1} + 2 d Y^{2}) \\ 0 & 0 & - d Y^{1} - d Y^{2} \end{matrix}) \end{matrix}

(154)

The Maurer–Cartan Forms 1-forms on

S_{5}

The left-invariant Maurer–Cartan forms

e^{A}

are obtained by decomposing the upper triangular traceless matrix

Θ

, whose matrix elements are 1-forms on the solvable Lie group manifold

S_{5}

, along the generator basis (A2):

\begin{matrix} Θ & = & e^{A} T_{A} \\ e^{1} & = & d Y^{1} \\ e^{2} & = & d Y^{2} \\ e^{3} & = & d Y^{3} + Y^{3} (d Y^{1} - d Y_{2}) \\ e^{4} & = & d Y^{4} + Y^{4} (d Y^{1} + 2 d Y_{2}) \\ e^{5} & = & d Y^{5} + Y^{4} d Y^{3} + Y^{3} Y^{4} d Y^{1} - Y^{3} Y^{4} d Y^{2} + 2 Y^{5} d Y^{1} + Y^{5} d Y^{2} \end{matrix}

(155)

By construction, the Maurer–Cartan forms

e^{A}

satisfy the Maurer–Cartan equations:

d e^{A} + \frac{1}{2} f_{B C}^{A} e^{B} \land e^{C} = 0

(156)

where

f_{B C}^{A}

are the structure constants of the solvable Lie algebra

S o l v_{5}

:

[T_{B}, T_{C}] = f_{B C}^{A} T_{A}

(157)

The vielbein of the unique

SL (3, R)

invariant Einstein metric on the symmetric space

M_{5}

As explained above and in [1], the unique Einstein

U

-invariant metric on the symmetric manifold is algebraically obtained from an orthonormal basis of

K

generators in the Cartan decomposition of the

U

-Lie algebra:

U = H \oplus K

, where

K

constitutes an irreducible representation of the maximal compact subalgebra

H

under its adjoint action. In our case

H = so (3)

and the 5-dimensional vector space

K

corresponds to the

J = 2

irrep of the three-dimensional rotation group. An orthonormal basis is provided by the following symmetric matrices:

\begin{matrix} K^{1} & = & (\begin{matrix} \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & - \frac{1}{\sqrt{2}} \end{matrix}) & ; & K^{2} & = & (\begin{matrix} - \frac{1}{\sqrt{6}} & 0 & 0 \\ 0 & \sqrt{\frac{2}{3}} & 0 \\ 0 & 0 & - \frac{1}{\sqrt{6}} \end{matrix}) & ; K^{3} & = & (\begin{matrix} 0 & \frac{1}{\sqrt{2}} & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 \end{matrix}) & ; & K^{4} & = & (\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} \\ 0 & \frac{1}{\sqrt{2}} & 0 \end{matrix}) \\ K^{5} & = & (\begin{matrix} 0 & 0 & \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 \end{matrix}) & ; \end{matrix}

(158)

that satisfy the following relation:

Tr (K^{I} \cdot K^{J}) = δ^{I J}; I, J = 1, 2 \dots, 5

(159)

The vielbein determining the metric is obtained from the Cartan left invariant matrix 1-form (154) by setting

V^{I} \equiv Tr (K^{I} \cdot Θ)

(160)

and the metric is

\begin{matrix} d s^{2} & \equiv & δ_{I J} V^{I} \times V^{J} = g_{α β} (Υ) d Y_{α} d Y_{β} \\ = & \frac{1}{2} \{3 d Y^{2} + {(2 d Y^{1} + d Y^{2})}^{2} + {(d Y^{3} + Y^{3} (d Y^{1} - d Y^{2}))}^{2} + {(d Y^{4} + Y^{4} (d Y^{1} + 2 d Y^{2}))}^{2} \\ + {(d Y^{5} + Y^{4} d Y^{3} + Y^{3} Y^{4} d Y^{1} - Y^{3} Y^{4} d Y^{2} + 2 Y^{5} d Y^{1} + Y^{5} d Y^{2})}^{2}\} \end{matrix}

(161)

The vielbein 1-forms

V^{I}

are linear combinations of the Maurer–Cartan 1-forms

e^{A}

:

V^{I} = ν_{A}^{I} e^{a}; ν = (\begin{matrix} \sqrt{2} & \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & \sqrt{\frac{3}{2}} & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & \frac{1}{\sqrt{2}} \end{matrix})

(162)

So that the metric can also be written as:

\begin{matrix} d s^{2} & = & κ_{A B} e^{A} \times e^{B} \\ κ & \equiv & ν^{T} \cdot ν = (\begin{matrix} 2 & 1 & 0 & 0 & 0 \\ 1 & 2 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{2} & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{2} & 0 \\ 0 & 0 & 0 & 0 & \frac{1}{2} \end{matrix}) \end{matrix}

(163)

5.1. Hamiltonians in Involution and Generalized Thermodynamics

As we stressed in Section 4.2.5, the general definition (126) of the Poisson bracket on the total space of the tangent bundle can be reduced to functions

f (p)

that depend only on the momenta, namely on the vertical fibre coordinates, and one obtains the reduced formula (147) which still satisfies all the properties mentioned in Definition A10 of Appendix A.7 in order to define a Poisson bracket. In the case of the here considered master model, Equation (147) takes the following explicit form for any pair of functions

F (p), G (p)

:

\begin{matrix} {\{F (p), G (p)\}}_{r e d} & = & - 2 p_{3} (- \partial_{3} F \partial_{1} G + \partial_{3} F \partial_{2} G + \partial_{1} F \partial_{3} G - \partial_{2} F \partial_{3} G) \\ - 2 p_{4} (- \partial_{4} F \partial_{1} G - 2 \partial_{4} F \partial_{2} G + \partial_{1} F \partial_{4} G + 2 \partial_{2} F \partial_{4} G) \\ - 2 p_{5} (- 2 \partial_{5} F \partial_{1} G - \partial_{5} F \partial_{2} G - \partial_{4} F \partial_{3} G + \partial_{3} F \partial_{4} G + 2 \partial_{1} F \partial_{5} G + \partial_{2} F \partial_{5} G) \end{matrix}

(164)

where we have used the shorthand notation:

\partial_{a} f \equiv \frac{\partial f (p)}{\partial p_{a}}

(165)

Utilizing the constructive recipe introduced by Arkhangel’skii in [57] and recalled in Equation (2.49) of [18]), we obtain the following three Hamiltonians in involution:

\begin{matrix} H_{1} & = & \frac{1}{3} (p_{1}^{2} - p_{2} p_{1} + p_{2}^{2} + 3 (p_{3}^{2} + p_{4}^{2} + p_{5}^{2})) \end{matrix}

(166)

\begin{matrix} H_{2} & = & \frac{1}{27} (- 2 p_{1}^{3} + 3 p_{2} p_{1}^{2} + 3 (p_{2}^{2} - 3 (p_{3}^{2} - 2 p_{4}^{2} + p_{5}^{2})) p_{1} - 2 p_{2}^{3} - 54 p_{3} p_{4} p_{5} - 9 p_{2} (p_{3}^{2} + p_{4}^{2} - 2 p_{5}^{2})) \end{matrix}

(167)

\begin{matrix} H_{3} & = & \frac{1}{3} (p_{1} - 2 p_{2} + \frac{3 p_{3} p_{4}}{p_{5}}) \end{matrix}

(168)

\begin{matrix} 0 & = & {\{H_{i}, H_{j}\}}_{r e d} i, j = 1, 2, 3 \end{matrix}

(169)

The first crucial observation is that the quadratic first Hamiltonian

H_{1} = H_{g e o U H}

coincides with the quadratic Hamiltonian (112) obtained from the Legendre transform of the geodesic dynamical system Lagrangian. Indeed, in our case, we have that

κ

is given by the second of Equation (163), and one immediately verifies that

\begin{matrix} H_{1} & = & \frac{1}{2} {(κ^{- 1})}^{A B} p_{A} p_{B} \equiv \frac{1}{2} κ^{A B} p_{A} p_{B} = H_{g e o U H} \\ κ^{- 1} & = & (\begin{matrix} \frac{2}{3} & - \frac{1}{3} & 0 & 0 & 0 \\ - \frac{1}{3} & \frac{2}{3} & 0 & 0 & 0 \\ 0 & 0 & 2 & 0 & 0 \\ 0 & 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 0 & 2 \end{matrix}) \end{matrix}

(170)

This implies that the method of Hamiltonians in involution selects the unique invariant norm on the solvable Lie algebra that corresponds to the unique

U

-symmetric Einstein metric on the equivalent symmetric space

SL (N, R) / SO (N)

. This fact was already pointed out in [18].

The second important observation is that the functions depending only on the momenta

p_{A}

that are in involution with respect to the reduced Poisson bracket are in involution with respect to the complete Poisson bracket, so that they are conserved along all trajectories, namely all geodesic lines.

The case of the Casimir

H_{3}

is different. It commutes with all the momenta, yet it does not commute with all the coordinates

Y^{α}

. Indeed, we have:

\{H_{3}, p_{A}\} = 0; \{H_{3}, Y^{α}\} = \frac{\partial H_{3}}{\partial p_{A}} e_{A}^{α} (Υ)

(171)

5.2. Generalized Thermodynamics for a Geodesic Dynamical System on $U / H$

As we see from Equation (171), there is no way of fixing consistently the Hamiltonian

H_{3}

to a constant value on the whole ten-dimensional space with canonical coordinates

{p_{A}, Y^{α}}

, obtaining in this way a reduced symplectic manifold of dimension eight. This conclusion is illustrated explicitly in the present master example, yet it is true for all manifolds

SL (N, R) / SO (N)

since in all cases the Casimirs

C_{i} (p)

that have a vanishing reduced Poisson bracket with the momenta

p

, have, with the manifold coordinates

Y^{α}

, a nonvanishing Poisson bracket:

\{C_{i} (p), Y^{α}\} = \frac{\partial C_{i} (p)}{\partial p_{A}} e_{A}^{α} (Υ)

(172)

Nevertheless, one can introduce, as in the classical statistical mechanics of free gases, a Gibbs state probability distribution that minimizes the Shannon functional as we discussed in Section 3 and we announced in Section 1.5. Indeed, given the set of conserved Hamiltonians in involution that depend only on the momenta

p

and that we denote

H_{i} (p)

, we can construct the Gibbs state probability distribution defined by Equations (26) and (27). For any symmetric space

U / H

of non-compact type, we have:

\begin{matrix} G (λ, V) & = & \frac{exp [- λ \cdot H (p)]}{Z (λ, V)} \end{matrix}

(173)

\begin{matrix} Z (λ, V) & = & \int_{M_{2 d}} exp [- λ \cdot H (p)] d λ (p, Υ) \end{matrix}

(174)

\begin{matrix} d λ (p, Υ) & = & \underset{Vol (M_{2 d})}{\underset{︸}{e^{1} \land e^{2} \land \dots \land e^{d} \land d p_{1} \land \dots \land d p_{d}}} \end{matrix}

(175)

where

M_{2 d}

is the total space of the tangent bundle as specified in Equation (120) and

d λ (p, Υ)

is the Liouville integration measure on such a space presented in Equation (119). Since all the Hamiltonians depend only on the momenta

p_{A}

and not on the coordinates

Y^{α}

, we are in a situation similar to that of Ideal Gases (see Appendix C.4), and the partition function (174) factorizes into the product of two integrals (or summations):

Z (λ, V) = \underset{ζ (λ)}{\underset{︸}{\int_{T} exp [- λ \cdot H (p)] d p_{1} \land \dots \land d p_{d}}} \times \underset{V = volume}{\underset{︸}{\int_{Box \subset S} e^{1} \land e^{2} \land \dots \land e^{d}}}

(176)

Discussion on the Box Conception

In relation to the factorization in Equation (176), we have to pause for a moment and analyze the comparison with Ideal Gas Statistical Mechanics reviewed in Appendix C.4. In both cases, the basic Hamiltonian is a quadratic form in the momenta, and the integral over

p

is a multiple-Gaussian integral, the multiplicity being the number of degrees of freedom

n_{f}

. In the Ideal Gas case, this number is

n_{f} = 3 \times N

where N is the number of molecules composing the gas, namely of the order of the Avogadro number. In the geodesic dynamical system case, the number of degrees of freedom is

n_{f} = d

, namely a figure of the dimension of the symmetric space

U / H

or, if you prefer, of the corresponding solvable Lie group

S_{U / H}

. In the Ideal Gas case, the integral V is the volume of the portion of physical space

R^{3}

in which the sample of gas is confined, and the integration on it occurs N times, one for each molecule composing the gas. Hence, the factor that multiplies

ζ (λ)

is

V^{N}

. For the gases, the Gibbs state distribution expresses the probability that the N-particles are in a state where each of them has a given momentum, and it is at a given point. The independence of the Hamiltonian from the coordinates implies that such probability is insensitive to the position, namely, all positions have the same probability, while the momenta follow a Gaussian distribution. In the case of the geodesic dynamical system, the Gibbs state describes the probability that a geodesic starts at a given point with a given initial tangent vector (the momentum). The coordinate independence of the Hamiltonians implies that all points of the base manifold have the same probability, while the initial tangent vectors follow, as far as we consider only the quadratic Hamiltonian, a Gaussian distribution. As more Hamiltonians are introduced, the distribution becomes more complicated and more structured in momentum space, yet remains flat in coordinate space. In both cases, this flatness is the consequence of translation invariance, with respect to

R^{3}

in the Ideal Gas case, with respect to the solvable Lie group

S_{U / H}

in the case of the geodesic dynamical system on

U / H

. Just as in the Ideal Gas case the Box is the portion of physical space in which the gas is confined, in the same way for the Geodesic Dynamical System case the Box is the portion of base manifold to which we confine the possible initial points of the geodesics of our interest: indeed the statistical distribution described by our thermodynamics is a statistical distribution in the space of geodesics.

5.2.1. Generalized Thermodynamics for the Chosen Master Example

In order to study the generalized thermodynamics of our master example, where the conserved Hamiltonians are given by Equations (166)–(168), we have to calculate the nontrivial part of the partition function:

ζ (λ) = \int exp [- \sum_{i = 1}^{3} λ_{i} H_{i} (p)] d^{5} p

(177)

To this effect, it is convenient to diagonalize the quadratic form in

p_{A}

that corresponds to the first Hamiltonian. Since we have

- λ_{1} H_{1} = λ_{1} \underset{= k^{A B}}{\underset{︸}{{(- \frac{1}{2} κ^{- 1})}^{A B}}} p_{A} p_{B}

(178)

we need to diagonalize the above-defined matrix

k

. An easy calculation shows that the following matrix:

U \equiv (\begin{matrix} 0 & 0 & 0 & - 1 & \sqrt{3} \\ 0 & 0 & 0 & 1 & \sqrt{3} \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 \end{matrix})

(179)

satisfies the condition:

U^{T} \cdot k \cdot U = - 1_{5 \times 5}

(180)

Hence, defining new variables

w \equiv U p

(181)

we obtain

- w^{T} w = - \frac{1}{2} H_{1} (p)

(182)

Explicitly, the change of variables is as follows

\begin{matrix} w_{1} & = & p_{5} \\ w_{2} & = & p_{4} \\ w_{3} & = & p_{3} \\ w_{4} & = & \frac{1}{2} (p_{2} - p_{1}) \\ w_{5} & = & \frac{p_{1} + p_{2}}{2 \sqrt{3}} \end{matrix}; \begin{matrix} p_{1} & = & \sqrt{3} w_{5} - w_{4} \\ p_{2} & = & w_{4} + \sqrt{3} w_{5} \\ p_{3} & = & w_{3} \\ p_{4} & = & w_{2} \\ p_{5} & = & w_{1} \end{matrix}

(183)

In terms of new variables

w

, the three Hamiltonians have the following expression:

\begin{matrix} H_{1} (w) & = & w_{1}^{2} + w_{2}^{2} + w_{3}^{2} + w_{4}^{2} + w_{5}^{2} \\ H_{2} (w) & = & (w_{4} + \frac{w_{5}}{\sqrt{3}}) w_{1}^{2} - 2 w_{2} w_{3} w_{1} + \frac{1}{9} ((3 \sqrt{3} w_{5} - 9 w_{4}) w_{2}^{2} + 2 \sqrt{3} w_{5} (w_{5}^{2} - 3 (w_{3}^{2} + w_{4}^{2}))) \\ H_{3} (w) & = & \frac{w_{2} w_{3}}{w_{1}} - w_{4} - \frac{w_{5}}{\sqrt{3}} \end{matrix}

(184)

Hence, the partition function reduces to the following multiple integral

ζ (λ_{1}, λ_{2}, λ_{3}) = \int_{- \infty}^{\infty} d w_{1} \int_{- \infty}^{\infty} d w_{2} \int_{- \infty}^{\infty} d w_{3} \int_{- \infty}^{\infty} d w_{4} \int_{- \infty}^{\infty} d w_{5} exp [- \sum_{i = 1}^{3} λ_{i} H_{i} (w)]

(185)

Although we have not explored all the possible strategies to calculate the integral in the case that the three generalized temperatures

λ_{i}

are all different from zero, so such a calculation appears. On the other hand, the calculation is rather easy if we put

λ_{2} = 0

, while keeping

λ_{1} \neq 0

and

λ_{3} \neq 0

. We present such an explicit result that provides an illustrative example of what the geodesic thermodynamics on a non-compact symmetric space can be.

With an iterative Gaussian integration, we obtain:

ζ (λ_{1}, 0, λ_{3}) = \frac{2 π^{5 / 2} e^{\frac{λ_{3}^{2}}{12 λ_{1}}}}{λ_{1}^{5 / 2}} \Rightarrow Z (λ_{1}, 0, λ_{3}, V) = \frac{2 π^{5 / 2} e^{\frac{λ_{3}^{2}}{12 λ_{1}}}}{λ_{1}^{5 / 2}} V

(186)

and consequently the stochastic Hamiltonian (see Equation (45)) reads as follows:

H ∣_{λ_{2} = 0} = - log (\frac{2 π^{5 / 2} e^{\frac{λ_{3}^{2}}{12 λ_{1}}}}{λ_{1}^{5 / 2}}) - log (V) = - \frac{λ_{3}^{2}}{12 λ_{1}} - \frac{5}{2} log (\frac{π}{λ_{1}}) - log (V) - log (2)

(187)

Recalling Equation (47), the Shannon Information Entropy (which coincides with minus the thermodynamical entropy S) is the following:

I = H - λ_{1} \frac{\partial H}{\partial λ_{1}} - λ_{3} \frac{\partial H}{\partial λ_{3}} = \frac{5 log (λ_{1})}{2} - log (2 V) - \frac{5}{2} - \frac{5 log (π)}{2}

(188)

Finally, the Gibbs state probability distribution is

G (λ_{1}, λ_{3}, V, p, Υ) = \frac{λ_{1}^{5 / 2} exp (\frac{1}{12} (- \frac{λ_{3}^{2}}{λ_{1}} - 4 λ_{3} (p_{1} - 2 p_{2} + \frac{3 p_{3} p_{4}}{p_{5}}) - 4 λ_{1} (p_{1}^{2} - p_{2} p_{1} + p_{2}^{2} + 3 (p_{3}^{2} + p_{4}^{2} + p_{5}^{2}))))}{2 π^{5 / 2} V}

(189)

5.2.2. Final Remarks on the GDS Generalized Thermodynamics of the Master Model $SL (3, R) / SO (3)$

Let us observe that if we put

λ_{3} = 0

and we interpret

λ_{1} = 1 / (k_{B} T)

, the partition function (186) takes apart from constant factors the form of that of an ideal gas (compare the following with Equation (A110)):

Z (T, 0, V) = 2 {(π k_{B} T)}^{5 / 2} V

(190)

where

3 N

is replaced by 5 and N by 1. The meaning of this is quite transparent. It is like we were dealing with a gas made of just one particle (hence

N = 1

) that moves in a space with five dimensions instead of three. Correspondingly, the thermodynamical metric at

λ_{3} = 0

is flat as for ideal gases. On the other hand, if we switch on the generalized temperature

λ_{3}

, we have a surprise. Considering the three thermodynamical coordinates:

t = {λ_{1}, λ_{3}, V}

(191)

and working out the thermodynamical metric from the stochastic Hamiltonian in Equation (187), we obtain:

\begin{matrix} d s_{t h e r m}^{2} & \equiv & \frac{\partial^{2} H}{\partial t^{i} \partial t^{j}} d t^{i} d t^{j} \\ = & \frac{{d V}^{2}}{V^{2}} - \frac{{d λ_{1}}^{2} (15 λ_{1} + {λ_{3}}^{2}) - 2 d λ_{1} d λ_{3} λ_{1} λ_{3} + {d λ_{3}}^{2} {λ_{1}}^{2}}{6 {λ_{1}}^{3}} \end{matrix}

(192)

Clearly, the metric (192) is a metric on a direct product manifold, which is spanned by the two temperatures

λ_{1, 3}

and that spanned by the volume. Furthermore, the metric has a Lorentzian signature

{- 1, - 1, 1}

. Finally, the two-dimensional space associated with the two temperatures is not flat, as in the ideal gas case, it is a constant negative curvature manifold. Indeed, introducing the following dreibein:

\begin{matrix} E & = & \{E^{1}, E^{2}, E^{3}\} \\ E^{1} & = & \frac{\sqrt{15 λ_{1} + {λ_{3}}^{2}}}{\sqrt{6} {λ_{1}}^{3 / 2}} d λ_{1} - \frac{λ_{3}}{\sqrt{6} \sqrt{λ_{1}} \sqrt{15 λ_{1} + {λ_{3}}^{2}}} d λ_{3} \\ E^{2} & = & \frac{\sqrt{\frac{5}{2}}}{\sqrt{15 λ_{1} + {λ_{3}}^{2}}} d λ_{3} \\ E^{3} & = & \frac{1}{V} d V \end{matrix}

(193)

we can write

d s_{t h e r m}^{2} = - {(E^{1})}^{2} - {(E^{2})}^{2} + {(E^{3})}^{2}

(194)

and we derive the following spin-connection

ω^{i j} = (\begin{matrix} 0 & - \frac{15 \sqrt{\frac{3}{2}} λ_{1}^{3 / 2}}{{(λ_{3}^{2} + 15 λ_{1})}^{3 / 2}} E^{2} - \frac{λ_{3}^{3}}{\sqrt{10} {(λ_{3}^{2} + 15 λ_{1})}^{3 / 2}} E^{1} & 0 \\ \frac{15 \sqrt{\frac{3}{2}} λ_{1}^{3 / 2}}{{(λ_{3}^{2} + 15 λ_{1})}^{3 / 2}} E^{2} + \frac{λ_{3}^{3}}{\sqrt{10} {(λ_{3}^{2} + 15 λ_{1})}^{3 / 2}} E^{1} & 0 & 0 \\ 0 & 0 & 0 \end{matrix})

(195)

which yields the following curvature 2-form:

R^{i j} = d ω^{i j} + ω^{i k} \land ω^{p j} η_{k p} = \frac{1}{10} (\begin{matrix} 0 & E^{1} \land E^{2} & 0 \\ - E^{1} \land E^{2} & 0 & 0 \\ 0 & 0 & 0 \end{matrix})

(196)

which clearly demonstrates what we just stated. The two-dimensional thermodynamical subspace spanned by the generalized temperatures

λ_{1}, λ_{3}

is a portion of a constant curvature manifold, namely a portion of a hyperbolic plane, since the negative signature of the metric implies a negative value of the constant curvature. In any case, the constancy of the thermodynamical curvature signals the absence of any critical point and phase transition.

6. Generalized Thermodynamics à la Souriau on Kähler Non-Compact $U / H$ .s

Having clarified the status of generalized thermodynamics associated with the Geodesic Dynamical System and, in general, with Integrable Dynamical Systems, it clearly appears that the corresponding Gibbs Probability Distributions are of little use for Machine Learning algorithms, since they are non-trivial distributions only in momentum space and not on the very manifolds

U / H

that constitute the hidden layers of a neural network. Although nothing can be excluded a priori, what one needs in the ML context are non-trivial probability distributions on the very manifolds

U / H

that constitute the layers of the network. Such distributions are provided by generalized thermodynamics à la Souriau as schematically defined in Equations (32)–(35). Such a generalized thermodynamics replaces the Hamiltonians in involution of integrable systems with the moment-maps

P_{A} (Υ)

of a typically non-abelian, actually semisimple or simple, Lie algebra

U

that has a Poissonian realization:

\{P_{A}, P_{B}\} = f_{A B}^{C} P_{C}

(197)

and exists only on Kähler manifolds, as we extensively discussed in the introduction, analyzing the original conception of Barberesco et al. based on the notion of coadjoint orbits. As one sees, generalized thermodynamics à la Souriau is almost the opposite of the generalized thermodynamics related to integrable systems and with the corresponding Liouville Hamiltonians in involution. Its relevance precisely resides in the non-abelian character of the algebra satisfied by the Hamiltonians, that is, the algebra of Killing vector fields of the corresponding Riemannian manifold. The principle underlying the construction of generalized thermodynamics à la Souriau is another one, totally different from Liouville integrability: it is the convergence of the partition function integral (32), namely

Z_{K} (β) \equiv \int_{U / H} exp [- β \cdot P (Y)] \underset{n - times}{\underset{︸}{K \land K \land \dots \land K}} < \infty

(198)

that poses constraints on the vector

β

of generalized temperatures which, generically, identifies an element of the Lie algebra

U

in the chosen basis

t_{A}

of its generators:

β^{A} t_{A} \in U

(199)

The determination of the subset

Ω \subset U

(typically not a subalgebra), for which the convergence constraint (198) is satisfied, encodes the new quality of this peculiar Kählerian generalized thermodynamics whose use in Machine Learning appears to be much more promising than generalized thermodynamics based on integrable dynamical systems.

We will demonstrate how the conditions defining the subspace

Ω

of generalized temperatures is distinctively handy when one uses solvable coordinates, namely while relying on the metric equivalence of non-compact

U / H

symmetric spaces with their corresponding solvable Lie Group Manifolds

S_{U / H}

. Indeed, the inequalities defining the range of

β

are successively extracted from the negativity requirement of the quadratic term in Gaussian integrals over the real line:

\int_{- \infty}^{\infty} e^{- α_{2} x^{2} - α_{1} x} d x = \frac{\sqrt{π} e^{\frac{α_{1}^{2}}{4 α_{2}}}}{\sqrt{α_{2}}} iff α_{2} > 0

(200)

To arrive at this, we first have to single out, among the non-compact symmetric manifolds U/H, the series of Kählerian ones. There are essentially two series, since, as we explained in the introduction, the Kähler character of a coset manifold

U / H

is uniquely signaled by the presence of a

u (1) ≃ so (2)

addend in the compact subalgebra

H

. Recalling the results and the discussion of the foundational paper [1] we see that the two series are:

M^{[2, q]} \equiv \frac{SO (2, 2 + q)}{SO (2) \times SO (2 + q)}; {SH}_{n} \equiv \frac{Sp (2 n, R)}{U (1) \times SU (n)} \underset{by Cayley map}{\underset{︸}{=}} \frac{USp (n, n)}{U (1) \times SU (n)}

(201)

For the generalized Cayley map, see for instance [36] (page 356, formulae (7.2.12)–(7.2.13)) and more generally [58]. The first series was already mentioned in Equation (36) and corresponds to an entire Tits Satake Universality class, the common Tits Satake submanifold of the entire class being:

M_{T S}^{[2, q]} = M^{[2, 1]} = \frac{SO (2, 3)}{SO (2) \times SO (3)} \underset{\begin{matrix} spinor double \\ covering \end{matrix}}{\underset{︸}{≃}} \frac{Sp (4, R)}{U (1) \times SU (2)}

(202)

As we see from Equation (204), the Tits Satake submanifold of the first series is equivalent, due to the low-dimensional Lie algebra isomorphisms, to the second manifold in the second series. Indeed, the second series is the series of Siegel half spaces of genus n and

\frac{Sp (4, R)}{U (1) \times SU (2)}

is a double covering of

\frac{SO (2, 3)}{SO (2) \times SO (3)}

obtained through the use of the four-dimensional spinor representation of

SO (2, 3)

rather than its five-dimensional vector one (see [1,9]). The first manifold

n = 1

of the second series is also the Tits Satake submanifold of the Hyperbolic Space universality class:

M^{[1, 1 + q]} \equiv \frac{SO (1, 1 + q)}{SO (1 + q)}

(203)

In the above series, the manifolds are not Kählerian except for the case

q = 1

, which also corresponds to the Tits Satake submanifold of the entire class.

M_{T S}^{[1, q]} = M^{[1, 1]} = \frac{SO (1, 2)}{SO (2)} \underset{\begin{matrix} spinor double \\ covering \end{matrix}}{\underset{︸}{≃}} \frac{Sp (2, R)}{U (1)} = \frac{SL (2, R)}{SO (2)} \underset{by Cayley map}{\underset{︸}{=}} \frac{SU (1, 1)}{U (1)}

(204)

Finally, in view of the visions of Paint Group invariance and of its relevance in ML algorithms discussed in [1,4] one realizes that the most interesting Kähler manifolds are those of the series

M^{[2, q]}

in (201) that are also named Calabi–Vesentini manifolds. An interesting point, whose implications for ML are all to be studied, is that the Calabi–Vesentini manifolds times a hyperbolic plane constitute Special Kähler manifolds (see [1,58]) and can be described by a suitable section of the corresponding flat holomorphic symplectic bundle, as recalled in [1].

6.1. The General Setup

Having singled out the series of non-compact symmetric spaces

U / H

that are Kählerian and correspondingly apt to support generalized thermodynamics à la Souriau let us develop the general setup for the construction of this latter.

For all

U / H

, we have the two sinergic Lie algebra decompositions:

\begin{matrix} U & = & H \oplus K \end{matrix}

(205)

\begin{matrix} U & = & H + S o l v_{U / H} \end{matrix}

(206)

where

H

is the maximal compact subalgebra of

U

and

K

constitutes a linear representation of

H

under its adjoint action, but it is not a closed subalgebra:

[H, K] \subset K; [K, K] ⊈ K rather [K, K] \subset H

(207)

while

S o l v_{U / H}

, that has the same dimension as

K

, is a closed Lie subalgebra (a solvable one), but it is not a linear representation of

H

under its adjoint action:

[H, S o l v_{U / H}] ⊈ S o l v_{U / H}; [S o l v_{U / H}, S o l v_{U / H}] \subset S o l v_{U / H}

(208)

In the case when

U / H

is Kählerian, we have the additional essential property:

H = H_{0} \oplus {so}_{c} (2); [H_{0}, H_{0}] \subset H_{0}; [H_{0}, {so}_{c} (2)] = 0

(209)

In all cases (compare with Section 4.2), we construct the

U

-invariant metric on

U / H

utilizing the vielbein extracted from the left-invariant matrix 1-form

Θ

of the metric equivalent solvable Lie group

S_{U / H} \subset U

:

Θ \equiv L^{- 1} (Υ) d L (Υ)

(210)

by projecting it onto an orthonormal basis of

K

generators:

\begin{matrix} V^{A} & \equiv & Tr (Θ \cdot K^{A}); Tr (K^{A} \cdot K^{B}) = δ^{A B} \\ d s_{U / H}^{2} & = & V^{A} \times V^{B} δ_{A B} \end{matrix}

(211)

and we find that we can equivalently write (compare with Equation (101)):

d s_{U / H}^{2} = \underset{const . symm .}{\underset{︸}{κ_{A B}}} e^{A} \times e^{B}

(212)

where the 1-forms

e^{A}

are defined by the expansion of

Θ

along a basis of generators

T_{A}

of the solvable Lie algebra

S o l v_{U / H}

:

Θ = e^{A} T_{A}

(213)

since the relation between the two synergic decompositions (205) and (206) of the same Lie algebra

U

implies that there always exists a constant matrix

ν

such that (compare with Equation (163)):

V^{A} = ν_{B}^{A} e^{B} \Rightarrow κ = ν^{T} \cdot ν

(214)

In the Kählerian case, we have the additional item of the Kähler 2-form whose form is general in terms of the matrix representing the adjoint action of the

{so}_{c} (2)

generator on the space

K

. Let us name

X^{c}

such a generator and construct its d-dimensional matrix representation on

K

:

K_{A B}^{c} = δ_{A C} δ_{B D} Tr ([X^{c}, K^{C}] \cdot K^{D})

(215)

Since

{so}_{c} (2)

is a compact subalgebra, the matrix

K^{c}

is necessarily antisymmetric, and the

H

invariance of the metric guarantees that the Kähler 2-form defined by:

K \equiv K_{A B}^{c} V^{A} \land V^{B}

(216)

is necessarily closed. The proof is simple. By definition of the Levi-Civita connection in vielbein/spin-connection formalism we have:

d V^{A} = ω^{A C} \land V^{C}

(217)

where

ω^{I J} = - ω^{J I}

is valued in the d-dimensional representation

K

of the

H

Lie algebra. Using (217), we obtain

d K = - \underset{= 0}{\underset{︸}{{[K^{c}, ω]}_{I J}}} V^{I} \land V^{J}

(218)

Indeed,

K^{c}

is the

{so}_{c} (2)

subalgebra, and it commutes with the whole

H

-algebra.

The next question concerns the Killing vector fields and their Hamiltonian representation. It is important to stress another clear-cut distinction. The symmetric space

U / H

is diffeomorphic to the solvable Lie group

S_{U / H}

and the latter, as any Lie group manifold

G

, possesses two commuting sets of vector fields

t_{A}^{[L / R]}

satisfying the Lie algebra

G

of the group:

[t_{A}^{[L / R]}, t_{B}^{[L / R]}] = f_{A B}^{C} t_{B}^{[L / R]} C; [t_{A}^{[L]}, t_{B}^{[R]}] = 0

(219)

the left-invariant ones

t_{A} \equiv t_{A}^{[L]}

, dual to the left-invariant 1-forms

e^{B}

, according to Equations (104) and (105), generate right-translations, while the right-invariant ones

t_{A}^{[R]}

, dual to the right-invariant 1-forms

e_{[R]}^{A}

defined by the expansion along a generator basis

T_{A}

of

S o l v_{U / H}

of the right-invariant matrix 1-form:

Θ_{[R]} \equiv d L \cdot L^{- 1} = e_{[R]}^{A} T_{A}; e_{[R]}^{A} (t_{A}^{[R]}) = δ_{B}^{A}

(220)

generate the left-translations. For this reason, only the right-invariant vector fields

t_{A}^{[R]}

are Killing vector fields of the symmetric space metric (211) and, as such, they are also symplectic Killing vector fields for the symplectic structure provided by the Käehler 2-form in Equation (216):

ℓ_{t_{A}^{[R]}} K = 0

(221)

where

ℓ_{t}

denotes the Lie derivative along the vector field

t

. The right-invariant vector fields

t_{A}^{[R]}

are not the only Killing vectors and hence symplectic Killing vectors: Indeed, the symmetric space metric is invariant with respect to the whole group

U

and we need a set of Killing vector fields satisfying the whole

U

Lie algebra. According to the decomposition (206), we just have to add the Killing vector fields spanning the compact subalgebra

H

. The question is how to obtain their explicit expression.

6.1.1. General Construction Method of the Killing Vector Fields

In order to construct the expression in solvable coordinate

Υ

of the Killing vector fields associated with the compact generators, we utilize the following procedure, which is general and applies to any Killing vector field.

From the general theory of coset manifolds (see, for instance, [55], second volume, section 5.2.3, page 114 and the following ones), we know that when we act on the left on a coset representative

L (y)

with any element

g \in U

we have:

g L (y) = L (g (y)) \cdot h (y, g); h (y, g) \in H \subset U

(222)

where

y^{'} = g (y)

is the coordinate of the new point in the coset reached by the g transformation and

h (y, g)

, which lies in the subgroup, is named the compensator. Typically, the determination of the compensator is a cumbersome task; yet, in the solvable parameterization, there is a well-defined universal algorithm for the determination of

g (y)

that bypasses the determination of the compensator. Our coset representative is an element of the solvable Lie group and as demonstrated in [1,2] we can always use for any

U / H

the so-called triangular basis, where the solvable coset representative is an upper triangular matrix. Hence, the solvable coset representative

L (Υ)

is upper triangular, and the matrices

h \in H

of the compact subgroup, in particular the compensators, are all orthogonal

h \cdot h^{T} = 1

. It follows that defining the symmetric matrix:

M (Υ) \equiv L (Υ) \cdot L^{T} (Υ)

(223)

We have

\forall g \in U : g \cdot M (Υ) \cdot g^{T} = M (g (Υ))

(224)

In order to obtain

g (Υ)

, which is our goal, it suffices to utilize the finite recursive Cholewski–Crout algebraic algorithm (see [1,2]) that, given

M (g (Υ))

, uniquely determines the upper triangular matrix

L (g (y))

such that:

M (g (Υ)) = L (g (Υ)) \cdot L^{T} (g (Υ))

(225)

Then applying the inverse of the

Σ

exponential map according to the definitions and conventions of [1,2] we get:

Σ^{- 1} [L (g (Υ))] = g {(Υ)}^{α} T_{α}

(226)

In Equation (226), consider the compact group subgroup elements of the form:

g_{i} [θ] = exp [θ J_{i}]

(227)

where

J_{i}

(

i = 1, \dots, m = \dim H

) is a basis of generators of the compact subalgebra

H

. The

g_{i} [θ]

are elements of the m one-parameter subgroups of

H

. Expanding in a power series of

θ

, we get:

g_{i} [θ] {(Υ)}^{α} = Υ^{α} + f_{i}^{α} (Υ) + O (θ^{2})

(228)

The searched for Killing vector fields are:

k_{i} \equiv f_{i}^{α} (Υ) \frac{\partial}{\partial Y^{α}}

(229)

and note that by construction they satisfy the Lie algebra

H

and are symplectic Killing vector fields, according to the definition (221). Note also that the above construction of the Killing vector fields that was indispensable for the compact subalgebra

H

could also be applied to the determination of the Killing vector fields associated with the solvable Lie subalgebra, if we did not know them independently, or to those associated with the

K

-generators, corresponding to the decomposition (205). Indeed, the above procedure is completely general for any set of generators

J_{Λ}

of the entire Lie algebra

U

.

6.1.2. The General Form of the Moment-Maps

Given any set

k_{Λ}

of Killing vector fields, each uniquely associated with a generator

J_{Λ}

of the Lie algebra

U

via the construction mentioned above in Equations (227) and (229), one obtains the corresponding moment map via another fully general formula which is extremely simple:

\begin{matrix} P & : & U ⟶ C^{\infty} (\frac{U}{H}) \\ P_{Λ} (Υ) & = & \frac{1}{2} Tr [X_{c} \cdot L^{- 1} (Υ) \cdot J_{Λ} \cdot L (Υ)] \end{matrix}

(230)

The functions

P_{Λ} (Υ)

satisfy the necessary condition with respect to the Killing vector fields

k_{Λ}

:

i_{k_{Λ}} K = d P_{Λ} (Υ)

(231)

and their definition (230) has a very simple interpretation; they are the projection onto the

so {(2)}_{c}

central subalgebra defining the Kähler structure of the adjoint transformation of the generator

J_{λ}

. An important comment is the following. Since all the non-compact symmetric spaces

U / H

are Hadamard–Cartan manifolds diffeomorphic to

R^{n}

(see [2]), they can be covered by just one open chart, and the solvable coordinates

Υ

provide such a chart. For this reason, the moment maps in solvable coordinates

P_{Λ} (Υ)

are globally defined functions over the whole manifold and Equation (231) holds true globally.

6.1.3. The Partition Function and the Gibbs Probability Distribution

Equipped with the above general weapons, one can address the conditions on the temperature vector

β

for the convergence of the integral (198) defining the partition function

Z (β)

. The first observation is that in the privileged solvable coordinate chart, the volume form reduces to:

\underset{n times}{\underset{︸}{K \land K \land \dots \land K}} = const \times \underset{Cartan coordinates}{\underset{︸}{d Y^{1} \land \dots \land d Y^{r_{n . c .}}}} \underset{nilpotent coordinates}{\underset{︸}{d Y^{r_{n . c .} + 1} \land \dots \land d Y^{2 n}}}

(232)

where

r_{n . c .}

is the non-compact rank of the considered Kählerian symmetric space

U / H

. If we choose the first series in Equation (201), then the non-compact rank is always

r_{n . c .}

and we just have two solvable coordinates associated with the unique two Cartan generators, while all the other coordinates

Y^{r_{n . c .} + 1}, \dots, Y^{2 n}

are associated with nilpotent generators corresponding to long and short roots, the latter organized in two multiplets assigned to the fundamental representation of the Paint Group

G_{Paint} = SO (q)

. On the contrary, all the Kähler manifolds in the second series are maximally split, and the non-compact rank increases linearly in n. In any case, the strategy to calculate the partition function is that of starting with the integration one-by-one on the nilpotent coordinates, which leads each time to a Gaussian integral of the form (200) and to a corresponding positivity constraint on the

β

vector. As we show in the two explicitly constructed examples, the integration of the nilpotents for

r_{n . c .} = 2

can be explicitly performed analytically until we reach a function to be integrated on the Cartan coordinates, which can be verified to be explicitly positive definite and exponentially decreasing at infinity so as to guarantee partition function convergence. In the simplest case of the hyperbolic plane, we can also perform the last integration analytically, obtaining the partition function in closed form and the thermodynamical metric. For the Siegel

n = 2

plane, the last integration over the Cartan fields has to be done numerically, yet all the thermodynamical quantities are accessible as compiled functions. The extension of the results for the Siegel plane to all the manifolds of the first series in (201) via a convenient use of Paint Group invariance is an issue of further research. In Appendix D, we present the preliminary setup calculation for the case

q = 2

.

6.2. Generalized Thermodynamics à la Souriau of the Poincaré–Lobachevsky Hyperbolic Plane $H_{2}$

Let us begin with the simplest example, namely with the hyperbolic plane

H_{2}

. Utilizing the representation:

H_{2} = \frac{SL (2, R)}{SO (2)}

(233)

and following all the conventions of [1], we write the generators of the full

SL (2, R)

Lie algebra in two ways, namely in the orthogonal decomposition:

sl (2, R) = so (2) \oplus K

(234)

and in the solvable subalgebra decomposition:

sl (2, R) = so (2) + S o l v_{2}

(235)

For the 1-dimensional compact subalgebra

H = so (2)

, both in Equations (234) and (235), we choose the same generator:

X_{c} = (\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix})

(236)

The two generators of the orthogonal non-compact space

K

are given by:

K_{1} = (\begin{matrix} \frac{1}{\sqrt{2}} & 0 \\ 0 & - \frac{1}{\sqrt{2}} \end{matrix}), K_{2} = (\begin{matrix} 0 & \frac{1}{\sqrt{2}} \\ \frac{1}{\sqrt{2}} & 0 \end{matrix})

(237)

while our chosen basis of generators for the solvable Lie algebra is:

S o l v_{2} = span \{T_{1} T_{2}\}; T_{1} = (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}); T_{2} = (\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix})

(238)

As explained in [1], the general form of a solvable Lie group element is:

L (Υ) = (\begin{matrix} e^{Y_{1}} & e^{Y_{1}} Y_{2} \\ 0 & e^{- Y_{1}} \end{matrix})

(239)

and the left-invariant matrix 1-form decomposes as follows in the solvable Lie algebra basis:

Θ \equiv L^{- 1} d L = e^{1} T_{1} + e^{2} T_{2}

(240)

where the two left-invariant 1-forms

e^{1, 2}

have the following appearance:

\begin{matrix} e^{1} & = & d Y_{1} \\ e^{2} & = & 2 Y_{2} d Y_{1} + d Y_{2} \end{matrix}

(241)

The zweibein of the 2-dimensional space is instead defined by the projection of the left-invariant 1-form along the

K

generators:

V^{i} = T r [Θ \cdot K_{i}]

(242)

and one obtains the relation:

V = ν e; ν = (\begin{matrix} \sqrt{2} & 0 \\ 0 & \frac{1}{\sqrt{2}} \end{matrix})

(243)

so that the norm form on the solvable Lie algebra mentioned in Equation (101) is as follows:

κ = ν^{T} \cdot ν = (\begin{matrix} 2 & 0 \\ 0 & \frac{1}{2} \end{matrix})

(244)

Defining the adjoint action of the unique

so (2)

generator on the

K_{i}

generators and hence on the zweibein

V

:

K^{c} = {adj}_{X_{c}} [K] = Tr ([X_{c}, K_{i}] . K_{j}) = (\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix})

(245)

we obtain the Kähler 2-form:

K = K_{i j}^{c} V^{i} \land V^{j} = - 2 V^{1} \land V^{2} = - 2 d Y_{1} \land d Y_{2}

(246)

whose closure

d K = 0

is immediately verified.

The Kähler metric and its inverse are immediately calculated from the above data:

g_{i j} = (\begin{matrix} 2 (Y_{2}^{2} + 1) & Y_{2} \\ Y_{2} & \frac{1}{2} \end{matrix}); g^{- 1 | i j} = (\begin{matrix} \frac{1}{2} & - Y_{2} \\ - Y_{2} & 2 (Y_{2}^{2} + 1) \end{matrix})

(247)

and the complex structure tensor in the solvable coordinate basis is obtained from the explicit form of the Kähler 2-form provided in Equation (246):

J_{c} = K^{c} \cdot g^{- 1} = (\begin{matrix} Y_{2} & - 2 (Y_{2}^{2} + 1) \\ \frac{1}{2} & - Y_{2} \end{matrix}); J_{c}^{2} = - 1

(248)

The metric (247) is explicitly Hermitian with respect to the complex structure (248):

J_{c} \cdot g \cdot J_{c}^{T} = g

(249)

The left-invariant 1-forms

e

on the solvable Lie group manifold

S_{2}

, in terms of which we have constructed the Kähler metric, satisfy the Maurer–Cartan equation written below that defines the structure constant of the solvable Lie algebra

S o l v_{2}

d e^{1} = 0; d e^{2} + 2 e^{1} \land e^{2} = 0

(250)

The same Maurer–Cartan Equation (250) is also satisfied by the right-invariant 1-forms obtained by decomposing the right-invariant matrix 1-form

d L \cdot L

.

Following all the procedures detailed in the previous Section 6.1 we obtain the explicit form of the three Killing vector fields in solvable coordinates:

\begin{matrix} k_{0} & = & e^{2 Y_{1}} Y_{2} \frac{\partial}{\partial Y^{1}} + (e^{- 2 Y_{1}} - e^{2 Y_{1}} (Y_{2}^{2} + 1)) \frac{\partial}{\partial Y^{2}} \\ k_{1} & = & \frac{\partial}{\partial Y^{1}} \\ k_{2} & = & exp [- 2 Y^{1}] \frac{\partial}{\partial Y^{2}} \end{matrix}

(251)

where

k_{0}

is the Killing vector field associated with compact generator

X_{c}

, while

k_{1, 2}

are the Killing vector fields associated with the solvable Lie algebra generators

T_{1, 2}

. The corresponding moment maps calculated by means of Equation (230) are the following ones:

\begin{matrix} P_{0} (Υ) & = & \frac{1}{2} e^{- 2 Y_{1}} (- e^{4 Y_{1}} (Y_{2}^{2} + 1) - 1) \\ P_{1} (Υ) & = & - Y_{2} \\ P_{2} (Υ) & = & - \frac{1}{2} e^{- 2 Y_{1}} \end{matrix}

(252)

6.2.1. Calculation of the Partition Function

For simplicity of writing, naming

α, β, γ

the three components

β^{0}, β^{1}, β^{2}

of the temperature vector

β

, and

x = Y_{1}, y = Y_{2}

the two solvable coordinates, the partition function to be computed is the following:

Z (α, β, γ) = \int_{- \infty}^{\infty} d x \int_{- \infty}^{\infty} d y exp [- \frac{1}{2} e^{- 2 x} (α + γ + α e^{4 x} (y^{2} + 1) + 2 β e^{2 x} y)]

(253)

and using the convergence condition of the Gaussian integrals recalled in Equation (200), we get the following constraints:

α > 0; α (α + γ) - β^{2} > 0

(254)

When the above conditions are satisfied, the integrals are easily calculated, and we obtain:

Z (α, β, γ) = \frac{π e^{- \sqrt{α (α + γ) - β^{2}}}}{\sqrt{α (α + γ) - β^{2}}}

(255)

The corresponding Gibbs probability distribution takes the following appearance:

G (α, β, γ, Y^{1}, Y^{2}) = \frac{\sqrt{α (α + γ) - β^{2}} exp [\sqrt{α (α + γ) - β^{2}} - \frac{1}{2} e^{- 2 Y_{1}} (α e^{4 Y_{1}} (Y_{2}^{2} + 1) + α + 2 β e^{2 Y_{1}} Y_{2} + γ)]}{π}

(256)

Equations (254)–(256) take a nicer form by a linear redefinition of the temperatures:

α = δ + ζ; γ = - 2 ζ; β = β

(257)

where

δ, ζ

are the new temperature parameters. We get:

\begin{matrix} δ & > & 0; δ^{2} - β^{2} - ζ^{2} > 0 \end{matrix}

(258)

\begin{matrix} Z (δ, β, ζ) & = & \frac{π e^{- \sqrt{δ^{2} - β^{2} - ζ^{2}}}}{\sqrt{δ^{2} - β^{2} - ζ^{2}}} \end{matrix}

(259)

\begin{matrix} G (δ, β, ζ, Y^{1}, Y^{2}) & = & \frac{\sqrt{- β^{2} + δ^{2} - ζ^{2}} exp (\sqrt{- β^{2} + δ^{2} - ζ^{2}} - \frac{1}{2} e^{- 2 Y_{1}} (2 β e^{2 Y_{1}} Y_{2} + e^{4 Y_{1}} (Y_{2}^{2} + 1) (δ + ζ) + δ - ζ))}{π} \end{matrix}

(260)

It appears from Equation (258) that the Souriau

Ω \subset sl (2, R)

subspace of allowed temperatures is a cone in the three-dimensional Lie algebra space:

= \{(\begin{matrix} β & δ - ζ \\ - δ - ζ & - β \end{matrix}) \in sl (2, R) ∣ δ > 0, δ^{2} - β^{2} - ζ^{2} > 0\}

(261)

as it is shown in Figure 1.

6.2.2. Visualization of the Gibbs Probability Distributions

In the perspective of Data Science applications, the temperature vectors

β = {δ, β, ζ} \in Ω

define, as the Gibbs states (260), probability distributions over the symmetric space

U / H = SL (2, R) / O (2)

, namely the Poincaré hyperbolic plane. Such Gibbs states are the appropriate generalizations to not-flat Cartan–Hadamard spaces of the familiar Gaussian distributions pertaining to flat space. The temperature vector models such distributions. It is very helpful to visualize the Gibbs states (260) utilizing the disk model of the Poincaré plane:

Disk = \{\{x, y\} \in R^{2} ∣ x^{2} + y^{2} < 1\}

(262)

The relation between the solvable coordinates

Y_{1}, Y_{2}

, and the coordinates

x, y

is provided by the following formula (see [1,4]):

Y_{2} = \frac{4 y}{x^{2} + y^{2} - 1}, Y_{1} = log (- \frac{x^{2} + y^{2} - 1}{x^{2} - 2 x + y^{2} + 1})

(263)

Substituting Equation (263) in Equation (260) and furthermore utilizing, polar coordinates in the

β, ζ

plane of the temperature space:

β = μ cos (θ); ζ = μ sin (θ); 0 < μ < δ

(264)

we obtain the following three-parameter family of probability distributions over the disk (262):

\begin{matrix} G (δ, μ, θ, x, y) = \frac{\sqrt{δ^{2} - μ^{2}}}{π} \times \\ \times exp [\sqrt{δ^{2} - μ^{2}} - \frac{{(x^{2} - 2 x + y^{2} + 1)}^{2} (δ - μ sin (θ) + \frac{(\frac{16 y^{2}}{{(x^{2} + y^{2} - 1)}^{2}} + 1) {(x^{2} + y^{2} - 1)}^{4} (δ + μ sin (θ))}{{(x^{2} - 2 x + y^{2} + 1)}^{4}} + \frac{8 μ y cos (θ) (x^{2} + y^{2} - 1)}{{(x^{2} - 2 x + y^{2} + 1)}^{2}})}{2 {(x^{2} + y^{2} - 1)}^{2}}] \end{matrix}

(265)

We present in Figure 2 a few examples of plots of such probability distributions.

6.2.3. The Kähler Geothermodynamic Metric and Curvature

The convex cone conditions (258) defining the Souriau temperature space in the case

so (2, 1) ≃ su (1, 1) ≃ sl (2, R)

were found, in different notations and setups, also by the authors of [30,31,32,33,34]. These authors interpreted the cone as the future-directed light cone of three-dimensional Minkowski space, yet this is just a special feature of the low-dimensional case of

U / H

Kähler manifolds under consideration. What is general is that the temperature associated with the

{so (2)}_{c}

subalgebra defining the Kähler structure must be strictly positive, and then the temperatures associated with the other generators receive constraints in terms of the latter in order to maintain convergence of the other Gaussian integrals.

Independent of such observations, Gibbs states are parameterized probability distributions that can be used to interpolate data by fitting their parameters, namely their temperatures. Generalized thermodynamics provides a metric on the space of temperatures and therefore yields a distance between two distributions, each corresponding to an equilibrium state.

According to the general theory discussed in previous sections, we have the stochastic Hamiltonian:

H^{s t o c h} \equiv - log [Z (δ, β, ζ)] = \sqrt{- β^{2} + δ^{2} - ζ^{2}} + \frac{1}{2} log [- β^{2} + δ^{2} - ζ^{2}] - log (π)

(266)

and according to Equation (47) we calculate Shannon Information Functional:

I = H^{s t o c h} - (δ \frac{\partial}{\partial δ} + β \frac{\partial}{\partial β} + ζ \frac{\partial}{\partial ζ}) H^{s t o c h} = \frac{1}{2} (log [\frac{δ^{2} - β^{2} - ζ^{2}}{π^{2}}] - 2)

(267)

As we see the Information Functional tends to

- \infty

on the boundary of the cone of Figure 1, confirming that the norm

N (β)

of the temperature vector is the analogue of the inverse thermodynamical temperature

N (β) = \sqrt{δ^{2} - β^{2} - ζ^{2}} \sim \frac{1}{T}

(268)

and that Shannon Information Functional

I

is just minus the Thermodynamical Entropy S:

I \sim - S

(269)

When temperature T goes to ∞, the Thermodynamical Entropy S goes to infinity, and the information content of the Gibbs probability distribution is largely negative since we have maximal disorder. On the contrary, when

T \to 0

, which means that

N (β) \to \infty

the Information Entropy tends to ∞ as well, logarithmically slowly, since we have a lot of information. As an illustration of this basic feature, the reader is referred to Figure 3.

Recalling next the general form of the thermodynamical metric expressed in terms of the Hessian of the stochastic Hamiltonian (see Equation (69)), we write

\begin{matrix} d s_{g e o t h e r m}^{2} & = & \frac{\partial^{2} H^{s t o}}{\partial β^{i} \partial β^{j}} d β^{i} \times d β^{j} \\ = & \frac{1}{N^{4}} \times \{{d β}^{2} (- (β^{2} + (N + 1) (δ - ζ) (δ + ζ))) + 2 (N + 2) β d β \times (δ d δ - ζ d ζ) \\ - {d δ}^{2} (δ^{2} + β^{2} (N + 1) + ζ^{2} (N + 1)) + 2 (N + 2) δ \times ζ \times d δ \times d ζ + {d ζ}^{2} ((N + 1) (β - δ) (β + δ) - ζ^{2})\} \end{matrix}

(270)

where

N

is a shorthand for

N (β)

as defined in Equation (268) and

β \equiv {δ, β, ζ}

.

We are interested in calculating the Riemannian curvature of the metric (270), but in order to better understand the intrinsic properties of the curvature 2-form, we prefer to study it in the anholonomic basis provided by the vielbein formalism. For this reason, we need to construct a suitable dreibein that reproduces (270) as the sum of squares of appropriate 1-forms:

d s_{g e o t h e r m}^{2} = - \sum_{i = 1}^{3} V^{i} \times V^{i}

(271)

With some work, we have found that the following dreibein fulfils its own job:

\begin{matrix} V^{1} & = & \frac{1}{N^{2} \sqrt{N + 2} (δ - ζ)} \times \{- β d β (N + 2) (δ - ζ) + d δ (β^{2} (N + 1) + (δ - ζ) (δ - ζ (N + 1))) \\ + d ζ ((δ - ζ) (δ - ζ + δ N) - β^{2} (N + 1))\} \\ V^{2} & = & \frac{\sqrt{N + 1} (d β (δ - ζ) + β (d ζ - d δ))}{N (δ - ζ)} \\ V^{3} & = & (d δ - d ζ) \sqrt{\frac{- N^{2} + N + 2}{(4 - N^{2}) {(δ - ζ)}^{2}}} \end{matrix}

(272)

and using the MATHEMATICA code Vielbgrav23 developed by one of us (see [9]), we have found the explicit form of the spin connection and of the curvature 2-form:

d V^{i} + ω^{i j} \land V^{k} η_{j k} = 0; R^{i j} \equiv d ω^{i j} + ω^{i k} \land ω^{ℓ j} η_{k ℓ}; η = diag (- 1, - 1, - 1)

(273)

We got:

R^{i j} = (\begin{matrix} 0 & G (β) V^{2} \land V^{3} + F (β) V^{1} \land V^{2} & Q (β) V^{1} \land V^{3} \\ - G (β) V^{2} \land V^{3} - F (β) V^{1} \land V^{2} & 0 & P (β) V^{2} \land V^{3} + G (β) V^{1} \land V^{2} \\ - Q (β) V^{2} \land V^{3} & - P (β) V^{2} \land V^{3} - G (β) V^{1} \land V^{2} & 0 \end{matrix})

(274)

where the four coefficients

F (β), G (β), Q (β), P (β)

are not constants; rather, they have a non-trivial dependence on the temperature vector components. However, they display the

SO {(2)}_{c}

invariance of the geothermic metric. Indeed using the polar parameterization (264) of the two non-compact temperatures

β, ζ

, we find that the intrinsic components of the curvature 2-form,

F (β), G (β), Q (β), P (β)

, depend only on

δ, μ

, and do not depend on the angle

θ

. Explicitly, we found:

\begin{matrix} F & = & \frac{N_{F}}{D_{F}} \\ N_{F} & = & - (δ^{4} - 2 δ^{2} μ^{2} - 4 δ^{2} + μ^{4} + 4 μ^{2}) (δ^{8} - 4 δ^{6} μ^{2} + 71 δ^{6} + 6 δ^{4} μ^{4} - 213 δ^{4} μ^{2} + 384 δ^{4} - 4 δ^{2} μ^{6} \\ + 213 δ^{2} μ^{4} - 768 δ^{2} μ^{2} - 426 δ^{2} μ^{2} \sqrt{δ^{2} - μ^{2}} + 426 δ^{2} \sqrt{δ^{2} - μ^{2}} - 426 μ^{2} \sqrt{δ^{2} - μ^{2}} + 104 \sqrt{δ^{2} - μ^{2}} \\ - 13 μ^{6} \sqrt{δ^{2} - μ^{2}} + 39 δ^{2} μ^{4} \sqrt{δ^{2} - μ^{2}} + 213 μ^{4} \sqrt{δ^{2} - μ^{2}} + 284 δ^{2} + 13 δ^{6} \sqrt{δ^{2} - μ^{2}} - 39 δ^{4} μ^{2} \sqrt{δ^{2} - μ^{2}} \\ + 213 δ^{4} \sqrt{δ^{2} - μ^{2}} + μ^{8} - 71 μ^{6} + 384 μ^{4} - 284 μ^{2} + 16) \\ D_{F} & = & 4 (\sqrt{δ^{2} - μ^{2}} + 1) {(\sqrt{δ^{2} - μ^{2}} + 2)}^{3} {(\sqrt{δ^{2} - μ^{2}} + δ^{2} - μ^{2})}^{2} \times \\ \times (\sqrt{δ^{2} - μ^{2}} - δ^{2} + μ^{2} + 2) (3 \sqrt{δ^{2} - μ^{2}} + δ^{2} - μ^{2} + 2) \end{matrix}

(275)

\begin{matrix} G & = & \frac{N_{G}}{D_{G}} \\ N_{G} & = & (δ^{2} - μ^{2}) {(\frac{- δ^{2} + μ^{2} + 4}{\sqrt{δ^{2} - μ^{2}} - δ^{2} + μ^{2} + 2})}^{3 / 2} (δ^{6} - 3 δ^{4} μ^{2} + 26 δ^{4} + 3 δ^{2} μ^{4} \\ - 52 δ^{2} μ^{2} - 16 δ^{2} μ^{2} \sqrt{δ^{2} - μ^{2}} + 44 δ^{2} \sqrt{δ^{2} - μ^{2}} - 44 μ^{2} \sqrt{δ^{2} - μ^{2}} + 20 \sqrt{δ^{2} - μ^{2}} + 8 μ^{4} \sqrt{δ^{2} - μ^{2}} \\ + 41 δ^{2} + 8 δ^{4} \sqrt{δ^{2} - μ^{2}} - μ^{6} + 26 μ^{4} - 41 μ^{2} + 4) \\ D_{G} & = & 2 (\sqrt{δ^{2} - μ^{2}} + 1) {(\sqrt{δ^{2} - μ^{2}} + 2)}^{5 / 2} {(\sqrt{δ^{2} - μ^{2}} + δ^{2} - μ^{2})}^{2} (3 \sqrt{δ^{2} - μ^{2}} + δ^{2} - μ^{2} + 2) \end{matrix}

(276)

\begin{matrix} Q & = & \frac{N_{Q}}{D_{Q}} \\ N_{Q} & = & {(δ^{2} - μ^{2} - 4)}^{4} (δ^{6} - 3 δ^{4} μ^{2} + 25 δ^{4} + 3 δ^{2} μ^{4} - 50 δ^{2} μ^{2} - 16 δ^{2} μ^{2} \sqrt{δ^{2} - μ^{2}} + 38 δ^{2} \sqrt{δ^{2} - μ^{2}} \\ - 38 μ^{2} \sqrt{δ^{2} - μ^{2}} + 8 \sqrt{δ^{2} - μ^{2}} + 8 μ^{4} \sqrt{δ^{2} - μ^{2}} + 28 δ^{2} + 8 δ^{4} \sqrt{δ^{2} - μ^{2}} - μ^{6} + 25 μ^{4} - 28 μ^{2}) \\ D_{Q} & = & 4 {(\sqrt{δ^{2} - μ^{2}} + 2)}^{6} {(\sqrt{δ^{2} - μ^{2}} - δ^{2} + μ^{2} + 2)}^{4} \end{matrix}

(277)

\begin{matrix} P & = & \frac{N_{P}}{D_{P}} \\ N_{P} & = & {(δ^{2} - μ^{2} - 4)}^{2} (δ^{2} - μ^{2}) (14 δ^{10} - 70 δ^{8} μ^{2} + 340 δ^{8} + 140 δ^{6} μ^{4} - 1360 δ^{6} μ^{2} + 1562 δ^{6} - 140 δ^{4} μ^{6} \\ + 2040 δ^{4} μ^{4} - 4686 δ^{4} μ^{2} + 1864 δ^{4} + 70 δ^{2} μ^{8} - 1360 δ^{2} μ^{6} + 4686 δ^{2} μ^{4} - 3728 δ^{2} μ^{2} \\ - 4030 δ^{2} μ^{2} \sqrt{δ^{2} - μ^{2}} + 1210 δ^{2} \sqrt{δ^{2} - μ^{2}} - 1210 μ^{2} \sqrt{δ^{2} - μ^{2}} + 136 \sqrt{δ^{2} - μ^{2}} - μ^{10} \sqrt{δ^{2} - μ^{2}} \\ + 5 δ^{2} μ^{8} \sqrt{δ^{2} - μ^{2}} + 89 μ^{8} \sqrt{δ^{2} - μ^{2}} - 356 δ^{2} μ^{6} \sqrt{δ^{2} - μ^{2}} - 869 μ^{6} \sqrt{δ^{2} - μ^{2}} + 2607 δ^{2} μ^{4} \sqrt{δ^{2} - μ^{2}} \\ + 2015 μ^{4} \sqrt{δ^{2} - μ^{2}} + 524 δ^{2} + δ^{10} \sqrt{δ^{2} - μ^{2}} \\ - 5 δ^{8} μ^{2} \sqrt{δ^{2} - μ^{2}} + 89 δ^{8} \sqrt{δ^{2} - μ^{2}} - 356 δ^{6} μ^{2} \sqrt{δ^{2} - μ^{2}} + 869 δ^{6} \sqrt{δ^{2} - μ^{2}} + 10 δ^{6} μ^{4} \sqrt{δ^{2} - μ^{2}} \\ - 2607 δ^{4} μ^{2} \sqrt{δ^{2} - μ^{2}} + 2015 δ^{4} \sqrt{δ^{2} - μ^{2}} - 10 δ^{4} μ^{6} \sqrt{δ^{2} - μ^{2}} + 534 δ^{4} μ^{4} \sqrt{δ^{2} - μ^{2}} - 14 μ^{10} + 340 μ^{8} \\ - 1562 μ^{6} + 1864 μ^{4} - 524 μ^{2} + 16) \\ D_{P} & = & 4 {(\sqrt{δ^{2} - μ^{2}} + 1)}^{2} {(\sqrt{δ^{2} - μ^{2}} + 2)}^{3} {(\sqrt{δ^{2} - μ^{2}} + δ^{2} - μ^{2})}^{2} \times \\ \times {(\sqrt{δ^{2} - μ^{2}} - δ^{2} + μ^{2} + 2)}^{2} {(3 \sqrt{δ^{2} - μ^{2}} + δ^{2} - μ^{2} + 2)}^{2} \end{matrix}

(278)

The behavior of these four functions is displayed in two pictures in Figure 4.

As it becomes clear from the previous detailed discussion, the generalized thermodynamics à la Souriau yields a 3-dimensional temperature space equipped with a completely non-trivial metric which describes the distance between different Gibbs state probability distributions. The richness and non-triviality of this generalized thermodynamics is to be contrasted with the essentially trivial thermodynamics (Ideal-Gas-like) associated with the geodesic dynamical system and with any other conceivable integrable dynamical system. What one needs in Machine Learning algorithms are probability distributions on the very base manifold, constituting the mathematical model of the hidden layers. Probability distributions on the fibres of the tangent bundle are not that useful in this context.

6.3. Generalized Thermodynamics à la Souriau of the Siegel Half Plane ${SH}_{2}$

The solvable coordinate description of the Siegel half-plane

{SH}_{2}

and its theory are presented in section 7.2 of the foundational paper [1] written by two of us together with Ugo Bruzzo. To that paper and to that section, we refer the reader for all the items we use here to derive the generalized thermodynamics à la Souriau of this manifold. First of all, we recall from the introduction to section 7.2 of [1] two conceptual points that are relevant from the perspective of Machine Learning applications.

In the four papers [1,2,3,4] by means of which the new paradigm of Cartan Neural Networks was elaborated and presented to the scientific community, we mainly focused on the series of

U / H

manifolds of the type:

M^{[r, q]} \equiv \frac{SO (r, r + q)}{SO (r) \times SO (r + q)}

(279)

In order to enlighten the role of non-maximal split manifolds having a non-trivial Tits Satake projection, it is conjectured to be a universal mechanism for clustering of data. In particular we analyzed the cases of non-compact rank

r = 1, 2

and in [1], at the beginning of section 7.2, we wrote:

We focus on the case

r = 2

of the series of symmetric manifolds (279) in a completely synoptic setup with respect to our previous treatment of the case

r = 1

. The motivation for this synopsis is twofold:

1.: On one hand we want to stress that the $r = 1$ case is completely aligned with all the subsequent $r > 1$ ones and that the Tits Satake projection, which went unnoticed and unexploited by the authors in [59,60,61], is actually the conceptual back-bone for all the members of the considered series of manifolds.
2.: On the other hand we want to emphasize that the $r = 2$ and $r = 1$ cases are twins inside the entire series since their respective Tits Satake submanifolds $M^{[1, 1]}, M^{[2, 1]}$ are just the first and the second instance of a Siegel upper complex plane, which is the appropriate generalization of the Lobachevsky-Poincaré hypebolic plane. Instead, for values $r > 2$ , the Tits Satake submanifold, that, by definition, is always a maximally split symmetric space, is not a further instance of a Siegel upper complex plane. Indeed the appearance of the first two Siegel planes is strictly linked with the low rank sporadic isomorphisms of simple Lie algebras.

As we already noticed above from the point of view of Souriau generalized thermodynamics, the relevant

U / H

are the Kählerian ones, which essentially means the two infinite series mentioned in Equation (201). The second series (the Siegel planes) is made of maximally split symmetric manifolds, while the first series displays, for

q > 1

, the Tits Satake projection mechanism gives rise to non-trivial Paint Group invariances. Hence, if we want to join the Tits Satake projection with generalized thermodynamics à la Souriau, we have to choose the first series

M^{[2, q]}

. Yet, due to low-dimensional Lie algebra isomorphism, the Tits Satake submanifold

M^{[2, 1]}

for the TS universality class

M^{[2, q]}

is locally isomorphic (by double covering) to the second element of the second series, namely the Siegel plane

{SH}_{2}

, as already emphasized in Equation (204). These preliminary observations are instrumental to understand the reason behind our following exposition which, summarizing and importing the results of section 7.2 of [1], emphasizes the double description of

{SH}_{2}

in terms of the 4-dimensional spinor representation of

SO (2, 3)

, identified with the

Sp (4, R)

fundamental representation, and in terms of the 5-dimensional vector, defining, representation of

SO (2, 3)

. This is due to our interest in extending the results inherent to the Tits Satake submanifold to the full TS-universality class by relying on Paint Group covariance. Indeed, in Appendix D we provide a preliminary study of the generalized thermodynamics setup for the case

M^{[2, 2]}

in order to emphasize the role of Paint Group in this context.

After these preliminary clarifications, we start the analysis of

{SH}_{2}

following section 7.2 of our foundational paper. The relation between the spinor and the vector representation that provides the local isomorphism of

SO (2, 3)

with

Sp (4, R)

is given by the gamma-matrices

Γ_{i}

,

i = 1, \dots, 5

. Those well adapted to the triangular basis, which is that where the preserved metric with signature

(2, 3)

is the

η_{t}

-matrix in Equation (7.34) of [1] are displayed in Equation (7.41) of the same paper. The charge conjugation matrix

C_{s}

that becomes the symplectic invariant form of

Sp (4, R)

is displayed in Equation (7.37) of the same paper. The 10 generators

J_{i j}

of the

so (2, 3) ≃ sp (4, R)

Lie algebra are defined in Equation (7.39) of [1] as commutators of gamma matrices. The explicit expression of the double covering of

SO (2, 3)

group by means of

Sp (4, R)

group is provided by Equation (7.44) of the same reference that we repeat here for the reader’s convenience:

\forall S \in Sp (4, R) : O_{j}^{i} [S] \equiv \frac{1}{4} Tr (Γ_{i}^{T} S^{- 1} Γ_{j} S) \in SO (2, 3)

(280)

The fundamental next step is the construction of the solvable coset representative both in the vector and in the spinor representation. We utilize the notations and the results of [1] section 7.2. Hence, we use the letter W instead of

Υ

for the vector of solvable coordinates, namely of parameters of the solvable Lie group metrically equivalent to

U / H

. Thinking of

SO (2, 3)

as the Tits Satake subgroup of

SO (2, 2 + 2 s)

the solvable coordinate vector is the one given in Equation (7.52) of [1], namely:

W = \{w_{1}, w_{2}, w_{3}, w_{4}, w_{5}, \underset{(2 s - 1)}{\underset{︸}{0, \dots, 0}}, w_{6}, \underset{(2 s - 1)}{\underset{︸}{0, \dots, 0}}\}

(281)

the zeros corresponding to the Tits Satake projection. The corresponding solvable group element constructed with the

Σ

-exponential map (see a discussion of its definition in [2]) in terms of the solvable coordinates is the following one:

L (W) = (\begin{matrix} e^{w_{1}} & \frac{e^{w_{1}} w_{3}}{\sqrt{2}} & \frac{1}{2} e^{w_{1}} (w_{3} w_{6} + \sqrt{2} w_{5}) & \frac{1}{8} e^{w_{1}} (- \sqrt{2} w_{3} {w_{6}}^{2} + 4 \sqrt{2} w_{4} - 4 w_{5} w_{6}) & - \frac{1}{4} e^{w_{1}} (2 w_{3} w_{4} + {w_{5}}^{2}) \\ 0 & e^{w_{2}} & \frac{e^{w_{2}} w_{6}}{\sqrt{2}} & - \frac{1}{4} e^{w_{2}} {w_{6}}^{2} & - \frac{e^{w_{2}} w_{4}}{\sqrt{2}} \\ 0 & 0 & 1 & - \frac{w_{6}}{\sqrt{2}} & - \frac{w_{5}}{\sqrt{2}} \\ 0 & 0 & 0 & e^{- w_{2}} & - \frac{e^{- w_{2}} w_{3}}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & e^{- w_{1}} \end{matrix})

(282)

which is Equation (7.54) of [1]. Imposing the identification:

O [S_{s} (W)] = L (W)

(283)

One finds a unique solution for the

S_{s}

that is the following:

S_{s} [W] = (\begin{matrix} e^{\frac{1}{2} (- w_{1} - w_{2})} & \frac{1}{2} e^{\frac{1}{2} (w_{2} - w_{1})} w_{6} & \frac{1}{4} e^{\frac{1}{2} (w_{1} + w_{2})} (w_{5} w_{6} - 2 \sqrt{2} w_{4}) & \frac{1}{4} e^{\frac{1}{2} (w_{1} - w_{2})} (2 w_{5} + \sqrt{2} w_{3} w_{6}) \\ 0 & e^{\frac{1}{2} (w_{2} - w_{1})} & \frac{1}{2} e^{\frac{1}{2} (w_{1} + w_{2})} w_{5} & \frac{e^{\frac{1}{2} (w_{1} - w_{2})} w_{3}}{\sqrt{2}} \\ 0 & 0 & e^{\frac{1}{2} (w_{1} + w_{2})} & 0 \\ 0 & 0 & - \frac{1}{2} e^{\frac{1}{2} (w_{1} + w_{2})} w_{6} & e^{\frac{1}{2} (w_{1} - w_{2})} \end{matrix})

(284)

6.3.1. The Siegel Upper Plane

For completeness, we import from [1] the necessary concepts and formulae for the Siegel upper plane. This is particularly important in order to compare our results with those of Barbaresco et al. [30,31,32,33,34]. The Siegel upper complex plane of degree (or genus) g is the generalization to higher dimensions of the Lobachevsky–Poincaré hyperbolic plane.

Just as the standard hyperbolic plane with the Poincaré metric is a complex analytic realization of a maximally split symmetric space, namely

SL (2, R) / SO (2)

, in the same way, the upper Siegel plane of degree g is the complex analytic realization of the symmetric space:

M_{S i e g e l} = \frac{Sp (2 g, R)}{S [U (1) \times U (g)]}

(285)

The key observation is the following. Just in the same way as the fractional linear transformation:

z \to \tilde{z} \equiv \frac{a z + b}{c z + d}; (\begin{matrix} a & b \\ c & d \end{matrix}) \in PSL (2, R)

(286)

maps complex numbers z with strictly positive imaginary part into complex numbers

\tilde{z}

with the same property, the fractional linear matrix transformation:

Z_{g \times g} \to {\tilde{Z}}_{g \times g} \equiv (A_{g \times g} Z_{g \times g} + B_{g \times g}) \cdot {(C_{g \times g} Z_{g \times g} + D_{g \times g})}^{- 1}; (\begin{matrix} A_{g \times g} & B_{g \times g} \\ C_{g \times g} & D_{g \times g} \end{matrix}) \in Sp (2 g, R)

(287)

maps complex symmetric matrices:

Z_{g \times g} = X_{g \times g} + i Y_{g \times g}; Z_{g \times g}^{T} = Z_{g \times g}

(288)

whose imaginary part

Y_{g \times g}

is positive definite (namely has strictly positive eigenvalues) into complex symmetric matrices

{\tilde{Z}}_{g \times g}

with the same property. The relations among the

g \times g

blocks:

A^{T} C = C^{T} A; B^{T} D = D^{T} B; A^{T} D - C^{T} B = 1

(289)

following from the very definition of the

Sp (2 g, R)

group, are instrumental in the lengthy yet straightforward proof of what was stated above.

The number of real components of

Z_{g \times g}

exactly matches the dimension of the maximally split symmetric space defined in Equation (285) so that the upper Siegel plane constitutes its holomorphic realization. Furthermore, the choice of the Borel solvable subgroup inside

Sp (2 g, R)

provides a convenient parameterization of the matrix

Z_{g \times g}

. Indeed, this latter is the orbit under the fractional linear action of the Borel subgroup of the special matrix

Z_{0} = i 1_{g \times g}

.

Applying this idea to the case

g = r = 2

which is ours, and utilizing the parameterization of the Borel solvable subgroup provided in Equation (284), we obtain:

\begin{matrix} Z & = & X + i Y \\ X & = & (\begin{matrix} \frac{1}{8} (w_{6} (4 w_{5} + \sqrt{2} w_{3} w_{6}) - 4 \sqrt{2} w_{4}) & \frac{1}{4} (2 w_{5} + \sqrt{2} w_{3} w_{6}) \\ \frac{1}{4} (2 w_{5} + \sqrt{2} w_{3} w_{6}) & \frac{w_{3}}{\sqrt{2}} \end{matrix}) \\ Y & = & (\begin{matrix} \frac{1}{4} e^{- w_{1} - w_{2}} (e^{2 w_{2}} w_{6}^{2} + 4) & \frac{1}{2} e^{w_{2} - w_{1}} w_{6} \\ \frac{1}{2} e^{w_{2} - w_{1}} w_{6} & e^{w_{2} - w_{1}} \end{matrix}) \end{matrix}

(290)

According to Equation (287) the action of any element

g \in Sp (4, R)

on the coset manifold is the fractional linear transformation of the symmetric complex matrix:

Z = (\begin{matrix} z & ω \\ ω & ζ \end{matrix}); z, ω, ζ \in C

(291)

which represents the entire manifold. When Z is diagonal, namely when

ω = 0

, the two remaining complex entries

z, ζ

represent the coordinates of two hyperbolic upper planes. The subgroup

Γ \subset Sp (4, R)

which respects diagonality, namely the condition

ω = 0

is

Γ = SL (2, R) \times SL (2, R)

.

It is convenient to recall Equation (290), which provides the parameterization of the complex matrix (291) in terms of the solvable coordinates

w_{1}, w_{2}, w_{3}, w_{4}, w_{5}, w_{6}

that can be summarized by stating:

\begin{matrix} z & = & \frac{1}{8} (- 4 \sqrt{2} w_{4} + w_{6} (4 w_{5} + \sqrt{2} w_{3} w_{6}) + 2 i e^{- w_{1} - w_{2}} (e^{2 w_{2}} w_{6}^{2} + 4)) \\ ζ & = & \frac{w_{3}}{\sqrt{2}} + i e^{w_{2} - w_{1}} \\ ω & = & \frac{1}{4} (2 w_{5} + 2 i e^{w_{2} - w_{1}} w_{6} + \sqrt{2} w_{3} w_{6}) \end{matrix}

(292)

As one sees from Equation (292), the diagonalization condition of the matrix Z corresponds to setting

w_{5} = w_{6} = 0

, which implies that the other two solvable coordinates

w_{3}, w_{4}

obtain the interpretation of real parts of the complex coordinates

ζ

and z, respectively.

The three complex numbers

z, ζ, ω

can be utilized as complex coordinates of the symmetric space and the Kähler metric can be derived from a suitable Kähler potential. Similarly, the Kählerian moment maps for all the Killing vector fields can be obtained from the Kähler potential, just as the Kähler 2-form. This, however, is not the best approach for our goals. In order to construct the generalized thermodynamics à la Souriau, it is much more convenient to utilize real solvable coordinates and obtain the Kähler 2-form just as we did in the Poincaré case from the unique

U (1)

generator. This is what we do in next subsection.

6.3.2. The Kähler 2-Form, the Killing Vector Fields, and the Moment Maps

In Order to construct all the items of generalized thermodynamics we need the vielbein, the Kähler 2-form and the moment maps of all Killing vectors. To this effect we need a well-normalized basis of generators of the full

U

Lie algebra. Such a basis is presented in two versions in the spinor representation in Table 1 and in the vector representation in Table 2.

The generators in the two lists are in one-to-one correspondence and satisfy the Lie algebra commutation relations with the very same structure constants. The 10 generators, irrespectively of their label s or v are ordered in the following way:

T_{1, \dots, 6}^{s / v} = K_{i}

(293)

are the 6 non-compact coset generators spanning the vector subspace

K

in the orthogonal decomposition

U = H \oplus K

(294)

Furthermore, the first two generators

T_{1, 2}^{s / v}

are the two non-compact Cartan generators.

The generators:

T_{7, 8, 9}^{s / v} = H_{1, 2, 3}

(295)

are the generators of the

su (2) ≃ so (3)

subalgebra of

H = su (2) \oplus u (1)

.

Finally,

T_{10}^{s / v} = H_{0}

(296)

is the

u (1) ≃ so (2)

generator responsible for the Kähler structure.

Introducing the left-invariant 1-form in either the spinor or the vector form and projecting it onto the

K

-subspace, we find the sechsbein of the 6-dimensional space which turns out to be the same in the two cases, as it should. We choose the vector form and we write:

\begin{matrix} Θ^{v} & \equiv & L^{- 1} (W) \cdot d L (W) \\ e^{i} & = & Tr (Θ^{v} \cdot K_{i}^{†}) \\ δ_{i j} & = & Tr (K_{i} \cdot K_{j}^{†}); Tr (H_{i} \cdot K_{j}^{†}) = 0 normalization of the adjoints K_{j}^{†} \end{matrix}

(297)

, and explicitly we have:

\begin{matrix} e^{1} & = & d w_{1} \\ e^{2} & = & d w_{2} \\ e^{3} & = & \frac{1}{2} (d w_{3} + d w_{1} w_{3} - d w_{2} w_{3}) \\ e^{4} & = & \frac{1}{8} (w_{6}^{2} (- (d w_{3} + d w_{1} w_{3} - d w_{2} w_{3})) - 2 \sqrt{2} w_{6} (d w_{5} + d w_{1} w_{5}) + 4 (d w_{4} + (d w_{1} + d w_{2}) w_{4})) \\ e^{5} & = & \frac{1}{4} (2 d w_{5} + 2 d w_{1} w_{5} + \sqrt{2} w_{6} (d w_{3} + d w_{1} w_{3} - d w_{2} w_{3})) \\ e^{6} & = & \frac{1}{2} (d w_{6} + d w_{2} w_{6}) \end{matrix}

(298)

In this case, the sechsbein coincide with the left-invariant 1-forms and satisfy the Maurer–Cartan equations of the solvable Lie group; however, this is not particularly relevant for our goals. The explicit form of the metric is given by:

d s_{S i e g e l}^{2} = \sum_{i = 1}^{6} e^{i} \times e^{i}

(299)

The Kähler 2-form is obtained by constructing the adjoint representation of the generator

H_{0}

on the subspace

K

. Working in the vector representation and defining the normalized generators:

K_{i} = \frac{1}{\sqrt{2}} T_{i}^{v} (i, 1, \dots, 6); Tr (K_{i} \cdot K_{j}) = δ_{i j}

(300)

we obtain

[H_{0}, K_{i}] = {({AdjH}_{0})}_{i j} K_{j}

(301)

where

{AdjH}_{0} = (\begin{matrix} 0 & 0 & - \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0 & 0 \\ \frac{1}{\sqrt{2}} & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ - \frac{1}{\sqrt{2}} & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - 1 \\ 0 & 0 & 0 & 0 & 1 & 0 \end{matrix})

(302)

and the Kähler 2-form takes the form:

\begin{matrix} K & = & {({AdjH}_{0})}_{i j} e^{i} \land e^{j} \\ = & - \sqrt{2} e^{1} \land e^{3} + \sqrt{2} e^{1} \land e^{4} + \sqrt{2} e^{2} \land e^{3} + \sqrt{2} e^{2} \land e^{4} - 2 e^{5} \land e^{6} \end{matrix}

(303)

The explicit expression of the Kähler 2 in the solvable coordinate basis is obtained from Equation (303) by substituting the explicit form of the sechsbein (298); so doing, one immediately verifies that the 2-form

K

is closed, namely

d K = 0

.

Then, utilizing the vector representation and utilizing the method described in Section 6.1.1 we derive the explicit form of the 10 Killing vector fields expressed in terms of the solvable coordinate basis associated with the Lie algebra generators, as ordered and displayed in Table 2. We obtain the following result. The 6 Killing vector fields, generating the coset translations associated with the

K

-generators, have the following explicit form:

\begin{matrix} k_{1} & = & \partial_{1} \\ k_{2} & = & \partial_{2} \\ k_{3} & = & - \frac{1}{2} e^{w_{1} - w_{2}} w_{3} \partial_{1} + \frac{1}{2} e^{w_{1} - w_{2}} w_{3} \partial_{2} + (\frac{1}{2} e^{w_{1} - w_{2}} (w_{3}^{2} + 2) + e^{w_{2} - w_{1}}) \partial_{3} + \frac{1}{4} e^{w_{1} - w_{2}} (w_{5} - w_{6}) (w_{5} + w_{6}) \partial_{4} \\ - \frac{e^{w_{1} - w_{2}} w_{6}}{\sqrt{2}} \partial_{5} + \frac{e^{w_{1} - w_{2}} w_{5}}{\sqrt{2}} \partial_{6} \\ k_{4} & = & - \frac{1}{2} e^{w_{1} + w_{2}} w_{4} \partial_{1} + \frac{1}{4} e^{w_{1} + w_{2}} (\sqrt{2} w_{5} w_{6} - 2 w_{4}) \partial_{2} + \frac{1}{4} e^{w_{1} + w_{2}} (w_{5}^{2} + \sqrt{2} w_{3} w_{6} w_{5} - w_{6}^{2}) \partial_{3} \\ + (\frac{1}{16} e^{w_{1} + w_{2}} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + {(w_{6}^{2} + 4)}^{2}) + e^{- w_{1} - w_{2}}) \partial_{4} \\ + \frac{e^{w_{1} + w_{2}} w_{6} (w_{6}^{2} + 4)}{4 \sqrt{2}} \partial_{5} - \frac{e^{w_{1} + w_{2}} w_{5} (w_{6}^{2} + 4)}{4 \sqrt{2}} \partial_{6} \\ k_{5} & = & - \frac{1}{2} e^{w_{1}} w_{5} \partial_{1} - \frac{e^{w_{1}} w_{3} w_{6}}{2 \sqrt{2}} \partial_{2} - \frac{e^{w_{1}} (w_{3}^{2} + 2) w_{6}}{2 \sqrt{2}} \partial_{3} + \frac{e^{w_{1}} w_{6} (w_{6}^{2} + 2 w_{3} w_{4} + 4)}{4 \sqrt{2}} \partial_{4} \\ + (\frac{1}{4} e^{w_{1}} (w_{5}^{2} + 2 w_{6}^{2} + 2 w_{3} w_{4} + 4) + e^{- w_{1}}) \partial_{5} + \frac{e^{w_{1}} (w_{3} (w_{6}^{2} + 4) - 4 w_{4})}{4 \sqrt{2}} \partial_{6} \\ k_{6} & = & - \frac{1}{2} e^{w_{2}} w_{6} \partial_{2} - \frac{1}{2} e^{w_{2}} (\sqrt{2} w_{5} + w_{3} w_{6}) \partial_{3} + (\frac{e^{- w_{2}} w_{5}}{\sqrt{2}} + \frac{1}{2} e^{w_{2}} w_{4} w_{6}) \partial_{4} \\ + \frac{e^{- w_{2}} (e^{2 w_{2}} w_{4} - w_{3})}{\sqrt{2}} \partial_{5} + (\frac{1}{4} e^{w_{2}} (w_{6}^{2} + 4) + e^{- w_{2}}) \partial_{6} \end{matrix}

(304)

The 3 Killing vector fields closing the

su (2)

Lie subalgebra of the isotropy algebra

H

have the following explicit form:

\begin{matrix} k_{7} & = & \frac{1}{2} e^{w_{2}} w_{6} \partial_{2} + \frac{1}{2} e^{w_{2}} (\sqrt{2} w_{5} + w_{3} w_{6}) \partial_{3} + (\frac{e^{- w_{2}} w_{5}}{\sqrt{2}} - \frac{1}{2} e^{w_{2}} w_{4} w_{6}) \partial_{4} \\ - \frac{e^{- w_{2}} (w_{3} + e^{2 w_{2}} w_{4})}{\sqrt{2}} \partial_{5} + (e^{- w_{2}} - \frac{1}{4} e^{w_{2}} (w_{6}^{2} + 4)) \partial_{6} \\ k_{8} & = & \frac{1}{2} e^{w_{1}} w_{5} \partial_{1} + \frac{e^{w_{1}} w_{3} w_{6}}{2 \sqrt{2}} \partial_{2} + \frac{e^{w_{1}} (w_{3}^{2} + 2) w_{6}}{2 \sqrt{2}} \partial_{3} - \frac{e^{w_{1}} w_{6} (w_{6}^{2} + 2 w_{3} w_{4} + 4)}{4 \sqrt{2}} \partial_{4} \\ + (e^{- w_{1}} - \frac{1}{4} e^{w_{1}} (w_{5}^{2} + 2 w_{6}^{2} + 2 w_{3} w_{4} + 4)) \partial_{5} + \frac{e^{w_{1}} (4 w_{4} - w_{3} (w_{6}^{2} + 4))}{4 \sqrt{2}} \partial_{6} \\ k_{9} & = & \frac{e^{w_{1} - w_{2}} (w_{3} + e^{2 w_{2}} w_{4})}{2 \sqrt{2}} \partial_{1} + \frac{1}{4} e^{w_{1} - w_{2}} (e^{2 w_{2}} (\sqrt{2} w_{4} - w_{5} w_{6}) - \sqrt{2} w_{3}) \partial_{2} \\ + \frac{e^{- w_{1} - w_{2}} (e^{2 w_{2}} (4 - e^{2 w_{1}} (w_{5}^{2} + \sqrt{2} w_{3} w_{6} w_{5} - w_{6}^{2})) - 2 e^{2 w_{1}} (w_{3}^{2} + 2))}{4 \sqrt{2}} \partial_{3} \\ + \frac{e^{- w_{1} - w_{2}} (e^{2 w_{1}} (- 4 w_{5}^{2} + 4 w_{6}^{2} - e^{2 w_{2}} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + {(w_{6}^{2} + 4)}^{2})) + 16)}{16 \sqrt{2}} \partial_{4} \\ + \frac{1}{8} e^{w_{1} - w_{2}} w_{6} (4 - e^{2 w_{2}} (w_{6}^{2} + 4)) \partial_{5} + \frac{1}{8} e^{w_{1} - w_{2}} w_{5} (e^{2 w_{2}} (w_{6}^{2} + 4) - 4) \partial_{6} \end{matrix}

(305)

Finally, the Killing vector field associated with the

u (1)

subalgebra of the

H

isotropy algebra has the following explicit form:

\begin{matrix} k_{10} & = & \frac{(e^{w_{1} - w_{2}} w_{3} - e^{w_{1} + w_{2}} w_{4})}{2 \sqrt{2}} \partial_{1} - \frac{1}{4} e^{w_{1} - w_{2}} (\sqrt{2} w_{3} + e^{2 w_{2}} (\sqrt{2} w_{4} - w_{5} w_{6})) \partial_{2} \\ + \frac{e^{- w_{1} - w_{2}} (e^{2 w_{2}} (e^{2 w_{1}} (w_{5}^{2} + \sqrt{2} w_{3} w_{6} w_{5} - w_{6}^{2}) + 4) - 2 e^{2 w_{1}} (w_{3}^{2} + 2))}{4 \sqrt{2}} \partial_{3} \\ + \frac{e^{- w_{1} - w_{2}} (e^{2 w_{1}} (- 4 w_{5}^{2} + 4 w_{6}^{2} + e^{2 w_{2}} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + {(w_{6}^{2} + 4)}^{2})) - 16)}{16 \sqrt{2}} \partial_{4} \\ + \frac{1}{8} e^{w_{1} - w_{2}} w_{6} (e^{2 w_{2}} (w_{6}^{2} + 4) + 4) \partial_{5} - \frac{1}{8} e^{w_{1} - w_{2}} w_{5} (e^{2 w_{2}} (w_{6}^{2} + 4) + 4) \partial_{6} \end{matrix}

(306)

Last but not least, we need the moment maps associated with each of the above Killing vector fields. To this effect, we utilize the general method described in Section 6.1.2 and we apply Formula (230). Hence, we write:

P_{Λ} (W) = \frac{1}{2} Tr (T_{10}^{v} \cdot L^{- 1} (W) \cdot T_{Λ}^{v} \cdot L (W)); Λ = 1, \dots, 10

(307)

and the explicit result that we obtain is displayed below:

\begin{matrix} \begin{matrix} P_{1} & = & \frac{1}{16} (4 \sqrt{2} w_{4} - 4 w_{5} w_{6} - \sqrt{2} w_{3} (w_{6}^{2} + 4)) \\ P_{2} & = & \frac{4 w_{4} + w_{3} (w_{6}^{2} + 4)}{8 \sqrt{2}} \\ P_{3} & = & \frac{1}{32} (e^{w_{1} - w_{2}} (\sqrt{2} (w_{6}^{2} + 4) w_{3}^{2} + 4 w_{5} w_{6} w_{3} + 2 \sqrt{2} (w_{5}^{2} + 4)) - 2 \sqrt{2} e^{w_{2} - w_{1}} (w_{6}^{2} + 4)) \\ P_{4} & = & \frac{1}{64} e^{- w_{1} - w_{2}} (16 \sqrt{2} - e^{2 (w_{1} + w_{2})} (8 \sqrt{2} w_{4}^{2} - 8 w_{5} w_{6} w_{4} + \sqrt{2} (w_{5}^{2} + 4) (w_{6}^{2} + 4))) \\ P_{5} & = & \frac{1}{32} e^{w_{1}} (- 4 \sqrt{2} w_{4} w_{5} + \sqrt{2} w_{3} (w_{6}^{2} + 4) w_{5} - 4 w_{3} w_{4} w_{6} + 2 (w_{5}^{2} - 4) w_{6}) - \frac{1}{4} e^{- w_{1}} w_{6} \\ P_{6} & = & \frac{1}{16} e^{- w_{2}} (2 \sqrt{2} (w_{3} - e^{2 w_{2}} w_{4}) w_{6} + w_{5} (e^{2 w_{2}} (w_{6}^{2} + 4) + 4)) \end{matrix} \\ \begin{matrix} P_{7} & = & \frac{1}{16} e^{- w_{2}} (2 \sqrt{2} (w_{3} + e^{2 w_{2}} w_{4}) w_{6} + w_{5} (4 - e^{2 w_{2}} (w_{6}^{2} + 4))) \\ P_{8} & = & - \frac{1}{4} e^{- w_{1}} w_{6} - \frac{1}{32} e^{w_{1}} (- 4 \sqrt{2} w_{4} w_{5} + \sqrt{2} w_{3} (w_{6}^{2} + 4) w_{5} - 4 w_{3} w_{4} w_{6} + 2 (w_{5}^{2} - 4) w_{6}) \\ P_{9} & = & \frac{1}{64} e^{- w_{1} - w_{2}} [- 4 e^{2 w_{2}} (w_{6}^{2} + 4) - 2 e^{2 w_{1}} ((w_{6}^{2} + 4) w_{3}^{2} + 2 \sqrt{2} w_{5} w_{6} w_{3} + 2 (w_{5}^{2} + 4)) \\ + e^{2 (w_{1} + w_{2})} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + (w_{5}^{2} + 4) (w_{6}^{2} + 4)) + 16] \end{matrix} \\ \begin{matrix} P_{10} & = & \frac{1}{64} e^{- w_{1} - w_{2}} [- 4 e^{2 w_{2}} (w_{6}^{2} + 4) - 2 e^{2 w_{1}} ((w_{6}^{2} + 4) w_{3}^{2} + 2 \sqrt{2} w_{5} w_{6} w_{3} + 2 (w_{5}^{2} + 4)) \\ - e^{2 (w_{1} + w_{2})} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + (w_{5}^{2} + 4) (w_{6}^{2} + 4)) - 16] \end{matrix} \end{matrix}

(308)

In Equation (308), we have separated the moment-maps into the three groups. The first group of six are the moment maps of the

K

translations. The subsequent group of three yields the moment maps of the

su (2)

compact generators, while the last group of just one is the moment map of the

u (1)

generator associated with the Kähler structure.

7. On the Partition Function and Gibbs Distributions in General and for ${SH}_{2}$ in Particular

Before addressing the determination of the partition function and the Gibbs distributions for the case of the Siegel half plane, which is now accessible, since we have prepared all the necessary instruments, we pause for a moment to consider a very important general property of the temperature vectors

β

. We might have anticipated the forthcoming discussion from previous sections, yet we chose to postpone it to the present junction since we are now in a favorable position to illustrate it with a concrete and non-trivial example. Once again, the metric equivalence of the non-compact symmetric spaces with a solvable Lie group manifold

S_{U / H}

and the double synergic decompositions (205) and (206) play an essential role.

The solvable Lie algebra generators can always be reexpressed as suitable linear combinations of the

K_{i}

coset generators and of the

H_{α}

compact subalgebra generators. Let us name

T_{i}

(

i = 1, \dots, 6

) the solvable Lie algebra generators in our

{SH}_{2}

case. With reference to Table 2 we have:

T_{i} = \underset{6 \times 6}{\underset{︸}{Q_{i j}}} K_{j} + \underset{6 \times 4}{\underset{︸}{Q_{i α}}} H_{α} : \{\begin{matrix} T_{1} & = & K_{1} \\ T_{2} & = & K_{2} \\ T_{3} & = & \frac{1}{\sqrt{2}} H_{0} + \frac{1}{\sqrt{2}} H_{3} + K_{3} \\ T_{4} & = & - \frac{1}{\sqrt{2}} H_{0} + \frac{1}{\sqrt{2}} H_{3} + K_{4} \\ T_{5} & = & H_{2} + K_{5} \\ T_{6} & = & H_{1} + K_{6} \end{matrix}

(309)

The same linear transformation applies to the moment maps and allows to find the moment maps of the Killing vectors associated to the solvable Lie algebra generators. Hence instead of using the basis

(K, H)

for the moment-maps and the temperature vector

β

, we can use the basis

(S o l v, H)

and write the argument of the exponential in the partition function as

\hat{β} \cdot \hat{P} (W) = {\hat{β}}^{Λ} {\hat{P}}_{Λ} (W)

(310)

where now

\begin{matrix} {\hat{P}}_{Λ} (W) & = & \frac{1}{2} Tr (H_{0} \cdot L^{- 1} (W) \cdot {\hat{T}}_{Λ} \cdot L (W)); Λ = 1, \dots, 10 \\ {\hat{T}}_{Λ} & = & \{T_{1} \dots, T_{6}, H_{1}, \dots, H_{4}\} \end{matrix}

(311)

Consider next the formal definition of the partition function

\begin{matrix} Z (\hat{β}) & = & \int exp [- \hat{β} \cdot \hat{P} (W)] μ (W) \end{matrix}

(312)

\begin{matrix} μ (W) & \equiv & K \land K \land K ≃ \underset{just a constant}{\underset{︸}{\sqrt{\det [g (W)]}}} d^{6} W \end{matrix}

(313)

where

μ (W)

in Equation (313) is the integration measure that is invariant with respect to isometries of the coset manifold

U / H

. Recalling next the metric equivalence of the coset manifold with the solvable Lie group

S_{U / H}

that has free transitive action on the base manifold, we consider any group element

s_{u} \in S_{U / H}

whose corresponding parameters we name

U

. By definition, the abstract group element is represented in five dimensions by the matrix

L (U)

, and we have:

\begin{matrix} s_{u} : L (W) ⟶ L (U) \cdot L (W) = L (s_{u} (W)) \end{matrix}

(314)

Since the integration measure is invariant under the solvable Lie group translations that are isometries, we can change integration variables and write:

Z (\hat{β}) = \int exp [- \hat{β} \cdot \hat{P} (s_{u} (W))] μ (s_{u} (W)) = \int exp [- \hat{β} \cdot \hat{P} (s_{u} (W))] μ (W)

(315)

Focusing on the argument of the exponential integrand, we have:

\begin{matrix} \hat{β} \cdot \hat{P} (s_{u} (W)) & = & {\hat{β}}^{Λ} \times \frac{1}{2} Tr (H_{0} \cdot L^{- 1} (W) \cdot L^{- 1} (U) \cdot {\hat{T}}_{Λ} \cdot L (U) \cdot L (W)) \\ = & {\hat{β}}^{Λ} \times Adj {(s_{u})}_{Λ}^{Σ} \times \frac{1}{2} Tr (H_{0} \cdot L^{- 1} (W) \cdot {\hat{T}}_{Σ} \cdot L (W)) \\ = & {\hat{β}}^{Λ} \times Adj {(s_{u})}_{Λ}^{Σ} {\hat{P}}_{Σ} (W) \end{matrix}

(316)

The conclusion of the above formal calculation is that the partition function of generalized thermodynamics à la Souriau on Kähler non-compact symmetric spaces

U / H

has the following extremely important symmetry:

\begin{matrix} Z (\hat{β}) & = & Z ({Adj}^{T} (s) \cdot \hat{β}); \forall s \in S_{U / H} \\ {({Adj}^{T} (s) \cdot \hat{β})}^{Σ} & \equiv & {\hat{β}}^{Λ} Adj {(s)}_{Λ}^{Σ} \end{matrix}

(317)

The relevance of the above symmetry is that the co-adjoint action (transpose) of the solvable Lie group can always rotate a generic temperature vector

β

into a new one

β_{c}

that has non-vanishing components only along the compact subalgebra generators

H_{α}

, as we might explicitly illustrate for the Siegel case

{SH}_{2}

, but we skip the somewhat lengthy calculations. This is not the end of the story. There is still the symmetry with respect to the compact isotropy subgroup that we can utilize. Consider the partition function reduced to a compact temperature

β_{c}

, namely:

\begin{matrix} Z (β_{c}) & = & \int exp [- β_{c}^{α} P_{α} (W)] μ (W) \\ β_{c}^{α} P_{α} (W) & = & \sum_{α = 1}^{\dim H} β_{c}^{α} \times \frac{1}{2} Tr (H_{0} \cdot L^{- 1} (W) \cdot H_{α} \cdot L (W)) \end{matrix}

(318)

The transformation of the compact isotropy subgroup

H

are just isometries as all other transformations of

U

, hence they leave the integration measure

μ (W)

invariant and act on the solvable coset representative

L (W)

in the canonical way as follows:

\forall h \in H; h \cdot L (W) = L (h (W)) \cdot H (h, W); H (h, W) \in H

(319)

where

h (W)

are the new solvable coordinates after the

h

-isometry transformation and

H (h, W)

is the H-compensator, which, by definition, also lies in

H

and depends both on the point

W

and on the chosen

h

group element.

With the same strategy utilized above, we change the integration variable

W \to h (W)

, and we rewrite:

\begin{matrix} Z (β_{c}) & = & \int exp [- β_{c}^{α} P_{α} (h (W))] μ (h (W)) \\ = & \int exp [- β_{c}^{α} P_{α} (h (W))] μ (W) \end{matrix}

(320)

Next, using Equations (318) and (319), we obtain:

\begin{matrix} β_{c}^{α} P_{α} (h (W)) & = & \sum_{α = 1}^{\dim H} β_{c}^{α} \times \frac{1}{2} Tr (H_{0} \cdot H (h, W) \cdot L^{- 1} (W) \cdot h^{- 1} \cdot H_{α} \cdot h \cdot L (W) \cdot H^{- 1} (h, W)) \end{matrix}

(321)

\begin{matrix} = & β_{c}^{α} Adj {(h)}_{α}^{β} P_{β} (W) \end{matrix}

(322)

In order to establish the equality of the r.h.s in (321) with that in (322) we utilized three properties:

The cyclic invariance of the trace.
The crucial fact that $H_{0}$ is the center of the compact Lie algebra $H$ , which is the very reason why the considered manifold is Kählerian, so that $H_{0}$ is invariant against any adjoint transformation of $H$ group.
The adjoint H representation in the space of its Lie algebra $H$ :

$\forall h \in H : h^{- 1} \cdot H_{α} \cdot h = Adj {(h)}_{α}^{β} H_{β}$

(323)

The final crucial fact is that by means of a suitable

h

-transformation we can always bring any

H

Lie algebra element into the Cartan subalgebra

C \subset H

and then reduce the

β_{c}

to

β_{c}^{0}

that has non vanishing component only along the Cartan generators, one being

H_{0}

the remaining ones being the Cartan generators of

H^{'}

in the decomposition (30).

7.1. Canonical Form of the Partition Functions and of the Gibbs Probability Distributions, in General

In this way, the space of allowed temperatures turns out to be the

U

(co)adjoint orbit of a proper subset

Ω_{c} C \subset H

of the compact Cartan subalgebra

C

.

Ω_{c}

is provided by those compact Cartan temperatures that have the correct sign in order to guarantee convergence of the Gaussian integrals. As we advocate further on, for the non-maximally split manifolds, where the solvable Lie algebra admits the Paint-Group automorphism group, which is a proper subgroup

G^{Paint} \subset H \subset U

, the essential Cartan temperatures might be further reduced. We postpone this study to the next publication and confine ourselves to the preliminary observations of Appendix D relative to the case

M^{[2, 2]}

.

Learning the lesson that the temperature vector

β

can be reduced to its minimal Cartan form

β_{0}

by means of

U

-isometry transformations, the general form of the Gibbs probability distributions becomes very simple and clear. It always contains as many parameters as the dimension of the

U

Lie group but it can be written in the following very compact and useful way:

\begin{matrix} G (β_{0}; g ∣ W) & \equiv & \frac{exp [- \sum_{i = 0}^{ℓ = rank H^{'}} β_{0}^{i} P_{i} (g [W])]}{Z (β^{0})} \\ β_{0}^{i} & = & temperatures associated with the compact Cartan generators H_{0, 1, \dots, ℓ}^{c} of H \\ P_{i} (W) & = & moment maps of the compact Cartan generators H_{0, 1, \dots, ℓ}^{c} of H \\ g & = & any group element in U \\ g [W] & = & new solvable parameters after a g - isometry transformation \end{matrix}

(324)

In other words, the Cartan temperatures define the Gibbs probability distribution centered around the origin of the coset manifold, namely the identity of the metrically equivalent solvable group. The other temperatures simply would rotate such a distribution (H-transformations) or translate it to be centered around any other point of the manifold (solvable Lie group transformations). Hence instead of introducing such parameters we can simply evaluate the original Gibbs distribution in a transformed point. Both for analytic calculations and for practical applications in Machine Learning this view-point is extremely useful. Stated differently, when reducing the temperature vector to the compact Cartan subalgebra, the suppressed

β

-parameters are replaced by the parameters of a generic

U

-transformation g of the point, appearing in the argument of the exponential distribution.

7.2. Calculation of the Partition Function for the Siegel Plane in Canonical Form

Having clarified that we jus need two compact temperatures associated with two Cartan generators we choose as second Cartan generator the one listed as

H_{3} = T_{9}^{v}

in Table 2. This means that using Equation (308) for the explicit form of the moment maps, the argument of the exponential in the partition function integrand is the following:

\begin{matrix} A \equiv β_{0} \cdot P (W) & = & μ P_{9} (W) + λ P_{10} (W) \\ = & \frac{1}{64} e^{- w_{1} - w_{2}} (λ (- 4 e^{2 w_{2}} (w_{6}^{2} + 4) - 2 e^{2 w_{1}} ((w_{6}^{2} + 4) w_{3}^{2} + 2 \sqrt{2} w_{5} w_{6} w_{3} + 2 (w_{5}^{2} + 4)) \\ - e^{2 (w_{1} + w_{2})} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + (w_{5}^{2} + 4) (w_{6}^{2} + 4)) - 16) \\ + μ (- 4 e^{2 w_{2}} (w_{6}^{2} + 4) - 2 e^{2 w_{1}} ((w_{6}^{2} + 4) w_{3}^{2} + 2 \sqrt{2} w_{5} w_{6} w_{3} + 2 (w_{5}^{2} + 4)) \\ + e^{2 (w_{1} + w_{2})} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + (w_{5}^{2} + 4) (w_{6}^{2} + 4)) + 16)) \end{matrix}

(325)

where we have named

β_{9} = μ

and

β_{10} = λ

. For calculation convenience it is useful to redefine

w_{1, 2} = log [ρ_{1, 2}

. In this way we get:

\begin{matrix} A & = & \frac{N_{A}}{D_{A}} \\ N_{A} & = & ρ_{1}^{2} (- (ρ_{2}^{2} (8 w_{4}^{2} - 4 \sqrt{2} w_{5} w_{6} w_{4} + (w_{5}^{2} + 4) (w_{6}^{2} + 4)) (λ - μ) \\ + 2 ((w_{6}^{2} + 4) w_{3}^{2} + 2 \sqrt{2} w_{5} w_{6} w_{3} + 2 (w_{5}^{2} + 4)) (λ + μ))) - 4 (4 (λ - μ) + ρ_{2}^{2} (w_{6}^{2} + 4) (λ + μ)) \\ D_{A} & = & 64 ρ_{1} ρ_{2} \end{matrix}

(326)

We have to calculate the 6-integrals of

exp [A]

on the nilpotent coordinate

w_{3}, w_{4}, w_{5}, w_{6}

and finally on

ρ_{1}, ρ 2

. We begin with the

w_{3}

integral that is perfectly Gaussian and imposes the convergence condition:

λ + μ > 0

(327)

Next, we calculate also the integrand on

w_{4}

which is again Gaussian and imposes the second convergence condition:

λ - μ > 0

(328)

We get

\begin{matrix} \int_{- \infty}^{\infty} d w_{4} \int_{- \infty}^{\infty} d w_{3} exp [- A] & = & \frac{16 π e^{- B}}{ρ_{1} \sqrt{λ - μ} \sqrt{(w_{6}^{2} + 4) (λ + μ)}} \\ B & = & \frac{N_{B}}{64 ρ_{1} ρ_{2}} \\ N_{B} & = & 16 λ - 16 μ + ρ_{2}^{2} ρ_{1}^{2} w_{5}^{2} w_{6}^{2} (- (λ - μ)) + 4 ρ_{2}^{2} (w_{6}^{2} + 4) (λ + μ) \\ + \frac{ρ_{1}^{2} (ρ_{2}^{2} (w_{5}^{2} + 4) {(w_{6}^{2} + 4)}^{2} (λ - μ) + 16 (w_{5}^{2} + w_{6}^{2} + 4) (λ + μ))}{w_{6}^{2} + 4} \end{matrix}

(329)

The next integration on the nilpotent coordinate

w_{5}

is just Gaussian and it does not impose further contraints on the two temperatures

μ

and

λ

. We get the following:

\begin{matrix} \int_{- \infty}^{\infty} d w_{5} \frac{16 π e^{- B}}{ρ_{1} \sqrt{λ - μ} \sqrt{(w_{6}^{2} + 4) (λ + μ)}} & = & C \\ C & = & \frac{64 π^{3 / 2} exp (- \frac{4 (ρ_{1}^{2} (λ + μ) + λ - μ) + ρ_{2}^{2} (w_{6}^{2} + 4) (ρ_{1}^{2} (λ - μ) + λ + μ)}{16 ρ_{1} ρ_{2}})}{ρ_{1} \sqrt{(w_{6}^{2} + 4) (λ - μ) (λ + μ)} \sqrt{ρ_{2} ρ_{1} (λ - μ) + \frac{4 ρ_{1} (λ + μ)}{ρ_{2} (w_{6}^{2} + 4)}}} \end{matrix}

(330)

The fourth integration on the nilpotent variable

w_{6}

can also be performed and yields an analytical result:

\begin{matrix} F (ρ_{1}, ρ_{2}, λ, μ) \equiv \int_{- \infty}^{\infty} d w_{6} C \\ = & \frac{64 π^{3 / 2} \sqrt{λ - μ} exp (- \frac{λ^{2} + (λ - μ) (ρ_{1}^{2} (ρ_{2}^{2} (λ - μ) + λ + μ) + ρ_{2}^{2} (λ + μ)) - 6 λ μ + μ^{2}}{8 ρ_{1} ρ_{2} (λ - μ)}) K_{0} (\frac{((λ - μ) ρ_{1}^{2} + λ + μ) ((λ - μ) ρ_{2}^{2} + λ + μ)}{8 (λ - μ) ρ_{1} ρ_{2}})}{\sqrt{ρ_{2} (λ + μ)} {(\frac{ρ_{1} (λ - μ)}{ρ_{2}^{2} (λ - μ) + λ + μ})}^{3 / 2} {(ρ_{2}^{2} (λ - μ) + λ + μ)}^{3 / 2}} \end{matrix}

(331)

where

K_{0} (x)

is the Bessel function of type K and index 0.

Unfortunately the last two integrals on the remaining two variables

ρ_{1, 2}

or better on their logarithms

w_{1, 2}

cannot be done analytically and one has to perform them numerically introducing in this way compiled functions. By plotting the integrand we can however very easily verify that it always dacays exponentially to zero in all directions so that the integral is always convergent (see Figure 5).

For this reason, one can define the partition function as a compiled function by performing numerically on a computer the last two integrals:

Z (λ, μ) = \int_{- \infty}^{\infty} d w_{1} \int_{- \infty}^{\infty} d w_{2} F (exp [w_{1}], exp [w_{2}], λ, μ)

(332)

In Figure 6, we display a plot of the partition function and of minus its logarithm, namely of the stochastic Hamiltonian.

8. Conclusions

Machine Learning and so-named Artificial Intelligence algorithms rely on two mathematical pillars: Differential Geometry on one side and Probability Theory on the other. The first is needed to model the spaces where data are to be encoded, the second to classify and elaborate the data by assigning a probability to their actual location in such spaces and to their correlations. Hence, the two mathematical pillars have to be reconciled with one another and solidly entangled. A statistical viewpoint on whatever set of objects always leads to thermodynamics, in a generalized sense, and, already for some decades, an abstract geometrical formulation of thermodynamics has been developed that starts from Shannon’s information entropy and leads to the identification of thermodynamical equilibrium states with Lagrangian submanifolds of symplectic manifolds, also endowing them with a canonical Riemannian structure. In the different data science literature, the same Riemannian structure has been developed under the name of Fisher’s information geometry. In such a variegated conceptual landscape and in strong correlation with the newly introduced paradigm of Cartan neural networks, where all the hidden layers of neural networks are modeled as non-compact symmetric spaces

U / H

, that are all Cartan Hadamard manifolds and, consequently, are equipped with a canonical distance function, an important issue for Machine Learning is that of Gaussian-like probability distributions on the encoding spaces. Starting from the hints provided by the work of a group of French authors [30,31,32,33,34], who suggested the use of Gibbs states related to the Lie Group Thermodynamics proposed long ago by Souriau, in the present paper, we took on ourselves the following task:

Clarify the relation of Geometrical Thermodynamics à la Ruppeiner [22,24,25,26,27,28] and Lychagin [29] with Souriau’s proposals [30,34] and with Information Geometry [43].
Distinguish between Souriau non-abelian thermodynamics and the geometrical thermodynamics associated with Integrable Dynamical Systems, in particular the Geodesic Dynamical System associated with the calculation of geodesics on the same $U / H$ symmetric spaces that enter the Machine Learning game as hidden layer models.
Investigate the basic principle of Souriau’s thermodynamics, that is, the characterization of the locus $Ω \subset U$ in the relevant Lie algebra, whose elements are possible generalized temperatures in the sense that for them the partition function integral is convergent.
Clarify the role of the coadjoint orbit conception, Souriau’s favorite one, that turns out to be equivalent to the more practical and algorithmic conception based on coset manifolds.

We think that we have attained all the goals we aimed at. Indeed, our results can be summarized as follows.

(1)

We have established the identity of Fisher’s Information metric, given as the Hessian of a certain matrix with the metric obtained as the Hessian of the stochastic Hamiltonian

H^{s t o} (λ)

, derived in Lychagin’s approach as the canonical Riemannian metric on Lagrangian submanifolds of a symplectic manifold where, by definition, the symplectic 2-form

ω

vanishes identically. Such Lagrangian submanifolds are the thermodynamical equilibrium states, and the 1st and 2nd Principle of Thermodynamics are incorporated in their very definition. These notions are fully general and equally apply to any generalized thermodynamics, non-abelian algebra, as it happens in the thermodynamics à la Souriau.

(2)

With respect to the Poissonian structures on the dual

S o l v^{★}

of solvable Lie algebras

S o l v

utilized also by two of us (P.F. and A.S.) in their 2009 paper [18] on the integrability of the geodesic equations on non-compact symmetric spaces

U / H

and investigated by Arkhangelsky in [57], where he derived their Hamiltonians in involution, we show here that such a Poissonian structure is only half of the full story, since it is defined only on the momentum subspace of phase space. Introducing also the coordinates, which is what one should always do, there is a complete symplectic manifold with a symplectic 2-form of maximal rank, and what one describes is just the geodesic dynamical system in Hamiltonian formalism. Arkhangelsky Hamiltonians depend only on the momenta, but they are Hamiltonians in involution also with respect to the complete symplectic structure. The geometric thermodynamics associated with such integrable dynamical systems can be constructed, but it is essentially uninteresting for three reasons:

(a): The dependence of the partition function on volume is factorized, and the equation of state resembles the trivial one of Ideal Gases.
(b): The degrees of freedom are few, and a statistical description seems inappropriate.
(c): Last but not least, the Gibbs probability distributions have a non-trivial structure only in momentum space, namely along the fibres of the tangent bundle $T U / H$ , not on the very base manifold $U / H$ . All that is of little appeal for Machine Learning applications, where one looks for probability distributions (Gibbs states) on $U / H$ .

(3)

The searched for Gaussian-like probability distributions on

U / H

are instead provided by the construction of Gibbs states à la Souriau. This requires a symplectic structure on the very manifold

U / H

and not on its tangent bundle. After demonstrating that a coadjoint

U

-orbit of some element

b \in U

is always diffeomorphic and algebraically equivalent to a coset manifold

U / H

, where

H \subset U

is the stabilizer of the element

b

in

U

, we abandon the coadjoint orbit conception, and we focus on non-compact symmetric spaces

U / H

. In order to have the symplectic structure and construct Souriau thermodynamics,

H

must be the stabilizer of some Lie algebra element, and this, as we show, implies that

H

has a

u (1)

which endows the symmetric space with a Kähler structure, and the Kähler 2-form

K

is the required symplectic 2-form. In this way, we come to the conclusion that the relevant non-compact symmetric spaces are the Kähler ones corresponding only to two infinite series, the Siegel half-planes

{SH}_{n}

and the Calabi–Vesentini manifolds

M^{[2, q]}

mentioned in Equation (201). The first series is composed of maximally split manifolds of increasing non-compact rank, while the second constitutes a Tits Satake universality class having the Siegel

{SH}_{2}

manifold as universal Tits Satake submanifold. In application to Machine Learning, if one wants to take advantage of the Paint Group symmetry (see [1]) and its potentiality in data clustering, the Calabi–Vesentini choice is preferred.

(4)

The central point of Souriau’s generalized thermodynamics, namely the determination of the subspace

Ω \subset U

of allowed temperature vectors was also solved by us in a simple and elegant way.

Ω

is just the the adjoint $U$ orbit of a positivity chamber in the Cartan subalgebra $C \subset H$ of the compact subalgebra $H \subset U$ . One fixes the sign of the ℓ independent temperatures associated with ℓ generators of

C

, and by an adjoint transformation of

U

, generates all the other possible temperature vectors that respect convergence of the partition function integral. This property, apart from solving the convergence issue, is also of practical relevance. Indeed, the true temperatures are just the compact Cartan ones; all the others are an effect of translation of the central point of a Gibbs probability distribution to any other in the manifold by means of an isometry. Hence, we can always utilize the same partition function depending on a very small number of temperatures and change the point in the Gibbs distribution from a given one to its image under any element of the isometry group

U

.

(5)

For the case of the Poincaré plane, we have explicitly constructed the partition function depending on three temperatures and even studied the 3-dimensional thermodynamical Riemannian metric, showing that it is non-trivial and not that of a space. For the case of the Siegel half-plane

{SH}_{2}

, we have reduced the partition function to an integral in two variables, whose integrand is a combination of exponentials, roots, and Bessel functions. The integral is convergent, and one can construct a compiled function of which we have shown a plot. The only problem is the computing velocity of the utilized computer.

What to Do Next

As we have shown in Appendix D, the results obtained here for the Tits Satake submanifold

{SH}_{2}

are liable to be extended to the entire universality class of Calabi–Vesentini manifolds by careful use of Paint Group invariance. This is certainly the most urgent task in the program. Succeeding in that, we will have Gibbs probability distributions for each of the candidate hidden layers of a Cartan Neural Network based on Calabi–Vesentini manifolds. As Barbaresco et al. have shown [30,31,32,33,34], geometric thermodynamics à la Souriau can be utilized to study time/sequences, in particular those provided by radar data. This is just the tip of the iceberg. On one side, there are many other possible sequential data; on the other side, the use of Gibbs probability distributions on the hidden layers, or more generally on the manifold where various types of data can be mapped, introduces a powerful new weapon for designing algorithm architectures. Furthermore, recalling the results of [1] so far not yet used, one activity direction might be that of restricting the Gibbs probability distributions to infinite discrete subsets of the Calabi–Vesentini manifold, corresponding to orbits of the origin under the action of any of the large class of discrete subgroups of

SO (2, 2 + q)

that were classified and constructed in the foundational paper [1].

Final Comment

When the present article was ready for posting on arXiv and for submission to a Journal we learned about the beautiful and inspiring paper by Laurent Bonnasse-Gahot and Jean Pierre Nadal [62] focused on geometry of the internal representation, which mathematically means Fisher’s information geometry, namely, what we have here shown to be identical with the thermodynamical geometry of Lychagin, Roop, and Ruppeiner, in whose general scheme we could also fit the Gibbs states à la Souriau on the Kähler non-compact symmetric spaces

U / H

. It follows that an additional challenging direction of further investigation is the possible application of [62] general methods to data mapped to Kähler non-compact symmetric spaces and to the Gibbs states defined over them.

Author Contributions

Conceptualization, P.G.F., A.S.S. and M.T.; Methodology, P.G.F., A.S.S. and M.T.; Formal analysis, P.G.F., A.S.S. and M.T.; Investigation, P.G.F., A.S.S. and M.T.; Writing—review & editing, P.G.F., A.S.S. and M.T. All authors have read and agreed to the published version of the manuscript.

Funding

The work of A. Sorin is supported in part by the Center for Integration in Science of the Israel Ministry of Aliyah and Integration.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

P.G. Fré acknowledges support by the Company Additati & Partners Consulting s.r.l. during the development of the present study.

Conflicts of Interest

Author Pietro Fré was employed by the company Additati & Partners Consulting s.r.l. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Basic Structures of Contact and Symplectic Geometry

In this appendix, we recall the basic structures of symplectic and contact geometry that are the two tightly connected arenas both for the geometric reformulation of thermodynamics and for dynamical systems, i.e., the underlying mathematical basis of all phenomena studied in stochastic processes, data science, and (geometric) deep learning. Indeed, the prototype of a symplectic manifold is the phase space of a dynamical system.

Appendix A.1. Contact Geometry

Contact Geometry is at the same time a new and ancient chapter in Mathematics, since it originates in classical results dating back to Goursat, Darboux, Lie, and other Masters of the 19th century, but has been vigorously developed in the last two to three decades by a relatively small community of mathematicians. Regarding such structures, there is an extensive literature provided by the following bibliographical references and the additional articles they cite [63,64,65,66,67,68,69,70,71,72]. To summarize, it can be said that Contact Geometry is a mathematical theory that aims at establishing an intrinsic geometric-topological characterization of non-integrability and, in a sense, is formulated in an inverted perspective from that usual in cohomological theories and integrable systems theory.

Not surprisingly, Contact Geometry comes into play whenever one intends to study chaos and disorder rather than order, as in integrable systems. Disorder and lack of information are characteristics of turbulent regimes in Fluid Dynamics and are essential attributes of statistical thermodynamic systems; it is therefore quite natural that Contact Geometry has relations to both Fluid Dynamics and Thermodynamics.

Contact Geometry deals exclusively with Differentiable Manifolds of odd dimension

M_{2 n + 1}

and, on the other hand, it has a symbiotic relationship with Simplectic Manifolds

S_{2 n + 2}

and

S_{2 n}

in the two adjacent even dimensions, the upper and the lower one.

The fundamental notions of Contact Geometry are easily stated and require, to be assimilated, nothing more than the most basic and elementary concepts of Differential Geometry. In the concise summary we present in this section, we closely follow the excellent review article [66].

Appendix A.2. Contact Structures

Let us consider an odd-dimensional differentiable manifolds

M_{2 n + 1}

and its tangent bundle:

{TM}_{2 n + 1} \overset{π}{⟶} M_{2 n + 1}; \forall p \in M_{2 n + 1}, π^{- 1} (p) \sim R^{2 n + 1}

(A1)

By definition, the transition function between any pair of local trivializations of the tangent vector bundle is provided by the inverse Jacobian of the transition function

ψ_{i j} (x)

between the two corresponding open charts

(U_{i}, φ_{i})

and

(U_{j}, φ_{j})

in any atlas that covers the entire manifold

M_{2 n + 1}

(see, for instance, [9]).

The space of the sections of the tangent bundle

Γ [{TM}_{2 n + 1}, M_{2 n + 1}]

is made up of all the vector fields, whose local description is in terms of first-order differential operators:

\begin{matrix} X \in Γ [{TM}_{2 n + 1}] ⇓ \\ X = X^{μ} (x) \frac{\partial}{\partial x^{μ}} : In any open chart U with coordinates x^{1} \dots x^{2 n + 1} \end{matrix}

(A2)

The cotangent bundle

T^{★} M_{2 n + 1} \overset{π_{★}}{⟶} M_{2 n + 1}; \forall p \in M_{2 n + 1}, π_{★}^{- 1} (p) \sim {(R^{2 n + 1})}^{★}

(A3)

is the dual of the tangent one, in the sense that its fibre over any point

p \in M_{2 n + 1}

of the basis space is the dual of the fibre vector space over the same point as the tangent fibre, that is, the space of linear functionals over the latter:

\forall p \in M_{2 n + 1} : π_{★}^{- 1} (p) = space of linear functionals on π^{- 1} (p)

(A4)

By construction, the transition function between two local trivializations of the cotangent bundle is provided by the direct Jacobian of the two corresponding open charts

(U_{i}, φ_{i})

and

(U_{j}, φ_{j})

in any atlas that covers the entire base manifold

M_{2 n + 1}

.

The space of sections of the cotangent bundle

Γ [T^{★} M_{2 n + 1}, M_{2 n + 1}]

is made by differential 1-forms whose local description is recalled below:

\begin{matrix} ω \in Γ [T^{★} M_{2 n + 1}, M_{2 n + 1}] ⇓ \\ ω = ω_{μ} (x) d x^{μ} : In any open chart U with coordinates x^{1} \dots x^{2 n + 1} \end{matrix}

(A5)

In terms of these fundamental concepts, which apply to differentiable manifolds of any dimension, whether even or odd, the concept of hyperplane bundles can be introduced. A hyperplane bundle is a reduction of the tangent bundle where the fibres above each point constitute a codimension one subspace of the tangent space above the same point, the transition functions being derived accordingly:

\begin{matrix} HY \overset{P}{⟶} M & ; & \forall p \in M, P^{- 1} (p) \subset π^{- 1} (p) where T M \overset{π}{⟶} M \\ \dim_{R} M = m & ; & \dim_{R} π^{- 1} (p) = m; \dim_{R} P^{- 1} (p) = m - 1 \end{matrix}

(A6)

A simple way to construct a hyperplane bundle is by means of a section of the cotangent bundle, namely by means of a 1-form

ω \in Γ [T^{★} M, M]

. The desired hyperplane sub-bundle

{HY}^{ω} \subset TM

of the tangent bundle is implicitly defined by specifying the space of all of its sections

Γ [{HY}^{ω}, M]

, namely by specifying all vector fields that are sections of

{HY}^{ω}

. In a precise mathematical language, let

X \in Γ [TM, M]

be a vector field, then we write:

X \in Γ [{HY}^{ω}, M] iff X \in \ker ω i . e ., ω (X) \equiv 0 (everywhere)

(A7)

Definition A1.

Given an odd-dimensional manifold

M_{2 n + 1}

, a contact structure on

M_{2 n + 1}

is a rank

2 n

sub-bundle

ξ \overset{P}{⟶} M_{2 n + 1}

of tangent bundle

{TM}_{2 n + 1} \overset{π}{⟶} M_{2 n + 1}

that can be identified with the hyperplane bundle

{HY}^{α}

where the 1-form α satisfies the following condition:

α \land \underset{n - volte}{\underset{︸}{d α \land \dots \land d α}} \neq 0 (everywhere on M_{2 n + 1})

(A8)

The 1-form α is named the contact form of the contact structure.

Definition A2.

A contact manifold is a pair

(M_{2 n + 1}, ξ)

made by an odd-dimensional manifold and a contact structure

ξ \overset{P}{⟶} M_{2 n + 1}

.

Some observations are obligatory in connection with the two definitions above. The first is that the same contact structure can be defined by different contact forms

α

,

α^{'}

,…. In fact, all multiples of a given contact form by means of a scalar function that does not vanish at any point

λ : M_{2 n + 1} \to R

define the same contact structure. Secondly, it is perfectly possible that the same odd-dimensional manifold

M_{2 n + 1}

admits more than one contact structure. The classification of such contact structures modulo diffeomorphism connected to the identity map is an interesting and relevant mathematical problem for odd-dimensional manifolds, just as it is interesting and relevant, the classification of complex structures that exist on an even-dimensional manifold. It is therefore obligatory to single out the concept of contactomorphism.

Definition A3.

Let

(M, ξ)

and

(N, χ)

be two contact manifolds and let

φ : M ⟶ N

(A9)

be a diffeomorphism of the former onto the latter manifold (obviously,

M

and

N

must have the same dimensions in order for φ to possibly exist). Let α be a contact form that defines ξ and let β be a contact form that defines χ. The considered diffeomorphism φ is named a contactomorphism if and only if:

φ^{★} (β) = λ α

(A10)

where

φ^{★}

is the pull-back map and

λ : M ⟶ R

(A11)

is a nowhere vanishing function on

M

. If a contactomorphism between them does exist, the two considered manifolds are named contactomorphic.

In the Definition A3, the two manifolds

M

and

N

might be the same manifold. In this case, what we are considering is the transformation of a contact structure into another one by means of a contactomorphism.

Definition A4.

Given two contact structures ξ and χ on the same manifold

M_{2 n + 1}

they must be identified as the same contact structure if a contactomorphism does exist that maps one into the other.

In conclusion the relevant mathematical problem is that of classifying contact structures on a manifold

M_{2 n + 1}

modulo contactomorphisms.

Appendix A.3. Integrability and Frobenius Theorem

We do not go into the details of Frobenius theorem (see for example esempio [73] e [66]) but we merely summarize the concepts underlying its formulation. We begin by noting that any vector field

X

on a differentiable manifold

M

of any dimension defines its own integral curves

I_{X}

, i.e., those curves that at each of their points admit the local value of the vector field

X

as a tangent vector. Since any

p \in M

lies on some integral curve

I_{X}

, we are guaranteed that a single vector field induces a foliation of the manifold

M

into one-dimensional submanifolds.

On the other hand, it is a more complicated matter to determine whether or not a rank

r > 1

sub-bundle

E ⟶ M

of the tangent bundle does or does not induce a foliation of

M

. To this effect, utilizing a not completely rigorous, yet intuitive and qualitatively correct way of speaking, by means of the wording foliation one means the covering of the manifold with a family of leaves, i.e., submanifolds all diffeomorphic to each other,

F_{ν} \subset M

having dimension equal to the rank r of the sub-bundle

E

, each of which can be regarded as the level hypersurface of r functions

u_{i} (p)

(

i = 1, \dots, r

) that originate from the integration of a basis of sections

X_{i}

of the sub-bundle

E ⟶ M

:

\begin{matrix} F_{ν} & = & \{p \in M ∣ u_{i} (p) = ν_{i}\}; ν \equiv {ν_{1}, \dots ν_{r}}; ν_{i} = real constants \\ \nabla u_{i} (p) & = & X_{i} ∣_{p} \end{matrix}

(A12)

When this situation is realized one says that the sub-bundle

E ⟶ M

is integrable. Frobenius Theorem establishes the necessary conditions for such an integrability.

Theorem A1.

Let

M

be a differentiable manifold and let

E ⟶ M

be a rank

r > 1

sub-bundle of the tangent bundle

TM

. The necessary and sufficient condition in order for

E

to be integrable is the following one:

\forall X, Y \in Γ [E, M] : [X, Y] \in Γ [E, M]

(A13)

In the case where

E ⟶ M

is a hyperplane bundle defined by a 1-form

ω

, Frobenius integrability condition can be formulated as follows

ω \land d ω = 0

(A14)

In order to appreciate the equivalence of the condition (A14) with condition (A13) it suffices to recall Cartan’s formula for the value of

d ω

on two arbitrary vector fields

X, Y

:

d ω (X, Y) = \frac{1}{2} (X ω (Y) - Y ω (X) - ω ([X, Y]))

(A15)

Let

Z \notin Γ [E, M]

and

X, Y \in Γ [E, M]

; next evaluate the 3-forma

ω \land d ω

on the triplet

Z, X, Y

of vector fields. We find:

ω \land d ω (Z, X, Y) \propto \underset{\neq 0}{\underset{︸}{ω (Z)}} ω ([X, Y])

(A16)

Hence Equation (A14) implies

ω ([X, Y]) = 0

that on its turn means

[X, Y] \in Γ [E, M]

.

In this way we see that a contact structure defined by a contact 1-form is the exact opposite of an integrable sub-bundle. Indeed, one can show that it corresponds to maximum nonintegrability.

In Section 4.2, considering the integrable dynamical systems constructed on normed solvable Lie algebras and solvable groups, we get deeper into the meaning of Frobenius theorem. For integrable systems the foliation of the differentiable manifold is provided by the level set surfaces of a maximal set of Hamiltonian functions in involution whose corresponding vector fields span the integrable sub-bundle of the tangent bundle.

Appendix A.4. Isotropic Submanifolds of a Contact Manifold and Non Integrability

Let us introduce the following:

Definition A5.

Let

(M_{2 n + 1}, ξ)

be a contact manifold and

L \subset M_{2 n + 1}

be one of its submanifolds. Let us consider the tangent bundle of such a submanifold

TL \overset{π_{τ}}{⟶} L

and the contact structure

ξ \overset{π_{ξ}}{⟶} M

. The submanifold

L

is named isotropic if and only if

\forall p \in L : π_{τ}^{- 1} (p) \subset π_{ξ}^{- 1} (p)

(A17)

Equivalently, if the contact structure ξ is defined by contact 1-form α, the submanifold

L

is isotropic if any vector field

X

that is tangent to

L

, is contained in

\ker α

:

X \in Γ [TL, L] \Rightarrow α (X) = 0

(A18)

Let us introduce the further:

Definition A6.

Let

(M_{2 n + 1}, ξ)

be a contact manifold and

{\tilde{M}}_{2 m + 1} \subset M_{2 n + 1}

be an odd-dimensional submanifold with codimension

2 (n - m) \geq 0

. Let α be the contact 1-form that defines the contact structure ξ and let ι be the inclusion map:

ι : {\tilde{M}}_{2 m + 1} ⟶ M_{2 n + 1}

(A19)

In this case

({\tilde{M}}_{2 m + 1}, χ)

is named a contact sbmanifold of

(M_{2 n + 1}, ξ)

if the contact structure χ on

{\tilde{M}}_{2 m + 1}

is defined by the contact form

ι^{★} α

, namely if

χ = \ker ι^{★} α

(A20)

A result of the highest relevance both for applications in Fluid Dynamics and for the Geometrization of Thermodynamics is the following theorem due to Arnol’d:

Theorem A2.

Let

(M_{2 n + 1}, ξ)

be a contact manifold in

2 n + 1

-dimensions and

L \subset M_{2 n + 1}

an isotropic submanifold. Then

\dim L \leq n

.

In order to prove the Theorem A2 the following Lemma is needed

Lemma A1.

Let

(M_{2 n + 1}, ξ)

be a contact manifold whose contact sctruture ξ is defined as

\ker α

, in terms of a contact 1-form α. As a consequence of the condition (A8) included in the definition, it follows that

d α ∣_{ξ} \neq 0

and for each point

p \in M_{2 n + 1}

the

2 n

-dimensional fibre

ξ_{p} \subset T_{p} M_{2 n + 1}

is a vector space endowed with an antisymmetric 2-form of maximal rank massimale (namely without vanishing eigenvalues) which is exactly provided by the restriction to

ξ_{p}

of

d α

i.e.,

d α ∣_{ξ_{p}}

. Hence the contact structure is a symplectic bundle with respect to the 2-form

d α ∣_{ξ}

.

The proof of the lemma is almost evident from its own formulation.

Proof.

In order to prove the theorem we consider the inclusion map:

ι : L ⟶ M_{2 n + 1}

(A21)

and also the pull-back of the contact 1-form on the isotropic submanifold By definition

ι^{★} α = 0

. Hence we have

ι^{★} d α = 0

. In any point

p \in L

, the tangent space

T_{p} L

is a subspace of the symplectic space

ξ_{p}

on which the symplectic 2-form vanishes

d α ∣_{ξ_{p}}

. Using elementary linear algebra we conclude that such a subspace has maximal dimension one-half of the dimension of

ξ_{p}

. Indeed it suffices to put, by means of change of basis the antisymmetric 2-form in canonical form:

(\begin{matrix} 0_{n \times n} & 1_{n \times n} \\ - 1_{n \times n} & 0_{n \times n} \end{matrix})

and the statement becomes evident. This proves the theorem. □

What are the consequences of this theorem? It states that if we have a contact structure

ξ

, induced by a contact 1-form

α

, then we can exclude a foliation of the contact manifold into hypersufaces

Σ_{h} \subset M_{2 n + 1}

codimension one:

M_{2 n + 1} ⋍ Σ_{h} \times R_{h}

(A22)

such that for each

h \in R

the tangent bundle of

Σ_{h}

be contained in the contact structure. Indeed if this were true se any leave

Σ_{h}

of the foliation would be an isotropica submanifold of dimension

2 \times n

which is exactly what is ruled out by the theorem.

Definition A7.

An isotropic submanifold

L \subset M_{2 n + 1}

of a

(2 n + 1)

-dimensional contact manifold that has the maximal allowed dimension n is named a Legendrian submanifold.

An example relevant for Fluid Dynamics

In Fluid Dynamics the relevant manifold

M_{3}

, is locally isomorphic to our three dimensional Euclidean space

E^{3} ≅ R^{3}

since fluids of interest flow in a portion

M_{3} \subset R^{3}

of such a space that can be compact (the fluid is confined in some container) or partially non compact (river, lake, ocean, atmosphere). In any case we have

n = 1

. Hence if the flow, which is a vector field

U \in Γ [{TM}_{3}, M_{3}]

induces a contact structure on

M_{3}

(in the next subsection we will see that this happens when

U

is proportional to Reeb vector of a contact form

α

), then the unique Legendrian submanifolds are one-dimensional so that the ambient space does not admit a foliation into 2-dimensional submanifolds that contain the flows or that are transverse to them. This is the geometrical foundation of turbulence: the flows are chaotic.

An example relevant for Thermodynamics

As we are going to see later on the relevant contact manifold for thermodynamics is not the Euclidean physical space rather the space of thermodynamical variables (both extensive and intensive) that are in number of:

2 n + 1 \equiv 2 m + 3

(

n = m + 1

). The simplest minimal case corresponds to

m = 1

and the 5 thermodynamic variables are

\{U, S, V, T, P\}

, namely internal energy, entropy, volume, temperature and pressure. As we will explain the thermodynamical space

M_{2 m + 3}

è is always endowed with a canonical contact 1-form that summarizes the principles of classical thermodynamics and available equilibrium thermodynamical states correspond to all possible Legendrian submanifolds

L_{E} \subset M_{2 m + 3}

that have, as a consequence of Theorem A2 and of Definition A7 the following dimension:

\dim L_{E} = m + 1

(A23)

In the simplest case

m + 1 = 2

so that the Legendrian subvarieties are portions of an

R^{2}

-plane. It is in such planes that one studies the phase diagrams of chemicals. In the case of multiphase, multicomponent mixtures we have

m > 1

and phase diagrams develops in spaces of higher dimensions.

Big Data spaces

It is to be investigated whether certain Big Data spaces might be endowed with a hidden contact structure responsible for chaotic motions and phase transitions.

Appendix A.5. The Reeb Vector

Let us now come to the definition of the Reeb vector which shifts the definition of a contact structure from the cotangent to the tangent bundle of the considered

M_{2 n + 1}

manifold.

Definition A8.

Associated wih a contact form α we always have the so named Reeb vector field

R_{α}

, defined by two conditions:

\begin{matrix} α (R_{α}) = λ (x) = nowhere vanishing function on M_{2 n + 1} \\ \forall X \in Γ [{TM}_{2 n + 1}, M_{2 n + 1}] : d α (R_{α}, X) = 0 \end{matrix}

(A24)

If the contact manifold

M_{2 n + 1}

is endowed with a Riemanniana metric g (as it is the case of the Euclidean space

R^{2 n + 1}

), then the contact form

α

and its Reeb vector field

R_{α}

are related to each other by the operation of raising and lowering of world indices. Suppose we start from the Reeb vector field:

R = R^{μ} \frac{\partial}{\partial x^{μ}}

(A25)

The corresponding

α

is retrieved by setting:

α = Ω^{[R]} \equiv g_{μ ν} R^{μ} d x^{ν}

(A26)

and the condition (A8) that such a 1-form should be a contact form becomes the following equation on the Reeb field components:

ϵ^{λ μ_{1} ν_{1} μ_{2} ν_{2} \dots μ_{n} ν_{m}} R_{λ} \partial_{μ_{1}} R_{ν_{1}} \partial_{μ_{2}} R_{ν_{2}} \dots \partial_{μ_{n}} R_{ν_{n}} \neq 0 nowhere vanishes

(A27)

In the opposite direction, if one start from the contact form

α

, le componens of the Reeb vector field are obtained by setting:

R_{α}^{μ} = g^{μ ν} α_{ν}

(A28)

One should note that the nowhere vanishing function

λ

mentioned in the Definition A8 is simply the squared norm of the Reeb vector field or of the contact form that make sense only in a Riemannian space and there they coincide:

λ = {‖ R ‖}^{2} = {‖ Ω^{[R]} ‖}^{2} \equiv g_{μ ν} R^{μ} R^{ν}

(A29)

Contact structures for

n = 1

and hydrodynamical flows

Let us now consider the case relevant to the hydrodynamics of three-dimensional contact manifolds

(M_{3} ξ_{α})

, where, by the notation

ξ_{α}

, we refer to the contact form

α

that defines the contact structure

ξ

. The consequence of Theorem A2, as we have already pointed out, is that in these contact varieties, the Legendrian subvarieties are all one-dimensional, that is, they are curves or, as it is usual to refer to them in this context knots.

Therefore in three dimensions, there are two types of knots the Legendrian knots, whose tangent vector belongs to

\ker α

and the transverse knots whose tangent vector is parallel to the Reeb field vector at every point in their trajectory.

Furthermore in D = 3 condition (A27) becomes:

ϵ^{λ μ ν} R_{λ} \partial_{μ} R_{ν} \neq 0

(A30)

The standard contact structure on

R^{3}

Three-dimensional flat space, whose coordinates we denote

x, y, z

is equipped with a standard contact structure that admits the following contact 1-form

α_{s} = d z + x d y

(A31)

A picture of the local planes defining the contact structure (A31) is shown in Figure A1.

In hydrodynamics, a vector field

U

that we identify with the velocity field of a fiuid is potentially interesting for the study of chaotic regimes if it is the Reeb field of a contact 1-form

α

. If our work arena is a Riemanianna variety endowed with a metric g cioè

(M_{3}, g)

we can always invert the procedure la procedura and define the contact form

α

by writing:

α = Ω^{[U]} \equiv g_{i j} U^{i} (x, t) d x^{j}

(A32)

where

U^{i} (x, t)

are the three components of the velocity field at time t and the Latin indices

i, j = 1, 2, 3

identify the three coordinates

x^{i} = {x, y, z}

. In this way the first of the two conditions (A24) is automatically satisfied:

i_{U} Ω^{[U]} = {‖ U ‖}^{2} > 0

. It remains to be seen if

Ω^{[U]}

is a true contact form, namely if

Ω^{[U]} \land d Ω^{[U]} \neq 0

. It is at this level that one finds the relation with Beltrami equations which makes sense only in dimension

D = 3

and hence for

n = 1

. In the thermodynamic case

n \geq 2

: hence we stop the discussion here and we refer the interested reader to [21].

Figure A1. Schematic vision of the standard contact structure in

R^{3}

.

Figure A1. Schematic vision of the standard contact structure in

R^{3}

.

Appendix A.6. Darboux Theorem and the Case of Thermodynamics

A classical theorem due to Darboux, the proof of which we omit by referring the reader to [66] where it is presented, highlights the important fact that the standard contact structure of

R^{3}

mostrata nella Equation (A31) and illustrated graphically in Figure A1 is not an arbitrary choice, bensl corresponds to the canonical local form of any contact structure on any contact manifold.

Theorem A3.

Let

(M_{2 n + 1}, ξ)

be

(2 n + 1)

-dimensional contact manifold and let α be a contact 1-forma that defines

ξ = \ker α

. Let

p \in M_{2 n + 1}

be any point of the manifold and

U \subset M_{2 n + 1}

an open neighborhood of p. Then we can always find a local homomorphism:

φ : U \to R^{2 n + 1}

such that, naming

{x_{0}, x_{i}, y_{i}}

,

(i = 1, \dots, n)

the coordinates of

φ (U) \subset R^{2 n + 1}

we get:

α ∣_{U} = d x_{0} + \sum_{i = 1}^{n} y^{i} d x_{i}

(A33)

In the case

n = 1

Equation (A33) reproduces Equation (A31).

Darboux’s canonical form is the most inspiring one to suggest a connection between contact geometry and thermodynamics. As we shall see in more detail later, in classical thermodynamics it is natural to distinguish two types of variables, the extensive ones such as

U, S, V, N

(internal energy, entropy, volume, number of particles or of moles) and the intensive ones such as

T, P, μ

(temperature, pressure, potenziale chemical potential). We can identify the coordinates

x_{i} = {S, V, N, \dots}

with the extensive variables and the coordinates

y_{i} = {T, P, μ \dots}

with the intensive ones, further idenfying the priviledged extensive variable

x_{0} = U

with internal energy. For a reason that will become clear later, after the discussion of probability measures, it is convenient to generally rename the intensive variables as

λ_{i}

since in the general view of information/probability theory they play the role of Lagrange multipliers. Thus the contact differential 1-form

α_{t e r m o} \equiv d x_{0} + \sum_{μ = 1}^{3} λ^{i} d x_{i}

(A34)

can be used to formulate the first and second Principles of Thermodynamics stating that

α_{t e r m o}

should vanish. Such vanishing must be understood in a correct mathematical way: it vanishes on Legendrian submanifolds

L_{E} \subset M_{2 m + 3}

which represent equilibrium thermodynamical states whose embedding functions in the ambient space

M_{2 m + 3}

are the physical laws defining such states.

Appendix A.7. Symplectic and Poisson Manifolds

In the further study of the immersion of Legendrian submanifolds in the thermodynamic contact manifold

M_{2 m + 3}

an important role is played by the symbiotic relationship of all contact manifolds

M_{2 n + 1}

with the adiacent symplectic manifolds that are also Poissonian.

Hence let us begin with the following:

Definition A9.

A symplectic manifold is a pair

({SM}_{2 n + 2}, ω)

of a smooth manifold

{SM}_{2 n + 2}

in dimension

2 n + 2

and one differential 2-form ω that is closed, non degenerate (namely admitting no vanishing eigenvalue) and of maximal rank:

d ω = 0; ω \land ω \land \dots \land ω \neq 0 everywhere on {SM}_{2 n + 2}

(A35)

On a symplectic manifold we have a naturally defined antisymmetric 2-form on the space of sections of the tangent bundle, i.e., on vector fields:

\begin{matrix} ω & : & Γ [{TSM}_{2 n + 2}, {SM}_{2 n + 2}] \times Γ [{TSM}_{2 n + 2}, {SM}_{2 n + 2}] ⟶ C^{(\infty)} ({SM}_{2 n + 2}) \\ \forall X, Y & \in & Γ [{TSM}_{2 n + 2}, {SM}_{2 n + 2}], ω (X, Y) \in C^{(\infty)} ({SM}_{2 n + 2}) \end{matrix}

(A36)

Poisson manifolds are instead defined as it follows:

Definition A10.

A Poisson manifold

({PM}_{ℓ}, {,})

is a pair of a smooth manifold

{PM}_{ℓ}

of dimension ℓ and a Poisson bracket

{,}

which is a binary operation on the space of smooth functions defined over the manifold:

{,} : C^{(\infty)} ({PM}_{ℓ}) \times C^{(\infty)} ({PM}_{ℓ}) ⟶ C^{(\infty)} ({PM}_{ℓ})

(A37)

endowed with the following three properties:

(1): Antisymmetry ${f, g} = - {g, f}$ , $\forall f, g \in C^{(\infty)} ({PM}_{ℓ})$
(2): Jacobi Identity ${f, {g, h}} + {g, {h, f}} + {h, {f, g}} = 0$ , $\forall f, g, h \in C^{(\infty)}$ $({PM}_{ℓ})$
(3): Leibniz rule ${f, g . h} = {f, g} h + g {f, h}$ , $\forall f, g, h \in C^{(\infty)} ({PM}_{ℓ})$

The first two properties mentioned in Definition A10 guarantee that the space of functions on a Poisson variety becomes a Lie algebra when it is equipped with a Poisson bracket. On the other hand, the third property implies that every function

f \in C^{(\infty)}

is associated by the Poisson bracket with a derivation of the commutative algebra of functions on the manifold, which is, by definition, a vector field

X_{f}

; this latter is named the Hamiltonian vector field of the function f.

Locally, in each open chart

{x_{1}, \dots, x_{j}}

, the Poisson bracket takes the following form:

{f, g} = π^{i j} (x) \frac{\partial f}{\partial x^{i}} \frac{\partial g}{\partial x^{j}}; π^{i, j} (x) = - π^{j i} (x)

(A38)

where the antisymmetric countervariant vector

π^{i j} (x)

is usually called a bivector. Thus the Hamiltonian vector field

X_{f}

is easily identified as:

X_{f} = π^{i j} \partial_{i} f \partial_{j}

(A39)

Suppose now that the dimension of the Poisson variety is

ℓ = 2 n + 2

and that the bivector

π^{j i} (x)

is an everywhere invertible matrix. Posing

ω = π_{k ℓ}^{- 1} d x^{k} \land d x^{ℓ}

we obtain a symplectic 2-form of maximal rank that is closed as a consequence of the Jacobi identities satisfied by the bivector. In this way we see that such a Poisson manifold is a symplectic manifold and in particular we can write:

{f, g} = ω (X_{f}, X_{g})

(A40)

Definition A11.

Let

({SM}_{2 n + 2}, ω)

be a symplectic manifold. A Liouville vector field X is a vector field for which the following condition holds:

L_{X} ω = ω

(A41)

where

L_{X} ω \equiv i_{X} d ω + d (i_{X} ω)

denotes the Lie derivative of the 2-form ω along the vector field X.

Note that, being

ω

closed, if X is a Liouvile vector field, we have:

d (i_{X} ω) = L_{X} ω = ω

.

Appendix A.8. The Relation Between Contact Manifolds and Symplectic Manifolds

Let us consider a symplectic manifold

({SM}_{2 n + 2}, ω)

and let us assume that it admits at least one Liouville vector field

L

. Furthermore let

Σ_{L} \subset {SM}_{2 n + 2}

be the hypersurface which is transverse to the Liouville field

L

. Then we realize that

Σ_{L}

is a contact manifold with contact 1-form

α = i_{L} ω

. Since

Σ_{L}

is transverse to

L

, the 1-form

α

vanishes along

L

and, on the contrary, it never vanishes on the tangent bundle

T Σ_{L}

. In order to verify that

α

is a bona fide contact form we just have to perform the following calculation:

\begin{matrix} α \land \underset{n - times}{\underset{︸}{d α \land \dots \land d α}} & = & i_{L} ω \land \underset{n - times}{\underset{︸}{d i_{L} ω \land \dots \land d i_{L} ω}} = i_{L} ω \land \underset{n - times}{\underset{︸}{ω \land \dots \land ω}} = \frac{1}{n + 1} i_{L} (\underset{(n + 1) - times}{\underset{︸}{ω \land \dots \land ω}}) \\ = & {Vol}_{Σ_{L}} \end{matrix}

(A42)

The last equality holds because

ω^{n + 1}

is the volume form of the ambient symplectic manifold forma di volume della varietà simplettica ambiente la and the hypersurface

Σ_{L}

is, by hypothesis, transverse to the Liouville vector field.

Reversely, given a contact manifold

(M_{2 n + 1}, ξ)

with contact 1-form

α

and Reeb vector field

R

, any hypersurface

Σ \subset M_{2 n + 1}

that is transverse to the Reeb vector, automatically acquires the structure of a symplectic manifold with symplectic 2-form

\tilde{ω} = d α ∣_{Σ}

.

Therefore, we can have odd-dimensional contact manifolds lying in between two adjacent even-dimensional symplectic manifolds as it is shown in the following diagram:

\begin{matrix} ({SM}_{2 n}, \tilde{ω} = d α) & \overset{ι}{↪} & (M_{2 n + 1}, α = i_{L} ω) & \overset{ι}{↪} & ({SM}_{2 n + 2}, ω) \\ ⇓ & ⇓ & ⇓ \\ symplectic & contact & symplectic \\ transverse to Reeb field & transverse to Liouville field \end{matrix}

(A43)

The pattern described in Equation (A43) is reminescent of what happens with sasakian manifolds that lye in between two Kähler manifolds of adiacent even dimensions. This is not surprising since Kähler manifolds are particular instances of symplectic manifolds where the symplectic form is simply the Kähler 2-form.

We stop here, for the moment with the exposition of fundamental geometric concepts, since we have to turn to the other leg of our constructions, namely, Probability Theory. We shall come back to geometry in later chapter about diffusion theory that establishes a solid link between Markov random processes and Riemannian Geometry.

Appendix B. Fundaments of Probability Theory

In this appendix we recall the fundamental concepts, which will be indispensable to us, with regard to measurement and probability theory.

Appendix B.1. σ-Algebras and Probability Measures

Definition A12.

Given a set Ω one defines σ-algebra on Ω a family

A

of subsets

A_{i} \subset Ω

such that:

1.: $\emptyset \in A$ and $Ω \in A$ .
2.: If $A \in A$ then its complement $A^{c} \equiv Ω - A$ also belongs to the same family $A^{c} \in A$ .
3.: If the elements of a denumerable family of sets ${\{A_{i}\}}_{i \in N}$ belong to $A$ then also their union belongs to it:

$A = ⋃_{i = 1}^{\infty} A_{i} \in A$

Definition A13.

The pair

(Ω, A)

where Ω is a set and

A

is a σ-algebra on Ω is named a measurable space.

In Probability Theory the elements of the

σ

-algebra,

X \in A

are named events while the points

p \in Ω

are named experiments.

Definition A14.

One names Part Algebra of a set Ω and denotes it with the symbol

P (Ω)

the set of all subsets equipped with the boolean algebraic operations of union, intersection and complement.

Let us now consider two sets

Ω_{1}

ed

Ω_{2}

; any map:

ϕ : Ω_{1} \to Ω_{2}

(A44)

induces a pullback map

ϕ_{★}^{- 1}

on the corresponding Part Algebras:

ϕ_{★}^{- 1} : P (Ω_{2}) \to P (Ω_{1})

(A45)

One can verify that the map

ϕ_{★}^{- 1}

satisfies the following properties

\begin{matrix} ϕ_{★}^{- 1} (X ⋃ Y) & = & ϕ_{★}^{- 1} (X) ⋃ ϕ_{★}^{- 1} (Y) \\ ϕ_{★}^{- 1} (X ⋂ Y) & = & ϕ_{★}^{- 1} (X) ⋂ ϕ_{★}^{- 1} (Y) \\ ϕ_{★}^{- 1} (X^{c}) & = & ϕ_{★}^{- 1} {(X)}^{c} \end{matrix}

(A46)

Hence

ϕ_{★}^{- 1}

è is a morphism of boolean algebras.

Once this general concepts have been established one can introduce the following notion of probability measure by means of the following:

Definition A15.

Let

(Ω, A)

be a measurable space. A probability measure on

(Ω, A)

is a map

p : A \to [0, 1] \subset R

(A47)

that satisfies the following properties:

$p (\emptyset) = 0$ and $p (Ω) = 1$
$p (⋃_{i} X_{i}) = \sum_{i} p (X_{i})$ for all denumerable unions of disjoint parts, i.e., such that $X_{i} ⋂ X_{j} = \emptyset$ se $i \neq j$ .

When we have a triplet

(Ω, A, p)

we say that we have a stochastic space and the value

p (X)

is the probability of the event X.

Appendix B.2. Stochastic Functions, Stochastic Vectors and Distributions

Let us begin with:

Definition A16.

If

T

is a separable topological space, one names Borel Algebra of

T

, denoted

B (T)

the σ-algebra made by all denumerable unions, intersections and complements of open subsets

U \subset T

.

In particular, on all varieties

R^{n}

we have the ball-topology which, for the real line

R

, reduces to the topology of open intervals

] x, y [\subset R

, and the correspondent Borel algebra is clearly defined. Hence, using as

σ

-algebra the natural Borel algebra

B (R)

, we have that the pair

(R, B (R))

, makes a measurable space

Let us then consider a stochastic space

(Ω, A, p)

and a map:

ψ : Ω \to R

(A48)

which to any point

p \in Ω

of the set

Ω

associates a real number (its coordinate). Because of what we discussed above, the pullback map

ψ_{★}^{- 1} : B (R) \to A

(A49)

is a morphism of boolean algebras that explicitly associates an element

X \in A

to every element of the Borel algebra of the real line. Thus, composing the maps we define:

p_{ψ} \equiv p \circ ψ_{★}^{- 1}

(A50)

which is a map from the Borel algebra of the real line to the interval

[0, 1]

:

p_{ψ} : B (R) \to [0, 1]

(A51)

This is what we name a stochastic function. In practice to every open interval

] x, y [

the stochastic function associates a number between 0 ed 1 which is the probability that while doing a measuring experiment the measured value happens to be in the considered interval. In this way one can consider stochastic functions that are discontinuous, step-wise and the like, yet they are Lebesgue integrable thanks to the measurability of the support space.

Probability Density

An interesting case is when the stochastic function can be described in terms of a probability density given by an integrable function

ρ_{ψ} (q)

on the real line such that:

p_{ψ} ([a, b]) = \int_{a}^{b} ρ_{ψ} (q) d q

(A52)

For the probability density to be well defined, it is necessary that the probability density

ρ_{ψ} (q)

be properly normalized:

\int_{- \infty}^{+ \infty} ρ_{ψ} (q) d q = 1

(A53)

Under these conditions, one can define the average value of any function

f (q)

of the stochastic variable

q \in R

writing

〈 f 〉 \equiv \int_{- \infty}^{+ \infty} f (q) ρ_{ψ} (q) d q

(A54)

Stochastic Vector

In a similar way we can define stochastic vectors.

Consider a finite dimensional vector space

V

:

\dim_{R} V = r < \infty

(A55)

and a basis

e_{i}

(

i = 1 \dots, r

) of vectors such that

\forall X \in V : X = \sum_{i = 1}^{r} X^{i} (χ) e_{i}

(A56)

where the components of the vector are thought of as functions of

χ \in Ω

, the space of events over which we defined the probability measure. By reasoning entirely analogous to that above, each component

X^{i} (χ)

can be thought of as a probability density

X_{ψ}^{i} (q)

on a space

R^{n}

where n is the number of coordinates necessary to identify a point

χ \in Ω

, namely the dimension of the set

Ω

, if this latter can be thought of as a differentiable manifold. As in the previous case what we are constructing is, for each compenent

w^{i}

of the stochastic vector, a map:

ψ^{i} : Ω \to R^{n}

(A57)

which, by pullback, induces a map:

ψ_{★}^{- 1 | i} : B (R^{n}) \to A

(A58)

By composition of maps we obtain

p_{ψ^{i}} \equiv p \circ ψ_{★}^{- 1 | i}

(A59)

which is a map from the Borel algebra of

R^{n}

to the interval

[0, 1]

:

p_{ψ^{i}} : B (R^{n}) \to [0, 1]

(A60)

In this way we have defined aa stochastic vector, namely a map:

Ψ : Ω \to V

(A61)

Also for stochastic vectors the most favorable and smooth situation occurs when each of the vector components is substituted by an integrable probability density:

X (q) = \sum_{i = 1}^{r} ρ_{Ψ}^{i} (q) e_{i}; \int \int \dots \int_{R^{n}} ρ_{Ψ}^{i} (q) \underset{\equiv d μ (q)}{\underset{︸}{d^{n} q}} = 1

(A62)

where with

d μ (q)

we have denoted the integration measure on the space

Ω

which might be more elaborate and contain a factor

\sqrt{\det g}

when

Ω

is a Riemannian manifold endowed with a metric.

Appendix C. A Summary of Classical Thermodynamics and Statistical Mechanics

In this appendix in order to establish the notations and systematically organize the geometrical treatments of both classical thermodynamics and statistical mechanics, we introduce in a very synthetic way the basic concepts of both disciplines [74,75] making when necessary reference to the general conceptual frameworks discussed in the previous appendix and in the main text. Indeed the main motivation of the present summary is to show how Shannon Information entropy and classical thermodynamical entropy do indeed coincide emphasizing that a thermodynamical, geometrical view is closely inherent to any Big Data system.

Appendix C.1. Thermodynamical Potentials and State Functions

In macroscopic thermodynamics one utilizes the following extensive quantities:

U = internal energy of a thermodynamical system
S = entropy
V = volume occupied by the system under consideration, for instance a mixture of gases or a certain quantity of a liquid or of a solid.
N = the total number of particles composing the system, ifor instance the number of molecules of a gas that can be measured in various units, among which the most frequently utilized is the number of moles of the chemical compound under investigation. Alternatively when the system is a mixture of more than one component one utilizes:
$N_{i}$ = the total number of particles of the i-th component of the mixture, typically measured in number or fractions of moles.

Extensive quantities means that partitioning the system into subsystems the considered quantity is subdivided into percentual fractions. In other words, for instance the entropy of a system composed of two subsystems A and B is the entropy of A plus the entropy of B:

S_{A \cup B} = S_{A} + S_{B}

(A63)

Similarly can be said of the internal energy U, of the volume V and of the numbers of particles

N_{i}

or fractions of moles.

The intensive quantities of classical thermodynamics do not have the same property and they are instead characterized by the fact that in the equilibrium states they assume the same value in every part of the system, large or small. They are

T = temperature that determines the average energy per particle.
P = pressure which, as in mechanics, is the force per unit area.
$μ$ = chemical potential which is the intensive variable canonically conjugate to the number of particles N.
$μ_{i}$ = chemical potentials of the various components in the various phases as happens in multicomponent and multiphase mixtures.

Appendix C.2. Thermodynamical Constants

Before we begin our summary of classical thermodynamics and statistical mechanics, it is appropriate to recall the definition and numerical value of universal constants, both fundamental and empirical.

Definition A17.

One mole of a chemical substance

X

is the quantity of atoms or of molecules of that substance

X

necessary to form a mass

M

numerically equal in grams to the weight

w_{X}

of an atom or a molecule of that substance

X

expressed in atomic mass units.

Thanks to such a definition one mole of any chemical substance

X

always contains the same number of atoms or of molecules that is the Avogadro number:

N_{A} = 6.002214076 \times 10^{23}

(A64)

In view of the Definition (A20) Avogadro number can be seen as the conversion factor from the standard mass unit, namely the gram and the atomic mass unit

u

1 g = N_{A} u

(A65)

The Equation of State (A115) that shortly after we rigorously derive from the partition function for a classical system of free identical particles, coincides with the Equation of State of diluted gases, empirically known since long time in the following form (It was experimentally determined by Émile Clapeyron in 1834):

P V = n R T

(A66)

where P is the pressure, n denotes the number of moles of the gas that are present in the considered volume V, and R is a universal physical constant with the following value:

R = 8.31446261815324 \frac{J}{m o l \times K}

(A67)

Definition A18.

Boltzmann constant, that appears in all formulae of statistical mechanics, is defined as the ratio of the ideal gas constant R and Avogadro number (A119):

k_{B} \equiv \frac{R}{N_{A}} = 1.380649 \times 10^{- 16} e r g K^{- 1}

(A68)

Thanks to Definition A19, the empirical form of the equation of state (A66) and that derived from the Statistical Mechanics of free particles do coincide since by substituting (A118) into Equation (A115), we obtain the ratio

N / N_{A}

between the number of particles and Avogadro number, which is by definition the number of moles of the considered chemical substance:

n = \frac{N}{N_{A}} = # di moli

(A69)

The First and Second Principles of Thermodynamics

Classical thermodynamics is axiomatically formulated through two principles that are expressed in differential form and concern the infinitesimal changes in the quantities introduced in the previous paragraph when, by means of heat supply or subtraction,

\pm d Q

an infinitesimal transformation of the thermodynamic system is carried out from its equilibrium state.

The First Principle

The first principle asserts that if we call

d W

the mechanical work absorbed or given up by the system, in an infinitesimal transformation, the following relationship holds

d Q = d U - d W

(A70)

Typically, mechanical work produces a change in the volume V of the system, and since pressure is force per unit area, we have

d W = P d V

so that the canonical formulation of the first principle is the following:

d Q = d U - P d V

(A71)

The Second Principle

The second principle of thermodynamics can be formulated in several equivalent ways. The most concise is as follows:

\oint_{C} \frac{d Q}{T} = 0

(A72)

which translates to saying that if we integrate the differential form

\frac{d Q}{T}

along a closed path in the space of state variables, that is, by performing through exchanges of work and heat a transformation that takes the system from the initial state to a final state equal to the initial one, then we obtain zero. This implies that, unlike heat differential

d Q

, the combination

\frac{d Q}{T}

is an exact differential, namely, it is the differential of a new state function that we have already anticipated and which takes the name of entropy S. Hence, we can write:

d Q = T d S

(A73)

where S is a function of state, for instance

S = S (T, V, N)

.

Thus, one introduces the following thermodynamic potentials, which are all functions of state:

(1): Internal Energy

$U = U (T, V, N)$

(A74)
(2): Helmholtz Free Energy

$F \equiv U - T S$

(A75)
(3): Entalpy

$H \equiv U + P V$

(A76)
(4): Gibbs potential

$G \equiv H - T S = U + P V - T S = F + P V$

(A77)

The various thermodynamic potentials are all related to each other by relationships and are useful in describing thermodynamic transformations of various kinds, but their most important justification is the relationship of three of them, with the partition function, respectively, of the three possible ensembles in the microscopic, statistical mechanics description of macroscopic thermodynamic systems.

Appendix C.3. The Three Ensembles of Statistical Mechanics

Let us begin with the first of the three ensembles, which is perhaps the most fundamental of the three because it aims to directly explain disorder in terms of the enormous number of microscopic configurations corresponding to the same macroscopic state.

Appendix C.3.1. The Microcanonical Ensemble

Considering a number

N ≫ 1

of particles (molecules) that are collected in a volume V and have an overall energy E, we identify the internal energy of the system with the said energy:

U = E

(A78)

and the entropy of the same system with:

S (U, V) = k_{B} log (N_{E})

(A79)

where

k_{B}

is the Boltzmann constant, and by definition

N_{E} = # microscopic states that have an overall energy E

(A80)

From the first principle of thermodynamics, if we know the entropy function

S (U, V)

we retrieve the temperature since:

{\frac{\partial S}{\partial U}|}_{V = c o s t} = \frac{1}{T}

(A81)

Using the same logical procedure, we instead obtain:

T {\frac{\partial S}{\partial V}|}_{U = c o s t} = P

(A82)

and all the equations of thermodynamics can be reconstructed from the knowledge of the function

S (U, V)

defined by means of the identification in Equation (A79).

Appendix C.3.2. The Canonical Ensemble

In the microcanonical ensemble, the total energy

U = E

, the volume V, and the number of particles N are kept constant. In the canonical ensemble, on the other hand, the fixed energy condition is relaxed and only the volume V and the number of particles N constituting the system are kept fixed. Instead of energy, the temperature parameter T is introduced. With these elements, one then constructs the so-called canonical partition function. In the following way. Let

Σ (N, V)

be the set of all possible microscopic states (at the level of classical mechanics, we can say configurations in phase space) that can be constructed with

N ⋙ 1

particles (typically molecules) in volume V. Each state, i.e., each element

σ \in Σ (N, V)

is endowed with a specific energy that we will denote

E_{σ}

. The canonical partition function is then written as the following sum:

Z_{N} (T, V) \equiv \sum_{σ \in Σ (N, V)} exp [- \frac{E_{σ}}{k_{B} T}]

(A83)

The connection with classical thermodynamics is made through the following identification:

F (T, V, N) = A (T, V, N) \equiv - k_{B} T log [Z_{N} (T, V)]

(A84)

where

F (T, V, N)

is Helmholtz free energy introduced in Equation (A75) and

A (T, V, N)

is the name we give to the combination to its left. The rationale for this identification becomes clear if we reason in terms of probabilities and refer to the general scheme described in Appendix B in particular recalling Equations (43) and (44). The set of all states

Σ (N, V)

is, in the present case, the measurable space

Ω

and the energy

E_{σ}

of a state

σ \in Ω

is the relevant stochastic vector

X

. In this case the vector space

V

is one-dimensional because the energy E is a scalar quantity at least in classical mechanics. Finally we can identify:

λ = \frac{1}{k_{B} T}

(A85)

and we see that the probability of the system being in state

σ

, given by:

p (σ) = \frac{exp [- \frac{E_{σ}}{k_{B} T}]}{Z_{N} (T, V)}

(A86)

coincides with that defined in the general Formula (43). For the same valid reason, in the general case, this probability density is correctly normalized to 1 if we sum over all states.

Note that the measurable set

Σ (N, V)

is typically a variety of large dimensions and therefore the variables

q

that identify its points are many. For example, when the constituent particles of the system can be considered classical particles,

Σ (N, V)

is the phase space for a system of N particles with

6^{N}

dimensions. Each configurations

σ

in the phase space has a given energy

E_{σ}

.

We can now estimate the average energy of the system in the ensemble by writing:

〈 E 〉 = \sum_{σ \in Σ (N, V)} E_{σ} p (σ) \equiv k_{B} T^{2} \partial_{T} log [Z_{N} (T, V)]

(A87)

The fundamental conceptual identification is that between the thermodynamic internal energy and the average energy of the canonical ensemble calculated in Equation (A87):

U = 〈 E 〉

(A88)

Referring again to the general formulas in Appendix B we can compare Equation (A87) with the general Equation (46). Setting

H (λ, N) = - log [Z (T, N)]

(A89)

and considering relation (A85) according to (46), we find:

〈 E 〉 = \frac{\partial}{\partial λ} H (λ, N) = - \frac{\partial T}{\partial λ} \partial_{T} log [Z (T, N)] = k_{B} T^{2} \partial_{T} log [Z_{N} (T, V)]

(A90)

which exactly reproduces Equation (A87).

Let us compare Equation (A84) with the general Equation (47). Using the relation (A85) we get:

I [p] = - log [Z (T, N)] - \frac{1}{k_{B} T} U

(A91)

and multiplying both the left and the right-hand side by

k_{B} T

, we find:

U + T (k_{B} I [p]) = A (T, V, N) \equiv - k_{B} T log [Z_{N} (T, V)]

(A92)

Hence, recalling the definition of Helmholtz free energy (A75), we realize that the identification (A84) is the correct one if Shannon information entropy

I [p]

is identified with thermodynamical entropy modulo the factor

- k_{B}

:

S (T, V, N) = - k_{B} I [p]

(A93)

In classical thermodynamics, the identification between the function

A (T, V, N)

and Helmholtz free energy

F (T, V, N)

follows from the fact that we can show that F and A satisfy the same differential relation with internal energy.

Let us begin with the thermodynamic definition (A75). Utilizing both the first and second principles of thermodynamics, we get:

\begin{matrix} d F & = & d U - d T S - T d S \end{matrix}

(A94)

\begin{matrix} = & T d S - P d V - d T S - T d S = - P d V - S d T \end{matrix}

(A95)

from which we work out the relations:

P = - {\frac{\partial F}{\partial V}|}_{T = c o s t}; S = - {\frac{\partial F}{\partial T}|}_{V = c o s t}

(A96)

Equation (A96) implies that the definition (A75) can be rewritten as the following differential equation:

F = U + T {\frac{\partial F}{\partial T}|}_{V = c o s t}

(A97)

Reconsidering now Equation (A87) that provides the expression for internal energy, with obvious manipulations, we can write what follows:

〈 E 〉 = U = A - T {\frac{\partial A}{\partial T}|}_{V = c o s t}

(A98)

which coincides with (A97) if

F = A

. Thus, the interpretation (A84) is fully justified within the setup of classical thermodynamics.

In the previous discussion, we emphasized the deeper sense of the classical identification in relation to Information Theory. In any case, having established identification (A84), the thermodynamic variables are all identified in the canonical ensemble as well. Summarizing, we have:

\begin{matrix} P & = & \partial_{V} (k_{B} T log [Z_{N} (T, V)]) \\ S & = & \partial_{T} (k_{B} T log [Z_{N} (T, V)]) \\ U & = & k_{B} T^{2} \partial_{T} log [Z_{N} (T, V)] \\ F (T, V, N) & = & - k_{B} T log [Z_{N} (T, V)] \end{matrix}

(A99)

Keeping the number N of particles or of moles fixed, the thermodynamic state state of the system is always a point in a two-dimensional variety for which we can use either the native variables

(T, V)

which are the arguments of the partition function or any other pair of independent variables whose relations to

(T, V)

we can derive by inverting the relations (A99), whenever this is analytically possible.

Appendix C.3.3. The Grand Canonical Ensemble and the Gibbs Potential

In the formulation of statistical mechanics through the canonical grand ensemble, not only is the energy of states not fixed, but neither is the total number of particles or moles of the substance. Thus, the following grand partition function is written:

Z (T, V, μ) = \sum_{N = 0}^{\infty} z^{N} Z_{N} (T, V)

(A100)

where the variable

z = exp [\frac{μ}{k_{B} T}]

is named the fugacity and the symbol

μ

is called the chemical potential. The relationship between the statistical description by the canonical grand partition function and classical thermodynamics is encoded in the identification:

Φ (T, V, μ) = - k_{B} T log [Z (T, V, μ)] = U - T S - μ N

(A101)

In the grand canonical description of thermodynamics as well as in the canonical description, the internal energy of the system is not an a priori fixed datum, but rather it takes an average value that we calculate with the exact analogue of the Formula (A87)

U = k_{B} T^{2} \partial_{T} log [Z (T, V, μ)]

(A102)

The same thing happens with the number of particles (or moles), which is not fixed but just assumes an average value calculated below:

〈 N 〉 = - k_{B} T \partial_{μ} log [Z (T, V, μ)]

(A103)

Identification (A101) also allows us to write the other analogs of Equation (A99)

\begin{matrix} P & = & \partial_{V} (k_{B} T log [Z (T, V, μ)]) \\ U & = & k_{B} T^{2} \partial_{T} log [Z (T, V, μ)] \\ S & = & \partial_{T} (k_{B} T log [Z (T, V, μ)]) \\ N & = & - k_{B} T \partial_{μ} log [Z (T, V, μ)] \end{matrix}

(A104)

and suggests the introduction of a new thermodynamic potential, called the Gibbs potential, which has the following definition

G (T, V, μ) = P V + U - T S

(A105)

Appendix C.4. Statistical Mechanics of Ideal Gases

After the general exposition of the previous section, we briefly present the fundamental example of the ideal gas of identical particles with masses all equal and not interacting with each other. As we recalled above, this is one of the rare cases where the partition function can be explicitly computed in closed form.

Our system consists of N non-relativistic particles, each with mass

m

, that are free to move inside a volume V, which, for simplicity, we regard as a cube with side

ℓ = \sqrt[3]{V}

.

The phase space, in the Hamiltonian approach, is a space of dimension

2 \times 3 \times N

whose coordinates are the N pairs

(p_{i}, q_{i})

where

p_{i} = {p_{i, x}, p_{i, y}, p_{i, z}}

is the momentum vector and

q_{i} = {q_{i, x}, q_{i, y}, q_{i, z}}

is the position vector of the i-th particle (

i = 1, \dots, N

).

The classical Hamiltonian of the system is very simple and is as follows:

H (p) = \sum_{i = 1}^{N} \frac{p_{x, i}^{2} + p_{y, i}^{2} + p_{z, i}^{2}}{2 m}

(A106)

In accordance with the general principles stated in previous sections, since the Hamiltonian represents the energy of the system, introducing Planck’s constant h as the unit of measurement, the canonical partition function is written as follows:

Z_{N} (T, V) = \frac{1}{N! h^{3 N}} \int exp [- \frac{H (p)}{k_{B} T}] d^{3 N} p \times \int d^{3 N} q

(A107)

Since the Hamiltonian does not depend on the coordinates

q

, the last integral in

d^{3 N} q

is calculated immediately, and we get

V^{N}

. Since the Hamiltonian is a sum of independent quadratic terms, the integral over moments is transformed into a product of integrals, and we have:

Z_{N} (T, V) = \frac{V^{N}}{N!} \times \prod_{i = 1}^{N} P_{i, x} P_{i, y} P_{i, z}

(A108)

, where

P_{i, x} = \int_{- \infty}^{+ \infty} \frac{1}{h} exp [- \frac{p_{i, x}^{2}}{2 m k_{B} T}] d p_{i, x}

(A109)

and similarly for

x \to y

ed

x \to z

. Everything then reduces to the calculation of a single Gaussian integral. Through the change of variable

t = p / \sqrt{2 m k_{B} T}

, we obtain the final result:

Z_{N} (T, V) = \frac{V^{N}}{N!} {(\frac{2 π k_{B} T m}{h^{2}})}^{3 N / 2}

(A110)

At this point, we are in a position to write the Helmholtz free energy as a function of temperature, volume and number of particles (or number of moles). It suffices to use the general Formula (A84) and apply it to the case of the free gas partition function calculated in (A110). We get:

\begin{matrix} F_{I G} (T, V, N) & = & - k_{B} T (N (\frac{3}{2} log (\frac{2 π k_{B} m}{h^{2}}) + \frac{3 log (T)}{2} + log (V)) - log (Γ (N + 1))) \\ \approx & \frac{1}{2} k_{B} N T (- 3 log (\frac{2 π k_{B} m}{h^{2}}) + 2 log (N) - 3 log (T) - 2 log (V) - 2) \end{matrix}

(A111)

where

Γ

denotes the Euler Gamma function e

Γ (N + 1) = N!

and where the second line is obtained from the first using the Stirling approximation

log [Γ [N + 1]] \approx N log [N] - N

which is very accurate for large values of N, as happens in our case.

Starting from Equation (A111) and using the general definitions (A99) and relation (A75), we obtain all thermodynamic state functions for ideal gases.

(Pressure)

$P_{I G} (T, V, N) = - \partial_{V} F_{I G} = \frac{k_{B} N T}{V}$

(A112)
(Entropy)

$S_{I G} (T, V, N) = - \partial_{T} F_{I G} = \frac{1}{2} k N (3 log (\frac{2 π k_{B} m}{h^{2}}) - 2 log (N) + 3 log (T) + 2 log (V) + 5)$

(A113)
(Internal Energy)

$U_{I G} (T, V, N) = F_{I G} (T, V, N) + T S_{I G} (T, V, N) = \frac{3 k_{B} N T}{2}$

(A114)

The Equation of State of Ideal Gases

Equation (A112) gives us the equation of state of ideal gases as a relation between pressure, volume, and temperature at a fixed number of moles N. Leaving out the subscript

I G

since it is clear that we are talking about ideal gases, we have:

P V = k_{B} N T

(A115)

Appendix C.5. The Van Der Waals Model of a Real Gas

The first and still important example of modification of the gas equation of state is due to the Dutch physicist-mathematician van der Waals, who replaced Equation (A115) with the following one where two phenomenological parameters were introduced,

(a, b)

:

(P + \frac{a n^{2}}{V^{2}}) (V - b n) = n R T;

(A116)

In (A116) n denotes the number of moles of the chemical component of the gas, while, as before, P is the pressure, and V is geometrical volume in which the gas is enclosed. Furthermore, R is a universal physical constant with the following value:

R = 8.31446261815324 \frac{J}{m o l \times K}

(A117)

Definition A19.

The Boltzmann constant, already utilized before, and that appears in all formulae of statistical mechanics is defined as the ratio of the ideal gas constant R and the Avogadro number (A119):

k_{B} \equiv \frac{R}{N_{A}} = 1.380649 \times 10^{- 16} e r g K^{- 1}

(A118)

Definition A20.

One mole of a chemical substance

X

is the quantity of atoms or of molecules of that substance

X

necessary to form a mass

M

numerically equal in grams to the weight

w_{X}

of an atom or a molecule of that substance

X

expressed in atomic mass units.

Thanks to such a definition one mole of any chemical substance,

X

always contains the same number of atoms or of molecules that is the Avogadro number:

N_{A} = 6.002214076 \times 10^{23}

(A119)

In view of the Definition (A20) Avogadro number can be seen as the conversion factor from the standard mass unit, namely the gram and the atomic mass unit

u

1 g = N_{A} u

(A120)

The two phenomenological parameters

a > 0

and

b > 0

were introduced to account for two effects. The first effect encrypted in parameter b takes into account the fact that each molecule does not have the entire geometric volume V at its disposal because the repulsive forces of the other molecules that are short-range create interdiction zones whose total volume is the greater, the larger the number of molecules present. The second effect modeled through the introduction of the parameter a is the reduction in the effective force exerted by molecular collisions on the walls of the container because of the attractive force, at longer radius, exerted on each molecule by all the first ones nearby. The physical dimensions of the introduced parameters can be read directly from the equation. Considering the number of moles dimensionless, we have:

[b] = ℓ^{3}; [a] = m ℓ^{7} t^{- 2}

(A121)

where ℓ denotes length; m denotes mass; and t denotes time.

Skipping a lot of classical thermodynamic manipulation, we arrive at the conclusion that for the van der Waals model of a real gas Equation (78) is replaced by

\begin{matrix} A_{W} (T, V) & = & \frac{a b n^{3} - a n^{2} V + n R T V^{2}}{V^{2} (V - b n)} \\ B_{W} (T, V) & = & n \frac{3}{2} R T - \frac{a n^{2}}{V} \end{matrix}

(A122)

Function (A122) satisfies the general constraint (76) of Lagrangian immersion. Correspondingly, the Riemannian metric that is induced on the Lagrangian variety as defined in Equation (77) takes, in the van der Waals case, the following explicit form:

d s_{V D W}^{2} = \frac{n (- \frac{2 T d V^{2} (R T V^{3} - 2 a n {(V - b n)}^{2})}{V^{3} {(V - b n)}^{2}} - 3 R d T^{2})}{2 T^{2}}

(A123)

The metric (A123) can be immediately rewritten in terms of zweibein 1-forms:

d s_{V D W}^{2} = - e^{1} \otimes e^{1} - e^{2} \otimes e^{2}

(A124)

where

\begin{matrix} e^{1} & = & \frac{\sqrt{\frac{3}{2}} d T \sqrt{n R}}{T} \\ e^{2} & = & d V \sqrt{\frac{n (R T V^{3} - 2 α n {(V - β n)}^{2})}{T V^{3} {(V - β n)}^{2}}} \end{matrix}

(A125)

It is very easy to calculate the unique component of the spin connection

ω^{12}

and the curvature 2-form that happens to be the following:

\begin{matrix} R_{V D W} & = & R_{V D W} (T, V) (\begin{matrix} 0 & e^{1} \land e^{2} \\ - e^{1} \land e^{2} & 0 \end{matrix}) \\ R_{V D W} (T, V) & = & \frac{2 a {(V - b n)}^{2} (a n {(V - b n)}^{2} - R T V^{3})}{3 R {(R T V^{3} - 2 a n {(V - b n)}^{2})}^{2}} \end{matrix}

(A126)

One immediately sees that if the two parameters

a, b

are set to zero, the curvature 2-form vanishes. Hencem the equation of state of Ideal Gases corresponds to a flat Lagrangian variety deprived of phase transitions and critical phenomena. It follows that thermodynamic curvature, as it was first proposed by Ruppeiner [24] measures the interaction among molecules and defines critical phenomena through its own critical surfaces and critical curves.

What is essentially new compared with the equation of state for ideal gases is that the function

A_{W} (V, n, T, a, b)

possesses a critical point at which both its first and second derivatives with respect to the volume cancel. The two equations

\partial_{V} A_{W} = 0

and

\partial_{V}^{2} A_{W} = 0

have a single solution for the temperature variable

T

and the volume variable V such that we find a single critical point for the three thermodynamic variables

(P, V, T)

, which is displayed in the following equation:

V_{c} = 3 b n; T_{c} = \frac{8 a}{27 b R} \Rightarrow P_{c} = \frac{a}{27 b^{2}}

(A127)

As can be seen, the critical temperature and pressure depend only on the parameters

a, b

, while the critical volume also depends on the number of moles. If we introduce dimensionless variables

T, v, p

defined as the ratios of

T, V, P

with respect to their critical values

T = T T_{c}; V = v V_{c}; P = p P_{c}

(A128)

in terms of such variables van der Waals equation of state becomes universal:

p = \frac{8 T}{3 v - 1} - \frac{3}{v^{2}} \equiv p_{W} (T, v)

(A129)

It is very interesting to rewrite the function

R_{V D W} (T, V)

in terms of the dimensionless variables introduced in Equation (A128). We get:

R_{V D W} (T, V) = \frac{1}{6 n R} R (T, v) \equiv - \frac{{(1 - 3 v)}^{2} (8 T v^{3} - 9 v^{2} + 6 v - 1)}{{(- 4 T v^{3} + 9 v^{2} - 6 v + 1)}^{2}}

(A130)

The plot of function

R (T, v)

is shown in Figure A2. The singularity line is simply the vanishing locus in the

{v, T}

plane for the denominator in Equation (A130).

Figure A2. The first image shows the two-dimensional surface traced by the curvature scalar (A130) in the three-dimensional space spanned by the coordinates

{v, T, - R}

. One easily sees the singularity line along which thermodynamic curvature diverges to

- \infty

. Such a line projected onto the plane

{v, T}

is displayed in the second image.

Figure A2. The first image shows the two-dimensional surface traced by the curvature scalar (A130) in the three-dimensional space spanned by the coordinates

{v, T, - R}

. One easily sees the singularity line along which thermodynamic curvature diverges to

- \infty

. Such a line projected onto the plane

{v, T}

is displayed in the second image.

Appendix D. The Example of the Kählerian Manifold $M^{[2, 2]}$ with Non Trivial Paint Group

Within the Tits Satake Universality Class (36), we choose the explicit case

q = 2

in order to illustrate how the essential items in the formulation of generalized thermodynamics à la Souriau can be written in Paint Group covariant fashion and hence extended to the whole class. The symmetric space

M^{[2, 2]}

of our example has dimension 8, which, therefore, is also the dimension of the corresponding solvable group

S_{8}

and of the corresponding solvable Lie algebra

{Solv}_{8}

:

\begin{matrix} M^{[2, 2]} & = & \frac{SO (2, 4)}{SO (2) \times SO (4)} \\ \dim [M^{[2, 2]}] & = & 8 = \dim [S_{[2, 2]}] = \dim [{Solv}_{[2, 2]}] \end{matrix}

(A131)

The chosen basis of generators of the solvable Lie algebra are displayed in Table A1 (for more details on their construction and normalization see [1]).

Table A1. The generators of the solvable Lie algebra

{Solv}_{8}

.

Table A1. The generators of the solvable Lie algebra

{Solv}_{8}

.

\begin{matrix} T_{1} & = & (\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - 1 \end{matrix}) & ; & T_{2} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{3} & = & (\begin{matrix} 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) & ; & T_{4} & = & (\begin{matrix} 0 & 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{5} & = & (\begin{matrix} 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) & ; & T_{6} & = & (\begin{matrix} 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{7} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) & ; & T_{8} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

Following the conventions and the theory exposed in [1,2] a generic element of the solvable Lie algebra is parameterized as follows:

{Solv}_{8} ∋ X (Υ) = Y^{1} T_{1} + Y^{2} T_{2} + Y^{3} T_{3} + Y^{4} T_{4} + Y^{5, 1} T_{5} + Y^{5, 2} T_{6} + Y^{6, 1} T_{7} + Y^{6, 2} T_{8}

(A132)

The reason for the special naming of the solvable coordinates

Υ

is the distinction between the long roots (generators

T_{3, 4}

associated with roots

α_{3, 4}

) that have no multiplicity and the short ones that have multiplicity and transform in the fundamental representation of the Paint Group

G_{Paint}

(see [1] for details). For the chosen example, the Paint Group is just

SO (2)

and the solvable generators

T_{5, 6}

form the doublet of painted roots

α_{5}

, while the solvable generators

T_{7, 8}

form the doublet of painted roots

α_{6}

in the root system of

so (2, 3) ≃ sp (4, R)

.

Following the conventions and notations of [1,2], the

Σ

exponential map from the solvable Lie algebra to the solvable group yields the generic element of the solvable group manifold in the form presented in Equation (3.56) of [1], that we repeat here for reader’s convenience:

\begin{matrix} L {(Υ)}^{[2, 1]} = \\ (\begin{matrix} e^{Y_{1}} & \frac{e^{Y_{1}} Y_{3}}{\sqrt{2}} & \frac{1}{2} e^{Y_{1}} (\sqrt{2} U_{1} + Y_{3} V_{1}) & \frac{1}{2} e^{Y_{1}} (\sqrt{2} U_{2} + Y_{3} V_{2}) & - \frac{1}{8} e^{Y_{1}} (4 U \cdot V + \sqrt{2} (Y_{3} V^{2} - 4 Y_{4})) & - \frac{1}{4} e^{Y_{1}} (U^{2} + 2 Y_{3} Y_{4}) \\ 0 & e^{Y_{2}} & \frac{e^{Y_{2}} V_{1}}{\sqrt{2}} & \frac{e^{Y_{2}} V_{2}}{\sqrt{2}} & - \frac{1}{4} e^{Y_{2}} V^{2} & - \frac{e^{Y_{2}} Y_{4}}{\sqrt{2}} \\ 0 & 0 & 1 & 0 & - \frac{V_{1}}{\sqrt{2}} & - \frac{U_{1}}{\sqrt{2}} \\ 0 & 0 & 0 & 1 & - \frac{V_{2}}{\sqrt{2}} & - \frac{U_{2}}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & e^{- Y_{2}} & - \frac{e^{- Y_{2}} Y_{3}}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & e^{- Y_{1}} \end{matrix}) \end{matrix}

(A133)

In Equation (A133) we have used the notation

U_{i} = Y_{5, i}

and

V_{i} = Y_{6, i}

(in our case

i = 1, 2

), which puts into evidence the existence of two Paint vectors

U, V

in the case

r = 2

and the Paint Group covariant structure of the solvable group element

L {(Υ)}^{[2, s]}

. Indeed, from Equation (A133), it is immediate to deduce the general form of the matrix for any value of

q

.

Starting from Equation (A133), we easily calculate all the further required items necessary for our argumentation. To begin with, we calculate the left-invariant 1-form:

Θ \equiv L^{- 1} d L

and then we project it onto the coset generators in the orthogonal decomposition of the full

U

Lie algebra:

so (2, 4) = \underset{H}{\underset{︸}{so (2) \oplus so (4)}} \oplus K

(A134)

The list of

K

generators is given in Table A2.

Table A2. The coset generators of the of

SO (2, 4) / SO (2) \times SO (4)

.

Table A2. The coset generators of the of

SO (2, 4) / SO (2) \times SO (4)

.

\begin{matrix} K_{1} & = & (\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - 1 \end{matrix}) & ; & K_{2} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} K_{3} & = & (\begin{matrix} 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \end{matrix}) & ; & K_{4} & = & (\begin{matrix} 0 & 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 & 0 \\ 0 & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} K_{5} & = & (\begin{matrix} 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 \end{matrix}) & ; & K_{6} & = & (\begin{matrix} 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} K_{7} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & \frac{1}{\sqrt{2}} & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) & ; & K_{8} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & \frac{1}{\sqrt{2}} & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

Since the

K_{i}

are normalized in such a way that

Tr (K_{i} \cdot K_{j}) = δ_{ij}

, we can immediately calculate the vielbein as

V^{i} = Tr (K_{i} Θ); i = 1, \dots, 8

(A135)

The vielbeins are, as they should, linear combinations, with constant coefficients of the left-invariant 1-forms

e^{i}

, defined by the decomposition:

Θ = \sum_{i = 1}^{8} e^{i} T_{i}

(A136)

The latter have the following explicit appearance:

\begin{matrix} e^{1} & = & d Y_{1} \\ e^{2} & = & d Y_{2} \\ e^{3} & = & d Y_{3} + Y_{3} (d Y_{1} - d Y_{2}) \\ e^{4} & = & \frac{1}{4} (Y_{6, 1}^{2} (- d Y_{3}) + Y_{3} Y_{6, 1}^{2} d Y_{2} - 2 \sqrt{2} Y_{6, 1} d Y_{5, 1} - Y_{6, 2}^{2} d Y_{3} + Y_{3} Y_{6, 2}^{2} d Y_{2} - 2 \sqrt{2} Y_{6, 2} d Y_{5, 2} \\ + (- Y_{3} Y_{6, 1}^{2} - 2 \sqrt{2} Y_{5, 1} Y_{6, 1} - Y_{3} Y_{6, 2}^{2} - 2 \sqrt{2} Y_{5, 2} Y_{6, 2} + 4 Y_{4}) d Y_{1} + 4 d Y_{4} + 4 Y_{4} d Y_{2}) \end{matrix}

(A137)

\begin{matrix} e^{5, 1} & = & d Y_{5, 1} + \frac{Y_{6, 1} (d Y_{3} - Y_{3} d Y_{2})}{\sqrt{2}} + (Y_{5, 1} + \frac{Y_{3} Y_{6, 1}}{\sqrt{2}}) d Y_{1} \\ e^{5, 2} & = & d Y_{5, 2} + \frac{Y_{6, 2} (d Y_{3} - Y_{3} d Y_{2})}{\sqrt{2}} + (Y_{5, 2} + \frac{Y_{3} Y_{6, 2}}{\sqrt{2}}) d Y_{1} \\ e^{6, 1} & = & d Y_{6, 1} + Y_{6, 1} d Y_{2} \\ e^{6, 2} & = & d Y_{6, 2} + Y_{6, 2} d Y_{2} \end{matrix}

(A138)

One finds the following relation between the vielbein and the left invariant 1-forms:

V^{i} = ν_{A}^{i} e^{A}; ν = (\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{2} & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{2} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & \frac{1}{2} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & \frac{1}{2} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & \frac{1}{2} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & \frac{1}{2} \end{matrix})

(A139)

This result determines the form of the

κ

matrix in this case and consequently the expression of the

SO (2, 4)

invariant metric on the symmetric space (A131) in terms of the left-invariant 1-forms. Indeed, we have:

\begin{matrix} {ds}^{2} & = & κ_{AB} e^{A} \times e^{B} \\ κ & \equiv & ν^{T} \cdot ν = (\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{4} & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{4} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & \frac{1}{4} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & \frac{1}{4} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & \frac{1}{4} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & \frac{1}{4} \end{matrix}) \end{matrix}

(A140)

to be compared with the result obtained in Equation (163) for the case of the maximally split master example

SL (3, R) / SO (3)

. As in all the other cases, the left-invariant 1-forms satisfy a set of Maurer–Cartan equations:

\begin{matrix} d e^{1} & = & 0 \\ d e^{2} & = & 0 \\ d e^{3} + \frac{1}{2} (2 e^{1} \land e^{3} - 2 e^{2} \land e^{3}) & = & 0 \\ d e^{4} + \frac{1}{2} (2 e^{1} \land e^{4} + 2 e^{2} \land e^{4} - \sqrt{2} e^{5, 1} \land e^{6, 1} - \sqrt{2} e^{5, 2} \land e^{6, 2}) & = & 0 \\ d e^{5, 1} + \frac{1}{2} (2 e^{1} \land e^{5, 1} + \sqrt{2} e^{3} \land e^{6, 1}) & = & 0 \\ d e^{5, 2} + \frac{1}{2} (2 e^{1} \land e^{5, 2} + \sqrt{2} e^{3} \land e^{6, 2}) & = & 0 \\ d e^{6, 1} + e^{2} \land e^{5, 1} & = & 0 \\ d e^{6, 2} + e^{2} \land e^{6, 2} & = & 0 \end{matrix}

(A141)

from which we read off the explicit value of the solvable Lie algebra structure constants

f_{BC}^{A}

since their general form is that given in Equations (156) and (157).

Observing Equation (A142), we immediately see how they can be generalized to any manifold of the TS universality class introducing the Paint index that runs in the fundamental vector representation of the Paint Group

G_{Paint} = SO (q)

. The Paint coariant transcription of the Maurer–Cartan equation is the following:

\begin{matrix} d e^{1} & = & 0 \\ d e^{2} & = & 0 \\ d e^{3} + \frac{1}{2} (2 e^{1} \land e^{3} - 2 e^{2} \land e^{3}) & = & 0 \\ d e^{4} + \frac{1}{2} (2 e^{1} \land e^{4} + 2 e^{2} \land e^{4} - \sqrt{2} \sum_{i = 1}^{q} e^{5, i} \land e^{6, i}) & = & 0 \\ d e^{5, i} + \frac{1}{2} (2 e^{1} \land e^{5, i} + \sqrt{2} e^{3} \land e^{6, i}) & = & 0 \\ d e^{6, i} + e^{2} \land e^{5, i} & = & 0 \end{matrix}

(A142)

and it has the same form as the Maurer–Cartan equations on the Siegel plane, namely on the Tits Satake projection of the manifold. It suffices to restrict the Paint index

i

to the first value

i = 1

.

Appendix D.1. The Kähler 2-Form

The reason why the symmetric spaces (36) are all Kähler manifolds is the presence in the isotropy compact subgroup

H_{c} = SO (2) \times H^{'}

of the factor

SO (2) ≃ U (1)

and the arrangement of the coset generator vector space

K

into a representation

(2 ∣ v)

where 2 is the doublet of

SO (2)

and

v

is some irreducible representation of the other factor

H^{'}

. All coset manifolds where such a situation is realized are Kähler manifolds, since the generator of

SO (2)

in the

K

representation of

H_{c}

can be identified with the complex structure and leads to the explicit expression of the closed Kähler 2-form. In our specific case (A131) the

SO (2)

-generator in the fundamental representation of

SO (2, 4)

is the following one:

X^{c} = (\begin{matrix} 0 & \frac{1}{2} & 0 & 0 & - \frac{1}{2} & 0 \\ - \frac{1}{2} & 0 & 0 & 0 & 0 & \frac{1}{2} \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ \frac{1}{2} & 0 & 0 & 0 & 0 & - \frac{1}{2} \\ 0 & - \frac{1}{2} & 0 & 0 & \frac{1}{2} & 0 \end{matrix})

(A143)

and its representation on the space of

K_{i}

generators and hence on the vielbein

V^{i}

is the following one:

J^{c} = (\begin{matrix} 0 & 0 & - \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 & 0 & 0 \\ - \frac{1}{\sqrt{2}} & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & - 1 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \end{matrix})

(A144)

The matrix

J^{c}

is obtained from the adjoint action of

X^{c}

on the

K_{i}

coset generators, namely from:

{(J^{c})}_{i j} = \frac{1}{2} Tr ([X^{c}, K_{i}] \cdot K_{j})

(A145)

and it squares to minus the identity:

J^{c} \cdot J^{c} = - 1_{8 \times 8}

(A146)

hence it acts as a complex structure on the cotangent bundle (implying the same for the tangent bundle). Correspondingly, the Kähler 2-form can be written as:

K = \sum_{a = 1}^{8} \sum_{b = 1}^{8} J_{ab}^{c} V^{a} \land V^{b} = - \frac{e^{1} \land e^{3}}{\sqrt{2}} + \frac{e^{1} \land e^{4}}{\sqrt{2}} + \frac{e^{2} \land e^{3}}{\sqrt{2}} + \frac{e^{2} \land e^{4}}{\sqrt{2}} - \frac{1}{2} \sum_{i = 1}^{q} e^{5, i} \land e^{6, i}

(A147)

where, once again, we have utilized the example under investigation to put the Paint invariance structure into evidence. In the present case, the index

i

takes only two values

i = 1, 2

but the Formula (A147) applies to all values of

q

, namely to the entire Tits Satake universality class. In particular, for

q = 1

, Equation (A147) coincides, up to an overall factor, with Equation (303) corresponding to the Siegel plane. One easily verifies that

K

is closed and of maximal rank

d K = 0; K \land K \land K \land K = const e^{1} \land e^{2} \land \dots \land e^{6, 2}

(A148)

just as a consequence of the Maurer–Cartan Equation (A142).

The manifold (A131) equipped with the Kähler 2-form becomes a symplectic manifold

(M^{[2, 2]}, K)

and because of metric equivalence, we can say that the solvable Lie group manifold

S_{[2, 2]}

is a symplectic manifold

(S_{[2, 2]}, K)

.

Appendix D.2. The Hamiltonian Vector Fields and Their Moment-Maps

On the group manifold

S_{[2, 2]}

, we have both the left translations and the right ones, and, correspondingly, we have the left-invariant vector fields

t_{A}^{[L]}

that generate right translations and the right invariant ones

t_{A}^{[R]}

that generate left translations. Both sets of vector fields satisfy the solvable Lie algebra (157) with the structure constants defined by Equation (A142):

[t_{B}^{[L]}, t_{C}^{[L]}] = f_{BC}^{A} t_{A}^{[L]}; [t_{B}^{[R]}, t_{C}^{[R]}] = f_{BC}^{A} t_{A}^{[R]}

(A149)

The metric (A140) is invariant only with respect to the left-translations that are generated by the right-invariant vector fields, not with respect to both, since it is the metric of a coset manifold

U / H

. Correspondingly only

t_{A}^{[R]}

are Killing vectors for the Kähler metric and consequently the Kähler 2-form

K

admits as symplectic Killing vector fields only the set

t_{A}^{[R]}

:

\begin{matrix} 0 & = & ℓ_{t_{A}^{[R]}} K \equiv i_{t_{A}^{[R]}} \underset{= 0}{\underset{︸}{d K}} + d (i_{t_{A}^{[R]}} K) \\ ⇓ \\ i_{t_{A}^{[R]}} K & = & d \underset{moment map}{\underset{︸}{P_{A} (Υ)}} \end{matrix}

(A150)

The second of Equation (A150) corresponds to the definition of the moment map functions

P_{A} (Υ)

, such that

\begin{matrix} P : k & ⟶ & P_{k} (Υ) \in C^{\infty} (S_{[2, 2]}) \\ \forall f (Υ) \in C^{\infty} (S_{[2, 2]}) : k f & = & \{P_{k}, f\} \equiv K (k, X_{f}) \\ X_{f} & \equiv & π^{α β} (Υ) \frac{\partial f}{\partial Y^{α}} \frac{\partial}{\partial Y^{β}}; π^{α β} \equiv {(K^{- 1})}^{α β} \end{matrix}

(A151)

which are the exact analogues of Equations (125), (128), and (129). The difference, making explicit the clear-cut distinction advocated in Section 1.5 is that the symplectic manifold now is the very solvable Lie group manifold, rather than the total space of its tangent bundle, the moment maps are functions of the solvable coordinates

Y^{α}

rather than of the canonical momenta, and the symplectic form is the Kähler 2-form

K

.

The very important point is that the moment maps

μ_{A} (Υ)

that solve the differential equation in the second line of Equation (A150) have the closed form expression for all manifolds of the Tits Satake Universality class (36), discussed in Section 6.1.2 and presented in Equation (230).

We leave to a future publication the explicit calculation of the moment-maps and the study of the partition function that will closely follow the calculations already performed in the case of the Siegel plane. The main motivation for the present appendix was to show that all the structures analyzed in the main text for the Tits Satake projection of the entire universality class have obvious extensions to all members of the class by introducing Paint Group indices in the appropriate places.

References

Bruzzo, U.; Fré, P.G.; Trigiante, M. The Paint Group Tits Satake Theory of Hyperbolic Symmetric Spaces: The distance function, paint invariants and discrete subgroups. arXiv 2025, arXiv:2503.07626. [Google Scholar]
Fré, P.G.; Milanesio, F.; Santoro, M.; Sanguinetti, G. Navigation through Non Compact Symmetric Spaces: A mathematical perspective on Cartan Neural Networks. arXiv 2025, arXiv:2507.16871. [Google Scholar] [CrossRef]
Fré, P.G.; Milanesio, F.; Santoro, M.; Sanguinetti, G. Cartan Networks: Group theoretical Hyperbolic Deep Learning. arXiv 2025, arXiv:2505.24353v1. [Google Scholar] [CrossRef]
Fré, P.; Milanesio, M.; Oyarzo, F.; Santoro, M.; Trigiante, M. Tessellation Groups, Harmonic Analysis on Non Compact Symmetric Spaces and the Heat kernel in view of Cartan Convolutional Neural Networks. arXiv 2025, arXiv:2508.16015v1. [Google Scholar] [CrossRef]
Barbaresco, F. Innovative Tools for Radar Signal Processing based on Cartan’s Geometry of SPD Matrices & Information Geometry. In Proceedings of the 2008 IEEE Radar Conference, Rome, Italy, 26–30 May 2008. [Google Scholar]
Dongho, C.; Duboskiy, P. Travellling Wave-like solutions of the Navier Stokes and related equations. J. Math. Anal. Appl. 1996, 204, 930–939. [Google Scholar]
Cartan, E. Sur une classe remarquable d’espaces de Riemann. Bull. Soc. Math. Fr. 1926, 54, 214–264. [Google Scholar] [CrossRef]
Helgason, S. Differential Geometry and Symmetric Spaces; Academic Press: Cambridge, MA, USA, 1962. [Google Scholar]
Fré, P.G. Discrete, Finite and Lie Groups; De Gruyter: Berlin, Germany; Boston, MA, USA, 2023. [Google Scholar] [CrossRef]
Magnea, U. An introduction to symmetric spaces. arXiv 2002, arXiv:0205288. [Google Scholar] [CrossRef]
Kobayashi, S.; Nomizu, K. Foundations of Differential Geometry; Interscience Publishers: New York, NY, USA, 1963; Volume 1. [Google Scholar]
Alekseevsky, D. Classification of quaternionic spaces with a transitive solvable group of motions. Math. USSR Izv. 1975, 9, 297–339. [Google Scholar] [CrossRef]
Cortés, V. Alekseevskian spaces. Diff. Geom. Appl. 1996, 6, 129–168. [Google Scholar] [CrossRef]
Borel, A.; Tits, J. Groupes Réductifs. Publications Mathématiques de l’IHES 1965, 27, 55–151. [Google Scholar] [CrossRef]
de Wit, B.; Van Proeyen, A. Special geometry, cubic polynomials and homogeneous quaternionic spaces. Commun. Math. Phys. 1992, 149, 307–334. [Google Scholar] [CrossRef]
de Wit, B.; Van Proeyen, A. Broken sigma model isometries in very special geometry. Phys. Lett. 1992, B293, 94–99. [Google Scholar] [CrossRef]
de Wit, B.; Van Proeyen, A. Isometries of special manifolds. arXiv 1995, arXiv:9505097. [Google Scholar]
Fré, P.; Sorin, A.S. Integrability of supergravity billiards and the generalized Toda lattice equation. Nucl. Phys. 2006, B733, 334–355. [Google Scholar] [CrossRef][Green Version]
Fré, P.; Sorin, A.S. Supergravity black holes and billiards and the Liouville integrable structure associated with Borel algebras. J. High Energy Phys. 2010, 2010, 66. [Google Scholar] [CrossRef]
Fré, P.G.; Sorin, A.S. Classification of Arnold-Beltrami flows and their hidden symmetries. Phys. Part. Nucl. 2015, 46, 497–632. [Google Scholar] [CrossRef][Green Version]
Fré, P.; Trigiante, M. Chaos from symmetry, Navier Stokes equations, Beltrami fields and the universal classifying crystallographic group. J. Geom. Phys. 2022, 191, 104884. [Google Scholar] [CrossRef]
Ruppeiner, G.; Seftas, A. Thermodynamic Curvature of the Binary van der Waals Fluid. Entropy 2020, 22, 1208. [Google Scholar] [CrossRef]
Weinhold, F. Metric geometry of thermodynamics. Phys. Today 1976, 29, 23. [Google Scholar] [CrossRef]
Ruppeiner, G. Thermodynamic curvature measures interactions. Am. J. Phys. 2010, 78, 1170–1180. [Google Scholar] [CrossRef]
Ruppeiner, G. Thermodynamic curvature from the critical point to the triple point. Phys. Rev. E 2012, 86, 021130. [Google Scholar] [CrossRef] [PubMed]
Ruppeiner, G.; Sahay, A.; Sarkar, T.; Sengupta, G. Thermodynamic geometry, phase transitions, and the Widom line. Phys. Rev. E 2012, 86, 052103. [Google Scholar] [CrossRef]
Ruppeiner, G. Thermodynamic curvature: Pure fluids to black holes. J. Phys. Conf. Ser. 2013, 410, 012138. [Google Scholar] [CrossRef]
Ruppeiner, G.; Mausbach, P.; May, H.-O. Thermodynamic R-diagrams reveal solid-like fluid states. Phys. Lett. A 2015, 379, 646–649. [Google Scholar] [CrossRef]
Lychagin, V. Contact Geometry, Measurement and Thermodynamics. In Nonlinear PDEs, Their Geometry, and Applications, Proceedings of the Wisla 18 Summer School; Springer Nature: Berlin/Heidelberg, Germany, 2019; pp. 3–54. [Google Scholar] [CrossRef]
Marle, C.-M. From Tools in Symplectic and Poisson Geometry to Souriau’s Theories of Statistical Mechanics and Thermodynamics. Entropy 2016, 18, 370. [Google Scholar] [CrossRef]
Barbaresco, F. Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families. Entropy 2016, 18, 386. [Google Scholar] [CrossRef]
Barbaresco, F. Lie Group Machine Learning and Gibbs Density on Poincaré Unit Disk from Souriau Lie Group Thermodynamics and SU(1,1) Coadjoint Orbits. In Geometric Science of Information; Nielsen, F., Barbaresco, F., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 157–170. [Google Scholar] [CrossRef]
Barbaresco, F. Gaussian Distributions on the Space of Symmetric Positive Definite Matrices from Souriau’s Gibbs State for Siegel Domains by Coadjoint Orbit and Moment Map. In Geometric Science of Information; Nielsen, F., Barbaresco, F., Eds.; Springer International Publishing: Berlin/Heidelberg, Germany, 2021; pp. 245–255. [Google Scholar] [CrossRef]
Marle, C.-M. On Gibbs states of mechanical systems with symmetries. arXiv 2021, arXiv:2012.00582. [Google Scholar] [CrossRef]
Trigiante, M. Gauged supergravities. Phys. Rep. 2017, 680, 1–175. [Google Scholar] [CrossRef]
Fré, P.G. Advances in Geometry and Lie Algebras from Supergravity; Theoretical and Mathematical Physics Book Series; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar] [CrossRef]
Andrianopoli, L.; Bertolini, M.; Ceresole, A.; D’Auria, R.; Ferrara, S.; Fre, P.; Magri, T. N = 2 supergravity and N = 2 superYang-Mills theory on general scalar manifolds: Symplectic covariance, gaugings and the momentum map. J. Geom. Phys. 1997, 23, 111–189. [Google Scholar] [CrossRef]
Frè, P.G. Lectures on resolutions à la Kronheimer of orbifold singularities, McKay quivers for Gauge Theories on D3 branes, and the issue of Ricci flat metrics on the resolved three-folds. arXiv 2023, arXiv:2308.14022. [Google Scholar] [CrossRef]
Dwivedi, S.; Herman, J.; Jeffrey, L.; van den Hurk, T. Hamiltonian Group Actions and Equivariant Cohomology; SpringerBriefs in Mathematics; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Cabanes, Y. Multidimensional Complex Stationary Centered Gaussian Autoregressive Time Series Machine Learning in Poincaré and Siegel Disks: Application for Audio and Radar Clutter Classification. Ph.D. Thesis, Université de Bordeaux, Bordeaux, France, 2022. [Google Scholar]
Rao, C. Information and Accuracy attainable in the estimation of statistical parameters. Bull. Calcutta Math. Soc. 1945, 37, 81–91. [Google Scholar]
Chentsov, N. Statistical Decision Rules and Optimal Inferences. Trans. Math. Monog. Amer. Math. Soc. Provid. 1982, 53. [Google Scholar]
Nielsen, F. An elementary introduction to information geometry. Entropy 2020, 22, 1100. [Google Scholar] [CrossRef]
Amari, S. Information Geometry and Its Applications. In Applied Mathematical Sciences; Springer: Tokyo, Japan, 2016. [Google Scholar]
Shannon, C. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Koralov, L.B.; Sinai, Y.G. Theory of Probability and Random Processes; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
Jaynes, E. Information Theory and Statistical Mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
Jaynes, E. Information Theory and Statistical Mechanics ii. Phys. Rev. 1957, 108, 171–190. [Google Scholar] [CrossRef]
Lychagin, V.; Roop, M. Phase transitions in filtration of real gases. arXiv 2019, arXiv:1903.00276. [Google Scholar] [CrossRef]
Lychagin, V.; Roop, M. Steady filtration of Peng-Robinson gas in a porous medium. arXiv 2019, arXiv:1904.08387. [Google Scholar] [CrossRef]
Lychagin, V.; Roop, M. On higher order structures in thermodynamics. Entropy 2020, 22, 1147. [Google Scholar] [CrossRef]
Kushner, A.; Lychagin, V.; Roop, M. Optimal thermodynamic processes for gases. Entropy 2020, 22, 448. [Google Scholar] [CrossRef]
Bianchi, M.; Bruzzo, U.; Fré, P.; Martelli, D. Resolution à la Kronheimer of C³/Γ singularities and the Monge-Ampère equation for Ricci-flat Kähler metrics in view of d3-brane solutions of supergravity. Lett. Math. Phys. 2021, 111, 79. [Google Scholar] [CrossRef]
Bruzzo, U.; Fré, P.; Shahzad, U.; Trigiante, M. D3-brane supergravity solutions from Ricci-flat metrics on canonical bundles of Kähler-einstein surfaces. Lett. Math. Phys. 2023, 113, 64. [Google Scholar] [CrossRef]
Fré, P.G. Gravity, a Geometrical Course; Springer Science & Business Media: Dordrecht, The Netherlands, 2012; Volumes 1–2. [Google Scholar]
Alekseevsky, D.V.; Cortes, V.; Devchand, C.; Van Proeyen, A. Polyvector superPoincare algebras. Commun. Math. Phys. 2004, 253, 385–422. [Google Scholar] [CrossRef]
Arkhangel’skii, A. Completely Integrable Hamiltonian systems on a Group of Triangular Matrices. Math. USSR Sb 1980, 36, 127. [Google Scholar] [CrossRef]
Fre, P. Lectures on special Kahler geometry and electric–magnetic duality rotations. Nucl. Phys. Proc. Suppl. 1996, 45BC, 59–114. [Google Scholar] [CrossRef]
Nickel, M.; Kiela, D. Poincaré Embeddings for Learning Hierarchical Representations. In Advances in Neural Information Processing Systems; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
Nickel, M.; Kiela, D. Learning continuous hierarchies in the Lorentz model of hyperbolic geometry. In Proceedings of the 35th International Conference on Machine Learning; Dy, J., Krause, A., Eds.; PMLR: New York, NY, USA, 2018; Volume 80, pp. 3779–3788. [Google Scholar]
Klimovskaia, A.; Lopez-Paz, D.; Bottou, L.; Nickel, M. Poincaré maps for analyzing complex hierarchies in single-cell data. Nat. Commun. 2020, 11, 2966. [Google Scholar] [CrossRef]
Bonnasse-Gahot, L.; Nadal, J.P. Category learning in deep neural networks: Information content and geometry of internal representations. Phys. Rev. E 2025, 112, 055315. [Google Scholar] [CrossRef] [PubMed]
Jones, S.E.; Gilbert, A.D. Dynamo action in the abc flows using symmetries. Geophys. Astrophys. Fluid Dyn. 2014, 108, 83–116. [Google Scholar] [CrossRef][Green Version]
Etnyre, J.; Ghrist, R. Contact topology and hydrodynamics: Beltrami fields and the seifert conjecture. Nonlinearity 2000, 13, 441–458. [Google Scholar] [CrossRef]
Ghrist, R. On the contact topology and geometry of ideal fluids. In Handbook of Mathematical Fluid Dynamics; North Holland: Amsterdam, The Netherlands, 2007; Volume 4, pp. 1–37. [Google Scholar]
Geiges, H. Contact geometry. Handb. Differ. Geom. 2006, 2, 315–382. [Google Scholar]
Cardona, R.; Miranda, E.; Peralta-Salas, D. Euler flows and singular geometric structures. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2019, 377, 20190034. [Google Scholar] [CrossRef]
Eva, M.; Oms, C. The geometry and topology of contact structures with singularities. arXiv 2021, arXiv:1806.05638. [Google Scholar]
Cardona, R.; Miranda, E. On the volume elements of a manifold with transverse zeroes. Regul. Chaotic Dyn. 2019, 24, 187–197. [Google Scholar] [CrossRef]
Cardona, R.; Miranda, E. Integrable systems and closed one forms. J. Geom. Phys. 2018, 131, 204–209. [Google Scholar] [CrossRef]
Guillemin, V.; Miranda, E.; Pires, A.R. Symplectic and poisson geometry on b-manifolds. Adv. Math. 2014, 264, 864–896. [Google Scholar] [CrossRef]
Pollard, J.; Alexander, G.P. Singular contact geometry and beltrami fields in cholesteric liquid crystals. arXiv 2019, arXiv:1911.10159. [Google Scholar] [CrossRef]
Arnold, V. Metodi Matematici della Meccanica Classica; Editori Riuniti; Edizioni Mir: Roma, Italy, 1979. [Google Scholar]
Huang, K. Statistical Mechanics; John Wiley & Sons: New York, NY, USA; London, UK; Sydney, Australia, 1963. [Google Scholar]
Uhlenbeck, G.; Ford George, G. Lectures in Statistical Mechanics; Lectures in Applied Mathematics; American Mathematical Society: Providence, RI, USA, 1963. [Google Scholar]

Figure 1. The cone

Ω

of Souriau allowed temperature vectors in the

sl (2, R)

Lie algebra space as defined in Equation (261).

Figure 1. The cone

Ω

of Souriau allowed temperature vectors in the

sl (2, R)

Lie algebra space as defined in Equation (261).

Figure 2. Examples of plots of the Gibbs probability distributions (265) over the Poincaré disk, labeled by different set of temperatures. The exponential Gaussian decay toward infinity is visually evident, as much as the deformed bell shape. In the first image, we compare two distributions with the same values of

δ μ

but with a different angle

θ

. In the second image, we compare two distributions that differ in all parameters.

Figure 2. Examples of plots of the Gibbs probability distributions (265) over the Poincaré disk, labeled by different set of temperatures. The exponential Gaussian decay toward infinity is visually evident, as much as the deformed bell shape. In the first image, we compare two distributions with the same values of

δ μ

but with a different angle

θ

. In the second image, we compare two distributions that differ in all parameters.

Figure 3. In this figure, we compare two Gibbs distributions on the Poincaré plane corresponding to a lower and higher value of the norm (268). As one sees for a high value of the norm, the distribution is very sharply shaped around its maximal value, while for a lower norm, it is much broader. For high norm, we know with much more precision the actual location of the stochastic variable in the plane.

Figure 4. In the first and second pictures in this figure, we display the behavior of the four intrinsic curvature components

F, G, Q, P

as respectively seen from the two sides of the vertical plane that has the line

δ = μ

as a base. Such a line corresponds to the boundary of the cone in Figure 1. All the curvature components become singular on such a line.

Figure 4. In the first and second pictures in this figure, we display the behavior of the four intrinsic curvature components

F, G, Q, P

as respectively seen from the two sides of the vertical plane that has the line

δ = μ

as a base. Such a line corresponds to the boundary of the cone in Figure 1. All the curvature components become singular on such a line.

Figure 5. In this figure, we show for a few pairs of values of

λ

and

μ

the integrand

F

in the integration variables

w_{1, 2} = log ρ_{1, 2}

. The bell shape and the uniform exponential decay to zero at infinity in all directions guarantee the convergence of the two remaining integrals on

w_{1, 2}

.

Figure 5. In this figure, we show for a few pairs of values of

λ

and

μ

the integrand

F

in the integration variables

w_{1, 2} = log ρ_{1, 2}

. The bell shape and the uniform exponential decay to zero at infinity in all directions guarantee the convergence of the two remaining integrals on

w_{1, 2}

.

Figure 6. Plots of the numerical evaluations of the Geothermodynamical partition function and stochastic Hamiltonian à la Souriau for the Siegel plane.

Table 1. In this table, we display the complete list of the 10 generators of the

sp (4, R)

Lie algebra.

Table 1. In this table, we display the complete list of the 10 generators of the

sp (4, R)

Lie algebra.

\begin{matrix} T_{1}^{s} & = & (\begin{matrix} - \frac{1}{2} & 0 & 0 & 0 \\ 0 & - \frac{1}{2} & 0 & 0 \\ 0 & 0 & \frac{1}{2} & 0 \\ 0 & 0 & 0 & \frac{1}{2} \end{matrix}) & T_{2}^{s} & = & (\begin{matrix} - \frac{1}{2} & 0 & 0 & 0 \\ 0 & \frac{1}{2} & 0 & 0 \\ 0 & 0 & \frac{1}{2} & 0 \\ 0 & 0 & 0 & - \frac{1}{2} \end{matrix}) \end{matrix}

\begin{matrix} T_{3}^{s} & = & (\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 \\ 0 & \frac{1}{\sqrt{2}} & 0 & 0 \end{matrix}) & T_{4}^{s} & = & (\begin{matrix} 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 \\ - \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{5}^{s} & = & (\begin{matrix} 0 & 0 & 0 & \frac{1}{2} \\ 0 & 0 & \frac{1}{2} & 0 \\ 0 & \frac{1}{2} & 0 & 0 \\ \frac{1}{2} & 0 & 0 & 0 \end{matrix}) & T_{6}^{s} & = & (\begin{matrix} 0 & \frac{1}{2} & 0 & 0 \\ \frac{1}{2} & 0 & 0 & 0 \\ 0 & 0 & 0 & - \frac{1}{2} \\ 0 & 0 & - \frac{1}{2} & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{7}^{s} & = & (\begin{matrix} 0 & \frac{1}{2} & 0 & 0 \\ - \frac{1}{2} & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{2} \\ 0 & 0 & - \frac{1}{2} & 0 \end{matrix}) & T_{8}^{s} & = & (\begin{matrix} 0 & 0 & 0 & \frac{1}{2} \\ 0 & 0 & \frac{1}{2} & 0 \\ 0 & - \frac{1}{2} & 0 & 0 \\ - \frac{1}{2} & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{9}^{s} & = & (\begin{matrix} 0 & 0 & - \frac{1}{2} & 0 \\ 0 & 0 & 0 & \frac{1}{2} \\ \frac{1}{2} & 0 & 0 & 0 \\ 0 & - \frac{1}{2} & 0 & 0 \end{matrix}) & T_{10}^{s} & = & (\begin{matrix} 0 & 0 & \frac{1}{2} & 0 \\ 0 & 0 & 0 & \frac{1}{2} \\ - \frac{1}{2} & 0 & 0 & 0 \\ 0 & - \frac{1}{2} & 0 & 0 \end{matrix}) \end{matrix}

Table 2. In this table, we display a complete list of the 10 generators of the

so (2, 3)

Lie algebra, which are in one-to-one correspondence with and in the same order as the generators of the

sp (4, R)

Lie algebra listed in Table 1.

Table 2. In this table, we display a complete list of the 10 generators of the

so (2, 3)

Lie algebra, which are in one-to-one correspondence with and in the same order as the generators of the

sp (4, R)

Lie algebra listed in Table 1.

\begin{matrix} T_{1}^{v} & = & \sqrt{2} K_{1} & = & (\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - 1 \end{matrix}) & T_{2}^{v} & = & \sqrt{2} K_{2} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{3}^{v} & = & \sqrt{2} K_{3} & = & (\begin{matrix} 0 & \frac{1}{\sqrt{2}} & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & - \frac{1}{\sqrt{2}} & 0 \end{matrix}) & T_{4}^{v} & = & \sqrt{2} K_{4} & = & (\begin{matrix} 0 & 0 & 0 & \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & 0 \\ 0 & - \frac{1}{\sqrt{2}} & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{5}^{v} & = & \sqrt{2} K_{5} & = & (\begin{matrix} 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ \frac{1}{\sqrt{2}} & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & - \frac{1}{\sqrt{2}} & 0 & 0 \end{matrix}) & T_{6}^{v} & = & \sqrt{2} K_{6} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & \frac{1}{\sqrt{2}} & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & - \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{7}^{v} & = & H_{1} & = & (\begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & - \frac{1}{\sqrt{2}} & 0 & - \frac{1}{\sqrt{2}} & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{matrix}) & T_{8}^{v} & = & H_{2} & = & (\begin{matrix} 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ - \frac{1}{\sqrt{2}} & 0 & 0 & 0 & - \frac{1}{\sqrt{2}} \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} & 0 & 0 \end{matrix}) \end{matrix}

\begin{matrix} T_{9}^{v} & = & H_{3} & = & (\begin{matrix} 0 & \frac{1}{2} & 0 & \frac{1}{2} & 0 \\ - \frac{1}{2} & 0 & 0 & 0 & - \frac{1}{2} \\ 0 & 0 & 0 & 0 & 0 \\ - \frac{1}{2} & 0 & 0 & 0 & - \frac{1}{2} \\ 0 & \frac{1}{2} & 0 & \frac{1}{2} & 0 \end{matrix}) & T_{10}^{v} & = & H_{0} & = & (\begin{matrix} 0 & \frac{1}{2} & 0 & - \frac{1}{2} & 0 \\ - \frac{1}{2} & 0 & 0 & 0 & \frac{1}{2} \\ 0 & 0 & 0 & 0 & 0 \\ \frac{1}{2} & 0 & 0 & 0 & - \frac{1}{2} \\ 0 & - \frac{1}{2} & 0 & \frac{1}{2} & 0 \end{matrix}) \end{matrix}

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fré, P.G.; Sorin, A.S.; Trigiante, M. Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks. Entropy 2026, 28, 365. https://doi.org/10.3390/e28040365

AMA Style

Fré PG, Sorin AS, Trigiante M. Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks. Entropy. 2026; 28(4):365. https://doi.org/10.3390/e28040365

Chicago/Turabian Style

Fré, Pietro G., Alexander S. Sorin, and Mario Trigiante. 2026. "Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks" Entropy 28, no. 4: 365. https://doi.org/10.3390/e28040365

APA Style

Fré, P. G., Sorin, A. S., & Trigiante, M. (2026). Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks. Entropy, 28(4), 365. https://doi.org/10.3390/e28040365

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Thermodynamics à la Souriau on Kähler Non-Compact Symmetric Spaces for Cartan Neural Networks

Abstract

1. Introduction

1.1. Cartan Neural Networks: A New Paradigm

1.2. The Mathematical Basis of CaNN

1.3. The Link with Symplectic Geometry and Generalized Thermodynamics

1.4. Gibbs States and Lie Group Generalized Thermodynamics

1.4.1. Symplectic Moment Map

1.4.2. Coadjoint Orbits

1.5. Clearcut Distinctions

1.5.1. Kähler Non-Compact Symmetric Spaces

1.5.2. Hence Two Cases

1.6. Relevance for Cartan Neural Networks

1.7. Outline of This Paper

2. Shannon Information Entropy and the Partition Function

Conditional Minimalization of Information and the Partition Function

3. Geometrical Structure of Thermodynamics

3.1. The Geometric Reformulation

3.1.1. Legendrian Submanifolds

3.1.2. The Lagrangian Submanifold and Its Metric

3.1.3. The Canonical Riemannian Metric on the Lagrangian Submanifold

3.1.4. The Lagrangian Submanifold in the Two-Dimensional Case and Its Riemannian Structure

3.2. General Conclusion of This Section

4. The Geodesic Dynamical System

4.1. The Geodesic Dynamical System in General

4.2. The Geodesic Dynamical System for Non-Compact Symmetric Spaces

4.2.1. The Symplectic 2-Form

4.2.2. The Poissonian Bi-Vector

4.2.3. Hamiltonian Vector Fields and the Poisson Bracket

4.2.4. Symplectic Moment Map

4.2.5. Relation with the Nomizu Operator

5. A Master Example for the Geodesic Dynamical System: SL ( 3 , R ) / SO ( 3 )

5.1. Hamiltonians in Involution and Generalized Thermodynamics

5.2. Generalized Thermodynamics for a Geodesic Dynamical System on U / H

5.2.1. Generalized Thermodynamics for the Chosen Master Example

5.2.2. Final Remarks on the GDS Generalized Thermodynamics of the Master Model SL ( 3 , R ) / SO ( 3 )

6. Generalized Thermodynamics à la Souriau on Kähler Non-Compact U / H .s

6.1. The General Setup

6.1.1. General Construction Method of the Killing Vector Fields

6.1.2. The General Form of the Moment-Maps

6.1.3. The Partition Function and the Gibbs Probability Distribution

6.2. Generalized Thermodynamics à la Souriau of the Poincaré–Lobachevsky Hyperbolic Plane H 2

6.2.1. Calculation of the Partition Function

6.2.2. Visualization of the Gibbs Probability Distributions

6.2.3. The Kähler Geothermodynamic Metric and Curvature

6.3. Generalized Thermodynamics à la Souriau of the Siegel Half Plane SH 2

6.3.1. The Siegel Upper Plane

6.3.2. The Kähler 2-Form, the Killing Vector Fields, and the Moment Maps

7. On the Partition Function and Gibbs Distributions in General and for SH 2 in Particular

7.1. Canonical Form of the Partition Functions and of the Gibbs Probability Distributions, in General

7.2. Calculation of the Partition Function for the Siegel Plane in Canonical Form

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Basic Structures of Contact and Symplectic Geometry

Appendix A.1. Contact Geometry

Appendix A.2. Contact Structures

Appendix A.3. Integrability and Frobenius Theorem

Appendix A.4. Isotropic Submanifolds of a Contact Manifold and Non Integrability

Appendix A.5. The Reeb Vector

Appendix A.6. Darboux Theorem and the Case of Thermodynamics

Appendix A.7. Symplectic and Poisson Manifolds

Appendix A.8. The Relation Between Contact Manifolds and Symplectic Manifolds

Appendix B. Fundaments of Probability Theory

Appendix B.1. σ-Algebras and Probability Measures

Appendix B.2. Stochastic Functions, Stochastic Vectors and Distributions

Appendix C. A Summary of Classical Thermodynamics and Statistical Mechanics

Appendix C.1. Thermodynamical Potentials and State Functions

Appendix C.2. Thermodynamical Constants

The First and Second Principles of Thermodynamics

Appendix C.3. The Three Ensembles of Statistical Mechanics

Appendix C.3.1. The Microcanonical Ensemble

Appendix C.3.2. The Canonical Ensemble

Appendix C.3.3. The Grand Canonical Ensemble and the Gibbs Potential

Appendix C.4. Statistical Mechanics of Ideal Gases

The Equation of State of Ideal Gases

5. A Master Example for the Geodesic Dynamical System: $SL (3, R) / SO (3)$

5.2. Generalized Thermodynamics for a Geodesic Dynamical System on $U / H$

5.2.2. Final Remarks on the GDS Generalized Thermodynamics of the Master Model $SL (3, R) / SO (3)$

6. Generalized Thermodynamics à la Souriau on Kähler Non-Compact $U / H$ .s

6.2. Generalized Thermodynamics à la Souriau of the Poincaré–Lobachevsky Hyperbolic Plane $H_{2}$

6.3. Generalized Thermodynamics à la Souriau of the Siegel Half Plane ${SH}_{2}$

7. On the Partition Function and Gibbs Distributions in General and for ${SH}_{2}$ in Particular

Appendix D. The Example of the Kählerian Manifold $M^{[2, 2]}$ with Non Trivial Paint Group