Next Article in Journal
A New Information-Theoretic Method for Advertisement Conversion Rate Prediction for Large-Scale Sparse Data Based on Deep Learning
Previous Article in Journal
Performance Optimization of a Condenser in Ocean Thermal Energy Conversion (OTEC) System Based on Constructal Theory and a Multi-Objective Genetic Algorithm
Previous Article in Special Issue
Multi-Stage Meta-Learning for Few-Shot with Lie Group Network Constraint
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Lie Group Statistics and Lie Group Machine Learning Based on Souriau Lie Groups Thermodynamics & Koszul-Souriau-Fisher Metric: New Entropy Definition as Generalized Casimir Invariant Function in Coadjoint Representation

by
Frédéric Barbaresco
Key Technology Domain PCC (Processing, Control & Cognition) Representative, Thales Land & Air Systems, Voie Pierre-Gilles de Gennes, F91470 Limours, France
Entropy 2020, 22(6), 642; https://doi.org/10.3390/e22060642
Submission received: 4 March 2020 / Revised: 31 May 2020 / Accepted: 2 June 2020 / Published: 9 June 2020

Abstract

:
In 1969, Jean-Marie Souriau introduced a “Lie Groups Thermodynamics” in Statistical Mechanics in the framework of Geometric Mechanics. This Souriau’s model considers the statistical mechanics of dynamic systems in their “space of evolution” associated to a homogeneous symplectic manifold by a Lagrange 2-form, and defines in case of non null cohomology (non equivariance of the coadjoint action on the moment map with appearance of an additional cocyle) a Gibbs density (of maximum entropy) that is covariant under the action of dynamic groups of physics (e.g., Galileo’s group in classical physics). Souriau Lie Group Thermodynamics was also addressed 30 years after Souriau by R.F. Streater in the framework of Quantum Physics by Information Geometry for some Lie algebras, but only in the case of null cohomology. Souriau method could then be applied on Lie groups to define a covariant maximum entropy density by Kirillov representation theory. We will illustrate this method for homogeneous Siegel domains and more especially for Poincaré unit disk by considering SU(1,1) group coadjoint orbit and by using its Souriau’s moment map. For this case, the coadjoint action on moment map is equivariant. For non-null cohomology, we give the case of Lie group SE(2). Finally, we will propose a new geometric definition of Entropy that could be built as a generalized Casimir invariant function in coadjoint representation, and Massieu characteristic function, dual of Entropy by Legendre transform, as a generalized Casimir invariant function in adjoint representation, where Souriau cocycle is a measure of the lack of equivariance of the moment mapping.

  La thèse de Kirillov, parue en 1962, a suscité immédiatement beaucoup d’intérêt…En outre, quantité de notions naturelles concernant les représentations s’interprètent géométriquement en terme d’orbites coadjointes: restriction à un sous-groupe, induction unitaire, produit tensoriel, mesure de Plancherel, la topologie de l’ensemble représentations unitaires irréductibles… Kirillov s’est vite convaincu, et il a convaincu la communauté mathématique que cette « méthode des orbites » devait être applicable à des groupes bien plus généraux que les groupes nilpotents. Il n’a pas hésité à aborder le cas des groupes de Lie connexes quelconques. Evidemment, des difficultés considérables ont surgi immédiatement. Néanmoins, Kirillov a indiqué une voie d’accès, qui ensuite a été largement utilisée.
- Jacques Dixmier, Brèves remarques sur l’œuvre de A.A. Kirillov
  On comprend ainsi comment Lagrange a pu développer les lois de la Mécanique des systèmes formés de solides sans s’occuper des variations de la température de ces corps et Fourier traiter des variations de la température de ces mêmes corps solides sans s’occuper de leur mouvement; comment on peut étudier le mouvement de la Terre, assimilée à un solide rigide, sans se préoccuper de la température de cet astre et étudier le refroidissement du globe terrestre sans se préoccuper de son mouvement. Une telle indépendance entre les problèmes qui ressortissent à la Mécanique et les problèmes qui ressortissent à la Théorie de la chaleur n’existe plus lorsque les systèmes auxquels on a affaire ne sont plus des systèmes classiques; si, par exemple, au lieu de regarder la Terre comme un solide rigide, d’état invariable, on tient compte des changements de volume, de forme, d’état physique et chimique qui accompagnent son refroidissement, on ne peut plus séparer le problème du mouvement de la Terre et le problème du refroidissement terrestre. … On sait que cette forme de relations supplémentaires avait été introduite par Newton et les géomètres du XVIIIème siècle dans la théorie du son. Ces considérations montrent que les questions qui ressortissent à la Thermodynamique ont dû solliciter l’attention des physiciens dès qu’on a voulu aborder l’étude des systèmes autres que des systèmes classiques; et, en fait, c’est la théorie de la propagation du son dans l’air qui a provoqué Laplace à créer la Thermodynamique.
- P. Duhem, L’intégrale des forces vives en thermodynamique, JMPA 4:5-19, 1898 [1,2,3,4]
  Sous cette aspiration, la physique qui était d’abord une science des “agents” doit devenir une science des “milieux”. C’est en s’adressant à des milieux nouveaux que l’on peut espérer pousser la diversification et l’analyse des phénomènes jusqu’à en provoquer la géométrisation fine et complexe, vraiment intrinsèque…Sans doute, la réalité ne nous a pas encore livré tous ses modèles, mais nous savons déjà qu’elle ne peut en posséder un plus grand nombre que celui qui lui est assigné par la théorie mathématique des groupes
- Gaston Bachelard, Etude sur l’Evolution d’un problème de Physique –La propagation thermique dans les solides, 1928

1. Introduction

The previous French quotes by the Mathematician Jacques Dixmier, the Physicist Pierre Duhem, and the Philosopher Gaston Bachelard are important to introduce the epistemological context of models that will be developed in the paper. Jacques Diximer refers to Alexander Kirillov seminal idea of coadjoint orbits method to consider Lie group representation model. Pierre Duhem makes comments to the origin of the gap between the theory of heat and the theory of Mechanics. Finally, Gaston Bachelard make prediction that new Thermodynamics foundations will be given by groups. We will try in this paper, to prove that these ideas could be reconciled by the Souriau model of Lie groups Thermodynamics through the mathematical structure of Lie algebra cohomology.
After a the state of the art and trends in Machine Learning based on Information Geometry, we will present, in this introduction, the main objective of this paper to jointly apply models from geometric statistical mechanics and tools from Information geometry to solve “Gauss density” definition problem for statistics on Lie groups and homogeneous manifolds. We will also present use-cases motivation for Lie group machine learning illustrating for Doppler statistics analysis with SU(1,1) statistics, and for kinematics data analysis with SE(2) statistics.

1.1. State of the Art and Trends in Machine Learning Based on Information Geometry

The classical simple gradient descent used in Deep Learning has two drawbacks: the use of the same non-adaptive learning rate for all parameter components, and a non-invariance with respect to parameter re-encoding inducing different learning rates. As the parameter space of multilayer networks forms a Riemannian space equipped with Fisher information metric, instead of the usual gradient descent method, the natural gradient or Riemannian gradient method, which takes account of the geometric structure of the Riemannian space, is more effective for learning. The natural gradient preserves this invariance to be insensitive to the characteristic scale of each parameter direction. The Fisher metric defines a Riemannian metric as the Hessian of two dual potential functions (the Entropy and the log-partition function). Yann Ollivier and Gaétan Marceau-Caron provided in 2016 [5] the first experimental results on non-synthetic data sets for the quasi-diagonal Riemannian Natural gradient descents for neural networks introduced previously by Yann Ollivier in [6] (MNIST, SVHN, and FACE data sets). The quasi-diagonal Riemannian algorithms consistently beat simple stochastic gradient gradient descents by a varying margin. The computational overhead with respect to simple backpropagation is around a factor 2, and reach their final performance quickly, thus requiring fewer training epochs and a smaller total computation time. The main goal of natural gradient is to obtain invariance properties, such as, for a neural network, insensitivity of the training algorithm to whether a logistic or tanh activation function is used, or insensitivity to simple changes of variables in the parameters, such as scaling some parameters. In 2017, same authors have introduced the resulting natural Langevin dynamics [7] combining the advantages of natural gradient descent and Fisher-preconditioned Langevin dynamics for large neural networks, validated on MNIST with Fisher matrix preconditioning. With all invariance properties of natural gradient, this Langevin Dynamics avoids overfitting as a regularization method, and replaces classical methods based on a controlled amount of noise to stochastic gradient descents, that ensures convergence to the Bayesian posterior on model parameters. The theoretically optimal covariance of the noise is the inverse Fisher metric, and Y. Ollivier and G. Marceau-Caron have shown how to implement this in practice with neural networks using efficient Fisher metric approximations. In 2017, Yann Ollivier has also introduced TANGO algorithm (True Asymptotic Natural Gradient Optimization) [8], which converges to a true natural gradient descent in the limit of small learning rates, without explicit Fisher matrix estimation, and where in large dimension, small learning rates will be required to approximate the natural gradient well. Y. Ollivier has also shown that it is possible to get arbitrarily close to exact natural gradient descent with a lightweight algorithm. About natural gradient for Deep Learning, we can refer to [9,10]. This year, Shun-ichi Amari [11] has given an elementary geometrical proof that any target function is realized in a sufficiently small neighborhood of any randomly connected deep network, provided the width (the number of neurons in a layer) is sufficiently large.
In this paper, we will introduce how to extend these approaches for data as elements of Lie groups or data lying on a homogeneous manifold where a Lie group acts transitively. This extension is considered in the framework and interconnexion of Souriau “Lie groups Thermodynamics”, Information Geometry and Kirillov representation theory [12] to define probability densities for Lie groups, as Souriau covariant Gibbs densities (density of Maximum of Entropy). We will develop this case for the matrix Lie group SU(1,1) (case with null cohomology) through the computation of Souriau’s moment map, and Kirillov’s orbit method. We will also develop the method for SE(2) Lie group (case with non-null cohomology) where a Souriau cocycle should be taken into account due to the defect of equivariance of the coadjoint action on the moment map.
Supervised learning approaches are based on neural networks whose parameters are estimated by natural gradient algorithms. Non-supervised algorithm are based on clustering by using technics called “k-means” or “Mean-shift” using distance between elements of the dataset. In both cases, if we want to extend these approaches for Lie groups dataset, we have to extend the notion of Gaussian densities and distance between elements. We propose to use Geometric Statistical Model coming from Geometric Statistical Mechanics to introduce “Gauss density” of Lie group elements. Jointly, we can associate a natural distance between these Lie group elements on the Symplectic manifold by means of KKS 2-form, introducing a natural Riemannian metric associated to Fisher Metric from Information Geometry. The objective of this paper is to explain how to use Geometric Statistical Mechanics tools in this context.

1.2. Objectives of this Paper

The purpose of this article is multiple. The work of Professor Jean-Marie Souriau is well known in the field of “Geometric Mechanics” of which he is one of the founders with his book “structure of dynamic systems” published in 1969, and in which he introduced the foundations of Symplectic Geometry. Inside this book, chapter IV dealing with the extension of Geometric Mechanics to Statistical Mechanics, has been little read or misunderstood by this community. We have discovered that this model was part of and generalized another discipline, which is called Information Geometry. We have demonstrated in other previous articles that one could generalize Fisher metric (invariant metric used in Information Geometry) for Lie groups, with this model. It is therefore a question of rehabilitating the work of Jean-Marie Souriau in a broader framework, which concerns statistics and machine learning extended to objects considered as elements of a Lie group or a homogeneous manifold.
The second goal is to solve with these new tools problems that were still unsolved in statistics and machine learning. These unresolved problems concern the definition and calculation of the expression of probability densities, playing the role of Gaussian density, for elements of a Lie group or elements of homogeneous manifolds. In this article, we completely solve the problem for 2 Lie groups very useful in machine learning but also in physics, the Lie groups SU(1,1) and SE(2). The calculation is not a simple application of the Souriau model, because it is necessary to establish the “moment map” associated with these groups and define a Laplace transform on their coadjoint orbits of these groups (action of the group on the dual space of Lie algebra). In a second step, we must use Information Geometry to write these covariant Gibbs densities in the correct parametrization which parametrizes the generalized Gaussian law from statistical moments on the homogeneous symplectic manifold associated with coadjoint orbits. In the case SU(1,1), which corresponds to a case of null cohomology (equivariance of the coadjoint operator on the moment application), as the homogeneous symplectic manifold to the coadjoint orbit is the Poincaré unit disk, we solve jointly, an open problem to define mathematically the notion of Gaussian density in this disk in hyperbolic geometry. With the property that this density is by construction invariant under the action of the group SU(1,1), which is the condition sine qua none to preserve the symmetries and the invariance of the associated Fisher metric. We show that this model achieves a breakthrough in machine learning, because we have a Gibbs density and a Fisher metric invariant by change of parametrization and invariant under the action of symmetries. Gibbs density allows us to extend the classical supervised statistical machine learning algorithms, and Fisher metric allows us to adress unsupervised learning problem as k-means problems in metric space. The model opens the way to machine learning for Lie groups with multiple applications in robotics, sensor signal processing, image processing.
In the last part of the article, based on this model, we give a new “geometric” definition of Entropy by showing that Entropy is an invariant Casimir function in coadjoint representation. The Casimir functions have been widely studied within the framework of Poisson structures and manifolds [13,14,15,16]. This characterization of Entropy is new, because previously Entropy was defined axiomatically. Using this Casimir function property, we show that it is possible to use full geometric approaches to construct the Entropy function only from the structure coefficients of the Lie group associated with the symmetries involved. We show that we can also introduce an Euler-Poincaré equation and its stochastic variant to study other open problems in statistics and thermodynamics. The application of this Casimir characterization, which is demonstrated in this article, are developed in another twin article published in the same special issue with François Gay-Balmaz [17].

1.3. Motivation of Lie Group Machine Learning with Use-Cases

Machine learning is a field of study of artificial intelligence, which is based on statistical approaches to give computers the ability to “learn” from data, that is, to classify data from observations in a supervised or non-supervised way. Machine learning generally has two phases. The first consists in estimating a model from data, called observations. This so-called “training” phase is generally carried out before the practical use of the model. The second phase corresponds to the start of production: the model being determined, new data can then be classified. According to the information available during the learning phase, learning is qualified in different ways. If the data is labeled (that is, the task response is known for that data), it is supervised learning. We speak of classification if the labels are discrete, or of regression if they are continuous. In the most general case, without a label, we seek to determine the underlying structure of the data (which can be a probability density) and it is then unsupervised learning. Machine learning can be applied to different types of data, such as graphs, trees, curves, or more simply feature vectors, which can be continuous or discrete. We propose to extend the approach, when datasets are element of matrix lie groups.
Learning algorithms can be categorized according to their learning mode. For supervised learning, the classes are predetermined and the examples known, and then the system learns to classify according to a classification model. An expert must label examples beforehand. The process takes place in two phases. For unsupervised learning, when the system or operator has only examples, but no label, and the number of classes and their nature have not been predetermined, we speak of unsupervised learning or clustering. No expert is required. The algorithm must discover for itself the more or less hidden structure of the data, by data partitioning and data clustering. The system must cluster the data according to their available attributes, to classify them into homogeneous groups of examples. Similarity is generally calculated according to a distance function between pairs of examples.
We will illustrate two problems of Machine Learning on Lie groups coming from Radar Industry. Target recognition on Radar micro-Doppler data could be modeled by a problem of classification of dataset considered as elements of SU(1,1) Lie group (see Figure 1). Radar complex time series of micro-Doppler observation of data are classically processed on sliding time window to estimate their associated covariance matrices that are characterized by a Toeplitz Hermitian Positive-definitiveness structure. Using a well-known Verblunsky/Trench Theorem, we can parametrize all Toeplitz Hermitian Positive Definite Covariance matrices of stationary Radar Time series in a product space with a real positive axis (for signal power) and a Poincaré polydisk (for Doppler Spectrum shape). If we consider the Poincaré Unit Disk as an homogeneous space where SU(1,1) Lie group acts transitively. Each data in Poincaré unit disk of this polydisk could be then coded by SU(1,1) matrix Lie group element. We have transformed the problem into a statistical learning challenge processing data of SU(1,1) matrix Lie group. Another exemple considers flying object recognition on their kinematics coded in SE(2) or SE(3) Lie Groups. 3D (or 2D) trajectories could be coded by SE(3) (or SE(2)) Lie group time series provided through Invariant Extended Kalman Filter (IEKF) Radar Tracker, that locally estimates displacement of Frenet-Seret frame. Object kinematics will be then coded by time series of SE(3) (or SE(2)) matrix Lie groups characterizing local rotation/translation of Frenet frame along the drone 3D (or 2D) trajectory. Statistics of this SE(3) (or SE(2)) Lie group elements will characterize flight mechanics of different kinds of object (birds, drones, …).
SU(1,1) or SE(2) are also fundamental tools in Image Processing (Sub-Riemannian Geometry of vision with SE(2)), in robotics (rigid bodies statistical analysis with SE(2)), in Natural Langage Processing (methods of graph-embedding in Poincaré disk with SU(1,1)), …. For instance, SU(1,1) Lie group which acts on Poincaré unit Disk is highly studied to embed isometrically a graph in an hyperbolic space. It is used by GAFAM (Google, Facebook, …) for Natural Language Processing by reducing graph analysis to a Machine Learning problem in Hyperbolic Poincaré Unit Disk. Hyperbolic Neural network [18] have been developed in this framework. SU(1,1) Lie group is also fundamental in Quantum physics to describe Coherent states of an electron in a magnetic field for instance [19] and Coherent states in Quantum Optics [20] (some statistical photon-counting aspects of SU(1,1) coherent states are emphasized). SE(2) Lie group is especially fundamental for Geometry of Vision considering sub-Riemannian approaches of the Citti-Petitot-Sarti Model [21] but also also in neuroimagery [22].

1.3.1. SU(1,1) Lie Group Machine Learning for Doppler Data Statistics Analysis

Lie group structure appears naturally on Doppler data, if we consider time series of locally stationary signal and their associated covariance matrix. Covariance matrix is Toeplitz Hermitian Positive Definite. Based on Theorem due to Verblunsky [23,24] and Trench [25], we can parametrize Hermitian Positive Definite Matrix in product space involving the Poincaré unit Polydisk:
φ : T H D P ( n ) R + * × D n 1 R n ( P 0 , μ 1 , , μ n 1 )
where D is the Poincaré Unit Disk:
D = { z = x + i y C / | z | < 1 }
The Poincaré unit disk is an homogeneous bounded domain where the Lie group SU(1,1) act transitively. This Matrix Group is given by:
S U ( 1 , 1 ) = { [ a b b * a * ] / | a | 2 | b | 2 = 1 ,   a , b C }
where SU(1,1) acts on the Poincaré Unit Disk by:
g S U ( 1 , 1 ) g . z = a z + b b * z + a
with Cartan Decomposition of SU(1,1)
( a b b * a * ) = | a | ( 1 z z * 1 ) ( a / | a | 0 0 a * / | a | ) with   z = b ( a * ) 1 , | a | = ( 1 | z | 2 ) 1 / 2
We can observe that z = b ( a * ) 1 could be considered as action of g S U ( 1 , 1 ) on the centre on the unit disk z = g . 0 = b ( a * ) 1 . The principal idea is that we can code any point z = b ( a * ) 1 in the unit disk by an element of the Lie group SU(1,1). Main advantage is that the point position is no longer coded by coordinates but intrinsically by transformation from the orogin 0 to this point. Finally, a covariance matrix of a stationary signal could be coded by (n−1) Matrix SU(1,1) Lie group elements:
  THPD R + * × D n 1 R + * × S U ( 1 , 1 ) n 1 R n ( P 0 , μ 1 , , μ n 1 ) ( P 0 , [ a 1 b 1 b 1 * a 1 * ] , , [ a n 1 b n 1 b n 1 * a n 1 * ] )

1.3.2. SE(2) and SE(3) Lie Groups Machine Learning for Kinematics Data Statistics Analysis

When we consider a 3D trajectory of a mobile target, we can describe this curve by a time evolution of the local Frenet-Serret frame (local frame with tangent vector, normal vector and binormal vector) as illustrated in Figure 2. This frame evolution is described by the Frenet-Serret formula that gives the kinematic properties of the target moving along the continuous, differentiable curve in 3D Euclidean space ℝ3. More specifically, the formulas describe the derivatives of the so-called tangent, normal, and binormal unit vectors in terms of each other.
d d t ( t n b ) = [ 0 κ 0 κ 0 γ 0 γ 0 ] ( t n b )   with   { κ : curvature γ : torsion
We will consider motions determined by exponentials of paths in the Lie algebra. Such a motion is determined by a unit speed space-curve τ ( t ) . Now in a Frenet-Serret motion a point in the moving body moves along the curve and the coordinate frame in the moving body remains aligned with the tangent t , normal n , and binormal b , of the curve. Using the 4-dimensional representation of the Lie group SE(3), the motion can be specified as:
G ( t ) = ( R ( t ) τ ( t ) 0 1 ) S E ( 3 )
where τ ( t ) is the curve and the rotation matrix R ( t ) has the unit vectors t , n , and b as columns:
R ( t ) = ( t n b ) S O ( 3 )
If we introduce the Darboux vector ω = γ t + κ b that we can rewritte from Frenet-Serret Formulas:
d t d t = ω × t   ,   d n d t = ω × n   ,   d b d t = ω × b
Then, we can write with Ω is the 3 × 3 anti-symmetric matrix corresponding to ω :
d R d t = Ω R
We note that d τ ( t ) d t = t and d ω d t = d γ d t t + d κ d t b .
The instantaneous twist of the motion G ( t ) is given by:
S d = d G ( t ) d t G 1 ( t ) = ( Ω υ 0 0 )
This is the Lie algebra element corresponding to the tangent vector to the curve G ( t ) . It is well known that elements of the Lie algebra s e ( 3 ) can be described as lines with a pitch. The fixed axode of a motion G ( t ) S E ( 3 ) is given by the axis of S d as t varies. The instantaneous twist in the moving reference frame is given by S b = G 1 ( t ) S d G ( t ) , that is, by the adjoint action on the twist in the fixed frame. The instantaneous twist S b can also be found from the relation:
S b = G 1 ( t ) d G ( t ) d t
S b = G 1 d G d t = ( R T R T τ 0 1 ) ( Ω R t 0 0 ) = ( R T Ω R R t 0 0 )
We can observe that we could describe a 3D trajectory by a time series of SE(3) Lie group elements:
S E ( 3 ) = { [ R τ 0 1 ] / R S O ( 3 ) , τ R 3 }
with
S O ( 3 ) = { R / R T R = R R T = I , det 2 R = 1 }
Then, the trajectory will be given by the following time series of SE(3) elements:
{ [ R 1 τ 1 0 1 ] , [ R 2 τ 2 0 1 ] , , [ R n τ n 0 1 ] } S E ( 3 ) n

2. New Results Introduced in the Paper

The paper is structured in two parts:
-
1st Part on “Gauss Density on Lie groups”: This part is totally new in Machine learning with an extension of “Gauss densities” (defined as Maximum Entropy model) for Lie groups coupling both Souriau model (introduced in statistical physics domain), with Information Geometry in Geometric Machine Learning domain. We illustrate with two use-cases SU(1,1) and SE(2) that are the most useful Lie groups in Image Processing (Sub-Riemannian Geometry of vision with SE(2)), in robotics (rigid bodies statistical analysis with SE(2)), in Natural Langage Processing (SU(1,1) with methods of graph-embedding in Poincaré disk), …. Some tentatives have been developed to define noise on Lie groups by adding additional Gaussian components on elements of the Lie algebra [26,27,28,29], but these models are not mathematically correct because they do not preserve the symmetries and the moment map associated to these symmetries by the Noether Theorem.
-
2nd part on “Entropy definition extension as Casimir Function”: This part gives a new geometric definition of Entropy as invariant Casimir function in coadjoint representation, explaining the invariance of entropy under the affine coadjoint action on moment map in the dual space of Lie algebra. This definition was not in the paper of Souriau. With this new definition, we can compute Entropy only by structure constraints given by the Lie group. It opens the door to new generalization of Maximum Entropy method and first of all computation of “Gaussian densities” for any Lie group. Applications of this new property is not developed in this paper but in a twin paper in the same special issue [17]. We refert to M. Gromov papers to consider more geometric structures of Entropy [30,31].
The main new results of this paper are the introduction of “Gauss density” for Lie groups or data on homogeneous space where a Lie groups acts transitively, and the full computation for SU(1,1) Lie group. This group acts transitively on the Poincaré unit disk, and so we have also solved an open problem related to Gauss density on this homogeneous space. For this purpose, the main approach has considered an extended definition of classical “Gauss density”, as introduced by Jaynes, in term of density of Maximum Entropy. In this way, the initial problem was transfert to a new one related to the good definition of Entropy for Lie groups. To address this problem, first, we have recalled the classical Euclidean case, where the Entropy S ( η ) could be defined as the Legendre transform of minus the log-partition function Φ ( θ ) = log R e θ , y d y (defined by Laplace transform) by the following equation S ( η ) = θ , η Φ ( θ )   with   η i = Φ ( θ ) θ i   and   θ i = S ( η ) η i . The next step was to explain how to extend the log-partition function for Lie groups. We have then considered the Laplace transform in the framework of Lie group representation theory as introduced by Alexander Kirillov and Geometric Statical Mechanics as modeled by Jean-Marie Souriau. We have preserved the same Legendre structure, and have defined the Entropy S ( Q ) , parametrized on the dual space of the Lie algebra Q g * (called geometric heat), as Legendre transform of minus of the log-partition function Φ ( β ) = log M e J ( ξ ) , β d λ ω , parametrized on the Lie algebra by β g (called geometric Planck Temperature), from a Laplace transform defined on the homogeneous symplectic manifold (associated to the Lie group by the Kirrilov-Kostant-Souriau 2-form called KKS 2-form in the litterature). By introducing the moment map J : M g * , fundamental tool of representation theory introduced by Souriau, we were able to define the log-partition function on the coadjoint orbit of the Lie group, Φ ( β ) = log g * e J ( ξ ) , β d λ ω . The entropy is then given by the Legendre transform S ( Q ) = Q , β Φ ( β )   with   Q = Φ ( β ) β g *   and   β = S ( Q ) Q g . We have then defined the Gauss density for Lie groups as the density that maximizes this Entropy S ( Q ) under the constraint of its associated first moment Q = Φ ( β ) β = M J ( ξ ) p G i b b s ( ξ ) d λ ω . The Gauss density is then established by analogy with thermodynamics as the Gibbs density p G i b b s ( ξ ) = e Φ ( β ) J ( ξ ) , β = e J ( ξ ) , β M e J ( ξ ) , β d λ ω . But this is not enough, because this density is not given in the good parametrization. We have proposed to express the Gibbs density with respect to the 1st statistical moment Q (statistical mean of moment map) by inverting the relation Q = Φ ( β ) β = Θ ( β ) . The Gibbs density p G i b b s , Q ( ξ ) = e Φ ( β ) J ( ξ ) , Θ 1 ( Q ) with β = Θ 1 ( Q ) will provide the extended definition of Gauss density in final good parametrization.
For the time being, no “Gaussian density” was defined on Poincaré unit disk with the mandatory property to be covariant under the action of SU(1,1) Lie group that acts transitively on this homogeneous bounded domain. We have applied the previous model via computation of moment map and developed the full computation of this extended Gauss density for SU(1,1) Lie group, S U ( 1 , 1 ) = { ( a b b * a * ) / a , b C ,   | a | 2 | b | 2 = 1 } and then deduced as consequence the gauss density for the Poincaré unit disk considered as the homogeneous symplectic manifold associated to the coadjoint orbit of the SU(1,1) Lie group via KKS 2 form. Considering the Lie algebra s u ( 1 , 1 ) = { ( i r η η * i r ) / r R , η C } and the dual space of the Lie algebra s u ( 1 , 1 ) * = { ( z x + i y x + i y z ) / x , y , z R } , we have computed the moment map J : D s u * ( 1 , 1 ) defined by J ( z ) . u i = J i ( z , z * ) , that maps D the Poincaré unit disk into a coadjoint orbit in s u * ( 1 , 1 ) , J ( z ) = J 1 ( z , z * ) u 1 * + J 2 ( z , z * ) u 2 * + J 3 ( z , z * ) u 3 * = ρ ( 1 + | z | 2 1 | z | 2 2 z * 1 | z | 2 2 z 1 | z | 2 1 + | z | 2 1 | z | 2 ) g * The moment map J is a diffeomorphism of D onto one sheet of the two-sheeted hyperboloid in s u * ( 1 , 1 ) , determined by the following equation J 1 2 J 2 2 J 3 2 = ρ 2   ,   J 1 ρ   with J 1 u 1 * + J 2 u 2 * + J 3 u 3 * s u * ( 1 , 1 ) . But the full SU(1,1) Lie group is not related to any equilibrium Gibbs state (the open subset of the Lie algebra, associated to this Gibbs state is empty). We have then considered one-parameter subgroups of the Lie group S U ( 1 , 1 ) such that the open subset Λ β = { β g / D e J ( z ) , β d λ ( z ) < + }   is not empty. In the neighborhood of the identity element, the elements of g S U ( 1 , 1 ) can be written as the exponential of an element β of its Lie algebra. If we make the remark that we have the following relation β 2 = ( i r η η * i r ) ( i r η η * i r ) = ( | η | 2 r 2 ) I , we can developed the exponential map by a Taylor expansion of the exponential function, which is given by the following relation g = exp ( ε β ) = k = 0 ( ε β ) k k ! = ( cosh ( ε R ) + i r sinh ( ε R ) R η sinh ( ε R ) R η * sinh ( ε R ) R cosh ( ε R ) i r sinh ( ε R ) R ) with   R 2 = | η | 2 r 2 .
We can observe that one condition is that | η | 2 r 2 > 0 then the subset to consider is given by the subset Λ β = { β = ( i r η η * i r ) , r R , η C / | η | 2 r 2 > 0 }   such that D e J ( z ) , β d λ ( z ) < + . Finally, we have computed the covariant Gibbs density in the unit disk given by β Λ β   and by the moment map of the Lie group S U ( 1 , 1 ) , that could be expressed in the following equation: p G i b b s ( z ) = e J ( z ) , β D e J ( z ) , β d λ ( z ) = = e ρ ( 1 + | z | 2 ( 1 | z | 2 ) 2 z * ( 1 | z | 2 ) 2 z ( 1 | z | 2 ) 1 + | z | 2 ( 1 | z | 2 ) ) , ( i r η η * i r ) D e J ( z ) , β d λ ( z )   with   d λ ( z ) = 2 i ρ d z d z * ( 1 | z | 2 ) 2 . To write the final Gibbs density with respect to its statistical moment, we rewrite the density with Q = E [ J ( z ) ] , by β = Θ 1 ( Q ) g where Q = Φ ( β ) β = Θ ( β ) g * and Q = E [ J ( z ) ] = E [ ρ ( 1 + | w | 2 ( 1 | w | 2 ) 2 w * ( 1 | w | 2 ) 2 w ( 1 | w | 2 ) 1 + | w | 2 ( 1 | w | 2 ) ) ] .
To extend this approach for covariant Gibbs density on Siegel Unit Disk S D = { Z M p q ( C ) / I p Z Z + > 0 } , that is a classical matrix extension of Poincaré unit Disk, we have proposed to consider G = S U ( p , q ) unitary group and the homogeneous space G / K = S U ( p , q ) / S ( U ( p ) , U ( q ) )   with   K = S ( U ( p ) × U ( q ) ) = { ( A 0 0 D ) / A U ( p ) , D U ( q ) , det ( A ) det ( D ) = 1 } and the moment map given by J ( Z ) = i λ ( ( I p Z Z + ) 1 ( p Z Z + q I p ) ( p + q ) Z ( I q Z + Z ) 1 ( p + q ) ( I q Z + Z ) 1 Z + ( p I q + q Z + Z ) ( I q Z + Z ) 1 ) .
After S U ( 1 , 1 ) Lie group (case with null cohomology), we have considered the same model for S E ( 2 ) Lie group with non-null cohomology that needs the use of symplectic one-cocycle to manage the defect of cohomology. We have considered the special Euclidean group S E ( 2 ) = { [ R φ τ 0 1 ] / R φ S O ( 2 ) , τ R 2 } with S O ( 2 ) = { R φ = [ cos φ sin φ sin φ cos φ ] / φ R } , and the Lie algebra s e ( 2 ) of S E ( 2 ) ( ξ , u ) s e ( 2 ) = R × R 2 [ ξ u 0 0 ] s e ( 2 ) with = [ 0 1 1 0 ] , to define the moment map J ( ξ , u ) ( x ) : R 2 s e * ( 2 ) that is given by the expression J ( ξ , u ) ( x ) = J ( x ) . ( ξ , u ) with J ( x ) = 2 ( 1 2 x 2 , x ) ,   x R 2 . Then, the Gibbs density is deduced for generalized temperature β Ω = { ( b , Β ) s e ( 2 ) / b < 0 , Β R 2 } by p G i b b s ( x ) = e J ( x ) , β R 2 e J ( x ) , β d λ ( x ) = e 1 2 b x 2 Β . x R 2 e 1 2 b x 2 Β . x d λ ( x ) , with the log-partition function given by the following expression Φ ( β ) = log R 2 e 1 2 b x 2 Β . x d λ ( x ) = log ( 2 π b e 1 2 b B 2 ) with Q = Φ ( β ) β = ( 1 b Β 2 2 b 2 , 1 b Β ) = Θ ( β ) and where Q Ω * = { ( m , M ) s e * ( 2 ) / m + M 2 2 < 0 } . To obtain the good parametrization related to statical moments, we have inverted the relation β = Θ 1 ( Q ) = ( ( m + 1 2 M 2 ) 1 , ( m + 1 2 M 2 ) 1 M ) , to provide the covariant Gibbs density parametrized by ( m , M ) = E ( J ( x ) ) = E [ 2 ( 1 2 x 2 , x ) ] = [ E ( x 2 ) , 2 E ( x ) ] . The final Gauss density for SE(2) is then p G i b b s ( x ) = e 1 2 x 2 M . x ( m + 1 2 M 2 ) R 2 e 1 2 x 2 M . x ( m + 1 2 M 2 ) d λ ( x ) .
We conclude the paper by a deeper study of Souriau model structure. We observe that Souriau Entropy S ( Q ) defined on coadjoint orbit of the group has a property of invariance S ( A d g # ( Q ) ) = S ( Q ) with respect to Souriau affine definition of coadjoint action A d g # ( Q ) = A d g * ( Q ) + θ ( g ) where θ ( g ) is called the Souriau cocyle. In the framework of Souriau Lie groups Thermodynamics, we can then characterize the Entropy as a generalized Casimir invariant function in coadjoint representation, and Massieu characteristic function (or log-partition function), dual of Entropy by Legendre transform, as a generalized Casimir function in adjoint representation. When M is a Poisson manifold, a function on M is a Casimir function if and only if this function is constant on each symplectic leaf (the non-empty open subsets of the symplectic leaves are the smallest embedded manifolds of M which are Poisson submanifolds) [15]. Classically, the Entropy is defined axiomatically as Shannon or von Neumann Entropies without any geometric structures constraints. In this paper, the Entropy is also presented as solution of the Casimir equation ( a d S Q * Q ) j + Θ ( S Q ) j = C i j k a d ( S Q ) i * Q k + Θ j = 0 with Θ ˜ ( X , Y ) = Θ ( X ) , Y = J [ X , Y ] { J X , J Y } = d θ ( X ) , Y ,   X , Y g , where Θ ( X ) = T e θ ( X ( e ) ) appears in case of non-null cohomology (non-equivariance of coadjoint operator on the moment map), with θ ( g ) the Souriau Symplectic cocycle. The dual space of the Lie algebra foliates into coadjoint orbits that are also the level sets on the entropy. The KKS (Kostant-Kirillov Souriau) 2-form, and the Souriau-Koszul-Fisher metric transform each orbit into a homogeneous Symplectic manifold. The information manifold foliates into level sets of the entropy that could be interpreted in the framework of Thermodynamics by the fact that motion remaining on this complex surfaces is non-dissipative, whereas motion transversal to these surfaces is dissipative, where the dynamic is given by d Q d t = { Q , H } Θ ˜ = a d H Q * Q + Θ ( H Q ) with stable equilibrium when H = S d Q d t = { Q , S } Θ ˜ = a d S Q * Q + Θ ( S Q ) = 0 . We have finally also observed that d S = Θ ˜ β ( H Q , β ) d t where Θ ˜ β ( H Q , β ) = Θ ˜ ( H Q , β ) + Q , [ H Q , β ]   , showing that Entropy production is linked with Souriau tensor related to Fisher metric.
The Casimir equations that we have introduced in non-zero cohomology case are consequences of the constancy of the entropy on adjoint orbits of the Lie algebra and of the equivariance of the map between the set of generalized temperatures and the dual space of the Lie algebra, as introduced by Jean-Marie in his 1974 paper. We explained this fact in the paper by starting elaboration of Casimir equations from the Souriau equation. Casimir equations are then presented in this context, as a fully equivalent form written in a new way, especially in the framework of Souriau Lie groups Thermodynamics. Souriau has not observed that the Entropy is an invariant Casimir function in coadjoint representation, but we can assume that he was fully aware of this invariant structure.
From Souriau equation Q , [ β , Z ] + Θ ˜ ( β , Z ) = 0 published in 1974, we have rewritten as direct consequence this equation on a Casimir form a d S Q * Q + Θ ( S Q ) = 0 . This equation preserves the geometric structures included in Souriau equation but allow us to consider the Entropy from the point of view of Casimir invariant function. The concept of Entropy and the concept of Casimir function were, for the time being, two disjoint concepts that have been developed independently in the past. There is a large literature on Casimir function, especially the russian one that have characterized properties of Casimir function. We refer to Igor V. Shirokov who has proposed a method for constructing invariants of the coadjoint representation of Lie groups with an arbitrary dimension and structure based on local symplectic coordinates on the coadjoint orbits. With Oleg L. Kurnyavko, Igor V. Shirokov has also proposed a general method for constructing invariant Casimir functions. The second reference is about A.T. Fomenko and V.V. Trofimov who have also deeply studied Casimir functions (but in case of null cohomology) and have developed the following equation that we can write for Entropy in null cohomology case S ( A d e t ξ * Q ) = S ( Q ) + n = 1 ( ϕ ( ξ ) ) n S n ! ( Q ) . t n with ϕ : g V e c ( Γ ) a representation of Lie algebras defined on basis ( e 1 , e 2 , , e n ) in g . We refer to a twin paper [17] developing consequences of this new definition of Entropy as an invariant Casimir function. In this twin paper, we study the associated Euler-Poincaré equation d Q d t = a d H Q * Q + Θ ( H Q ) and the stochastic extension based on a new Stratonovich differential equation for the stochastic process given by the following relation by mean of Souriau’s symplectic cocycle d Q + [ a d H Q * Q + Θ ( H Q ) ] d t + i = 1 N [ a d H i Q * Q + Θ ( H i Q ) ] d W i ( t ) = 0 . These kind of stochastic equations have been also studied by Alexis Arnaudon and Daryl Holm but only in the restricted case of null-cohomology [32].
We give references from classical textbooks (as Souriau book and papers) to preprints because different approaches have been developed in parallel to address Lie groups statistics, as soon as mid of last century, but without bridges between these disciplines which have developed specific tools to address this problem. We have limited these references to main and important documents, which are characterized as seminal and as tutorial of their domains. We have preserved references in French, because some works as Souriau Lie groups Thermodynamics model have not been yet largely spread towards the different communities.

3. Learning Inference Lie Groups Thermodynamics and Covariant Gibbs Density

We identify the Riemanian metric introduced by Souriau based on cohomology, in the framework of “Lie groups thermodynamics” as an extension of classical Fisher metric introduced in information geometry. We have observed that Souriau metric preserves Fisher metric structure as the Hessian of the minus logarithm of a partition function, where the partition function is defined as a generalized Laplace transform on a sharp convex cone. Souriau’s definition of Fisher metric extends the classical one in case of Lie groups or homogeneous manifolds. Souriau has developed this “Lie groups thermodynamics” theory in the framework of homogeneous symplectic manifolds in geometric statistical mechanics for dynamical systems, but as observed by Souriau, these model equations are no longer linked to the symplectic manifold but equations only depend on the Lie group and the associated cocycle [33,34]. This analogy with Fisher metric opens potential applications in machine learning, where the Fisher metric is used in the framework of information geometry, to define the “natural gradient” tool for improving ordinary stochastic gradient descent sensitivity to rescaling or changes of variable in parameter space. In machine learning revised by natural gradient of information geometry, the ordinary gradient is designed to integrate the Fisher matrix. Amari has theoretically proved the asymptotic optimality of the natural gradient compared to classical gradient. With the Souriau approach, the Fisher metric could be extended, by Souriau-Fisher metric, to design natural gradients for data on homogeneous manifolds. Information geometry has been derived from invariant geometrical structure involved in statistical inference. The Fisher metric defines a Riemannian metric as the Hessian of two dual potential functions, linked to dually coupled affine connections in a manifold of probability distributions. With the Souriau model, this structure is extended preserving the Legendre transform between two dual potential function parametrized in Lie algebra of the group acting transentively on the homogeneous manifold.

3.1. Inference by Natutal Gradient and Legendre Structure

Classically, to optimize the parameter θ of a probabilistic model, based on a sequence of observations y t , is an online gradient descent:
θ t θ t 1 η t l t ( y t ) T θ
with learning rate η t , and the loss function l t = log p ( y t / y ^ t ) . This simple gradient descent has a first drawback of using the same non-adaptive learning rate for all parameter components, and a second drawback of non invariance with respect to parameter re-encoding inducing different learning rates. Amari has introduced the natural gradient to preserve this invariance to be insensitive to the characteristic scale of each parameter direction. The gradient descent could be corrected by I ( θ ) 1 where I is the Fisher information matrix with respect to parameter θ , given by:
I ( θ ) = [ g i j ]   with   g i j = [ E y p ( y / θ ) [ 2 log p ( y / θ ) θ i θ j ] ] i j
with natural gradient:
θ t θ t 1 η t I ( θ ) 1 l t ( y t ) T θ
Amari has proved that the Riemannian metric in an exponential family is the Fisher information matrix defined by:
g i j = [ 2 Φ θ i θ j ] i j   with   Φ ( θ ) = log R e θ , y d y
and the dual potential, the Shannon entropy, is given by the Legendre transform:
S ( η ) = θ , η Φ ( θ )   with   η i = Φ ( θ ) θ i   and   θ i = S ( η ) η i
We can observe that Φ ( θ ) = log R e θ , y d y = log ψ ( θ ) is linked with the cumulant generating function.
J.L. Koszul and E. Vinberg have introduced an affinely invariant Hessian metric on a sharp convex cone through its characteristic function:
Φ Ω ( θ ) = log Ω * e θ , y d y = log ψ Ω ( θ )   with   θ Ω   sharp   convex   cone ψ Ω ( θ ) = Ω * e θ , y d y   with   Koszul-Vinberg   Characteristic   function
Jean-Louis Koszul has introduced the following forms
1st Koszul form:
α = d Φ Ω ( θ ) = d log ψ Ω ( θ )
2nd Koszul form:
γ = D α = D d log ψ Ω ( θ )
with the following property of positive definitiveness:
( D d log ψ Ω ( x ) ) ( u ) = 1 ψ Ω ( u ) 2 [ Ω * F ( ξ ) 2 d ξ . Ω * G ( ξ ) 2 d ξ ( Ω * F ( ξ ) . G ( ξ ) d ξ ) 2 ] > 0 with   F ( ξ ) = e 1 2 x , ξ   and   G ( ξ ) = e 1 2 x , ξ u , ξ
Koszul has defined the following Diffeomorphism:
η = α = d log ψ Ω ( θ ) = Ω * ξ p θ ( ξ ) d ξ   with   p θ ( ξ ) = e ξ , θ Ω * e ξ , θ d ξ
with preservation of Legendre transform:
S Ω ( η ) = θ , η Φ Ω ( θ )   with   η = d Φ Ω ( θ )   and   θ = d S Ω ( η )

3.2. Souriau Lie Groups Thermodynamique and Souriau-Koszul-Fisher Metric

This relations have been extended by Jean-Marie Souriau in geometric statistical mechanics, where he developed a “Lie groups thermodynamics” of dynamical systems where the (maximum entropy) Gibbs density is covariant with respect to the action of the Lie group. In the Souriau model, previous structures of information geometry are preserved:
I ( β ) = 2 Φ β 2   with   Φ ( β ) = log M e U ( ξ ) , β d λ ω   and   U : M g *
S ( Q ) = Q , β Φ ( β )   with   Q = Φ ( β ) β g *   and   β = S ( Q ) Q g
In the Souriau Lie groups thermodynamics model, β is a “geometric” (Planck) temperature, element of Lie algebra g of the group, and Q is a “geometric” heat, element of the dual space of the Lie algebra g * of the group. Souriau has proposed a Riemannian metric that we have identified as a generalization of the Fisher metric:
I ( β ) = [ g β ]   with   g β ( [ β , Z 1 ] , [ β , Z 2 ] ) = Θ ˜ β ( Z 1 , [ β , Z 2 ] )
with   Θ ˜ β ( Z 1 , Z 2 ) = Θ ˜ ( Z 1 , Z 2 ) + Q , a d Z 1 ( Z 2 )     where   a d Z 1 ( Z 2 ) = [ Z 1 , Z 2 ]
Souriau has proved that all co-adjoint orbit of a Lie group given by O F = { A d g * F = g 1 F g , g G } subset   of   g * , F g * carries a natural homogeneous symplectic structure by a closed G-invariant 2-form. If we define K = A d g * = ( A d g 1 ) * and K * ( X ) = ( a d X ) * with A d g * F , Y = F , A d g 1 Y , g G , Y g , F g * where if X g , A d g ( X ) = g X g 1 g , the G-invariant 2-form is given by the following expression σ Ω ( a d X F , a d Y F ) = B F ( X , Y ) = F , [ X , Y ] , X , Y g . Souriau Foundamental Theorem is that « Every symplectic manifold on which a Lie group acts transitively by a Hamiltonian action is a covering space of a coadjoint orbit ». We can observe that for Souriau model, Fisher metric is an extension of this 2-form in non-equivariant case g β ( [ β , Z 1 ] , [ β , Z 2 ] ) = Θ ˜ ( Z 1 , [ β , Z 2 ] ) + Q , [ Z 1 , [ β , Z 2 ] ]   .
The Souriau additional term Θ ˜ ( Z 1 , [ β , Z 2 ] ) is generated by non-equivariance through Symplectic cocycle. The tensor Θ ˜ used to define this extended Fisher metric is defined by the moment map J ( x ) , application from M (homogeneous symplectic manifold) to the dual space of the Lie algebra g * , given by:
Θ ˜ ( X , Y ) = J [ X , Y ] { J X , J Y }
with   J ( x ) : M g *   such   that   J X ( x ) = J ( x ) , X ,   X g
This tensor Θ ˜ is also defined in tangent space of the cocycle θ ( g ) g * (this cocycle appears due to the non-equivariance of the coadjoint operator A d g * , action of the group on the dual space of the lie algebra; the action of the group on the dual space of the Lie algebra is modified with a cocycle so that the momentu map becomes equivariant relative to this new affine action):
Q ( A d g ( β ) ) = A d g * ( Q ) + θ ( g )
θ ( g ) g * is called nonequivariance one-cocycle, and it is a measure of the lack of equivariance of the moment map.
Θ ˜ ( X , Y ) : g × g with   Θ ( X ) = T e θ ( X ( e ) ) X , Y Θ ( X ) , Y
The cocycle should verify:
θ ( s t ) = J ( ( s t ) . x ) A d s t * J ( x ) θ ( s t ) = [ J ( s . ( t . x ) ) A d s * J ( t . x ) ] + [ A d s * J ( t . x ) A d s * A d t * J ( x ) ] θ ( s t ) = θ ( s ) + A d s * [ J ( t . x ) A d t * J ( x ) ] θ ( s t ) = θ ( s ) + A d s * θ ( t )
We can also compute tangent of one-cocycle θ at neutral element, to compute 2-cocycle Θ :
ζ g , θ ζ ( s ) = θ ( s ) , ζ = J ( s . x ) , ζ A d s * J ( x ) , ζ = J ( s . x ) , ζ J ( x ) , A d s 1 ζ T e θ ζ ( ξ ) = T x J . ξ p ( x ) , ζ + J ( x ) , a d ξ ζ   with   ξ p = X J , ξ T e θ ζ ( ξ ) = X J ( x ) , ξ [ J ( x ) , ζ ] + J ( x ) , [ ξ , ζ ] T e θ ζ ( ξ ) = { J , ξ , J , ζ } + J ( x ) , [ ξ , ζ ] = Θ ( ξ )
We can also write: T x J ( ξ p ( x ) ) = a d ξ * J ( x ) + Θ ( ξ , . )
By differentiating the equation on affine action, we have:
d J ( X x ) = a d X J ( x ) + d θ ( X )   ,   x M , X g
d J ( X x ) , Y = a d X J ( x ) , Y + d θ ( X ) , Y ,   x M , X , Y g d J ( X x ) , Y = J ( x ) , [ X , Y ] + d θ ( X ) , Y = { J , X , J , Y } ( x )   J ( x ) , [ X , Y ] { J , X , J , Y } ( x ) = d θ ( X ) , Y
It can be then deduced that the tensor could be also written:
Θ ˜ ( X , Y ) = J [ X , Y ] { J X , J Y } = d θ ( X ) , Y   ,   X , Y g
with the cocycle property:
Θ ˜ ( [ X , Y ] , Z ) + Θ ˜ ( [ X , Y ] , Z ) + Θ ˜ ( [ X , Y ] , Z ) = 0   ,   X , Y , Z g
By noting the action of the group on the dual space of the Lie algebra:
G × g * g * , ( s , ξ ) s ξ = A d s * ξ + θ ( s )
Associativity is also derived:
( s 1 s 2 ) ξ = A d s 1 s 2 * ξ + θ ( s 1 s 2 ) = A d s 1 * A d s 2 * ξ + θ ( s 1 ) + A d s 1 * θ ( s 2 ) ( s 1 s 2 ) ξ = A d s 1 * ( A d s 2 * ξ + θ ( s 2 ) ) + θ ( s 1 ) = s 1 ( s 2 ξ )   ,   s 1 , s 2 G , ξ g *
This study of the moment map J equivariance, and the existence of an affine action of G on g * , whose linear part is the coadjoint action, for which the moment J is equivariant, is at the cornerstone of Souriau theory of geometric mechanics and Lie groups thermodynamics.

3.3. Souriau Entropy and Souriau-Fisher-Koszul Metric Invariance under the Action of the Group and Covariant Souriau Gibbs Density

In Souriau’s Lie groups thermodynamics, the invariance by re-parameterization in information geometry has been replaced by invariance with respect to the action of the group. When an element of the group g acts on the element β g of the Lie algebra, given by adjoint operator A d g . Under the action of the group A d g ( β ) , the entropy S ( Q ) and the Fisher metric I ( β ) are invariant:
β g A d g ( β ) { S [ Q ( A d g ( β ) ) ] = S ( Q ) I [ A d g ( β ) ] = I ( β )
In the framework of Lie group action on a symplectic manifold, equivariance of moment map could be studied to prove that there is a unique action a(.,.) of the Lie group G on the dual g * of its Lie algebra for which the moment map J is equivariant, that means for each x M :
J ( Φ g ( x ) ) = a ( g , J ( x ) ) = A d g * ( J ( x ) ) + θ ( g )
When coadjoint action is not equivariant, the symmetry is broken, and new “cohomological” relations should be verified in Lie algebra of the group. A natural equilibrium state will thus be characterized by an element of the Lie algebra of the Lie group, determining the equilibrium temperature β . The entropy s ( Q ) , parametrized by Q the geometric heat (mean of energy U , element of the dual space of the Lie algebra) is defined by the Legendre transform of the Massieu potential Φ ( β ) parametrized by β ( Φ ( β ) is the minus logarithm of the partition function ψ Ω ( β ) ).
A Gibbs state, in the usual sense, is a statistical state at which the entropy is stationary with respect to all infinitesimal variations of the statistical state for which the mean value of the energy remains constant. In the sense of Souriau, a generalized Gibbs state is a statistical state at which the entropy is stationary with respect to all infinitesimal variations of the statistical state for which the mean value of the moment map remains constant. This generalization is very natural, since the energy can be considered as the moment map of the Hamiltonian action of the one-dimensional Lie group of time translations. Furthermore, each generalized Gibbs state is associated to an element of the Lie algebra of the group, called by Souriau a generalized temperature, and that the set of possible generalized temperature is not, in general the whole Lie algeba, but an open convex subset of the Lie algebra, which may be empty, for which some integrals encountered in the expression of the generalized Gibbs state are normally convergent. So, for some Lie groups, generalized Gibbs states do not exist, and there is no Souriau Lie groups thermodynamics.
Souriau has then defined a Gibbs density that is covariant under the action of the group:
p G i b b s ( ξ ) = e Φ ( β ) U ( ξ ) , β = e U ( ξ ) , β M e U ( ξ ) , β d λ ω   ,   with   Φ ( β ) = log M e U ( ξ ) , β d λ ω Q = Φ ( β ) β = M U ( ξ ) e U ( ξ ) , β d λ ω M e U ( ξ ) , β d λ ω = M U ( ξ ) p ( ξ ) d λ ω
We can express the Gibbs density with respect to Q by inverting the relation Q = Φ ( β ) β = Θ ( β ) . Then p G i b b s , Q ( ξ ) = e Φ ( β ) U ( ξ ) , Θ 1 ( Q ) with β = Θ 1 ( Q ) . All Souriau equations of Lie groups Thermodynamics are illustrated in Figure 3 and Figure 4.
Souriau completed his “geometric heat theory” by introducing a 2-form in the Lie algebra, that is a Riemannian metric tensor in the values of adjoint orbit of β , [ β , Z ] with Z an element of the Lie algebra. This metric is given for ( β , Q ) :
g β ( [ β , Z 1 ] , [ β , Z 2 ] ) = Θ ( Z 1 ) , [ β , Z 2 ] + Q , [ Z 1 , [ β , Z 2 ] ]
where Θ is a cocycle of the Lie algebra, defined by Θ = T e θ with θ a cocycle of the Lie group defined by θ ( M ) = Q ( A d M ( β ) ) A d M * Q .
We observe that Souriau Riemannian metric, introduced with symplectic cocycle, is a generalization of the Fisher metric, that we call the Souriau-Fisher metric, that preserves the property to be defined as a Hessian of the partition function logarithm g β = 2 Φ β 2 = 2 log ψ Ω β 2 as in classical information geometry. We will establish the equality of two terms, between Souriau definition based on Lie group cocycle Θ and parameterized by “geometric heat” Q (element of the dual space of the Lie algebra) and “geometric temperature” β (element of Lie algebra) and hessian of characteristic function Φ ( β ) = log ψ Ω ( β ) with respect to the variable β (as illustrated in Figure 5):
g β ( [ β , Z 1 ] , [ β , Z 2 ] ) = Θ ( Z 1 ) , [ β , Z 2 ] + Q , [ Z 1 , [ β , Z 2 ] ] = 2 log ψ Ω β 2
If we differentiate this relation of Souriau theorem Q ( A d g ( β ) ) = A d g * ( Q ) + θ ( g ) , this relation occurs:
Q β ( [ Z 1 , β ] , . ) = Θ ˜ ( Z 1 , [ β , . ] ) + Q , A d . Z 1 ( [ β , . ] ) = Θ ˜ β ( Z 1 , [ β , . ] )
Q β ( [ Z 1 , β ] , Z 2 . ) = Θ ˜ ( Z 1 , [ β , Z 2 ] ) + Q , A d . Z 1 ( [ β , Z 2 ] ) = Θ ˜ β ( Z 1 , [ β , Z 2 ] )
Q β = g β ( [ β , Z 1 ] , [ β , Z 2 ] )
As the entropy is defined by the Legendre transform of the characteristic function, a dual metric of the Fisher metric is also given by the hessian of “geometric entropy” S ( Q ) with respect to the dual variable given by Q: 2 S ( Q ) Q 2 .
For the maximum entropy density (Gibbs density), the following three terms coincide: 2 log ψ Ω β 2 that describes the convexity of the log-likelihood function, I ( β ) = E [ 2 log p β ( ξ ) β 2 ] the Fisher metric that describes the covariance of the log-likelihood gradient, whereas I ( β ) = E [ ( ξ Q ) ( ξ Q ) T ] = V a r ( ξ ) that describes the covariance of the observables. We can also observe that the Fisher metric I ( β ) = Q β is exactly the Souriau metric defined through symplectic cocycle:
I ( β ) = Θ ˜ β ( Z 1 , [ β , Z 2 ] ) = g β ( [ β , Z 1 ] , [ β , Z 2 ] )
The Fisher metric I ( β ) = 2 Φ ( β ) β 2 = Q β has been considered by Souriau as a generalization ofheat capacity”. Souriau called it K the “geometric capacity”.

3.4. Covariant Souriau Gibbs Density and Information Manifold Foliation

R.F. Streater has studied in 1999, Information Geometry for some Lie algebra where for certain unitary representation of a Lie algebra, he has defined the statistical manifold of states as convex cone for which the partition function is finite, making reference to Bogoliubov-Kubo-Mori metric. But Streater has only developed the case with null cohomology for so (3) and sl (2,R) Lie alebras. Nevertheless, as observed by R.F. Streater in his paper “Information Geometry for some Lie algebras” [35], referring to Kirillov work and Roger Balian paper, “We can expect further natural structures to arise in this case. Indeed, it is known (*) that the dual to the Lie algebra, which parametrizes the state-space in this case, foliates into coadjoint orbits; there are also the level sets on the entropy; Kirillov form, and the BKM (Bogoliubov-Kubo-Mori) metric, together make each orbit into kähler space, along the lines proposed by Kostant. Motion along these holomorphic directions is nondissipative. The transversal to the orbits is a real half-line, which represents the dissipative direction…We study the case of sl (2,R) in the discrete series of representations. We show the information manifold foliates into level sets of the entropy, each being isometric to H, the Poincaré upper half-plane… The states of constant entropy are the hyperboloids and β is the dissipative coordinate… For an integrable system described by a Lie algebra in a traceable representation, we find that the information manifold foliates into complex spaces; the level sets of entropy can be given a complex structure by the method of Kostant. Motion remaining on the complex surfaces is nondissipative, whereas motion transversal to these surfaces is dissipative. In information geometry, the state is parametrized by the canonical coordinates. Which function of them is measured by a thermometer? In our models, it is reasonable to designate 1 / β to be the temperature; it is a dissipative coordinate, and it increases with time, showing that the system is thermalizing”.

4. Mathematical Definition of Souriau Moment Map

Previously, we have introduced the concept of Souriau’s moment map. In this chapter, we will introduce a mathematical definition of this tool, as defined in Souriau’s book [36] with modern notations [37,38,39,40,41]. Other details on moment map are also given in Jean-Louis Koszul’s Book [42].

4.1. Operations on Vector Fields

Consider a map F : X R M Y R N , y = F ( x ) , the derivative of F at x X , D F : X R N × M is given by:
( δ y 1 δ y N ) = ( y 1 x 1 y 1 x M y N x 1 y N x M ) ( δ x 1 δ x M ) = D F ( x ) ( δ x ) = L i m t 0 F ( x + t δ x ) F ( x ) t
Second derivative is given by the linear map D 2 F : X R N × M × M :
δ [ y x ] = 2 y x 2 ( δ x ) = D 2 F ( x ) ( δ x )
Consider a vector Field V on X R M defined by: V : X R M R M , operations on vector fields are given by adjoint action and Lie bracket:
A d F V ( y ) = d d t [ F e t V F 1 ] ( y ) | t = 0 = D F ( x ) ( V ( x ) )   with   x = F 1 ( y )
[ U , V ] ( x ) = d d s A d e s U V ( x ) | s = 0 = D U ( x ) ( V ( x ) ) D V ( x ) ( U ( x ) )
0-form is a scalar, 1-form are row ω = ( ω 1 ω M ) in dual space. 2-forms can be regarded as antisymmetric matrices ( ω i j ) with ω ( u , v ) = u t ( ω 11 ω 1 M ω M 1 ω M M ) v . m-forms are all scalar multiples of the standard volume form vol, defined by V o l ( v 1 , , v m ) = det ( matrix   with   columns   v 1 , , v m ) .

4.2. Derivative Rules by Sophus Lie, Elie Cartan and Henri Cartan

With the following classical definitions:
  • Pull back: F * ω is a p-form on X
    F * ω ( v 1 , , v p ) = ω F ( x ) ( D F ( x ) ( v 1 ) , , D F ( x ) ( v p ) )
  • Interior product: i V ω is the (p−1)form on M obtained by inserting V ( x ) as the first argument of ω
    i V ω ( v 2 , v p ) = ω ( V ( x ) , v 2 , , v p )
  • Exterior product: θ ω is the (p + 1)-form on X where ω is a p-form and θ is a 1-form on M (where the hat indicates a term to be omitted):
    θ ω ( v 0 , , v p ) = i = 0 p ( 1 ) i θ ( v i ) ω ( v 0 , , v ^ i , , v p )
  • Lie derivative: L V ω is a p-form on M , and L V ω = 0 if the flow of V consists of symmetries of ω :
    L V ω ( v 1 , , v p ) = d d t e t V * ω ( v 1 , , v p ) | t = 0
    d ω is the (p+1)-form on M defined by taking the ordinary derivative of ω and then antisymmetrizing:
  • Exterior derivative:
    d ω ( v 0 , , v p ) = i = 0 p ( 1 ) i ω x ( v i ) ( v 0 , , v ^ i , , v p )
    p = 0 , [ d ω ] i = i ω   ;   p = 1 , [ d ω ] i j = i ω j j ω i   ;   p = 2 , [ d ω ] i j k = i ω j k + j ω k i + k ω i j
From these definitions, the properties of the exterior and Lie Derivative were established by Sophus Lie, Elie Cartan, and Henri Cartan:
L V ω = d i V ω + i V d ω
(Elie Cartan equation)
i [ U , V ] ω = i V L U ω L U i V ω
(Henri Cartan equation)
L [ U , V ] ω = L V L U ω L U L V ω
(Sophus Lie equation)

4.3. Souriau Moment Map

Considering Manifolds and Lie groups, We define the tangent bundle T X of X as the disjoint union of the T x X , or the set of all pairs ( δ x x ) with x X and δ x T x X . If F : X Y is a smooth map between manifolds, its tangent map is the map:
F * ( δ x x ) = ( D F ( x ) ( δ x ) F ( x ) )
A Lie group is a group G with a manifold structure such that the product ( g , h ) g h and the inversion g g 1 are smooth maps from G × G (resp. G) to G . Its Lie algebra is the tangent space g = T e G at the identity element. A smooth action of G on a manifold X is a group morphism:
Φ : G × X D i f f ( X ) ( g , x ) g . x
The orbit of x X is G ( x ) = { g . x : g G } .
The tangent space to an orbit at x :
T x G ( x ) = { Z ( x ) : Z g } = g / g x with Z ( x ) = d d t e t Z ( x ) | t = 0 and where
g x = { Z g : Z ( x ) = 0 }
Let ( M , σ ) be a connected symplectic manifold. A vector field η on M is called symplectic if its flow preserves the 2-form: L η σ = 0 . If we use Elie Cartan’s formula, we can deduce that L η σ = d i η σ + i η d σ = 0 but as d σ = 0 then d i η σ = 0 . We observe that the 1-form i η σ is closed. When this 1-form is exact, there is a smooth function x H on M with:
i η σ = d H
This vector field η is called Hamiltonian and could be defined as symplectic gradient η = S y m p H .
Let a Lie group G that acts on M and that also preserve σ . A moment map exists if these infinitesimal generators are actually hamiltonian, so that a map J : M g * exists with:
i Z X σ = d H Z where
H Z = J ( x ) , Z
The Poisson bracket of two functions H , H is defined by:
{ H , H } = σ ( η , η ) = σ ( S y m p H , S y m p H )   with   i η σ = d H   and i η σ = d H
If G is connected, then the moment map is G-equivariant if and only if it satisfies { H Z , H Z } = H [ Z , Z ] .
Souriau has proved thet every coadjoint orbit of a Lie group is a homogeneous symplectic manifold when endowed with the KKS 2-form σ ( Z ( x ) , Z ( x ) ) = x , [ Z , Z ] , and conversely, every homogeneous symplectic manifold of a connected Lie group G is, up to a possible covering, a coadjoint orbit of some central extension of G. σ is G-invariant.

5. Poincaré Unit Disk, SU(1,1) Lie Group and Souriau Moment Map

We will introduce Souriau moment map for SU(1,1)/K group that acts transitively on Poincaré Unit Disk, based on moment map. More details on computation of moment map for SU(1,1)/K Lie group is given in Appendix A of this document.

5.1. Poincaré Unit Disk and SU(1,1) Lie Group

The group of complex unimodular pseudo-unitary matrices S U ( 1 , 1 ) , is the set of elements u such that [43,44,45,46,47,48,49,50,51,52]:
u M u + = M   with   M = ( + 1 0 0 1 )
We can show that the most general matrix u belongs to the Lie group given by:
G = S U ( 1 , 1 ) = { ( a b b * a * ) / | a | 2 | b | 2 = 1 ,   a , b C }
Its Cartan decomposition is given by:
( a b b * a * ) = | a | ( 1 z z * 1 ) ( a / | a | 0 0 a * / | a | )   with   z = b ( a * ) 1 , | a | = ( 1 | z | 2 ) 1 / 2
( a b b * a * ) ( 1 z z * 1 ) = | a | ( 1 z z * 1 ) ( a / | a | 0 0 a * / | a | )   with   { a = b z * + a z = a z + b b * z + a *
S U ( 1 , 1 ) is associated to group of holomorphic automorphisms of the Poincaré unit disk D = { z = x + i y C / | z | < 1 } in the complex plane, by considering its action on the disk as g ( z ) = ( a z + b ) / ( b * z + a * ) . The following measure on Unit disk:
d μ 0 ( z , z * ) = 1 2 π i d z d z * ( 1 | z | 2 ) 2
is invariant under the action of S U ( 1 , 1 ) captured by the fractional holomorphic transformation:
d z d z * ( 1 | z | 2 ) 2 = d z d z * ( 1 | z | 2 ) 2
The complex unit disk admits a Kähler structure determined by potential function:
Φ ( z , z * ) = log ( 1 z z * )
The invariant 2-form is:
Ω = 1 i 2 Φ ( z , z * ) z z * d z d z * = 1 i d z d z * ( 1 | z | 2 ) 2
which is closed d Ω = 0 . This group S U ( 1 , 1 ) is isomorphic to the group S L ( 2 , R ) as a real Lie group, and the Lie algebra g = 𝖘 u ( 1 , 1 ) is given by:
g = { ( i r η η * i r ) / r R , η C }
with the bases ( u 1 , u 2 , u 3 ) g : u 1 = 1 2 ( 0 i i 0 )   ,   u 2 = 1 2 ( 0 1 1 0 )   ,   u 3 = 1 2 ( i 0 0 i )
with the commutation relation:
[ u 3 , u 2 ] = u 1 , [ u 3 , u 1 ] = u 2 , [ u 2 , u 1 ] = u 3
Dual base on the dual space of the Lie algebra is named ( u 1 * , u 2 * , u 3 * ) g * . The dual vector space g * = 𝖘 u * ( 1 , 1 ) can be identified with the subspace of 𝖘 𝖑 ( 2 , C ) of the form:
g * = { ( z x + i y x + i y z ) = x ( 0 1 1 0 ) + y ( 0 i i 0 ) + z ( 1 0 0 1 ) / x , y , z R }
Coadjoint action of g G on dual space of the Lie algebra ξ g * is written g . ξ .

5.2. Coadjoint Orbit of SU(1,1) and Souriau Moment Map

We will use results of C. Cishahayo and S. de Bièvre [53] and B. Cahen [54,55] for computation of moment map of S U ( 1 , 1 ) . Let r R * + , orbit O ( r u 3 * ) of r u 3 * for the coadjoint action of g G could be identified with the upper half sheet x 3 > 0 of { ξ = x 1 u 1 * + x 2 u 2 * + x 3 u 3 * / x 1 2 x 2 2 + x 3 2 = r 2 } , the two-sheet hyperboloid. The stabilizer of r u 3 * for the coadjoint action of G is torus K = { ( e i θ 0 0 e i θ ) , θ R } . K induces rotations of the unit disk, and leaves 0 invariant. The stabilizer for the origin 0 of unit disk is maximal compact subgroup K of SU(1,1). We can observe [54] that O ( r u 3 * ) = G / K . On the other hand O ( r u 3 * ) = G / K is diffeomorphic to the unit disk D = { z C / | z | < 1 } , then by composition, the Souriau moment map is given by:
J : D O ( r u 3 * ) z J ( z ) = r ( z + z * ( 1 | z | 2 ) u 1 * + z z * i ( 1 | z | 2 ) u 2 * + 1 + | z | 2 ( 1 | z | 2 ) u 3 * )
J is linked to the natural action of G on D (by fractional linear transforms) but also the coadjoint action of G on O ( r u 3 * ) = G / K . J 1 could be interpreted as the stereographic projection from the two-sphere S 2 onto C [56]. In case r = n 2 where n N + , n 2 then the coadjoint orbit is given by O n = O ( ζ n ) with ξ n = n 2 u 3 * g * , with stabilizer of ξ n for coadjoint action the torus K = { ( e i θ 0 0 e i θ ) , θ R } with Lie algebra R u 3 . O n = O ( ζ n ) is associated with a holomorphic discrete series representation π n of G by the KKS (Kirillov-Kostant-Souriau) method of orbits.
J : D O n z J ( z ) = n 2 ( z + z * ( 1 | z | 2 ) u 1 * + z z * i ( 1 | z | 2 ) u 2 * + 1 + | z | 2 ( 1 | z | 2 ) u 3 * )
Group G act on D by homography g . z = ( a b b * a * ) . z = a z + b b * z + a * . This action corresponds with coadjoint action of G on O n . The Kirillov-Kostant-Souriau 2-form of O n is given by:
Ω n ( ζ ) ( X ( ζ ) , Y ( ζ ) ) = ζ , [ X , Y ]   ,   X , Y g   and   ζ O n
and is associated in the frame by J with:
ω n = i n ( 1 | z | 2 ) 2 d z d z *
with the corresponding Poisson Bracket:
{ f , g } = i ( 1 | z | 2 ) 2 ( f z g z * f z * g z )
It has been also observed that there are 3 basic observables generating the S U ( 1 , 1 ) symmetry on classical level:
{ D R z k 3 ( z ) = 1 + | z | 2 1 | z | 2 , { D R z k 1 ( z ) = 1 i z z * 1 | z | 2 , { D R z k 2 ( z ) = z + z * 1 | z | 2
with the Poisson commutation rule:
{ k 3 , k 1 } = k 2 , { k 3 , k 2 } = k 1 , { k 1 , k 2 } = k 3
( k 1 , k 2 , k 3 ) vector points to the upper sheet of the two-sheeted hyperboloid in R 3 given by k 3 2 k 1 2 k 2 2 = 1 , whose the stereographic projection onto the open unit disk is:
{ ( k 1 , k 2 , k 3 ) H + D z = k 2 + i k 1 1 + k 3 = k 3 1 k 3 + 1 e i arg z
Under the action of g G = S U ( 1 , 1 ) = { ( a b b * a * ) / | a | 2 | b | 2 = 1 ,   a , b C } :
( k k 3 k 3 k + ) = ( k 2 + i k 1 k 3 k 3 k 2 i k 1 ) = 1 1 | z | 2 ( 2 z 1 + | z | 2 1 + | z | 2 2 z * ) is transform in:
( k k 3 k 3 k + ) = ( k ( g 1 . z ) k 3 ( g 1 . z ) k 3 ( g 1 . z ) k + ( g 1 . z ) ) = g 1 ( k k 3 k 3 k + ) ( g 1 ) t
This transform can be viewed as the co-adjoint action of S U ( 1 , 1 ) on the coadjoint orbit identified with k 3 2 k 1 2 k 2 2 = 1 . We can also observe that the quotient S U ( 1 , 1 ) / K is isomorphic to the upper sheet of the hyperboloid described by k 3 2 k 1 2 k 2 2 = 1 , by the following parametrization ( τ , φ ) , given by n = ( cosh τ , sinh τ cos φ , sinh τ sin φ ) , and its stereographic projection onto the inside of the unit disk, parametrized by ς = tanh τ 2 e i φ .

6. Covariant Gibbs Density by Souriau Thermodynamics for Poincaré Unit Disk

6.1. Fourier Transform, Laplace Transform and Lie Group Representation Theory

In Souriau Lie Group Thermododynamic, we have to consider Laplace Transform defined on coadjoint orbits to define Massieu Potential Function and Gibbs density. This problem has been solved in the domain of Kirillov Representation Theory. Representation theory studies abstract algebraic structures by representing their elements as linear transformations of vector spaces, and algebraic objects (Lie groups, Lie algebras) by describing its elements by matrices and the algebraic operations in terms of matrix addition and matrix multiplication, reducing problems of abstract algebra to problems in linear algebra. Representation theory generalizes Fourier analysis via harmonic analysis. The modern development of Fourier analysis during XXth century has explored the generalization of Fourier and Fourier-Plancherel formula for non-commutative harmonic analysis, applied to locally compact non-Abelian groups. This has been solved by geometric approaches based on “orbits methods” (Fourier-Plancherel formula for G is given by coadjoint representation of G in dual vector space of its Lie algebra) with many contributors (Dixmier, Kirillov, Bernat, Arnold, Berezin, Kostant, Souriau, Duflo, Guichardet, Torasso, Vergne, Paradan, etc.) [57,58,59,60,61,62,63,64,65,66,67,68].
For classical commutative harmonic analysis, we consider the following groups:
G = Τ n = R n / Z n   for   Fourier   series ,   G = R n   for   Fourier   Transform G   group   character   ( linked   to   e i k x ) : χ : G U   with   U = { z C / | z | = 1 } G ^ = { χ / χ 1 . χ 2 ( g ) = χ 1 ( g ) χ 2 ( g ) }   and   Fourier   transform   is   given   by : φ : G C φ ^ : G ^ C g φ ( g ) = G ^ φ ^ ( χ ) χ ( g ) 1 d χ χ φ ^ ( χ ) = G φ ( g ) χ ( g ) d g
For non-commutative harmonic analysis, Group unitary irreductible representation is U : G U ( H ) with H Hilbert space and character by χ U ( g ) = t r U g . Fourier transform for non-commutative group is U φ = G φ ( g ) U g d g with character χ U ( g ) = t r U φ . If we describe group element with exponential map U ψ = g ψ ( X ) U exp ( X ) d X , we have:
trU ψ = dim τ . μ G . f ( ψ . j 1 ) ψ . j 1 : g g * ,   Four .   Transf .   with   { μ G . f :   Liouville   meas .   on   O = G . f , f g * μ G . f ( ψ . j 1 ) :   Integral   of   ψ . j 1 wrt   μ G . f  
where
j ( X ) = ( det s ( a d X ) ) 1 / 2   with   s ( x ) = n = 0 1 ( 2 n + 1 ) ! ( x 2 ) 2 n = s h ( x 2 ) / ( x 2 )
Kirillov Character formula is:
χ U ( exp ( X ) ) = trU exp ( X ) = j ( X ) 1 O e i f , X d μ O ( f )
O e i f , X d μ O ( f ) = j ( X ) trU exp ( X )   with   j ( X ) = ( det ( e a d X / 2 e a d X / 2 a d X / 2 ) ) 1 / 2  
We will use Kirillov representation theory and his character formula to compute Souriau covariant Gibbs density in the unit Poincaré disk. For any Lie group G , a coadjoint orbit O g * has a canonical symplectic form ω 0 given by KKS 2-form. As seen, if G is finite dimensional, the corresponding volume element defines a G -invariant measure supported on O , which can be interpreted as a tempered distribution. The Fourier transform (where d is the half of the dimension of the orbit O):
( x ) = O g * e i λ , x 1 d ! d ω O d   with   λ g *   and   x g
is Ad G -invariant. When O g * is an integral coadjoint orbit, Kirillov formula, given previously, expresses Fourier transform ( x ) by Kirillov character χ O :
( x ) = j ( x ) χ O ( e x )   where   j ( x ) = det 1 / 2 ( sinh ( a d ( x / 2 ) ) a d ( x / 2 ) )
χ O is, as defined previously, the “Kirillov character” of a unitary representation associated to the orbit.

6.2. Souriau Covariant Gibbs Density in Poincaré Unit Disk for SU(1,1) Lie Group

In the following, we will give the full development to compute the Souriau covariant Gibbs density. As the Gibbs density is not defined for all geometric temperature, as observed by Souriau, we have used his approach by considering a one-parameter subgroup of the Lie group generated by exponential map from a one element of Lie algebra given by geometric temperature. The subset of Lie algebra where the Gibbs density is deduced from the contraints related to this one-parameter subgroup generation.
Considering the Lie group S U ( 1 , 1 ) = { ( a b b * a * ) / a , b C ,   | a | 2 | b | 2 = 1 } and its Lie algebra given by elements s u ( 1 , 1 ) = { ( i r η η * i r ) / r R , η C } . A basis for this Lie algebra s u ( 1 , 1 ) is ( u 1 , u 2 , u 3 ) g with u 1 = i 2 ( 1 0 0 1 ) , u 2 = 1 2 ( 0 1 1 0 )   and   u 3 = 1 2 ( 0 i i 0 ) with [ u 1 , u 3 ] = u 2 , [ u 1 , u 2 ] = u 3 , [ u 2 , u 3 ] = u 1 .
The compact subgroup is generated by u 1 , while u 2 and u 3 generate a hyperbolic subgroup. The dual space of the Lie algebra is given by s u ( 1 , 1 ) * = { ( z x + i y x + i y z ) / x , y , z R } with the basis ( u 1 * , u 2 * , u 3 * ) g * with u 1 * = ( 1 0 0 1 ) , u 2 * = ( 0 i i 0 )   and   u 3 * = ( 0 1 1 0 ) .
Let consider D = { z C / | z | < 1 } be the open unit disk of Poincaré. For each ρ > 0 , the pair ( D , ω ρ ) is a symplectic homogeneous manifold with ω ρ = 2 i ρ d z d z * ( 1 | z | 2 ) 2 , where ω ρ is invariant under the action: S U ( 1 , 1 ) × D D ( g , z ) g . z = a z + b b * z + a * .
This action is transitive and is globally and strongly Hamiltonian. Its generators are the hamiltonian vector fields associated to the functions:
J 1 ( z , z * ) = ρ 1 + | z | 2 1 | z | 2   ,   J 2 ( z , z * ) = ρ i z z * 1 | z | 2   ,   J 3 ( z , z * ) = ρ z + z * 1 | z | 2
The associated moment map J : D s u * ( 1 , 1 ) defined by J ( z ) . u i = J i ( z , z * ) , maps D into a coadjoint orbit in s u * ( 1 , 1 ) . Then, we can write the moment map as a matrix element of s u * ( 1 , 1 ) :
J ( z ) = J 1 ( z , z * ) u 1 * + J 2 ( z , z * ) u 2 * + J 3 ( z , z * ) u 3 * = ( ρ 1 + | z | 2 1 | z | 2 ρ z z * 1 | z | 2 ρ z + z * 1 | z | 2 ρ z z * 1 | z | 2 + ρ z + z * 1 | z | 2 ρ 1 + | z | 2 1 | z | 2 ) J ( z ) = J 1 ( z , z * ) u 1 * + J 2 ( z , z * ) u 2 * + J 3 ( z , z * ) u 3 * = ρ ( 1 + | z | 2 1 | z | 2 2 z * 1 | z | 2 2 z 1 | z | 2 1 + | z | 2 1 | z | 2 ) g *
The moment map J is a diffeomorphism of D onto one sheet of the two-sheeted hyperboloid in s u * ( 1 , 1 ) , determined by J 1 2 J 2 2 J 3 2 = ρ 2   ,   J 1 ρ   with   J 1 u 1 * + J 2 u 2 * + J 3 u 3 * s u * ( 1 , 1 ) . We note O ρ + the coadjoint orbit A d S U ( 1 , 1 ) * of S U ( 1 , 1 ) , given by the upper sheet of the two-sheeted hyperboloid given by previous equation. The orbit method of Kostant-Kirillov-Souriau associates to each of these coadjoint orbits a representation of the discrete series of S U ( 1 , 1 ) , provided that ρ is a half integer greater or equal than 1 ( ρ = k 2 , k N   and   ρ 1 ). When explicitly executing the Kostant-Kirillov construction, the representation Hilbert spaces H ρ are realized as closed reproducing kernel subspaces of L 2 ( D , ω ρ ) . The Kostant-Kirillov-Souriau orbit method shows that to each coadjoint orbit of a connected Lie group is associated a unitary irreducible representation of G acting in a Hilbert space H.
Souriau has oberved that action of the full Galilean group on the space of motions of an isolated mechanical system is not related to any equilibrium Gibbs state (the open subset of the Lie algebra, associated to this Gibbs state is empty). The main Souriau idea was to define the Gibbs states for one-parameter subgroups of the Galilean group. We will use the same approach, in this case We will consider action of the Lie group S U ( 1 , 1 ) on the symplectic manifold (M,ω) (Poincaré unit disk) and its momentum map J are such that the open subset Λ β = { β g / D e J ( z ) , β d λ ( z ) < + }   is not empty. This condition is not always satisfied when (M, ω) is a cotangent bundle, but of course it is satisfied when it is a compact manifold. The idea of Souriau is to consider a one parameter subgroup of S U ( 1 , 1 ) . To parametrize elements of S U ( 1 , 1 ) is through its Lie algebra. In the neighborhood of the identity element, the elements of g S U ( 1 , 1 ) can be written as the exponential of an element β of its Lie algebra:
g = exp ( ε β )   with   β g
The condition g + M g = M   for   M = ( 1 0 0 1 ) can be expanded for ε < < 1 and is equivalent to β + M + M β = 0 which then implies β = ( i r η η * i r ) , r R , η C . We can observe that r and η = η R + i η I contain 3 degrees of freedom, as required. Also because det g = 1 , we get T r ( β ) = 0 . We can then exponentiate β with exponential map to get:
g = exp ( ε β ) = k = 0 ( ε β ) k k ! = ( a ε ( β ) b ε ( β ) b ε * ( β ) a ε * ( β ) )
If we make the remark that β 2 = ( i r η η * i r ) ( i r η η * i r ) = ( | η | 2 r 2 ) I , we can developed the exponential map:
g = exp ( ε β ) = ( cosh ( ε R ) + i r sinh ( ε R ) R η sinh ( ε R ) R η * sinh ( ε R ) R cosh ( ε R ) i r sinh ( ε R ) R )   with   R 2 = | η | 2 r 2
We can observe that one condition is that | η | 2 r 2 > 0 then the subset to consider is Λ β = { β = ( i r η η * i r ) , r R , η C / | η | 2 r 2 > 0 }   such that D e J ( z ) , β d λ ( z ) < + . The generalized Gibbs states of the full S U ( 1 , 1 ) group do not exist. However, generalized Gibbs states for the one-parameter subgroups exp ( α β ) , β Λ β , of the S U ( 1 , 1 ) group do exist. The generalized Gibbs state associated to β remains invariant under the restriction of the action to the one-parameter subgroup of S U ( 1 , 1 ) generated by exp ( ε β ) .
To go futher, we will develop the Souriau Gibbs density from the Souriau moment map J ( z ) and the Souriau temperature β Λ β   . If we note b = 1 1 | z | 2 [ 1 z ] , we can write the moment map:
J ( z ) = ρ ( 1 + | z | 2 1 | z | 2 2 z * 1 | z | 2 2 z 1 | z | 2 1 + | z | 2 1 | z | 2 ) = ρ ( 2 M b b + T r ( M b b + ) I )   with   M = [ 1 0 0 1 ]
We can the write the covariant Gibbs density in the unit disk given by moment map of the Lie group S U ( 1 , 1 ) and geometric temperature in its Lie algebra β Λ β :
p G i b b s ( z ) = e J ( z ) , β D e J ( z ) , β d λ ( z )   where   d λ ( z ) = 2 i ρ d z d z * ( 1 | z | 2 ) 2
p G i b b s ( z ) = e ρ ( 2 b b + T r ( b b + ) I ) , β D e J ( z ) , β d λ ( z ) = e ρ ( 1 + | z | 2 ( 1 | z | 2 ) 2 z * ( 1 | z | 2 ) 2 z ( 1 | z | 2 ) 1 + | z | 2 ( 1 | z | 2 ) ) , ( i r η η * i r ) D e J ( z ) , β d λ ( z )
To write the Gibbs density with respect to its statistical moments, we have to express the density with respect to Q = E [ J ( z ) ] . Then, we have to invert the relation between Q and β , to replace this last variable β = ( i r η η * i r ) Λ β by β = Θ 1 ( Q ) g where Q = Φ ( β ) β = Θ ( β ) g * with Φ ( β ) = log D e J ( z ) , β d λ ( z ) , deduce from Legendre tranform. The mean moment map is given by:
Q = E [ J ( z ) ] = E [ ρ ( 1 + | w | 2 ( 1 | w | 2 ) 2 w * ( 1 | w | 2 ) 2 w ( 1 | w | 2 ) 1 + | w | 2 ( 1 | w | 2 ) ) ]   where   w D
This mean moment map can be obtained by Karcher mean computation on the one-sheet hyperboloid corresponding to the coadjoint orbit. For the dual pairing, we can observed that J ( z ) = J 1 ( z , z * ) u 1 * + J 2 ( z , z * ) u 2 * + J 3 ( z , z * ) u 3 * g * with J 1 ( z , z * ) = ρ 1 + | z | 2 1 | z | 2 ,   J 2 ( z , z * ) = ρ i z z * 1 | z | 2 ,   J 3 ( z , z * ) = ρ z + z * 1 | z | 2 and β = β 1 u 1 + β 2 u 2 + β 3 u 3 g * with β = 2 ( r , η R , η I )   ,   η = η R + i η I .
The integral of normalization in Gibbs density could be computed through Kirillov character formula by χ m ( exp ( ( x . . x ) ) ) = j ( x ) 1 O m 1 + e i ( x . . x ) , ( i r η η * i r ) ω O m 1 + where
j ( x ) = det 1 / 2 [ sinh ( a d ( x / 2 x / 2 ) ) / a d ( x / 2 x / 2 ) ] = sinh ( x ) x
with following relation e m x 1 e 2 x j ( x ) = D e ( m 1 ) x 1 + | w | 2 1 | w | 2 1 ( 1 | w | 2 ) 2 d w d w * .
Recently, Enrico De Micheli [69] has introduced a Laplace-type transform (the so-called Spherical Laplace Transform) with a connection to the Non-Euclidean Fourier Transform in the sense of Helgason, and the principal series of the unitary representation of SU(1,1).

6.3. Extension to SU (p,q) Unitary Group for Siegel Unit Disk

Mode details are given in Appendix B, on parameterization of SU(1,1) and extension to SU (p,q). To address computation of covariant Gibbs density for Siegel Unit Disk, we will consider in this section S U ( p , q ) Unitary Group:
G = S U ( p , q )   and   K = S ( U ( p ) × U ( q ) ) = { ( A 0 0 D ) / A U ( p ) , D U ( q ) , det ( A ) det ( D ) = 1 }
We can use the following decomposition for g G C :
g = ( A B C D ) G C , g = ( I p B D 1 0 I q ) ( A B D 1 C 0 0 D ) ( I p 0 D 1 C I q )
and consider the action of g G C on Siegel Unit Disk S D = { Z M p q ( C ) / I p Z Z + > 0 } given by:
g = ( A B C D ) G C , g = ( I p B D 1 0 I q ) ( A B D 1 C 0 0 D ) ( I p 0 D 1 C I q )
Benjamin Cahen has study this case and introduced the moment map by identifing G-equivariantly g * with g by means of the Killing form β on g C :
g *   G equivariant   with   g by   Killing   form   β ( X , Y ) = 2 ( p + q ) T r ( X Y )  
The set of all elements of g fixed by K is 𝖍 :
𝖍 = { element   of   G   fixed   by   K }   ,   ξ 0 𝖍 , ξ 0 = i λ ( q I p 0 0 p I q ) ξ 0 , [ Z , Z + ] = 2 i λ ( p + q ) 2 T r ( Z Z + ) , Z D
Then, we the equivatiant moment map is given by:
X g C   ,   Z D , ψ ( Z ) = A d * ( exp ( Z + ) ζ ( exp Z + exp Z ) ) ξ 0 g G , Z D   then   ψ ( g . Z ) = A d g * ψ ( Z ) ψ   is   a   diffeomorphism   from   S D   onto   orbit   O ( ξ 0 )
with:
ψ ( Z ) = i λ ( ( I p Z Z + ) 1 ( p Z Z + q I p ) ( p + q ) Z ( I q Z + Z ) 1 ( p + q ) ( I q Z + Z ) 1 Z + ( p I q + q Z + Z ) ( I q Z + Z ) 1 )
ζ ( exp Z + exp Z ) = ( I p Z ( I q Z + Z ) 1 0 I q )

7. Lie Groups Thermodynamics for SE(2) Lie Group

After S U ( 1 , 1 ) Lie group with null cohomology and then without Souriau one-cocycle, we will consider Souriau model for S E ( 2 ) Lie group with non-null cohomology and then with introduction of Souriau one-cocycle [70].
We will consider first S O ( 2 ) Lie group:
S O ( 2 ) = { R φ = [ cos φ sin φ sin φ cos φ ] / φ R }
A vector at the identity to S O ( 2 ) is given by:
d R t η d t | t = 0 = η   with   = [ 0 1 1 0 ] , T = 1 =
We consider the special Euclidean group S E ( 2 ) = S O ( 2 ) × R 2 .
S E ( 2 ) = { [ R φ τ 0 1 ] / R φ S O ( 2 ) , τ R 2 }
the group operation is given by:
[ R φ 1 τ 1 0 1 ] [ R φ 2 τ 2 0 1 ] = [ R φ 1 R φ 2 R φ 1 τ 2 + τ 1 0 1 ] = [ R φ 1 + φ 2 R φ 1 τ 2 + τ 1 0 1 ] ( R 1 , τ 1 ) . ( R φ 2 , τ 2 ) = ( R φ 1 + φ 2 , R φ 1 τ 2 + τ 1 )
[ R φ 1 τ 1 0 1 ] 1 = [ R φ 1 R φ 1 τ 1 0 1 ] ( R φ 1 , τ 1 ) 1 = ( R φ 1 , R φ 1 τ 1 )
The Lie algebra s e ( 2 ) of S E ( 2 ) has underlying vector space R 3 and Lie bracket:
( ξ , u ) s e ( 2 ) = R × R 2 [ ξ u 0 0 ] s e ( 2 )
Lie bracket is given by:
[ ( ξ , u ) , ( η , v ) ] = ( 0 , ξ v + η u )
Adjoint action of S E ( 2 ) is given by:
A d ( R φ , τ ) ( ξ , u ) = [ R φ τ 0 1 ] [ ξ u 0 0 ] [ R φ R φ τ 0 1 ] = [ ξ ξ τ + R φ u 0 0 ] A d ( R φ , τ ) ( ξ , u ) = ( ξ , R φ u + ξ τ )
Coadjoint action of S E ( 2 ) is given by:
A d ( R φ , τ ) * ( m , ρ ) = ( m + R φ ρ . τ , R φ ρ )
The moment map J : R 2 s e * ( 2 ) of S E ( 2 ) is defined by:
J ( ξ , u ) ( x ) = J ( x ) . ( ξ , u )
with the right action of S E ( 2 ) on R 2 :
x . ( R φ , τ ) = R φ ( x τ )
the infinitesimal generator of ( ξ , u ) s e ( 2 ) has the expression:
( ξ , u ) R 2 ( x ) = d [ x . ( R t ξ , t u ) ] d t | t = 0 = d [ R t ξ ( x t u ) ] d t | t = 0 = ξ x u
Let J ( ξ , u ) ( x ) : R 2 s e * ( 2 ) be the moment map of this action relative to the symplectic form, we can compute it from its definition:
d J ( ξ , u ) ( x ) . y = 2 ω ( ( ξ , u ) R 2 , y ) with   ω ( ( ξ , u ) R 2 , y ) = ω ( ξ x u , y ) = ( ξ x u ) . y = ( ξ x + u ) . y d J ( ξ , u ) ( x ) . y = 2 ( ξ x + u ) . y J ( ξ , u ) ( x ) = 2 ( 1 2 ξ x 2 + u . x ) = 2 ( 1 2 x 2 , x ) . ( ξ , u ) J ( ξ , u ) ( x ) = J ( x ) . ( ξ , u ) J ( x ) = 2 ( 1 2 x 2 , x )   ,   x R 2
We then compute the one-cocycle of S E ( 2 ) from the moment map:
θ ( ( R φ , τ ) ) = J ( 0 . ( R φ , τ ) ) A d ( R φ , τ ) * J ( 0 ) = J ( R φ τ ) θ ( ( R φ , τ ) ) = 2 ( 1 2 τ 2 , R φ τ ) = 2 ( 1 2 τ 2 , R φ π 2 τ )
Coadjoint orbit of S E ( 2 ) are generated by:
O ( m , ρ ) = { A ( R φ , τ ) * ( m , ρ ) + θ ( ( R φ , τ ) ) / ( R φ , τ ) S E ( 2 ) } O ( m , ρ ) = { ( x R π 2 ρ . τ τ 2 , R φ ρ 2 R φ π 2 τ ) / ( R φ , τ ) S E ( 2 ) }
The Souriau Symplectic form in this case of non-null cohomology is given by:
ω ( m , ρ ) ( m , ρ ) ( a d ( ξ , u ) * ( m , ρ ) ( 0 , 2 u ) , a d ( η , v ) * ( m , ρ ) ( 0 , 2 v ) ) = ρ . ( ξ v + η u ) + 2 u . v with   ( m , ρ ) = ( x R π 2 ρ . τ τ 2 , R φ ρ 2 R φ π 2 τ ) O ( m , ρ ) R 3
With the expression of moment map, we can compute Souriau covariant Gibbs density of Maximum Entropy.
Considering the symplectic form ω ( ζ , υ ) = ζ . υ   with   = [ 0 1 1 0 ] on R 2 , we have seen that the action of SE(2) is symplectic and admits the momentum map, J ( x ) = ( 1 2 x 2 , x )   ,   x R 2 .
Souriau Gibbs density is defined for generalized temperature β Ω = { ( b , Β ) s e ( 2 ) / b < 0 , Β R 2 } and given by:
p G i b b s ( x ) = e J ( x ) , β R 2 e J ( x ) , β d λ ( x ) = e 1 2 b x 2 Β . x R 2 e 1 2 b x 2 Β . x d λ ( x )
The Massieu Potential could be computed:
Φ ( β ) = log R 2 e 1 2 b x 2 Β . x d λ ( x ) = log ( 2 π b e 1 2 b B 2 )
By derivation of Massieu potential, we can deduce expression of Heat:
Q Ω * = { ( m , M ) s e * ( 2 ) / m + M 2 2 < 0 } Q = Φ ( β ) β = ( 1 b Β 2 2 b 2 , 1 b Β ) = Θ ( β )
We can the inverse this relation to express generalized temperature with respect to the heat:
β = Θ 1 ( Q ) = ( ( m + 1 2 M 2 ) 1 , ( m + 1 2 M 2 ) 1 M )
We can the express the Gibbs density with respect to the Heat Q which is the mean of moment map:
p G i b b s ( x ) = e 1 2 x 2 M . x ( m + 1 2 M 2 ) R 2 e 1 2 x 2 M . x ( m + 1 2 M 2 ) d λ ( x )   with   ( m , M ) = E ( J ( x ) ) = E [ 2 ( 1 2 x 2 , x ) ] = [ E ( x 2 ) , 2 E ( x ) ]
So we can rewrite the Gibbs density:
p G i b b s ( x ) = e 1 2 x 2 + 2 E ( x ) . I x ( E ( x 2 ) + 2 E ( x ) 2 ) R 2 e 1 2 x 2 + 2 E ( x ) . I x ( E ( x 2 ) + 2 E ( x ) 2 ) d λ ( x )
We can also provide a Fisher metric in dual Lie algebra as hessian of the Entropy:
S ( Q ) = Q , β Φ ( β ) = 1 + log ( 2 π ) + log ( m M 2 2 )
I F i s h e r ( Q ) = ( m + 1 2 M 2 ) 1 [ I M T M T 1 2 M 2 m ]
and as ( m , M ) = E ( J ( x ) ) = E [ 2 ( 1 2 x 2 , x ) ] = [ E ( x 2 ) , 2 E ( x ) ] , Fisher metric in dual space of Lie Algebra parameterization could be written:
I F i s h e r ( Q ) = ( 2 E ( x ) 2 E ( x 2 ) ) 1 [ I ( 2 E ( x ) ) T 2 E ( x ) 2 E ( x ) 2 + E ( x 2 ) ]

8. New Entropy Definition as Generalized Casimir Invariant Functions for Coadjoint and Adjoint Representation

In his paper written in 1974, Jean-Marie Souriau has observed that if we consider the heat expression Q = d Φ d β , that we can write δ Φ Q , δ β = 0 . For each δ β tangent to the orbit, and so generated by an element Z of the Lie algebra, if we consider the relation Φ ( A d g ( β ) ) = Φ ( β ) θ ( g 1 ) , β and we differentiate it at g = e using the property that Θ ˜ ( X , Y ) = d θ ( X ) , Y   ,   X , Y g , we obtain Q , [ β , Z ] + Θ ˜ ( β , Z ) = 0 . Souriau has stopped by this last equation, the characterization of Group action on Q = Φ β . Souriau has also observed that S [ Q ( A d g ( β ) ) ] = S [ A d g * ( Q ) + θ ( g ) ] = S ( Q ) . We propose to characterize more explicitly this invariance, by characterizing Entropy as an invariant Casimir function in coadjoint representation.
From last Souriau equation, if we use the identities β = S Q , a d β Z = [ β , Z ] and Θ ˜ ( β , Z ) = Θ ( β ) , Z , then we can deduce that a d S Q * Q + Θ ( S Q ) , Z = 0 , Z . So, Entropy S ( Q ) should verify a d S Q * Q + Θ ( S Q ) = 0 , characterizes an invariant Casimir function in case of non-null cohomology, that we propose to write with Poisson brackets, { S , H } Θ ˜ ( Q ) = 0 where { S , H } Θ ˜ ( Q ) = Q , [ S Q , H Q ] + Θ ˜ ( S Q , H Q ) = 0 , H : g * R , Q g * .
In a Poisson manifold, Casimir functions S C ( g * ) , in case of null cohomology, are functions whose Poisson brackets will all functions vanish, { S , H } ( Q ) = 0   , S C ( g * ) , Q g * . In the dual of the Lie algebra of a connected Lie group G , the Casimir functions are the A d * -invariant functions, because if S , H C ( g * ) and Q g * , then { S , H } ( Q ) = Q , [ S Q , H Q ] = Q , a d S Q H Q = a d S Q * Q , H Q vanishes for all H C ( g * ) if and only if a d S Q * Q = 0 . A function is S on g * is A d * -invariant if g . S = S ,   g G where Lie group G acts on functions on g * by ( g . S ) ( Q ) = S ( A d g * Q ) ,   Q g * , S C ( g * ) , g G , and where infinitesimal characterizations of A d * -invariant functions on g * , d d t S ( A d exp ( t x ) * Q ) | t = 0 = a d x * Q , S Q = a d S Q * Q , x . The symplectic leaves of a Poisson manifold are contained in the connected components of the level sets of the Casimir functions and Casimir function is constant on a symplectic leaf. Coadjoint orbits lie on level sets of the Casimir functions, which are conserved quantities. Casimir functions Level sets are symplectic manifolds. Coadjoint motion of the moment map Q ( t ) = A d g ( t ) * Q ( 0 ) for a solution curve g ( t ) C ( G ) take place on the intersections of levels sets of the Hamiltonian and the Casimir functions. Alexis Arnaudon has studied stochastic coadjoint processes whose solutions lie on coadjoint orbits.
We have observed that { S , H } Θ ˜ ( Q ) = Q , [ S Q , H Q ] + Θ ˜ ( S Q , H Q ) = 0 , H : g * R , Q g * , that shows that Souriau Entropy is a Casimir function in case with non-null cohomology when an additional cocycle should be taken into account. Indeed, infinitesimal variation is characterized by the following differentiation: d d t S ( Q ( A d exp ( t x ) β ) ) | t = 0 = d d t S ( A d exp ( t x ) * Q + θ ( exp ( t x ) ) ) | t = 0 = a d S Q * Q + Θ ( S Q ) , x . We recover extended Casimir equation in case of non-null cohomology verified by Entropy, a d S Q * Q + Θ ( S Q ) = 0 , and then the generalized Casimir condition { S , H } Θ ˜ ( Q ) = 0 . Hamiltonian motion on these affine coadjoint orbits is given by the solutions of the Lie-Poisson equations with cocycle.
The identification of Entropy as an Invariant Casimir Function in Coadjoint representation is also important in Information Theory, because classically Entropy is introduced axiomatically. With this new approach, we can build Entropy by constructing the Casimir Function associated to the Lie group and also in case of non-null cohomology. Igor V. Shirokov [71,72,73,74,75] has proposed a method for constructing invariants of the coadjoint representation of Lie groups with an arbitrary dimension and structure based on local symplectic coordinates on the coadjoint orbits. The idea of the method of constructing coadjoint invariants is to construct the canonical transition to the Darboux coordinates on the orbits of the dual Lie algebra g * of maximal dimension dual to the Lie algebra g of the Lie group G . These relations provide invariants of the coadjoint representation of the Lie group G .
This geometric framework unifies several earlier works on the subject, including Souriau’s symplectic model of statistical mechanics, and approaches developed in Information Geometry and Quantum Information Geometry. This approach helps to identify the common geometric structures appearing in various domains from statistical mechanics to statistical learning. The emphasis is put on the role of the affine equivariance with respect to Lie group actions, as extension of the Fisher metric in presence of equivariance and the associated Lie-Poisson equations with cocycle (affine Lie-Poisson equations). The entropy of the Souriau model as a Casimir function can be used to apply a geometric model for energy preserving entropy production on Lie algebras. We can exploit the geometric framework of this new equation to build geometric numerical integrator schemes for some of the equations associated to Souriau’s model and its polysymplectic extension. This new equation is important because it introduce new structure of differential equations in case of non-null cohomology and for an arbitrary Hamiltonian H : g * R : d Q d t = a d H Q * Q + Θ ( H Q ) .
The equation d Q d t = a d H Q * Q + Θ ( H Q ) is important because it allows extending stochastic perturbation of the Lie-Poisson equation with cocycle within the setting of stochastic Hamiltonian dynamics, which preserves the affine coadjoint orbits. We can extend model for stochastic geometric modeling in fluid dynamics via variational principles described in [32,76]. This extension results in the new Stratonovich differential equation for the stochastic process d Q + [ a d H Q * Q + Θ ( H Q ) ] d t + i = 1 N [ a d H i Q * Q + Θ ( H i Q ) ] d W i ( t ) = 0 .
This new equation is also very usefull for geometric symplectic Lie group integrator for Lie-Poisson equations with cocycle that preserves the affine coadjoint orbits for general Hamiltonian. This equation is also very relevant in the framework of dynamics with Casimir dissipation/production, to formulate a dynamical geometric model for dissipation/production of this Casimir. This allows to extend the general Lie algebraic approach developed in [77,78] for Casimir dissipation, to take into account of a cocycle, and to a wider class of dissipation. Paper [17] will exploit this new Casimir equation in case of non-null cohomology.
This equation d Q d t = a d H Q * Q + Θ ( H Q ) could be used also to make the link with 2nd principle of Thermodynamique, that will be deduced from positivity of Souriau-Fisher metric:
S ( Q ) = Q , β Φ ( β )   with   d Q d t = a d H Q * Q + Θ ( H Q ) d S d t = Q , d β d t + a d H Q * Q + Θ ( H Q ) , β d Φ d t = Q , d β d t + a d H Q * Q , β + + Θ ( H Q ) , β d Φ d t d S d t = Q , d β d t + Q , [ H Q , β ] + Θ ˜ ( H Q , β ) d Φ d t = Q , d β d t + Θ ˜ β ( H Q , β ) Φ β , d β d t d S d t = Q , d β d t + Θ ˜ β ( H Q , β ) Φ β , d β d t   with   Φ β = Q d S d t = Θ ˜ β ( H Q , β ) 0 , H   ( link   to   positivity   of   Fisher   metric ) if   H = S S Q = Q d S d t = Θ ˜ β ( β , β ) = 0   because   β K e r Θ ˜ β
Entropy production is then linked with Souriau-Fisher structure, d S = Θ ˜ β ( H Q , β ) d t with Θ ˜ β ( H Q , β ) = Θ ˜ ( H Q , β ) + Q , [ H Q , β ]   Souriau tensor related to Fisher metric.

8.1. Casimir Invariant and Generalized Casimir Invariant

Hendrik Brugt Gerhard Casimir, a Dutch physicist, studied what is called Casimir operators and Casimir invariants (H. Casimir and Van der Waerden studied the SU(2) group, the group of isospin/angular momentum, as the model of the algebraic approach to the study of the unitary representations of semi-simple compact Lie groups). Kirillov has explained that Casimir operators are in one-to-one correspondence with polynomial invariants characterizing orbits of the coadjoint representation. Solutions are not necessarily polynomials and the nonpolynomial solutions are called generalized Casimir invariants. For certain classes of Lie algebras, all invariants of the coadjoint representation are functions of polynomial ones. In physics, Hamiltonians and integrals of motion of classical integrable Hamiltonian systems are not polynomials in the momenta [71,72,73,74,75,79,80,81,82,83,84,85,86,87,88,89,90,91,92].

8.2. Souriau Entropy as Generalized Casimir Invariant in Coadjoint Representation

In Souriau Lie groups Thermodynamics, we will see that coadjoint orbits lie on level sets of the Entropy that could be considered as a Casimir invariant function:
S : g * R Q S ( Q )
We will consider first the case of null-cohomology, Entropy as Casimir invariant function is a conserved quantity, because Casimir function has null Lie Poisson brackets functions [93,94]:
{ S , H } ( Q ) = Q , [ S Q , H Q ] = 0 , H : g * R , Q g *   ,   A , B = B ( A , B )   Cartan-Killing   form with   S ( Q ) = d d ε S ( Q + δ Q ) | ε = 0 = δ Q , S Q
We can observe that β = S Q , then:
Q , [ β , H Q ] = Q , a d β H Q = 0 , H : g * R , Q g *   ,   a d a b = [ a , b ]
We can also write:
Q , [ S Q , H Q ] = Q , a d S Q H Q = a d S Q * Q , H Q = 0   ,   H : g * R
It means that a d S Q * Q = a d β * Q = 0   ,   β = S Q . We can remark that if we note ( a d S Q * Q ) j = C i j k a d ( S Q ) i * Q k = 0 with C i j k the structure tensor, we observe that this equation is in fact the Casimir condition for invariant function in coadjoint representation as we will see hereafter. The restriction of the Lie-Poisson bracket to an orbit generates a symplectic structure on the orbit, called the KKS (Kirillov-Kostant-Souriau) structure, or the canonical symplectic structure. Casimir function is characterized as a quantity which commutes with each linear functional on the Poisson manifold, and then it is conserved by dynamics of any Hamiltonian.
Given a Hamiltonian H : g * R , the equation of motion for Q g * is:
d Q d t = { Q , H } = a d H Q * Q   with   H = S d Q d t = { Q , S } = a d S Q * Q = 0
In case of non-null cohomology, the Lie Poisson brackets functions are given by:
{ S , H } Θ ˜ ( Q ) = Q , [ S Q , H Q ] + Θ ˜ ( S Q , H Q ) = 0 , H : g * R , Q g * with   Θ ˜ ( X , Y ) = J [ X , Y ] { J X , J Y }   where   J X ( x ) = J ( x ) , X Θ ˜ ( X , Y ) : g × g with   Θ ( X ) = T e θ ( X ( e ) ) X , Y Θ ( X ) , Y
That we can develop in the following:
{ S , H } Θ ˜ ( Q ) = Q , [ S Q , H Q ] + Θ ( S Q ) , H Q = 0 { S , H } Θ ˜ ( Q ) = Q , a d S Q H Q + Θ ( S Q ) , H Q = 0 { S , H } Θ ˜ ( Q ) = a d S Q * Q , H Q + Θ ( S Q ) , H Q = 0 H , { S , H } Θ ˜ ( Q ) = a d S Q * Q + Θ ( S Q ) + , H Q = 0 a d S Q * Q + Θ ( S Q ) = 0
We have found the generalized Casimir equation for Entropy in the non-null cohomology case:
{ S , H } Θ ˜ ( Q ) = 0
That could be also written:
a d S Q * Q + Θ ( S Q ) = 0
This equation was observed by Souriau in his paper of 1974, where he has written that geometric temperature β is a kernel of Θ ˜ β , that is written:
β K e r Θ ˜ β Q , [ β , Z ] + Θ ˜ ( β , Z ) = 0
That we can develop to recover the Casimir equation:
Q , a d β Z + Θ ˜ ( β , Z ) = 0 a d β * Q , Z + Θ ˜ ( β , Z ) = 0 β = S Q a d S Q * Q , Z + Θ ˜ ( S Q , Z ) = a d S Q * Q + Θ ( S Q ) , Z = 0 , Z a d S Q * Q + Θ ( S Q ) = 0
Then the generalized Casimir Equation in non-null cohomogy is given by:
( a d S Q * Q ) j + Θ ( S Q ) j = C i j k a d ( S Q ) i * Q k + Θ j = 0
Given a Hamiltonian H : g * R , the equation of motion for Q g * is:
d Q d t = a d H Q * Q + Θ ( H Q )   with   H = S d Q d t = a d S Q * Q + Θ ( S Q ) = 0
Level sets of the Casimir Entropy function, on which the coadjoint orbits lie, are symplectic manifolds.

8.3. Souriau Entropy Invariance in Coadjoint Representation

If we note 𝕬 𝖓 ( g * ) the space of analytic function on the dual space of the Lie agebra g * , a function F * 𝕬 𝖓 ( g * ) is a Casimir invariant if for any g G , X g * , we have F * ( A d g * X ) = F * ( X ) . We have observed previously that Souriau’s Entropy analytic function S ( Q ) defined on dual space of the Lie algebra g * by Legendre transform of Massieu Characteric analytic function Φ ( β ) (minus logarithm of Laplace transform) defined on Lie algebra g was an invariant function under the affine coadjoint action S [ Q ( A d g ( β ) ) ] = S [ A d g * ( Q ) + θ ( g ) ] = S ( Q ) . In case of null-cohomology, Souriau cocycle cancels θ ( g ) = 0 , and we recover Casimir invariant function in coadjoint representation S [ A d g * ( Q ) ] = S ( Q ) .
We can then observe that Souriau Entropy is an extended Casimir invariant function in case of non-null cohomogy. This characteristic of Souriau Entropy could be a new characterization of Entropy. In Souriau Lie groups Thermodynamics, Entropy S ( Q ) is a generalized Casimir invariant function for coadjoint representation in case of non-null cohomology, and Massieu Characteristic function by Legendre duality is a generalized Casimir function for adjoint representation.
We will explain how to prove that Souriau Entropy is invariant under the action of the group, starting from its definition:
S ( Q ) = Q , β Φ ( β )   with   Q = Φ ( β ) β g *   and   β = S ( Q ) Q g
with
Φ ( β ) = log M e U ( ξ ) , β d λ ω   and   U : M g *
Considering Souriau Entropy S ( Q ) where the heat Q = Φ ( β ) β g * an element of the dual space of the Lie algebra is parameterized by β g an element of the Lie algebra, the Lie group G acts through g G by adjoint operator A d g , the entropy is given by S [ Q ( A d g ( β ) ) ] with Q ( A d g ( β ) ) given by fundamental Souriau equation:
Q ( A d g ( β ) ) = A d g * ( Q ) + θ ( g )
The invariance of Souriau Entropy is deduced from the following developments:
β g A d g ( β ) Ψ ( A d g ( β ) ) = M e U , A d g ( β ) d λ ω Ψ ( A d g ( β ) ) = M e A d g 1 * U , β d λ ω = M e U ( A d g 1 β ) θ ( g 1 ) , β d λ ω Ψ ( A d g ( β ) ) = e θ ( g 1 ) , β Ψ ( β ) θ ( g 1 ) = A d g 1 * θ ( g ) Ψ ( A d g ( β ) ) = e A d g 1 * θ ( g ) , β Ψ ( β ) Φ ( β ) = log Ψ ( β ) Φ ( A d g ( β ) ) = Φ ( β ) θ ( g 1 ) , β = Φ ( β ) + A d g 1 * θ ( g ) , β
Based on this expression of Massieu Characteristic function transform by action of the group, we can use Legendre transform to study how Souriau Entropy is changed:
S ( Q ) = Q , β Φ ( β ) S ( Q ( A d g β ) ) = Q ( A d g β ) , A d g β Φ ( A d g β ) { Q ( A d g ( β ) ) = A d g * ( Q ) + θ ( g ) Φ ( A d g ( β ) ) = log Ψ ( A d g ( β ) ) = θ ( g 1 ) , β + Φ ( β ) S ( Q ( A d g β ) ) = A d g * ( Q ) + θ ( g ) , A d g β + θ ( g 1 ) , β Φ ( β ) S ( Q ( A d g β ) ) = A d g * ( Q ) + θ ( g ) , A d g β A d g 1 * θ ( g ) , β Φ ( β ) S ( Q ( A d g β ) ) = A d g 1 * A d g * ( Q ) + A d g 1 * θ ( g ) , β A d g 1 * θ ( g ) , β Φ ( β ) A d g 1 * A d g * ( Q ) = Q S ( Q ( A d g β ) ) = Q , β Φ ( β ) = S ( β )
We finally prove that Souriau Entropy is invariant in coadjoint representation S ( A d g * ( Q ) + θ ( g ) ) = S ( β ) in general case of non-null cohomology, that we could write S ( A d g # ( Q ) ) = S ( β ) , if we note affine coadjoint action A d g # ( Q ) = A d g * ( Q ) + θ ( g ) . This is also true in case of null-cohomology when the Souriau cocycle cancels θ ( g ) = 0 , and we recover classical generalized Casimir invariant function definition on coadjoint representation for Entropy S ( A d g * ( Q ) ) = S ( β ) generalized Casimir invariant function definition on adjoint representation for Massieu Characteristic function Φ ( A d g ( β ) ) = Φ ( β ) .

8.4. Souriau Entropy Given by Casimir Invariant Functions Equations

Based on development given in the following we can state that:
As the Entropy S is a generalized Casimir invariant function in the coadjoint representation, S ( A d e t ξ * h ) = S ( h ) , then S should be solution of the following differential equation:
C i j k Q k S ( Q ) Q j = 0   ,   i , j , k = dim g ,   with   { C i j k Q k = C i j ( Q ) = B i j B Q ( x , y ) = B i j x i y j = Q , [ x , y ]
where C i j k is the structure tensor of the Lie algebra g in the basis ( e 1 , e 2 , , e n ) , while X k are the coordinates in g * in the basis ( e 1 , e 2 , , e n ) defined by e j , e i = δ i j . The structure tensor s given by [ ϕ ( e i ) , ϕ ( e i ) ] = C i j k ϕ ( e k ) with ϕ ( e i ) = C i j k X k X j   ,   i = 1 , , n .

8.5. Characterization of Generalized Casimir Invariant Functions in Coadjoint Representation

We will describe recent characterization of generalized Casimir invariant functions by Oleg L. Kurnyavko and Igor V. Shirokov [72,73,75] who have proposed Algebraic method for construction of Casimir invariants of Lie groups coadjoint representations (see Appendix C). Modern invariant theory based on geometric methods, which was credited classically as non-constructive, has some exception admitting a constructive solution related to the constructing invariants of Lie groups representations.
Let T be a connected Lie group, T ( G ) a representation of the group G in the linear space V , T g the operators associated to the representation of the group G on the linear space V , then the invariants are given by the following equation:
F ( T g x ) = F ( x )   ,   x V , g G , T g T ( G ) , F ( x ) C ( V )
With the properties that:
T e = I   ,   T g a g b = T g a T g b   ,   T g 1 = ( T g ) 1
Solution is given by the following differential equation:
i , j dim V t k j i x j F ( x ) x i = 0   with   t k j i = ( T g ) j i g k | g = e   and   k = 1 , , dim G
t k j i are elements of the matrices of the Lie algebra representation basis of G .
That we can write t k = t k j i x j x i and t k F ( x ) = 0 .
If we consider the dual space V * , the co-tangent representation is given by:
T * ( g ) X , T ( g ) x = X , x
And co-represnetation invariants are given by:
t k * F * ( X ) = 0   with   t k * = t k j i X i X j
They have underlined the relationship between invariants of representations and conjugate representations, where the algebraic construction of Lie groups representations invariants are given by invariants of the conjugate representation with respect to the invariants of the original representation.
Shirokov Theorem 1. 
Let F ( x ) be a non-degenerate invariant of the representation T ( G ) , then conjugate representation invariant can be found by Legrendre tranform:
F * ( X ) = x i X i F ( x ) = x , X F ( x )   with   X = F ( x ) x   such   that   X i = F ( x ) x i
and also the converse problem:
F ( x ) = x i X i F * ( X ) = x , X F * ( X )   with   x = F * ( X ) X   such   that   x i = F * ( X ) X i
Shirokov has considered F ( x ) the representation invariant T ( G ) , and F * ( X ) the representation invariant T * ( G ) conjugate to T ( G ) , with the conditions:
t k j i x j F ( x ) x i = 0   and   t l j i X i F * ( X ) X j = 0
t l j i X i F * ( X ) X j = t l j i X i X j [ x k ( X ) X k F ( x ( X ) ) ] = t l j i X i x k X j X k + t l j i X i x k X k X j t l j i X i F ( x ) x k x k X j t l j i X i F * ( X ) X j = t l j i X i x k X j F ( x ) x k + t l j i F ( x ) x i x k δ k j t l j i F ( x ) x i F ( x ) x k x k X j t l j i X i F * ( X ) X j = t l j i x j F ( x ) x i = 0
Invariant Casimir Functions of the coadjoint representation has been studied for completely integrable Hamiltonian systems, as classical systems on the orbits of the coadjoint representation. Oleg L. Kurnyavko and Igor V. Shirokov have considered the relationship between invariants of representations of Lie groups and their conjugate dual representations.
Considering the coadjoint action given by:
A d g * X , x = X , A d g 1 x   ,   g G , X g * , x g
Invariants of a coadjoint representation are called Casimir functions, with the property:
F * ( A d g * X ) = F * ( X )