Abstract
Due to the principle of minimal information gain, the measurement of points in an affine space V determines a Legendrian submanifold of . Such Legendrian submanifolds are equipped with additional geometric structures that come from the central moments of the underlying probability distributions and are invariant under the action of the group of affine transformations on V. We investigate the action of this group of affine transformations on Legendrian submanifolds of by giving a detailed overview of the structure of the algebra of scalar differential invariants, and we show how the scalar differential invariants can be constructed from the central moments. In the end, we view the results in the context of equilibrium thermodynamics of gases, and notice that the heat capacity is one of the differential invariants.
1. Introduction
It has been known since the work of Gibbs [1,2] that thermodynamics can be formulated in the language of contact geometry. Because the fundamental thermodynamic relation takes the form of a contact structure on an odd-dimensional manifold, closed systems in thermal equilibrium correspond to Legendrian submanifolds with respect to this contact structure. More recently, some effort has been devoted towards studying an additional metric structure in the context of thermodynamics (see for example [3,4] and references therein).
Both the contact structure and the metric can be given statistical interpretations [5,6]. The principle of minimal information gain (similar to Jaynes’ principle of maximal entropy, [7]), applied to a random vector taking values in an affine space V, gives a contact structure on . The variance of the random vector determines a metric on Legendrian submanifolds. In fact, for each integer , the kth central moment gives a symmetric k-form on Legendrian submanifolds [6].
In this framework, the group of affine transformations on V plays an important role, as it acts on , and on the space of Legendrian submanifolds, while preserving the geometric structures mentioned. If we treat Legendrian submanifolds that are related by a transformation in this group as equivalent, it is clear that the important quantities are those that are invariant under this group action. Such quantities will be the main focus of this paper. In particular, we give a detailed description of the differential algebra of scalar differential invariants, building upon the work done in [8].
The paper is organized as follows. We start in Section 2 by recalling how the process of measuring points in an affine space leads to symplectic and contact geometry, following [6]. We explain how Legendrian submanifolds come equipped with symmetric k-forms corresponding to kth central moments. We end the section by discussing the action of the Lie group of affine transformations on V. In Section 3, we recall notions from the geometric theory of PDEs and explain the concept of scalar differential invariants. We show how the group acts on the jet spaces, and that its Lie algebra of vector fields is the largest Lie algebra of vector fields on V that preserves the central moments. In Section 4, we use the central moments to find generators for the field of rational scalar differential invariants. We compute the Hilbert and Poincaré functions for the field. We also find invariant derivations, and give a finite set of generators for the set of differential invariants, as a differential algebra. We compute differential syzygies for the case . In Section 5, we apply our results to the thermodynamics of gases in equilibrium. We find a generating set of differential invariants and invariant derivations with respect to , and we notice that the heat capacity (at constant pressure) is one of the fundamental scalar differential invariants. We also compute differential invariants with respect to two different subgroups of before we finish with a discussion of the significance of differential invariants.
2. From Random Vectors to Differential Geometry
We start by describing how the principle of minimal information gain applied to the measurement of points in an affine space leads to contact and symplectic geometry. We follow closely the approach of [6].
2.1. Measuring Points in Affine Spaces
A random vector is a map from a probability space to the affine space :
Here is the sample space, the -algebra of “events”, and a probability measure. We interpret X as a measurement of if
By choosing an affine frame in V, the integral above can be defined coordinate-wise.
The measurement of a different vector is made by changing the probability measure from to , using the probability measure as a control parameter, in a way so that . We now find conditions that determine .
Assume first that is absolutely continuous with respect to . Then the Radon-Nikodym theorem implies that . We require to satisfy the conditions
In addition we require to be the probability distribution “closest” to in the sense that it minimizes the information gain, or the Kullback-Leibler divergence:
This is the principle of minimal information gain. It is similar to Jaynes’ maximum-entropy principle [7]. As Jaynes put it, it is the only unbiased assignment we can make. Note that Jensen’s inequality gives . Moreover, we have if .
Minimizing under the constraints (1) gives
where is a covector of Lagrange multipliers corresponding to the constraints given by the first equation of (1), and is the partition function (see [6] for details). Thus is determined by , and implies .
Let be a simply connected domain, containing 0, on which is defined and smooth. Due to the first condition of (1), we have . The differential is an element in which can be identified with . In this identification, is identified with . By defining , we get
This defines an n-dimensional manifold
By choosing an affine frame on V with origin (and its dual frame on ), we get coordinates on V and on . In these coordinates, the linear basis of the vector space associated to V can be identified with , and L is then given by the n equations .
The space is a symplectic space equipped with the symplectic form
and L is a Lagrangian submanifold: . We use the Einstein summation convention, and sum over repeated indices.
Since the information gain depends on , and therefore on , it can be considered as a function on L. By using (2) we get
on L. If the point can be “measured”, then Equation (3) can be solved for . If is a (local) solution, we may write
Let u be a coordinate on . We have the relation , so the submanifold
is Legendrian with respect to the contact form on .
The Legendrian submanifold depends on the initial random vector. In this way, the statistical object we started with is translated into a geometric object . This shows that our model of physical measurements, based on random vectors and the principle of minimal information gain, is always accompanied by a form of thermodynamics, where the contact form plays the role of the fundamental thermodynamic relation.
We note that in some cases, it is convenient to hide the information gain from our consideration and consider the Lagrangian submanifold instead of . We will switch between these two viewpoints.
2.2. Central Moments
Let continue to denote the Legendrian manifold corresponding to the random vector X. It can be parametrized either by the measured quantity or by the parameter (we will mostly use the latter choice).
The kth moment of X (with respect to ) is defined by
It depends on and defines a symmetric tensor on . We continue to use coordinates on V and on , coming from the choice of affine frame on V with origin and linear basis . If we write , we get
Assuming that the function Z is smooth, the differentiation is symmetric, and thus is a symmetric k-form defined on . The equality
can be shown by induction, using (2).
The kth central moment of X is defined as the kth moment of , where
They are related through the formula
where the product is the symmetric tensor product. Note that the tensor is completely determined by , or even by the corresponding Lagrangian manifold .
The second central moment, or variance, determines a positive definite metric
on Legendrian submanifolds. It can also be given locally as . Or if we consider only the Lagrangian submanifold , defined by n functions , it can be given by . Notice that the positive definiteness of puts conditions on . Thus, the measurement process does not lead to arbitrary Legendrian manifolds, but only to ones where the symmetric k-forms can be properly interpreted as central moments.
The metric has been studied previously in different contexts. In particular it was treated by Ruppeiner [3,4] as a metric on V (in our notation). Ruppeiner was also interested in computing and giving a physical interpretation to its scalar curvature. However, as was pointed out in [6], and which we will discuss further in this paper, the group which appears naturally in this context is smaller than the full Lie pseudogroup of diffeomorphisms. As a result, the scalar curvature is not the most fundamental invariant. In [5], was treated as a metric on Legendrian submanifolds of in the same way as we do in this paper. There, they also showed that it can be extended to a nondegenerate metric on .
2.3. The Action of the Group of Affine Transformations
In the framework outlined above, we get quite naturally an action of the affine group . One can view it as coming from the arbitrary choice of affine frame on V. The action extends to in the obvious way:
This Lie group action preserves the contact structure on (or the symplectic structure on ). Thus, it also gives an action on the space of Legendrian (or Lagrangian) submanifolds. Moreover, the tensor is preserved by for every . This will be discussed further in the next section. In particular, we will show in Theorem 2 in Section 3.2 that the Lie algebra of the -action is the largest Lie algebra of vector fields on V that preserves the tensor . Because of this, plays an important role in the theory of measurements, and this Lie group will be the focus of our attention in most of this paper.
We also have the invariant 0-form and 1-form . The latter form vanishes when restricted to , so we consider instead which also is invariant since u, and therefore , is. Since on , we have . And thus
Remark 1.
Notice that there may be situations for which it is desirable to consider proper subgroups ofonly, perhaps because the space V has additional structure. For example, in the case where V is a vector space, it may be more appropriate to consider only the action of. In this case, the regular kth momentwill be invariant.
3. Jets and PDEs
The theory of jet spaces lets us treat functions, sections of bundles, submanifolds of a fixed dimension and, more generally, solutions of PDEs geometrically. In particular it gives a transparent picture of the algebra of scalar differential invariants. We will use most of this section to fix notation and definitions, sufficient for our use, and refer to the standard literature (for example [9,10,11]) for details. We recommend [12] for a comprehensive introduction the theory of jet spaces and differential invariants, and [13] for a more concise overview. The paper [14] can be added to either one of these as an updated treatment of the theory of scalar differential invariants.
3.1. Jets
We have seen that the Legendrian submanifolds in can be represented, locally, by a function on V. Let denote the space of k-jets of functions on V. It is a bundle over V. As coordinates on we will use
We have
By identifying with we get an identification of with , and the contact form on is identified with the contact form on . A Legendrian submanifold of can be prolonged canonically to an n-dimensional submanifold , by requiring that is an integral manifold of the Cartan distribution on .
Alternatively, we may remove information gain from the picture, and consider Lagrangian submanifolds of with the symplectic form . (The information gain may be recovered later, up to an additive constant, by solving the system .)
Let denote the space of k-jets of n-dimensional submanifolds of . Its dimension is given by
The symplectic form defines a PDE . A submanifold is a Lagrangian submanifold if and only if its 1-jets are contained in . Using coordinates on , a Lagrangian submanifold L is locally determined by n functions for . (For thermodynamics of gases this corresponds to writing the internal energy and volume as functions of temperature and pressure.) Since on L, the restriction of the symplectic form to L is given by
Thus, the manifold L is Lagrangian () if and only if
If we use the coordinates
on , it is clear that the Equation (5) define .
By differentiating the equations with respect to the variables we get additional equations of order two. We add these to the original set of first-order equations, and we denote the corresponding manifold in by . Similarly, we get submanifolds for every positive integer k by adding all derivatives (of appropriate order) of the first-order equations. We will also use the notation .
By counting the number of equations, we easily get the following statement.
Theorem 1.
The dimension of is given by
for .
Note that by throwing away the information gain from the picture, we get a natural projection , which is reflected in the counting above.
The central moment can be interpreted as a horizontal symmetric form on (or on ). Loosely speaking, this means that it is defined by the same formula as before, only that its coefficients is now considered as functions on (or on ). For example, in the formula for , the coefficient is viewed as a function on rather than a function on . Indeed, it is one of the coordinate functions on . If a Lagrangian manifold is given, can be restricted to the kth prolongation of L and get back its original meaning. In this way, the theory of jet spaces lets us work with functions and tensors on L that depend on and their derivatives without specifying L.
3.2. The Action of the Affine Group on and
In Section 2.3 we explained how acts on . The action on V induces uniquely an action on which preserves the contact form . This also gives a group action of on . More generally, a transformation on uniquely induces a transformation on for every positive integer k (it transforms functions on V and, as a consequence, their k-jets). In this way, the -action can be prolonged to . The same is true for the corresponding Lie algebra of vector fields defined on .
We illustrate this by prolonging a vector field on the base of to a vector field on , and refer to [10] for a more general treatment. Consider the vector field on . The unique vector field on that preserves the Cartan distribution and projects to X is given by
Note that when are affine functions, this corresponds exactly to the -action described in Section 2.3. The prolongation of X to is given by
In this formula is assumed in the summation, and is the total derivative operator. The total derivative is used in order to avoid expressions such as , which is not a function on . In the case , we have . For , we have .
Remark 2.
In a similar way we get an -action on for . Notice that since preserves the symplectic structure on , it preserves the subset , and we may consider the -action on .
When we say that is -invariant, we mean that it is invariant as a horizontal symmetric form with respect to the -action on . More precisely, we have for every , where is understood as a transformation .
Now, let us see which conditions we get on the functions if we require the vector field from above to preserve the variance (i.e. ). Here . We have
and
The terms of that depend on first-order derivatives of cancel, and we see that vanishes if and only if all second-order partial derivatives of vanish, implying that are affine functions on V. We get the following statement.
Theorem 2.
Let be a vector field on . Then X preserves (i.e. ) if and only if X is an affine vector field.
The fact that affine vector fields preserve the tensors and that, due to the above theorem, they are the only vector fields on V to do so, emphasizes the importance of the -action in the context of measurements.
3.3. Differential Invariants
Since acts on , we can look for functions on that are -invariant.
Definition 1.
A (scalar) differential invariant of order k is a function on which is constant on -orbits.
Notice that a differential invariant of order k is also a differential invariant of order . This determines a filtration on the algebra of differential invariants.
Due to the natural projection , any function on gives a function on . Essentially, the only thing we loose by considering functions on , is the zero-order invariant u corresponding to the information gain. In the remaining sections we will mostly consider Lagrangian submanifolds of , and therefore we will usually not mention the information gain explicitly in our description of differential invariants. Notice also that, by our definition, an invariant defined on will be of order and not, in general, of order k.
We follow [14] and consider the field of rational differential invariants. This is sufficient for separating orbits in general position as the group action, and the PDE on which it acts, is algebraic. More precisely, for every , there is a Zariski-open subset of on which the field of rational differential invariants of order k separate orbits. The set of rational differential invariants can also be considered as a differential algebra. By using invariant derivations, the differential algebra of differential invariants can be finitely generated.
4. Scalar Differential Invariants
In this section, we construct scalar differential invariants from the symmetric tensors and the 1-form which was discussed in Section 2.3. We end up with a transcendence basis for the field of differential invariants of order k, for every k, and we find a finite set of generators for the differential algebra of differential invariants. Differential syzygies are computed for the case .
As is common when looking for differential invariants, we will be satisfied with obtaining a number of independent differential invariants of order k equal to the codimension of an orbit in general position in . In other words, we will generate a differential algebra of differential invariants which, for each order k, contains a transcendence basis for the field of differential invariants of order k. In general, the field of rational differential invariants of order k will be an algebraic extension of the field generated by the transcendence basis. In this section, we will focus on Lagrangian submanifolds of and therefore, in most cases, disregard the differential invariant of order zero corresponding to the information gain.
Remark 3.
All results in this section may be useful also if we want to consider a subgroup , for example due to preservation of additional structure on V. All scalar and tensorial invariants will be invariant also with respect to G. The essential changes in the algebra of scalar differential invariants will occur on the level of third-order invariants, where new invariants will appear.
4.1. From Invariant Symmetric Forms to Scalar Invariants
We will now find the scalar differential invariants. We use , and to construct an invariant frame on .
- The symmetric 2-form is nondegenerate, and is used to construct a vector field .
- By using again, we turn the symmetric 2-form into a linear map .
- We use A to define the additional vector fields for .
We require that constitute a frame on L. This puts conditions on A, and therefore on the 2-jets of L. These conditions hold on a Zariski-open subset of .
Note that all the steps above can be taken without fixing the manifold L if we treat and for as horizontal symmetric forms on . The inverse of is then given by , where is the total derivative. Note that . Thus, and , where are (rational) functions on determined by the algorithm above. These are horizontal vector fields on .
- 4.
- The functions are rational, scalar differential invariants.
Remark 4.
Note that this idea has previously been used, with slight modifications, when considering subgroups of . Differential invariants of the actions of the Euclidean group and the general linear group on functions were considered in [15,16], respectively.
The next step is justifying that we can find, among these differential invariants that we have constructed, a transcendence basis for the field of differential invariants of order k, for each order k.
Lemma 1.
There is only one independent differential invariant of order 2. It can be given in coordinates by
Proof.
Consider the action of on the 2-jet of the information gain at the point . By using translations on V, we may set , so that the 2-jet is given by . The linear and quadratic terms can be encoded in terms of a one-form and a nondegenerate metric, respectively (in fact, by and ). The action by (the stabilizer of 0) preserves the degree of monomials, and we can use it to normalize the quadratic terms to get . Next, we use the stabilizer of the quadratic form, and rotate the expression into . We have used up all the freedom the group gives us, and the 2-jet is brought to normal form. There is only one free constant c, in addition to , so there is at most one invariant of second order in addition to the zero-order information gain, namely the length of the one-form (or its square). □
Before we consider the differential invariants of third order, we consider the invariants of higher order, which is an easier task. At points in general position in , the horizontal vector fields are linearly independent. Notice that the function is a sum of and lower-order terms. Since the coefficients of depend on , the invariants , with , are algebraically independent when restricted to a generic fiber of . This argument is easily extended to , and we obtain the following statement.
Lemma 2.
The differential invariants , with and , are algebraically independent. Moreover, for each , the number of independent functions among them, when restricted to a generic fiber of , is equal to the dimension of the fiber.
We deduce from the lemma that the -action is locally free on , meaning that the dimension of an orbit in general position in is equal to . Since we have , the codimension of an orbit in general position is given by
Therefore, in addition to the invariant of order two, there exist algebraically independent differential invariants of order three. We will verify that we have found all of them.
It is clear from the construction of the invariant frame that we have the relations
Note how this is consistent with the fact that there is only one second-order invariant . It also follows that there will be relations between :
In two dimensions this holds trivially, since is symmetric. In three dimensions the additional relation takes the form
In four dimensions we get two additional relations:
For we get relations between the components of . The difference is .
Lemma 3.
The algebraic relations between the differential invariants , with , are exactly the relations given by (6).
Proof.
It is sufficient to prove that there are algebraically independent functions in this set of differential invariants. We will do this by restricting to a suitable -dimensional submanifold in , and show that there are independent functions among the invariants, also after restricting to this submanifold. After the appropriate submanifold is found, the statement of the lemma follows quite easily.
First we restrict the invariants to the fiber of given by
On this fiber, the expressions for the tensors and are significantly simplified:
We restrict further to a submanifold of this fiber, of codimension , by setting
On this subset, we have
and, in general, the only nonzero components of are in the -directions for . Moreover, the coefficients for depend only on the variables and in addition to the variables of which depend. This allows for an induction argument, since the expression for is independent of . We have
For we get
These four functions are clearly independent, and thus the lemma is proved in the case . With an appropriate choice of ordering of the functions , each new function depends on the same variables as the previous function in addition to , for .
Due to the particular form of and , when restricted to our submanifold, it is clear that this type of pattern will continue. By extending the sequence of functions, with one at a time, each variable will be introduced with the function , where and and (this also indicates the appropriate ordering). There are two possible obstructions to this, and we have to justify that they are, in fact, not obstructions.
Firstly, the function will depend on the same variables as the four we found above, in addition to . This new variable is zero on our submanifold, and thus is not independent from the previous ones. However, this is exactly what we want, as we are already aware of the relation . The situation is similar for and , and so on.
Secondly, even though the derivations depend in nontrivial ways on the variables, this fact does not interfere with the pattern. The potential new variables coming from the derivations will always be introduced (to our sequence of functions) with , and these new variables would be and . The first one was introduced already by , and the second is exactly the one that should be introduced here, as it is equal to . The fact that is squared in is a consequence of this. □
These lemmas imply that we have in the following sense constructed a complete set of rational differential invariants.
Theorem 3.
For every order k, the field generated by and , where and generates a field of differential invariants with transcendence degree equal to the codimension of an -orbit in general position in .
Let denote the codimension of an -orbit in , in general position, for . This number is the same as the transcendence degree of the field of rational scalar differential invariants of order k. The Hilbert function for the filtered field of differential invariants is defined as for , and . From the discussion above we deduce the following statement.
Theorem 4.
The Hilbert function for the field of differential invariants is given by
Note that for , the formula for coincides with that for , as both gives 4.
We define the Poincaré function corresponding to the Hilbert function by the series which converges to a rational function for .
Theorem 5.
The Poincaré function for the field of differential invariants is given by
4.2. The Differential Algebra of Differential Invariants
The differential algebra of differential invariants is finitely generated (see [14] and references therein). The invariant horizontal vector fields act on the differential invariants as derivations, resulting in new differential invariants which, in general, are of higher order. It is clear that we get, by taking invariant derivatives of the kth-order invariants , a set of independent differential invariants of order , for (since the coefficients of the derivations are only of order 3). See for example [12] (Theorem 5.48). Thus, the algebra of differential invariants can be generated by the differential invariants of order 4, together with the invariant derivations.
Theorem 6.
The differential algebra of scalar differential invariants is generated by the invariant derivations and the scalar invariants , , and .
Note that this generating set of differential invariants is not minimal. As explained in in the previous section, some of the invariants are algebraically related to each other. Moreover, many of the invariants can be constructed by computing invariant derivatives of the third-order invariants. In particular, when the algebra of differential invariants can be generated by and the differential invariants of order three.
4.3. Differential Syzygies for
The algebra of differential invariants is not a freely generated differential algebra; there are differential syzygies among the generators. We will find the syzygies in the simplest case, when . This will also let us sharpen Theorem 6 for this particular case.
Let us use the notation
We have and . In order to write the differential syzygies in a relatively compact form, it will be useful to have the following definitions:
When , the third-order invariants are sufficient to generate the whole algebra. Furthermore, if we compute invariant derivatives of , we get
This implies that the invariants are sufficient for generating the algebra of differential invariants, together with and . Let us express the differential syzygies relating these generators.
The derivations satisfy the commutation relation
where and . The commutation relation determines some of the differential syzygies. When finding the rest of them, we consider only higher-order invariant derivatives of the form , where are non-negative integers.
Invariant derivatives of give differential invariants of order k for while the invariant derivatives of and give differential invariants of order k for . This implies that we get 4 differential invariants of order three, 7 of order four and, in general, of order k for . The Hilbert function is given by for , implying that there are syzygies for .
Since the differential invariants are rational, finding the differential syzygies becomes a completely algebraic problem. The first two differential syzygies (for ) are found with the help of Maple, and they are given by
Notice that the invariants can be expressed in terms of and .
Since only depends on invariant derivatives of , it is clear that the invariant derivatives of and will give additional independent syzygies involving differential invariants of order k for each . Thus, they generate the desired number of differential syzygies. The commutation relation can be used to ensure that the syzygies are written purely in terms of invariants of the form , where I is , or . We get the following theorem.
Theorem 7.
The differential algebra of scalar differential invariants is generated by the invariant derivations and , and the scalar differential invariants , and . The differential syzygies are generated by and .
Remark 5.
In [5] it was noted that there is a relation between the curvature of and higher moments. Indeed, when the scalar curvature of can be written as
5. Thermodynamics of Gases
We will now apply the ideas above to the context of gases in thermodynamic equilibrium. Consider the thermodynamic space with variables corresponding to pressure, temperature, internal energy, volume and entropy. The entropy is related to the information gain I by the formula . By aligning the one-form with the fundamental thermodynamic relation , we see that we can get the standard thermodynamics of gases from the measurement of a point , and the principle of minimal information gain. The relationship between and is
We will suppress the information gain, or entropy, from the picture, and consider a system in thermodynamic equilibrium as a Lagrangian submanifold of on which the symplectic form
vanishes. We will assume throughout.
Similarly as above, we let the Lagrangian submanifold be given by two functions . Restricting to such a submanifold gives
implying that L is Lagrangian if and only if
The differential equation determines the submanifold in .
5.1. The Group Action
In the (nonlinear) coordinates , the Lie algebra corresponding to the action on is spanned by the six vector fields
Orbits in general position in are six-dimensional. The subset on which the orbit dimension decreases is given by . Positive definiteness of implies positivity of the left-hand side of this equation (see the end of Section 5.2).
One can ask if it is appropriate to consider arbitrary affine transformation on points . We will consider the action by two different subgroups of , in addition to the full group of affine transformations, and write down the differential invariants for each of these three group actions. The choices of the two subgroups are based on recent results concerning the thermodynamics of fluids on two different manifolds.
In [17], the symmetries of compressible viscid fluids were found. Some of the symmetries are purely geometrical, such as translations, rotations and Galilean transformations. In addition there are some symmetries that act on thermodynamic variables. In particular there is a three-dimensional Lie algebra that is spanned by
It corresponds to the group action given by for , .
In [18], the symmetries of compressible viscid fluids on a spherical layer were found. In this case, the thermodynamic part of the symmetry Lie algebra is a Lie subalgebra of the one above. It is spanned by
It corresponds to the group action given by for , .
Remark 6.
To be precise, in both [17,18] there is also a vector field corresponding to translation in entropy. These transformations are not contained in , and since we consider here only differential invariants of Lagrangian submanifolds of , they will not affect our description of differential invariants. However, we swiftly note that if we extend the Lie algebras above with this vector field, the differential invariant of order zero that corresponds to entropy will not be invariant any more.
5.2. Differential Invariants with Respect to
The algebra of differential invariants with respect to was in principle completely described in Section 4. We found both generators and syzygies for the case . In the new coordinates, the tensors , and are given by
In this section we will use a slightly different set of generators for the algebra of differential invariants. With the help of the DifferentialGeometry and JetCalculus packages and the pdsolve procedure in Maple we find a generating set of invariants whose expressions are simpler in the current choice of coordinates. It is possible to find the second-order differential invariants by solving the system of linear partial differential equations given by for a basis of the Lie algebra of vector fields corresponding to the group action.
First of all, we have the following invariant derivations:
And we remember from the previous section that the Hilbert function for the field of rational differential invariants is given by and for .
The verification of the theorems in this section consists of two parts. Firstly, one must check that the functions and derivations given are invariant, something we recommend doing with the help of Maple. Secondly, one must verify that the differential invariants and invariant derivations allow us to generate a sufficient amount of independent differential invariants (compare with ). Due to the simple form of the differential invariants and invariant derivations, this is straightforward.
Theorem 8.
The differential invariants
constitute a transcendence basis for the field of third-order differential invariants.
Here we have recycled the notation that we used in Section 4.3. These are not exactly the generators we used above, but . We notice that this second-order differential invariant is what is known in thermodynamics as the heat capacity (at constant pressure). Thus the concept of heat capacity is given to us automatically if we consider the action of the affine group on (a subgroup of the affine group will lead to more scalar differential invariants).
Example 1.
Requiring to be constant singles out a special set of Lagrangian submanifolds. They are solutions to the system
for , and they are given by
By using the invariant derivations and we can generate the algebra of differential invariants. Notice that .
Theorem 9.
The differential algebra of scalar differential invariants is generated by the invariant derivations and , together with the differential invariants .
Notice also that the tensor takes diagonal form when written in terms of the frame :
Positive definiteness of implies and (the latter may be replaced by ).
5.3. Differential Invariants with Respect to a Three-Dimensional Subgroup
Now, let us consider the three-dimensional group from [17] which acts by for , .
The Hilbert function is in this case given by and for . Thus, we get 4 invariants of order two.
Theorem 10.
The differential invariants
generate the field of second-order differential invariants.
We notice that the last two invariants are the heat capacity at constant pressure and at constant volume, respectively. The derivations and are invariant, and by using them we can generate the algebra of differential invariants.
Theorem 11.
The differential algebra of differential invariants is generated by the following two differential invariants of order 1 and two invariant derivations:
The generators are related by the differential syzygy
The invariant can be written in terms of the invariants in Theorem 10. We use it here because it gives a simple differential syzygy.
Example 2.
Let us find the Lagrangian submanifolds for which is constant. By solving the system
for , we get
5.4. Differential Invariants with Respect to a Two-Dimensional Subgroup
Now, let us consider the two-dimensional Lie group from [18] which acts by for , .
The Hilbert function is in this case given by for . Thus, we get 5 invariants of order two.
Theorem 12.
The differential invariants
generate the field of second-order differential invariants.
The derivations and are invariant, and by using them we can generate the algebra of differential invariants. We reuse the notation from the previous subsection.
Theorem 13.
The differential algebra of differential invariants is generated by the following two differential invariants of order 1 and two invariant derivations:
The generators are related by the differential syzygy
5.5. The Significance of the Differential Invariants
We end the paper with a discussion on the significance of the differential invariants. In order to simplify the discussion, we will focus mainly on the group action of the two-dimensional Lie group considered in the previous subsection, but the ideas are general. We denote the Lie group by G.
A differential invariant I is a function on that is constant on G-orbits. It can be restricted to a Lagrangian submanifold L, resulting in a function on L. For example, if L is given by a particular pair of functions and , then the differential invariant restricts to a function . A transformation from the Lie group under consideration, such as , transforms the functions and into two new functions and , but the differential invariant remains the same:
This function gives the same value at the point in L as at the point in the transformed Lagrangian manifold . Thus, if we treat Lagrangian manifolds that are related by a transformation in G as equivalent, the differential invariants play an important role, as they are the scalar functions determined by and and their derivatives that only depend on the equivalence class of L.
The differential invariants can also be used to distinguish non-equivalent Lagrangian manifolds and classify them. Consider the differential invariants
If we restrict these invariants to a Lagrangian submanifold L, we get five functions that depend on T and p. We consider the subset of parametrized by . For generic Lagrangian manifolds this is a two-dimensional surface, and we call it the signature surface of L. Clearly, two equivalent Lagrangian manifolds give the same signature surface.
We will now generate the algebra of differential invariants in a different way than we did above. Let denote the Tresse derivatives with respect to the pair . They are derivations of the form , where a and b are functions on , that are uniquely determined by the condition . Here it is important that
where is the horizontal differential. This condition determines an Zariski-open subset in , and we consider only Lagrangian manifolds whose 1-jets lie in this open subset. Another way to say it is that we consider only Lagrangian manifolds L such that for every point in L. Note that this condition guarantees that the signature surfaces are two-dimensional.
It is not difficult to check that we can generate all differential invariants by using these five differential invariants and the Tresse derivatives. Then, since all differential invariants can be generated from and the Tresse derivatives, it becomes clear that the signature manifold of L contains all information about its invariants. Since the field of rational differential invariants separates orbits in general position in for every k [14], distinguishing non-equivalent Lagrangian manifolds (with jets in general position) comes down to comparing their signature surfaces.
Now we will find conditions that the signature surfaces must satisfy. By computing , we get six differential invariants of order three. Since , there are at least two differential syzygies. In fact, there are exactly two and they are given by
All higher-order syzygies are differential consequences of these. The Tresse derivatives play the role of partial derivatives with respect to and (their restriction to a Lagrangian submanifold L are partial derivatives with respect to and ), and these two differental syzygies are partial differential equations that the signature manifolds must satisfy. They are often called the quotient or factor equations. Their solutions can be interpreted as equivalence classes of Lagrangian submanifolds.
Funding
This research was supported by Grantová Agentura České Republiky (19-14466Y).
Acknowledgments
The author thanks Valentin Lychagin for fruitful discussions related to this project.
Conflicts of Interest
The author declares no conflict of interest.
References
- Gibbs, J.W. Graphical methods in the thermodynamics of fluids. Trans. Conn. Acad. 1873, II, 309–342. [Google Scholar]
- Gibbs, J.W. A method of geometrical representation of the thermodynamic properties of substances by means of surfaces. Trans. Conn. Acad. 1873, II, 382–404. [Google Scholar]
- Ruppeiner, G. Thermodynamics: A Riemannian geometric model. Phys. Rev. A 1979, 20, 1608–1613. [Google Scholar] [CrossRef]
- Ruppeiner, G. Riemannian geometry in thermodynamic fluctuation theory. Rev. Mod. Phys. 1995, 67, 605–659. [Google Scholar] [CrossRef]
- Mrugala, R.; Nulton, J.D.; Schön, J.C.; Salamon, P. Statistical approach to the geometric structure of thermodynamics. Phys. Rev. A 1990, 41, 3156–3160. [Google Scholar] [CrossRef]
- Lychagin, V. Contact Geometry, Measurement, and Thermodynamics. In Nonlinear PDEs, Their Geometry, and Applications; Kycia, R.A., Ulan, M., Schneider, E., Eds.; Birkhäuser: Cham, Switzerland, 2019; pp. 3–52. [Google Scholar]
- Jaynes, E.T. Information Theory and Statistical Mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar]
- Schneider, E. Differential Invariants in Thermodynamics. In Nonlinear PDEs, Their Geometry, and Applications; Kycia, R.A., Ulan, M., Schneider, E., Eds.; Birkhäuser: Cham, Switzerland, 2019; pp. 223–232. [Google Scholar]
- Krasil’shchik, I.S.; Lychagin, V.V.; Vinogradov, A.M. Geometry of Jet Spaces and Nonlinear Partial Differential Equations; Gordon and Breach Science Publishers: London, UK, 1986. [Google Scholar]
- Krasil’shchik, I.S.; Vinogradov, A.M. Symmetries and Conservation Laws for Differential Equations of Mathematical Physics; American Mathematical Society: Providence, RI, USA, 1999. [Google Scholar]
- Olver, P. Applications of Lie Groups to Differential Equations; Springer: New York, NY, USA, 1986. [Google Scholar]
- Olver, P. Equivalence, Invariants, and Symmetry; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
- Kruglikov, B.; Lychagin, V. Geometry of Differential equations. In Handbook of Global Analysis; Krupka, D., Saunders, D., Eds.; Elsevier: Amsterdam, The Netherlands, 2008; pp. 725–772. [Google Scholar]
- Kruglikov, B.; Lychagin, V. Global Lie-Tresse theorem. Sel. Math. 2016, 22, 1357–1411. [Google Scholar] [CrossRef][Green Version]
- Kruglikov, B.; Lychagin, V. Differential invariants of the motion group actions. arXiv 2007, arXiv:0712.3500. [Google Scholar]
- Bibikov, P.; Lychagin, V. Differential Contra Algebraic Invariants: Applications to Classical Algebraic Problems. Lobachevskii J. Math. 2016, 37, 36–49. [Google Scholar] [CrossRef]
- Duyunova, A.; Lychagin, V.; Tychkov, S. Differential invariants for flows of viscid fluids. J. Geom. Phys. 2017, 121, 309–316. [Google Scholar] [CrossRef]
- Duyunova, A.; Lychagin, V.; Tychkov, S. Differential invariants for spherical layer flows of viscid fluids. J. Geom. Phys. 2018, 130, 288–292. [Google Scholar] [CrossRef]
© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).