Abstract
This paper defines a large class of differentiable manifolds that house two distinct optimal problems called affine-quadratic and rolling problem. We show remarkable connections between these two problems manifested by the associated Hamiltonians obtained by the Maximum Principle of optimal control. We also show that each of these Hamiltonians is completely intergrable, in the sense of Liouville. Finally we demonstrate the significance of these results for the theory of mechanical systems.
Keywords:
Lie groups; Lie algebras; homogeneous manifolds; Hamiltonians; Poisson bracket; mechanical tops MSC:
49J15; 53A17; 53A35; 58A05; 58A30; 70B15
1. Introduction
This paper is a continuation of my long-standing interest in the role of Lie groups and Lie algebras in the theory of integrable systems and the equations of mathematical physics. The interest in this topic originated in two seemingly unrelated phenomena, the presence of elastica in the theory of rolling spheres ([1,2]), and the presence of the heavy top in the equations describing the equilibrium configurations of an elastic rod ([3,4]). My interest in these phenomena was further renewed by the subsequent studies ([5,6,7]) that showed intriguing connections between rolling problems, elastic curves and problems in mechanics. These studies also identified a class of variational problems on Lie groups, called affine-quadratic that not only played a pivotal role in this theory, but also made a significant impact on the theory of integrable systems ([8], Chapters 9, 10 and 11).
In this paper, we will shift emphasis to a new class of rolling problems associated with homogeneous Riemannian spaces rolling isometrically on their tangent planes (based on our recent study [9,10] ). We will show that each such isometric rolling has a well defined length which then leads to natural definition for a rolling geodesic. The rolling problem then consists of finding some necessary differential conditions that the rolling geodesics must satisfy.
We will show that each rolling problem can be recast as a left-invariant optimal control problem on a Lie group, and consequently, we will be able to regard the rolling geodesics as the projections of the extremal curves generated by a suitable Hamiltonian obtained through Pontryagin’s Maximum Principle. We will show several remarkable properties of the aforementioned Hamiltonian. First we will show that any such Hamiltonian is completely integrable, and secondly, we will show that the Hamiltonian system associated with an affine-quadratic system may be regarded as an invariant subsystem of the Hamiltonian differential system associated with the rolling problem. This discovery sheds new light on the geometric origins of the affine-quadratic systems and their connections to mechanical systems ([11,12]). These findings seem particularly remarkable considering the fact that the control functions that define these optimal problems lie in mutually orthogonal spaces of each other.
The general setting of the paper in which the above-mentioned problems will be analyzed is defined by a semi-simple Lie group G and a compact subgroup K with a finite centre. Any such pair is reductive in the sense that the Lie algebra of G admits a splitting where is a vector space complementary to the Lie algebra of K. In this paper, will be the orthogonal complement of relative to the Killing form . We recall that the Killing form is non-degenerate on G and also satisfies
for any elements in . This implies that . We shall make another assumption that and satisfy strong Cartan’s Lie algebraic conditions
Finally, we will assume that the Killing form is of definite sign on . This last condition is automatically satisfied when G is compact and is also satisfied by irreducible symmetric Riemannian pairs in the theory of symmetric spaces.
Let us now recall the definition of the affine-quadratic problem in this general setting ([8]).
1.1. Affine-Quadratic Problem
Any element A in generates an affine set in , and this set defines a left invariant differential system
where is a bounded and measurable curve in . We will think of (2) as a control system with playing the role of control functions. We will assume that A is regular , that is, that the set of elements in that commute with A forms an abelian subalgebra in . Our assumption implies that where each factor is a simple ideal of the form . It then follows that the projection of a regular element A on each factor in (4) is non-zero which, in turn, implies that (2) is controllable, in the sense that for any two points in G there is a solution on an interval that satisfies and (see [8], page 162 for a proof). Since any two Cartan subalgebras in are conjugate, so are the systems defined by any two regular elements and .
We will now let be the energy functional associated with any solution of (2) generated by a control , where . Note that the Killing form is negative semi-definite on the Lie algebra of K when K is compact, and is strictly negative when K has a finite centre (2). Therefore, our energy functional is positive for any non-zero control . This energy functional is called canonical relative to a more general one defined by any positive linear operator on .
The above data induce a natural optimal control problem: find the solutions of (2) that satisfy the given boundary conditions , for which the energy of transfer is minimal. The above optimal control problem will be referred to as the affine-quadratic problem (reminiscent of linear-quadratic problems in the control theory literature). In this paper we shall be interested only in the canonical case .
As we mentioned earlier, the pair is reductive. Any reductive semi-simple Lie algebra also carries along a “hidden” semi-direct product for the following reasons. Since , K acts linearly on by the adjoint action , and induces the semi-direct product with the group operation . Then the Lie algebra of is equal to with the Lie bracket given by
We will identify elements with the sums under the identification , in which case the Lie brackets in are identified with
Thus, as a vector space carries two Lie brackets:
defined by a single parameter s: in the semi-direct case, and in the semi-simple case.
It follows that every affine space that defines an affine left-invariant system on G also defines a corresponding left-invariant affine system on the semi-direct product . Thus, behind every affine quadratic optimal problem on G there is a corresponding affine-quadratic “shadow” problem on the semi-direct product .
When K is a compact group with finite centre, then the above optimal problems are well defined in the sense that for any set of boundary points and there exists an optimal trajectory that satisfies and for some .
Remarkably, the Hamiltonian associated with the shadow problem is particularly relevant in the theory of mechanical systems (see [8], Ch. 10 for the mechanical problem of Neumann on the sphere [13], Ch. 11 for Jacobi’s problem on the ellipsoid, and Ch. 13 for the elastic problem and the pendulum). This phenomenon raises a natural question: what is the geometric origin behind the affine-quadratic problem that properly accounts for its relevance for the above mentioned problems? This question was partly addressed in the literature on integrable systems where the drift vector was associated with a linear potential V associated to an abstract “rigid body” with a Hamiltonian on the tangent bundle of a Lie group G ([14]) but that association raised its own questions, and at the end proved to be more enigmatic than useful.
In this paper, we will show that the Poisson systems generated by the canonical affine-quadratic problem and the rolling problem provide new and original answers to the above query: we will show that the Poisson system associated with the affine-quadratic problem is an invariant subsystem of the Poisson system generated by the rolling problem on a coadjoint orbit where the drift element A appears as a constant of motion for the rolling problem (Propositions 5 and 6).
With this goal in mind, we will now turn our attention to the quotient space and the rolling problem.
1.2. Homogeneous Riemannian Manifolds
We will first need to introduce the Riemannian structure on the homogeneous manifold defined by G and K. To begin with we will regard G as a semi-Riemannian manifold (in the sense of O-Neill [15]) with the left-invariant metric induced by a scalar multiple of the Killing form that is positive definite on . Such a choice is possible by our assumption. On compact Lie groups G, this multiple will be a negative multiple of the Killing form and then the above metric on G coincides with the canonical bi-invariant metric. However, on non-compact Lir groups, the Killing form is indefinite and the above metric is semi-Riemannian. Here is a shorthand notation for the left-invariant vector field , where is the left translation . The same shorthand notation applies to the right-invariant vector fields with We also recall that the Killing form is invariant under any linear automorphism of and hence the quadratic form is invariant.
In order to make an easy passage to the techniques of optimal control, we will assume that all curves are absolutely continuous, and all differential equations involving such curves will be understood to be true only up to sets of measure zero without explicitly saying so. With that convention in mind any curve in G is a solution of for some bounded and measurable curve . When takes values in , is called horizontal, and when takes values in is called vertical. Correspondingly, the left-invariant distributions and will be called horizontal and vertical, respectively. Thus, horizontal curves are tangent to in the sense that . Likewise vertical curves are tangent to . It follows that
We shall assume that is endowed with a manifold structure so that the natural projection is a smooth surjection (such a structure exists ([15])). Then with this manifold structure will be denoted by M and o will denote the point in M such that , where e is the group identity in G.
A curve in G is called a lift of a curve if . Such a lift is said to be horizontal when is a horizontal curve. The projection of a vertical curve is a single point in M because any solution of is of the form .
If is any lift of a curve , then where and are the orthogonal projections of on and . Then, The above shows that , the solution of , is a horizontal curve that projects on , and secondly, it shows that for any horizontal lift of . The isomorphism can then be used to induce a metric on M
Let now denote the group of diffeomorphisms on M defined by the group action
Since G acts transitively on M, M can be represented by the orbit . It follows that for any . Note that is the flow generated by the right-invariant vector field . The above equality shows that the flow of is -related to the flow in M.
In what follows, will denote the infinitesimal generator of the flow , and will denote the family of vector fields . The correspondence is one to one and onto . Since the Lie brackets of vector fields related by a mapping F are also F-related ([16]) , the Lie brackets are -related to . Therefore the correspondence is a Lie algebra homomorphism, and hence is a finite dimensional Lie algebra of vector fields that satisfies for each . Elements of are generally known as the vector fields generated by the group action.
Note that and therefore . Then implies that
Furthermore,
Hence,
It follows that
for any and any tangent vectors and in .
Therefore, acts on M by isometries, and consequently each vector field in is a Killing vector field. Recall that the isometry group of M is a subgroup of Diff that leaves the metric invariant, also recall that a vector field is a Killing vector field if its flow acts on M by isometries (the flow of is given by ). See [15] for additional details.
A homogeneous manifold defined by the above data will be referred to as semi-simple (it is defined by a semi-simple Lie group G, a compact subgroup K, and the metric induced by the Killing form). It can be shown that any symmetric Riemannian space with no Euclidean factors can be reduced to a semi-simple manifold (so that holds). Conversely, if G is simply connected then every semi-simple manifold is symmetric (see [17], Proposition 6.27). In any event, the present exposition makes no use of geodesic symmetry so there is no need to get distracted with the theory of symmetric spaces.
On semi-simple manifolds, parallel transport and covariant derivative are given by nice formulas inherited from G. To elaborate, note that any semi-simple Lie group G with its left-invariant metric a scalar multiple of the Killing form is a semi-Riemannian group in the terminology of O’Neill ([15], p. 305) because the Killing form is invariant (it is only in the compact case that this semi-metric is Riemannian, i.e., equal to the canonical bi-invariant metric on G).
Relative to this left-invariant semi-metric, , X and Y left-invariant, is the (unique) bi-invariant affine connection that preserves the inner product and is torsion free ([15]). The associated covariant derivative of a vector field defined along a curve in G is given by
Since the metric on M is the pull-back of the metric on G, the covariant derivative and parallel transport in M can be described in terms of the lifted objects in via the following formulas ([9]): any curve of tangent vectors along a curve in M can be represented by in terms of a unique curve , where denotes a horizontal curve in G that projects onto . It follows that and . Then the covariant derivative of along is given by
where denotes the orthogonal projection of on (because of our assumption , the orthogonal projection of on is zero). Hence, is parallel along whenever is the projection of a curve with a constant in . With this background at our disposal we will now come to the rolling problem.
1.3. The Rolling Problem
The most direct route to the rolling problem is via the intrinsic definition of rolling, introduced by R. Bryant and L. Hsu in ( [18]), and later used by A. Agrachev in ( [19]), Y. Chitour in ( [20,21]) and Godoy Molina in ([22]). According to this definition a curve on a Riemannian manifold M rolls on a curve on another Riemannian manifold if there exists an isometry that satisfies:
and also satisfies the condition that is a parallel vector field in along for each parallel vector field along in M. The triple is called a rolling curve. It is clear that rolling is reflexive in the sense that if is rolled on by an isometry then is rolled on by the isometry , and therefore is also a rolling curve. We will take which we regard as a Euclidean space with its metric defined by (4) and we address the rolling of curves in M on curves in . Recall that in any semi-Euclidean vector space parallel transport along a curve in is done only by constant vector fields (translations).
Any curve in M is the projection of a horizontal curve , that is, and . Then,
where denotes the curve of Killing vector fields in defined by . If we now let be any solution in of and let then is an isometry that rolls on since the parallel transport condition is fulfilled (by Equation (10)). Of course, then rolls on .
It follows that each horizontal curve in G defines a family of curves in , each a solution of associated with , that roll on . Conversely, every solution of the differential system
defines a curve in M on which in is rolled by the isometry .
The rolling problem will be defined on the configuration space , which will be regarded as a Lie group with the group operation , for all and in . Then the Lie algebra of will be naturally identified with with the Lie bracket .
Let now denote the left invariant distribution defined by that is,
The distribution will be referred to as the rolling distribution and its integral curves will be called rolling motions. Any rolling motion is a solution of
and can be associated with the rolling curve , where . The reader may want to show that this intrinsic definition of rolling agrees with the extrinsic descriptions [23] based on the formalism in [24].
Since is a vector subspace in that satisfies
the Lie algebra generated by the left-invariant vector fields tangent to is equal to , and therefore, any two points in can be connected by a rolling motion, and each rolling motion inherits a natural length from G. To put the matter in a control theoretic context, let be an orthonormal basis in so that is an orthonormal basis in . Then an absolutely continuous curve is a rolling motion if and only if
for some bounded and measurable control functions , in which case the length of is given by . It then follows from (16) that the Lie algebra generated by the left-invariant vector fields is of full rank in . Since each left-invariant vector field is complete, any pair of points in can be connected by an integral curve of of minimal length ([19]). An integral curve of is called a rolling geodesic if for any and , sufficiently close to each other, the length of in the interval is minimal among all other integral curves of that connect to .
The rolling problem consists of characterizing the rolling geodesics in induced by . Since each rolling geodesic is also a sub-Riemannian geodesic on the configuration space relative to the above length, the rolling problem can be equivalently phrased as a sub-Riemannian problem in where one looks for the solutions on a fixed time interval that satisfy the given boundary conditions and along which the energy of transfer is minimal.
Return now briefly to the affine-quadratic problem introduced earlier with its dynamics
and the energy (sometimes called the cost in the literature on optimal control) , induced by a positive definite operator relative to the scalar product . Since can be diagonalized by an orthonormal basis in , the affine-quadratic problem can be restated as an optimal problem over the system
with , and the energy of transfer ( are the eigenvalues of ). In the canonical case .
Let us now single out some examples that are relevant for the results that follow.
1.4. Some Notable Examples
1. In this situation we will assume that the Lie algebra , that consists of matrices having zero trace, is endowed with the scalar product . Then is the Lie algebra of K, and is the space of symmetric matrices in . It is easy to verify that is positive on and negative on . Therefore G with its left-invariant metric induced by is a semi-Riemannian manifold.
Then the quotient space will be identified with , the space of positive-definite matrices of determinant one, through the action , where is the matrix transpose of g. Since any positive definite matrix P with can be written as for some the action is transitive, and can be identified with the orbit through the identity I. Since the identity matrix I is both an element of and the group identity in G, it is equal to the point o (). Horizontal curves are the solutions of . Any curve in is the projection of a horizontal curve and the length of is given by . Killing vector fields are given by , and . The rolling distribution is given by
The case is somewhat special, for then is isometrically diffeomorphic to the Poincaré upper half plane with its metric . To elaborate, note that every can be written as where P is upper triangular and R is a rotation matrix. In fact, if is an element of then
Let now
where We will now show that F is an isometry from with its Poincaré hyperbolic metric onto with its G-invariant metric. If then
and therefore, . If and , then an easy calculation shows that
and hence . It follows that and therefore F is an isometry.
It then follows that the rolling distribution has its isometric analogue on rolling on the tangent space at i. In this scenario acts on via the Moebius transformations , and is represented by the orbit . Horizontal curves are the solutions of and their projections on are given by . Then
Therefore, rolling motions are the solutions of
2. , where denotes the connected component of that contains the group identity when , and , when . Both cases can be treated in a uniform manner as follows.
Let denote with the scalar product (x,y)ϵ = x0y0 + ϵ xiyi. Each SOϵ(n + 1) acts on by matrix multiplications, and each group is defined as the matrix group whose elements have a positive determinant and preserve the bilinear form . It follows that each g ∈ SOϵ(n + 1) satisfies gTDg = D where D is a diagonal matrix with its diagonal entries equal to (1,ϵ,...,ϵ). Therefore, Det(gT)Det(g)Det(D) = Det(D) which implies that Det(g) = 1. This shows that each of SOϵ(n + 1) is a subgroup of SL(n + 1).
We will let denote the Euclidean unit sphere when ϵ = 1 and the hyperboloid {x ∈ Rn+1: , x0 > 0} when ϵ = −1. In each case, SOϵ(n + 1) acts on by the left matrix multiplications on the points of written as column vectors. It can be shown that this action is transitive. When is represented by the orbit through then the isotropy group is equal to . Therefore,
with the natural projection given by .
We will regard as a semi-Riemannian subgroup of with its left-invariant metric introduced through the bilinear form (this metric is indefinite on
when ϵ = −1 and is positive when ϵ = 1).
The following notations will be useful in describing the Cartan factors and . If a and b are any points in Rn+1 then a ⊗ϵ b will denote the matrix defined by (a ⊗ϵ b)x = (a,x)ϵb, x ∈ ℝn+1, and then a ∧ϵ b will denote the matrix a ⊗ϵ b − b ⊗ϵ a. Since ((a ∧ϵ b)x,y)ϵ + (x,(a ∧ϵ b)y)ϵ = 0, a ∧ϵ b belongs to 𝔰𝔬ϵ(n + 1) for any a, b in ℝn+1.
It is easy to show that the Lie algebra of K and its orthogonal complement are given by the following expressions:
The preceding matrices can be also written as
Horizontal curves are the solutions of
that satisfy
Then is the natural metric on . We then have
hence the metric is invariant, and with this metric is a semi-simple homogeneous manifold. It follows that the rolling distribution is given by
which agrees with 2.4 in ([6]).
2. Symplectic Background, Hamiltonian Systems
Let us now turn our attention to the extremal curves associated with our main problems. Because of the constraints present in these problems, the Maximum Principle of optimal control, rooted in the Hamiltonian formalism, is the only tool available for arriving to the appropriate extremal equations. However, in order to make an effective use of the Maximum Principle we will need to work with the symplectic form in a special system of coordinates that is well adapted for left-invariant optimal control problems (described in [3,8]) which calls for a brief review of symplectic geometry. Below is a brief summary of the symplectic material required for the main results.
Recall that a manifold M endowed with a non-degenerate and closed 2-form is called symplectic. The symplectic form induces a correspondence between functions and vector fields: every function f corresponds to a vector field defined by . In this context, is called the Hamiltonian vector field generated by f. Every symplectic manifold is even dimensional, and at each point of M there exists a neighbourhood with coordinates such that the Hamiltonian vector fields are given by
This choice of coordinates in which is represented by (27) is called symplectic.
Any cotangent bundle is a symplectic manifold endowed with its canonical symplectic form, usually written as relative to a choice of symplectic coordinates . As a symplectic manifold the cotangent bundle is somewhat special, it is a vector bundle at the same time. For that reason every vector field X on M can be lifted to a unique Hamiltonian vector field in via the function , . Vector field is called the Hamiltonian lift of X. The same procedure is applicable to any time varying vector field, and by extension to any differential system on M. Thus, any differential system in M can be lifted to a Hamiltonian system in . Then the Maximum Principle singles out the appropriate Hamiltonian lifts that govern the optimal solutions ([8]).
When the base manifold is a Lie group G, and when the underlying differential system is either left or right invariant, then there is privileged system of coordinates based on the realization of as , with the dual of , that preserves the left (or right) invariant symmetries and elucidates the conservation laws of the associated Hamiltonian system. The passage to these coordinates is explained below.
2.1. Left-Invariant Trivializations and the Symplectic Form
Having in mind applications involving left-invariant variational systems, the cotangent bundle and the tangent bundle will be represented as and via the left-translations. That is, tangent vectors will be identified with the pairs via the relation . Similarly, linear functions will be identified with pairs via , i.e., . Then is naturally identified with the product , with the understanding that an element denotes the tangent vector at the base point .
Note that is a Lie group in its own right since is an abelian Lie group with the group multiplication given by the vector addition. Then left-invariant vector fields in are the left-translates of the pairs in the Lie algebra of . In this formalism the flow associated with the left-invariant vector field in is given by . In terms of left-invariant vector fields and , the canonical symplectic form on is given by the following formula:
The above differential form is invariant under left-translations in , and is particularly revealing for the Hamiltonian vector fields generated by left-invariant functions on , that is, functions that satisfy for all and all . Evidently, left-invariant functions on are in exact correspondence with functions in .
Each left-invariant vector field , , lifts to a linear function on because
and each function H on generates a Hamiltonian vector field on whose integral curves are the solutions of
Equation (29) can be easily verified by the following argument: when H is a function on , then its differential at a point ℓ is a linear function on , hence an element of , because is a finite dimensional vector space. If for some vectors and , then
must hold for any tangent vector at . This implies that , and , where for all . Hence, (29) holds.
In a more general case where H is a function of both g and ℓ, the equations for are given by
as can be easily verified through the relations
This situation typically occurs in problems of mechanics in the presence of potential functions. For instance, the motion of a three-dimensional rigid body with a potential function is described by the Hamiltonian
where denote the columns of the matrix transpose of the rotation R in . If is a curve in defined by an element , then . Therefore,
where is the standard inner product in . Thus, is the external torque exerted by V. The corresponding equations of motion are given by
These equations extend to an “n-dimensional rigid body” ) with the external torque . This system of equations is usually written on the tangent bundle of , represented as the product , as
In this context, is the generalization of the angular momentum, is the generalization of the angular velocity, and is the generalization of the inertia tensor.
2.2. Poisson Manifolds, Coadjoint Orbits
We will now address the Poisson structure on inherited from the symplectic form given by (28). Recall that a manifold M together with a bilinear, skew-symmetric form
that satisfies
for all functions on M, is called a Poisson manifold.
Every symplectic manifold is a Poisson manifold with the Poisson bracket defined by . However, a Poisson manifold need not be symplectic, because it may happen that the Poisson bracket is degenerate at some points of M. Nevertheless, each function f on M induces a Poisson vector field through the formula . It is known that every Poisson manifold is foliated by the orbits of its family of Poisson vector fields, and that each orbit is a symplectic submanifold of M with its symplectic form . (This foliation is known as a the symplectic foliation of M).
Proposition 1.
The dual of a Lie algebra is a Poisson manifold with the Poisson bracket
Proof.
Functions on coincide with the left-invariant functions on . Hence,
It follows that the Poisson bracket on is the restriction of the canonical Poisson bracket on to the left-invariant functions. As such it automatically satisfies the properties of a Poisson manifold. □
In the literature on integrable systems, Poisson bracket is often referred as the Lie-Poisson bracket ([14]). We have taken its negative so that Poisson vector fields agree with the projections of the Hamiltonian vector fields generated by left-invariant functions (and also agree with the sign convention in [7,8]).
It follows that each function H on defines a Poisson vector field on through the formula . The integral curves of are the solutions of
That is, each function H on may be considered both as a Hamiltonian on , as well as a function on the Poisson space . It follows that the Poisson equations of the associated Poisson field are the projections of the Hamiltonian Equation (29) on .
Solutions of Equation (33) are intimately linked with the coadjoint orbits of G. We recall that the coadjoint orbit of G through a point is given by
The following proposition is a paraphrase of A.A. Kirillov’ fundamental contributions to the Poisson structure of ([25]).
Proposition 2.
Let denote the family of Poisson vector fields on and let denote the orbit of through a point . Then M is equal to the connected component of the coadjoint orbit of G that contains . Consequently each coadjoint orbit is a symplectic submanifold of .
The fact that the Poisson equations evolve on coadjoint orbits implies useful reductions in the theory of Hamiltonian systems with symmetries. Our main results will make use of this fact.
2.3. Representation of Coadjoint Orbits on Lie Algebras- Semi-Simple vs. Semi-Direct
On semi-simple Lie groups, the Killing form, or any scalar multiple of it , is non-degenerate, and can be used to identify linear functions ℓ on with points via the formula , . Then Poisson Equation (33) can be expressed dually on as
The argument is simple:
Since X is arbitrary, Equation (34) follows.
Under the above identification coadjoint orbits are identified with the adjoint orbits , and the Poisson vector fields are identified with vector fields . Each vector field is tangent to at L, and , in is the symplectic form on each orbit .
In a reductive semi-simple Lie group G with a subgroup K there is also the semi-direct product , described earlier in the introduction. Then Poisson equations on can be also represented on via the quadratic form as in the semi-simple case, but the resulting expression takes on a slightly different form. To see the difference, let and denote the decompositions of and L onto the factors and . On the semi-direct product,
Hence, the Poisson equations are given by
This equation can be combined with the equations for the semi-simple case in terms of the parameter s with
One can show that the coadjoint orbit through under the action of consists of pairs
when is identified with in , and when is identified with ([8]).
The adjoint orbits of a non-compact semi-simple Lie group G are often symplectomorphic with the cotangent bundles of manifolds ([26]). It appears that the same is true for coadjoint orbits under the action of semi-direct products. We will now single out two such situations which are relevant for the connections to mechanical tops.
Return now to and introduced in Example 2.
Proposition 3.
The coadjoint orbit through under the action of the semi-direct product is diffeomorphic to the tangent bundle of the connected component of the “sphere” that contains .
Proof.
Let , and . Then
where is the projection of x on the orthogonal complement of p in . Therefore,
is the desired diffeomorphism from the tangent bundle of the connected sphere onto the coadjoint orbit . □
The above diffeomorphism is actually a symplectomorphism from the cotangent bundle of either the Euclidean sphere when , or the hyperboloid of one sheet when , to the appropriate coadjoint orbit, but we will not go into these details. ([8]).
We will now turn our attention to the reductive pair (Example 1) and the coadjoint orbit through a symmetric matrix with distinctive non-zero eigenvalues under the action of . We recall that where is the space of symmetric matrices of trace zero. Every symmetric matrix S can be written as , . An easy inspection of (37) shows that the orbit through S differs by a constant factor from the orbit through . So the zero-trace requirement is inessential for the structure of coadjoint orbits.
Proposition 4.
The coadjoint orbit through given by
is diffeomorphic to the tangent bundle of the flag manifold consisting of subspaces with .
Sketch of the proof: Let denote a symmetric matrix with distinct non-zero eigenvalues . Then can be identified with a point in where each subspace is equal to the linear span of unit eigenvectors of . If is represented by the matrix , then is represented by the matrix that corresponds to the point in . The correspondence is a diffeomorphism from the orbit onto .
Let now denote the Stiefel manifold of k-orthonormal frames in . Points of can be represented by matrices M with columns that satisfy , where denotes the matrix transpose of M, and where is the k-dimensional identity matrix. Let be the embedding
Then , where D is a diagonal matrix with its diagonal entries equal to . Therefore, is a covering space for , and hence and are locally diffeomorphic, that is, every point admits an open neighbourhood U such that the restriction of to U is a diffeomorphism onto . It follows that tangent vectors at a point M can be identified with matrices that satisfy .
Let now U be an open set in such that restricted to U is a diffeomorphism onto . For every , is identified with . Then
with . Since X is symmetric, . Moreover, could be replaced by its orthogonal projection on without altering the value of Q. So we may assume that
It follows that , hence is a tangent vector at M. The pairs are parametrized by the entries of M and the entries of the matrix Y. The columns of Y satisfy constraints , and . This implies that the manifold of pairs of matrices subject to the constraints
is of the same dimension as the tangent bundle of . Therefore, the correspondence is one to one and onto the sub-bundle over U.
Corollary 1.
If is the orthogonal projection on a k-dimensional vector space, i.e., if , for some orthonormal vectors , then the coadjoint orbit through under the action of the semi-direct product is diffeomorphic to the tangent bundle of the oriented Grassmannian .
Here is identified with the flag consisting of a single k-dimensional vector space spanned by . Then is diffeomorphic to the oriented Grassmannians .
Note 1.
Proposition 4 is a correction to Proposition 10.2 on page 170 in [8] which incorrectly states that the coadjoint orbit through is the Steifel rather than the flag manifold .
3. Hamiltonian and Poisson Systems: Extremal Curves
We now come to the central part of the paper, the Hamiltonian systems associated with our optimal control problems,
3.1. Rolling Hamiltonians
Recall the rolling problem Equation (17),
and the associated optimal control problem of minimizing the energy function . Our immediate aim is to use the Maximum Principle to obtain the equations for the extremal curves in the cotangent bundle of the configuration space . To emphasize the structure of the problem, we will rewrite (17) as
where each a left-invariant vector field , . If is an optimal trajectory then, according to the Maximum Principle, is the projection of an extremal curve in along which the cost extended Hamiltonian
is maximal relative to all competing controls. In this notation, each is the Hamiltonian lift of , i.e., . In the abnormal case, , the Maximum principle results in the constraints
while in the normal case, , the maximality condition implies that the optimal controls are of the form , in which case the corresponding optimal solutions are the projections of the solution curves of a single Hamiltonian vector field generated by the Hamiltonian
This Hamiltonian is left-invariant in the representation and hence its Hamiltonian equations are given by the Equation (29), that is,
We will now concentrate on the solutions of the associated Poisson equation
Let us first expand on the structure of the coadjoint orbits in this situation. Since is a Euclidean vector space, its tangent space at the origin can be identified with . Then the Lie algebra can be identified with , and its dual can be identified with , where
It then follows that every can be written as with and . Since is a vector space, and therefore an abelian algebra, the projection on is constant on each coadjoint orbit of . The argument is straightforward: if , then
It follows that the coadjoint orbits in are of the form
This fact can be also verified directly from Equation (42): we have
where and . Therefore,
from which follows that
Since is arbitrary .
To uncover other constants of motion, identify with via the natural quadratic forms on each of the factors, and then recast the preceding equations on . More precisely, identify each in with a tangent vector via the formula . Similarly, identify with via the formula . Then decompose into the sum , and . Relative to the basis in , where . It follows that
and
Since X and are arbitrary,
Equation (43) constitutes the Poisson equations on generated by the Hamiltonian . Note that in this identification of the Lie algebras with their duals, coadjoint orbits are identified with the affine sets . Coupled with
Equation (43) constitutes the extremal equations for the rolling geodesics. Each extremal curve projects onto a geodesic , and each geodesic further projects onto the pair of curves in M and in that are rolled upon each other by .
3.2. Affine-Quadratic Hamiltonian
Similar to the rolling problem, the Maximum Principle reveals that the normal extremals of the affine-quadratic system (18) are the integral curves of the Hamiltonian vector field associated with the Hamiltonian function
where as before is the decomposition of onto the factors and . In the canonical case , and in the representation , the Hamiltonian equations generated by H are then given by
The Poisson equation can be written in expanded form as
The “shadow” problem generates an analogous Hamiltonian on the tangent bundle of the semi-direct product with its extremal equations given by:
Here , and is the same as .
The propositions below reveal a remarkable fact that the Poisson equations of a canonical affine-quadratic Hamiltonian can always be regarded as an invariant subsystem of the Poisson equations associated with a rolling Hamiltonian. We will use bold letters when referring to the variables in the rolling Hamiltonian in contrast to the variables in the affine-quadratic Hamiltonian.
Proposition 5.
Let be any integral curve of the rolling Hamiltonian , that is,
Then
is an integral curve of the affine Hamiltonian , where , and is the solution of with .
Moreover, in with and a solution of is the projection of an extremal curve
associated with the shadow Hamiltonian .
Proof.
If A is any element in then . Since , and are the solutions of the same differential equation they will be equal to each other whenever , that is, when .
Assume that . Then,
Additionally,
and
As to the proof of the second statement, note that and is a solution of as remarked in Equation (47). An argument identical to the one above shows that
□
The converse also holds as this proposition demonstrates.
Proposition 6.
Suppose that is an extremal curve of the affine Hamiltonian . Then
is an extremal curve of the rolling Hamiltonian .
However, if , and is an extremal curve of the shadow Hamiltonian H, then
is an extremal equation of the Hamiltonian with .
Proof.
The proof of the first part is essentially the same as in the previous proposition.
In the second part, we have
Then .
Let so that . It follows that and . Hence,
Additionally,
□
The above shows that the Poisson systems generated by any affine-quadratic Hamiltonian are invariant subsystems of the rolling Hamiltonians. To summarize, let denote an integral curve of the rolling Hamiltonian . If denotes the solution of , then define by . It follows from above that
are integral curves of the affine quadratic Hamiltonian . However, when
then are integral curves of the shadow Hamiltonian H.
3.3. Isospectral Representations and Integrability
An matrix equation is called a Lax equation, and is called Lax pair. If is a Lax pair, then the spectrum of is constant. The proof is simple: , where a constant matrix for any solution in the general linear group . Since the spectrum of is equal to the spectrum of , the spectrum of must be constant.
It follows that the Poisson equation of any left-invariant Hamiltonian H is a Lax equation on a semi-simple Lie algebra (Equation (34)) and therefore, the eigenvalues of are constants of motion for any left-invariant Hamiltonian on and hence may be regarded as the conservation laws on .
A function h on a Poisson space is said to be invariant if for any function f. On semi-simple Lie algebras any spectral function is invariant. In particular functions form a family of invariant functions.
In some situations, a Lax equation extends to a Lax equation with a spectral parameter . Then a discrete spectrum of L is replaced by a continuous spectrum of which results in additional constants of motion. In the case of rolling spheres J. Zimmerman in his PhD thesis (2002, University of Toronto) discovered an extension of the Lax equation which he called isospectral ([6]). Remarkably, Zimmerman’s extension exists for the rolling problem on any semi-simple homogeneous manifold, for the same reasons as in the rolling sphere problem. In fact, if , then the Poisson equations may be written as
This equation is invariant under a dilational change . It then follows that
satisfies the equation
Therefore, the spectrum of is constant. We will refer to as the spectral curve for . Of course, the above implies that the Poisson system associated with the affine-quadratic Hamiltonian also admits an isospectral representation. To be specific note that after the substitutions from Equation (49),
Then
Therefore,
To be consistent with my earlier publications, replace by to get
where , and . Equation (54) agrees with the isospectral representation in ([8]) (obtained by other means).
To get the spectral curve for the shadow Hamiltonian, use Equation (50). In such a case, , and yields
Then a calculation analogous to the one above gives . After the rescaling we get a modified Lax pair
Each spectral curve defines a family of functions
Proposition 7.
The family is involutive, that is, for each g and h in , and in the case that is regular, it is also complete, in the sense that it contains a subfamily that is Liouville integrable on each coadjoint orbit in ([8], pp. 164–165).
See also related papers also [27,28,29]).
Since belongs to , the rolling problem is completely integrable when is regular.
Corollary 2.
Each affine-quadratic Hamiltonian is completely integrable on when A is regular.
4. Symmetric Mechanical Tops
We will now relate the “top-like” equations
on the tangent bundle of , associated with the energy Hamiltonian , to the rolling equations. For simplicity of exposition, we will assume that the top is maximally symmetric, that is we will assume that all principal moments of inertia are equal, which is the same as . We will first consider the case of linear potentials.
Linear potentials: , where a is a vector in , and are constants. Then Equation (56) can be written as
where and . Our proposition below relates Equation (58) to the rolling equations
on , where .
To set the stage for this proposition, we will need to embed Equation (58) in via the following embeddings. To begin with, will denote the embedding for any . Then will be identified with and will be identified with in . In addition will be identified with , and Ω will be identified with so that is identified with . Then
is the same as . It follows that Equation (58) can be paraphrased as
However, then
and therefore is constant (same as ).
Proposition 8.
Top-like Equation (58) are isomorphic to the Equations (59) and (60) under the identification
Proof.
It follows that and . Thus, (60) is satisfied. We also have
and Equation (59) are also satisfied. □
Corollary 3.
An n-dimensional symmetric top with a linear potential is completely integrable.
Quadratic potentials. We will now show that the rolling geodesic equations on can be identified with movements of the symmetric top under a quadratic potential. For our purposes, an n-dimensional top with quadratic potential is synonymous with the Hamiltonian
with , , S a symmetric matrix, and arbitrary numbers. In accordance with (32) the Hamiltonian equations of are given by
. In the symmetric case and and the equations reduce to
To relate these equations to the rolling equations, let
Recall that is a rank one matrix defined by where is the standard Euclidean inner product in . Therefore each matrix is a symmetric matrix with its trace equal to one, and consequently is a symmetric matrix having zero trace. Along each solution of (63)
Additionally,
Now let
We then have
Proposition 9.
Equation (63) are isomorphic to the Poisson equations of the rolling problem on (Equation (43)) associated with the extremal
Proof.
By a straightforward calculation. □
Corollary 4.
Equations of a symmetric n-dimensional top with quadratic potential are completely integrable.
See also related results in [30,31,32]).
Funding
This research received no external funding.
Data Availability Statement
Not applicable.
Conflicts of Interest
No conflict of interest.
References
- Jurdjevic, V. The geometry of the Ball-Plate Problem. Arch. Ration. Mech. Anal. 1993, 124, 305–328. [Google Scholar] [CrossRef]
- Jurdjevic, V. Non-Euclidean Elasticae. Am. J. Math. 1995, 117, 93–125. [Google Scholar] [CrossRef]
- Jurdjevic, V. Geometric Control Theory; Cambridge Studies in Advanced Mathematics; Cambridge University Press: New York, NY, USA, 1997; Volume 52. [Google Scholar]
- Jurdjevic, V. Integrable Hamiltonian Systems on Lie groups: Kowalewski type. Ann. Math. 1999, 150, 605–644. [Google Scholar] [CrossRef]
- Zimmerman, J.A. Optimal control of the sphere Sn rolling on En. Math. Control Signals Syst. 2005, 17, 14–37. [Google Scholar] [CrossRef]
- Jurdjevic, V.; Zimmermann, J. Rolling sphere problems on spaces of constant curvature. Math. Proc. Camb. Phil. Soc. 2008, 144, 729–747. [Google Scholar] [CrossRef]
- Jurdjevic, V. Integrable Hamiltonian Systems on Complex Lie Groups; American Mathematical Society: Providence, RI, USA, 2005; Volume 178. [Google Scholar]
- Jurdjevic, V. Optimal Control and Geometry: Integrable Systems; Cambridge Studies in Advanced Mathematics; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
- Jurdjevic, V.; Markina, I.; Silva Leite, F. Symmetric spaces rolling on flat spaces. arXiv 2022, arXiv:2205.14636. [Google Scholar]
- Jurdjevic, V. Rolling on Affine Tangent Planes: Parallel Transport and the Associated Sub-Riemannian Problems. In Proceedings of the 14th APCA International Conference on Automatic Control and Soft Computing, Bragança, Portugal, 1–3 July 2020; pp. 136–147. [Google Scholar]
- Fomenko, A.T.; Mischenko, A.S. Euler equation on finite-dimensional Lie groups. Izv. Ross. Akad. Nauk. Seriya Mat. 1978, 12, 371–389. [Google Scholar]
- Fomenko, A.T.; Trofimov, V.V. Integrability in the sense of Liouville of Hamiltonian systems on Lie algebras. Uspekhi Mat. Nauk 1984, 2, 3–56. (In Russian) [Google Scholar]
- Neumann, C. De problemate quodam mechanico, quod ad primam integralium ultraellipticorum classem revocatum. J. Reine Angew. Math. 1859, 56, 46–63. [Google Scholar]
- Reyman, A.G.; Semenov Tian-Shansky, M.A. Group theoretic Methods in the Theory of finite dimensional Integrable Systems. In Dynamical Systems VII; Chapter 2, Encyclopaedia of Mathematical Sciences; Springer: Berlin/Heidelberg, Germany, 1994; Volume 16. [Google Scholar]
- O’Neill, B. Semi-Riemannian Geometry; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 1983. [Google Scholar]
- Sternberg, S. Lectures on Differential Geometry; Prentice- Hall, Inc.: Englewood Cliffs, NJ, USA, 1964. [Google Scholar]
- Ziller, W. Lie Groups. Representation Theory and Symmetric Spaces; University of Pennsylvania: Philadelphia, PA, USA, 2010. [Google Scholar]
- Bryant, R.; Hsu, L. Rigidity of integral curves of rank 2 distributions. Invent. Math. 1993, 114, 435–461. [Google Scholar] [CrossRef]
- Agrachev, A.; Sachkov, Y. Control Theory from the Geometric Viewpoint; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
- Chitour, Y.; Kokkonen, P. Rolling Manifolds: Intrinsic Formulation and Controllability. arXiv 2011, arXiv:1011.2925v2. [Google Scholar]
- Chitour, Y.; Godoy Molina, M.; Kokkonen, P. The rolling problem: Overview and challenges. In Geometric Control theory and Sub-Riemannian Geometry; Springer INdAM Series 5; Springer: Berlin/Heidelberg, Germany, 2014; pp. 103–123. [Google Scholar]
- Godoy Molina, M.; Grong, E.; Markina, I.; Silva Leite, F. An intrinsic formulation of the problem on rolling manifolds. J. Dyn. Control Syst. 2012, 18, 181–214. [Google Scholar] [CrossRef]
- Krakowski, K.A.; Machado, L.; Silva Leite, F. Rolling Symmetric Spaces. In Proceedings of the Second International Conference on Geometric Science of Information, Palaiseau, France, 28–30 October 2015; Springer: Berlin/Heidelberg, Germany, 2015; pp. 550–557. [Google Scholar]
- Sharpe, R.W. Differential Geometry; GTM, 166; Springer: New York, NY, USA, 1997. [Google Scholar]
- Kirillov, A.A. Lectures on the Orbit Method; Graduate Studies in Mathematics, 64; American Mathematical Society: Providence, RI, USA, 2004. [Google Scholar]
- Gasparim, E.; Grama, L.; San Martin, L.A. Adjoint orbits of semi-simple Lie groups and Lagrangian submanifolds. Proc. Edinb. Math. Soc. 2017, 60, 361–385. [Google Scholar] [CrossRef]
- Bolsinov, A. A completeness criterion for a family of functions in involution obtained by the shift method. Soviet Math. Dokl. 1989, 38, 161–165. [Google Scholar]
- Reyman, A.G. Integrable Hamiltonian Systems connected with graded Lie algebras. J. Sov. Math. 1982, 19, 1507–1545. [Google Scholar] [CrossRef]
- Perelomov, A.M. Integrable Systems of Classical Mechanics and Lie Algebras; Birkhauser: Basel, Switzerland, 1990; Volume 1. [Google Scholar]
- Bogoyavlenski, O. New Integrable Problem of Classical Mechanics. Comm. Math. Phys. 1984, 94, 255–269. [Google Scholar] [CrossRef]
- Jurdjevic, V. Affine quadratic problem on Lie groups. J. Lie Theory 2020, 20, 425–444. [Google Scholar] [CrossRef]
- Manakov, S.V. Note on the integration of Euler’s equations of the dynamics of an n dimensional rigid body. Funct. Anal. Appl. 1976, 10, 328–329. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).