Abstract
The present tutorial paper constitutes the second of a series of tutorials on manifold calculus with applications in system theory and control. The aim of the present tutorial, in particular, is to explain and illustrate some key concepts in manifold calculus such as covariant derivation and manifold curvature. Such key concepts are then applied to the formulation, to the control, and to the analysis of non-linear dynamical systems whose state-space are smooth (Riemannian) manifolds. The main flow of exposition is enriched by a number of examples whose aim is to clarify the notation used and the main theoretical findings through practical calculations.
1. Introduction
The present paper appears as the second part of a long tutorial covering notions of calculus on manifolds embedded in larger flat spaces, dynamical systems whose state spaces are structured as smooth manifolds and non-linear control systems on manifolds.
Regulation of dynamical systems on state manifolds appears to be an appealing research field in non-linear control theory which is gaining increasing interest in the scientific community, especially in the field of mechanical systems control [1,2,3,4,5,6]. Depending on the application at hand, one may be facing two cases [7]: positional control, arising in the regulation of first-order as well as second-order systems, and velocity control, arising in the regulation of second-order systems on manifolds. Embedded manifold setting proves useful in modeling large-scale constrained systems, since a state manifold allows one to take into account non-linear constraints on a non-linear system’s variables seamlessly. A number of scientific papers, both of theoretic and applied nature, were published over the years to investigate such extensions. Since the early days of control theory, it was clear how geometric methods are of prime importance in the study of dynamical systems and their regulation [8,9,10,11]. For a short review of contributions in the theory of Lie-group as well as general manifold system regulation, interested readers might like to consult [7].
A challenging problem in feedback control on smooth manifolds is the synchronization in time of non-linear dynamical systems, an appealing application of which is in the synchronization of a follower oscillator, whose state-variables take the role of masking carrier, to a leader oscillator, to be employed in data encryption for secure transmission over digital networks [12]. In such application, it is assumed that the leader oscillator state variable is completely observable, that its output coincides with its state and that its dynamics is completely specified.
The present tutorial paper is devoted to describing and illustrating key concepts in manifold calculus such as covariant derivation and manifold curvature with applications to formulating, controlling and studying the properties of dynamical systems whose state-space is a smooth manifold. Its content may be summarized as follows:
- it provides a clear and well-motivated introduction to advanced concepts in manifold calculus, the basis of system and control theory on manifolds, with special emphasis to its computational and application aspects. The present contribution provides practical formulas to cope with those real-valued manifolds which, in the author’s experience, are most accessed in engineering and applied science problems. As a matter of fact, complex-valued manifolds are not treated at all;
- it clearly states and illustrates the idea that, when dealing with a computer-based implementation of dynamical systems, it is necessary to discretize the differential equations that describe the system in a suitable way. In order to perform such discretization, standard discretization methods (such as the ones based on Euler forward-backward discretization) are not advisable since these do not work as they stand on curved manifolds. It is therefore advisable to resort to more sophisticated integration techniques such as the one based on geodesics.
The present paper is organized as follows. Section 2 is dedicated to a recapitulation of the notation used in the first part [13] as well as the present part, along with the main concepts covered by the first part. The recapitulation offered in Section 2 is not very detailed as readers that are approaching the present part of the tutorial are supposed to either get themselves sufficiently acquainted with the introductory material presented in the first part or to be already familiar with it. Section 3 introduces the key notion of covariant derivation of a vector field, which generalizes to manifold the familiar notion of directional derivative from real calculus. Section 3 discusses such concept in detail, starting from motivations behind its definition. Section 4 takes off by some observations about iterated covariant derivation and lands at the key concept of manifold curvature, which is a quintessential notion in manifold calculus. Section 5 surveys the notion of calculus of variations on manifolds by the help of a family of smooth curves and shows how to derive a general kind of continuous-time dynamical systems described by second-order differential equations on manifold. Control systems to regulate such second-order dynamical systems are discussed in Section 6 with special emphasis to time-synchronization. Section 7 presents an overview of second-order discrete-time systems on manifolds with special emphasis on ARMA-type random path generators. Section 8 concludes this tutorial.
As a distinctive aspect of the present tutorial, the main flow of discussion is based on coordinate-free (or component-free) expressions, which facilitates the implementation of the main equations by a matrix-friendly computation language such as MATLAB©. Starred subsections, namely subsections marked by an asterisk (*), treat some specific arguments, some related to coordinate-prone manifold calculus and some of minor importance in the present context, which may be skipped by the uninterested readers without detriment to the comprehension of the main flow.
2. Notation and Recapitulation of Fundamentals
In the present section we recall some notation and the principal notions laid out in the first part [13]. For details on derivations, we point the reader to such introductory part.
In the present paper, a column array is denoted by a lower-case symbol, while a matrix is denoted by an upper-case symbol. An element of a manifold is however always denoted by a lower-case symbol.
It is convenient to recall a few manifolds of interest in application, defined in embedded (coordinate-free) terms:
- Hypercube: The simplest manifold of interest is the hypercube , which is essentially the set spanned by p real-valued variables (or p-tuples). A generalization of the hypercube is the set of real-valued matrices.
- Hypersphere: A hypersphere is represented as and is the subset of points of the hypercube with unit Euclidean distance from the point 0. This is a smooth manifold of dimension .
- General linear group and special linear group: The general linear group is defined as }. The special linear group is defined as }. It represents the set of invertible linear operators of given dimension.
- Special orthogonal group: The manifold of special orthogonal matrices is defined as . It represents the set of hyper-rotations in a hypercube of given dimension. In the definition, the symbol denotes a identity matrix.
- Stiefel manifold: The (compact) Stiefel manifold is defined as:where .
- Real symplectic group: The real symplectic group is defined aswhere the symbol denotes a whole-zero matrix.
- Manifold of symmetric, positive-definite (SPD) matrices: The space of symmetric, positive-definite matrices is defined aswhere the notation indicates a positive-definite matrix.
- Grassmann manifold: A Grassmann manifold is a set of subspaces of spanned by p independent vectors, namelywith being a p-tuple of arbitrary and linearly independent n-dimensional arrays.
A fundamental assumption of the present tutorial is that any manifold is embedded into an ambient space of suitable dimension. The ambient space is assumed to be metric, namely to be associated with a scalar product .
A smooth curve is generally denoted as , with . A curve may be thought of as a trajectory generated by a dynamical system whose state-space is a smooth manifold.
The tangent space to the manifold at a point x is denoted as . The tangent bundle is a collection of tangent spaces, namely . The concept of normal space complements that of tangent space, namely, the normal space of an embedded manifold in a given point under a chosen metric is defined as .
A vector field on a manifold is a function that assigns a tangent vector to every point . A vector field may even depend on the time parameter, in which case it will be denoted as with .
In order to take into account non-linear constraints on the state variables of a dynamical system, it is convenient to introduce the notion of curved state space under the form of smooth manifolds. This logical process leads to a type of first-order (or single-integrator) dynamical systems described by
where denotes the state-transition function of the system and denotes the state of the system at any given time t.
Given two Riemannian manifolds and , let : A pushforward map is defined such that for every smooth curve , such that , it holds:
In general, given a function that maps a point x from the manifold to the point on the manifold , the map associates to any tangent vector v belonging to the tangent space a tangent vector belonging to the tangent space by ‘pushing’ the former to the latter space.
Manifolds that also enjoy the distinguishing properties of an algebraic group are termed Lie groups. The tangent space of a Lie group at the identity is termed Lie algebra and is denoted as . Any Lie group admits an exponential map, denoted as .
A Riemannian manifold is defined as a smooth manifold whose tangent bundle is equipped with a smooth family of inner products termed ‘metric’. There exists a canonical metric related to the ambient space that a manifold is embedded in, given by , where denotes an inner product in . Such expression is based on a metric kernel . A derivative of the metric kernel with respect to a curve is defined by
where denotes any smooth curve.
A geodesic arc is defined as the curve satisfying the following differential inclusion
where denotes a fundamental form. A solution of such differential inclusion that joins two points , taking x as the departure point, is denoted as , while a solution with prescribed departure point and velocity at departure is denoted as . The length of a geodesic defines a distance between its endpoints.
To a geodesic arc is associated a map defined as . Such function is termed exponential map with pole . The inverse function associated to the exponential map is termed logarithmic map. On a manifold where it is defined an exponential map , a logarithmic map is denoted as . A logarithmic map satisfies the identity .
An orthogonal projector is a function that maps an element of the ambient space to a tangent vector in a specific tangent space, namely . A derivative of the orthogonal projector with respect to a curve is , which is defined as
where denotes any smooth curve.
Let us recall a property of the derivative that will turn useful in the following development. Whenever is applied to a pair of tangent vectors, it will return a normal vector, namely the restriction .
Example 1.
Let us show the behavior of the derivative just described in the case of the manifold endowed with its canonical metric. In this case , therefore taken and a curve such that , it is seen that
The matrix sum in parentheses is symmetric, hence is a normal vector at R.
The Riemannian gradient of a smooth function is denoted as and may be written by the explicit formula , where denotes the Gateaux derivative of f in . A golden calculation rule involving the notion of Riemannian gradient and that of Riemannian distance is .
Parallel transport of a vector along a geodesic joining points is indicated by . By this notation it is understood that x is the departure point. Its calculation is based on a Christoffel form of the second kind. It is convenient to define two Christoffel forms, namely the Christoffel form of the first kind as
and a Christoffel form of the second kind as . A property that involves parallel transport and manifold logarithm is .
3. Covariant Derivative of a Vector Field
Covariant derivation provides a kind of derivative to evaluate the rate of change of a vector field along a given direction starting from a given point. Covariant derivation is a generalization to manifolds of the familiar notion of directional derivative.
The present section is organized as follows. Section 3.1 presents a brief review of the standard directional derivative and of its properties. Section 3.2 presents a coordinate-free conceptualization of covariant derivation along a curve, which is the main topic of the present section. Section 3.3 introduces the more general notion of covariant derivation of a vector field along a given direction and that of connection. Section 3.4 discusses again the notion of Riemannian Hessian operator and shows how it can be obtain by the covariant derivative of the Riemannian gradient. Section 3.5 discusses a definition of covariant derivation through axioms and shows that the introduced Levi-Civita connection meets such axioms. Section 3.6 retraces the notion of covariant derivation in coordinates. Section 3.7 introduces two new concepts, namely the commutator of two vector fields and the torsion field, which are indeed closely related. Section 3.8 illustrates and explains the consequence of a quite remarkable relationship between the Christoffel form of the second kind and the linearization of the projection operator. Section 3.9 discusses the notion of Lie derivative, a primeval attempt to define differentiation on tangent bundles.
3.1. Brief Review of Directional Derivative and of Its Properties
Let us briefly review the notion of direction derivative to better understand its manifold counterpart. Let us consider a smooth scalar multivariable function . Given a point , a directional derivative essentially measures the rate of change of the function f when moving away from p along a given direction. Two points come immediately to mind while endeavoring a definition for a directional derivative:
- Since f is a multivariable function, because its domain is , there are plenty of ways to ‘move away’ from the foot-point p. The rate of change of the function f depends on the direction towards which one moves away from point x, hence a direction needs to be specified. For example, if one takes , the function f changes little along the x axis, changes more along the y axis, increases along the positive direction of the z axis and decreases along the negative direction of the z axis, not to mention the behavior corresponding to every possible combinations of the triple .
- Since f is arbitrary, its rate of change depends also on ‘how far’ one moves away from the foot-point p. Taking a large leap is, however, not very useful, as one may always replace the foot-point p with a new foot-point to know how the function behaves far from p. Therefore, it is understood that the rate of change is evaluated in close proximity of the point p.
Consequently, a possible definition of directional derivative is
From multivariable calculus it is known that
where and denotes the gradient (or Jacobian, or Gateaux derivative) of the function f with respect to its three scalar arguments, namely . The rightmost quantity is, according to the general definition (6), the differential of the function f, at point p, applied to the vector v. In this case, however, since we are working in , the ‘vectorial’ nature of v is not immediately apparent.
Let us now generalize this notion by taking a smooth function , which is an array-type multivariable function, which we can call a multivariable vector field. The function f assigns an array to any point of its domain. Therefore may be though of as a vector field (once again, its vectorial nature as understood in manifold calculus in not apparent). Like before, a notion of ‘rate of change’ of a vector field along a given direction may be expressed as
Although the definition just given is formally identical to (12), the space where the two directional derivatives live are different. From multivariable calculus we get:
where denotes the ith component of the m-dimensional array p, denotes the ith component of the m-dimensional array v and denotes the ith component of the array-valued function f (with respect to the canonical basis of the domain and of the codomain space ). Hence, in this generalized scenario, , where denotes the Jacobian matrix of the transformation . In this case, the rate of change itself is multivariable, hence this directional derivative is related to the notion of differential in the sense that .
To complete this brief survey of directional derivation, let us emphasize a few properties that the directional derivative enjoys:
- () Assume that the direction v along which the rate of change is sought is given by the superposition of two partial directions, namely , with being two scalar fields, namely and . How the rate of change relates to rates of change and along partial directions is easily understood:
- () Assume the field f be given as the superposition of two concurrent fields, namely . The way the rate of change relates to rates of change and is also easily understood:
- () In addition, assume the field f is given as the dilation of another field, namely , with being a scalar field. How the rate of change relates to the rate of change is readily understood:by the rule of partial derivation of compound functions.
Having recalled the notion of directional derivative of a vector field in , we are now going to examine and discuss its extension to manifolds.
3.2. Coordinate-Free Conceptualization of Covariant Derivation along a Curve
The notion of covariant derivation may be conceptualized in different manners. We recall here two relevant ways:
- Top-down from parallel transport: Assuming that a notion of parallel transport is available, which enables us to compare the values of a vector field at different points of a manifold, covariant derivation stems quite naturally by adapting the classical definition of directional derivative (14). In this sense, covariant derivation represents a ‘local’ version of parallel transport.
- Direct axiomatization: In this instance, covariant derivative is defined to be an operator that meets certain requirements, which basically retrace properties , , noted in Section 3.1. Such definition is quite general and leads to a family of covariant derivatives. Moreover, such definition is given independently from that of parallel transport, which may, in turn, be defined as the solution of a covariant-derivative-based differential equation on the tangent bundle . In this sense, parallel transport represents a ‘global’ version of covariant derivation.
In the present tutorial paper, we are going to follow the first way of defining covariant derivation (based on parallel transport), not only because one such flow of discussion would be coherent with the general setting of the present series of tutorials, but also because the top-down conceptualization is deemed to appear more clear to control engineers and applied scientist and well-suited to develop numerical methods to simulate the behavior of second-order dynamical systems on manifold.
As a first notion to be defined, we are going to deploy covariant derivation along a curve. Namely, we assume that a vector field is well-defined and regular along a smooth curve and we need to evaluate the rate of change of the vector field along the curve itself. Such rate of change will be denoted as , where denotes the value of the parameter that individuates univocally a point over the curve . Such rate of change is precisely the covariant derivative of the vector field w along the curve at the point t, and is defined as
Such definition appears as a natural extension of the directional derivative (14) of a vector field, properly corrected by parallel transport to align vectors belonging to different tangent spaces. It should be noticed that the curve does not necessarily need to be a geodesic. In fact, in system and control theory, it likely represents a trajectory generated by a dynamical system over its state manifold, which only occasionally coincides with a geodesic arc.
In order to derive an explicit expression of the covariant derivative, let us evaluate the time-derivative of the function defined as follows:
By definition of derivative with respect to the scalar parameter t, we have
By the property of conformal isometry of parallel transport (with respect to the inner product), it follows that
hence the derivative may be rewritten as
On the basis of the above development, it is possible to write an explicit expression for the covariant derivative of a vector field along a curve based on the (derivative of) the metric kernel. Let us rewrite the function as . Then, it holds that
where denotes, for brevity, the total naïve derivative . Notice that
where denotes the usual Gateaux derivative and appears as a linear operator (apparently, such operation is defined in the ambient space rather than in ). Let us rewrite the above expression by introducing a simple trick:
Let us examine each term separately. The term (A) in (26) may be rewritten as
by definition of orthogonal projection.
The term (B) may be rewritten as thanks to the commutativity of the derivative with respect to the ambient inner product. Moreover, by adding an identity gives , where cumbersome parentheses were (and will be) omitted for brevity. Invoking again the property of the metric kernel to be self-adjoint and the definition of orthogonal projection leads to the identity
where we have used quite liberally the ‘extendability’ property of the metric kernel to the ambient space.
The term (C) may be rewritten as and, introducing an orthogonal projector, we get , hence
It is now worth noticing that the term (D) may be rewritten equivalently as and, in addition, observing that , we may conclude that
Summing up the four terms in (26) written in the indicated ways, we obtain the expression
Comparing the above expression with the relationship (23) and recalling that the vector field is arbitrary, one gets the two identities
The two above expressions are coherent, in fact, setting in the relationship (32) yields the relationship (33).
The relationship (32) may be rewritten through the Christoffel form of the first kind. In fact from the expression (11) it follows that the term (B) in (26) may be rewritten as . As a consequence, the covariant derivative (32) may be rewritten equivalently as
a classical expression in manifold calculus. In terms of Gateaux derivative of the vector field w, we may also write
which evidences how is linear in .
Two special cases of the above relationship are particularly worth examining separately:
- Case of normal Christoffel form. Whenever the manifold is endowed with a normal Christoffel form , namely whenever , the general expression of the covariant derivative may be simplified asnamely, the covariant derivative of a vector field coincides with an orthogonal projection of the naïve derivative of the vector field.
- Case of geodesics. When , namely the vector field under derivation coincides with the velocity field of a curve, it holds that . If is a geodesic curve, since in every point of a geodesic, then it turns out that, on every geodesic line,In fact, in a ‘direct axiomatization’ setting, the above relationship is taken as one defining a geodesic!
We shall revise the expression (34) in Section 3.8 where we shall show that projection is not strictly necessary, although taking out the projection operator does not necessarily simplify the related calculations!
From the above calculations one might infer that, for all practical purposes, the operators and commute, namely .
Example 2.
In all instances in which , as in the case of the unit hypersphere endowed with its canonical metric, the property is trivially verified.
Let us consider the case of the manifold endowed with its canonical metric, for which and (uniform). In this case
Let us further consider the case of the Stiefel manifold endowed with its canonical metric, where and . In this case, it holds
hence the property of commutativity of the two operators is verified once again.
In fact, the commutation property just exemplified may be proved quite easily by observing that, for every and , the following equality chain holds:
where we used the shortened notation for . The above equality chain implies that .
Let us consider a few examples of calculation of covariant derivative.
Example 3.
As an example of calculation of covariant derivative, let us consider the case of the unit hypersphere endowed with its canonical metric. The orthogonal projector reads . According to the relationship (36), the covariant derivative would read
Let us verify the same result by applying the definition (19). Let us recall the expression of the parallel transport operator on the hyper-sphere that reads . By direct calculation we get
Setting leads to
Hence the two results coincide, as expected.
Let us now compute the covariant derivative of a vector field in . Let us recall that, for the manifold of symmetric, positive-definite matrices, we found (cfr. [13]) that . It is straightforward to verify that, for every , it holds that , as well as that, for every vector field and every smooth trajectory , it holds that the naïve derivative . As a consequence, in the present case the general relationship (34) simplifies into
which, by the expression of the Christoffel form, reads
To end with this series of examples, let us compute the covariant derivative of a vector field in . Let us recall that, whenever the manifold is endowed with its canonical metric, it holds , while the Christoffel form of the second kind takes the expression . Indeed, the exact expression of the Christoffel form, in this case, is irrelevant, since it maps to the normal bundle . Therefore, over a trajectory , the covariant derivative of a vector field reads
It is a useful exercise to verify by direct calculations from the definition that the above expression is correct.
3.3. Covariant Derivation along a Direction, Connections
Let denote the covariant derivative of a vector field at a point t of a curve . In some cases, one might want to indicate explicitly the direction along which the derivative is taken, which coincides with . Referring the foot-point and the tangent direction at , in these cases one may set and denote the covariant derivative by the shortened notation
In this case, the covariant derivative does not depend on the curve , except for and . (If it is necessary to specify the foot-point where all these computations take place, the elongated notation may be used). For example, the covariant derivative of a velocity field with respect to itself, namely the covariant acceleration, would read .
An extension of the covariant derivative of a vector field along a given direction leads to the notion of connection . Let us consider a vector field that is to be derivated and a further vector field that associates to each point of the manifold (or a region of the manifold) a direction along which derivate. In other terms, at a point , defines the direction along which the rate of change of the vector field w is sought. The corresponding covariant derivative, denoted by or , extends the notion of covariant derivation of a vector field along a curve and is based on the assumption that the vector field w is defined in every point of (or within a region of the manifold). The formal definition goes like
The connection described in the present tutorial is commonly referred to as Levi-Civita connection.
A metric connection is a covariant derivative that ‘agrees’ or is ‘coordinated’ with an inner product in the sense that the covariant derivative of an inner product of two vector fields depends only on the rate of change of such vector fields and does not depend on the rate of change of the inner product itself, namely, given two vector fields and a tangent direction , it holds that
The definition (19) ensures that the resulting connection is metric. Notice, in fact, that is a scalar field, a special case of vector field, that may be subjected to covariant derivation. In addition, notice that in a large number of references, the directional derivative is denoted simply as , leaving understood that a vector field acts as a directional derivative.
Example 4.
We can see this even in the case of a flat manifold endowed with an inner product . In this case, we have that the directional derivative of the inner product of two vector fields along a direction v is
Adding zero-sum terms, and rearranging them, gives:
By the linearity and continuity of the inner product one then gets
In conclusion, one gets
The first two addenda depend only on the directional derivatives of the vector fields, while the third term depends on the rate of change of the inner product along the line . Only when the last term is identically zero the directional derivative is said to be metric, in which case .
3.4. Relationship of Gradient to Hessian by Covariant Derivation
A noticeable result from manifold calculus is that Riemannian Hessian, covariant derivation and Riemannian gradient are related to one another in a neat way, namely
The gradient builds a vector field over a curve based on a scalar function and the Hessian represents the rate of change of such vector field along the curve. In order to prove such neat result, it is convenient to re-derive the expression of the Riemannian gradient in a different way compared to the one derived in [13].
Let us recall that, given a scalar function and a smooth curve with image in , it holds that
where denotes, as usual, the ambient space that the manifold is embedded in and denotes an Euclidean scalar product that the ambient space is endowed with. Deriving with respect to t the rightmost expression gives
Let us examine separately the above three addenda. The first addendum may be rewritten as
The second addendum may be rewritten as
Since the expression (56) is path-dependent, we choose the curve to be a geodesic line, hence the third addendum takes the equivalent expressions
Gluing pieces together, one obtains
By definition, the Hessian of a scalar function at a point , along a given tangent direction , is such that
where and , hence we may conclude that
Let us examine two examples of calculation of Hessian.
Example 5.
As a first example, let us consider the function , where denotes a Riemannian distance and denotes a fixed reference point (This function is sometimes referred to as Synge’s world function in relativistic physics [14]). As we have proven, in this case , where denotes a logarithmic map. In this example, therefore, it turns out that
The papers [15,16] further elaborate on the notion of Hessian of the Riemannian squared distance.
As a second example, let us consider the Hessian of Oja’s function (or Rayleigh quotient) in , with constant. Since is normal, we can conclude that , with denoting a geodesic arc such that and . Since
we have
Setting gives
Notice that the third and the fourth addenda are normal to , hence we conclude that
In order to judge about the positive-definiteness of the Hessian, let us evaluate the inner product
with (recall that, in the positive-definiteness test, ). The Hessian is positive-definite at a given point x if for every .
3.5. Axiomatization of Covariant Derivative and Relationship to Parallellism
A notion of covariant derivative may be defined arbitrarily as long as it meets generalized versions of the properties (), (), () recalled in Section 3.1, which qualify covariant derivative as a generalized directional derivative. Denoting the covariant derivative of a vector field in the direction at a point as , we may require it to satisfy the following properties:
with , and being scalar fields and where denotes a tangent map (pushforward). The properties and establish the linearity of the covariant derivative with respect to its vectorial arguments, while the property represents a sort of product rule.
We may verify that the connection defined in Section 3.2 satisfies the above axioms through the expression (35):
- Proof of property : The covariant derivative may be written explicitly asSince both the Gateaux derivative and the Christoffel form are linear in their vectorial arguments, the property follows.
- Proof of property : The vector field may be expressed explicitly asBy the linearity of the terms in parentheses with respect to their vectorial arguments, the proof follows.
- Proof of property : The covariant derivative may be re-expressed asNoticing that leads to the sought result.
Considering as primary the notion of covariant derivation, it is possible to recover the notion of parallelism from covariant derivation. Let us denote by a smooth curve on and a smooth vector field defined (at least) on the image of . Such vector field is said to be parallel along γ if
Conversely, setting up one such differential equation equipped with an initial condition , its solution generates a parallel transport field of the vector along the curve γ from 0 to t denoted as .
A metric connection makes parallel transport be a conformal isometry. To show this important property, let us consider again a curve such that and two vectors . Parallel transport defines two vector fields along such curve, namely
Define . Since the connection ∇ is metric, the function s stays constant for any . In fact, by definition of metric connection, it holds that
with no extra terms. Since the vector fields and are parallel along the curve , it holds that , which proves the assertion. A consequence of the above-proven property is that
for any . This equality has two important consequences:
- Parallel transport preserves the length of transported vectors: In fact, if we take , we get . This means that parallel transport realizes an isometry.
- Parallel transport preserves the angle between transported vectors: In fact, if we define the cosine of the angle between two tangent vectors as their inner product normalized by their length, we getwhich means that parallel transport realizes a conformal map.
The following example shows explicitly a property of parallel fields along geodesics.
Example 6.
Let us prove directly that a vector field parallel along a geodesic curve γ keeps the function constant with respect to the parameter t. Recall that , hence
On a geodesic line it holds that , hence
Since , we may write that
Since, by assumption, the vector field w is parallel along γ, its covariant derivative , therefore , hence keeps constant at its initial value .
3.6. Coordinate-Prone Covariant Derivation*
The fundamental requirement at the very core of a connection is that a covariant derivative of a basis vector field in a tangent direction given by another basis vector is a tangent vector itself, namely . Such a basic fact may be expressed by the fundamental relationship:
where are coefficients that describe the structure of a connection. The elementary covariant derivative quantifies the rate of change of the basis vector field along the direction specified by the further basis vector field .
By applying the properties (69)–(71), it is readily obtained that:
where the functions and the functions are the components of vector fields in expressed in the canonical basis . If , the basis vectors stay constant across the tangent bundle and the covariant derivative simply coincides with the directional derivative, namely
In general, one needs coefficient-functions to define a connection on a p-dimensional manifold, which may be specified arbitrarily and give rise to an independent definition of connection.
A special choice of connection coeffients leads to the Levi-Civita connection, where , namely the Christoffel symbols of the second kind associated to the metric tensor of components which read
Let us survey the rationale behind Levi-Civita connection through an example.
Example 7.
Let us revisit the notion of geodesic. On a manifold , replacing in the Equation (84), leads to the expression . Setting the covariant derivative to zero leads to the geodesic equations for the components :
Choosing to endow the manifold with a Levi-Civita connection, namely setting makes the geodesic Equation (87) coincide to the equation given in [13]. A different choice of connection would lead to a different geodesic equation and, ultimately, to a different calculus setting for the manifold .
3.7. Commutator of Vector Fields and Torsion Field
Let us take again two vector fields and a connection . Apparently, in general there need not to subsist any relation between the covariant derivatives and . In the case of a Levi-Civita connection, there exist however a precise relationship between such derivatives, which is usually expressed through the sentence a Levi-Civita connection is torsion-free.
In order to elucidate such concept, let us define two notions, namely, commutator of two vector fields and torsion. The commutator (or Lie brackets) of two vector fields is a further vector field denoted as
namely, as a projected version of a comparison between the rate of change of the vector field v along the direction and of the rate of change of the vector field w along the direction . (As usual, the Gateaux derivatives are taken in .)
The torsion field is defined as
Torsion measures the degree of non-commutativity of directional derivations as the difference between directional derivatives of two vector fields with respect to each other depurated from their own non-commutativity. In other words, torsion measures how closely the commutator of vector fields can be recovered from the connection. Notice that, in Euclidean spaces, it holds that and , hence torsion is certainly null. As we shall see later, null torsion induces a relationship between two otherwise unrelated concepts, namely covariant derivation and Lie derivation.
Now, let us prove that the torsion field is identically null. Let us recall that
hence
since the Christoffel form is symmetric in its vectorial arguments. The assertion that a Levi-Civita connection is torsion free hence follows. We underline how the result (91) is noticeable per se, because it implies that, while the covariant derivative depends on the Christoffel form, the difference between two covariant derivatives with swapped arguments does not. Another way to see this property is to write .
3.8. A Remarkable Relationship Linking the Operator to the Christoffel Form
Let us recall that the bilinear operator arises from the directional derivation of the orthogonal projection operator which, in turn, does not depend at all from the metric that a manifold is endowed with, as it depends only on the structure of the tangent spaces to the manifold and on the Euclidean ambient space ’s metric. In fact, by definition of orthogonal projector, it should hold that , for every , and .
Let us recall how orthogonal projectors are computed for a few manifolds of interest.
Example 8.
Let us recall how to determine the expression of an orthogonal projector . By definition it must hold that , hence . Since , it must also hold that , therefore and hence . The directional derivative of such projector along a tangent direction reads
Let us further recall how to derive the expression of an orthogonal projection operator . By definition, it must hold that , hence , with . Since , it must also hold that . Straightforward calculations show that and hence , where we have used the cancellation implied by . The directional derivative of such projector along a tangent direction reads
In addition, let us survey the case of the manifold of symmetric, positive definite matrices by determining . By definition of orthogonal projection, it must hold that , therefore , with . Since , it is necessary that , therefore , and hence . The directional derivative of this projector along the tangent direction is identically zero, namely
since does not actually depend on the foot-point P.
To end this example with, let us consider the case of the Stiefel manifold . In this case, . From the condition it follows that , with . From the tangency condition it follows that , hence that . Ultimately, we get . The directional derivative of such projector along a tangent direction reads
that the expression (93) is a special case of, when .
We further recall that the Christoffel form of the second kind does depend on the metric, through the metric kernel , and from the structure of the tangent bundle . Let us recall the expressions of some Christoffel forms of interest.
Example 9.
For the hyper-sphere endowed with the canonical metric, it holds
(We note, with apologies, that in the first part of this series of tutorial [13], the Christoffel form associated to the unit hyper-sphere was reported with an incorrect sign).
For the special orthogonal group, it holds
For the manifold of symmetric, positive definite matrices, the Christoffel form takes the expression
Instead, the expression
pertains to the Stiefel manifold endowed with the Euclidean metric.
Now we lay out a quite simple observation, that leads to a number of important consequences. Let us consider a smooth vector field , namely, a function that assigns a tangent vector to every point , and a smooth curve . In the previous sections we had familiarized ourselves with the naïve directional derivative , that actually stands for . Now the fundamental observation that we are underlining is that, although is an array in the ambient space , this does not mean that its entries are unconstrained. In other terms, the naïve directional derivative of a tangent vector field is not a tangent vector field, yet it must obey some constraints.
Let us express such constraints in an analytic way. For the sake of notation conciseness, we are going to make use of the following notation: For any , we shall denote by the quantity .
Let us observe that, by definition of tangent vector field, on a smooth curve it must hold that . Deriving with respect to the parameter t gives:
Setting , and , gives
The above result means that the normal component of the naïve velocity of a vector field is completely determined by the directional derivative (hence only its tangential component may be prescribed freely). (This finding further clarifies the meaning of the bilinear operator ). It is worth underlining that the relation (101) only involves the restriction in and is fully compliant with the previous finding that .
An important consequence of the above finding is as follows. We know that a tangent vector field is parallel along a curve if it satisfies the differential equation (in )
Splitting the members of the above equation into their tangential and normal components, and retaining the latter, we get, for a point :
where . Comparing the relationship (101) with the relation (103), we immediately get that
namely, the anti-normal component of the Christoffel form (of the second kind) equals a restriction of the directional derivative of the projection operator. Two seemingly unrelated operators are quite tightly bound to one another!
Let us verify the above important finding on a number of cases.
Example 10.
For the hyper-sphere endowed with the canonical metric, the restriction of the directional derivative to the tangent bundle reads
because . Notice that, in this example, , hence it coincides with the Christoffel form (96) up to a sign switch.
For the special orthogonal group endowed with its canonical metric, the restriction of the directional derivative to the tangent bundle reads
To conduct the above calculations, we have used twice the property that for every . Notice that, even in this case, , hence coincides, up to a sign, with the Christoffel form (97).
In the case of the manifold of symmetric, positive definite matrices endowed with its canonical metric, the directional derivative of the orthogonal projector, although restricted to the tangent bundle, is still identically null, namely . On the other hand, from the expression (98), it is immediately verified that , hence the two expressions coincide.
For the Stiefel manifold, the directional derivative of the orthogonal projector restricted to the tangent bundle reads
since the second term on the right-hand side cancels. Once again, this quantity, which is purely normal, coincides (up to sign) with the Christoffel form of the second kind (99) which, in turn, appears to be purely normal.
Notice that in all these examples, the expression of results to be symmetric in its vectorial arguments.
The property (104) prescribes specifically the normal component of the Christoffel form of the second kind to be independent of the metric. Such prescription implies several important consequences, that we are going to explore in the following.
A consequence to underline is that for every pair , namely, the restriction of the directional derivative of the orthogonal projector to the tangent bundle is symmetrical in its arguments.
A further consequence is that, although for a non-parallel vector field along a curve it holds that , from the relationships (101) and (104) it follows that
This latter relationship also implies that the normal component of the naïve velocity of variation of a (non-parallel) vector field may be written as a function of the vector field itself.
As a further consequence, let us examine the expression of the commutator of two vector fields as defined in Equation (88). Given two vector fields , their commutator at a point x is given by
Since for the two vector fields in must hold that
by the symmetry of the operator restricted to the tangent bundle it follows that
An even deeper consequence of the property (104) concerns the computation of the covariant derivative of a vector field. Let us recall that, given a vector field and a tangent vector , the covariant derivative of w in the direction v at a point x is given by
Since the second and the fourth terms in the above sum cancel out, we rest with
In other terms, the sum is already an element of the tangent bundle, without any need for projection. A similar relationship holds for the covariant derivative along a smooth curve, namely
The above two expressions of the covariant derivative may be used interchangeably with those given in Section 3.2 whenever they are deemed to be easier to apply, as in the case of iterated derivatives covered in Section 4.2.
Let us verify the above general relations by a number of examples.
Example 11.
Let us consider, as a first case of study, the computation of the covariant derivative on the unit hyper-sphere endowed with the canonical metric. In Example 3 we have evaluated the covariant derivative of a vector field in two different ways to prove their equivalency. Let us repeat such calculation by means of the relationship (114):
Now, since , deriving once gives , therefore
as in Example 3.
The case regarding the manifold of symmetric, positive-definite matrices endowed with the canonical metric is self-explanatory, since the Christoffel form of the second kind is purely tangential.
The case concerning the special orthogonal group is more interesting. In particular, in [17] (Theorem 1), the present author gave the following expression for the covariant derivative of a vector field along a curve :
Such expression was derived applying directly the definition in terms of parallel transport. We would like to show here that such expression is exactly the same as that found in Example 3, but applying the Equation (114). Let us first simplify the expression (117) for the sake of easier comparison. From the orthogonality condition it follows that , hence , therefore the expression (117) simplifies in
Further we notice that from the condition it follows that , hence
in fact , hence the first and the third terms in the second line cancel out. This is the expression found in Example 3. By applying the simplified Equation (114) we get:
On the other hand, from the tangency condition it follows that , hence
which, upon easy simplifications, come to coincide with expression (119).
The projectionless expression of the covariant derivative of a vector field along a curve impacts on a number of differential operators, including the Hessian, which is defined as the covariant derivative of a gradient of a function. In fact, let us recall that, given a smooth function , its Hessian operator computed along a curve is defined as , where now
hence an alternative expression for the Hessian is
The following examples studies in particular two interesting problems involving covariant differentiation and the logarithmic map.
Example 12.
As a first example of relationship involving covariant differentiation and logarithmic map, let us consider the covariant derivative of a logarithmic map field. In particular, given a point and a smooth curve γ, the quantity represents a vector field along the curve γ. One might wonder how the covariant derivative of such vector field along the curve γ looks like. Applying the Equation (114) to the vector field (where x is fixed), we get
Technically speaking, one might notice that and that it might be termed as logarithmic pushforward endomorphism. Such endomorphism was mentioned also in [18].
For the second example, let us consider a manifold of dimension 2 endowed with a distance function , take a point and define a curve γ implicitly such as for some . Notice that γ is a curve only if the manifold is of dimension two (otherwise it would be a ‘surface’ or, more generally, a submanifold) and that such curve is clearly a ‘circle’, centered in x, of radius r. In addition, let us choose a special parametrization of the curve γ known as parametrization by arclength, which essentially means that at any point of the curve it presents unit speed, namely . Now one might wonder as to whether the acceleration of the curve presents any special property (where ∇ denotes the Levi-Civita connection). This is indeed the case: let us examine the problem. As a first consideration, notice that the condition of unit speed implies that , in fact
on each tangent space . In addition, the condition implies that
Now, since at every point of the curve , one must conclude that
namely the covariant acceleration is directed radially. As a by-product, with reference to the first example, we may notice that the condition also implies that
which therefore implies
In fact we may notice that, for a Euclidean space, it holds that and hence it follows that .
To end this section, let us examine a further, perhaps counter-intuitive, example of calculation.
Example 13.
We have seen that an important property of the Levi-Civita connection is that is is ‘metric’, which may be expressed concisely as for any smooth curve γ. In intrinsic manifold calculus, such property is usually expressed through the sentence ‘the covariant derivative of the metric tensor is equal to zero’. One might thus wonder as to whether the same holds for the metric kernel G, namely if . We are going to show now that this is not, indeed, the case.
Recalling that, for a smooth vector field it holds that , we may compute
Therefore, we can conclude that the covariant derivative of the metric kernel reads , where 🟉 denotes an empty slot again. The expression on the right-hand side of the expression (130) is equal to zero trivially only when the metric kernel coincides with the identity.
3.9. Lie Derivation*
Given two vector fields , besides of covariant derivation , there exists a further way to compute the rate of change of v in the direction u, which constitutes the first attempt to define directional derivatives on manifolds: the Lie derivative. Lie derivation does not require any notion of connection, parallel transport nor metrization. We shall however see that Lie derivation and Levi-Civita covariant derivation are related to one another. Although the notion of Lie derivative is presented in this section in extrinsic terms, it is a non-essential topic in this tutorial paper. However, besides of its historical value, we would like to point out that it has important applications in control theory, as summarized in [19].
Let denote two smooth tangent vector fields. The notation indicates the value of u at x. The Lie derivative of a vector field is defined in an analogous way to the covariant derivative, except that the role of parallel transport is played by a pullback map of a flow. We define a flow associated to the vector field u as the solution of the following differential equation:
The function takes a point in the manifold to another point in the manifold, hence its first-order approximation at a point x, given by its tangent (pushforward) map, may be defined as
where denotes any smooth curve such that and . Now, the pushforward map , hence the associated pullback . Namely, the pullback takes a vector from the tangent space to the tangent space . Such operation resembles parallel transport but is however referred to as Lie dragging. Lie derivation is based on Lie dragging in place of parallel transport and is defined as
We notice that the Lie derivative is also related to the differential of a scalar-valued function , since .
In order to find a rule to calculate the Lie derivative of a vector field, we can proceed as follows. By definition we have
where we used the fact that and the last equality is due to the definition (132). By Taylor-expanding the function in t we get
Hence, it follows that
and, consequently, that
Setting results in
Likewise, Taylor-expanding the function with respect to the parameter t gives
Gluing pieces together gives
Namely, the Lie derivative of a vector field along a given direction coincides with a commutator.
Next we discuss, as an example, the meaning of vanishing Lie derivative.
Example 14.
Given two vector fields , it may happen that . Intuitively, such condition means that the two vector fields are related to one another. In particular, recalling the meaning of the flow σ at the basis of the definition (133), mathematicians say that σ is a ‘symmetry transformation’ of the vector field v. In order to make such notion explicit, let us rewrite the expression (133) for a generic point of the flow, namely
If the Lie derivative of the vector field v vanishes along the whole flow of the vector field u, the above expression implies that the function is constant, hence that . Ultimately, a vanishing Lie derivatives implies that
namely that the vector field v along the flow takes a value obtainable by Lie-dragging backward its source value and is completely ‘generated’ by it. This is reminiscent of the construction of a parallel vector field, from an initial seed, by parallel transport.
A consequence of the above result is that a torsion-free connection is nicely related to the Lie derivative, in fact, the condition implies that
while, in general, connections need not be related to Lie derivatives. Notice that, from the above expression, it directly follows that . But there is more to this. In fact, upon introducing a metric and a notion of Lie derivation, a covariant derivative may be computed on the basis of Lie derivatives through the following formula due to Koszul:
where are smooth vector fields and we have omitted for the ease of notation the point where all derivatives are taken. Proving the above identity is a matter of patience. First, let us recall that given a smooth curve such that and , and given two smooth vector fields , for a metric connection it holds that
Setting leads to
Likewise, we may write that
Summing up side by side yields:
which proves the assertion.
To conclude this stub, we may examine an example that discusses the notion of Killing vector field.
Example 15.
A Killing vector field is one along with the Lie derivative of the metric vanishes. Let us make this notion more substantial. Let denote three vector fields and let us recall that , where ∇ denotes a Levi-Civita connection. Now, from the relationship (143) it follows that
therefore, it holds that
The left-hand side of such equation represents the Lie derivative of the metric, namely . Vanishing of the Lie derivative of the metric implies that
that is a condition on the vector field v referred to as Killing equation. Here, the empty slots remind us that the equation must hold for every choice of , hence the Killing equation involves only the differential structure of the manifold.
4. Iterated Derivatives and Riemannian Curvature
The notion of covariant derivative arises as a generalization of the directional derivative of a vector field. However, while directional derivation commutes, covariant derivation does not. Such observation leads to the notion of Riemannian curvature endomorphism.
The present section is organized as follows. Section 4.1 remarks some well-known properties of the iterated directional derivatives in the flat space . The Section 4.2 lays out the notion of iterated covariant derivatives and defines the notion of second covariant derivative, which is shown to be well-defined in contrast to a double iterated derivative. Section 4.3 describes the parallel transport of a tangent vector along a tiny loop and shows that the final result differs from the starting vector, and attributes such discrepancy to the curvedness of a manifold. In Section 4.4, we describe the key notion of Riemannian curvature endomorphism, which quantifies the non-commutativity of second covariant derivative and is taken as a fundamental operator in describing the effects of curvature of a manifold. To end this section, the Section 4.5 describes some curvature-related concepts, such as tidal effects, geodesic deviation and sectional curvature.
4.1. Commutativity of Directional Derivatives in
Commutativity of directional derivatives is a consequence of Schwarz’s theorem. Recall that, given a vector field , and a vector , its directional derivative at a point p is defined as
Given another direction , one may compute the directional derivative of the vector field along u, namely
In other terms, we have that
hence directional derivatives commute.
The above-discussed commutativity test for directional derivatives is based on the notion of iterated directional derivation. To extend such test to manifolds, it is necessary to discuss what it is meant by iterated covariant derivation and through which formulas such iterated derivatives may be computed.
4.2. Iterated Covariant Derivatives and Second Covariant Derivative
Covariant derivation inputs a vector field and outputs a further vector field. Hence, it is possible to compute the covariant derivative of a covariant derivative (and so fort) of a given vector field.
In particular, given a smooth curve with image in , which represents the trajectory generated by a dynamical system, it is conceivable to define the following vector fields along such trajectory:
(It is an interesting research exercise to look for a translation of these terms in one’s local language. For the Italian readers, derivatives of a positional variable from the third order on are translated as strappo, sbalzo, crepitìo, schiocco).
In general, iterated covariant derivatives are better computed through the expression (114) which, applied to speed, reads
Let us consider an example of evaluation of covariant acceleration on the special orthogonal group.
Example 16.
On the special orthogonal group endowed with its canonical metric, the covariant acceleration associated to a trajectory reads, according to Equation (156):
Recalling that is a Lie group and writing the velocity as , with , one may write , hence substituting:
Whenever the group is used to model the rotational dynamics of a rigid body, the above relation is interpreted as follows: The covariant acceleration equals the angular acceleration in the body-fixed reference frame brought back to an inertial reference frame.
Applying again the Equation (114) to the vector field yields
The same procedure may be repeated to get the expressions of the higher-order iterated covariant derivatives. It is worth observing that, for the first time, we encounter a derivative of the Christoffel form, , as well as nested Christoffel forms . Such observation turns out to hold in the more general setting of nested connections applied to vector fields and plays a fundamental role in the computation of Riemannian curvature.
It is worth underlying two special cases.
- A special case arises in the presence of a connection whose Christoffel form is normal, namely , in which case we may access the simplified relation (36) to compute covariant derivative, namely . In such a case, the covariant jolt may be computed as
- A even more particular case is given by a geodesic curve, for which . In this case, the acceleration , hence all covariant derivatives (from acceleration to pounce) are identically zero (no matter what is the structure of the Cristoffel form of the second kind).
Let us examine a few examples of computation of covariant jolt associated to a manifold-valued system trajectory.
Example 17.
The first case treated in the present example concerns the hypersphere endowed with its canonical metric. In this case, the Christoffel form of the second kind reads hence the covariant acceleration reads
In addition, according to Equation (159), the covariant jolt of a trajectory in reads
Let us recall that from the condition , deriving thrice, one gets , hence the last expression is equivalent to
We might have obtained the same expression more easily recalling that, in this case, the Christoffel form is purely normal, hence the jolt is given by the Equation (160). Let us recall that , hence . Therefore, the jolt is computed as
Notice however that, deriving both sides of the condition twice yields , hence the expression (164) is equivalent to the expression (163).
A second case-of-study arises from the special orthogonal group endowed with its canonical metric. In this case, the jolt of a trajectory in computed through the general Equation (159), taking into account that , reads
Even in this case, the Christoffel form is purely normal, hence we may apply the relationship (160), upon recalling that and hence , obtaining
In this case, since the manifold is a Lie group, the velocity over the trajectory may be factored as with . Hence
Therefore one may verify that the above expressions are equivalent to one another and to
Notice that the result of the calculation is a tangent vector in . The so-called rotational jolt (and, in particular, a function of its norm ) was used in several contexts to measure the fluidity of rotational movements. An example may be found in [20,21], which dealt with hips sway fluency assessment in the context of movement analysis.
The last example considered concerns the space of symmetric, positive-definite matrices endowed with its canonical metric. In this case, , hence
and, consequently,
a (perhaps unexpectedly) convoluted expression. Notice that, in this case, the Christoffel form is not purely normal, hence the special relationship (160) does not hold.
We mention that, if a curve has unitary speed, namely , then the norm of its covariant acceleration is taken as a measure of its curvature. Namely, the scalar field
is defined geodesic curvature and quantifies how much a curve differs from a geodesic. Hence a geodesic curve has zero geodesic curvature.
In a similar manner as we have done for the covariant derivatives along a curve, we may compute iterated derivatives of a given vector fields along different directions. Notationwise, given a vector field whose covariant derivative is sought and two directions , a nested derivative will be denoted as . Let us make such abstract notation become substantial. The vector field w needs to be defined in a neighborhood of the point x. Likewise, it is necessary to set up a vector field , well-defined in a neighborhood of x, such that . In this way, the connection yields a vector field, well-defined in a neighborhood of x, which may be further covariantly-derivated along the direction u.
In a even more natural manner, given three vector fields , one may define an iterated connection , whose result may be expressed as
Notice that the term involves the directional derivative of the Christoffel form, as well as the directional derivative of the vector fields w and v, in fact
where denotes the second directional derivative of the vector field w and denotes the derivative of the Christoffel form with respect to the argument x only. Therefore, the nested connection may effectively be computed through the (a little convoluted) expression
where we have made use repeatedly of the linearity of the Christoffel form of the second kind with respect to its vectorial arguments to extend it to the whole ambient space.
An unavoidable characteristic of the iterated derivative (174) is that it includes not only the rate of change of the vector field w and of the Christoffel form (which describes the underlying manifold) but also the rate of change of the vector field v, which may be an unwanted effect. Therefore, on the basis of the nested covariant derivative (174), one may define a second covariant derivative of the vector field w in the directions as follows:
To verify the effects of such definition, it pays to compute first
so that the second covariant derivative may be expressed as
The second derivative looks ‘purified’ from the rate of change of the vector field v.
Observation 1.
For the mathematically-minded, we underline that there exists a technical reason why the iterated covariant derivative is, in some sense, not well-defined while the double covariant derivative is well defined. Let us recall that the iterated covariant derivative is meant to capture the rate of change, along the direction u, of the rate of change of the vector field w along the direction v, at a given point x: in principle, hence is a single tangent vector to which there need not to be associated any vector field. Nevertheless, in order to evaluate the covariant derivative along the direction u, it is necessary to ‘extend’ the direction v locally to a smooth vector field, say : such vector field needs only to obey the constraint , being elsewhere unconstrained. Now, since the nested covariant derivative (174) depends explicitly on , its value depends on the way we have extended the vector v by the field , which makes its expression ‘user-dependent’. As opposed to this, the second covariant derivative (177) does not exhibit such kind of dependence, namely the way one chooses the extension is irrelevant to the value of the second covariant derivative, which results henceforth well-defined (‘user-independent’).
Now, we may wonder whether the second covariant derivative, which depends on the second directional derivative, is independent of the order of derivation, namely if the result changes if one swaps the vectors u and v. From the last expression, we may immediately find that
By exploiting the known symmetry of the Christoffel form along with evident cancellations, one immediately finds that
The net result is that the second covariant derivative does not commute and the degree of non-commutativity depends solely on the behavior of the Christoffel form of the second kind while it does not depend at all by the rate of change of the vector field being derivated (w). A remarkable consequence of the latter observation is that the quantity is an intrinsic property of manifolds.
Let us underline that, for a torsion-free connection as the Levi-Civita one, the non-commutativity measure may be written in terms of commutators. In fact, from the definition of second covariant derivative (175) we get that
Now, recalling that and defining the following covariant differential commutator , the above relationship may be rewritten compactly and meaningfully as
which suggests that the lack of commutativity of the iterated covariant connection is due to the lack of commutativity of covariant connection depurated from the lack of commutativity of the involved vector fields.
To conclude this section, let us examine, as an example, how to generalize the relationship (159) to a generic tangent vector field.
Example 18.
Given a smooth vector field , its covariant derivative along a smooth curve γ may be expressed through the relation (114) as . Applying again the Equation (114) to the vector field yields
where the following abbreviation has been made use of
with being arbitrary constants. As we shall see in the next section of this paper, such linear operator is not just a convenient placeholder but plays indeed a fundamental role in quantifying the Riemannian notion of curvature.
4.3. Manifestation of Manifold Curvature in Parallel Transport along a Loop
Let ℓ denote a simple loop, namely a simple curve on a manifold that starts and ends at the same point x and let denote a test tangent vector. Let us parallely-transport the tangent vector along the closed loop ℓ and let us denote the final result of transport as .
If a manifold is flat, the result of parallel transport along a loop is just the initial vector, namely . On a curved manifold, this is not necessarily the case, as a manifestation of curvedness. Let us examine such phenomenon, referred to as anholonomy, in detail, in the hypothesis that the closed loop ℓ be very small.
To effect parallel transport along the loop ℓ parameterized by , with , we should set-up the differential equation
in the transport field , with initial condition given. Let us observe that, according to the property of the Christoffel form underlined in Section 3.8, the solution of the above differential equation in is indeed a tangent vector field along , hence we may write
Setting leads to
since the loop is closed.
Under the hypothesis that the loop is very small, it makes sense to approximate the loop integral above by means of first-order approximations of the Christoffel form as well as the transport field around the foot-point x, namely
with , where we have made use of the definition (183) and the last equation derives from the relationship (184). Also, we have made use of the approximation . On the basis of the above approximations, we see that
therefore, making use of the approximation , we get
Plugging the above relation into the loop integral (186), we get
Recalling that the Christoffel form is (bi)linear in its vectorial arguments, it is immediate to see that the first addendum is null, namely
therefore ultimately we get that the discrepancy between the transported vector after and before transport is quantified by
up to order . The above relationship may be written in a more ‘symmetric’ way by the following arguments. Let us first notice that
therefore it holds that
In addition, notice that, by linearity and closedness of the integration path, it holds that
therefore, the relationship (195) may be rewritten as
Through this equation, the discrepancy (193) may be rewritten as
One may hence infer that the integrand is hence responsible for the discrepancy between and . In addition, we notice informally that the integrand appears to have exactly the same structure of the right-hand side of the relation (179). (In addition, from the above relation it is easily guessed that the amount of discrepancy increases with the size of the loop.)
4.4. Riemannian Curvature Endomorphism
As we have seen, given a closed loop ℓ with foot-point and a tangent vector , the parallel transport of the latter along the closed loop results in a tangent vector that differs from . Since, however, both vectors and belong to the same tangent space (irrespective of how ample is the loop), one may postulate the existence of a function, call it , that transforms to . Such function is called curvature endomorphism and we write .
As it is customary, we are going to determine the structure of the curvature endomorphism for a special loop, namely a parallelogram. In order to define a parallelogram on a smooth manifolds’ tangent bundle, let us define the following elements
- four points that denote the vertexes of the parallelogram in around the point x,
- two tangent vectors that define the ‘sides’ of the parallelogram,
- a smooth curve that joins the point x to the point a, such that and ; the curve is parametrized through the parameter s, namely a point on such curve is denoted by ,
- a smooth curve that joins the point a to the point c, such that and ; the curve is parametrized through the parameter t, namely a point on such curve is denoted by ,
- a smooth curve that joins the point x to the point b, such that and ; the curve is parametrized through the parameter t, namely a point on such curve is denoted by ,
- a smooth curve that joins the point b to the point b, such that and ; the curve is parametrized through the parameter s, namely a point on such curve is denoted by .
Notice that, by consistency, the point on the parallelogram coincides to the point which, in turn, coincides to the point c, which lays at the ‘opposite’ side to the point x in such parallelogram. Also, notice that the extension of the parallelogram depends on the values of the parameters s and t, hence such parallelogram may be made as small as one wishes by reducing the values of such parameters.
It will be necessary to make use of local extensions of the vectors and of the test tangent vector being parallely-transported. In order not to burden the notation with extra symbols, we shall denote such (arbitrary) extensions by , respectively. These vector fields may be chosen arbitrarily as long as they are sufficiently regular to afford derivation and , , .
As a further introductory element, let us recall a couple of very useful first-order approximations. Let denote a smooth curve in and denote a smooth vector field. From the definition of covariant derivative of a vector field along a given direction we get the following first-order approximation of the vector field F at :
This relationship may be rewritten equivalently as , which appears as a sort of Taylor expansion of a vector field along a curve departing from a given point. Such expression leads to the further approximation
The approximations and will prove instrumental in writing down the following necessary relations seamlessly.
The main idea to define the curvature tensor is that transporting a tangent vector along a parallelogram and comparing the starting vector with the resulting vector is equivalent to transporting the same vectors along two paths, namely along and along and then comparing the two results at c. Formally, we hence define the Riemannian curvature endomorphism as
For the first branch, we have
where the first equality follows from the relation and the third equality follows from the relations and . As a consequence, it holds that
Analogously, for the second branch we get
Hence, after canceling out equal terms and grouping, we get
Dividing both sides by the product and taking the limit for that vanish to zero, we get
In conclusion, upon computing the above limits, we get the expression of the curvature endomorphism applied to a tangent vector as follows
At a first glance, it is not apparent that the combination of differential operators on the right-hand side of Equation (207) should equal a linear endomorphism as in the left-hand side. However, from Section 4.2 we know that not only this is indeed the case, but also that the right-hand side is completely independent from the way we choose the extensions . In addition, from Section 4.3, we know that the expression on the right-hand side is exactly the combination of operators and that quantifies the curvedness of a manifold in the parallel transport around a tiny loop. As a matter of fact, the Riemannian curvature endomorphism, applied to a test tangent vector , may be written explicitly (and solely) in terms of the Christoffel form of the second kind, as
Let us apply the above formula to compute the curvature of a few manifolds of interest.
Example 19.
The first case examined is that of a unit hyper-sphere endowed with its canonical metric. Let us recall that , hence
Applying the Equation (208) yields
Since the last two addenda are null because , we rest with
Notice that the result does not depend explicitly on the point x, hence such Riemannian curvature endomorphism is uniform (in the sense that it takes the same expression in all tangent spaces). It is interesting to observe that the vector belongs to the subspace .
Next we tackle the case of the special orthogonal group endowed with its canonical metric. Let us recall again that the Christoffel form of the second kind reads , therefore
Applying the Equation (208) gives
The addenda in the last expression do not look like they are going to add up and cancel nicely. Easy but lengthy calculations show that, indeed, they do. We are going to show only a few of the many transformations that are necessary:
The net result is
By some matrix manipulations similar to the above and by introducing the -commutator based on the matrix commutator , the above relationship may be rewritten as
Such expression is precisely the one reported in [22].
As a last example, let us consider the space of symmetric, positive-definite matrices endowed with its canonical metric, for which we had found that . The directional derivative of the Christoffel form reads
By applying the Equation (208) we obtain an expression that consists of 16 terms, which however either cancel or add up nicely to one another, resulting in the final expression
that coincides to the one reported in [22].
The Riemannian curvature endomorphism enjoys, together with linearity in its vectorial argument, a number of symmetries. We are going to recall here only those that we are going to actually utilize in Section 4.5, namely
In particular, one might notice that and .
4.5. Tidal Effects, Geodesic Deviation, Jacobi Fields, Sectional Curvature
A massive body subjected to a spatially-varying gravitation field is subjected to tidal effects. In fact, since different parts of the body are subjected to a gravitation pull that varies from point to point, the whole body is subjected to internal mechanical stress.
Formally, such physical effect has an analogous in manifold calculus in the separation of nearby geodesics that may accelerate or decelerate according to the local curvature of a manifold. To quantify such effect, let us define a smooth family of geodesics where
- for a fixed value of the index , the curve represents a geodesic parametrized by the parameter ; the geodesic curve represents the fiducial geodesic; each tangent vector denotes the speed of the geodesic (we shall omit the values of the parameters for the sake of notation conciseness); all geodesics depart from the same point , namely for any s;
- for a fixed value of the parameter , the function represents a smooth curve (but not a geodesic); each tangent vector represents the separation between two nearby geodesics (even in this case, we shall omit the values of the parameters for the sake of notation conciseness); more intuitively, the infinitesimal separation between nearby geodesics whose index s differ by an infinitesimal is . Since all geodesics depart from the same point, it holds that for every value of the index s.
Let us observe that the vector fields v and commute, in fact
by the Schwarz-Clairaut’s theorem.
The vector may be referred to as geodesic deviation. It turns out that such deviations obeys a very neat equation involving the curvature endomorphism. The velocity with which the separation between two nearby geodesic changes is quantified by the covariant derivation , while the acceleration of the geodesic deviation is quantified by . Let us recall that, for a torsion-free connection is holds that . Since , it follows that , hence
From the Equation (207) it follows that
where we omitted the base-point x. Since and because v is the velocity vector of a geodesic, it ultimately follows that
that confirms that the acceleration with which two nearby geodesic originating from the same point deviate from one another depends on the local curvature of the underlying manifold.
The Equation (222) is called a Jacobi equation and its solution is termed Jacobi field. In some circumstances, the Jacobi equation may assume a particularly simple form. To survey one of such cases, let us introduce the notion of sectional curvature.
The curvature endomorphism describes completely the curvature of a manifold in a given point, however the local curvature of a manifold is more easily described by a scalar index, the sectional curvature , defined as
Although the sectional curvature returns a scalar value, it by no means carries any diminished information about the curvature of a manifold, in fact it is possible to recover the curvature endomorphism by the sectional curvature. A special case that serves the purpose of illustrating such property is that of spaces of constant sectional curvature. A manifold possesses a constant section curvature if, for every and every , it holds that . In this case, the curvature endomorphism is given by
Let us illustrate the notion of constant curvature by a brief example.
Example 20.
Let us consider the hyper-sphere endowed with its canonical metric. We already know that its curvature may be expressed as hence, from the property (224), we conclude immediately that the hyper-sphere is indeed a manifold of constant sectional curvature .
For a manifold of constant sectional curvature , the Jacobi equation reads
which is clearly a linear equation in the unknown Jacobi field .
In any case, if one chooses u and v orthogonal to each other and with unit norm, it is immediately seen that , hence the sectional curvature and the Riemannian curvature essentially coincide.
The sectional curvature plays an important role in evaluating as to whether the acceleration of the geodesic deviation will increase or decrease (locally). To formalize such observation, let us define the quantity and let us evaluate its Taylor series expansion up to the fourth order. Here denotes the solution of the Jacobi equation with initial conditions for , with together with for . Notice that
Setting gives
To what concerns the last term, we observe that, due to the linearity of the curvature endomorphism in its vectorial arguments, we have
Setting and recalling that initially , we see that all terms on the right-hand side are null except for the third one, that reads , hence
therefore the Taylor expansion of the function around reads
where the dots stand for higher-order terms that we are not going to pay attention to.
Under the additional hypotheses that and , the above relationship may be rewritten in terms of the sectional curvature. In fact, by the symmetry (218) we know that which, in turn, equals , therefore we may write that
therefore, locally, if the acceleration of geodesic deviation decreases along the fiducial geodesic, while if the acceleration increases. Hence the sectional curvature determines the speed of spreading of nearby geodesics. (Therefore, in some sense, curvature acts as a force, which is the fundamental observation that motivated Einstein to formulate his General Theory of Relativity).
A further interpretation of the sectional curvature in terms of separation of nearby geodesics is worth examining. Let us define again a family of geodesics and let us assume, in particular, that these geodesic lines were shot in a peculiar way such that, if we denote again by v the tangent velocity along the fiducial geodesic and by the separation vector along the same geodesic line, then it holds that , namely the separation vector is orthogonal to the velocity vector along the fiducial geodesic (that is to say, the geodesic curves in this family are parallel to one another). By appropriate parametrization, one can make sure that . Since denotes the tidal acceleration, we may interpret the effect of the curvature on the spreading of the geodesics as follows:
- whenever , the tidal force is directed in the same direction of the current separation, hence nearby geodesics tend to spread even further from the fiducial one;
- whenever , the tidal force is directed in the opposite direction of the current separation vector, hence nearby geodesics tend to get closer to the fiducial one;
- as opposed to the previous cases, whenever , the tide does not produce any effects and the geodesics get neither closer nor farther apart (in fact, this case corresponds to zero curvature).
Since, by the hypotheses done, , it follows that
- in a region of positive sectional curvature, two nearby parallel geodesics tend to converge towards each other,
- in a region of negative curvature, two nearby parallel geodesics tend to deviate from one another,
- in a region of zero curvature, two nearby parallel geodesic curves neither converge nor diverge.
Once again, the spreading of nearby geodesics is evaluated in terms of relative acceleration, because velocity spreading is not peculiar of curvature, since even in a flat space two curves may deviate with non-zero velocity.
A neat result about sectional curvature is that the number does not actually depend on two vectors but rather on the subspace spanned by such vectors, namely if we define a plane , the sectional curvature is better written as . This property is easily proven by defining any linear combination
where such that , and calculating sectional curvature at , that takes the value
Given the linearity of the curvature endomorphism and of the inner product in their vectorial arguments and the symmetry properties (218), it is a little tedious but not hard to find that
and that
hence the assertion.
We conclude this section by an example that justifies the property that, in a space with constant sectional curvature, the curvature endomorphism is parallel (which is often expressed through the dry statement ).
Example 21.
Let us indicate by four smooth vector fields and let us form the function . The application defines a further vector field, that may be covariantly-derivated along z to give . Since is non-linear in x but trilinear in its vectorial arguments, it holds that
therefore, in particular
On a space with constant sectional curvature κ, it holds that for every choice of . In fact, recalling that, in this case, , we have:
The terms on the right-hand side cancel two by two, hence the stated property holds.
5. Continuous-Time Dynamical Systems
This section presents a survey on continuous-time dynamical systems on manifolds with particular emphasis on Lagrangian systems and non-linear damped oscillators. This survey of second-order (or double-integrator) dynamical system is presented in extrinsic terms, as per the main flow of this paper as well as, for the interested readers, in coordinates and through a special formulation based on cotangent bundles.
The present section is organized in the following way. Section 5.1 retraces calculus of variation on manifold and Section 5.2 applies such technique to derive the general equation of motion on a smooth manifold by a Lagrangian approach. The aim of Section 5.3 is to illustrate the same derivation by using the coordinate formalism, while the purpose of Section 5.4 is to retrace the same derivation by using a cotangent-bundle formalism, which is shown to be as valid as the usually-invoked tangent-bundle one. In particular, Section 5.4 digs into non-linear oscillatory phenomena by first revising classical non-linear oscillators and then showing how to extend such dynamical systems to state manifolds.
5.1. Calculus of Variations on Manifold
In the first part [13] of this multi-part tutorial, we studied a problem related to the evaluation of the first-order variation of the fundamental form , defined as , on a manifold and we invoked its ‘extendability’ to a neighborhood of the tangent bundle in the ambient bundle , namely, it is not necessary to assume the well-definedness of expressions like nor . In that occasion, we mentioned that extendability provides a convenient shortcut but that it is not actually necessary to arrive at the desired result. In the present section, we aim at clarifying this point.
Variational methods constitute a core tool in applied mathematics, and with a specific weight in system theory and control, as it affords the formulation of complex dynamics through and elegant formalism and first principles. It is extremely interesting to examine variational methods on smooth manifolds from an extrinsic perspective. We shall briefly survey this topic as a preparation to the next sections were we shall use it extensively.
Let denote an integrable function on a smooth manifold and let denote a smooth curve on a Riemannian manifold . We are going to prove that the first variation of the integral (action) functional
takes the expression:
where denotes the Euclidean inner product that the ambient space is endowed with.
To understand the present development and the properties of the symbol in a coordinate-free setting, let us define a smoothly-parametrized family of smooth curves . The notation may be interpreted as follows:
- the index s determines which curve in the family is referred to; keeping the index s fixed and varying the affine parameter t, namely considering , corresponds to tracing the curve in the family;
- the affine parameter t determines which point on a curve is referred to; keeping t fixed and letting s vary, namely considering , corresponds to traversing the family of curves transversally in correspondence of homologous points; notice that still traces out a curve, although not necessarily endowed with the properties of (for example, while curves may be geodesics, transversal curves might not be geodesics).
Such family of smooth curves will be chosen such that
namely, the family member corresponding to the index value coincides to the curve , while the other members are free except that they share the same endpoints as . As mentioned, for a fixed s, the map traces a smooth curve on the manifold . Likewise, for a fixed t, the map traces a smooth ‘transversal’ curve on . Smoothness implies by the Schwarz-Clairaut theorem. Now, it pays to define the following partial derivatives:
The function is the tangent vector to the transversal curve at the point t, hence . In addition, from the properties (247), it follows that .
On the basis of the above settings, the variation may be written as:
The partial derivative within the integral in Equation (249) may be written as:
Notice that the above expression does not require extendability to a neighbor of but merely the existence of the partial derivatives and . The variation of the action functional in a neighbor of the fiducial curve reads, therefore:
where, in the second integral, we have used the interchangeability of partial derivations permitted by the Schwartz-Clairaut theorem. In addition, integration by parts yields:
The first addendum on the right-hand side equals zero since the transversal variation vanishes at the endpoints of the fiducial curve , hence the variation of the integral functional assumes the final expression (246).
A noticeable consequence of this development is that the variation depends only on the tangential components of the perturbation.
5.2. Second-Order Dynamical Systems on Manifold
In order to formulate second-order dynamical systems on manifolds we shall recall the Euler-Lagrange-d’Alembert modified minimal action principle. According to such principle, a dynamical system with which it is associated a kinetic energy function , a potential energy function and a non-conservative force field f, generates a trajectory on a state manifold such that:
Let us survey in details how the above constituent functions are defined, what is their meaning, and how to evaluate the variation of the integral. We shall assume to be a smooth Riemannian manifold.
The kinetic energy associated to a dynamical system is defined as
with . Except for the factor , the kinetic energy coincides with the fundamental form . As such, the kinetic energy depends on the state of the system as well as on its velocity. (Notice that the ‘mass’ term m has been introduced to retain familiarity with well-known physics concepts. In general, in the present context it is not necessary at all because the ‘mass’ may be more naturally thought of as part of the metric , hence of the metric tensor G.)
The potential energy function associated to the dynamical system under construction is indicated by . The potential energy is generally a function of the system’s state only and, most frequently, of the Riemannian distance between the state and a reference point on the state-manifold.
The function is referred to as Lagrangian function of the system. The integral of the lagrangian function along the trajectory generated by the system, namely , is referred to as total action of the system. (In the international unit system, the Lagrangian is measured in Joule, while the action is measured in Joule · second.)
According to the Euler-Lagrange principle that a dynamical system is supposed to bey, in the absence of non-conservative forces acting on the system, the system generates a trajectory on the state-manifold that makes the total action stationary, namely
There exists a further function, referred to as Hamiltonian function of the system, defined as
that represents the total energy content of the system (at any given time instant). In a system that obeys the Euler-Lagrange principle, the Hamiltonian is an invariant, namely it holds that .
In the presence of non-conservative forces, the formed principle needs to be modified to accommodate the work of such forces, hence it gets extended as in (253), that we shall refer to as Euler-Lagrange-d’Alembert principle.
In order to calculate the variation of the total action, according to the method surveyed in Section 5.1, let us define a smooth family of smooth curves denotes as
where the index s individuates a member of the family, while the affine parameter t individuates a point on that curve. Let us underline that, for a fixed value of the affine parameter, the map traces a smooth curve.
Let us observe that the partial derivative represents a transversal tangent vector, computed as
The above-defined family of curves possesses, in particular, two fixed endpoints, namely the endpoint and the endpoint . In formal terms, this is expressed by
In other terms, the maps and are constants with respect to the index s. As a last definition, we set
A consequence of the assumption (259) is that . The function represents the ensemble of the variations, namely all transversal tangent vectors to all point of the fiducial curve . (The fiducial curve represents, in fact, the trajectory actually followed by the system.)
The variation of the total action may be defined as:
where we have used the shorthand to denote . In particular, the variation of the kinetic energy may be made explicit as follows:
The first and the third integrals on the right-hand side take the same value. In the second integral, it is possible to swap two tangential arguments, therefore
The first integral may be re-expressed by swapping the order of partial derivation and by applying integration by parts, which give:
where, in particular, we may recall from the definition of the ‘bullet’ derivative , that
Continuing with the evaluation of the variation of the integral of the kinetic energy, we get:
Let us now evaluate the variation of the potential energy function, that reads
To what concerns the work of the external forcing term, ultimately it turns out that
Upon summing up all the above terms and setting (wherever the variable is present) we obtain:
Let us observe that, in the first integral on the right-hand side, the scalar product does not change if one adds any arbitrary normal term in , therefore such integral is equivalent to:
In conclusion, the variational principle of Euler, Lagrange and d’Alembert leads to the integral equation
Since is an arbitrary (tangential) perturbation, the above equation implies that the integrand must be zero. Ultimately, the calculus of the variation leads to a class of second-order dynamical systems on manifold of the type
At a glance, it appears to generalize the II equation of dynamics of Newton to any smooth Riemannian manifold. In fact, let us observe that if , then , hence the above equation collapses to the familiar physics law “mass × acceleration = resultant of forces”.
As an example, let us compute the energy balance of a non-conservative system (and, as a by-product, let us verify the energy conservation of a conservative system).
Example 22.
By definition, the total energy of the system is , hence the power law for the system (272) is
From the expression (272) it follows that , and hence
The dissipative components of the external force fields are those that cause a decrease of the energy content of the system, namely those focing terms such that (such as, for example, the viscous damping , with ). The active components are, on the other hand, those which inject energy into the system, such as those generated by actuators and due to active damping effects.
5.3. Coordinate-Prone Lagrangian Formulation of Dynamical Systems*
As recalled, on a Riemannian manifold whose tangent spaces are endowed with an inner product , the non-conservative Euler-Lagrange-d’Alembert principle may be stated as:
where denotes a vector field of external forces which, in the context of mathematical system modeling and control, may include energy-dissipation terms and energy-injection terms that include control fields. On each point of the fiducial trajectory , the quantity denotes a variation with respect to the fiducial trajectory itself. A possible generalization of this principle would arise by replacing the potential function by a generalized potential [23], which would give rise to gyroscopic force fields.
On a smooth manifold of dimension p, where a point is described by coordinates via a coordinate chart, working with intrinsic coordinates eases the expression of the solution of the variational problem (275) in explicit form. In order to express the derivation of the general law of dynamics arising from the extended Lagrangian formulation (275), the following facts are worth summarizing:
- the standard notation for covariant and contravariant tensors indexes as well as Einstein’s convention on summation indexes are in force;
- for any pair of tangent vectors , it holds that , where denote the components of the metric tensor associated to the inner product , namely, based on the canonical basis , ; the functions depend on the coordinates and the symmetry property holds;
- in coordinates, the Riemannian gradient of a regular function at a point is given by:the quantities denote the components of the Euclidean gradient, while the spatial functions denote the contravariant components of the metric tensor;
- the Christoffel symbols of the first kind associated to the Levi-Civita connection corresponding to the metric tensor of components are computed as:the Christoffel symbols of the second kind are computed as ; the Christoffel symbols of the second kind are symmetric in the covariant indices, namely, ; the Christoffel form (a way to express the Christoffel symbol applied to two tangent directions as an array) has components ;
- the kinetic energy functional is expressed by the symmetric bilinear form , namely , where the constant plays the role of a mass term; on a Riemannian manifold, the metric tensor is positive-definite, hence on every trajectory ; in the context of abstract manifold calculus, the mass term is quite immaterial since it does not represent any actual physical quantity, nevertheless, we shall leave it in the equation to help readers although, as already mentioned, it might well be incorporated into the metric tensor g;
- the potential energy function depends on the coordinates only;
- the variations may be chosen arbitrarily except at the boundaries of the trajectory where they vanish, namely, .
In coordinates, the principle (275) may be rewritten as:
where functions denote the covariant components of a cotangent dissipation force field . By calculating the variation in the leftmost integral, the extended Lagrangian formulation may be expressed in the form:
through integration by parts. As the variations may be chosen arbitrarily for the parameter t ranging in the open interval , the equations of dynamics on the manifold are such that , namely:
Thanks to the Christoffel symbols of the first kind , the above equation may be recast as:
Upon multiplying both sides of the above equation by the contravariant tensor and upon summation with respect to the index k, the above equation takes the form:
The above expression may be further condensed by noticing that:
- the terms equal the Christoffel symbols of the second kind ;
- the terms denote the components of the Riemannian gradient ;
- the terms denote the components of the contravariant (tangent) dissipation force field .
In conclusion, by introducing the coordinate array and the velocity array , the Equation (281) may be recast compactly as a system of first-order equations
The following example treats in details the case of viscous damping as external force field and determines the rate of energy dissipation.
Example 23.
The dissipation term is often drawn from viscosity theory, namely, , with denoting a viscous damping term. The system dynamics assumes then the expression:
The dynamics (283) is governed by a driving force from a potential energy plus a viscosity term.
The law of energy balance for the dissipative system (283) may be found by the equation:
Plugging in the expression (278) gives:
The first integral may be rewritten, again upon integration by parts, as:
The second term may be written, upon integration by parts, as:
while the remaining terms may be written as
The energy balance equation may thus be written as:
where stands for and for . The above equation shows that the designed second-order dynamical system looses energy at a rate proportional to its kinetic energy.
A non-dissipative system () devoid of a potential function, hence completely governed by the kinetic energy/metric, produces a trajectory that keeps the quantity that undergoes variation, namely the only kinetic energy, constant over the whole trajectory. As we already knew, therefore, the Christoffel form may thus be computed on the basis of the variational principle:
invoking which is often the preferred path for computing the Christoffel form.
5.4. Cotangent Bundle Derivation of Non-Linear Oscillators on Manifolds*
Nonlinear oscillators arise from the modeling of complex physical structures [24] and make up the basis for a number of modern applications [25]. The state of nonlinear oscillators may form complex, non-repeating (although deterministic) patterns. Contributions from the scientific literature suggest how the simplest harmonic-oscillator models may be generalized to complex single or coupled oscillators [26,27,28,29].
The cotangent bundle, namely the dual of the tangent bundle associated to a manifold, is as valid as the tangent bundle to formulate and study dynamical systems. We mention that some authors in manifold calculus explain the whole manifold calculus itself by starting from cotangent bundles and only later introducing tangent spaces. In the present section, we shall rather be showing how non-linear oscillators may be defined by means of tangent-bundle calculus. We shall start off by revising a number of classical oscillator models very well known in the scientific literature and shall then continue by showing how such mathematical models may be seamlessly be extended to manifolds by means of cotangent-bundle calculus.
The present section makes frequent reference to the work [30] that it is based on.
5.4.1. Classical Oscillators on Euclidean Spaces
A number of nonlinear oscillator models appear in the scientific literature as dynamical systems in a single real descriptive variable. The simplest model known in the scientific literature is perhaps the harmonic oscillator described by the second-order dynamical system:
where denotes a descriptive variable and denotes the natural oscillation frequency of the harmonic oscillator. The state is a function of time, with . As no damping is considered, the harmonic oscillator conserves its initial energy indefinitely. The same consideration applies to the simple pendulum model , which appears as a second-order differential equation that may be easily recast as a system of two first-order differential equations:
where the forcing term on the right-hand side has been written as the gradient of a potential energy function which, in this case, may be defined as . (Notice that we have therewith abandoned the notion of ‘mass’ which, besides of being immaterial in the present context, may always be factored out through appropriate normalization.)
A further well-studied nonlinear damped oscillator that arises from the analysis of vacuum tubes is the van der Pol oscillator [24], described by the second-order differential equation:
where again denotes a descriptive variable and denotes a damping parameter. When , the Equation (290) collapses into the equation of an harmonic oscillator (288). When , the system will eventually enter a limit cycle. Near the origin of the phase space the system is unstable, while far from the origin the system is heavily damped and hence stable. The single Equation (290) may be recast as the paired equations:
where now two variables, , are used to describe its dynamics and a potential energy function has been introduced to quantify the forcing term, which is defined as , where denotes a constant parameter. The van der Pol oscillator model may be regarded as a special case of the FitzHugh-Nagumo model [31] which, in turn, represents a simplified version of the Hodgkin-Huxley mathematical model that explains in a detailed manner the phenomena of activation and deactivation of spiking neurons. A related example arises from the examination of the driven van der Pol oscillator, described by the differential system:
where and denote two constant parameters. In the present case, the non-linear damping term comprises a quantity (the driving term) that depends sinusoidally from the temporal variable t. The particular form of the damping causes a dehancement of the magnitude of strong oscillations and an amplification of the magnitude of weak oscillations. The original van der Pol oscillator has been generalized in different ways, as in the paper [32] that considers the introduction of fractional derivatives.
Another well-studied nonlinear dynamical system that exhibits a complex behavior is the Duffing oscillator [33]. Duffing’s dynamical system reads:
where and denote constant parameters and the potential energy function is defined according to one of the three known models summarized below:
- Case of a hard Duffing oscillator: In this case, the potential function is defined as ,
- Case of a double-well Duffing oscillator: In this instance, the potential is defined as ,
- Case of a soft Duffing oscillator: In this case, the potential is defined as ,
where and denote constants of the models. The dynamical system (293) describes the motion of a damped oscillator with a more complicated potential than in the simple harmonic oscillator. It is able to model, for example, a spring pendulum whose spring’s stiffness does not exactly obey Hooke’s law. A direct comparison between the Duffing oscillator (293) and the driven van der Pol oscillator (292) reveals that the linear damping term in the Duffing system is replaced by the non-linear damping term in the van der Pol system. A number of generalized Duffing models have been proposed in the scientific literature. As an example, we mention the contribution [34] that describes a generalized double-well Duffing oscillator endowed with non-linear damping described by:
where the constant is termed damping exponent and determines the degree of non-linearity of the damping term. The case corresponds to linear damping in the system (293). A similar analysis was performed in [35] about the universal escape oscillator endowed with the same nonlinear damping term.
Several two-variable oscillator models are known from the literature besides the one examined in details, such as the Lotka-Volterra model of prey-predator interaction and the Wilberforce torsional pendulum. A common factor of all two-dimensional models is that their dynamics is entirely described by two variables, a positional variable x and a velocity variable v, in such a way that the phase space of these systems is formed by all pairs , hence is the Euclidean space . The state space, in particular, coincides with the real line , and the velocity space coincides with the real line . This appears to be a coincidence that does not copy to the manifold case, in which the state manifold and the velocity space are entirely different spaces. Clearly the tangent bundle plays a fundamental role in the description of a dynamical system on manifold, and so does its dual, namely the cotangent bundle .
Dynamical systems involving more than one variable are known in the scientific literature. The best known example is perhaps the Lorenz oscillator [36], which appears as a nonlinear, three-dimensional dynamical system that generates a complex flow. A Lorenz oscillator is described by the mathematical model:
which includes three constant parameters and three variables . For certain values of the three parameters, the Lorenz system exhibits complex behavior. The Lorenz equations were derived from the simplified relationships that govern convection rolls arising in the study of the atmosphere and were applied to achieve climate and weather predictions. A further noteworthy example is brought to our attention by the authors of [37], which studied the emergence of oscillations in a SIR epidemic model.
The Lorenz model (295) is of the first order, but it may be recast as a second-order model by deriving both sides of each equation with respect to the temporal parameter t:
By associating to the Lorenz system the state array , the dynamical system (296) may be cast in the compact form:
The recast Lorenz system takes the form of a set of purely non-linearly damped oscillator on a flat space which may be thought of being endowed with a constant potential energy function.
A general expression for coupled scalar oscillators in that include, as a special instance, the Lorenz model (297), may be drawn from the study [38]:
where denotes the state array, denotes a potential energy function, denotes the Euclidean gradient of the potential energy function, indicates a linear operator in the variable v (possibly nonlinear in the variable x) that represents damping and other internal sinks/sources of energy, while the term represents a forcing term in the fixed direction .
5.4.2. Cotangent Bundle Notation
In the present section we shall denote the inner product of two tangent vectors through a special symbol, namely . The metric ‘flat’ operator, denoted as , turns a tangent vector into a cotangent vector, namely, . Its dual is termed ‘sharp’ operator and is denoted by . The differential of a differentiable function at a point is denoted by . The Riemannian gradient of the function with respect to a metric , evaluated at a point is expressed though the metric operator as . The Christoffel form associated to the metric is denoted as . The Levi-Civita connection associated to this metric is denoted as and the parallel transport operator associated to the same metric is denoted as .
Let us recall that a vector field that depends on a parameter, namely , is positive-definite if for every and every .
Let us further recall that, given two points connectable by a geodesic line , their Riemannian distance is defined by
A fundamental result of the calculus on manifolds states that the Riemannian gradient of a squared distance function reads:
as long as the logarithm is defined. The gradient of the squared distance is known in closed form only if the expression of the inverse exponential map is known explicitly.
5.4.3. Second-Order Dynamical Systems Formulation by Cotangent Bundles
A classical dynamical system model on the ordinary space is represented by Newton’s second law of motion of a particle of mass m subjected to an external force , where denotes a time interval. The trajectory followed by such particle results as the solution of Newton’s equation:
The second-order time-derivative quantifies the instantaneous translational acceleration of the test particle as a function of the absolute time t and the mass constant m quantifies the translational inertia of the particle. The resultant of ‘external’ forces f depends on the instantaneous position of the particle , on its instantaneous translational velocity and may even depend explicitly on the time.
The current section recalls a formulation of second-order dynamical systems on manifold that retraces the above-recalled dynamics of an virtual point-wise particle whose instantaneous position is bound to belong to a Riemannian manifold —endowed with metric —rather than the ordinary space . As in the previous sections, the Hamilton’s stationary-action principle, extended through the d’Alembert virtual work notion, shall be invoked to formulate the dynamics of such virtual particle. For this reason, it pays to revisit the notion notions of kinetic energy, potential energy and external excitation, mainly with the aim of clarifying the customized notation introduced at the beginning of the present section.
The kinetic energy function for a point-wise particle associated with the metric is denoted by and is defined by for . On a Riemannian manifold, the metric tensor is positive-definite, hence, on every trajectory , it holds that . The potential energy function is assumed to depend solely on the positional variable ; whenever any exogenous forcing is lacking, the trajectory follows the landscape of the potential energy function. As opposed to what stated in the previous sections, in the present context an exogenous force at a point appears as a cotangent vector, namely .
On the basis of the above-recalled quantities and notation, the extended stationary-action principle that determines the evolution of the state on a Riemannian manifold endowed with a metric takes the form:
where symbol again denotes variation. The integrand in the leftmost integral represents the classical Lagrangian function of the particle and its integral represents the total action of the particle. The rightmost integral represents the virtual work effected by the resultant of the external driving force and represents an extension of the classical Hamilton’s stationary-action principle that may be traced back to d’Alembert [39]. In the interior of the interval , the variation is arbitrary, while at the boundaries of the trajectory it vanishes to zero. Computing the variation leads to the dynamical system formulation:
By comparing the Equation (303) to Newton’s law (301), it is readily seen that the term represents the covariant acceleration of the particle sliding on the manifold , the operator plays the role of inertia tensor and the term is the total force that generates the motion of the particle. Of particular interest the role played by the inertia tensor that replaces (and extends) the notion of inertial mass.
The covariant acceleration may be written in terms of the Christoffel form as , therefore, the Equation (303) may be rewritten as the system of first-order differential equations:
in the tangent-bundle arrays .
The total energy (or Hamiltonian function) of the particle sliding on the manifold, and hence of the dynamical system (304), is defined by:
Over a trajectory of the system (304), it holds that:
which represents a power law for the dynamical system at hand. In the case that the external forcing is absent, the system (304) is conservative as the total energy keeps constant over the trajectory, otherwise the system is non-conservative and its energy varies along its trajectory.
In the present context, the force essentially represents damping effects and external forcing terms. In particular:
- Friction-type damping: This kind of damping generalizes the Rayleigh damping and is expressed by the forcing term , with being a damping coefficient and being a viscosity coefficient. By computing the derivatives, it is found that the friction-type damping force equals .
- Non-linear damping: It generalizes the nonlinear damping term that appears in the van der Pol system and assumes the expression , with being a vector field that depends on a parameter, such that, for each , is a linear endomorphism.
- Sinusoidal driving force: It generalizes the notion of external sinusoidal forcing term from mono-dimensional dynamical systems. It takes the expression , with and .
With the above assumptions, the dynamical system (304) takes the expression:
The rate of change of the total energy due to the above forcing terms reads:
If , the first term on the right-hand side is purely dissipative and is proportional to the kinetic energy of the system (to , in fact), while the second term on the right-hand side is not necessarily dissipative and may bring energy into the system. (If the vector field is positive-definite, then the term in the system (307) is purely dissipative.)
Let us consider two examples that aim at clarifying the notation used and to show that the considered non-linear oscillators on manifold are indeed general enough to encompass the original classical oscillators.
Example 24.
In the special case that , hence , and , it holds that , , , where symbol denotes the column array . Hence, the dynamical system (307) becomes:
where and . The dynamical system (309) accounts for the exemplary systems (289), (291)–(294) and (298).
To further exemplify the above theoretical development, the present subsection discusses the special case of Riemannian manifolds of dimension 1. The main interest on such kinds of manifolds is analytic tractability. Examples of 1-dimensional manifolds are the unit sphere (namely, the unit circle) and the special orthogonal group .
In order to particularize the general Equation (304) to the case of a 1-dimensional Riemannian manifold, the following preliminary observations are in order. On a Riemannian manifold of dimension 1, the tangent space is a linear space of dimension 1 whose basis vector is denoted by , and the metric tensor collapses into a scalar function , . Given two tangent vectors , their inner product . Consequently, the squared norm equals . Likewise, the cotangent space has dimension 1 and its basis vector is denoted by . Given a cotangent vector , it holds that . Hence, in the present example, the only Christoffel symbol reads and the only non-zero value of the Christoffel form is . Given a potential energy function , the only non-zero component of its gradient equals . The general Equation (304), particularized to a 1-dimensional Riemannian manifold, read:
An interesting special case is the one where the metric is independent of the position, namely , that implies . In this case, the dynamical system (310) particularizes to:
The above dynamical system is a prototype for the classical nonlinear damped oscillators, such as the van der Pol oscillator and the simple pendulum.
The sphere embedded in may be described as the set of arrays of the form , with . Then, it holds that:
where and . Choosing the Euclidean inner product on the tangent spaces , it holds that . Hence, the dynamical equations on endowed with the Euclidean metric are as in (311).
The exponential map corresponding to the above geometric setting may be expressed as:
Given two points , solving the equation in the unknown v gives , hence:
and the geodesic distance between the points x and z is given by:
In the above expressions the symbol ‘arccos’ denotes the inverse cosine function. Parametrizing the point y by and the point z by , the squared geodesic distance may be written as .
5.4.4. Potential Energy Functions
The present subsection retraces classical potential energy functions summarized in Section 5.4.1 and extends such potentials to a general Riemannian manifold.
A well-documented potential energy function is the one arising in the study of the simple pendulum oscillating around the reference angle . In the classical case that , it reads . Such potential energy function may be extended to a general Riemannian manifold , endowed with a Riemannian distance function , as:
with being a constant parameter and denoting a reference point. The potential presents one of its minima in . Rewriting the pendulum-type potential as it is immediate to see that, according to the calculation rule (300), its Riemannian gradient reads:
The potential energy function introduced to state the van der Pol dynamical system in the mono-dimensional state space is a quadratic function centered in the reference point , namely . Such potential energy function may be extended to a general Riemannian manifold by first generalizing the squared distance-to-zero by a squared distance-to-reference and successively by replacing the Euclidean distance by the geodesic distance, namely as
with being a constant parameter and denoting a reference point. The potential presents a single minimum in . Moreover, according to the calculation rule (300), its Riemannian gradient reads:
The contribution [40] studied an extension of the (hard) Duffing oscillator to two dimensions, namely, to the state space . The key point is to extend the original potential function through vector norm as:
where denotes the Euclidean norm, while and are free parameters. The quantity coincides with the Euclidean distance of the state x to the reference point . Such observation paves the way to an extension to a general Riemannian manifold , that is achieved by replacing the Euclidean distance with the Riemannian distance:
where again the quantity denotes the Riemannian distance between two points on the manifold , the constant denotes a free parameter that influences the strength of the conservative forcing term corresponding to the potential, and represents a reference point on the state manifold around which the trajectory of the dynamical system develops. The signs ± may be chosen arbitrarily and their choice give rise to the soft and the double-well Duffing oscillator, as well as to the hard Duffing oscillator discussed in the paper [40]. According to the calculation rule (300), the Riemannian gradient of the extended Duffing potential , in the case of a hard Duffing oscillator, reads:
Likewise, the Keplerian system mentioned in [40] may be extended to a general Riemannian manifold through the introduction of the Riemannian distance , by re-defining the Keplerian potential as:
with denoting parameters of the potential energy function. By rewriting such Keplerian-like potential as and invoking again the calculation rule (300), it is immediate to see that its Riemannian gradient takes the expression:
For exemplification purpose, let us consider two instances of non-linear damped oscillators on matrix manifolds.
Example 25.
As a first example, let us take as state space for non-linear manifold-type oscillators the manifold of symmetric positive-definite matrices, endowed with its canonical metric. If the potential energy function is written in terms of geodesic distance, its Riemannian gradient may be written by invoking the property (300). If the potential energy function is written as an explicit function of the matrix-variable , then the Riemannian gradient, in terms of its Gateaux derivative , may be written as:
By gathering the expressions of the Christoffel operator and of the Riemannian gradient of the potential energy function, the following dynamical system equations are obtained:
In the present case, the function .
As a second example, let us assume a Stiefel manifold is endowed with its canonical metric. In some applications, the potential energy function is written explicitly in terms of the matrix-state (rather than on Riemannian distance). In the principal/minor component analysis case, for instance, the potential energy function reads , with , and its Gateaux derivative reads . By the expressions of the Riemannian gradient of the potential energy function and of the Christoffel form, it is obtained the dynamical system:
where it must hold that .
6. Control Systems on Manifolds and Numerical Implementation
This section presents a survey on error feedback control systems on manifold and their numerical implementation, with special emphasis to system synchronization.
Synchronization in a group of dynamical systems may happen forcibly or spontaneously. Spontaneous synchronization may be observed in synchronous flashing of fireflies, in flocking of fish swarms, in neuronal synchronization in central nervous system as well as in arrays of semiconductor devices. As an example of spontaneous synchronization model, we cite the Cucker-Smale system [41] that insists on the state space and that may be expressed as
where N denotes the number of individuals in the swarm, denotes the ith individual whose state is denoted by , denotes an interaction strength constant and denotes communication weight function which quantifies the degree of communications between each individual.
The classical Cucker-Smale model has been extended to manifolds, and its emergent properties have been studied, in the research paper [42]. The studied extension of the classical model (327) to a Riemannian manifold reads
where the Euclidean distance in was replaced by a Riemannian distance in , the state acceleration was replaced by covariant acceleration and the difference between velocities was replaced by a difference between parallel-transported velocities, which ensures that the trajectories develop entirely on the state manifold .
An application surveyed in this section is the synchronization of the motion of two quadrotor drones (although the achieved results can be extended even to a fleet of drones). In particular, we shall survey a Lie-group control theory to allow the synchronization of the motion of two twin quadrotor drones, whose model is of second-order and insists on a Lie group. A leader drone is flying independently, while a follower should conform its dynamics in such a way that its state syncs with the state of the master. In the present context, we only focus on the rotational component of motion, which engages Lie-group control.
In order to state the quadcopter’s equations of motion, since its state space is a Lie group , let us recall some relevant notation:
- A left translation on a Lie group is denoted by and is defined as . Since for the quadcopter model we deal with a matrix Lie group, we may take , obtaining .
- The Lie algebra associated to a Lie group is a vector space endowed with Lie brackets and an adjoint endomorphism . The pushforward map will be denoted as for brevity.
- Given a smooth function , for a matrix Lie group we can define the fiber derivative of ℓ, , with , as the unique element on the algebra such that for each , where the symbol denotes an inner product on the vector space and the symbol denotes the Gateaux derivative (or Jacobian matrix) of the function ℓ with respect to the matrix-variable .
- On a matrix Lie algebra, it holds that . In order to ease the notation in the following sections, let us define a matrix anti-commutator, which is a symmetric form, namely , no matter what is the structure of the arguments. Conversely, the matrix commutator in the algebra is an anti-symmetric bilinear form, namely .
The present section is organized as follows. Section 6.1 presents a design and an analysis of a Lie-group synchronization theory, which is subsequently applied to synchronizing the motion of quadcopters in Section 6.2. The Section 6.3 treats an important topic, namely synchronization with vanishing control effort. To end with, Section 6.4 describes, in general terms, the notion of feedback error velocity and sheds some light between principal pushforward map and curvature endomorphism by means of homotopic nets.
6.1. Design and Analysis of a Lie-Group Synchronization Theory
The material reported in the present section is largely based on the previous contribution [43].
The Lie group considered for the quadcopter mathematical model is the special orthogonal group whose Lie algebra is . The rotational dynamics of the leader system is governed by the equations
with denoting the leader system’s state-pair, while the rotational dynamics of the controlled follower is governed by the equations
where denotes the follower system’s state-pair and denotes a control field to be designed to make the two twin systems synchronize in time. The function denotes the state-transition operator that ultimately describes the internal nature of the leader-follower systems twins. The mathematical model of the controlled follower system (330) will be further analyzed in Section 6.2.2 to verify the physical feasibility of the designed control strategy in relation to the effort demanded by to the actuators by the control algorithm.
The control algorithm may be designed either to synchronize the two drones’ angular velocities or to synchronize their attitude. Such control goals are examined separately in what follows.
6.1.1. Velocity Synchronization
The aim of angular velocities synchronization is to ensure that
For this purpose, a proportional-type Lie-algebra controller of the following form may be designed:
where denotes the synchronization angular-velocity mismatch and denotes a proportional action gain coefficient. By defining the control field can be rewritten as
The function has the purpose of deleting the dynamics of the follower and to replace it with the dynamics of the leader, hence it realizes a control design principle that we refer to as dynamics replacement. The leader-follower pair systems will result to be velocity-synchronized as .
The convergence of the synchronization error to zero is confirmed by the following result, proven in [44]: The control law (332), with , synchronizes in angular velocity the follower to the leader exponentially fast. This kind of controller does not ensure the full state synchronization for the leader-follower pair.
6.1.2. Attitude Synchronization
The problem of attitude synchronization is more involved than the problem of velocity synchronization discussed above, since a purely proportional control field would not be effective. Attitude synchronization may be achieved by a proportional-derivative-type control. Attitude synchronization is achieved when
where denotes a Riemannian distance in .
The first step in the development of a synchronizing control algorithm is to define an error field that associates to each pair of leader/follower states a mismatch measure. We define an attitude synchronization error measure by the following expression
This error expression is equivalent to a ‘difference’ between the states and , written for the group and belongs to the Lie algebra . In order to compute the time-derivative , it is convenient to make use of the following workaround:
where the symbol ‘’ denotes the matrix exponential. Replacing the expressions of the time-derivatives of the states and by the expressions given in (329) and (330) leads to the following relationship relating the error velocity to the attitudes and the angular velocities of the two constituent subsystems
Deriving once more with respect to the time parameter t, and noticing that , gives
The Lie-algebra control field U is designed so that the error dynamics obeys the law
In order to express the Lie-algebra control field U in an explicit form, it is necessary to equate the expression for found in (339) to the desired expression of the error acceleration prescribed by the relation (340), which leads to
The error system (340) is asymptotically convergent to a zero synchronization error. In fact, under the assumption that and , the function
is Lyapunov for the error system (340). Notice that, since , from the condition , it follows that , with equality only when . Upon derivation with respect to the time parameter, one gets
Since it holds that , from the condition , it follows that with equality only when .
6.2. Application to Quadcopters Synchronization
The material reported in the present section is largely based on the previous contribution [45]. In the present section, we shall apply the above-recalled synchronization theory to mathematical models of quadcopters. For an introductory review, interested readers might want to consult, e.g., [46]. In particular, we are going to retrace mathematical modeling of a quadcopter drone based on the standard approach of Lagrange-Euler-d’Alembert where the exogenous generalized forces are computed on the basis of the specific structure of a quadrotor drone. The main difference between the general case considered earlier in this tutorial paper is that the Lagrange-Euler-d’Alembert principle is applied to a system whose state-space is a Lie group, which leads to the so-called Euler-Poicaré equations.
6.2.1. Lie-Group Model Formulation of a Quadrotor Drone
A quadrotor drone (also called quadcopter) is made of a body and of four rotors , with , termed propellers, directed upwards and placed in a square formation with equal distance from the center. Among the four propellers, rotors 2 and 4 spin clockwise, while rotors 1 and 3 spin counterclockwise [45]. Translation along a horizontal plane may be achieved by tilting the quadrotor body by modulating the rotation speed of every rotor, while a vertical displacement is achieved by modulating the total thrust exerted by the rotors on the drone’s body. It is customary to attach a reference frame to the quadrotor’s body, whose three orthogonal axes are denoted by x, y and z. A rotation of the body around the x axis is termed roll, a rotation around the y axis is termed pitch, while a rotation around the z axis is termed yaw.
The mathematical model of a quadcopter recalled here holds under a number of assumptions, namely: (1) the structure of the quadrotor is rigid and axis-symmetrical, (2) the center of the body-fixed reference-frame axes coincides with the center of mass of the quadrotor’s structure, (3) the propellers are rigid, (4) the thrust and drag forces are proportional to the square of rotors’ angular speed.
A mathematical model of a composite roto-translating body may be written through the well-known formalism of the Lagrange-Euler-d’Alembert principle. For a dynamical system whose state space is a Lie group, it affirms that a dynamical system generates a trajectory such that
where the quantity denotes a Lagrangian function for the dynamical system, the symbol denotes a metric for the state space, the vector field denotes a generalized force field, the quantity denotes the state (or configuration) velocity and the symbol represents a variation across two infinitely-close trajectories.
It pays now to recall the notion of left-invariance with special emphasis on the terms that appear in the expression (344):
- Left-invariance of the kinetic energy: The kinetic energy associated to a trajectory g is defined as . We shall assume the inner product representing the metric to be left-invariant, which may be expressed analytically by the condition thatnamely, the inner product is invariant under left-translation. In practice, this means that the inner product does not depends explicitly on the configuration g and on the velocity , but only on their composition . Hence, the kinetic energy may be written as , where denotes a ‘reduced’ kinetic energy function.
- Left-invariance of the potential energy: There is no notion of left-invariance for the potential function, since it does not depend on the velocity . We shall henceforth assume (and we shall see that it is, in fact, the case) that the potential energy function for the system is simply constant with respect to the configuration g.
- Left-invariance of the generalized force field: In general, the product depends explicitly on the quantities g and . Under the assumption that the inner product is left-invariant, we may however write thatTo this, we add the assumption that the force field F be left-invariant, namely that , where denotes a ‘reduced force field’.
In a left-invariant setting, the principle (344) can be simplified as shown in [45] to
where represents a reduced Lagrangian function.
Now, the following result is instrumental in order to cast the integral principle (347) in a differential form [47]: Let denote a continuous family of trajectories indexed by the variable s and define and , where the variation δ refers to the variable s. The solution of the integral Euler-Lagrange-d’Alembert Equation (344) under perturbations of the form satisfies the Euler-Poincaré equation
where the superscript denotes a dual operator.
Thanks to such fundamental result, the equations that describe the motion of a dynamical system on a state Lie group cast as
The exogenous forcing f term accounts for several contributions to the dynamics, such as energy dissipation due to friction and energy-injecting active control fields.
The Euler-Poincaré Equation (348) describes a dynamical system subjected to holonomic constraints only, subsumed by the underlying Lie group. A system subjected to non-holonomic constrains (such as a sphere rolling on a plane without slipping), may be described by a modified Euler-Poincaré equation that takes a substantially more convoluted expression [48]. The point is that the set of velocities attainable by the system is restricted to an area of the tangent bundle termed distribution, which makes the equations of motion more convoluted.
The rotational component of motion of a rigid body, such as a drone, takes as descriptor an element in the Lie group and in its associated Lie algebra . Let us now define a basis of the vector space as follows:
Furthermore, by taking the canonical metric , it is readily shown that the fiber derivative of the reduced Lagrangian reads
where denotes the identity element in the Lie group and symbol denotes, once more, a Gateaux derivative.
A quadcopter model includes two specific constants, namely its total mass and its non-standard tensor of inertia, defined as
where and denote respectively the body mass of the quadcopter and its non-standard tensor of inertia, while and , with , denote respectively the mass of the rotors and their non-standard tensors of inertia. Let us recall that the non-standard tensor of inertia of a rigid body relates to its standard tensor of inertia J by the relationship (we mention, with apologies, that the relation reported in [45] was incorrect of a factor ). It follows that the quadcopter’s non-standard tensor of inertia can be deduced from the standard tensor of inertia, which is assumed to take the form of a diagonal matrix , as follows:
Let us proceed on recalling the equations that describe the rotational component of motion. The angular velocities of the propellers will hereafter be denoted as . In order to define the equations for the rotational component of motion for a quadcopter, we shall assume that all rotors are characterized by the same coefficient of rotational inertia and that such rotors develop an angular velocity around the z axis , where the sign-factors are meant to take into account that the rotors and spin counterclockwise, while rotors and spin clockwise.
Each rotor exerts a thrust, denoted as , on the body of the quadcopter, whose magnitude is taken to be proportional to the square of the corresponding angular velocity via a lift constant , namely
(The factor has been introduced for convenience). The thrust produced by each rotor also exerts a mechanical torque on the quadcopter’s body via a moment arm, denoted as , given by
where denotes the distance between each propellers’ centers and the center of mass of the quadcopter body, supposed to be the same for each propeller by symmetry.
The body of the quadcopter is subjected to further mechanical torques due to the rotation of each propeller. In fact, by virtue of the principle of angular momentum conservation, the spinning of each propeller produces a counter-torque that acts on the whole body of the quadcopter and tends to make it rotate along its z axis in the opposite direction of the resultant of the torques produced by the propellers. These torques are termed drag torques and their magnitude is proportional to the square of their angular velocity via a drag constant . The resulting drag torques take the expression
(Again the factor has been introduced solely for convenience).
The resultant mechanical torque acting on the body of the quadcopter hence takes the expression
By splitting such total mechanical torque with respect to the basis (350), namely as , one finds the components of the torques along the axes of the body-fixed reference system
The components and of the resultant torque, which quantify the degree of rolling and pitching, are controlled by the unbalance of the spinning velocity of two rotors (in particular, notice that the torque along the x axis is produced by the propellers located along the y axis and vice-versa). The component of the total torque, responsible of the yawing of the quadrotor body, is produced by the unbalance between the clockwise and counterclockwise spins. Yawing is generally an unwanted effect, unless the torque is so tiny to produce a slow yawing, therefore such component is carefully kept near zero by the onboard control system.
Let us now denote by the attitude of a quadcopter and by the angular speed of this quadcopter, both thought as functions of time t. The time-evolution of the state pair is described by the following systems of tangent-bundle differential equations:
where denotes the Poisson bracket and denotes the Lie bracket (or simply commutator, in this case) of two matrices . To confirm the above result notice that, since on the Lie algebra it holds that and on the group it is verified that , the system of Euler-Poincaré Equation (349) may be rewritten as
where denotes the resultant of all external mechanical torques. In the present case, the reduced Lagrangian of the quadrotor coincides with its kinetic energy, since there is no potential associated with the rotational component of motion. The fiber derivative of the reduced Lagrangian reads
where we have defined, for convenience,
The resultant of the external torques acting on the body of a dquadcopter drone is expressed by the relationship (358).
Whenever a drone is neither taking off nor landing, it is customary to assume the residual rotoric angular velocity to be constant. Therefore, the mathematical model (359) of the rotational motion of a quadcopter may be written in the following simplified form:
In order to describe the translational motion of a quadrotor, let us denote by the coordinate of the center of mass of the drone expressed in the inertial reference frame. The translational component of motion in a quadcopter obeys Newton’s second law of dynamics written in an inertial reference frame. The linear acceleration is proportional to the total thrust , to the gravity acceleration g and to an aerodynamic drag term. Since all the rotor thrusts are directed along the direction, the Newton law governing the translational motion can be written as:
where denotes a positive-definite (and often diagonal) aerodynamic drag tensor. The first term on the right-hand side of the above equation tells that whenever a quadcopter is horizontal to the ground (namely ), its center of mass can only move vertically, while whenever its body is tilted away (namely ) its center of mass can even move horizontally.
In order to write the set of equations that drive the drone’s flight, it is necessary to express explicitly the Euler-Poincaré equation to cast it into the form , where denotes the state-transition function associated to this particular dynamical system. To this aim, it is useful to define a new inertia matrix
Straightforward calculations show that and that , from which it follows that the inertia matrix D is invertible. Therefore, the Euler-Poincaré equation may be rewritten explicitly as
6.2.2. Physical Realizability of the Synchronizing Controller
The follower drone described by Equation (330) differs from the free leader drone by the application of a control field, which must be physically realizable, namely the control field U prescribed by the control algorithm should be effectively realizable by the propellers. A quadcopter is solely operated by changing its rotors’ angular velocities, therefore it is natural to require that the values assumed by the Lie-algebra control field U must be such that it can effectively be translated into achievable angular velocities.
As a first step in a realizability analysis, it is necessary to discern, from the Equation (363) that governs the evolution of the angular velocities of the follower, those terms that represent externally-controllable torques from the terms that represent unoperable torques. Such classification follows from the physical meaning of the mathematical terms and may be written as
where it is assumed that the residual rotor speed is constant, hence the terms in may be neglected. By defining and by equating the control field U to the operable component , the mathematical system (330) can be recast as
In order to compute the angular velocities of the four propelling rotors corresponding to the control field U, it is possible to derive three conditions from the Equation (357), namely:
where , e are scalar control fields defined by . The system (370) is under-determined, since it totals three equations in four unknowns, therefore a fourth constraint needs to be added by considering the follower in a hovering condition. The added constraint is:
The above system of equations is linear in the unknowns , , and and admits as solutions the angular velocities
The solutions (372) are real-valued, hence physically realizable, in principle, whenever the following two conditions are met:
- the values taken by the scalar control component , and do not differ too much from one another and are absolutely bounded,
- the control field components are small compared to the mass-related term .
In practice, the spinning velocity of the propellers differs little from the spinning velocity corresponding to a hovering condition, termed steady-state velocity. This is defined as the spinning velocity that keeps a quadrotor still in mid-air. From the Equation (371), it is not hard to determine such propellers’ spinning velocity as:
In order to exemplify the values of the parameters mentioned in this section, an OS4 Mini-VTOL (Vertical Take-Off and Landing) drone has been taken as reference model, as reported in [49]. The numerical values of the parameters corresponding to such quadrotor model are shown in the Table 1.
Table 1.
Table recapitulating typical parameter values of a OS4 Mini-VTOL quadcopter.
The value of the gravitational acceleration constant g was approximated at . The calculated steady-state rotor velocity corresponding to such set of parameters is rad/s (roughly corresponding to 3046 revolutions per minute (RPM), quite a fast spinning). Numerical simulations show that even a fractional deviation of one RPM with respect to the steady-state velocity produces a dramatic impact on the motion of a quadrotor.
6.2.3. Numerical Recipes to Implement Synchronization of Quadrotors
We shall now illustrates a numerical technique to implement the recalled mathematical model as well as the synchronization algorithm on a computing platform.
A pseudo-code corresponding to the numerical solution of the differential equations describing the rotational component of the motion of a single drone is illustrated in the Algorithm 1. The invoked numerical integration method represents a counterpart of the well-known Euler method adapted to manifolds.
| Algorithm 1 Pseudo-code to implement the (uncontrolled) mathematical model of a quadrotor on a computing platform |
| Initialize the attitude and the rotation velocity |
| Set the step-size h and the number of numerical steps K to complete a simulation |
| for to do |
| Update the attitude |
| Update the angular velocity |
| end for |
A pseudo-code corresponding to the numerical counterpart of the differential equations describing the rotational component of the motion of a leader-follower pair of drones along with the application of a velocity-synchronization control field is illustrated in the Algorithm 2.
| Algorithm 2 Pseudo-code to implement numerically a leader-follower pair and velocity synchronization by a proportional-type control algorithm |
| Initialize the attitudes , and the rotation velocities , |
| Set the step-size h, the number of numerical steps K and the proportional control gain |
| for to do |
| Update |
| Update |
| Evaluate the control field |
| Update |
| Update |
| end for |
Moreover, a pseudo-code corresponding to the numerical implementation of attitude synchronization by a proportional-derivative-type controller is illustrated in the Algorithm 3.
| Algorithm 3 Pseudo-code to implement numerically a leader-follower pair and an attitude synchronization control by a proportioanal-derivative-type controller |
| Initialize the attitudes , and the rotation velocities , |
| Set the discrete step-size h, the number of numerical steps K, the proportional control |
| gain and the derivative control gain |
| for to do |
| Update |
| Update |
| Evaluate the synchronization error |
| Evaluate the synchronization error velocity |
| Evaluate the control field (from Equation (341)) |
| Update |
| Update |
| end for |
If the leader quadrotor is subjected to a difference in its rotor velocities that is too large (the maximum value of the difference for which the controller is effective depends on the value of the gains set), the control algorithm may loose effectiveness might hence be no longer able to synchronize the two subsystems. Such observation is in accordance with the practice: as mentioned, a quadrotor usually keeps a low difference among its propellers spinning velocities in order to be able to trace a stable trajectory.
6.3. Synchronization of Second-Order Systems on Manifolds with Vanishing Control Effort
In the previous section we surveyed synchronization of two dynamical systems on Lie groups without regard to control effort, which quantifies the effort demanded to the actuators by the control algorithm as well as their energy consumption. In the present section we are going to recall from [50] a control theory that ensures the control effort to vanish asymptotically as synchronization takes place.
In the present section we are studying a category of dynamical systems on a manifold expressed by
where denotes a system’s state-transition function and denotes the covariant acceleration of the state. The first equation denotes a state-transition law, in the manifold-valued variable x, while the second equation denotes a velocity-transition law in the vector field v. Since the pair is an element of the tangent bundle, the tangent bundle represents a phase-space for the system (374).
Given a system trajectory and a control vector field , one may introduce a control effort index associated to the vector control field u as the scalar field defined by
The scalar field is defined as the squared amplitude of the control field and hence quantifies the ‘effort’ demanded by the control algorithm to the actuators [51].
An undesirable effect of the control schemes discussed in the previous sections is that the control effort does not necessarily vanish after synchronization is achieved, which entails unnecessary consumption of energy (and may therefor cause premature discharge of battery). In the present section, we shall describe a control scheme, valid for a generic Riemannian manifold, capable of making the control effort vanish to zero while synchronization is being achieved. By denoting as the state of the follower system, one such control scheme reads
where denotes an effort-limitation gain (not to be confused with the proportional control gain) and denotes the follower state-transition function. Notice that the follower system is described by a couple of first-order differential equations on the tangent bundle through the paired variables and that the control field u is present not only in the equation governing the evolution of the velocity but even in the one governing the evolution of the state, as opposed to the previous synchronization scheme described by Equation (330). It is hence legitimate to wonder as to whether one such system of equations represents a physical object. To answer such question, let us recast the above two equations as a single, second-order, equation by applying the covariant differentiation operator to both sides of the first equation, which gives
The controlled system hence is described by the equation
that represents a physical (second-order or double-integrator) dynamical system controlled by an effective control field that appears to depend on a primary control field u. As an instance, if the primary control u is proportional to an error feedback and the state-transition function is independent of the velocity field , then the effective control field is of proportional-derivative type.
It is worth emphasizing that the constant k plays a fundamental role in ensuring the asymptotic vanishing to zero of the control efforts. Before getting to the main result of this section, it is surely instructive to take a look at a couple of simpler cases.
Example 26.
Assume that and that the function f is continuous in the first argument and Lipschitz-continuous in the second argument, with Lipschitz constant . In this case, we shall consider the control system
where is a constant (that determines the synchronization speed). The control field ‘u’ makes the follower synchronize to the leader exponentially fast. Moreover, a main finding is that, provided and , the control effort asymptotically vanishes to zero.
In order to show that the above control scheme is synchronizing, let us adopt again a Lyapunov-function perspective and define
It is not hard to show that is indeed a Lyapunov function for the controller-follower subsystem, in fact:
The last equation implies that , where denotes the Euclidean vector norm and . Therefore, the inter-state distance tends asymptotically to zero with an exponential speed (namely, quickly at the beginning and slowly while approaching synchronization).
In order to ascertain the dynamics of the control effort, it pays to rewrite the expression of the control field in two equally meaningful ways, namely
The above two equations imply, respectively, that
The dynamics of the control effort is hence found to be governed by the following relation:
with initial condition and with . Since the function f was assumed to be continuous with respect to its x-argument and Lipschitz-continuous with respect to its v-argument, for all constants , vectors and , there exist two constants such that, for every such that , it results that . By taking profit of such inequality and of the Cauchy-Schwarz inequality about the inner product of two vectors, the right-hand terms in the differential Equation (386) may be approximated to give the following differential inequation:
For the sake of notation conciseness, it is convenient to define the following two quantities
in such a way that the above differential inequation may be rewritten compactly as
In view of applying a Grönwall argument [52], it is convenient to study the associated differential equation which, upon the change of variable , may be cast as
This is a first-order, inhomogeneous linear differential equation with initial condition whose closed-form solution reads
where depends from the initial condition and is given by
Note that, by hypothesis, and , hence the above quantities are well-written. Therefore, by the Grönwall’s inequality the solution of the differential inequation (390) satisfies
The first and the second term within the square brackets converge to zero. Since the quantity ε is arbitrary, it may be assumed as small as desired, from which the assertion follows.
Since the two exponential functions within the square brackets peter out with different speed, the vanishing speed of the control effort to zero is dominated by the value .
The second simpler result examines the case that is a general manifold but under the assumption that the leader subsystem and the follower subsystem are perfectly synchronized.
Example 27.
Consider the coupled dynamical systems
evolving on a Riemannian manifold and assume that the velocity-transition function f is Lipschitz continuous in the second argument, uniformly in the first argument, with Lipschitz constant . The state manifold is supposed to be equipped with a Levi-Civita connection ∇. At perfect synchronization, the control scheme (395) with makes the control effort vanish to zero at least exponentially fast.
In order to justify such statement, assume that the leader subsystem and the follower subsystem are perfectly synchronized, namely, that , . Notice that, even at perfect synchronization, one must assume that , in general, although the pair belongs to the same tangent space . At perfect synchronization, the control vector field simplifies to , therefore it holds that
By the hypothesis made, the function f satisfies the Lipschitz inequality in the space
with the constant being independent from the state x; therefore, from the relation (396), by applying the Cauchy-Schwarz inequality for the inner product and by recalling the above Lipschitz continuity condition, we may obtain the following differential inequation:
with initial condition and where it is understood that . Set as effective decay constant. By the Grönwall’s inequality, the above relation implies that the actual control effort magnitude is always bounded from above by the exponential curve , hence the control effort vanishes to zero at least exponentially fast.
In the equations that describe second-order dynamical systems discussed in the paper [30], the velocity-transition function appears as the summation of a gradient term (that depends only on the system’s state) and of a number of non-linear damping terms that depend on the system’s state velocity and on the system’s state itself. We should underline that the assumption that the velocity-transition function appears Lipschitz in its whole domain is rather strict, although in some circumstances such requirement is indeed met, for instance:
- in undamped systems, the function f depends only on the system’s state, therefore, the Lipschitz continuity condition is automatically verified and the differential inequation (398) becomes a differential equation, hence the control effort magnitude peters out as fast as , as long as .
- in linearly damped systems, the transition function is of the type + a term that depends on the state ‘x’ only. In this case, it is trivial to verify that the function f is globally -Lipschitz with .
In a number of applications, the design of the leader-follower systems pair is up to the user and does not have to obey any particular physical constraints (this is the case, for example, in secure transmission of information by signal masking). Therefore, linear damping may be safely assumed (which includes the case of an undamped oscillators wherever ).
On the basis of the above considerations, in the context of systems on manifolds it seems reasonable to prescribe that the momentum-transition function satisfies a mixed continuity condition in the first argument and Lipschitz continuity in the second argument. To express such requirement more precisely, let us define the following auxiliary function
where , and . Note that the following identities hold true
for every . We shall say that the function (399) is uniformly continuous in the argument y and Lipschitz continuous in the argument v if for every , for every and for every there exist a and a such that, for each pair with , it holds that
where denotes a neighborhood of the point x. Such condition, ultimately, ensures that
for every , and , at every point .
The above-defined mixed continuity condition is discussed in the following example concerning a specific instance of momentum-transition function.
Example 28.
Let us consider the transition function defined by
with S being a symmetric, positive-definite matrix and μ denoting a real-valued parameter. It is straightforward to verify that represents a tangent vector field in , in fact . The above function is continuous in the variable x, while it is obviously Lipschitz-continuous in the variable v.
In the following, the effects of the control term in the dynamical system (376) on the control effort magnitude in the case of a full dynamics will be analyzed in details. In order to state the main findings, it is convenient to present and comment some preliminary results.
A first instrumental result concerns the computation of the covariant derivative of a vector field obtained from the logarithmic map applied to a pair of (nearby) curves. Let and denote two given smooth curves on the Riemannian manifold endowed with the Levi-Civita connection ∇. Consider the parametrized smooth vector field on given by , with . The covariant derivative of the vector field along the velocity field associated to the curve x may be proven to take the expression
where and denote the partial pushforward maps of the exponential map computed with respect to its positional argument x and its tangential argument v, respectively. In order to explain such result, let a function be defined as , with and . Let us denote the pushforward map of the function q with respect to the first argument by and the pushforward map of q with respect to the second argument by . Since the function q is a local diffeomorphism in around , in a neighborhood of zero the linear map is full-rank, which ensure that there exists its inverse . In addition, notice that the identity holds. Differentiating both sides with respect to the parameter t gives , therefore, by inversion, the result relationship (405) is immediately obtained.
In order to get the reader acquainted with the notion of partial pushforwards of the exponential map, detailed computations in the case of a unit hyper-sphere are carried out in the following example.
Example 29.
Let us recall that the exponential map for the unit hypersphere corresponding to a Euclidean metric is given by
where denotes the standard 2-norm and ‘’ denotes the cardinal sine function. Its partial pushforward maps may be represented as linear operators mapping to itself, that is to say, by square matrices.
The computation of the sought tangent maps may be worked out by defining the curve
where , , and . Taking the derivative of both sides of the expression (407) with respect to the parameter t yields
where denotes the naïve derivative of the vector field and denotes the derivative of the cardinal sine function.
Recall, from Equation (115), that , therefore the above expression may be recast as
because . A comparison of the above expression with leads to a matrix-type representation of the sought partial pushforward maps as
where the symbol denotes a matrix representative of a linear operator.
In correspondence of the null vector , the pushforward map equals the identity and in a neighborhood of zero such pushforward map is invertible. For , since the matrix is written as the sum of two terms, one may utilize Woodbury’s matrix inversion lemma [53] to compute such inverse pushforward map. Notice that an application of the matrix inversion lemma gives . Applying such lemma a second time yields the sought result
Notice that a singularity occurs in correspondence of , where it holds that , whose rank is 2. A second singularity occurs when , where , whose rank is 1. Because of rank deficiency, in correspondence of these balls in such partial pushforward map is singular.
The inverse map plays a central role in the computation of the covariant derivative of the function . Likewise, the map will play a central role in the analysis of the control effort dynamics. Notice that whenever , the partial pushforward map coincides to the identity , while the map coincides to the identity . We are going to assume some sort of continuity of the dicussed partial pushforward maps with respect to their positional and vectorial arguments. In order to make such assumptions precise, let us define two further auxiliary maps that play a role in the computation of the covariant derivative of the logarithmic-map-based vector field, namely:
where and . We chose the above notation to underline that the functions and exhibit an asymmetric dependence from the variables x and y; in addition, one such notation makes it easy to remember that, for every given pair , the operators and are linear endomorphisms of . The following identities are immediate to verify:
In the following we shall assume the function defined in (413) to be continuous with respect to the variable y, namely that for every , for every and for every , there exists a constant such that, for any that satisfies , it holds that , where denotes a neighborhood of the point . Such assumption tells that the linear map , thought of as a function of y, behaves like an approximate identity when x gets close to y. Note that, because of the linearity in the argument r, it is not necessary to make any restriction on its value. The continuity assumption carries over an useful consequence. By virtue of the triangular inequality, it holds that , therefore the assumption implies that
Likewise, the map will be assumed to be continuous with respect to the variable y, in a neighborhood of a point x for every value . That is to say, for every point , every , and every scalar there exists a scalar such that, for any with , it holds that .
By virtue of the triangular inequality of the norm, taking any tangent vector , we may write that . As a consequence, the assumption implies that
The implications (418) and (419), together with the identity (417), tell that irrespective of how the positive values and are chosen small, there certainly exists a point such that
where . The inequality (420) appears as a sort of mixed continuity/Lipschitz-continuity condition on the composition of the partial pushforward maps associated to the exponential map.
Let us evaluate the continuity of the expressions and for the unit-hyperpshere, as an instructive case of study, in the following example.
Example 30.
The expression of , when , may be obtained by composing the expression (412) and the expression of the parallel transport on the unit hyper-sphere, that we recall to take the expression
We shall avoid writing the expression of the auxiliary function μ explicitly because it is quite cumbersome and not particularly illuminating. It suffices to note that, as we have seen in the Example 29, the map (412) is continuous as long as , while the map (421) is continuous as along as . Therefore, the function is continuous in . Likewise, the expression of may be obtained by composing the expression (410) and the expression (421), to give:
Such function is continuous in . Gluing the two above results, one gets .
Observation 2.
The inverse map has been studied in details in the unpublished contribution [54]. (In particular, to make the notation uniform, the map corresponds to the map in [54]). The contribution [54] shows that this map may be written explicitly as a Taylor series of higher-order covariant derivatives of the Riemannian curvature endomorphism.
As a further instrumental result in the present analysis, we may express the covariant derivative of the parallel transport operator on a manifold endowed with a Levi-Civita connection ∇. Consider the parametrized smooth vector field and define the new parametrized vector field . The covariant derivative of such vector field along the velocity field of the curve takes the expression
where for every and , and for every and . In addition, the ‘diagonal’ part of the linear map is identically zero, namely , for every point and tangent vector .
The above statements may be motivated as follows. Let be defined as for any two points x and y in and for any tangent vector . Note, in particular, that the function h is linear in the third argument. Now, set and note that v is a vector field in , namely . By definition of covariant derivative along a curve, we have that
The covariant derivative may thus be expressed as
where , and . Since the notation might lead to some confusion, let us specify that the symbol denotes the pushforward map of the function h with respect to its first argument, while the symbol denotes the pushforward map of the function h evaluated with respect to its second argument. Such partial pushforward maps are defined as
Moreover, the notation indicates that the pushforward map, that is a function of three variables (besides of its argument), is evaluated at a point where the first variable is instantiated to the value x, the second variable is instantiated to the value y and the third variable is instantiated to the value w.
Now, since , the identity holds. Applying to both sides the covariant derivative ∇ yields the identity
Notice that is an operator that maps to itself, while the operator maps to . Since the velocity vector fields and are arbitrary, the above identity, rewritten as
yields two independent operator identities, one of which is , namely
By using the Equation (425) and the definition of the function h leads to the sought result.
The statement about the diagonal part of the tangent map follows from the definitions (426), in fact
no matter how the point and the tangent vector are chosen on the tangent bundle. But since is arbitrary, the statement follows.
The following example clarifies the above findings.
Example 31.
Consider the parallel translation map for the unit hyper-sphere corresponding to a Euclidean metric. By applying the definitions of partial pushforward maps in terms of time-derivatives, it is not hard to compute the maps and . Let us initiate by recalling that, in the present example,
(Notice that this instance of parallel transport is from y to x! In fact, ). The naïve derivative of such instance of the function h reads
By making use of the expression of the covariant derivative of the vector fields and w given in (115), the relation (425) may be recast as:
Upon splitting the terms in and , after some tedious but not hard calculations, we obtain a matrix representation of the sought partial pushforward maps given by
It is not hard to verify that by using the above matrix representations. In addition, some lengthy–yet straightforward–calculations allow one to verify the identity (429), that may be rewritten explicitly as
Such identity establishes that the partial pushforward map may be written in terms of the partial pushforward map , which justifies why the latter is termed principal pushforward map (of the parallel transport operator). It is also straightforward to verify directly that for the diagonal part of the partial pushforward map .
Some remarks on the partial pushforward maps of the parallel transport operator are in order. The map is linear in the variable w (as well as in its argument), therefore, the only way it could possibly be bounded over the whole domain is that it is identically zero. Moreover, since the linear map gets close to the null map when the points x and y come close to one another, it is natural to express the continuity of the term by requiring that the function is continuous within its domain of definition if, for every , there exists a scalar such that, for any pair that satisfies the inequality , it results that
for every choice of the tangent vectors and .
We have by now set all the analytic tools that are indeed necessary to justify the main result of the present section. We shall in fact show that the control scheme (376) ensures that the control effort vanishes to zero under conditions of continuity.
Let us assume that the parallel-transport operator and the manifold exponential map are locally continuous together with their tangent maps in the tangent bundle . Let us further assume that the velocity-transition function is continuous in its first argument and Lipschitz continuous in its second argument with Lipschitz constant . If the constant k meets the conditions and , then the control effort presents a convergence to zero that results to be almost exponentially fast.
In order to justify such statement, it is convenient for easier tractability to define a dual control field as
It is worth noticing that, since , it holds that
Moreover, since the manifold was endowed with a metric connection ∇, the control effort possesses a time-dynamics described by the covariant derivative of the dual control field through the expression
The covariant derivative of the dual control field, in turn, consists of three terms:
According to the expression (405), the covariant derivative of the logarithmic map applied to the trajectories of the leader and of the follower subsystems reads
In addition, according to the expression (423), the covariant derivative of the parallel-transport-related field reads
Putting all those terms together in a single expression gives, for the covariant derivative of the dual control field along the trajectory of the leader subsystem, that reads
It is convenient to recall from the very definition of dual control field that
from which, by the triangular property of the norm, it follows directly that
Let us also recall that the distance takes on the expression , where .
For the inner product in Equation (441), it holds that
Let us examine the four terms in the right-hand side of the above lengthy expression:
- the term clearly equals because of the identity (440);
- the term may be majorized by means of the Cauchy-Schwarz inequality, by the assumption that the function f is continuous in the first argument and Lipschitz-continuous in the second argument, and by the properties (447). Namely, , such that with , it holds that
- the further scalar component may be majorized by invoking the Cauchy-Schwarz inequality and the continuity of the parallel transport and of its tangent maps. Namely, , such that with , it holds that
- to end with, the term may be majorized by the help of the Cauchy-Schwarz inequality, by the assumptions on the continuity of the pushforward maps of the exponential map and by the inequalities (447), namely, , such that for which then
6.4. Feedback Error Velocity, Principal Pushforward Map and Relation to Curvature
The cornerstone of control theory is error feedback control, based on the definition of a suitable error figure between the actual state of a system to control and a desired state, to be fed to a control algorithm. The control algorithm, in turn, will produce the appropriate commands to the system aimed at reducing the value of the error, hence forming a closed loop. Denoting by e such control error, it pays to study the evolution law of the error, often determined by a differential equation in the terms and , in order to make sure that effectively the error converges to zero (asymptotically). Such differential equation describes an error system, as it was exemplified in Section 6.1. The above-mentioned time-derivatives of the error also intervene in designing the control algorithm itself, which in industrial applications is often chosen to include proportional, integral and derivative (PID) terms. It is, in fact, known that each of these term may contribute in shaping the error system and in compensating each others’ drawbacks in order to attain a desired dynamics for the error field e.
When considering two dynamical systems whose common state space is a curved Riemannian manifold , whose states are described by the manifold-valued variables (where let us say that the reference system has state-variable x), there is not a straightforward definition of error. According to the prevailing literature (see, e.g., [55,56]), and on the present author’s personal experience, the error term should be defined as a tangent field along the trajectory of one of the systems as, for example, . A problem with this definition often comes when formulating the derivative and integral control terms. What is most relevant to the present discussion is the computation of a derivative term which, for consistency, cannot be the naïve derivative of the error but needs to be, for example, the covariant derivative . Such calculation was already detailed in Section 6.3 and we may recall from Equation (405) that
In practice, such expression, which depends on the partial pushforwards of the exponential map, is deemed to be overly convoluted to employ and hence is replaced by a simpler expression that retains the meaning of error velocity and the property of being a tangent vector field yet being much simpler to utilize, namely
It is interesting to recall that such an error term appears in the Cucker-Smale model on manifold recalled in the introductory discussion of the present Section.
Now, the dynamics of the error depends on the (covariant) time-derivative of the velocity of the error, namely on . The second term coincides with the state-acceleration of one of the systems, while the term needs to be studied aside. The present section is devoted to the analysis of the quantity . The analysis carried out in the present section elucidates some interesting algebraic properties of the principal pushforward map associated to the parallel transport operator and the connection between such principal pushforward map and the curvature endomorphism of the state manifold.
The scientific content recalled in the present section is largely based on the earlier contribution [18].
6.4.1. Algebraic Properties of the Principal Pushforward Map of Parallel Transport
We have already shown in Section 6.3 and, in particular, by Equation (423) that, given two smooth curves and in and a smooth vector field , the covariant derivative of the vector field along the velocity field of the curve reads
where for any . Such differential map is termed principal pushforward map since the covariant derivative may be expressed entirely on the basis of such map without any need to refer to the further partial pushforward map .
One might notice that the covariant derivative of the transported field equals the sum of an ‘obvious’ term, given by the transported covariant derivative , and of further terms purely related to the dynamics of parallel transport and, hence, on the curvature of the state manifold . (Such terms appear to be somehow reminiscent of the kinematic corrections due to a change in reference systems in physics.)
The principal pushforward map enjoys some interesting algebraic properties. Since these are not directly instrumental to the present analysis, let us recall them under the form of an example.
Example 32.
Denoting by the adjoint (or dual) of the operator with respect to the Riemannian metric of , it holds that
for every and . In order to explain such statement, let us recall that, since the parallel transport is an isometry, the vector fields and enjoy the property
Taking the derivative of both sides with respect to the parameter t yields the equation
By way of the isometry property of parallel transport, it is immediate to see that the last term on the left-hand side equals the term on the right-hand side, therefore the above property is equivalent to
It pays to recall that, given a linear operator on a vector space endowed with a positive-definite inner product , its adjoint with respect to the metric satisfies the relation for every . The left-hand side of the Equation (459) may thus be rewritten as . Since x, y, and are arbitrary, the Equation (459) may hold only if the identity (456) holds.
Let us further recall the following two properties of linear operators on vector spaces:
- Equivalence of matrix representations: Given a linear operator on a vector space , two of its matrix representations and are equivalent on the vector space if for any , although the matrices and may appear different.
- Representation of the adjoint: If is a matrix representation of a linear operator on a vector space with respect to an orthonormal basis of , then the matrix representation of its adjoint, namely coincides with the transpose (this is a reason why the adjoint is sometimes also referred to as transpose operator).
It is immediate to recognize that the matrix-representation (435) of the operator is not unique. For example, if we add to the matrix the term , we get an equivalent representation of the map given by:
A further equivalent representation is
that serves to exemplify the property (456). In fact, in the present context, it holds that and it is not hard to verify that , no matter how one chooses the points , and the tangent vector v in .
As a further property of the principal pushforward map, more related to system dynamics, let us consider two trajectories and that intersect at a point . Since , when the vector fields and align to one another at p, namely . On the other hand, the two curves are arbitrary and we assume that . An interesting finding is that at the point p, . In fact, at any points and for every , it holds that
Setting gives
namely
The right-hand side is identically zero because it is an element of the diagonal of the pushforward map, whence the conclusion follows.
The above statement, written explicitly for the unit hyper-sphere, leads indeed to non-trivial constraints on the naïve derivatives . By using the Equation (115) for the covariant derivative, the above property reads
Since , the above equation may be rewritten, for example, as
Whenever , it happens trivially that , otherwise, the two naïve derivatives differ by the amount along the radial direction p.
To conclude this review of the abstract properties of the principal pushforward map, let us suppose that one of the two trajectories and is constant, for example, suppose that
in such a way that its image reduces to a point-set . As a consequence, we have that and therefore and . Nevertheless, the vector field changes over time and its covariant derivative along the velocity field is still given by the relation (455) that, by the hypothesis made, reads:
The above relation supplies a meaning to the principal pushforward map as the covariant derivative of the transported ‘constant’ vector field .
6.4.2. Relationship between the Principal Pushforward Map and the Curvature Endomorphism
Parallel transport does not behave trivially under composition. For instance, given three points not aligned along the same geodesic arc, in general it happens that . Such fundamental observations leads to the holonomy theory which is closely related to the theory of curvature manifold by the Ambrose–Singer theorem [57]. The aim of the current section is to explore and exemplify such relationship with some details.
To start with, let us recall that to any point is associated an algebraic structure termed holonomy group formed by the set of linear operators that realize a parallel transport along any piecewise smooth closed loops originating and ending at the base-point x. An instance of piecewise smooth closed loop easy to visualize is a geodesic triangle determined by three non-aligned points and by the three geodesic arcs that connect them. Then, given a tangent vector , one may start by transporting the vector from the point y to the point z along the geodesic arc originating at the point y and ending in the point z by means of the parallel transport operator , then one may transport the resulting tangent vector from the point z to the point x along the geodesic line connecting such points by means of the parallel transport operator , and then one may transport the resulting tangent vector from the point x back to the point y along the geodesic line connecting such pair of points by means of the parallel transport operator . The resulting tangent vector, that we denote as
will, in general, differ from the vector that one started with, because of the curvature of the manifold. The compound linear map is an element of the holonomy group, namely, . Notice that, since each element of the parallel transport chain in the definition of the operator realizes an isometry, the map represents an isometry as well, therefore .
An interesting relationship between the notion of principal pushforward map and that of manifold holonomy descends directly from the definition (426). In fact, such definition may be rewritten equivalently as
The last line of the above relationship reveals that the argument of the limit involves a comparison between the vector and the result of its transformation by the operator belonging to the holonomy group and, in particular, the relationship
holds true. In turn, as we have already observed, the above-defined operator represents parallel transport along a geodesic triangle, which is an instance of closed loop. Since the curvature endomorphism arises from parallel transport along an infinitesimal closed loop, it might be guessed how parallel transport along a non-infinitesimal loop may be expressed as a path-integral of the curvature.
In order to clarify the relationship that relates the principal pushforward map to the curvature endomorphism, it is instrumental to recall a few results from manifold calculus.
As a first result to recall, which stands by itself as a fundamental finding in manifold calculus, let us survey the notion of anti-covariant-derivative, an homologous of the notion of anti-derivative in standard calculus. Such notion may be expressed in the following way: Let denote a curve and a smooth vector field defined at least on the image of and such that ; then one might wonder how to recover the vector field w from its covariant derivative , supposed to be known. The answer is given by the anti-covariant-derivative formula
Let us verify the correctness of such assertion. By definition of covariant derivative, it holds that
Moreover, let us recall that parallel transport enjoys composition over the same curve (no matter as to whether such curve is a geodesic or a non-geodesic line), therefore . As a consequence, the first integral in the last line of the above multiline equation is identically zero, while the second integral equals
by a continuity argument, hence the assertion (472) follows.
The second result that is convenient to recall is based on the notion of geodesic homotopy. Let us define the function to be a geodesic homotopy, namely, a family of curves such that for any fixed , traces a geodesic curve on connecting the point to the point . The image of an homotopy is sometimes referred to as homotopic net. In addition, let us define to be a smooth vector field parallel along the homotopy, namely, such that and
for every value of the index and for every value of the affine parameter . Namely, the vector field is parallel along the back-end curve and along each geodesic of the homotopic net. Notice that one such vector field is entirely determined by its value given and by the homotopy itself. Then, the following relationship holds:
where denotes the Riemannian curvature endomorphism associated to the manifold and to its Levi-Civita connection. Namely, the transversal covariant derivative of the vector field along the front-end curve is given by a linear operator, which depends on the curvature of the manifold along the homotopic net, applied to the field. In the following, to ease notation we shall denote as the transversal velocity and as the velocity along each geodesic belonging to the homotopic net. By definition of Riemannian curvature endomorphism, we have that
As we had already observed earlier, the vector fields v and commute, moreover, by hypothesis, the vector field is parallel along the homotopy, therefore . As a consequence, the expression (477) simplifies to
By the assumption of parallelism, for any given index s, the vector may be recovered from its front-end value through parallel transport, hence
Applying the notion of anti-covariant-derivative to the left-hand side of the expression (479) gives
Setting gives the sought result.
On the basis of the Equation (476), it is possible to express the principal pushforward map in terms of the Riemannian curvature endomorphism. Such relationship may be made explicit by observing that the principal pushforward map may be expressed in closed form as
Such statement may be explained by choosing an appropriate homotopy h and a vector field . In particular, let us choose the homotopy as
where the back-end curve is such that and the front-end curve is such that . In addition, we have denoted, for simplicity, by the tangent vector . The vector field along the corresponding homotopic net is defined as
with fixed. Apparently, the homotopy (482) traces a smooth geodesic net, in fact, for s fixed, traces a geodesic line that departs from the point y and ends at the point . Moreover, the vector field (483) satisfies both conditions (475), in fact, for any fixed value of the index s, the vector field arises as the result of parallel translating the vector along the corresponding geodesic of the homotopy net, hence is parallel along each geodesic; in addition, the tangent value is obtained by parallel transporting the vector along the curve , hence the vector field is parallel along the back-end arc of which represents the velocity. Therefore, from the relationship (476), it follows that
where for every . The relationship between the covariant derivative and the principal pushforward map is given by the observation that
Therefore, on the fiducial geodesic corresponding to the value , it holds that
where, in particular, . The partial derivative of the homotopy (482) with respect to the parameter t, namely the velocity along each geodesic of the homotopic net, reads
where denotes the pushforward map of the exponential map with respect to the x argument, therefore the velocity of the fiducial geodesic reads
Instead, the partial derivative of the homotopy (482) computed with respect to the index s reads
where denotes the pushforward map of the logarithmic map with respect to the x argument. Such partial derivative at reads
Since and the pushforward map is linear, the above relation may be simplified to
Such expression emphasizes the fact that the transversal velocity is linear in the vector . In conclusion, replacing the Equations (488) and (491) into the expression (486), we get:
which justifies the assertion (481). Notice that the integral in the second equation of (481) is well defined because the integrand is a linear endomorphism in .
Let us apply the above formula to the unit hypersphere as an example of calculation.
Example 33.
For the unit hypersphere endowed with the canonical metric we have determined that the curvature endomorphism reads
for every and . Since the curvature endomorphism does not depend explicitly on the point x, we omitted its indication.
A matrix representation of the curvature endomorphism is the curvature tensor , from which the bilinearity and antisymmetry properties are easily verified. For the ordinary sphere , denoting by the outer product in , the skew-symmetric matrix coincides with the matrix representation of the 3-vector , namely .
Let us recall that the exponential map is expressed as
therefore, its inverse, the logarithmic map, is expressed by the relation:
where it is assumed that the points are not antipodal. In the above formula, the symbol denotes the cardinal sine function defined as:
and the geodesic distance between two points associated to the canonical metrics reads
where the inverse cosine function ranges in .
Matrix representations of the derivatives of the exponential and the logarithmic maps are computed to be
where the prime denotes derivation. The corresponding matrix representation of the endomorphism takes the form:
which denotes an anti-symmetric matrix.
7. Discrete-Time Dynamical Systems on Manifolds
Dynamical systems on manifolds may appear under the form of discrete-time systems as well as continuous-time systems. In turn, discrete-time systems may be natively discrete-time or may be derived from continuous-time systems through time sampling. Discrete-time dynamical systems are nowadays conceived as algorithms and implemented as pieces of codes running on computing platforms.
The aim of the present last section of this tutorial paper is to recall an auto-regressive, moving average (ARMA) mathematical model from [58] that generates a discrete-time temporal sequence on a smooth manifold. In such setting, not only the algorithmic structure of the ARMA model reflect the mathematical structure of the state manifold but the input and the output of the ARMA dynamical system are structured sequences that belong to the tangent bundle of same smooth manifold .
The present section is organized in the following way. The Section 7.1 describes second-order discrete-time auto-regressive moving-average dynamical systems on Riemannian manifolds. To end with, Section 7.2 illustrates examples of ARMA-type dynamical systems on manifold.
7.1. Auto-Regressive Moving-Average (ARMA) Systems on Riemannian Manifolds
An ARMA model on a manifold is described in terms of a pair of sequences , where denotes the output signal of the model, denotes the state of the model and denotes a discrete-time index. In the familiar case that , choosing the standard identification , for any , an ARMA() model on the tangent bundle appears under the form of a state-evolution equation and of a state-to-output transformation equation. In particular, the equation that governs the evolution of the internal state reads:
where the transition matrices and denote the parameters of the auto-regressive and of the moving-average subsystems, respectively, and the sequence denotes a white noise. Let us recall the salient features of the two subsystem of an ARMA model:
- Moving average (MA) subsystem: The moving-average subsystem consists of a linear combination of the current value and of previous samples of the input signal;
- Auto-regressive (AR) subsystem: The auto-regressive subsystem consists of a linear combination of previous values of the internal state of the system.
The integer denotes the memory depth of the AR subsystem, while the integer denotes the memory depth of the MA subsystem. (Notice that speaking of ‘memory depth’ for the AR subsystem is purely conventional since an AR subsystem has an infinite memory span). In addition, the equation that describes how the internal state transforms into the output reads:
with denoting a constant bias (or drift term). It is worth remarking that, since the tangent bundle is trivial (in fact, is isomorphic to ), the Equation (501) is based on vector addition. On a general Riemannian manifold , however, vector addition in the tangent bundle (that is, in the AR part and in the MA part) may be hardly defined without invoking the concept of parallel transport. Likewise, Equation (502) is based on vector addition on , which is not defined on a general (curved) Riemannian manifold and needs rather to be replaced by the exponential map.
The proposed extension of the above ARMA system to a Riemannian manifold is based on the following considerations:
- At each step k, it is necessary to compute a linear combination of several tangent vectors in via the transition operators for the auto-regressive subsystem and for the moving-average subsystem, where the operators and are linear in the second (namely, the vectorial) argument.
- In order to compute the linear combination of previous and current values of the state variable, at each step k it is necessary to parallel transport each tangent vector , …, , , …, to the tangent space .
- In order to propagate the constant bias, at each step k it is necessary to parallel transport it to the tangent space .
On the basis of the above considerations, a manifold-type auto-regressive, moving average M-ARMA() dynamical system on the tangent bundle of the manifold may be cast as:
for . At , it is necessary to know the initial seed and to put into effect an extension of standard zero-padding to account for lagged vector-samples that are not available at :
with denoting the memory depth of the whole M-ARMA system. Notice that the memory depth on the state variable v has been chosen to be 2 since this tutorial paper cover mostly second-order systems and for illustrative purposes. The tangent state-vectors are computed as the output of an auto-regressive moving-average system on the tangent bundle . In a M-ARMA() system whose dynamics manifests through a state-sequence as a response to an input sequence , the current state variable value depends from the past state-variable values and and from the current and past input-variable values . The additional term in the second equation of the mathematical model (503) represents a drift term that causes a slight change in the current state even when the scalar velocity is close to zero.
The M-ARMA model (503), may be cast in a different way by replacing parallel transport with vector transport. The corresponding alternative version of such M-ARMA system reads
for . In some occasions, even if a parallel transport formula is available, vector transport is invoked due to its alleged lighter computational burden.
| Algorithm 4 Pseudocode to implement the M-ARMA() systems (503) and (505). | |
| 1: | Choose a metric for the manifold and make sure the exponential map and the parallel transport operator or the vector transport operator , with , are available |
| 2: | Choose the initial seed and the drift constant |
| 3: | Choose 2 transition operators and q transition operators |
| 4: | Set and set up the zero-padding , , |
| 5: | for to K do |
| 6: | Get a tangent vector from an input source |
| 7: | Compute tangent vectors and , or and , for and and the actualized drift term or |
| 8: | Compute the next state-velocity term |
| 9: | Compute the next term in the output sequence |
| 10: | end for |
7.2. Examples of M-ARMA-Type Dynamical Systems
The aim of the present section is to illustrate some examples of the discussed M-ARMA model. The general M-ARMA() system and the notion of autocorrelation function on manifold will be illustrated through examples on the unit hypersphere, the manifold of symmetric positive-definite matrices and the compact Stiefel manifold. In particular, examples about how to choose the auto-regressive and the moving-average transition operators and will be given. In particular, in the following we shall assume that the purpose of running a M-ARMA algorithm is to generate a non-white pseudo-random sequence on a given manifold, on the basis of a random number generator that is used to produce a pseudo-random input sequence .
7.2.1. Generation of Pseudo-Random Paths on the Unit Hyper-Sphere
In order to generate a pseudo-random vector , given a point , the following algorithm may be employed:
- Generate a random vector by stacking the outputs of a pseudo-random scalar generator;
- Project the vector into the tangent space by .
The generation of a pseudo-random path over the hyper-sphere requires to choose the transition operators that take part in the M-ARMA() model (503). In particular, a possible choice for the transition operators and is:
where the square matrices may be chosen arbitrarily.
Example 34.
To give an example of the above setting, let us choose and . Moreover, let us opt for the version based on parallel transport. The resulting pseudo-random path-generation algorithm M-ARMA on the hypersphere may be particularized from the general expression (503) as follows:
with , for and generated randomly by projection over the tangent space . The value of the integer K may be chosen as large as one needs.
7.2.2. Generation of Pseudo-Random Paths on the Manifold of Symmetric, Positive-Definite Matrices
The starting point is to generate a pseudo-random tangent matrix , given a symmetric, positive-definite matrix . Such result may be achieved by putting into effect the following scheme:
- Generate a pseudo-random matrix by means of a pseudo-random scalar generator;
- Project the matrix into the tangent space by the rule .
In addition, we may recognize that a way to select the transition operators and is:
where and denote constant parameters chosen arbitrarily.
7.2.3. Generation of Pseudo-Random Paths on the Compact Stiefel Manifold
The canonical metric is the metric of choice in the present section.
With the aim to generate a pseudo-random tangent matrix , given a point , the following scheme may be used:
- Generate a pseudo-random matrix by a pseudo-random scalar generator;
- Project the matrix into the tangent space by the rule .
The generation of a pseudo-random path over the manifold requires to select transition operators and . A possible way to effect such choice is:
where and denote arbitrarily chosen constant parameters.
Example 35.
For exemplification purposes, one may select , and . The initial seed needs to be chosen randomly in and one may choose, for simplicity, not to include any drift term in the equations. The pseudo-random path generation rule (505) particularizes, in this case, to:
for . The exponential is implemented as:
where Q and R denote the factors of the compact QR decomposition of the matrix .
8. Conclusions
The present tutorial paper, the second of a series of tutorials on manifold calculus with application to non-linear control systems, was devoted to topics from manifold calculus apt to describe properties of non-linear dynamical systems subject to holonomic constraints in its state variable as well as to control non-linear dynamical systems whose state belong to curved manifolds. As an interesting cases of study for illustrative purpose, special emphasis was put on designing algorithms to achieve synchronization of manifold-type systems based on feedback control and on developing numerical methods tailored to curved manifolds to implement such systems and algorithms.
Aside of practical applications to system theory and control, the present tutorial focuses even on purely theoretical topics such as curvature and covariant derivation. The present tutorial paper does not cover further topic of great interest, such as integration on manifold and systems of order larger than two. Such important arguments will be treated in a forthcoming part of this multi-part tutorial.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Conflicts of Interest
The author declares no conflict of interest.
References
- Bloch, A. An introduction to aspects of geometric control theory. In Nonholonomic Mechanics and Control; Krishnaprasad, P., Murray, R., Eds.; Interdisciplinary Applied Mathematics; Springer: New York, NY, USA, 2015; Volume 24. [Google Scholar]
- Bullo, F.; Lewis, A. Geometric control of mechanical systems. In Texts in Applied Mathematics; Springer: New York, NY, USA; Heidelberg/Berlin, Germany, 2004; Volume 49. [Google Scholar]
- Návrat, A.; Vašík, P. On geometric control models of a robotic snake. Note Mat. 2017, 37, 120–129. [Google Scholar]
- Nijmeijer, H.; van der Schaft, A. Nonlinear Dynamical Control Systems; Springer: New York, NY, USA, 1990. [Google Scholar]
- Sreenath, K.; Lee, T.; Kumar, V. Geometric control and differential flatness of a quadrotor UAV with a cable-suspended load. In Proceedings of the 52nd IEEE Conference on Decision and Control, Firenze, Italy, 10–13 December 2013; pp. 2269–2274. [Google Scholar] [CrossRef]
- Agrachev, A.; Sachkov, Y. Control Theory from the Geometric Viewpoint; Springer: Berlin/Heidelberg, Germnay, 2004; Volume 87. [Google Scholar]
- Fiori, S. Extension of PID regulators to dynamical systems on smooth manifolds (M-PID). SIAM J. Control Optim. 2021, 59, 78–102. [Google Scholar] [CrossRef]
- Brockett, R. Lie algebras and lie groups in control theory. In Geometric Methods in System Theory; Mayne, D., Brockett, R., Eds.; Springer Netherlands: Dordrecht, The Netherlands, 1973; pp. 43–82. [Google Scholar]
- Jakubzyk, B. Introduction to geometric nonlinear control; controllability and Lie bracket. In Mathematical Control Theory, Lectures Notes of a Minicourse; Polish Academy of Sciences: Warsaw, Poland, 2002; pp. 107–168. [Google Scholar]
- Jurdjevic, V. Optimal control on Lie groups and integrable Hamiltonian systems. Regul. Chaotic Dyn. 2011, 16, 514–535. [Google Scholar] [CrossRef]
- Sastry, S. Geometric nonlinear control. In Nonlinear Systems—Analysis, Stability, and Control; Sastry, S., Ed.; Springer Science + Business Media: New York, NY, USA, 1999; pp. 510–573. [Google Scholar]
- Fiori, S. Non-delayed synchronization of non-autonomous dynamical systems on Riemannian manifolds and its applications. Nonlinear Dyn. 2018, 94, 3077–3100. [Google Scholar] [CrossRef]
- Fiori, S. Manifold calculus in system theory and control—Fundamentals and first-order systems. Symmetry 2021, 13, 2092. [Google Scholar] [CrossRef]
- Poisson, E.; Pound, A.; Vega, I. The motion of point particles in curved spacetime. Living Rev. Relativ. 2011, 14, 7. [Google Scholar] [CrossRef]
- Ferreira, R.; Xavier, J. Hessian of the Riemannian squared distance function on connected locally symmetric spaces with applications. In Proceedings of the 7th Portuguese conference on automatic control (Controlo 2006), Lisbon, Portugal, 11–13 September 2006. [Google Scholar]
- Pennec, X. Hessian of the Riemannian Squared Distance; Technical report; Université Côte d’Azur and Inria Sophia-Antipolis Méditerranée: Valbonne, France, 2017. [Google Scholar]
- Fiori, S. Gyroscopic signals smoothness assessment by geometric jolt estimation. Math. Methods Appl. Sci. 2017, 40, 5893–5905. [Google Scholar] [CrossRef]
- Fiori, S. Error-based control systems on Riemannian state manifolds: Properties of the principal pushforward map associated to parallel transport. Math. Control Relat. Fields 2021, 11, 143–167. [Google Scholar] [CrossRef]
- Röbenack, K. Computation of multiple Lie derivatives by algorithmic differentiation. J. Comput. Appl. Math. 2008, 213, 454–464. [Google Scholar] [CrossRef][Green Version]
- Civita, A.; Fiori, S.; Romani, G. A mobile acquisition system and a method for hips sway fluency assessment. Information 2018, 9, 321. [Google Scholar] [CrossRef]
- Fiori, S. A closed-form expression of the instantaneous rotational lurch index to evaluate its numerical approximation. Symmetry 2019, 11, 1208. [Google Scholar] [CrossRef]
- Ferreira, R.; Xavier, J.; Costeira, J.; Barroso, V. Newton method for Riemannian centroid computation in naturally reductive homogeneous spaces. In Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, Toulouse, France, 14–19 May 2006; Volume 3. [Google Scholar] [CrossRef]
- Isvoranu, D.; Udrişte, C. Fluid flow versus geometric dynamics. In Proceedings of the 5th Conference on Differential Geometry, Mangalia, Romania, 29 August–2 September 2005; Geometry Balkan Press: Bucharest, Romania, 2006; pp. 70–82. [Google Scholar]
- van der Pol, B. On “relaxation-oscillations”. Lond. Edinb. Dublin Philos. Mag. J. Sci. 1926, 2, 978–992. [Google Scholar] [CrossRef]
- Tôrres, L.; Aguirre, L. Transmitting information by controlling nonlinear oscillators. Physica D 2004, 196, 387–406. [Google Scholar] [CrossRef]
- Cao, T.; Yi, H. On the complex oscillation of higher order linear differential equations with meromorphic coefficients. J. Syst. Sci. Complex. 2007, 20, 135–148. [Google Scholar] [CrossRef]
- Chen, J.; Lu, J.; Wu, X. Bidirectionally coupled synchronization of the generalized Lorenz systems. J. Syst. Sci. Complex. 2011, 24, 433–448. [Google Scholar] [CrossRef]
- Huang, J.; Zhang, H. Bifurcations of periodic orbits in three-well Duffing system with a phase shift. J. Syst. Sci. Complex. 2011, 24, 519–531. [Google Scholar] [CrossRef]
- Mo, J.; Lin, W. Generalized variation iteration solution of an atmosphere-ocean oscillator model for global climate. J. Syst. Sci. Complex. 2011, 24, 271–276. [Google Scholar] [CrossRef]
- Fiori, S. Nonlinear damped oscillators on Riemannian manifolds: Fundamentals. J. Syst. Sci. Complex. 2016, 29, 22–40. [Google Scholar] [CrossRef]
- FitzHugh, R. Mathematical models of excitation and propagation in nerve. In Biological Engineering; H.P. Schwan, Ed.; McGraw-Hill: New York, NY, USA, 1969. [Google Scholar]
- Barbosa, R.; Tenreiro Machado, J.; Vinagre, B.; Calderón, A. Analysis of the van der Pol oscillator containing derivatives of fractional order. J. Vib. Control 2007, 13, 1291–1301. [Google Scholar] [CrossRef]
- Duffing, G. Erzwungene Schwingungen bei Veränderlicher Eigenfrequenz und Ihre Technische Bedeutung; Braunschweig, F., Ed.; Vieweg & Sohn: Berlin, Germany, 1918. [Google Scholar]
- Trueba, J.; Rams, J.; Sanjuán, M. Analytical estimates of the effect of nonlinear damping in some nonlinear oscillators. Int. J. Bifurc. Chaos 2000, 10, 2257–2267. [Google Scholar] [CrossRef]
- Sanjuán, M. The effect of nonlinear damping on the universal escape oscillator. Int. J. Bifurc. Chaos 1999, 9, 735–744. [Google Scholar] [CrossRef]
- Lorenz, E. Deterministic nonperiodic flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef]
- Greer, M.; Saha, R.; Gogliettino, A.; Yu, C.; Zollo-Venecek, K. Emergence of oscillations in a simple epidemic model with demographic data. R. Soc. Open Sci. 2020, 7, 191187. [Google Scholar] [CrossRef]
- Georgiou, I.; Corless, M.; Bajaj, A. Dynamics of nonlinear structures with multiple equilibria: A singular perturbation-invariant manifold approach. Z. Angew. Math. Phys. 1999, 50, 892–924. [Google Scholar] [CrossRef]
- Shorek, S. A stationarity principle for non-conservative systems. Adv. Water Resour. 1984, 7, 85–88. [Google Scholar] [CrossRef]
- Molero, F.; Lara, M.; Ferrer, S.; Cèspedes, F. 2-D Duffing oscillator: Elliptic functions from a dynamical systems point of view. Qual. Theory Dyn. Syst. 2013, 12, 115–139. [Google Scholar] [CrossRef]
- Cucker, F.; Smale, S. Emergent behavior in flocks. IEEE Trans. Autom. Control 2007, 52, 852–862. [Google Scholar] [CrossRef]
- Ha, S.Y.; Kim, D.; Schlöder, F.W. Emergent behaviors of Cucker–Smale flocks on Riemannian manifolds. IEEE Trans. Autom. Control 2021, 66, 3020–3035. [Google Scholar] [CrossRef]
- Fiori, S.; Del Rossi, L. Minimal control effort and time Lie-group synchronisation design based on proportional-derivative control. Int. J. Control 2022, 95, 138–150. [Google Scholar] [CrossRef]
- Fiori, S. Synchronization of first-order autonomous oscillators on Riemannian manifolds. Discret. Contin. Dyn. Syst. Ser. B 2019, 24, 1725–1741. [Google Scholar] [CrossRef]
- Fiori, S. Model formulation over Lie groups and numerical methods to simulate the motion of gyrostats and quadrotors. Mathematics 2019, 7, 935. [Google Scholar] [CrossRef]
- Gaponov, I.; Razinkova, A. Quadcopter design and implementation as a multidisciplinary engineering course. In Proceedings of the IEEE International Conference on Teaching, Assessment, and Learning for Engineering (TALE) 2012, Hong Kong, China, 20–23 August 2012; pp. H2B-16–H2B-19. [Google Scholar] [CrossRef]
- Bloch, A.; Krishnaprasad, P.; Marsden, J.; Ratiu, T. The Euler-Poincaré equations and double bracket dissipation. Commun. Math. Phys. 1996, 175, 1–42. [Google Scholar] [CrossRef]
- Gajbhiye, S.; Banavar, R. The Euler-Poincaré equations for a spherical robot actuated by a pendulum. IFAC Proc. Vol. 2012, 45, 72–77. [Google Scholar] [CrossRef]
- Becker, M.; Sampaio, R.; Bouabdallah, S.; de Perrot, V.; Siegwart, R. In-flight collision avoidance controller based only on OS4 embedded sensors. J. Braz. Soc. Mech. Sci. Eng. 2012, 34, 295–297. [Google Scholar] [CrossRef]
- Fiori, S. A control-theoretic approach to the synchronization of second-order continuous-time dynamical systems on real connected Riemannian manifolds. SIAM J. Control Optim. 2020, 58, 787–813. [Google Scholar] [CrossRef]
- Magdy, M.; Ng, T. Regulation and control effort in self-tuning controllers. IEE Proc. D Control Theory Appl. 1986, 133, 289–292. [Google Scholar] [CrossRef]
- Grönwall, T. Note on the derivatives with respect to a parameter of the solutions of a system of differential equations. Ann. Math. 1919, 20, 292–296. [Google Scholar] [CrossRef]
- Henderson, H.; Searle, S. On deriving the inverse of a sum of matrices. SIAM Rev. 1981, 23, 53–60. [Google Scholar] [CrossRef]
- Gavrilov, A. The Taylor series related to the differential of the exponential map. arXiv 2012, arXiv:1205:2868v1. [Google Scholar]
- Bullo, F.; Lewis, A. Geometric Control of Mechanical Systems: Modeling, Analysis, and Design for Mechanical Control Systems; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
- Osborne, J.; Hicks, G. The geodesic spring on the Euclidean sphere with parallel-transport-based damping. Not. AMS 2013, 60, 544–556. [Google Scholar] [CrossRef]
- Ambrose, W.; Singer, I.M. A theorem on holonomy. Trans. Am. Math. Soc. 1953, 75, 428–443. [Google Scholar] [CrossRef]
- Fiori, S. Auto-regressive moving-average discrete-time dynamical systems and autocorrelation functions on real-valued Riemannian matrix manifolds. Discret. Contin. Dyn. Syst. Ser. B 2014, 19, 2785–2808. [Google Scholar] [CrossRef]
- Escobar, J.; Poznyak, A. Robust parametric identification for ARMAX models with non-Gaussian and coloured noise: A survey. Mathematics 2022, 10, 1291. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).