Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control

Blanco Díaz, Luis; Sardón, Cristina; Jiménez Alburquerque, Fernando; de Lucas, Javier

doi:10.3390/sym15061285

Open AccessFeature PaperArticle

Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control

¹

Department of Applied Mathematics, Universidad Politécnica de Madrid (UPM), c. José Gutiérrez Abascal 2, 28006 Madrid, Spain

²

Department of Mathematical Methods in Physics, University of Warsaw, ul. Pasteura 5, 02-093 Warsaw, Poland

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(6), 1285; https://doi.org/10.3390/sym15061285

Submission received: 19 April 2023 / Revised: 23 May 2023 / Accepted: 24 May 2023 / Published: 19 June 2023

(This article belongs to the Special Issue Symmetry and Applications of Differential Geometry to the Differential Equations of Mathematical Physics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

A Lie system is a nonautonomous system of first-order ordinary differential equations whose general solution can be written via an autonomous function, the so-called (nonlinear) superposition rule of a finite number of particular solutions and some parameters to be related to initial conditions. This superposition rule can be obtained using the geometric features of the Lie system, its symmetries, and the symmetric properties of certain morphisms involved. Even if a superposition rule for a Lie system is known, the explicit analytic expression of its solutions frequently is not. This is why this article focuses on a novel geometric attempt to integrate Lie systems analytically and numerically. We focus on two families of methods based on Magnus expansions and on Runge–Kutta–Munthe–Kaas methods, which are here adapted, in a geometric manner, to Lie systems. To illustrate the accuracy of our techniques we analyze Lie systems related to Lie groups of the form SL

(n, R)

, which play a very relevant role in mechanics. In particular, we depict an optimal control problem for a vehicle with quadratic cost function. Particular numerical solutions of the studied examples are given.

Keywords:

Lie group integration; geometric numerical methods; numerical methods for Lie systems

MSC:

MSC 2020 classes: 34A26; 53A70; (primary) 37M15; 49M25 (secondary)

1. Introduction

The analytic integration of differential equations can be achieved in many relevant occasions, but it is not the usual case. Sometimes the geometric and symmetry properties of a Lie system are not enough to completely integrate the system, and this is why numerical methods are so important to study solutions of differential equations. In particular, this paper devises geometric numerical methods adapted to a particular class of nonautonomous first-order systems of ordinary differential equations (ODEs): the so-called Lie systems [1,2,3].

A Lie system is a nonautonomous first-order system of ODEs that admits a general solution in terms of an autonomous function, the so-called superposition rule, a family of generic particular solutions and certain constants of integration related to the initial conditions [4,5,6]. It is worth noting that a superposition rule for a Lie system may be explicitly known even when the explicit expression of its analytic solution is not [5]. Although obtaining a superposition rule reduces the integration of Lie systems to obtaining some particular solutions, such particular solutions are not easy to describe explicitly [3,5]. This is why we consider that geometric numerical methods for Lie systems should be developed. One could find an extensive list of works devoted to numerical algorithms in geometric mechanics [7,8,9,10,11,12] and references therein, but, as far as we know, just a few methods have been specifically designed for Lie systems [13,14]. This manuscript, therefore, provides a novel application of geometric analytical and numerical methods to Lie systems, leading to some interesting consequences.

Our interest in Lie systems is two-fold. On the one hand, it is rooted in their geometric background. Long story short, the origin of Lie systems goes back to the XIX century, when Sophus Lie proved that a nonautonomous system of ODEs of first order admits a superposition rule if and only if it describes the integral curves of a t-dependent vector field defined taking values in a finite dimensional Lie algebra of vector fields, known as a Vessiot–Guldberg Lie algebra (VG henceforth) of the Lie system. The symmetries of a Lie system are in direct correlation with the underlying VG Lie algebra. The theory of Lie systems has been widely studied in the last two decades and its research involves projective foliations, generalized distributions, Lie group theory, Poisson coalgebras, etc. (see [1,2,15] and references therein). In particular, the coalgebra method is based in symmetric properties of certain operators that allow us to obtain superposition rules with the aid of a finite-dimensional Poisson algebra of functions. On the other hand, Lie systems have many remarkable applications in many relevant scientific fields (see [2] and references therein). For instance, Lie systems are used in the study of the integrability of Riccati equations [16], quantum mechanics [17], stochastic mechanics [18], superequations [19], and in biology and cosmology [2]. Recently, the theory of Lie systems has been generalized to higher-order ordinary differential equations, such as higher-order Riccati equations [20], second- and third-order Kummer–Schwarz equations [5], and Milne–Pinney equations [21], among others. Additionally, the theory of Lie systems is also extensible to systems of partial differential equations [15,22].

In the past few decades, discrete methods have made big progress in faithfully describing reality. For instance, the interest of numerical analysis in the research on Lie systems was already stressed by Winternitz [3], who remarked that superposition rules allow us to study all solutions of a Lie system from the knowledge of some of them, which can be derived numerically. This is why the discretization of Lie systems and their numerical integration has caught our attention. Since Lie systems are geometrically described in terms of an underlying VG Lie algebra, this allowed for solving a Lie system by studying a Lie system of a specific type, a so-called automorphic Lie system [2], on a Lie group associated to the VG Lie algebra. Two automorphic Lie systems are two Lie systems that are equivalent under automorphic transformations. In this way, automorphic Lie systems can be claimed to be symmetric Lie systems. One can then propose a numerical method for the automorphic Lie system, giving rise to numerical methods for a plethora of Lie systems that are related to the initial through an automorphic map that preserves the properties of the Lie group, also known as symmetry group transformation. Our perspective here on numerical methods specifically designed for Lie systems proposes numerical schemes on the Lie group. There already exist some numerical methods designed to work on Lie groups, but our aim is to adapt them for Lie systems. In particular, we will focus on two classes of methods: the so-called Magnus methods [8,23,24] and Runge–Kutta–Munthe–Kaas (RKMK) [25,26], the latter being based on the classical Runge–Kutta (RK) schemes.

Summarizing, this manuscript presents a novel procedure for the integration of Lie systems by applying geometric numerical methods on one of its associated automorphic Lie systems, which is defined on a Lie group (we may refer to it as a VG Lie group). We aim at providing a quantitative and qualitative analysis of our numerical methods on the Lie group and compare them with the results obtained from numerical integration of the system of ODEs that defines the Lie system. This would resolve at the same time all Lie systems that are related to the same automorphic Lie system, i.e., those Lie systems that have isomorphic VG Lie algebras and that are determined by an equivalent curve within them (see [1] for details). We apply our numerical methods to automorphic Lie systems defined on Lie groups

SL (n, R)

, which appear in many physical applications (cf. [27]). We are particularly interested in control theory, which involves matrix Riccati equations [28]. We depict an application of matrix Riccati equations in optimal control with quadratic cost functions and solve it numerically with our adapted Magnus and RKMK methods.

The structure of the paper goes as follows. Section 2 surveys the basic theory of Lie systems and develops their analytical resolution constructed upon the geometric structure they are built on; this analytical solution is enclosed in the procedure that is summarized in Procedure in Section 2.3.1. Meanwhile, Section 3 is concerned with the novel discretization we are proposing for Lie systems, enclosed in Definition 3. An application of our methods to SL

(2, R)

and SL

(3, R)

is provided in Section 4. Meanwhile, an optimal control problem for a vehicle with a quadratic cost function is presented in Section 5, and resolved using the novel analytical techniques we are delivering.

2. Geometric Fundamentals and Lie Systems

This section establishes the notation and geometric fundamentals on Lie systems and related concepts that we will be using throughout the manuscript. Unless otherwise stated, we hereafter assume all structures to be smooth, real, and globally defined. This will simplify the presentation, while stressing its main points. From now on,

K

stands for a field to be

R

or

C

.

2.1. Geometric Fundamentals

A key concept in the theory of Lie systems is that of t-dependent vector fields. Let us describe this geometric concept. Consider an n-dimensional manifold N and its natural tangent bundle projection

π_{N} : T N \to N

. Let us define the projection

π_{2} : (t, x) \in R \times N \mapsto x \in N

, where t is the natural coordinate system on

R

. A t-dependent vector field on N is a map

X : (t, x) \in R \times N \mapsto X (t, x) \in T N

so that the following diagram becomes commutative:

That is,

π_{N} \circ X = π_{2}

. In other words, a t-dependent vector field X on N amounts to a t-parametrized family of standard vector fields

{X_{t} : x \in N \mapsto X (t, x) \in T N}_{t \in R}

on N (see [5] for details). We write

X_{t} (N)

for the space of t-dependent vector fields on N, while

X (N)

stands for the space of vector fields on N.

An integral curve of a t-dependent vector field X on N is a curve

γ : R \to N

of the form

γ = π_{2} \circ \hat{γ}

, where

\hat{γ} : R \to R \times N

is an integral curve of the so-called autonomization of X, namely the vector field

\bar{X} = \partial / \partial t + X

on

R \times N

, which is also a section of the natural projection

π : (t, x) \in R \times N \mapsto t \in R

. More precisely, if

X = \sum_{i = 1}^{n} η_{i} (t, x) \frac{\partial}{\partial x_{i}}

on a local coordinate system

{x_{1}, \dots, x_{n}}

on N, then

\bar{X} = \frac{\partial}{\partial t} + \sum_{i = 1}^{n} η_{i} (t, x) \frac{\partial}{\partial x_{i}}

and

\hat{γ} : s \in R \mapsto (s, γ (s)) \in R \times N

is a solution of the system of differential equations

\{\begin{matrix} \frac{d x_{i}}{d s} & = η_{i} (t, x), \\ \frac{d t}{d s} & = 1, \end{matrix} i = 1, \dots, n .

The reparametrization

t = t (s)

shows that

γ (t)

is a solution to

\frac{d x_{i}}{d t} = η_{i} (t, x), i = 1, \dots, n .

(1)

System (1) is the associated system with X. Furthermore, a first-order system of ODEs in normal form (1) gives rise to a t-dependent vector field on N of the form

X (t, x) = \sum_{i = 1}^{n} η_{i} (t, x) \frac{\partial}{\partial x_{i}}

whose integral curves are of the form

t \mapsto (t, γ (t))

, where

γ (t)

is a particular solution to (1). This fact justifies identifying X with the t-dependent first-order system of ordinary differential Equation (1).

For our purposes, it is important to relate t-dependent vector fields to Lie algebras. A Lie algebra is a pair

(V, [\cdot, \cdot])

, where V is a vector space and

[\cdot, \cdot] : V \times V \to V

is a bilinear and antisymmetric map that satisfies the Jacobi identity. The minimal Lie algebra,

Lie (B, V, [\cdot, \cdot])

, of a subset

B \subset V

of a Lie algebra

(V, [\cdot, \cdot])

is the smallest Lie subalgebra (in the sense of inclusion) in V that contains

B

. If it does not lead to misunderstanding,

Lie (B, V, [\cdot, \cdot])

will simply be denoted by

Lie (B)

. Given a t-dependent vector field X on N, we call minimal Lie algebra of X the smallest Lie algebra,

V^{X}

, of vector fields on N that contains all the vector fields

{X_{t}}_{t \in R}

.

2.2. Lie Groups and Matrix Lie Groups

Let G be a Lie group and let e be its neutral element. Every

g \in G

defines a right-translation

R_{g} : h \in G \mapsto h g \in G

and a left-translation

L_{g} : h \in G \mapsto g h \in G

on G. A vector field,

X^{R}

, on G is right-invariant if

X^{R} (h g) = R_{g *, h} X^{R} (h)

for every

h, g \in G

, where

R_{g *, h}

is tangent map to

R_{g}

at

h \in G

. The value of a right-invariant vector field,

X^{R}

, at every point of G is determined by its value at e, since, by definition,

X^{R} (g) = R_{g *, e} X^{R} (e)

for every

g \in G

. Hence, each right-invariant vector field

X^{R}

on G gives rise to a unique

X^{R} (e) \in T_{e} G

and vice versa. Then, the space of right-invariant vector fields on G is a finite-dimensional Lie algebra. Similarly, one may define left-invariant vector fields on G, establish a Lie algebra structure on the space of left-invariant vector fields, and set an isomorphism between the space

g

of left-invariant vector fields on G and

T_{e} G

. The Lie algebra of left-invariant vector fields on G, with Lie bracket

[\cdot, \cdot] : g \times g \to g

, induces in

T_{e} G

a Lie algebra via the identification of left-invariant vector fields and their values at e. Note that we will frequently identify

g

with

T_{e} G

to simplify the terminology.

There is a natural mapping from

g

to G, the so-called exponential map, of the form

exp : a \in g \mapsto γ_{a} (1) \in G

, where

γ_{a} : R \to G

is the integral curve of the right-invariant vector field

X_{a}^{R}

on G satisfying

X_{a}^{R} (e) = a

and

γ (0) = e

. If

g = gl (n, K)

, where

gl (n, K)

is the Lie algebra of

n \times n

square matrices with entries in a field

K

relative to the Lie bracket given by the commutator of matrices, then

gl (n, K)

can be considered as the Lie algebra of the Lie group

GL (n, K)

of

n \times n

invertible matrices with entries in

K

. It can be proved that in this case,

exp : X \in gl (n, K) \mapsto exp (X) \in GL (n, K)

retrieves the standard expression of the exponential of a matrix [29], namely

exp (X) = I_{n} + X + \frac{X^{2}}{2} + \frac{X^{3}}{6} + \dots = \sum_{k = 0}^{\infty} \frac{X^{k}}{k!},

where

I_{n}

stands for the

n \times n

identity matrix.

From the definition of the exponential map

exp : T_{e} G \to G

, it follows that

exp (s a) = γ_{a} (s)

for each

s \in R

and

a \in T_{e} G

. Let us show this. Indeed, given the right-invariant vector field

X_{s a}^{R}

, where

s a \in T_{e} G

, then

X_{s a}^{R} (g) = R_{g *, e} X_{s a}^{R} (e) = R_{g *, e} (s a) = s R_{g *, e} (a), \forall g \in G .

In particular for

s = 1

, it follows that

X_{a}^{R} (g) = R_{g *, e} (a)

and, for general s, it follows that

X_{s a}^{R} = s X_{a}^{R}

. Hence, if

γ_{a}, γ_{s a} : R \to G

are the integral curves of

X_{a}^{R}

and

X_{s a}^{R}

with initial condition e, respectively, then it can be proved that, for

u = t s

, one has that

\frac{d}{d t} γ_{a} (t s) = s \frac{d}{d u} γ_{a} (u) = s X_{a}^{R} (γ_{a} (t s)) .

and

t \mapsto γ_{a} (s t)

is the integral curve of

X_{s a}^{R}

with initial condition e. Hence,

γ_{a} (s t) = γ_{s a} (t)

. Therefore,

exp (s a) = γ_{s a} (1) = γ_{a} (s)

. It is worth stressing that Ado’s theorem [30] shows that every Lie group admits a matrix representation close to its neutral element.

The exponential map establishes a diffeomorphism from an open neighborhood

U_{g}

of 0 in

T_{e} G

and

exp (U_{g})

. More in detail, every basis

V = {v_{1}, \dots, v_{r}}

of

T_{e} G

gives rise to the so-called canonical coordinates of the second-kind related to

V

defined by the local diffeomorphism

\begin{matrix} U_{g} \subset T_{e} G & ⟶ & exp (U_{g}) \subset G \\ (λ_{1}, \dots, λ_{r}) & \mapsto & \prod_{α = 1}^{r} exp (λ_{α} v_{α}), \end{matrix}

for an appropriate open neighborhood

U_{g}

of 0 in

T_{e} G ≃ g

.

In matrix Lie groups right-invariant vector fields take a simple useful form. In fact, let G be a matrix Lie group. It can be then considered as a Lie subgroup of

GL (n, K)

. Moreover, it can be proved that

T_{A} G

, for any

A \in G

, can be identified with the space of

n \times n

square matrices

M_{n} (K)

.

Since

R_{A} : B \in G \mapsto B A \in G

, then

R_{A *, e} (M) = M A \in T_{A} G,

for all

M \in T_{e} G,

and

A \in GL (n, K)

. As a consequence, if

X^{R} (e) = M

at the neutral element e, namely the identity

I

, of the matrix Lie group G, then

X^{R} (A) = R_{A *, I} (X^{R} (I)) = R_{A *, I} (M) = M A

. It follows that, at any

A \in G

, every tangent vector

B \in T_{A} G

can be written as

B = C A

for a unique

C \in T_{I} G

[31,32].

Let us describe some basic facts on Lie group actions on manifolds induced by Lie algebras of vector fields. It is known that every finite-dimensional Lie algebra, V, of vector fields on a manifold N gives rise to a (local) Lie group action

φ : G \times N \to N,

(2)

whose fundamental vector fields are given by the elements of V and G is a connected and simply connected Lie group whose Lie algebra is isomorphic to V. If the vector fields of V are complete, then the Lie group action (2) is globally defined on

G \times N

. Let us show how to obtain

φ

from V, which will be of crucial importance in this work.

Let us restrict ourselves to an open neighborhood

U_{G}

of the neutral element of G, where we can use canonical coordinates of the second-kind related to a basis

{v_{1}, \dots, v_{r}}

of

g

. Then, each

g \in U_{G}

can be expressed as

g = \prod_{α = 1}^{r} exp (λ_{α} v_{α}),

(3)

for certain uniquely defined parameters

λ_{1}, \dots, λ_{r} \in R

. To determine

φ

, we determine the curves

γ_{x}^{α} : R \to N : t \mapsto φ (exp (t v_{α}), x), α = 1, \dots, r,

(4)

where

γ_{x}^{α}

must be the integral curve of

X_{α}

for

α = 1, \dots, r

. Indeed, for any element

g \in U_{G} \subset G

expressed as in (3), using the intrinsic properties of a Lie group action,

φ (g, x) = φ (\prod_{α = 1}^{r} exp (λ_{α} v_{α}), x) = (φ (exp (λ_{1} v_{1})) \cdot φ (exp (λ_{2} v_{2})) \cdot φ (exp (λ_{r} v_{r})), x),

the action is completely defined for any

g \in U_{G} \subset G

.

In this work we will deal with some particular matrix Lie groups, starting from the general linear matrix group

GL (n, K)

, where we recall that

K

may be

R

or

C

. As is well-known, any closed subgroup of

GL (n, K)

is also a matrix Lie group ([29], Theorem 15.29, pg. 392). In the forthcoming pages we will work with some of those subgroups such as

SL (n, R)

, the Lie group formed by

n \times n

real matrices with unit determinant. Moreover, for future reference we recall that the Lie algebra of

SL (n, R)

, i.e.,

sl (n, R)

, is the space of traceless

n \times n

real matrices [32,33].

2.3. Lie Systems

The Lie Theorem [15] states that a Lie system is a t-dependent system of (first-order) ordinary differential equations that describes the integral curves of a t-dependent vector field that takes values in a finite-dimensional Lie algebra of vector fields, namely the aforementioned Vessiot–Guldberg Lie algebra (VG) [1,5]. As we also mentioned previously, one of the most important characteristics of Lie systems is that they admit (generally nonlinear) superposition rules and a plethora of mathematical properties mediated by the Lie theorem [15]. Furthermore, some Lie systems can be studied via a Hamiltonian formulation [2,20].

In this section we introduce some of these fundamental concepts in the theory of Lie systems. In this way, we start by introducing solutions of Lie systems in terms of superposition rules.

On a first approximation, a Lie system is a first-order system of ODEs that admits a superposition rule.

Definition 1.

A superposition rule for a system X on N is a map

Φ : N^{m} \times N \to N

such that the general solution

x (t)

of X can be written as

x (t) = Φ (x_{(1)} (t), \dots, x_{(m)} (t); ρ)

, where

x_{(1)} (t), \dots,

x_{(m)} (t)

is a generic family of particular solutions and ρ is a point in N related to the initial conditions of X.

A classic example of Lie system is the Riccati Equation ([2], Example 3.3), that is,

\frac{d x}{d t} = b_{1} (t) + b_{2} (t) x + b_{3} (t) x^{2}, x \in R,

(5)

with

b_{1} (t), b_{2} (t), b_{3} (t)

being arbitrary functions of t. It is known then that the general solution,

x (t)

, of the Riccati equation can be written as

x (t) = \frac{x_{(2)} (t) (x_{(3)} (t) - x_{(1)} (t)) + ρ x_{(3)} (t) (x_{(1)} (t) - x_{(2)} (t))}{(x_{(3)} (t) - x_{(1)} (t)) + ρ (x_{(1)} (t) - x_{(2)} (t))},

(6)

where

x_{(1)} (t), x_{(2)} (t), x_{(3)} (t)

are three different particular solutions of (5) and

ρ \in R

is an arbitrary constant. This implies that the Riccati equation admits a superposition rule

Φ : R^{3} \times R \to R

such that

Φ (x_{(1)}, x_{(2)}, x_{(3)}, ρ) = \frac{x_{(2)} (x_{(3)} - x_{(1)}) + ρ x_{(3)} (x_{(1)} - x_{(2)})}{(x_{(3)} - x_{(1)}) + ρ (x_{(1)} - x_{(2)})} .

The conditions that guarantee the existence of a superposition rule are gathered in the Lie theorem ([34], Theorem 44).

Theorem 1.

(Lie theorem). A first-order system X on N,

\frac{d x}{d t} = X (t, x), x \in N, X \in X_{t} (N),

(7)

admits a superposition rule if and only if X can be written as

X (t, x) = \sum_{α = 1}^{r} b_{α} (t) X_{α} (x), t \in R, x \in N,

(8)

for a certain family

b_{1} (t), \dots, b_{r} (t)

of t-dependent functions and a family of vector fields

X_{1}, \dots,

X_{r}

on N that generate an r-dimensional Lie algebra of vector fields.

The Lie theorem yields that every Lie system X is related to (at least) one VG Lie algebra, V, that satisfies that Lie(

{X_{t}}_{t \in R}

)

\subset V

. This implies that the minimal Lie algebra has to be finite-dimensional, and vice versa [5].

Example 1.

The t-dependent vector field on the real line associated with (5) is

X = b_{1} (t) X_{1} + b_{2} (t) X_{2} + b_{3} (t) X_{3}

, where

X_{1}, X_{2}, X_{3}

are vector fields on

R

given by

X_{1} = \frac{\partial}{\partial x}, X_{2} = x \frac{\partial}{\partial x}, X_{3} = x^{2} \frac{\partial}{\partial x} .

Since the commutation relations are

[X_{1}, X_{2}] = X_{1}, [X_{1}, X_{3}] = 2 X_{2}, [X_{2}, X_{3}] = X_{3},

(9)

the vector fields

X_{1}, X_{2}, X_{3}

generate a VG Lie algebra isomorphic to

sl (2, R)

. Then, the Lie theorem guarantees that (5) admits a superposition rule, which is precisely the one shown in (6).

2.3.1. Automorphic Lie Systems

The general solution of a Lie system on N with a VG Lie algebra, V, can be obtained from a single particular solution of a Lie system on a Lie group G whose Lie algebra is isomorphic to V, a so-called automophic Lie system ([5], §1.4). As the automorphic Lie system notion is going to be central in our paper, let us study it in some detail (see [5] for details).

Definition 2.

An automorphic Lie system is a t-dependent system of first-order differential equations on a Lie group G of the form

\frac{d g}{d t} = \sum_{α = 1}^{r} b_{α} (t) X_{α}^{R} (g), g \in G, t \in R,

(10)

where

{X_{1}^{R}, \dots, X_{r}^{R}}

is a basis of the space of right-invariant vector fields on G and

b_{1} (t), \dots, b_{r} (t)

are arbitrary t-dependent functions. Furthermore, we shall refer to the right-hand side of Equation (10) as

{\hat{X}}_{R}^{G} (t, g)

, i.e.,

{\hat{X}}_{R}^{G} (t, g) = \sum_{α = 1}^{r} b_{α} (t) X_{α}^{R} (g)

.

Because of right-invariant vector fields, systems in the form of

\hat{X}

have the following important property.

Proposition 1.

(See [5], §1.3) Given a Lie group G and a particular solution

g (t)

of the Lie system defined on G, as

\frac{d g}{d t} = \sum_{α = 1}^{r} b_{α} (t) X_{α}^{R} (g) = {\hat{X}}_{R}^{G} (t, g),

(11)

where

b_{1} (t), \dots, b_{r} (t)

are arbitrary t-dependent functions and

X_{1}^{R}, \dots, X_{r}^{R}

are right-invariant vector fields, we have that

g (t) h

is also a solution of (11) for each

h \in G

.

An immediate consequence of Proposition 1 is that, once we know a particular solution of

{\hat{X}}_{R}^{G}

, any other solution can be obtained simply by multiplying the known solution on the right by any element in G. More concretely, if we know a solution

g (t)

of (11), then the solution

h (t)

of (11) with initial condition

h (0) = g (0) h_{0}

can be expressed as

h (t) = g (t) h_{0}

. This justifies that henceforth we only worry about finding one particular solution

g (t)

of

{\hat{X}}_{R}^{G}

, e.g., the one that fulfills

g (0) = e

. The previous result can be understood in terms of the Lie theorem or via superposition rules. In fact, since (11) admits a superposition rule

Φ : (g, h) \in G \times G \mapsto g h \in G

, the system (1) must be a Lie system. Alternatively, the same result follows from the Lie Theorem and the fact that the right-invariant vector fields on G span a finite-dimensional Lie algebra of vector fields.

There are several reasons to study automorphic Lie systems. One is that they can be locally written around the neutral element of their Lie group in the form

\frac{d A}{d t} = B (t) A, A \in GL (n, K), B (t) \in M_{n} (K),

where

M_{n} (K)

is the set of

n \times n

matrices of coefficients in

K

, for every

t \in R

.

The main reason to study automorphic Lie systems is given by the following results, which show how they can be used to solve any Lie system on a manifold. Let us start with a Lie system X defined on N. Hence, X can be written as

\frac{d x}{d t} = \sum_{α = 1}^{r} b_{α} (t) X_{α},

(12)

for certain t-dependent functions

b_{1} (t), \dots, b_{r} (t)

and vector fields

X_{1}, \dots, X_{r} \in X (N)

that generate an r-dimensional dimensional VG Lie algebra. The VG Lie algebra V is always isomorphic to the Lie algebra

g

of a certain Lie group G. The VG Lie algebra spanned by

X_{1}, \dots, X_{r}

gives rise to a (local) Lie group action

φ : G \times N \to N

whose fundamental vector fields are those of V. In particular, there exists a basis

{v_{1}, \dots, v_{r}}

in

g

so that

\frac{d}{d t} |_{t = 0} φ (exp (t v_{α}), x) = X_{α} (g), α = 1, \dots, r .

In other words,

φ_{α} : (t, x) \in R \times N \mapsto φ (exp (t v_{α}), x) \in N

is the flow of the vector field

X_{α}

for

α = 1, \dots, r

. Note that if

[X_{α}, X_{β}] = \sum_{γ = 1}^{r} c_{α β}^{γ} X_{γ}

for

α, β = 1, \dots, r

, then

[v_{α}, v_{β}] = - \sum_{γ = 1}^{r} c_{α β}^{γ} v_{γ}

for

α, β = 1, \dots, r

(cf. [1]).

To determine the exact form of the Lie group action

φ : G \times N \to N

as in (4), we impose

φ (exp (λ_{α} v_{α}), x) = φ_{α} (λ_{α}, x) \forall α = 1, \dots, r, \forall x \in N,

(13)

where

λ_{1}, \dots, λ_{r} \in R

. If we stay in a neighborhood U of the origin of G, where every element

g \in U

can be written in the form

g = exp (λ_{1} v_{1}) \cdot \dots \cdot exp (λ_{r} v_{r}),

then the relations (13) and the properties of

φ

allow us to determine

φ

on U. If we fix

x \in N

, the right-hand side of the equality turns into an integral curve of the vector field

X_{α}

; this is why (13) holds.

Proposition 2.

(see [1,5] for details) Let

g (t)

be a solution to the system

\frac{d g}{d t} = \sum_{α = 1}^{r} b_{α} (t) X^{R} (g), \forall t \in R, g \in G .

Then,

x (t) = φ (g (t), x_{0})

is a solution of

X = \sum_{α = 1}^{r} b_{α} (t) X_{α}

, where

x_{0} \in N

. In particular, if one takes the solution

g (t)

that satisfies the initial condition

g (0) = e

, then

x (t)

is the solution of X such that

x (0) = x_{0}

.

Let us study a particularly relevant form of automorphic Lie systems that will be used hereafter. If

g

is a finite-dimensional Lie algebra, then Ado’s theorem [30] guarantees that

g

is isomorphic to a matrix Lie algebra

g_{M}

. Let

V = {M_{1}, \dots, M_{r}}

be a basis of

g_{M} \subset M_{n} (R)

. As reviewed in Section 2.2, each

M_{α}

gives rise to a right-invariant vector field

X_{α}^{R} (g) = M_{α} g

, with

g \in G

, on G. These vector fields have the opposite commutation relations to the (matrix) elements of the basis.

In the case of matrix Lie groups, the system (11) takes a simpler form. Let

Y (t)

be the matrix associated with the element

g (t) \in G

. Using the right invariance property of each

X_{α}^{R}

, we have that

\frac{d Y}{d t} = \sum_{α = 1}^{r} b_{α} (t) X_{α}^{R} (Y (t)) = \sum_{α = 1}^{r} b_{α} (t) R_{Y (t) *, e} (X_{α}^{R} (e)) = \sum_{α = 1}^{r} b_{α} (t) R_{Y (t) *, e} (M_{α}) .

We can write the last term as

\sum_{α = 1}^{r} b_{α} (t) R_{Y (t) *, e} (M_{α}) = \sum_{α = 1}^{r} b_{α} (t) M_{α} Y (t),

in such a way that for matrix Lie groups, the system on the Lie group is

\frac{d Y}{d t} = A (t) Y (t), Y (0) = I, with A (t) = \sum_{α = 1}^{r} b_{α} (t) M_{α},

(14)

where I is the identity matrix (which corresponds with the neutral element of the matrix Lie group) and the matrices

M_{α}

form a finite-dimensional Lie algebra, which is anti-isomorphic to the VG Lie algebra of the system (by anti-isomorphic we imply that the systems have the same constants of structure but that they differ in one sign).

There exist various methods to solve system (11) analytically ([6], §2.2), such as the Levi decomposition [35] or the theory of reduction of Lie systems ([4], Theorem 2). In some cases, it is relatively easy to solve it, as is the case where

b_{1}, \dots, b_{r}

are constants. We will depict an example in this particular case in Section 4. Nonetheless, we are interested in a numerical approach, since we will try to solve the automorphic Lie system with adapted geometric integrators. The solutions on the Lie group can be straightforwardly translated into solutions on the manifold for the Lie system defined on N via the Lie group action (2).

To finish this section, we will employ the previous developments in order to define our novel procedure to (geometrically) construct a continuous solution of a given Lie system.

The 7-step method: Reduction procedure to automorphic Lie system

The method can be itemized in the following seven steps:

1.: We identify the VG Lie algebra of vector fields $X_{1}, \dots, X_{r}$ that defines the Lie system on N.
2.: We look for a Lie algebra $g$ isomorphic to the VG Lie algebra, whose basis is ${M_{1}, \dots,$ $M_{r}} \in M_{n \times n} (R)$ with the same structure constants of $X_{1}, \dots, X_{r}$ in absolute value, but with a negative sign.
3.: We integrate the vector fields $X_{1}, \dots, X_{r}$ to obtain their respective flows $Φ_{α} : R \times N \to N$ with $α = 1, \dots, r$ .
4.: Using canonical coordinates of the second kind and the previous flows we construct the Lie group action $φ : G \times N \to N$ using expressions in (13).
5.: We define an automorphic Lie system ${\hat{X}}_{R}^{G}$ on the Lie group G associated with $g$ as in (11).
6.: We compute the solution of the system ${\hat{X}}_{R}^{G}$ that fulfills $g (0) = e$ .
7.: Finally, we retrieve the solution for X on N through the expression $x (t) = φ (g (t), x_{0})$ .

3. Discretization of Lie Systems

This section adapts known numerical methods on Lie groups to automorphic Lie systems. For this purpose, we start by reviewing briefly some fundamentals on numerical methods for ordinary differential equations and Lie groups [36,37,38], and later focus on two specific numerical methods on Lie groups, the Magnus expansion and RKMK methods [8,23,24,25,26].

Recall that, in this paper, we focus on ordinary differential equations of the form

\frac{d x}{d t} = f (t, x), t \in [a, b], x (t) \in N, f \in X_{t} (N) .

(15)

When N is (or diffeomorphic to) a Euclidean space, there is a plethora of numerical schemes approximating the analytic solution

x (t)

of (15) [36,37]. We will focus on one-step methods with fixed time step. By that we mean that solutions are approximated by a sequence of numbers

x_{k} = x (t_{k}) \in N

with

t_{k} = a + k h

,

h = (b - a) / N

,

b > a

and

\frac{x_{k + 1} - x_{k}}{h} = f_{h} (t_{k}, x_{k}, x_{k + 1}),

(16)

where

N

is the number of steps our time interval is divided to. We call h the time step, which is fixed, while

f_{h} : R \times N \times N \to T N

is a discrete vector field, which (recall that, for now, we set N to be a Euclidean space with norm

∥ \cdot ∥

) is a given approximation of f in (15). As usual, we shall denote the local truncation error by

E_{h}

, where

E_{h} = | | x_{k + 1} - x (t_{k + 1}) | |,

(17)

and say that the method is of order r if

E_{h} = O (h^{r + 1})

for

h \to 0

, i.e.,

{lim}_{h \to 0} | E_{h} / h^{r + 1} | < \infty

. Regarding the global error

E_{N} = | | x_{N} - x (b) | |,

we shall say that the method is convergent of order r if

E_{N} = O (h^{r})

, when

h \to 0

. As for the simulations, we pick the following norm in order to define the global error, that is

E_{N} = max_{k = 1, \dots, N} | | x (t_{k}) - x_{k} | | .

Given the relevant examples in this paper, e.g., Ricatti equations, where

N = R^{n}

, we will employ classical methods to approximate (15), particularly the Heun method (convergent of order 2) and RK4 (convergent of order 4), and compare to our novel discretization proposal.

3.1. Numerical Methods on Matrix Lie Groups

Our purpose is to numerically solve the initial condition problem for system (14) defined on a matrix Lie group G of the form

\frac{d Y}{d t} = A (t) Y with Y (0) = I,

(18)

where

Y \in G

while

A (t) \in g ≅ T_{e} G

is a given t-dependent matrix and I is the identity matrix in G. That is, we are searching for a discrete sequence

{Y_{k}}_{k = 0, \dots, N}

such that

Y_{k} \in G

. In a neighborhood of the zero in

T_{e} G

, the exponential map defines a diffeomorphism onto an open subset of the neutral element of G and the problem is equivalent to searching for a curve

Ω (t)

in

g

such that

Y (t) = exp (Ω (t)) .

(19)

This ansatz helps us to transform (18), which is defined in a nonlinear space, into a new problem in a linear space, namely the Lie algebra

g ≃ T_{e} G

. This is expressed in the classical result by Magnus [39].

Theorem 2.

(Magnus, 1954). The solution of the matrix Lie group (18) in G can be written for values of t close enough to zero, as

Y (t) = exp (Ω (t))

, where

Ω (t)

is the solution of the initial value problem

\frac{d Ω}{d t} = {dexp}_{Ω (t)}^{- 1} (A (t)), Ω (0) = 0,

(20)

where

0

is the zero element in

T_{e} G

.

When we are dealing with matrix Lie groups and Lie algebras, the

{dexp}^{- 1}

is given by

{dexp}_{Ω}^{- 1} (H) = \sum_{j = 0}^{\infty} \frac{B_{j}}{j!} {ad}_{Ω}^{j} (H),

(21)

where the

{B_{j}}_{j = 0, \dots, \infty}

are the Bernoulli numbers and

{ad}_{Ω} (H) = [Ω, H] = Ω H - H Ω .

The convergence of the series (21) is ensured as long as a certain convergence condition is satisfied [39].

If we try to integrate (20) applying a numerical method directly (note that, now, we could employ one-step methods (16) safely),

Ω (t)

might sometimes drift too much away from the origin and the exponential map would not work. This would be a problem, since we are assuming that

Ω (t)

stays in a neighborhood of the origin of

g

where the exponential map defines a local diffeomorphism with the Lie group. Since we still do not know how to characterize this neighborhood, it is necessary to adopt a strategy that allows us to resolve (20) sufficiently close to the origin. The thing to do is to change the coordinate system in each iteration of the numerical method. In the next lines we explain how this is achieved.

Consider now the restriction of the exponential map given by

\begin{matrix} exp : U_{g} \subset g & \to exp (U_{g}) \subset G, \\ A & \mapsto exp (A) \end{matrix}

so that this map establishes a diffeomorphism between an open neighborhood

U_{g}

around the origin in

g

and its image. Since the elements of the matrix Lie group are invertible matrices, the map

U_{g} \to exp (U_{g}) Y_{0} \subset G : A \mapsto exp (A) Y_{0}

from

U_{g} \subset g

to the set

exp (A) Y_{0} = {Y \in G : \exists X \in U_{g}, Y = X Y_{0}}

is also a diffeomorphism. This map gives rise to the so-called first-order canonical coordinates centered at

Y_{0}

.

As is well-known, the solutions of (20) are curves in

g

whose images by the exponential map are solutions to (18). In particular, the solution

Ω^{(0)} (t)

of system (18) such that

Ω^{(0)} (0)

is the zero matrix in

T_{Id} G

, namely

0

, corresponds with the solution

Y^{(e)} (t)

of the system on G such that

Y^{(e)} (0) = I

. Now, for a certain

t = t_{k}

, the solution

Ω^{(t_{k})} (t)

in

g

such that

Ω^{(t_{k})} (t_{k}) = 0

, corresponds with

Y^{(e)} (t)

via first-order canonical coordinates centered at

Y^{(e)} (t_{k}) \in G

, since

exp (Ω^{(t_{k})} (t_{k})) Y^{(e)} (t_{k}) = exp (0) Y^{(e)} (t_{k}) = Y^{(e)} (t_{k}),

and the existence and uniqueness theorem guarantees

exp (Ω^{(0)} (t)) = exp (Ω^{(t_{k})} (t)) Y^{(e)} (t_{k})

around

t_{k}

. In this way, we can use the curve

Ω^{(t_{k})} (t)

and the canonical coordinates centered on

Y^{(e)} (t_{k})

to obtain values for the solution of (18) in the proximity of

t = t_{k}

, instead of using

Ω^{(0)} (t)

. Whilst the curve

Ω^{(0)} (t)

could be far from the origin of coordinates for

t_{k}

, we know that

Ω^{(t_{k})} (t)

will be close, by definition. Applying this idea in each iteration of the numerical method, we are changing the curve in

g

to obtain the approximate solution of (18) while we stay near the origin (as long as the time step is small enough).

Thus, what is left is defining proper numerical methods for (20) whose solution, i.e.,

{Ω_{k}}_{k = 0, \dots, N}

, via the exponential map, provides us with a numerical solution of (18) remaining in G. In other words, the general Lie group method defined this way [8,23] can be set by the recursion

Y_{k + 1} = e^{Ω_{k}} Y_{k} .

(22)

Next, we introduce two relevant families of numerical methods providing

{Ω_{k}}_{k = 0, \dots, N}

.

3.1.1. The Magnus Method

Based on the work by Magnus, the Magnus method was introduced in [23,40]. The starting point of this method is to resolve Equation (20) by means of the Picard procedure. This method ensures that a given sequence of functions converges to the solution of (20) in a small enough neighborhood. Operating, one obtains the Magnus expansion

Ω (t) = \sum_{k = 0}^{\infty} H_{k} (t),

(23)

where each

H_{k} (t)

is a linear combination of iterated commutators. The first three terms are given by

\begin{matrix} H_{0} (t) & = \int_{0}^{t} A (ξ_{1}) d ξ_{1}, \\ H_{1} (t) & = - \frac{1}{2} \int_{0}^{t} [\int_{0}^{ξ_{1}} A (ξ_{2}) d ξ_{2}, A (ξ_{1})] d ξ_{1}, \\ H_{2} (t) & = \frac{1}{12} \int_{0}^{t} [\int_{0}^{ξ_{1}} A (ξ_{2}) d ξ_{2}, [\int_{0}^{ξ_{1}} A (ξ_{2}) d ξ_{2}, A (ξ_{1})]] d ξ_{1} \\ + \frac{1}{4} \int_{0}^{t} [\int_{0}^{ξ_{1}} [\int_{0}^{ξ_{1}} A (ξ_{2}) d ξ_{2}, A (ξ_{1})] d ξ_{2}, A (ξ_{1})] d ξ_{1} . \end{matrix}

Note that the Magnus expansion (23) converges absolutely in a given norm for every

t \geq 0

such that ([8], p. 48)

\int_{0}^{t} ∥ A (ξ) ∥ d ξ \leq \int_{0}^{2 π} \frac{d ξ}{4 + ξ [1 - cot (ξ / 2)]} \approx 1.086868702 .

In practice, if we work with the Magnus expansion, we need a way to handle the infinite series and calculate the iterated integrals. Iserles and Nørsett proposed a method based on binary trees [23,40]. In ([8], §4.3) we can find a method to truncate the series in such a way that one obtains the desired order of convergence. Similarly, ([8], §5) discusses in detail how the iterated integrals can be integrated numerically. In our case, for practical reasons we will implement the Magnus method following the guidelines of Blanes, Casas, and Ros [41], which is based on a Taylor series of

A (t)

in (18) around the point

t = h / 2

(recall that, in the Lie group and Lie algebra equations, we are setting the initial time

t_{0} = a = 0

). With this technique, one is able to achieve different orders of convergence. In particular, we will use the second- and fourth-order convergence methods ([41], §3.2), although one can build up to eighth-order methods.

The second-order approximation is

exp (Ω (h)) = exp (h a_{0}) + O (h^{3})

and the forth-order one reads

exp (Ω (h)) = exp (h a_{0} + \frac{1}{12} h^{3} a_{2} - \frac{1}{12} h^{3} [a_{0}, a_{1}]) + O (h^{5}),

where

Ω (0) = 0

and

a_{i} = \frac{1}{i!} {\frac{d^{i}}{d t^{i}} A (t)|}_{t = h / 2} i = 0, 1, 2 .

As we see from the definition, the first method computes the first and second derivative of matrix

A (t)

. Applying the coordinate change in each iteration (22), we can implement it through the following equations:

\begin{matrix} Y_{k + 1} = exp [h A (t_{k} + \frac{h}{2})] Y_{k} . [Order 2] \end{matrix}

(24)

\begin{matrix} \begin{matrix} Y_{k + 1} = exp (h a_{0} + h^{3} (a_{2} - [a_{0}, a_{1}])) Y_{k}, \\ t_{1 / 2} = t_{k} + \frac{h}{2}, a_{0} = A (t_{1 / 2}), a_{1} = \frac{\dot{A} (t_{1 / 2})}{12}, a_{2} = \frac{\ddot{A} (t_{1 / 2})}{24}, \end{matrix}\} [Order 4] \end{matrix}

(25)

where

\dot{A} (t_{0}), \ddot{A} (t_{0})

stand for the first and second derivatives of

A (t)

in terms of t at

t_{0}

. Note that the convergence order is defined for the Lie group dynamics (18). That is, when we say that the above methods are convergent of order 2, for instance, that means

E_{N} = | | Y_{N} - Y (b) | | = O (h^{2})

, with

h \to 0

, for a proper Lie matrix norm.

3.1.2. The Runge–Kutta–Munthe–Kaas Method

Changing the coordinate system in each step, as explained in previous sections, the classical RK methods applied to Lie groups give rise to the so-called Runge–Kutta–Munthe–Kaas (RKMK) methods [25,26]. The equations that implement the method are

\begin{matrix} \begin{matrix} Θ_{j} & = h \sum_{l = 1}^{s} a_{j l} F_{l}, \\ F_{j} & = {dexp}_{Θ_{j}}^{- 1} (A (t_{k} + c_{j} h)), \\ Θ & = h \sum_{l = 1}^{s} b_{l} F_{l}, \\ Y_{k + 1} & = exp (Θ) Y_{k} . \end{matrix} \begin{matrix} \} j = 1, \dots, s, \end{matrix} \end{matrix}

where the constants

{a_{j l}}_{j, l = 1}^{s}

,

{b_{l}}_{l = 1}^{s}

,

{c_{j}}_{j = 1}^{s}

can be obtained from a Butcher’s table ([38], §11.8) (note that s is the number of stages of the usual RK methods). Apart from this, we have the consistency condition

\sum_{l = 1}^{s} b_{l} = 1

. As the equation that we want to solve comes in the shape of an infinite series, it is necessary to study how we evaluate the function

{dexp}_{Ω (t)}^{- 1}

. For this, we need to use truncated series up to a certain order in such a way that the order of convergence of the underlying classical RK is preserved. If the classical RK is of order p and the truncated series of (20) is up to order j, such that

j \geq p - 2

, then the RKMK method is of order p (see [25,26] and ([42], Theorem 8.5, p. 124)). Again, this convergence order refers to the equation in the Lie group (18).

Let us now determine the RKMK method associated with the explicit Runge–Kutta whose Butcher’s table is

0
$1 / 2$	$1 / 2$
$1 / 2$	0	$1 / 2$
1	0	0	1
	$1 / 6$	$1 / 3$	$1 / 3$	$1 / 6$

that is, a Runge–Kutta of order 4 (RK4). This implies that we need to truncate the series

{dexp}_{Ω (t)}^{- 1}

at

j = 2

:

{dexp}_{Ω}^{- 1} (A) \approx A - \frac{1}{2} [Ω, A] + \frac{1}{12} [Ω, [Ω, A]] .

(26)

Then, the RKMK implementation for the given Butcher’s table is

\begin{matrix} F_{1} & = {dexp}_{O_{n}}^{- 1} (A (t_{k})), \\ F_{2} & = {dexp}_{\frac{1}{2} h F_{1}}^{- 1} (A (t_{k} + \frac{1}{2} h)), \\ F_{3} & = {dexp}_{\frac{1}{2} h F_{2}}^{- 1} (A (t_{k} + \frac{1}{2} h)), \\ F_{4} & = {dexp}_{h F_{3}}^{- 1} (A (t_{k} + h)), \end{matrix}\} \begin{matrix} Θ = \frac{h}{6} (F_{1} + 2 F_{2} + 2 F_{3} + F_{4}), \\ Y_{k + 1} = exp (Θ) Y_{k}, \end{matrix}

(27)

where

{dexp}^{- 1}

is (26).

It is interesting to note that the method obtained in the previous section using the Magnus expansion (24) can be retrieved by an RKMK method associated with the following Butcher’s table:

0
$1 / 2$	$1 / 2$
	0	1

Since it is an order 2 method, for the computation of

{dexp}^{- 1}

, one can use

{dexp}_{Ω}^{- 1} (A) \approx A

.

3.2. Numerical methods for Lie Systems

So far, we have established in Procedure Section 2.3.1 how to construct an analytical solution of a Lie system on a manifold N via a Lie group action on N, which is obtained by means of the integration of the VG Lie algebra of the Lie system. On the other hand, in Section 3.1 we have reviewed some methods in the literature providing a numerical approximation of the solution of (18) remaining in the Lie group G (which accounts for their most remarkable geometrical property).

Now, let us explain how we combine these two elements to construct our new numerical methods, so we retrieve the solution of (12) on N. Let

φ

be the Lie group action (13) and consider the solution of the system (18) such that

Y (0) = I

. This solution permits us to retrieve the solution on N of (12) for small values of t, i.e., when a solution

Y (t)

of (18) stays close to the neutral element and hence the Lie group action

φ

is properly defined. Numerically, we have shown that the solutions of (18) can be provided through the approximations of (21), say

{Ω_{k}}_{k = 0, \dots, N}

, and (22), as long as we stay close enough to the origin. As particular examples, we have picked the Magnus and RKMK methods in order to obtain

{Ω_{k}}_{k = 0, \dots, N}

and, furthermore, the sequence

{Y_{k}}_{k = 0, \dots, N}

. Next, we establish the scheme providing the numerical solution to Lie systems.

Definition 3.

Let us consider a Lie system evolving on a manifold N of the form

\frac{d x}{d t} = \sum_{α = 1}^{r} b_{α} (t) X_{α} (x), x (a) = x_{0},

and let

\frac{d Y}{d t} = A (t) Y, A (t) = \sum_{α = 1}^{r} b_{α} (t) M_{α},

be its associated automorphic Lie system. We define the numerical solution to the Lie system, i.e.,

{x_{k}}_{k = 0, \dots, N}

, via Algorithm 1.

Algorithm 1 Lie systems method

Lie systems method

1: : Initial data : $N, h, x_{0}, A (t), Y_{0} = I, Ω_{0} = 0 .$
2: : Numerically solve $\frac{d Ω}{d t} = {dexp}_{Ω}^{- 1} A (t)$
3: : Output ${Ω_{k}}_{k = 1, \dots, N}$
4: : for $k = 1, \dots, N - 1$ do

$\begin{matrix} Y_{k + 1} = e^{Ω_{k}} Y_{k}, \\ x_{k + 1} = φ (Y_{k + 1}, x_{k}), \end{matrix}$
5: : end for
6: : Output: $(x_{1}, x_{2}, . . ., x_{N}) .$

At this point, we would like to highlight an interesting geometric feature of this method. On the one hand, the discretization is based on the numerical solution of the automorphic Lie system underlying the Lie system, which, itself, is founded upon the geometric structure of the latter. This numerical solution remains on G, i.e.,

Y_{k} \in G

for all k, due to the particular design of the Lie group methods (as long as h is small). Given this, our construction respects as well the geometrical structure of the Lie system, since, in principle, it evolves on a manifold N. We observe that the iteration

x_{k + 1} = φ (Y_{k + 1}, x_{k})

leads to this preservation, since

x_{k + 1} \in N

as long as

Y_{k + 1} \in G

and

x_{k} \in N

(we recall that

φ : G \times N \to N

). Note as well that the direct application of a one-step method (16) on a general Lie system (12) would destroy this structure.

For future reference, in regards of the Lie group methods (22), we shall refer to (24) as Magnus 2, to (25) as Magnus 4, and to (27) as, simply, RKMK (we recall that this method is order 4 convergent).

4. Application to SL $(n, R)$

4.1. SL( $2, R$ ) and the Riccati Equation

Let us recall the first-order Riccati equation over the real line

R

. One can check a comprehensive description of all the physical applications of this equation in [2]. The Riccati equation reads

\frac{d x}{d t} = b_{0} (t) + b_{1} (t) x + b_{2} (t) x^{2},

(28)

where

b_{0} (t), b_{1} (t), b_{2} (t)

are arbitrary t-dependent functions. The associated t-dependent vector field is

X = b_{0} (t) X_{0} + b_{1} (t) X_{1} + b_{2} (t) X_{2}

, where

X_{0} = \partial_{x}, X_{1} = x \partial_{x}, X_{2} = x^{2} \partial_{x}

and whose commutators are

[X_{0}, X_{1}] = X_{0}, [X_{0}, X_{2}] = 2 X_{1}, [X_{1}, X_{2}] = X_{2} .

(29)

This proves that the Riccati equation is a Lie system related to a VG Lie algebra isomorphic to

sl (2, R)

. Thus, we employ the 7-step method in Section 2.3.1 to study its solutions. We choose the basis

V = {M_{0}, M_{1}, M_{2}}

of

sl (2, R)

to integrate the VG Lie algebra to a Lie group action of SL

(2, R)

on

R

. In more detail,

M_{0} = (\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}), M_{1} = \frac{1}{2} (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}), M_{2} = (\begin{matrix} 0 & 0 \\ - 1 & 0 \end{matrix}) .

Note that

[M_{0}, M_{1}] = - M_{1}, [M_{0}, M_{2}] = - 2 M_{1}, [M_{1}, M_{2}] = - M_{2} .

We obtain the flows for the vector fields

X_{0}, X_{1}

, and

X_{2}

by integrating them in terms of the real parameters

λ_{0}, λ_{1}, a n d λ_{2}

, respectively. Indeed, the flows of the vector fields

X_{0}, X_{1}, X_{2}

read

Φ_{0} (λ_{0}, x_{0}) = λ_{0} + x_{0}, Φ_{1} (λ_{1}, x_{0}) = x_{0} e^{λ_{1}}, Φ_{2} (λ_{2}, x_{0}) = \frac{x_{0}}{1 - λ_{2} x_{0}},

respectively. Using canonical coordinates of the second-kind, we can write

Y \in SL (2, R)

near the neutral element as

Y = exp (λ_{0} M_{0}) exp (λ_{1} M_{1}) exp (λ_{2} M_{2}) .

(30)

We define the Lie group action

φ : SL (2, R) \times R \to R

through the equations

φ (exp (λ_{i} M_{i}), x) = Φ_{i} (λ_{i}, x) i = 0, 1, 2 .

Calculating the three exponential expressions in (30) and comparing the expression with an arbitrary element

Y \in SL (2, R)

with parameters

α δ - β γ = 1

, we have

Y = (\begin{matrix} α & β \\ γ & δ \end{matrix}) = (\begin{matrix} e^{λ_{1} / 2} - λ_{0} λ_{2} e^{- λ_{1} / 2} & λ_{0} e^{- λ_{1} / 2} \\ - λ_{2} e^{- λ_{1} / 2} & e^{- λ_{1} / 2} \end{matrix}),

from where the parameters

(λ_{0}, λ_{1}, λ_{2})

read

λ_{0} = \frac{β}{δ}, λ_{1} = - 2 log δ, λ_{2} = - \frac{γ}{δ} .

(31)

The action is obtained as

φ (Y, x_{0}) = φ (exp (λ_{0} M_{0}) \cdot exp (λ_{1} M_{1}) \cdot exp (λ_{2} M_{2}), x_{0}) = Φ_{0} (λ_{0}, Φ_{1} (λ_{1}, Φ_{2} (λ_{2}, x_{0}))),

and substituting the flows,

φ (Y, x_{0}) = λ_{0} + \frac{x_{0}}{1 - λ_{2} x_{0}} e^{λ_{1}} .

Now, substituting the parameters (31) and bearing in mind that for any

Y \in SL (2, R)

it is fulfilled that

α δ - γ β = 1

, we can reach the expression of the action that results in a homography [43]

φ (Y, x_{0}) = \frac{α x_{0} + β}{γ x_{0} + δ} .

(32)

4.1.1. Exact Solution

It is interesting to note that if the t-dependent coefficients of the Lie system are constants, the matrix Y associated with the linear system on the Lie group is t-independent and the solution of the automorphic Lie system can be easily retrieved.

For example, consider the Riccati equation with constant coefficients

\frac{d x}{d t} = 1 + 2 x + x^{2},

obtained by assuming

b_{0} (t) = 1

,

b_{1} (t) = 2

and

b_{2} (t) = 1

in (28). The system on the group (14) associated with this Riccati equation reads

\frac{d Y}{d t} = A Y, Y (t) = (\begin{matrix} y_{11} (t) & y_{12} (t) \\ y_{21} (t) & y_{22} (t) \end{matrix}) \in SL (2, R), Y (0) = I_{2},

(33)

where

I_{2}

is the identity

2 \times 2

matrix and

A (t)

is

A = \sum_{i = 0}^{2} b_{i} M_{i} = (\begin{matrix} 1 & 1 \\ - 1 & - 1 \end{matrix}) .

If we write (33) in the canonical form

\frac{d y_{11}}{d t} = y_{11} + y_{21}, \frac{d y_{12}}{d t} = y_{12} + y_{22}, \frac{d y_{21}}{d t} = - y_{11} - y_{21}, \frac{d y_{22}}{d t} = - y_{12} - y_{22},

or equivalently,

d y / d t = Σ y

, where

y = (\begin{matrix} y_{11} \\ y_{12} \\ y_{21} \\ y_{22} \end{matrix}), Σ = (\begin{matrix} 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 \\ - 1 & 0 & - 1 & 0 \\ 0 & - 1 & 0 & - 1 \end{matrix}), y (0) = (\begin{matrix} 1 \\ 0 \\ 0 \\ 1 \end{matrix}),

the solution of the system reads

y (t) = exp (\int_{0}^{t} Σ (τ) d τ) y (0) = exp (t Σ) y (0) .

Observe that the matrix

Σ

is constant, so the integration is trivial. Furthermore, since

Σ

is nilpotent, the exponential is simply truncated at order 2. In this way, we obtain the solution:

y (t) = {(t + 1, t, - t, 1 - t)}^{T} \Rightarrow Y (t) = (\begin{matrix} t + 1 & t \\ - t & 1 - t \end{matrix}) .

Applying the Lie group action, we retrieve the solution of the original system:

x (t) = φ (Y (t), x_{0}) = \frac{t x_{0} + x_{0} + t}{1 - t - t x_{0}},

where

x_{0}

is the initial condition.

4.1.2. Numerical Example

Let us now put into practice the numerical methods proposed in Definition 3. For this matter, we consider

\frac{d x}{d t} = 2 t - \frac{x}{t} + \frac{x^{2}}{t^{3}}, t \geq 1 .

(34)

This is another Riccati equation with t-dependent coefficients

b_{0} (t) = 2 t

,

b_{1} (t) = - 1 / t

and

b_{2} (t) = 1 / t^{3} .

Its solution is

x (t) = \frac{2 t^{3} - 2 t^{2}}{2 t - 1}

(35)

for the initial condition

x (1) = 0

.

In Figure 1, we show how the described numerical methods approximate the exact solution (35) in the interval

[1, 10]

taking different time steps and employing Magnus 2, Magnus 4, and RKMK as underlying methods in the Lie group.

In Figure 2, we show convergence plots. To make a proper comparison, we include two classical numerical schemes, Heun (order 2) and RK4, respectively, for the corresponding orders, applied directly to (34). As is apparent, the slopes of the convergence lines are two and four, and this manifests that the order of convergence of the numerical methods on the underlying Lie group is transmitted to the manifold in this particular example. This transmission can be easily understood in terms of the local truncation error of the underlying Lie group method and the particular form of the analytical solution we obtain, i.e., (32). Namely, if we are applying an order p Lie group method in this particular example, that means

α_{k + 1} = α (t_{k + 1}) + O (h^{p_{1} + 1})

,

β_{k + 1} = β (t_{k + 1}) + O (h^{p_{2} + 1})

,

γ_{k + 1} = γ (t_{k + 1}) + O (h^{p_{3} + 1})

,

δ_{k + 1} = δ (t_{k + 1}) + O (h^{p_{4} + 1})

, where

p =

min

{p_{1}, p_{2}, p_{3}, p_{4}}

. Naturally,

α, β, γ, δ

are the components of the SL

(2, R)

matrix we are dealing with. Taking this into account, the analytical expression (32) and the definition of the local truncation error we have introduced in (17), it is straightforward to see that

E_{h} = O (h^{p + 1})

, and, consequently, it is expected that the convergence order of the Lie group method is transmitted to the manifold.

4.2. SL( $3, R$ ) and Matrix Riccati Equations

A general matrix Riccati equation [44] has the following form:

\frac{d Γ}{d t} = G_{1} (t) + G_{2} (t) Γ + Γ G_{3} (t) + Γ G_{4} (t) Γ,

(36)

where

Γ, G_{1} (t) \in M_{n \times m} (R), G_{2} (t) \in M_{n \times n} (R), G_{3} (t) \in M_{m \times m} (R), G_{4} (t) \in M_{m \times n} (R) .

The case that matters to us is

n = 2, m = 1

, for which the matrix Riccati equation has a VG Lie algebra isomorphic to

sl (3, R)

. Then, Equation (36) takes the form

(\begin{matrix} \dot{x} \\ \dot{y} \end{matrix}) = (\begin{matrix} g_{1} (t) \\ g_{2} (t) \end{matrix}) + (\begin{matrix} g_{3} (t) & g_{4} (t) \\ g_{5} (t) & g_{6} (t) \end{matrix}) (\begin{matrix} x \\ y \end{matrix}) + (\begin{matrix} x \\ y \end{matrix}) g_{7} (t) + (\begin{matrix} x \\ y \end{matrix}) (\begin{matrix} g_{8} (t) & g_{9} (t) \end{matrix}) (\begin{matrix} x \\ y \end{matrix}),

(37)

where

g_{1} (t), \dots, g_{9} (t)

are arbitrary functions of time. Equivalently, we can write the previous matrix equation as

\{\begin{matrix} \frac{d x}{d t} & = g_{1} (t) + (g_{3} (t) + g_{7} (t)) x + g_{4} (t) y + g_{8} (t) x^{2} + g_{9} (t) x y, \\ \frac{d y}{d t} & = g_{2} (t) + g_{5} (t) x + (g_{6} (t) + g_{7} (t)) y + g_{8} (t) x y + g_{9} (t) y^{2} . \end{matrix}

(38)

The t-dependent vector field associated with this system can be written as

X = g_{1} (t) X_{1} + g_{2} (t) X_{2} + (g_{3} (t) + g_{7} (t)) X_{3} + (g_{6} (t) + g_{7} (t)) X_{4} + g_{4} (t) X_{5} + g_{5} (t) X_{6} + g_{8} (t) X_{7} + g_{9} (t) X_{8},

where

\begin{matrix} X_{1} = \partial_{x}, X_{2} = \partial_{y}, & X_{3} = x \partial_{x}, X_{4} = y \partial_{y}, \\ X_{5} = y \partial_{x}, X_{6} = x \partial_{y}, X_{7} & = x^{2} \partial_{x} + x y \partial_{y}, X_{8} = x y \partial_{x} + y^{2} \partial_{y} . \end{matrix}

Note that X only really depends on eight t-dependent functions, since

g_{3} (t), g_{6} (t)

, and

g_{7} (t)

appear as linear combinations

g_{3} (t) + g_{7} (t)

and

g_{6} (t) + g_{7} (t)

. Let us list only the non-vanishing commutators for these vector fields:

\begin{matrix} [X_{1}, X_{3}] = X_{1}, [X_{1}, X_{6}] = X_{2}, [X_{1}, X_{7}] = 2 X_{3} + X_{4}, [X_{1}, X_{8}] = X_{5}, \\ [X_{2}, X_{4}] = X_{2}, [X_{2}, X_{5}] = X_{1}, [X_{2}, X_{7}] = X_{6}, [X_{2}, X_{8}] = X_{3} + 2 X_{4}, \\ [X_{3}, X_{5}] = - X_{5}, [X_{3}, X_{6}] = X_{6}, [X_{3}, X_{7}] = X_{7}, [X_{4}, X_{5}] = X_{5}, \\ [X_{4}, X_{6}] = - X_{6}, [X_{4}, X_{8}] = X_{8}, [X_{5}, X_{6}] = X_{4} - X_{3}, \\ [X_{5}, X_{7}] = X_{8}, [X_{6}, X_{8}] = X_{7} . \end{matrix}

(39)

From this we conclude that (37) is a Lie system. Now, we choose a matrix basis for

sl (3, R)

:

\begin{matrix} M_{1} = (\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & 0 \\ - 1 & 0 & 0 \end{matrix}), M_{2} = (\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & - 1 & 0 \end{matrix}), M_{3} = \frac{1}{3} (\begin{matrix} 2 & 0 & 0 \\ 0 & - 1 & 0 \\ 0 & 0 & - 1 \end{matrix}), M_{4} = \frac{1}{3} (\begin{matrix} - 1 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & - 1 \end{matrix}), \\ M_{5} = (\begin{matrix} 0 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}), M_{6} = (\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}), M_{7} = (\begin{matrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}), M_{8} = (\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{matrix}) . \end{matrix}

To integrate the VG Lie algebra of (38) to a Lie group action, we express the elements of the Lie group

SL (3, R)

in terms of canonical coordinates of the second-kind in the following way:

Y = (\begin{matrix} y_{11} & y_{12} & y_{13} \\ y_{21} & y_{22} & y_{23} \\ y_{31} & y_{32} & y_{33} \end{matrix}) = \prod_{i = 1}^{8} exp (λ_{i} M_{i}) \in SL (3, R),

(40)

where

λ_{1}, \dots, λ_{8} \in R

are real parameters univocally determined for each Y in an open neighborhood of the neutral element of

SL (3, R)

. The exponentials in the above expression can be calculated very easily and, by using their values in (40), it turns out that

\begin{matrix} y_{11} = k_{1}, y_{12} & = λ_{6} k_{1}, y_{13} = (λ_{6} λ_{8} + λ_{7}) k_{1}, \\ y_{21} = λ_{5} k_{2}, y_{22} = (1 + λ_{5} λ_{6}) k_{2}, & y_{23} = (λ_{5} λ_{7} + λ_{5} λ_{6} λ_{8} + λ_{8}) k_{2}, \\ y_{31} = - λ_{1} k_{1} - λ_{2} λ_{5} k_{2}, & y_{32} = - λ_{2} (1 + λ_{5} λ_{6}) k_{2} - λ_{1} λ_{6} k_{1}, \\ y_{33} = - λ_{1} (λ_{6} λ_{8} + λ_{7}) k_{1} - λ_{2} (λ_{5} λ_{7} & + λ_{5} λ_{6} λ_{8} + λ_{8}) k_{2} + e^{(- λ_{3} - λ_{4}) / 3}, \end{matrix}

(41)

where

k_{1} = e^{(2 λ_{3} - λ_{4}) / 3}

y

k_{2} = e^{(2 λ_{4} - λ_{3}) / 3}

. Rewriting some equalities in terms of others, i.e.,

y_{31} = - λ_{1} y_{11} - λ_{2} y_{21}

and

y_{32} = - λ_{1} y_{12} - λ_{2} y_{22}

, we obtain a linear system from which we obtain

λ_{1}

and

λ_{2}

. Operating with the remaining ones, we calculate the rest of the parameters.

\begin{matrix} λ_{1} = \frac{y_{22} y_{31} - y_{21} y_{32}}{y_{12} y_{21} - y_{11} y_{22}}, & λ_{2} = \frac{y_{11} y_{32} - y_{12} y_{31}}{y_{12} y_{21} - y_{11} y_{22}}, \\ e^{λ_{3}} = - y_{11} (y_{12} y_{21} - y_{11} y_{22}), & e^{λ_{4}} = \frac{{(y_{12} y_{21} - y_{11} y_{22})}^{2}}{y_{11}}, \\ λ_{5} = - \frac{y_{11} y_{21}}{y_{12} y_{21} - y_{11} y_{22}}, & λ_{6} = \frac{y_{12}}{y_{11}}, \\ λ_{7} = \frac{y_{12} y_{23} - y_{13} y_{22}}{y_{12} y_{21} - y_{11} y_{22}}, & λ_{8} = \frac{y_{13} y_{21} - y_{11} y_{23}}{y_{12} y_{21} - y_{11} y_{22}} . \end{matrix}

(42)

Integrating the vector fields

X_{1}, \dots, X_{8}

, we obtain their flows,

Φ_{1}, \dots, Φ_{8}

, which in turn give us the action

\begin{matrix} \begin{matrix} Φ_{1} (λ_{1}, (x_{0}, y_{0})) = (λ_{1} + x_{0}, y_{0}), & Φ_{2} (λ_{2}, (x_{0}, y_{0})) = (x_{0}, λ_{2} + y_{0}), \\ Φ_{3} (λ_{3}, (x_{0}, y_{0})) = (x_{0} e^{λ_{3}}, y_{0}), & Φ_{4} (λ_{4}, (x_{0}, y_{0})) = (x_{0}, y_{0} e^{λ_{4}}), \\ Φ_{5} (λ_{5}, (x_{0}, y_{0})) = (x_{0} + y_{0} λ_{5}, y_{0}), & Φ_{6} (λ_{6}, (x_{0}, y_{0})) = (x_{0}, y_{0} + x_{0} λ_{6}), \end{matrix} \\ Φ_{7} (λ_{7}, (x_{0}, y_{0})) = (\frac{x_{0}}{1 - x_{0} λ_{7}}, \frac{y_{0}}{1 - x_{0} λ_{7}}), Φ_{8} (λ_{8}, (x_{0}, y_{0})) = (\frac{x_{0}}{1 - y_{0} λ_{8}}, \frac{y_{0}}{1 - y_{0} λ_{8}}) . \end{matrix}

In view of (40), the composition of the flows

Φ_{1} \circ Φ_{2} \circ \dots \circ Φ_{8}

allows us to obtain the complete action

(x, y) = φ (Y, (x_{0}, y_{0}))

, with

x = \frac{x_{0} (1 + λ_{5} λ_{6}) + y_{0} λ_{5}}{1 - x_{0} λ_{7} - y_{0} λ_{8}} e^{λ_{3}} + λ_{1}, y = \frac{x_{0} λ_{6} + y_{0}}{1 - x_{0} λ_{7} - y_{0} λ_{8}} e^{λ_{4}} + λ_{2} .

Operating with these expressions, we can rewrite the action through homographies as follows:

(x_{0}, y_{0}) \to (\frac{a_{21} + a_{22} x_{0} + a_{23} y_{0}}{a_{11} + a_{12} x_{0} + a_{13} y_{0}}, \frac{a_{31} + a_{32} x_{0} + a_{33} y_{0}}{a_{11} + a_{12} x_{0} + a_{13} y_{0}}),

with coefficients

\begin{matrix} a_{11} = 1, a_{12} = - λ_{7}, a_{13} = - λ_{8}, a_{21} = λ_{1}, \\ a_{22} = (1 + λ_{5} λ_{6}) e^{λ_{3}} - λ_{1} λ_{7}, a_{23} = λ_{5} e^{λ_{3}} - λ_{1} λ_{8}, a_{31} = λ_{2}, \\ a_{32} = λ_{6} e^{λ_{4}} - λ_{2} λ_{7}, a_{33} = e^{λ_{4}} - λ_{2} λ_{8} . \end{matrix}

(43)

Numerical example

To illustrate again our numerical methods, we will take the following equation as an example:

\{\begin{matrix} \frac{d x}{d t} & = 5 sin 10 t - x + y, \\ \frac{d y}{d t} & = 5 cos 10 t + x + y, \end{matrix}

(44)

which is a matrix Riccati Equation (37) with t-dependent functions

\begin{matrix} g_{1} (t) = 5 sin 10 t, g_{2} (t) = 5 cos 10 t, g_{3} (t) + g_{7} (t) = - 1, \\ g_{4} (t) = 1, g_{5} = 1, g_{6} (t) + g_{7} (t) = 1, g_{8} (t) = g_{9} (t) = 0 . \end{matrix}

More exactly, it is an affine system of first-order differential equations. For the initial condition

(x (0), y (0)) = (1, 0)

, the solution of (44) is

\{\begin{matrix} x (t) & = \frac{157}{102} cosh \sqrt{2} t - \frac{{\sqrt{2}}^{3} 19}{51} sinh \sqrt{2} t + \frac{5}{102} sin 10 t - \frac{55}{102} cos 10 t, \\ y (t) & = \frac{27 \sqrt{2}}{34} sinh \sqrt{2} t + \frac{5}{102} cosh \sqrt{2} t + \frac{15}{34} sin 10 t - \frac{5}{102} cos 10 t . \end{matrix}

Figure 3 shows convergence plots.

In this case, one can depict that, although our method is still convergent, the order of the Lie group is not transmitted to the manifold N (in both cases the slope of the convergence lines is about 1). In this case, our method is not compared to Heun and RK4 applied directly to (44) but to an alternate scheme given by

x_{k + 1} = φ ({\tilde{Y}}_{k + 1}, x_{k}),

(45)

where

{{\tilde{Y}}_{k}}_{k = 0, \dots, N}

is the numerical solution of (18) when Heun and RK4 are applied to them (in Figure 3, they are referred to as Heun and RK4). Naturally, this implies that

{\tilde{Y}}_{k} \notin G

.

Our conjecture is that, in this case, the construction of the action changes the convergence of the method, which can be sustained in the high nonlinearity obtained when defining the parameters (41), (42), (43). An interesting open question is whether there is a way to modify the methods according to the Lie group action so that the convergence is transmitted correctly. Another clue pointing in that direction is that, as can be easily seen in the plots, although the velocity of convergence is about the same for our method and (45), quantitatively the error of the former is lower. We consider this as another (positive) geometrical symptom, since, apparently, the error worsens when the underlying Lie group structure is not preserved.

4.3. Generalization to SL( $n, R$ )

The special linear Lie group plays an essential role in mechanical systems and integrable systems (see [19,27,45] and references therein). This is why we briefly detail a possible generalization of our proposed methods to SL(

n, R

).

Recall that the Lie algebra

sl (n, R)

associated with the Lie group

SL (n, R)

has dimension

n^{2} - 1

. In fact, a matrix representation of

sl (n, R)

is given by the matrix Lie algebra given by

n \times n

traceless matrices. For simplicity, we can choose a basis of

sl (n, R)

given by

n^{2} - n

matrices with one nontrivial off-diagonal entry equal to one, together with

n - 1

diagonal traceless matrices of the form

(\begin{matrix} 1 & 0 & 0 & \dots & 0 \\ 0 & - 1 & 0 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 0 \end{matrix}), (\begin{matrix} 1 & 0 & 0 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ 0 & 0 & - 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 0 \end{matrix}), \dots (\begin{matrix} 1 & 0 & 0 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & - 1 \end{matrix}) .

The total

n^{2} - 1

matrices are traceless and linearly independent. A Lie group action

φ : SL (n, R) \times R^{n - 1} \to R^{n - 1}

can then be constructed via homographies as follows (cf. [44]):

\begin{matrix} x_{1} & = \frac{a_{10} + a_{11} x_{1}^{0} + a_{12} x_{2}^{0} + \dots a_{1, n - 1} x_{n - 1}^{0}}{a_{00} + a_{01} x_{1}^{0} + a_{02} x_{2}^{0} + \dots a_{0, n - 1} x_{n - 1}^{0}}, \\ x_{2} & = \frac{a_{20} + a_{21} x_{1}^{0} + a_{22} x_{2}^{0} + \dots a_{2, n - 1} x_{n - 1}^{0}}{a_{00} + a_{01} x_{1}^{0} + a_{02} x_{2}^{0} + \dots a_{0, n - 1} x_{n - 1}^{0}}, \\ ⋮ \\ x_{n - 1} & = \frac{a_{n - 1, 0} + a_{n - 1, 1} x_{1}^{0} + a_{n - 1, 2} x_{2}^{0} + \dots a_{n - 1, n - 1} x_{n - 1}^{0}}{a_{00} + a_{01} x_{1}^{0} + a_{02} x_{2}^{0} + \dots a_{0, n - 1} x_{n - 1}^{0}}, \end{matrix}

(46)

where

(x_{1}, \dots, x_{n - 1}) = φ (Y, (x_{1}^{0}, \dots, x_{n - 1}^{0}))

, where

Y \in SL (n, R)

is

Y = (\begin{matrix} a_{00} & a_{01} & a_{02} & \dots & a_{0, n - 1} \\ a_{10} & a_{11} & a_{12} & \dots & a_{1, n - 1} \\ a_{20} & a_{21} & a_{22} & \dots & a_{2, n - 1} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ a_{n - 1, 0} & a_{n - 1, 1} & a_{n - 1, 2} & \dots & a_{n - 1, n - 1} \end{matrix}) .

Note that if

〈 \cdot, \cdot 〉

is the standard scalar product in

R^{n}

and we call

a_{i}

, with

i = 0, \dots, n

, the rows of Y and

{\bar{x}}^{0}

stands for the point

(1, x_{1}^{0}, \dots, x_{n - 1}^{0})

in

R^{n}

, then (46) can be rewritten as

x_{i} = 〈 a_{i}, {\bar{x}}^{0} 〉 / 〈 a_{0}, {\bar{x}}^{0} 〉

for

i = 1, \dots, n - 1

.

It is worth noting that if two VG Lie algebras

V_{1}, V_{2}

on two manifolds

N_{1}, N_{2}

are diffeomorphic, i.e., there exists a diffeomorphism

ϕ : N_{1} \to N_{2}

such that

ϕ_{*} V_{1} = V_{2}

, then

V_{1}, V_{2}

can be integrated to two

ϕ

-equivariant Lie group actions

φ_{1} : G \times N_{1} \to N_{1}

and

φ_{2} : G \times N_{2} \to N_{2}

, i.e.,

ϕ (φ_{1} (g, x)) = φ_{2} (g, ϕ (x))

for every

x \in N_{1}

and

g \in G

. In particular, if

V_{1}

is the VG Lie algebra of matrix Riccati equations studied in this section and

V_{2}

is another VG Lie algebra on

N_{2} = N_{1}

diffeomorphic to

V_{1}

, then the Lie group action

φ_{2}

is

ϕ

-equivariant to

φ_{1}

. Since every diffeomorphism in

N_{1}

can be understood as a change in variables, the

ϕ

-equivariance of

φ_{1}

and

φ_{2}

entails that a change in variables in

N_{2}

allows us to write the action of every

g \in SL (n, R)

via

φ_{2}

as a homography. Note that it is simple to prove that (46) gives rise to a Lie group action of SL

(n, R)

and its fundamental vector fields are those related to matrix Riccati equations.

Increase in Numerical Cost as n Increases

We can indirectly measure the numerical cost of our schemes according to the time they need to compute the solution. Let us consider the following equation:

\frac{d x}{d t} = 2 t - \frac{x}{t} + \frac{x^{2}}{t^{3}}, t \geq 1, x (1) = 0,

(47)

whose analytical solution is

x (t) = \frac{2 t^{3} - 2 t^{2}}{2 t - 1} .

Now, we apply our five numerical schemes to the equation above and plot the step size (which is strictly related with n) versus the time consumed for the resolution of the equation.

In the diagrams displayed in Figure 4, we can observe that in the logarithmic axis the relation between the variables is close to being linear. As expected, the fourth-order schemes (RKMK, Magnus 4, and RK4) show a bigger increase in numerical cost as n increases.

Now, we renact the same process to the following differential system:

\{\begin{matrix} \frac{d x}{d t} & = 5 sin 10 t - x + y, \\ \frac{d y}{d t} & = 5 cos 10 t + x + y, \end{matrix} \{\begin{matrix} x (0) & = 1, \\ y (0) & = 0, \end{matrix}

(48)

whose solution can be written as

\{\begin{matrix} x (t) & = \frac{157}{102} cosh \sqrt{2} t - \frac{{\sqrt{2}}^{3} 19}{51} sinh \sqrt{2} t + \frac{5}{102} sin 10 t - \frac{55}{102} cos 10 t, \\ y (t) & = \frac{27 \sqrt{2}}{34} sinh \sqrt{2} t + \frac{5}{102} cosh \sqrt{2} t + \frac{15}{34} sin 10 t - \frac{5}{102} cos 10 t . \end{matrix}

and we obtain the following figures.

In Figure 5, when h is small (and, therefore, n is big), we observe again a linear relation between the numerical cost and the index n.

5. Applications in Linear Quadratic Control

Now, we provide an interesting application to optimal control of the method to obtain the solution of Lie systems given in Procedure in Section 2.3.1. A very useful model to carry out the control of dynamical systems is the representation in the space of states. The most general representation on such space is

\{\begin{matrix} \dot{x} (t) & = f (x (t), u (t), t), \\ y (t) & = h (x (t), u (t), t), \end{matrix}

where

x : [t_{0}, t_{f}] \to R^{n}

is a vector containing the state variables of the system,

\dot{x} : [t_{0}, t_{f}] \to R^{n}

is its time derivative,

u : [t_{0}, t_{f}] \to R^{m}

is the vector containing the input variables,

y : [t_{0}, t_{f}] \to R^{p}

is the vector with the output variables, and

f : R^{n} \times R^{m} \times R \to R^{n}

and

h : R^{n} \times R^{m} \times R \to R^{p}

are two t-dependent arbitrary vector fields. We can manipulate the inputs to modify the state of the system.

A very important and common model is that of linear systems, given their simplicity [46]. Indeed, it is pretty usual to search for a linearization of nonlinear problems. The most general representation of a linear system is

\{\begin{matrix} \dot{x} (t) & = A (t) x (t) + B (t) u (t), \\ y (t) & = C (t) x (t) + D (t) u (t), \end{matrix}

(49)

where the t-dependent matrices

A (t), B (t), C (t)

, and

D (t)

are the state (or system) matrix, the input matrix, the output matrix, and the feedthrough (or feedforward) matrix, respectively. In order for the system to be defined, the dimensions of the matrices must be

A (t) \in M_{n}

,

B (t) \in M_{n \times m}

,

C (t) \in M_{p \times n}

, and

D (t) \in M_{p \times m}

for every

t \in R

.

In particular, we are interested in the problem of optimal control with a quadratic cost function, which, as we are going to show, can be transformed into a matrix Ricatti equation. That is, given a linear system (49), the state

x_{0}

, and the time interval

[t_{0}, t_{f}]

, we need to find an input

u (t)

starting with condition

x (t_{0}) = x_{0}

that minimizes the quadratic cost function, i.e.,

J (x, u) \overset{def .}{=} x {(t_{f})}^{T} S x (t_{f}) + \int_{t_{0}}^{t_{f}} x {(t)}^{T} Q (t) x (t) d t + \int_{t_{0}}^{t_{f}} u {(t)}^{T} R (t) u (t) d t,

where S is a positive semi-definite matrix and for all

t \in [t_{0}, t_{f}]

the matrices

Q (t)

and

R (t)

are, respectively, positive semi-definite and positive definite. Obviously,

S, Q (t) \in M_{n \times n}

, and

R (t) \in M_{m \times m}

for every

t \in R

.

Since the matrices involved are positive (semi-)definite, the terms appearing in them are a measure of the size of the vectors

x

and

u

. Each of them “penalizes” a different aspect of the control. The first one measures how far the system is from the null state

x = 0

at the end of the time interval. Analogously, the second term measures the distance between the state and the null state along time. In this way, the faster the system approaches the null state, the smaller the cost function is and the closer it is to the null state at the end of the time interval. On the other hand, the third term measures the size of the input along time in such a way that the smaller it is (with respect to the measure defined by the matrix

R (t)

), the smaller the value of the function J will be.

Adjusting the matrices

S, Q (t), R (t)

we choose what aspects are more important. If we choose the matrix S in such a way that staying far from the null state at the end of the interval is very penalized, the optimal control will conduct the system towards this state at the end of the time interval, at the cost that the input will be bigger. If

Q (t)

takes over the other two matrices, the control will lead the system to the null state as fast as possible. On the contrary, if the dominant matrix is

R (t)

, the input

u

will be small, but probably the other two aspects will be adversely affected. This is interesting when the size of the input is related to any other variable that we would like to minimize.

In this formulation, the cost function leads the system towards the null state. Nonetheless, it is easy to modify the problem so the system drifts towards a different state. If we aim at establishing the system in a certain state

x_{c}

, if we are capable of finding an input

u_{c}

such that

0 = A x_{c} + B u_{c},

then, performing the change in variables

\begin{matrix} \{\begin{matrix} x (t) - x_{c} & \to \hat{x} (t), \\ u (t) - u_{c} & \to \hat{u} (t), \end{matrix} \end{matrix}

we obtain a new system

\frac{d \hat{x}}{d t} = \frac{d x}{d t} = A (\hat{x} + x_{c}) + B (\hat{u} + u_{c}) = A \hat{x} + B \hat{u} + \underset{= 0}{\underset{︸}{A x_{c} + B u_{c}}} = A \hat{x} + B \hat{u}

in which we can apply the quadratic cost function to obtain an optimal control problem that conducts the system towards

\hat{x} = x - x_{c} = 0

. In this way, the original system will tend to

x_{c}

.

The solution of the linear quadratic control problem is given as a state-feedback controller, i.e., the optimal input

u_{o} (t)

that minimizes

J (x, u)

is a function of the state of the system. In particular, we can write

u_{o} (t) = K (t) x (t)

, where

K (t)

is the feedback matrix and is calculated as

K (t) = - R {(t)}^{- 1} B {(t)}^{T} P (t),

where

P (t)

is the solution of the following matrix differential Riccati equation:

\frac{d P}{d t} = P B R {(t)}^{- 1} B^{T} P - P A - A^{T} P - Q (t), P (t_{f}) = S .

(50)

The initial condition is given at the end of the time interval because one needs to integrate the equation in reverse ([47], §8.2). Equation (50) is the matrix Riccati equation introduced in Section 4.

Now, we are going to solve an example involving linear quadratic control by the application of our analytical resolution of Lie systems.

Example: Velocity of a Vehicle

We propose a model of a control for the velocity of a vehicle. We will have a single input variable, which will correspond with the strength of the engine to accelerate the vehicle. Let us assume that the only force that could decelerate the vehicle is the friction with air and that it is proportional to the square of the velocity [48]. For simplicity, our model reduces to describing motions with positive velocity. Under these hypotheses, it is enough to take the velocity of the vehicle as the variable of state to completely characterize the system. Applying Newton’s second law, we obtain the equation describing the system:

F - k v^{2} = m \frac{d v}{d t},

where F is the engine force, v is the velocity, k is a constant of proportionality, and m is the mass of the vehicle. For simplicity, we will take

m = k = 1

. We change the notation to use u instead of F, this being the input of the system. So, the system now reads

\frac{d v}{d t} = - v^{2} + u .

This system is nonlinear, but when we are designing a control that keeps the velocity constant around a certain value, we can linearize the system in the neighborhood of such a value to compute the optimal control with quadratic cost function that keeps the vehicle at cruising speed. Again, to simplify the computations, we take

v_{c} = 1

. Under these circumstances,

d v / d t = 0

, so we obtain

u_{c} = 1

. The linearized system around the point

(v_{c}, u_{c})

results in

\frac{d Δ v}{d t} = - 2 Δ v + Δ u,

where

Δ v = v - 1

and

Δ u = u - 1

are the incremental variables around

(v_{c}, u_{c})

.

To further simplify, we will take all the matrices constant in the quadratic cost function, and equal to one in the time interval

[0, 1]

. Then, the cost function is

J (v, u) = Δ v {(1)}^{2} + \int_{0}^{1} Δ v {(s)}^{2} d s + \int_{0}^{1} Δ u {(s)}^{2} d s .

(51)

The function

Δ u_{o} (t)

that minimizes (51) is

Δ u_{o} = K (t) Δ v (t)

, where

K = - R^{- 1} B^{T} P

. In our case,

K (t) = - P (t)

,

P (t)

being the solution of the Riccati equation

\frac{d P}{d t} = P B R^{- 1} B^{T} P - P A - A^{T} P - Q = P^{2} + 4 P - 1,

(52)

with (final) condition

P (1) = S = 1

.

Now, we resolve (52) analytically, applying our procedure explained in Section 2.3.1. Since it is a Riccati equation with constant coefficients, given its simplicity, we can compute its analytical solution by resolving its associated linear system on the group

SL (2, R)

. In this case, we have to solve

d Y (t) / d t = A Y (t)

, with

Y (1) = I

, where the matrix A is (according to the notation in Section 4.1)

A = - M_{0} + 4 M_{1} + M_{2} = (\begin{matrix} 2 & - 1 \\ - 1 & - 2 \end{matrix}) .

The exact solution of this system will be expressed in its canonical form

d y / d t = Σ y

, where

Σ = (\begin{matrix} 2 & 0 & - 1 & 0 \\ 0 & 2 & 0 & - 1 \\ - 1 & 0 & - 2 & 0 \\ 0 & - 1 & 0 & - 2 \end{matrix}), y (1) = (\begin{matrix} 1 \\ 0 \\ 0 \\ 1 \end{matrix}) .

Its solution is

y (t) = exp (\int_{1}^{t} Σ d τ) y (1) = exp ((t - 1) Σ) y (1),

that is,

y (t) = \frac{e^{- (\sqrt{5} t + \sqrt{5})}}{10} (\begin{matrix} (5 + 2 \sqrt{5}) e^{2 \sqrt{5} t} + (5 - 2 \sqrt{5}) e^{2 \sqrt{5}} \\ \sqrt{5} e^{2 \sqrt{5}} - \sqrt{5} e^{2 \sqrt{5} t} \\ \sqrt{5} e^{2 \sqrt{5}} - \sqrt{5} e^{2 \sqrt{5} t} \\ (5 - 2 \sqrt{5}) e^{2 \sqrt{5} t} + (5 + 2 \sqrt{5}) e^{2 \sqrt{5}} \end{matrix}) .

Finally, we can retrieve the solution to (52) by means of the Lie group action of

SL (2, R)

on

R

as

P (t) = φ (Y (t), P (1)) = \frac{(5 + \sqrt{5}) e^{2 \sqrt{5} t} + (5 - \sqrt{5}) e^{2 \sqrt{5}}}{(5 - 3 \sqrt{5}) e^{2 \sqrt{5} t} + (5 + 3 \sqrt{5}) e^{2 \sqrt{5}}} .

The optimal control problem is

Δ u_{o} = - P (t) Δ v

. We introduce a constant

Δ u_{c}

that carries the system from an initial perturbation to the functioning point

v = 1

. If we start from a point

v (0) = \bar{v}

, to determine the constant value of u that takes the system back to the cruising speed

Δ u_{c}

, the equation is

\frac{d Δ v}{d t} = - 2 Δ v + Δ u_{c},

with initial conditions

Δ v (0) = \bar{v} - 1

and

Δ v (1) = 0

. The solution can be computed trivially

Δ u_{c} = \frac{2 \bar{v} - 2}{1 - e^{2}} .

In Figure 6, we have depicted the evolution of the system with different initial conditions around

v = 1

. The continuous line represents the evolution of the system when we use optimal control and the discontinuous line corresponds with a constant u.

The chosen values of

S, Q, R

for the optimal control do not take the vehicle at cruising speed in the time interval considered. This makes sense if we think of the quadratic cost function as a compromise to reduce the size of the input, so the system reaches the functioning point fast and efficiently. If we want to ensure that the vehicle reaches the cruising speed, we need to reflect it in the cost function by giving more weight to S and Q.

If we now calculate the cost function for different initial conditions, we see that the constant control makes the system reach the cruising speed quicker and with less error than the optimal control, and the cost is smaller. We list some values in Table 1.

The input represents the engine force accelerating the vehicle. It is also reasonable that the fuel consumption will be proportional to the strength of the force. In this way, we can derive the optimal control that keeps the vehicle at constant cruising speed and minimizes the amount of fuel.

6. Conclusions

This paper is concerned with the integration of Lie systems, both from the analytical and numerical perspectives, using particular techniques adapted to their geometric features. This work is rooted in the field of numerical and discrete methods specifically adapted for Lie systems, which is still a very unexplored branch of research [2,13,14,49].

One major result in this paper is that we are able to solve Lie systems on Lie groups. This permits us to solve all Lie systems related to the same automorphic Lie system at the same time (equivalently, all Lie systems that have isomorphic VG Lie algebras) [1,5]. Automorphic Lie systems present a simple superposition rule that only depends on a single particular solution. This is an advantage in comparison with superposition rules for general Lie systems, which used to depend on a larger number of particular solutions. The second most important advantage is that, since Lie groups admit a local matrix representation, automorphic Lie systems can be written as first-order systems of linear homogeneous ODEs in normal form.

Employing the geometric structure of Lie systems, we propose a particular geometric integrator for Lie systems that exploits the properties of such a structure. Particularly, we employ the Lie group action obtained by integrating the Vessiot–Guldberg Lie algebra of a Lie system to obtain the analytical solution of the Lie system. We use the automorphic Lie system related to a Lie system, along with geometric schemes, say Lie group integrators, to preserve the group structure. Specifically, we use two families of numerical schemes; the first one is based on the Magnus expansion, whereas the second is based on RKMK methods. We have compared both methods in different situations. We can generally say that the fourth-order RKMK is slightly more precise than the Magnus expansion of the same order. Regarding the transmission of convergence order from the Lie group method to the Lie system method, our conjecture, rooted in the results obtained for different Lie groups, is that how the Lie group action is constructed has a central role. Whilst the numerical methods work very satisfactorily on the Lie group level, when we translate the properties into the manifold, we see that the convergence and precision of the numerical method can be modified (as in the SL

(3, R)

case). Nonetheless, since our methods are based on geometric integrators, they inherit all the geometric properties we wish to preserve and the solutions always belong in the manifold, where the Lie system is defined (something that is not preserved if one uses classical numerical schemes).

From the results obtained for

SL (2, R)

and

SL (3, R)

, we have been able to provide a generalization to

SL (n, R)

, and we have discussed the form of the Lie group action. As has been evidenced,

SL (n, R)

is a relevant Lie group, appearing recurrently in nonlinear oscillators of Winternitz–Smorodinsky, Milney–Pinney, and Ermakov sytems, as well as higher-order Riccati equations.

The last important result is that solving higher-order Riccati equations has allowed us to resolve important examples appearing in engineering problems. We have particularly proposed a problem in optimal control in which matrix Riccati equations appear naturally from quadratic cost functions.

In the future, we will analyze the convergence transmission from automorphic Lie systems to related Lie systems. In addition, since the exponential is a local diffeomorphism, the topological study of matrix Lie groups would allow us to establish the optimal time-step for Lie group methods, which is a long-standing problem that would also help optimize Lie system methods. Another endeavor is to study Lie systems on more general manifolds that are not necessarily isomorphic to

R^{n}

and depict how some geometric and topological invariants are preserved [50,51]. Right now, we are working on examples on Anti-de-Sitter spaces so we can depict how the curvature is preserved under the numerical method. We could easily generalize this to all kinds of systems in all types of curved spaces. This will in fact prove the interest of our 7-step method, since one could argue that the nongeometric approximation methods seem fairly better than our proposal. Nonetheless, in our forthcoming publications, we will show that when there are invariants in the game, the 7-step method is the best choice to preserve certain geometric and topological invariants.

Author Contributions

Conceptualization, J.d.L., C.S. and F.J.A.; methodology, L.B.D.; software, L.B.D.; validation, F.J.A. and C.S.; formal analysis, L.B.D.; investigation, L.B.D.; data curation, L.B.D.; writing—original draft preparation, C.S.; writing—review and editing, L.B.D. and F.J.A.; visualization, J.d.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Acknowledgments

J. de Lucas acknowledges partial financial support from MINIATURA-5 Nr 2021/05/X/ST1/01797, funded by the National Science Centre (Poland). C. Sardón and F. Jiménez acknowledge project “Teoría de aproximación constructiva y aplicaciones” (TACA-ETSII), UPM, Madrid.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cariñena, J.F.; Grabowski, J.; Marmo, G. Lie-Scheffers Systems: A Geometric Approach; Napoli Series in Physics and Astrophysics; Bibliopolis: Bruxelles, Belgium, 2000. [Google Scholar]
de Lucas, J.; Sardón, C. A Guide to Lie Systems with Compatible Geometric Structures; World Scientific: Singapore, 2020. [Google Scholar]
Winternitz, P. Nonlinear action of Lie groups and superposition rules for nonlinear differential equations. Phys. A 1982, 114, 105–113. [Google Scholar] [CrossRef]
Cariñena, J.F.; Grabowski, J.; Ramos, A. Reduction of t-dependent systems admitting a superposition principle. Acta Appl. Math. 2001, 66, 67–87. [Google Scholar] [CrossRef]
Cariñena, J.F.; de Lucas, J. Lie Systems: Theory, Generalisations, and Applications. Diss. Math. 2011, 479. [Google Scholar] [CrossRef] [Green Version]
Sardón, C. Lie Systems, Lie Symmetries and Reciprocal Transformations. Ph.D. Thesis, Universidad de Salamanca, Salamanca, Spain, 2015. [Google Scholar]
Cortés, J.; Martínez, S. Non-holonomic integrators. Nonlinearity 2001, 14, 1365–1392. [Google Scholar] [CrossRef]
Iserles, A.; Munthe-Kaas, H.; Nørsett, S.; Zanna, A. Lie-group methods. Acta Numer. 2005, 9, 215–365. [Google Scholar] [CrossRef] [Green Version]
Marrero, J.C.; Martín de Diego, D.; Martínez, E. Discrete Lagrangian and Hamiltonian mechanics on Lie groupoids. Nonlinearity 2006, 19, 1313–1348. [Google Scholar] [CrossRef] [Green Version]
Marsden, J.E.; West, M. Discrete mechanics and variational integrators. Acta Numer. 2001, 10, 357–514. [Google Scholar] [CrossRef] [Green Version]
McLachlan, R.; Quispel, G.R.W. Splitting methods. Acta Numer. 2002, 11, 341–434. [Google Scholar] [CrossRef]
Sanz-Serna, J.M. Symplectic integrators for Hamiltonian problems: An overview. Acta Numer. 1992, 243–286. [Google Scholar] [CrossRef]
Pietrzkowski, G. Explicit solutions of the a₁-type Lie-Scheffers system and a general Riccati equation. J. Dyn. Control Syst. 2012, 18, 551–571. [Google Scholar] [CrossRef] [Green Version]
Rand, D.W.; Winternitz, P. Nonlinear superposition principles: A new numerical method for solving matrix Riccati equations. Comput. Phys. Commun. 1984, 33, 305–328. [Google Scholar] [CrossRef]
Cariñena, J.F.; Grabowski, J.; Marmo, G. Superposition rules, Lie theorem and partial differential equations. Rep. Math. Phys. 2007, 60, 237–258. [Google Scholar] [CrossRef] [Green Version]
Cariñena, J.F.; Ramos, A. Integrability of the Riccati equation from a group theoretical viewpoint. Int. J. Mod. Phys. A 1999, 14, 1935–1951. [Google Scholar] [CrossRef] [Green Version]
Angelo, R.M.; Wresziński, W.F. Two-level quantum dynamics, integrability and unitary NOT gates. Phys. Rev. A 2005, 72, 034105. [Google Scholar] [CrossRef] [Green Version]
Lázaro-Camí, J.A.; Ortega, J.P. Superposition rules and stochastic Lie-Scheffers systems. Ann. Inst. H. Poincaré Probab. Stat. 2009, 45, 910–931. [Google Scholar] [CrossRef]
Hussin, V.; Beckers, J.; Gagnon, L.; Winternitz, P. Superposition formulas for nonlinear superequations. J. Math. Phys. 1990, 31, 2528–2534. [Google Scholar]
Cariñena, J.F.; de Lucas, J.; Sardón, C. A new Lie systems approach to second-order Riccati equations. Int. J. Geom. Meth. Mod. Phys. 2011, 9, 1260007. [Google Scholar] [CrossRef] [Green Version]
Cariñena, J.F.; de Lucas, J. Applications of Lie systems in dissipative Milne-Pinney equations. Int. J. Geom. Meth. Mod. Phys. 2009, 6, 683–699. [Google Scholar] [CrossRef] [Green Version]
Odzijewicz, A.; Grundland, A.M. The Superposition Principle for the Lie Type first-order PDEs. Rep. Math. Phys. 2000, 45, 293–306. [Google Scholar] [CrossRef] [Green Version]
Iserles, A.; Nørsett, S.P. On the solution of linear differential equations in Lie groups. Philos. Trans. R. Soc. A 1999, 357, 983–1020. [Google Scholar] [CrossRef] [Green Version]
Zanna, A. Collocation and relaxed collocation for the Fer and Magnus expansions. J. Numer. Anal. 1999, 36, 1145–1182. [Google Scholar] [CrossRef] [Green Version]
Munthe-Kaas, H. Runge-Kutta methods on Lie groups. BIT Numer. Math. 1998, 38, 92–111. [Google Scholar] [CrossRef]
Munthe-Kaas, H. High order Runge-Kutta methods on manifolds. J. Appl. Numer. Math. 1999, 29, 115–127. [Google Scholar] [CrossRef]
de Lucas, J.; Grundland, A.M. A Lie systems approach to the Riccati hierarchy and partial differential equations. J. Differ. Equ. 2017, 263, 299–337. [Google Scholar]
Kučera, V. A Review of the Matrix Riccati Equation. Kybernetika 1973, 9, 42–61. [Google Scholar]
Lee, J.M. Introduction to Smooth Manifolds; Graduate Texts in Mathematics 218; Springer: New York, NY, USA, 2003. [Google Scholar]
Ado, I.D. The representation of Lie algebras by matrices. Uspekhi Mat. Nauk. 1947, 2, 159–173. [Google Scholar]
Curtis, M.L. Matrix Groups, 2nd ed.; Springer: New York, NY, USA, 1984. [Google Scholar]
Hall, B. Matrix Lie Groups. In Lie Groups, Lie Algebras, and Representations: An Elementary Introduction; Springer International Publishing: Berlin/Heidelberg, Germany, 2015; pp. 3–30. [Google Scholar]
Sattinger, D.H.; Weaver, O.L. Lie Groups and Algebras with Applications to Physics; Geometry and Mechanics; Springer: Berlin/Heidelberg, Germany, 1986. [Google Scholar]
Lie, S.; Scheffers, G. Vorlesungen über continuierliche Gruppen mit geometrischen und anderen Anwendungen; Teubner: Leipzig, Germany, 1893. [Google Scholar]
Levi, E.E. Sulla Struttura dei Gruppi Finiti e Continui. Atti Della R. Accad. Delle Sci. Torino 1905, 40, 551–565. [Google Scholar]
Hairer, E.; Nørsett, S.P.; Wanner, G. Solving Ordinary Differential Equations I: Nonstiff Problems; Springer: Berlin/Heidelberg, Germany, 1993. [Google Scholar]
Isaacson, E.; Keller, H.B. Analysis of Numerical Methods; John Wiley & Sons: New York, NY, USA; London, UK; Sydney, Australia, 1966. [Google Scholar]
Quarteroni, A.; Sacco, R.; Saleri, F. Numerical Mathematics; Springer: New York, NY, USA, 2007. [Google Scholar]
Magnus, W. On the exponential solution of differential equations for a linear operator. Commun. Pure Appl. Math. 1954, 7, 649–673. [Google Scholar] [CrossRef]
Iserles, A.; Nørsett, S.P.; Rasmussen, A.F. t-Symmetry and High-Order Magnus Methods; Technical Report 1998/NA06, DAMTP; University of Cambridge: Cambridge, UK, 1998. [Google Scholar]
Blanes, S.; Casas, F.; Ros, J. Improved high order integrators based on the Magnus expansion. BIT Numer. Math. 2000, 40, 434–450. [Google Scholar] [CrossRef]
Hairer, E.; Lubich, C.; Wanner, G. Geometric Numerical Integration; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Hartshorne, R. Foundations of Projective Geometry; W.A. Benjamin, Inc.: New York, NY, USA, 1967. [Google Scholar]
Harnad, J.; Winternitz, P.; Anderson, R.L. Superposition principles for matrix Riccati equations. J. Math. Phys. 1983, 24, 1062. [Google Scholar] [CrossRef]
Reid, W.T. Riccati Differential Equations; Academic: New York, NY, USA, 1972. [Google Scholar]
Domínguez, S.; Campoy, P.; Sebastián, J.M.; Jiménez, A. Control en el Espacio de Estado; Pearson: London, UK, 2006. [Google Scholar]
Sontag, E.D. Mathematical Control Theory: Deterministic Finite Dimensional Systems; Springer: New York, NY, USA, 1998. [Google Scholar]
Pandey, A.; Ghose-Choudhury, A.; Guha, P. Chiellini integrability and quadratically damped oscillators. Int. J. Non-Linear Mech. 2017, 92, 153–159. [Google Scholar] [CrossRef] [Green Version]
Penskoi, A.V.; Winternitz, P. Discrete matrix Riccati equations with super-position formulas. J. Math. Anal. Appl. 2004, 294, 533–547. [Google Scholar] [CrossRef]
Herranz, F.J.; de Lucas, J.; Tobolski, M. Lie-Hamilton systems on curved spaces: A geometrical approach. J. Phys. A 2017, 50, 495201. [Google Scholar] [CrossRef] [Green Version]
Lange, J.; de Lucas, J. Geometric models for Lie–Hamilton systems on $R^{2}$ . Mathematics 2019, 7, 1053. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Exact vs. numerical solutions of (34) with

x (1) = 0

. In the left plot we observe the natural better approximation of higher-order methods for huge time steps (

h = 3

). In the right plot, we observe a closer approximation to the exact dynamics when h decreases (

h = 0.5

).

Figure 1. Exact vs. numerical solutions of (34) with

x (1) = 0

. In the left plot we observe the natural better approximation of higher-order methods for huge time steps (

h = 3

). In the right plot, we observe a closer approximation to the exact dynamics when h decreases (

h = 0.5

).

Figure 2. Convergence for the Riccati Equation (34).

Figure 3. Convergence for the affine system of first-order differential Equation (44).

Figure 4. Numerical integration of Riccati Equation (47).

Figure 5. Numerical integration of Equation (48).

Figure 6. Evolution of the system with optimal control and constant control.

Table 1. Summary of results in example.

$J (v, u) \times 1000$	$\bar{v} = 1.2$	$\bar{v} = 1.15$	$\bar{v} = 1.1$	$\bar{v} = 1.05$	$\bar{v} = 1$
Optimal Control	10.340	5.816	2.585	0.646	0
Constant Control	11.771	6.621	2.943	0.736	0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Blanco Díaz, L.; Sardón, C.; Jiménez Alburquerque, F.; de Lucas, J. Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control. Symmetry 2023, 15, 1285. https://doi.org/10.3390/sym15061285

AMA Style

Blanco Díaz L, Sardón C, Jiménez Alburquerque F, de Lucas J. Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control. Symmetry. 2023; 15(6):1285. https://doi.org/10.3390/sym15061285

Chicago/Turabian Style

Blanco Díaz, Luis, Cristina Sardón, Fernando Jiménez Alburquerque, and Javier de Lucas. 2023. "Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control" Symmetry 15, no. 6: 1285. https://doi.org/10.3390/sym15061285

APA Style

Blanco Díaz, L., Sardón, C., Jiménez Alburquerque, F., & de Lucas, J. (2023). Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control. Symmetry, 15(6), 1285. https://doi.org/10.3390/sym15061285

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control

Abstract

1. Introduction

2. Geometric Fundamentals and Lie Systems

2.1. Geometric Fundamentals

2.2. Lie Groups and Matrix Lie Groups

2.3. Lie Systems

2.3.1. Automorphic Lie Systems

3. Discretization of Lie Systems

3.1. Numerical Methods on Matrix Lie Groups

3.1.1. The Magnus Method

3.1.2. The Runge–Kutta–Munthe–Kaas Method

3.2. Numerical methods for Lie Systems

4. Application to SL $(n, R)$

4.1. SL( $2, R$ ) and the Riccati Equation

4.1.1. Exact Solution

4.1.2. Numerical Example

4.2. SL( $3, R$ ) and Matrix Riccati Equations

Numerical example

4.3. Generalization to SL( $n, R$ )

Increase in Numerical Cost as n Increases

5. Applications in Linear Quadratic Control

Example: Velocity of a Vehicle

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control

Abstract

1. Introduction

2. Geometric Fundamentals and Lie Systems

2.1. Geometric Fundamentals

2.2. Lie Groups and Matrix Lie Groups

2.3. Lie Systems

2.3.1. Automorphic Lie Systems

3. Discretization of Lie Systems

3.1. Numerical Methods on Matrix Lie Groups

3.1.1. The Magnus Method

3.1.2. The Runge–Kutta–Munthe–Kaas Method

3.2. Numerical methods for Lie Systems

4. Application to SL ( n , R )

4.1. SL( 2 , R ) and the Riccati Equation

4.1.1. Exact Solution

4.1.2. Numerical Example

4.2. SL( 3 , R ) and Matrix Riccati Equations

Numerical example

4.3. Generalization to SL( n , R )

Increase in Numerical Cost as n Increases

5. Applications in Linear Quadratic Control

Example: Velocity of a Vehicle

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4. Application to SL $(n, R)$

4.1. SL( $2, R$ ) and the Riccati Equation

4.2. SL( $3, R$ ) and Matrix Riccati Equations

4.3. Generalization to SL( $n, R$ )