Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation

Jensen, Mathias Højgaard; Joshi, Sarang; Sommer, Stefan

doi:10.3390/a15080290

Open AccessArticle

Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation

by

Mathias Højgaard Jensen

^1,†,

Sarang Joshi

² and

Stefan Sommer

^1,*,†

¹

Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark

²

Department of Biomedical Engineering, University of Utah, Salt Lake City, UT 84112, USA

^*

Author to whom correspondence should be addressed.

^†

Current address: Universitetsparken 5, 2100 Copenhagen, Denmark.

Algorithms 2022, 15(8), 290; https://doi.org/10.3390/a15080290

Submission received: 8 July 2022 / Revised: 9 August 2022 / Accepted: 9 August 2022 / Published: 17 August 2022

Download

Browse Figures

Versions Notes

Abstract

:

We present schemes for simulating Brownian bridges on complete and connected Lie groups and homogeneous spaces. We use this to construct an estimation scheme for recovering an unknown left- or right-invariant Riemannian metric on the Lie group from samples. We subsequently show how pushing forward the distributions generated by Brownian motions on the group results in distributions on homogeneous spaces that exhibit a non-trivial covariance structure. The pushforward measure gives rise to new non-parametric families of distributions on commonly occurring spaces such as spheres and symmetric positive tensors. We extend the estimation scheme to fit these distributions to homogeneous space-valued data. We demonstrate both the simulation schemes and estimation procedures on Lie groups and homogenous spaces, including

SPD (3) = {GL}_{+} (3) / SO (3)

and

S^{2} = SO (3) / SO (2)

.

Keywords:

bridge simulation; Brownian motion; Lie groups; homogeneous spaces; metric estimation; directional statistics

1. Introduction

Bridge simulation is a data augmentation technique for generating missing trajectories of continuous diffusion processes. We consider bridge simulation on Lie groups and homogeneous spaces. As an important example, we investigate the case of an i.i.d. Lie group or homogeneous space-valued samples that are considered discrete-time observations of a continuous diffusion process. Assuming the stochastic dynamics to be Brownian motion, we wish to estimate the underlying Riemannian metric of the Lie group or homogeneous space from the samples. To evaluate and maximize the likelihood of the data, we need to account for the diffusion process being unobserved at most time points. This requires bridge sampling, and the sampling techniques are thus the key enabler for metric estimation in this setting.

The simulation of conditioned diffusion processes is a highly non-trivial problem, even in Euclidean spaces. Transition densities of diffusion processes are tractable in closed form only for a small class of processes, and hence, simulating directly from the true bridge distribution is generally infeasible. The data augmentation used in inference for diffusion processes dates back to the seminal paper by Pedersen [1] almost three decades ago. Since then, several papers have studied diffusion bridge simulation methods; see, e.g., [2,3,4,5,6,7,8,9,10,11]. The method by Delyon and Hu [5] exchanged the intractable drift term in the conditioned diffusion with a tractable drift originating from the drift of a standard Brownian bridge.

In this paper, we further extend the original idea of Delyon and Hu [5] to Lie groups and homogenous spaces. Several papers have built on the ideas of Delyon and Hu. For example, a manifold equivalent drift term analogous to the drift term of a Brownian bridge in Euclidean space was used in [6] to describe the simulation of Brownian bridges on the flat torus, whereas [7] generalized this method to general finite-dimensional Riemannian manifolds. Reference [11] used the drift to model Brownian bridges on the space of landmarks. Bui et al. [4,12] used a similar drift term on the space of symmetric positive definite (covariance) matrices, exploiting the exponential map, which in the space of covariance matrices is a global diffeomorphism. The present paper extends these ideas to general Lie groups and homogeneous spaces.

As an application, we consider discrete-time observations in Lie groups and homogeneous spaces regarded as incomplete observations of sample paths of Brownian motions arising from left- (or right-) invariant Riemannian metrics. The bridge simulation schemes allow interpolating between the discrete-time observations. Furthermore, we observe how varying the metric on Lie groups affects pushforwards of the Brownian motion to homogeneous spaces being quotients of the group. These distributions encode the covariance of the data resulting from the metric structure of the Lie group. We define this family of distributions and derive estimation schemes for recovering the metric structure of the group both with Lie group samples and with homogeneous space samples. One particular example is the two-sphere,

S^{2} ≅ SO (3) / SO (2)

. Changing the metric structure on

SO (3)

results in anisotropic distributions on

S^{2}

, arising as the pushforward measure from

SO (3)

. Figure 1 illustrates the isotropic and anisotropic distributions on

S^{2}

induced by a bi-invariant and left-invariant (not bi-invariant) metric on

SO (3)

, respectively. The resulting distributions are analogous to the von Mises–Fisher and Fisher–Bingham–Kent distributions [13,14]. However, the approach is independent of the chosen embedding and uses only the geometric relation between the group and quotient space.

For the simulation on homogeneous spaces, we use bridges to submanifolds as developed by Thompson [15] to condition on the fibers in the Lie group G over the target point

v \in M = G / H

for some closed subgroup

H \subset G

. The resulting guiding term guides in the direction closest to the fiber.

Statistics on Lie groups and homogeneous spaces find applications in many diverse fields, including bioinformatics, medical imaging, shape analysis, computer vision, and information geometry; see, e.g., [16,17,18,19,20]. Statistics in Euclidean spaces often rely on the distributional properties of the normal distribution. Here, we use Brownian motions and the heat equation to generalize the normal distribution to Lie groups and homogeneous spaces as introduced by Grenander [21]. The solution to the heat equation is the transition density of a Brownian motion. Through Monte Carlo simulations of bridges, we can estimate the transition density and maximize the likelihood with respect to the Riemannian metric.

Contribution and Overview

We present simulation schemes on Lie groups and homogeneous spaces with application to parameter estimation. We outline the necessary theoretical background for the construction of bridge simulation on Lie groups and homogeneous spaces before demonstrating how the simulation scheme leads to estimates of means and the underlying metric structure using maximum likelihood estimation on certain Lie groups and homogeneous spaces. The paper builds on and significantly extends the conference paper [22], which introduced bridge simulation in the Lie group setting.

The paper is organized as follows: In Section 2, we describe the relevant background theory of Lie groups, Brownian motions, and Brownian bridges in Riemannian manifolds. Section 3 presents the theory and results of bridge sampling in Lie groups, and Section 4 introduces bridge sampling on homogeneous spaces. Section 5 covers maximum likelihood estimation of the starting point and Riemannian metric. Numerical experiments on selected Lie groups and homogeneous spaces are presented in Section 6.

2. Notation and Background

We here briefly describe simulating conditioned diffusions in Euclidean space as developed in [5] before reviewing the theory on conditioned diffusions on Riemannian manifolds.

2.1. Euclidean Diffusion Bridges and Simulation

Suppose a strong solution exists to an SDE of the form

\begin{matrix} d x_{t} = b (t, x_{t}) d t + σ (t, x_{t}) d w_{t}, \end{matrix}

where b and

σ

satisfy certain regularity conditions and where w denotes an

R^{n}

-valued Brownian motion. In this case, x is a Markov process, and its transition density exists. Suppose we define the function

\begin{matrix} h (t, x) = \frac{p_{T - t} (x_{t}, v)}{p_{T} (x_{0}, v)}, \end{matrix}

for some

x_{0}, v \in R^{n}

. Then, it is easily derived that h is a martingale on

[0, T)

with

h (0, x_{0}) = 1

, and Doob’s h-transform implies that the SDE of the conditioned diffusion

x | x_{T} = v

is given by

\begin{matrix} d y_{t} = \tilde{b} (t, y_{t}) d t + σ (t, y_{t}) d w_{t} \end{matrix}

where

\tilde{b} (t, y) = b (t, y) + (σ σ^{T}) (t, y) \nabla_{y} log p_{T - t} (y, v)

. In the case that the transition density is intractable, simulation from the exact distribution is infeasible. Delyon and Hu [5] suggested substituting the latter term in

\tilde{b}

with a drift term of the form

- (y_{t} - v) / (T - t)

, which equals the drift term in a Brownian bridge. The guided process obtained by making the above substitution yields a conditioning, and one obtains

E [f (x) | x_{T} = v] = C E [f (y) φ_{T} (y)],

(1)

where

φ_{T}

is a likelihood function that is tractable and numerically computable, y is the guided process, and the constant

C > 0

depends on

x_{0}

, v, and T.

2.2. Riemannian Manifolds and Lie Groups

Let M be a finite-dimensional smooth manifold of dimension d. M can be endowed with a Riemannian metric tensor, i.e., a family of inner products

{{〈 \cdot, \cdot 〉}_{x}}_{x \in M}

defined on each tangent space

T_{x} M

. The Riemannian metric tensor gives rise to a distance function between points in M. The tangent space is locally diffeomorphic with an open subset of M. The Riemannian exponential map

{Exp}_{x} : T_{x} M \to M

provides this local diffeomorphism. On the subset of M where

{Exp}_{x}

is a diffeomorphism, the inverse Riemannian exponential map, also called the Riemannian logarithm map,

{Log}_{x} : M \to T_{x} M

is defined. The Riemannian distance function can then be defined in terms of the Riemannian inner product as

d (x, y) = {∥ {Log}_{x} (y) ∥}_{x}

. The Riemannian logarithm map plays an important role when defining guided bridges on manifolds.

Let X be a vector field on M assigning to each point

X \in M

a tangent vector

X (x) \in T_{x} M

. A connection ∇ on a manifold is an operation that allows us to compare neighboring tangent spaces and define derivatives of vector fields along other vector fields, that is, if Y is another vector field, then

\nabla_{X} Y

is the derivative of Y along X (also known as the covariant derivative of Y along X). A connection also gives a notion of “straight lines” in manifolds, also known as geodesics. A curve

γ

is a geodesic if the vector field along

γ

is parallel to itself, i.e., if

\nabla_{{\dot{γ}}_{t}} {\dot{γ}}_{t} = 0

. The geodesic curves are locally length minimizing.

Generalizing the Euclidean Laplacian operator, the Laplace–Beltrami operator is defined as the divergence of the gradient,

Δ_{M} f = div grad f

. In terms of local coordinates

(x_{1}, \dots, x_{d})

, the expression for the Laplace–Beltrami operator becomes

\begin{matrix} Δ_{M} f = det {(g)}^{- 1 / 2} (\frac{\partial}{\partial x_{j}} g^{j i} det {(g)}^{1 / 2} \frac{\partial}{\partial x_{i}}) f, \end{matrix}

(2)

where

det (g)

denotes the determinant of the Riemannian metric g and

g^{i j}

are the coefficients of the inverse of g. (2) can be written as

\begin{matrix} Δ_{M} f & = a^{i j} \frac{\partial}{\partial x_{i}} \frac{\partial}{\partial x_{j}} f + b^{j} \frac{\partial}{\partial x_{j}} f, \end{matrix}

(3)

where

a^{i j} = g^{i j}

,

b^{k} = - g^{i j} Γ_{i j}^{k}

, and

Γ

denote the Christoffel symbols of the Riemannian metric.

2.3. Lie Groups

Let G denote a connected Lie group of dimension d, i.e., a smooth manifold with a group structure such that the group operations

G \times G ∋ (x, y) \overset{μ}{\mapsto} x y \in G

and

G ∋ x \overset{ι}{\mapsto} x^{- 1} \in G

are smooth maps. If

x \in G

, the left-multiplication map,

L_{x} y

, defined by

y \mapsto μ (x, y)

, is a diffeomorphism from G to itself. Similarly, the right-multiplication map

R_{x} y

defines a diffeomorphism from G to itself by

y \mapsto μ (y, x)

. Let

d L_{x} : T G \to T G

denote the pushforward map given by

{(d L_{x})}_{y} : T_{y} G \to T_{x y} G

. A vector field V on G is said to be left-invariant if

{(d L_{x})}_{y} V (y) = V (x y)

. The space of left-invariant vector fields is linearly isomorphic to

T_{e} G

, the tangent space at the identity element

e \in G

. By equipping the tangent space

T_{e} G

with the Lie bracket, we can identify the Lie algebra

g

with

T_{e} G

. The group structure of G makes it possible to define an action of G on its Lie algebra

g

. The conjugation map

C_{x} : = L_{x} \circ R_{x}^{- 1} : y \mapsto x y x^{- 1}

, for

x \in G

, fixes the identity e. Its pushforward map at e,

{(d C_{x})}_{e}

, is then a linear automorphism of

g

. Define

Ad (x) : = {(d C_{x})}_{e}

, then

Ad : x \mapsto Ad (x)

is the adjoint representation of G in

g

. The map

G \times g ∋ (x, v) \mapsto Ad (x) v \in g

is the adjoint action of G on

g

. We denote by

〈 \cdot, \cdot 〉

a Riemannian metric on G. The metric is said to be left-invariant if

{〈 u, v 〉}_{y} = {〈{(d L_{x})}_{y} u, {(d L_{x})}_{y} v〉}_{L_{x} (y)}

, for every

u, v \in T_{y} G

, i.e., the left-multiplication maps are isometries, for every

x \in G

. The metric is

Ad (G)

-invariant if

{〈 u, v 〉}_{e} = {〈Ad (x) u, Ad (x) v〉}_{e}

, for every

u, v \in g

. Note that an

Ad (G)

-invariant metric on G is equivalent to a bi-invariant (left- and right-invariant) inner product on

g

. The differential of the Ad map at the identity yields a linear map

Ad (x) = \frac{d}{d t} {Ad (exp (t x)) |}_{0}

. This linear map is equal to the Lie bracket

[v, w] = Ad (v) w

,

v, w \in g

.

A one-parameter subgroup of G is a continuous homomorphism

γ : (R, +) \to G

. The Lie group exponential map

exp : g \to G

is defined as

exp (v) = γ_{v} (1)

, for

v \in g

, where

γ_{v}

is the unique one-parameter subgroup of G whose tangent vector at e is v. For matrix Lie groups, the exponential map has the particular form:

exp (A) = \sum_{k = 0}^{\infty} A^{k} / k!

, for a square matrix A. The resulting matrix

exp (A)

is an invertible matrix. Given an invertible matrix B, if there exists a square matrix A such that

B = exp (A)

, then A is said to be the logarithm of B. In general, the logarithm might not exist, and if it does, it may fail to be unique. In a neighborhood sufficiently close to the identity, the Lie group logarithm exists and is unique. By means of left-translation (or right-translation), the Lie group exponential map can be extended to a map

{exp}_{g} : T_{g} G \to G

, for all

g \in G

, defined by

{exp}_{g} (v) = g exp (d L_{g^{- 1}} v)

. Similarly, the Lie group logarithm at g becomes

{log}_{g} (v) = d L_{g} log (g^{- 1} v)

. The matrix exponential and logarithms can be computed numerically efficiently (see [23], Chapter 5, and the references therein).

Example 1.

A few examples of Lie groups include the Euclidean space

(R^{n}, +)

with the additive group structure,

(R_{+}, \cdot)

the positive real line with a multiplicative group structure, the space of invertible real matrices

GL (n)

equipped with a multiplication of matrices forming a Lie group, and the rotation group

O (n)

, consisting of real orthogonal matrices with determinant one or minus one forming a subgroup of

GL (n)

.

The identification of the space of left-invariant vector fields with the Lie algebra

g

allows for a global description of

Δ_{G}

. Indeed, let

{v_{1}, \dots v_{d}}

be an orthonormal basis of

T_{e} G

. Then,

V_{i} (g) = {(d L_{g})}_{e} v_{i}

defines left-invariant vector fields on G and the Laplace–Beltrami operator can be written as (cf. [24], Proposition 2.5)

Δ_{G} f (e) = \sum_{i = 1}^{d} V_{i}^{2} f (e) - V_{0} f (e),

where

V_{0} = \sum_{i, j = 1}^{d} C_{i j}^{j} V_{j}

and

C_{i j}^{k}

denote the structure coefficients given by

[V_{i}, V_{j}] = C_{i j}^{k} V_{k} .

(4)

By left-invariance,

Δ_{G} f (g) = Δ_{G} f \circ L_{g} (e) = {(d L_{g})}_{e} Δ_{G} f (e)

.

2.4. Homogeneous Spaces

A homogeneous space is a particular type of quotient manifold that arises as a smooth manifold endowed with a transitive smooth action by a Lie group G. The homogeneous space is called a G-homogeneous space to indicate the Lie group action. All G-homogeneous spaces arise as a quotient manifold

G / H

, for some closed subgroup

H \subseteq G

. H is a closed subgroup of the Lie group G, which makes H a Lie group. Any homogeneous space is diffeomorphic to the quotient space

G / G_{x}

, where

G_{x}

is the stabilizer for the point x. The dimension of the G-homogeneous space is equal to

dim G - dim H

; the quotient map

π : G \to G / H

is a smooth submersion, i.e., the differential of

π

is surjective at every point. This implies that the fibers

π^{- 1} (x)

,

x \in M

, are embedded submanifolds of G. We assume throughout that G acts on itself by left-multiplication.

Example 2.

The rotation group

SO (n)

acts transitively on

S^{n - 1}

; therefore,

S^{n - 1}

is an

SO (n)

-homogeneous space. Consider a point in

S^{- 1}

as a vector in

R^{n}

. Rotations that fix the point occur precisely in the subspace orthogonal to the vector. Thus, the stabilizer or isotropy group is the rotation group

SO (n - 1)

and

S^{n - 1} = SO (n) / SO (n - 1)

. The set of invertible matrices with positive determinant

{GL}_{+} (n)

acts on symmetric positive definite matrices

SPD (n)

. The isotropy group is the rotation group

SO (n)

, and thus,

SPD (n) = {GL}_{+} (n) / SO (n)

. A particular type of homogeneous space arises when the subgroup is a discrete subgroup of G. For example, the space

T^{n} = R^{n} / Z^{n}

defines the n-torus as a homogeneous space.

2.5. Brownian Motion on Riemannian Manifolds

The Laplacian defines Brownian motion on M as a

\frac{1}{2} Δ_{M}

-diffusion process up to its explosion time

τ

. The stochastic differential equation (SDE) for a Brownian motion

X_{t}

in local coordinates is

d X_{t}^{k} = - \frac{1}{2} g^{i j} (X_{t}) Γ_{i j}^{k} (X_{t}) d t + σ_{j}^{k} (X_{t}) d B_{t}^{j},

(5)

where

σ = \sqrt{g^{- 1}}

is the matrix square root of

g^{- 1}

.

On Lie groups, an SDE for a Brownian motion on G in terms of left-invariant vector fields takes the form

d g_{t} = - \frac{1}{2} V_{0} (g_{t}) d t + V_{i} (g_{t}) \circ d B_{t}^{i}, g_{0} = e,

(6)

where ∘ denotes integration in the Stratonovich sense. By [24] (Proposition 2.6), if the inner product is Ad(G)-invariant, then

V_{0} = 0

. The solution of (6) is conservative or non-explosive and is called the left-Brownian motion on G (see [25] and the references therein).

2.6. Brownian Bridges

In this section, we briefly review some facts on Brownian bridges on Riemannian manifolds, including Lie groups. On Lie groups, the existence of left-invariant (respectively right-invariant) vector fields allows identification of the Lie algebra with the vector space of left-invariant vector fields making the Lie group parallelizable. This allows constructing general semimartingales directly on the Lie groups.

Let

P_{x}^{t} = P_{x} |_{F_{t}}

be the measure of a Riemannian Brownian motion,

X_{t}

, at some time t started at point x. Let

p_{t}

denote the transition density of

X_{t}

so that

d P_{x}^{t} = p_{t} (x, y) d Vol (y)

with

d Vol (y)

the Riemannian volume measure. Conditioning the Riemannian Brownian motion to hit some point

v \in M

at time

T > 0

defines a Riemannian Brownian bridge. We let

P_{x, v}^{T}

denote the corresponding probability measure. The two measures are absolutely continuous (equivalent) over the time interval

[0, T)

, however mutually singular at time

t = T

. This consequence is obvious because

P_{x} (X_{T} = v) = 0

, whereas

P_{x, v}^{T} (X_{T} = v) = 1

. The corresponding Radon–Nikodym derivative is

\frac{d P_{x, v}^{T}}{d P_{x}} |_{F_{s}} = \frac{p_{T - s} (X_{s}, v)}{p_{T} (x, v)} for 0 \leq s < T

(7)

which is a martingale for

s < T

. The Radon–Nikodym derivative defines the density for the change of measure, and it provides the conditional expectation

E [F (X_{t}) | X_{T} = v] = \frac{E [p_{T - t} (X_{t}, v) F (X_{t})]}{p_{T} (x, v)},

(8)

for any bounded and

F_{s}

-measurable random variable

F (X_{s})

. The Brownian bridge is a non-homogeneous diffusion on M with infinitesimal generator

\begin{matrix} L_{s} f (z) = \frac{t}{2} Δ_{M} f (z) + t \nabla_{z} log p_{t (1 - s)} (z, v) \cdot \nabla f (z) . \end{matrix}

The bridge can be described by an SDE in the frame bundle

F M

of M. Let

U_{t}

be a lift of

X_{t} = π_{F M} (U_{t})

, and using the horizontal vector fields

H_{i}, \dots, H_{d} \in X (F M)

, we have

d U_{t} = H_{i} (U_{t}) \circ (d B_{t}^{i} + {(U_{t}^{- 1} (π_{*} (\nabla_{u | u = U_{t}}^{H} log {\tilde{p}}_{T - t} (u, v))))}^{i} d t), U_{0} = u_{0},

(9)

where

{\tilde{p}}_{t} (u, v) = p_{t} (π (u), v)

denotes the lift of the transition density, B is an

R^{d}

-valued Brownian motion, and

{(π_{F M})}_{*} : T FM \to T M

is the pushforward of the projection

π_{F M} : FM \to M

. Here,

u_{0} \in F M

is an orthonormal frame such that

π_{F M} (u_{0}) = x_{0}

.

Further types of Riemannian bridges can be found in Thompson [15]. Brownian bridges to submanifolds are here introduced by considering the transition density evaluated at a submanifold

N \subset M

by

p_{t} (x, N) : = \int_{N} p_{t} (x, y) d {Vol}_{N} (y),

(10)

where

{Vol}_{N}

denotes the volume measure on N. Conditioning on

X_{T} \in N

gives

E [f (X_{t}) | X_{T} \in N] = \frac{E [p_{T - t} (X_{t}, N) f (X_{t})]}{p_{T} (x, N)},

(11)

which holds for all bounded

F_{t}

-measurable random variables

f (X_{t})

. Fibers of homogeneous spaces are embedded submanifolds of a Lie group. We will later use this to derive a simulation scheme on homogeneous spaces by conditioning on the fibers.

The notion of Fermi bridges was also introduced in [15]. Fermi bridges have infinitesimal generator

\frac{1}{2} Δ - \frac{r_{N}}{T - t} \frac{\partial}{\partial r_{N}},

(12)

where

r_{N} (\cdot) : = d (\cdot, N) = {inf}_{y \in N} d (\cdot, y)

and

\frac{\partial}{\partial r_{N}} = \nabla d (\cdot, N)

.

2.7. One-Point Motions

Consider the homogeneous space

M = G / H

, where H is a Lie subgroup of the Lie group G, and let

π : G \to M

denote the canonical projection. Suppose that G acts on M on the left and that

g_{t}

is a process in G. This induces a process in M. For any

x \in M

, the induced process

x_{t} = g_{t} x

defines the one-point motion of

g_{t}

in M, with initial value x. The one-point motion,

X_{t} = g_{t} x

, of a Brownian motion

g_{t}

in G, started at

g_{0} = e

, is only a Brownian motion in M under certain conditions (see, e.g., [24], Proposition 2.7). In the case of a bi-invariant metric, a Brownian motion on G maps to a Brownian motion in M through its one-point motion, which, in general, is not the case. For example, the one-point processes of a G-valued Markov process might not preserve the Markov property if the metric is not bi-invariant.

In this paper, we explicitly chose metrics whose Brownian motions do not descend to Brownian motions in

G / H

. The non-invariant metrics result in processes in

G / H

with an anisotropic covariance structure. The anisotropic distributions in

G / H

will arise from non-invariant metrics, and the induced processes will, in general, not inherit the Markov property.

2.8. Pushforward Measures

Let

π : G \to M

be the projection to the homogeneous space

M = G / H

. Then,

π

is a measurable map, and if

μ

is a measure on G, the pushforward of

μ

by

π

, defined by

π_{*} μ (B) = μ (π^{- 1} (B))

, for all measurable subsets

B \subseteq M

, is a measure on M. A numerical example is provided in Figure 1, showing anisotropic distributions on the homogeneous space

S^{2}

obtained from pushing forward Brownian motions of a non-invariant metric on the top space

SO (3)

.

The Riemannian volume measure

{Vol}_{G}

on G decomposes into a product measure consisting of the volume measure on fibers in G, e.g.,

π^{- 1} (z)

, and the volume measure on their horizontal complement, i.e.,

d {Vol}_{G} = d {Vol}_{π^{- 1} (z)} d {Vol}_{| H} (z)

, where

d {Vol}_{| H}

is the horizontal restriction of the volume measure in G. The measure of a process

g_{t}

on G pushes forward to M, and we denote the corresponding density with respect to the volume measure on M for

p_{t}^{M}

. Then,

p_{t}^{M} (x) = \int_{π^{- 1} (x)} p_{t}^{G} (g_{0}, y) d {Vol}_{π^{- 1} (z)} (y)

.

Lemma 1.

Let

g_{t}

be a Markov process on G, started at

g_{0} \in G

, with density

p_{t}^{G} (g_{0}, \cdot)

, and let

X_{t} = π (g_{t})

. The conditional expectation on M satisfies

E [f (X) | X_{T} = v] = E [f (X) \frac{p_{T - t}^{M} (X_{t}, v)}{p_{T}^{M} (x_{0}, v)}],

for all bounded, continuous, and non-negative

F_{t}

-measurable f on M. Furthermore,

E [\tilde{f} (g) | g_{T} \in N] = E [f (X) | X_{T} = v],

where

\tilde{f} = f \circ π

.

Proof.

Let f be a bounded, continuous, and non-negative measurable function on M, and let

\tilde{f} = f \circ π

. Then, it follows directly from (7) and (11) that

\begin{matrix} E [\tilde{f} (g) | g_{T} \in N] = & E [f (π (g_{t})) \frac{p_{T - t}^{G} (g_{t}, N)}{p_{T}^{G} (g_{0}, N)}] = E [f (π (g_{t})) \frac{p_{T - t}^{G} (g_{t}, π^{- 1} (v))}{p_{T}^{G} (g_{0}, π^{- 1} (v))}] \\ = & E [f (π (g_{t})) \frac{π_{*} p_{T - t}^{G} (g_{t}, v)}{π_{*} p_{T}^{G} (g_{0}, v)}] = E [f (X_{t}) \frac{p_{T - t}^{M} (X_{t}, v)}{p_{T}^{M} (x_{0}, v)}] . \end{matrix}

□

3. Simulation of Bridges on Lie Groups

In this section, we consider the task of simulating (6) conditioned to hit

v \in G

, at time

T > 0

. The potentially intractable transition density for the solution of (6) inhibits simulation directly from the bridge SDE (9). Instead, we propose to add a guiding term mimicking that of Delyon and Hu [5], i.e., the guiding term becomes the gradient of the distance to v divided by the time to arrival. The SDE for the guided diffusion becomes

d Y_{t} = - \frac{1}{2} V_{0} (Y_{t}) d t + V_{i} (Y_{t}) \circ (d B_{t}^{i} - \frac{{(\nabla_{{y |}_{y = Y_{t}}} d {(y, v)}^{2})}^{i}}{2 (T - t)} d t), Y_{0} = e,

(13)

where

d (\cdot, v)

denotes the Riemannian distance to v. Note that we can always, for convenience, take the initial value to be the identity e. Equation (13) can equivalently be written as

d Y_{t} = - \frac{1}{2} V_{0} (Y_{t}) d t + V_{i} (Y_{t}) \circ (d B_{t}^{i} - \frac{{Log}_{Y_{t}} {(v)}^{i}}{T - t} d t), Y_{0} = e,

where

{Log}_{p}

is the inverse of the Riemannian exponential map

{Exp}_{p}

. Figure 2 illustrates one sample path of the simulation scheme in (13) on the Lie group

SO (3)

. The corresponding axis-angle representation is visualized in Figure 3.

The guiding term in (13) is identical to the guiding term described in [4,7]. In [7], the guided processes used the frame bundle of M. In the Lie group setting, since Lie groups are parallelizable, the use of the frame bundle is not needed: the invariant vector fields

V_{i}

provide a frame of reference at all points of G.

Numerical computations of the Lie group exponential map are often computationally efficient; see [23] and the references therein for efficient algorithms. Therefore, by a change of measures argument, the equation above can be expressed in terms of the inverse of the Lie group exponential:

d Y_{t} = - \frac{1}{2} V_{0} (Y_{t}) d t + V_{i} (Y_{t}) \circ (d {\bar{B}}_{t}^{i} - \frac{{log}_{Y_{t}} {(v)}^{i}}{T - t} d t)

(14)

Y_{0} = e

, where

\bar{B}

is a Brownian motion under a new measure, say

\bar{P}

. The measure

\bar{P}

can explicitly be expressed as

\frac{d \bar{P}}{d P} |_{F_{t}} = exp [- \int_{0}^{t} H_{v} (s, Y_{s}) - \frac{1}{2} \int_{0}^{t} \frac{{∥ {log}_{Y_{s}} (v) - {Log}_{Y_{s}} (v) ∥}_{Y_{s}}^{2}}{{(T - s)}^{2}} d s],

where

P

denotes the law of the SDE in (13) and

H_{v} (t, Y_{t}) = {〈\frac{({log}_{Y_{t}} (v) - {Log}_{Y_{t}} (v))}{T - t}, V (Y_{t}) d B_{t}〉}_{Y_{t}} .

Note that when the metric is bi-invariant, the group logarithm and the Riemannian logarithm coincide.

3.1. Radial Process

Below, we investigate the relation between the bridge measure and the above simulation schemes. Let

r_{v} (\cdot) : = d (\cdot, v)

be the distance to v such that

r_{v} (g_{t})

is the radial process. Due to the singularities of the radial process on

Cut (v) \cup {v}

, the usual Itô’s formula only applies on subsets away from the cut-locus. The extension beyond the cut-locus of a Brownian motion’s radial process was due to Kendall [26]. Barden and Le [27,28] generalized the result to M-valued semimartingales. The radial process of the Brownian motion (6) is given by

r_{v} (g_{t}) = r_{v} {(g_{0})}^{2} + \int_{0}^{t} {〈\nabla_{g_{s}} r_{v} (g_{s}), V (g_{s}) d B_{s}〉}_{g_{s}} + \frac{1}{2} \int_{0}^{t} Δ_{G} r_{v} (g_{s}) d s - L_{s}^{v} (g),

(15)

where

L^{v}

is the geometric local time of the cut-locus

Cut (v)

, which is a non-decreasing continuous random functional increasing only when g is in

Cut (v)

(see [26,27,28]). Let

W_{t} : = \int_{0}^{t} 〈\frac{\partial}{\partial r}, V_{i} (g_{s})〉 d B_{s}^{i}

, which is the local-martingale part in the above equation. The quadratic variation of

W_{t}

satisfies

d {[W, W]}_{t} = d t

by the orthonormality of

{V_{1}, \dots, V_{d}}

; thus,

W_{t}

is a Brownian motion by Levy’s characterization theorem. From the stochastic integration by parts formula and (15), the squared radial process of g satisfies

r_{v} {(g_{t})}^{2} = r_{v} {(g_{0})}^{2} + 2 \int_{0}^{t} r_{v} (g_{s}) d W_{s} + \int_{0}^{t} r_{v} (g_{s}) Δ_{G} r_{v} (g_{s}) d s - 2 \int_{0}^{t} r (g_{s}) d L_{s}^{v},

(16)

where

d L_{s}^{v}

is the random measure associated with

L_{s}^{v} (X)

.

Similarly, we obtain an expression for the squared radial process of Y. The radial process becomes

r_{v}^{2} (g_{t}) = r_{v} {(g_{0})}^{2} + 2 \int_{0}^{t} r_{v} (g_{s}) d W_{s} + \int_{0}^{t} \frac{1}{2} Δ_{G} r_{v} {(g_{s})}^{2} d s - \int_{0}^{t} \frac{r_{v} {(g_{s})}^{2}}{T - s} d s - 2 \int_{0}^{t} r_{v} (g_{s}) d L_{s}^{v} .

(17)

Imposing a growth condition on the radial process yields an

L^{2}

-bound on the radial process of the guided diffusion, [15]. Therefore, assume there exist constants

ν \geq 1

and

λ \in R

such that

\frac{1}{2} Δ_{G} r_{v}^{2} \leq ν + λ r_{v}^{2}

on

D \ Cut (v)

, for every regular domain

D \subseteq G

. Then, (17) satisfies

E [1_{t < τ_{D}} r_{v} {(Y_{t})}^{2}] \leq (r_{v}^{2} (e) + ν t (\frac{t}{T - t})) {(\frac{T - t}{t})}^{2} e^{λ t},

(18)

where

τ_{D}

is the first exit time of Y from the domain D.

3.2. Girsanov Change of Measure

Let

B_{t}

be a d-dimensional Brownian motion defined on a filtered probability space

(Ω, F, {(F_{s})}_{s \geq 0}, P)

, and let

g_{t}

be a solution of (6). The process

\frac{\nabla r_{v} {(g_{t})}^{2}}{2 (T - t)}

is an adapted process. As

g_{t}

is non-explosive, we see that

\int_{0}^{t} {∥\frac{\nabla r_{v} {(g_{s})}^{2}}{2 (T - s)}∥}^{2} d s = \int_{0}^{t} \frac{r_{v} {(g_{s})}^{2}}{{(T - s)}^{2}} d s \leq C,

(19)

for every

0 \leq t < T

, almost surely, and for some fixed constant

C > 0

. Define a new measure

Q

by

Z_{t} : = \frac{d Q}{d P} |_{F_{t}} (g) = exp [- \int_{0}^{t} 〈\frac{\nabla r_{v} {(g_{s})}^{2}}{2 (T - s)}, V (g_{t}) d B_{s}〉 - \frac{1}{2} \int_{0}^{t} \frac{r_{v} {(g_{s})}^{2}}{{(T - s)}^{2}} d s] .

(20)

From (19), the process

Z_{t}

is a martingale, for

t \in [0, T)

, and

Q_{t}

defines a probability measure on each

F_{t}

absolutely continuous with respect to

P

. By Girsanov’s theorem (see, e.g., ([29], Theorem 8.1.2)), we obtain a new process

b_{s}

, which is a Brownian motion under the probability measure

Q

. Moreover, under the probability

Q

, Equation (6) becomes

d Y_{t} = - \frac{1}{2} V_{0} (Y_{t}) d t + V_{i} (Y_{t}) \circ (d b_{t}^{i} - \frac{r_{v} (Y_{t})}{T - t} {(\frac{\partial}{\partial r_{v}})}^{i} d t),

(21)

where

{(\frac{\partial}{\partial r})}^{i}

is the i’th component of the unit radial vector field in the direction of v. The squared radial vector field is smooth away from

Cut (v)

, and we set it to zero on

Cut (v)

. Away from

Cut (v)

, the squared radial vector field is

2 {Log}_{v}

. The added drift term acts as a guiding term, which pulls the process toward v at time

T > 0

.

From (20), we see that

E [f (Y_{t})] = E [f (g_{t}) Z_{t}]

. Using (16), we equivalently write

E [f (Y_{t}) φ_{t}] = E [f (X_{t}) ψ_{t}]

, with

ψ_{t, v} : = exp [\frac{- r_{v}^{2} (g_{t})}{2 (T - t)}] φ_{t, v} : = exp [\int_{0}^{t} \frac{r_{v} (Y_{s})}{T - s} (d A_{s}^{v} + d L_{s}^{v})],

(22)

where

θ_{v}

denotes the Jacobian determinant of

{Exp}_{v}

(see, e.g., [30]),

d A_{s}^{v} = \frac{\partial}{\partial r_{v}} log θ_{v}^{- 1 / 2} (Y_{s}) d s

is a random measure supported on

G \ Cut (v)

, and

d L_{s}^{v}

is the geometric local time at

Cut (v)

.

3.3. Delyon and Hu in Lie Groups

We can now generalize the result of Delyon and Hu ([5], Theorem 5) to the Lie group setting. The result here for Lie groups is analogous to the Riemannian setting as covered in [7].

Theorem 1.

Let

g_{t}

be a solution of (6). The SDE (13) yields a strong solution on

[0, T)

and satisfies

{lim}_{t ↑ T} Y_{t} = v

, almost surely. Moreover, the conditional expectation of g given

g_{T} = v

is

E [f (g) | g_{T} = v] = lim_{t ↑ T} \frac{E [f (Y) φ_{t, v}]}{E [φ_{t, v}]},

(23)

for every

F_{t}

-measurable non-negative function f on G, for

t \in [0, T)

, where

φ_{t}

is given in (22).

When the geometry of G is particularly simple, the equivalence of measures hold on

[0, T]

; see [7]. For example, in the case of G being simply connected:

Corollary 1.

When G is simply connected, (23) becomes

E [f (g) | g_{T} = v] = C E [f (Y) φ_{T, v}],

(24)

where

C > 0

is a constant, which depends on the initial point, the time

T > 0

, and the curvature in the radial direction.

4. Simulation of Bridges in Homogeneous Spaces

We now turn to bridge simulation in homogeneous spaces by sampling bridges in G conditioned on the fiber over

v \in M = G / H

. We simulated in the top space the Lie group G and, subsequently, projected to the homogeneous space M. Inspired by Fermi bridges to submanifolds, we guided toward the closest point in the fiber.

Guiding to the Closest Point

Recall that the projection

π : G \to G / H

is a submersion; hence, the fibers

π^{- 1} (x)

are embedded submanifolds of G. From Lemma 1, we obtain a conditional expectation in M by conditioning on the fiber in the Lie group. The corresponding SDE for the Fermi bridge in the Lie group setting is given by

d Y_{t} = - \frac{1}{2} V_{0} (Y_{t}) d t + V_{i} (Y_{t}) \circ (d B_{t}^{i} - \frac{{(\nabla_{y | y = Y_{t}} d {(y, N)}^{2})}^{i}}{2 (T - t)} d t), Y_{0} = e,

(25)

where

d (x, N) : = {inf}_{z \in N} d (x, z)

and

N : = π^{- 1} (v)

, for some

v \in M

.

The one-point motion conditioned on

v \in M

corresponds to conditioning

g_{t}

on the fiber

N : = π^{- 1} (v)

, and we can use Fermi bridges directly. Because N is an embedded submanifold of G, we obtain from Thompson [15] that

φ_{t, N}

is of the form

φ_{t, N} : = exp [\int_{0}^{t} \frac{r_{N} (Y_{s})}{T - s} (d A_{s}^{N} + d L_{s}^{N})],

(26)

where

d A_{s}^{N} = \frac{\partial}{\partial r_{N}} log Θ_{N}^{- 1 / 2} (Y_{s}) d s

and

Θ_{N} = θ_{N} \circ {({Exp |}_{Log (M ∖ Cut (N))})}^{- 1}

. Similar to the single-point case, we obtain

E [f (X) | X_{T} \in N] = lim_{t ↑ T} \frac{E [f (Y) φ_{t, N}]}{E [φ_{t, N}]},

for any bounded measurable function f. Again, there are various situations where it can be justified to take the limit inside. See the discussion in [30], Appendix C.

5. Maximum Likelihood Estimation

A Brownian motion depends both on its starting point and the underlying Riemannian metric. We can consider both parameters of the model and, given the data, seek to estimate the parameters by the maximum likelihood (MLE). The resulting optimal starting point will in this case be a diffusion mean [31]. As visualized in Figure 1, the pushforward measure of a Brownian motion generated by a non-invariant metric induces distributions on the quotient space, and these distributions will be anisotropic if the metric on the top space is not bi-invariant. Here, we describe a setting for estimation of the underlying metric by the maximum likelihood.

Consider i.i.d. observations

y^{1}, \dots, y^{n}

on G or

G / H

. Let

p_{t} (\cdot | θ)

and

π_{*} p_{t} (\cdot | θ)

be the densities of Brownian motions with parameters

θ = (g, A)

, where g represents the starting point and A the metric tensor at g. The inverse

A^{- 1} = Σ

can be thought of as the covariance of the model. We obtain a likelihood as

L (θ | y^{1}, \dots, y^{n}; T) = \prod_{i = 1}^{n} p_{T} (y^{i} | θ),

(27)

and, similarly,

π_{*} L = \prod π_{*} p

.

The bridge sampling scheme introduced above yields approximations of the intractable transition densities in (27). In the d-dimensional Euclidean case, importance sampling yields the estimate [9]

p_{T} (u, v) = \sqrt{\frac{det (A (T, v))}{{(2 π T)}^{d}}} e^{- \frac{{∥ u - v ∥}_{A}^{2}}{2 T}} E [φ_{T, v}],

where

{∥ x ∥}_{A} = x^{T} A (0, u) x

. Thus, from the output of the importance sampling, we obtain an estimate of the transition density. Similar to the Euclidean case, we here obtain an expression for the heat kernel

p_{T} (e, v)

as

p_{T} (e, v) = q (T, e) E [φ_{T, v}]

, where

q (T, g) = \sqrt{\frac{det A (v)}{{(2 π T)}^{d}}} exp (- \frac{d {(g, v)}^{2}}{2 T}) = \sqrt{\frac{det A (T, v)}{{(2 π T)}^{d}}} exp (- \frac{{∥ {Log}_{g} (v) ∥}_{A}^{2}}{2 T}),

(28)

where the equality holds almost everywhere and

A \in {Sym}^{+} (g)

denotes the metric

A (e) : = A (0, e)

. The

{Log}_{g}

map in (28) is the Riemannian inverse exponential map

{({Exp}_{g})}^{- 1}

.

Algorithm 1 provides a detailed description of the iterative MLE approach. Visual examples of the iterative MLE can be found in Figures 4 and 6.

Algorithm 1: Parameter estimation: iterative MLE.

6. Experiments

In this section, we present numerical results of bridge sampling on specific Lie groups and homogeneous spaces: the three-dimensional rotation group

SO (3)

and the general linear group of invertible matrices with positive determinant

{GL}_{+} (3)

. Exploiting the bridge sampling scheme described above, we show below how to estimate the true underlying metric on

SO (3)

with iterative maximum likelihood estimation. This estimation, in turn, allows finding the parameters of the anisotropic pushforward distributions as displayed in Figure 1.

The space of the symmetric positive definite matrices

SPD (n)

is an example of a non-linear space in which geometric data appear in many applications. The space

SPD (3)

can be obtained as the homogeneous space

{GL}_{+} (3) / SO (3)

, where

{GL}_{+}

is the space of invertible matrices with a positive determinant.

6.1. Discretization

We numerically approximate the Stratonovich integrals by the Euler–Heun scheme. With a time discretization

t_{1}, \dots, t_{k}

,

t_{k} - t_{k - 1} = Δ t

and corresponding noise

Δ B_{t_{i}} \sim N (0, Δ t)

, the numerical approximation of the Brownian motion (6) takes the form

x_{t_{k + 1}} = x_{t_{k}} - \frac{1}{2} \sum_{j, i} C_{i j}^{j} V_{i} (x_{t_{k}}) Δ t + \frac{v_{t_{k + 1}} + V_{i} (v_{t_{k + 1}} + x_{t_{k}}) Δ B_{t_{k}}^{i}}{2}

(29)

where

v_{t_{k + 1}} = V_{i} (x_{t_{k}}) Δ B_{t_{k}}^{i}

is used only as an intermediate value in integration. For the guided bridge simulations, we add the corresponding drift to (29) to obtain the numerical scheme.

6.2. Importance Sampling and Metric Estimation on SO(3)

This section takes G as the special orthogonal group

SO (3)

, the space of three-dimensional rotation matrices. The special orthogonal group is a compact connected matrix Lie group. The rotation group

SO (3)

is a semi-simple Lie group, and bi-invariant inner products exist. In the case of a bi-invariant metric, the Riemannian exponential map Exp coincides with the Lie group exponential map exp, and thus, the Riemannian distance function

d {(R, I)}^{2} = {∥ {Log}_{I} (R) ∥}^{2}

, from the rotation R to the identity I, satisfies

\nabla_{R} d {(R, I)}^{2} = 2 log (R)

.

Figure 2 illustrates the numerical approximation with a sample path from the guided diffusion conditioned to hit the rotation represented by the black vectors. Another way of visualizing the guided bridge on the rotation group

SO (3)

is through the angle–axis representation. Figure 3 represents a guided process on

SO (3)

by presenting the axis representation on

S^{2}

and its corresponding angle of rotation.

Figure 4 illustrates how importance sampling on

SO (3)

leads to a metric estimation of the underlying unknown metric, which generated the Brownian motion. We sampled 128 points as endpoints of a Brownian motion from the metric

diag (0.2, 0.2, 0.8)

, and used 20 time steps to sample four bridges per observation. An iterative MLE method using gradient descent with a learning rate of

0.2

and initial guesses of the metric

diag (1, 1, 1)

and

diag (0.5, 0.5, 0.5)

yielded convergence to the true metric.

6.3. Diffusion Mean Estimation on the Space of Symmetric Positive Definite Matrices

The space of symmetric positive definite (SPD) matrices is used in a range of applications, one example being diffusion tensor imaging where the element of

SPD (3)

models the anisotropic diffusion of water molecules in each position of the imaged domain. The SPD matrices constitute a homogeneous space

{GL}_{+} (n) / SO (n)

of invertible matrices with the positive determinants’ quotient the rotation group.

Figure 5 illustrates discrete-time observations of three sample paths of a guided bridge in

SPD (3)

.

In Figure 6, the bridge sampling scheme derived above is used to obtain an estimate of the diffusion mean [31,32] on

SPD (3)

by sampling guided bridge processes in the space of invertible matrices with positive determinants

{GL}_{+} (3)

. This sampling method provides an estimate of the density on

{GL}_{+} (3)

, which projects to a density in

SPD (3)

. Exploiting the resulting density in

SPD (3)

, the iterative MLE yields convergence to the diffusion mean.

6.4. Density Estimation on the Two-Sphere

The two-sphere

S^{2}

can be considered the homogeneous space

SO (3) / SO (2)

of three-dimensional rotations, identifying the subgroup of two-dimensional rotations as a single point. Conditioning on the fiber

SO (2)

in

SO (3)

, we obtain guided bridges on

S^{2}

. In the case of a bi-invariant metric on G, the G-valued Brownian motion pushes forward to an M-valued Brownian motion. Figure 1b illustrates the estimated transition density on

S^{2}

from sampling bridges in the Lie group conditioned on the fiber

SO (2)

, when the underlying metric is bi-invariant. When altering the metric to a non-invariant variant one, the G-Brownian motion does not in general push forward to an M-Brownian motion. The non-invariant metrics result in a covariance structure exhibiting anisotropy, which is illustrated by Figure 1c,d.

7. Conclusions

In this paper, we presented algorithms for estimating the parameters of a class of densities that are the generalization of the Euclidean normal density to Lie groups and homogeneous spaces. We used the heat equation to generalize the normal distribution to Lie groups and homogeneous spaces, where the left- (or right-) invariant metric generalizes the notion of the covariance of a normal density. We presented algorithms for bridge simulation and for estimating the metric given the i.i.d. Lie group or homogeneous space-valued samples. The estimation algorithm was based on Monte Carlo simulations of Brownian bridges. These algorithms are expected to impact many diverse fields, including bioinformatics, medical imaging, shape analysis, computer vision, and information geometry, where Lie groups or homogeneous spaces are the natural model spaces for data samples.

8. Code

The code used for the experiments is available in the Theano Geometry http://bitbucket.org/stefansommer/theanogeometry and Jax Geometry http://bitbucket.org/stefansommer/jaxgeometry software packages (accessed on 7 July 2022). The implementation uses automatic differentiation libraries extensively for the geometry computations, as is further described in [33].

Author Contributions

Conceptualization, S.J. and S.S.; methodology, all authors; software, M.H.J. and S.S.; writing, all authors. All authors have read and agreed to the published version of the manuscript.

Funding

The work presented is supported by the CSGB Centre for Stochastic Geometry and Advanced Bioimaging funded by a grant from the Villum Foundation, the Villum Foundation Grants 22924 and 40582, the Novo Nordisk Foundation Grant NNF18OC0052000, and the National Science Foundation Grant DMS-1912030.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pedersen, A.R. Consistency and asymptotic normality of an approximate maximum likelihood estimator for discretely observed diffusion processes. Bernoulli 1995, 1, 257–279. [Google Scholar] [CrossRef]
Bladt, M.; Sørensen, M. Simple simulation of diffusion bridges with application to likelihood inference for diffusions. Bernoulli 2014, 20, 645–675. [Google Scholar] [CrossRef]
Bladt, M.; Finch, S.; Sørensen, M. Simulation of multivariate diffusion bridges. J. R. Stat. Soc. Ser. B Stat. Methodol. 2016, 78, 343–369. [Google Scholar] [CrossRef]
Bui, M.N.; Pokern, Y.; Dellaportas, P. Inference for partially observed Riemannian Ornstein–Uhlenbeck diffusions of covariance matrices. arXiv 2021, arXiv:2104.03193. [Google Scholar]
Delyon, B.; Hu, Y. Simulation of Conditioned Diffusion and Application to Parameter Estimation. Stoch. Process. Their Appl. 2006, 116, 1660–1675. [Google Scholar] [CrossRef]
Jensen, M.H.; Mallasto, A.; Sommer, S. Simulation of Conditioned Diffusions on the Flat Torus. In Proceedings of the International Conference on Geometric Science of Information, Toulouse, France, 27–29 August 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 685–694. [Google Scholar]
Jensen, M.H.; Sommer, S. Simulation of Conditioned Semimartingales on Riemannian Manifolds. arXiv 2021, arXiv:2105.13190. [Google Scholar]
Van der Meulen, F.; Schauer, M. Bayesian estimation of discretely observed multi-dimensional diffusion processes using guided proposals. Electron. J. Stat. 2017, 11, 2358–2396. [Google Scholar] [CrossRef]
Papaspiliopoulos, O.; Roberts, G. Importance sampling techniques for estimation of diffusion models. Stat. Methods Stoch. Differ. Equ. 2012, 311–340. [Google Scholar]
Schauer, M.; Van Der Meulen, F.; Van Zanten, H. Guided proposals for simulating multi-dimensional diffusion bridges. Bernoulli 2017, 23, 2917–2950. [Google Scholar] [CrossRef]
Sommer, S.; Arnaudon, A.; Kuhnel, L.; Joshi, S. Bridge Simulation and Metric Estimation on Landmark Manifolds. In Proceedings of the Graphs in Biomedical Image Analysis, Computational Anatomy and Imaging Genetics, Lecture Notes in Computer Science, Quebec, QC, Canada, 10–14 September 2017; Springer: Berlin/Heidelberg, Germany, 2017; pp. 79–91. [Google Scholar]
Bui, M.N. Inference on Riemannian Manifolds: Regression and Stochastic Differential Equations. Ph.D. Thesis, UCL (University College London), London, UK, 2022. [Google Scholar]
Fisher, R. Dispersion on a Sphere. Proc. R. Soc. Lond. A Math. Phys. Eng. Sci. 1953, 217, 295–305. [Google Scholar] [CrossRef]
Kent, J.T. The Fisher-Bingham Distribution on the Sphere. J. R. Stat. Soc. Ser. B (Methodol.) 1982, 44, 71–80. [Google Scholar] [CrossRef]
Thompson, J. Brownian bridges to submanifolds. Potential Anal. 2018, 49, 555–581. [Google Scholar] [CrossRef]
García-Portugués, E.; Sørensen, M.; Mardia, K.V.; Hamelryck, T. Langevin Diffusions on the Torus: Estimation and Applications. Stat. Comput. 2017, 29, 1–22. [Google Scholar] [CrossRef]
Hamelryck, T.; Kent, J.T.; Krogh, A. Sampling Realistic Protein Conformations Using Local Structural Bias. PLoS Comput. Biol. 2006, 2, e131. [Google Scholar] [CrossRef] [PubMed]
Pennec, X.; Fillard, P.; Ayache, N. A Riemannian Framework for Tensor Computing. Int. J. Comput. Vis. 2006, 66, 41–66. [Google Scholar] [CrossRef]
Vaillant, M.; Miller, M.; Younes, L.; Trouvé, A. Statistics on Diffeomorphisms via Tangent Space Representations. NeuroImage 2004, 23, S161–S169. [Google Scholar] [CrossRef]
Yang, L. Means of Probability Measures in Riemannian Manifolds and Applications to Radar Target Detection. Ph.D. Thesis, Poitiers University, Poitiers, France, 2011. [Google Scholar]
Grenander, U. Probabilities on Algebraic Structures; Wiley: New York, NY, USA/London, UK, 1963. [Google Scholar]
Jensen, M.H.; Joshi, S.; Sommer, S. Bridge Simulation and Metric Estimation on Lie Groups. In Proceedings of the Geometric Science of Information; Lecture Notes in Computer Science; Nielsen, F., Barbaresco, F., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 430–438. [Google Scholar] [CrossRef]
Pennec, X.; Sommer, S.; Fletcher, T. Riemannian Geometric Statistics in Medical Image Analysis; Elsevier: Amsterdam, The Netherlands, 2020. [Google Scholar]
Liao, M. Lévy Processes in Lie Groups; Cambridge University Press: Cambridge, UK, 2004; Volume 162. [Google Scholar]
Shigekawa, I. Transformations of the Brownian motion on a Riemannian symmetric space. In Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete; 1984; pp. 493–522. [Google Scholar]
Kendall, W.S. The radial part of Brownian motion on a manifold: A semimartingale property. Ann. Probab. 1987, 15, 1491–1500. [Google Scholar] [CrossRef]
Barden, D.; Le, H. Some consequences of the nature of the distance function on the cut locus in a Riemannian manifold. J. LMS 1997, 56, 369–383. [Google Scholar] [CrossRef]
Le, H.; Barden, D. Itô correction terms for the radial parts of semimartingales on manifolds. Probab. Theory Relat. Fields 1995, 101, 133–146. [Google Scholar] [CrossRef]
Hsu, E.P. Stochastic Analysis on Manifolds; AMS: Providence, RI, USA, 2002; Volume 38. [Google Scholar]
Thompson, J. Submanifold Bridge Processes. Ph.D. Thesis, University of Warwick, Coventry, UK, 2015. [Google Scholar]
Hansen, P.; Eltzner, B.; Huckemann, S.F.; Sommer, S. Diffusion Means in Geometric Spaces. arXiv 2021, arXiv:2105.12061. [Google Scholar]
Hansen, P.; Eltzner, B.; Sommer, S. Diffusion Means and Heat Kernel on Manifolds. arXiv 2021, arXiv:2103.00588. [Google Scholar]
Kühnel, L.; Sommer, S.; Arnaudon, A. Differential Geometry and Stochastic Dynamics with Deep Learning Numerics. Appl. Math. Comput. 2019, 356, 411–437. [Google Scholar] [CrossRef]

Figure 1. The two leftmost plots visualize the transition densities of (a) a Fisher–Bingham–Kent distribution and (b) the pushforward density of a Brownian motion to

S^{2}

with a bi-invariant metric. The pushforward measure of a Brownian motion on

SO (3)

to the sphere

S^{2}

results in anisotropic distributions on

S^{2}

when the metric on

SO (3)

is not bi-invariant, for (c)

T = 0.5

and (d)

T = 1.0

. The coloring indicates the density of the pushforward (different color scheme for each subfigure).

Figure 1. The two leftmost plots visualize the transition densities of (a) a Fisher–Bingham–Kent distribution and (b) the pushforward density of a Brownian motion to

S^{2}

with a bi-invariant metric. The pushforward measure of a Brownian motion on

SO (3)

to the sphere

S^{2}

results in anisotropic distributions on

S^{2}

when the metric on

SO (3)

is not bi-invariant, for (c)

T = 0.5

and (d)

T = 1.0

. The coloring indicates the density of the pushforward (different color scheme for each subfigure).

Figure 2. One sample path of the guided bridge process (13) on

SO (3)

visualized by its action on the basis vectors (red, blue, green) of

R^{3}

. The bridge is conditioned on the rotation indicated by the black arrows.

Figure 2. One sample path of the guided bridge process (13) on

SO (3)

visualized by its action on the basis vectors (red, blue, green) of

R^{3}

. The bridge is conditioned on the rotation indicated by the black arrows.

Figure 3. Angle–axis representation of the guided bridge defined by (13). (Left) The projection of the path in

SO (3)

to

S^{2}

. The trajectory on

S^{2}

corresponds to the motion of the tip of the blue vector, as seen in Figure 2. (Right) The angle representation of the guided bridge in

SO (3)

.

Figure 3. Angle–axis representation of the guided bridge defined by (13). (Left) The projection of the path in

SO (3)

to

S^{2}

. The trajectory on

S^{2}

corresponds to the motion of the tip of the blue vector, as seen in Figure 2. (Right) The angle representation of the guided bridge in

SO (3)

.

Figure 4. The importance sampling technique applies to estimate the metric on the Lie group

SO (3)

. Sampling a Brownian motion from an underlying unknown metric, we obtain convergence to the true underlying metric using an iterative MLE method. Here, we sampled four guided bridges per observation, providing a relatively smooth iterative likelihood. (Top left) Estimation of the unknown underlying metric using bridge sampling, starting from the metric

diag (1, 1, 1)

. Here, the true metric is the diagonal matrix

diag (0.2, 0.2, 0.8)

represented by the red lines. The diagonal is represented by the colors diag (purple, blue, yellow). (Top right) The corresponding log-likelihood evolution through the iterations. (Bottom left) Estimation of the unknown underlying metric using bridge sampling, starting from the metric

diag (0.5, 0.5, 0.5)

. (Bottom right) The corresponding iterative log-likelihood.

Figure 4. The importance sampling technique applies to estimate the metric on the Lie group

SO (3)

. Sampling a Brownian motion from an underlying unknown metric, we obtain convergence to the true underlying metric using an iterative MLE method. Here, we sampled four guided bridges per observation, providing a relatively smooth iterative likelihood. (Top left) Estimation of the unknown underlying metric using bridge sampling, starting from the metric

diag (1, 1, 1)

. Here, the true metric is the diagonal matrix

diag (0.2, 0.2, 0.8)

represented by the red lines. The diagonal is represented by the colors diag (purple, blue, yellow). (Top right) The corresponding log-likelihood evolution through the iterations. (Bottom left) Estimation of the unknown underlying metric using bridge sampling, starting from the metric

diag (0.5, 0.5, 0.5)

. (Bottom right) The corresponding iterative log-likelihood.

Figure 5. Discrete-time observations from three sample paths on

SPD (3)

. The sample paths are obtained as the pushforward of the Fermi bridge in

{GL}_{+} (3)

. The start and endpoint are the left- and rightmost figures, where the SPD matrices are indicated by the bold face arrows.

Figure 5. Discrete-time observations from three sample paths on

SPD (3)

. The sample paths are obtained as the pushforward of the Fermi bridge in

{GL}_{+} (3)

. The start and endpoint are the left- and rightmost figures, where the SPD matrices are indicated by the bold face arrows.

Figure 6. Given 256 data points in

SPD (3)

, we estimated the diffusion mean on the homogeneous space by sampling bridges in the top space conditioned on the fibers. The iterative MLE in Algorithm 1 yielded the convergence of the diffusion mean parameter, using a learning rate of

0.005

and one bridge sample per observation. (Left) The purple, blue, and yellow lines correspond to the diagonal of the metric matrix, whereas the remaining colors represent the off-diagonal. The true mean value is the identity matrix indicated by the red lines. (Right) The corresponding log-likelihood evolution through the iterations.

Figure 6. Given 256 data points in

SPD (3)

, we estimated the diffusion mean on the homogeneous space by sampling bridges in the top space conditioned on the fibers. The iterative MLE in Algorithm 1 yielded the convergence of the diffusion mean parameter, using a learning rate of

0.005

and one bridge sample per observation. (Left) The purple, blue, and yellow lines correspond to the diagonal of the metric matrix, whereas the remaining colors represent the off-diagonal. The true mean value is the identity matrix indicated by the red lines. (Right) The corresponding log-likelihood evolution through the iterations.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jensen, M.H.; Joshi, S.; Sommer, S. Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation. Algorithms 2022, 15, 290. https://doi.org/10.3390/a15080290

AMA Style

Jensen MH, Joshi S, Sommer S. Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation. Algorithms. 2022; 15(8):290. https://doi.org/10.3390/a15080290

Chicago/Turabian Style

Jensen, Mathias Højgaard, Sarang Joshi, and Stefan Sommer. 2022. "Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation" Algorithms 15, no. 8: 290. https://doi.org/10.3390/a15080290

APA Style

Jensen, M. H., Joshi, S., & Sommer, S. (2022). Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation. Algorithms, 15(8), 290. https://doi.org/10.3390/a15080290

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Discrete-Time Observations of Brownian Motion on Lie Groups and Homogeneous Spaces: Sampling and Metric Estimation

Abstract

1. Introduction

Contribution and Overview

2. Notation and Background

2.1. Euclidean Diffusion Bridges and Simulation

2.2. Riemannian Manifolds and Lie Groups

2.3. Lie Groups

2.4. Homogeneous Spaces

2.5. Brownian Motion on Riemannian Manifolds

2.6. Brownian Bridges

2.7. One-Point Motions

2.8. Pushforward Measures

3. Simulation of Bridges on Lie Groups

3.1. Radial Process

3.2. Girsanov Change of Measure

3.3. Delyon and Hu in Lie Groups

4. Simulation of Bridges in Homogeneous Spaces

Guiding to the Closest Point

5. Maximum Likelihood Estimation

6. Experiments

6.1. Discretization

6.2. Importance Sampling and Metric Estimation on SO(3)

6.3. Diffusion Mean Estimation on the Space of Symmetric Positive Definite Matrices

6.4. Density Estimation on the Two-Sphere

7. Conclusions

8. Code

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI