Abstract
We present schemes for simulating Brownian bridges on complete and connected Lie groups and homogeneous spaces. We use this to construct an estimation scheme for recovering an unknown left- or right-invariant Riemannian metric on the Lie group from samples. We subsequently show how pushing forward the distributions generated by Brownian motions on the group results in distributions on homogeneous spaces that exhibit a non-trivial covariance structure. The pushforward measure gives rise to new non-parametric families of distributions on commonly occurring spaces such as spheres and symmetric positive tensors. We extend the estimation scheme to fit these distributions to homogeneous space-valued data. We demonstrate both the simulation schemes and estimation procedures on Lie groups and homogenous spaces, including and .
1. Introduction
Bridge simulation is a data augmentation technique for generating missing trajectories of continuous diffusion processes. We consider bridge simulation on Lie groups and homogeneous spaces. As an important example, we investigate the case of an i.i.d. Lie group or homogeneous space-valued samples that are considered discrete-time observations of a continuous diffusion process. Assuming the stochastic dynamics to be Brownian motion, we wish to estimate the underlying Riemannian metric of the Lie group or homogeneous space from the samples. To evaluate and maximize the likelihood of the data, we need to account for the diffusion process being unobserved at most time points. This requires bridge sampling, and the sampling techniques are thus the key enabler for metric estimation in this setting.
The simulation of conditioned diffusion processes is a highly non-trivial problem, even in Euclidean spaces. Transition densities of diffusion processes are tractable in closed form only for a small class of processes, and hence, simulating directly from the true bridge distribution is generally infeasible. The data augmentation used in inference for diffusion processes dates back to the seminal paper by Pedersen [] almost three decades ago. Since then, several papers have studied diffusion bridge simulation methods; see, e.g., [,,,,,,,,,]. The method by Delyon and Hu [] exchanged the intractable drift term in the conditioned diffusion with a tractable drift originating from the drift of a standard Brownian bridge.
In this paper, we further extend the original idea of Delyon and Hu [] to Lie groups and homogenous spaces. Several papers have built on the ideas of Delyon and Hu. For example, a manifold equivalent drift term analogous to the drift term of a Brownian bridge in Euclidean space was used in [] to describe the simulation of Brownian bridges on the flat torus, whereas [] generalized this method to general finite-dimensional Riemannian manifolds. Reference [] used the drift to model Brownian bridges on the space of landmarks. Bui et al. [,] used a similar drift term on the space of symmetric positive definite (covariance) matrices, exploiting the exponential map, which in the space of covariance matrices is a global diffeomorphism. The present paper extends these ideas to general Lie groups and homogeneous spaces.
As an application, we consider discrete-time observations in Lie groups and homogeneous spaces regarded as incomplete observations of sample paths of Brownian motions arising from left- (or right-) invariant Riemannian metrics. The bridge simulation schemes allow interpolating between the discrete-time observations. Furthermore, we observe how varying the metric on Lie groups affects pushforwards of the Brownian motion to homogeneous spaces being quotients of the group. These distributions encode the covariance of the data resulting from the metric structure of the Lie group. We define this family of distributions and derive estimation schemes for recovering the metric structure of the group both with Lie group samples and with homogeneous space samples. One particular example is the two-sphere, . Changing the metric structure on results in anisotropic distributions on , arising as the pushforward measure from . Figure 1 illustrates the isotropic and anisotropic distributions on induced by a bi-invariant and left-invariant (not bi-invariant) metric on , respectively. The resulting distributions are analogous to the von Mises–Fisher and Fisher–Bingham–Kent distributions [,]. However, the approach is independent of the chosen embedding and uses only the geometric relation between the group and quotient space.
Figure 1.
The two leftmost plots visualize the transition densities of (a) a Fisher–Bingham–Kent distribution and (b) the pushforward density of a Brownian motion to with a bi-invariant metric. The pushforward measure of a Brownian motion on to the sphere results in anisotropic distributions on when the metric on is not bi-invariant, for (c) and (d) . The coloring indicates the density of the pushforward (different color scheme for each subfigure).
For the simulation on homogeneous spaces, we use bridges to submanifolds as developed by Thompson [] to condition on the fibers in the Lie group G over the target point for some closed subgroup . The resulting guiding term guides in the direction closest to the fiber.
Statistics on Lie groups and homogeneous spaces find applications in many diverse fields, including bioinformatics, medical imaging, shape analysis, computer vision, and information geometry; see, e.g., [,,,,]. Statistics in Euclidean spaces often rely on the distributional properties of the normal distribution. Here, we use Brownian motions and the heat equation to generalize the normal distribution to Lie groups and homogeneous spaces as introduced by Grenander []. The solution to the heat equation is the transition density of a Brownian motion. Through Monte Carlo simulations of bridges, we can estimate the transition density and maximize the likelihood with respect to the Riemannian metric.
Contribution and Overview
We present simulation schemes on Lie groups and homogeneous spaces with application to parameter estimation. We outline the necessary theoretical background for the construction of bridge simulation on Lie groups and homogeneous spaces before demonstrating how the simulation scheme leads to estimates of means and the underlying metric structure using maximum likelihood estimation on certain Lie groups and homogeneous spaces. The paper builds on and significantly extends the conference paper [], which introduced bridge simulation in the Lie group setting.
The paper is organized as follows: In Section 2, we describe the relevant background theory of Lie groups, Brownian motions, and Brownian bridges in Riemannian manifolds. Section 3 presents the theory and results of bridge sampling in Lie groups, and Section 4 introduces bridge sampling on homogeneous spaces. Section 5 covers maximum likelihood estimation of the starting point and Riemannian metric. Numerical experiments on selected Lie groups and homogeneous spaces are presented in Section 6.
2. Notation and Background
We here briefly describe simulating conditioned diffusions in Euclidean space as developed in [] before reviewing the theory on conditioned diffusions on Riemannian manifolds.
2.1. Euclidean Diffusion Bridges and Simulation
Suppose a strong solution exists to an SDE of the form
where b and satisfy certain regularity conditions and where w denotes an -valued Brownian motion. In this case, x is a Markov process, and its transition density exists. Suppose we define the function
for some . Then, it is easily derived that h is a martingale on with , and Doob’s h-transform implies that the SDE of the conditioned diffusion is given by
where . In the case that the transition density is intractable, simulation from the exact distribution is infeasible. Delyon and Hu [] suggested substituting the latter term in with a drift term of the form , which equals the drift term in a Brownian bridge. The guided process obtained by making the above substitution yields a conditioning, and one obtains
where is a likelihood function that is tractable and numerically computable, y is the guided process, and the constant depends on , v, and T.
2.2. Riemannian Manifolds and Lie Groups
Let M be a finite-dimensional smooth manifold of dimension d. M can be endowed with a Riemannian metric tensor, i.e., a family of inner products defined on each tangent space . The Riemannian metric tensor gives rise to a distance function between points in M. The tangent space is locally diffeomorphic with an open subset of M. The Riemannian exponential map provides this local diffeomorphism. On the subset of M where is a diffeomorphism, the inverse Riemannian exponential map, also called the Riemannian logarithm map, is defined. The Riemannian distance function can then be defined in terms of the Riemannian inner product as . The Riemannian logarithm map plays an important role when defining guided bridges on manifolds.
Let X be a vector field on M assigning to each point a tangent vector . A connection ∇ on a manifold is an operation that allows us to compare neighboring tangent spaces and define derivatives of vector fields along other vector fields, that is, if Y is another vector field, then is the derivative of Y along X (also known as the covariant derivative of Y along X). A connection also gives a notion of “straight lines” in manifolds, also known as geodesics. A curve is a geodesic if the vector field along is parallel to itself, i.e., if . The geodesic curves are locally length minimizing.
Generalizing the Euclidean Laplacian operator, the Laplace–Beltrami operator is defined as the divergence of the gradient, . In terms of local coordinates , the expression for the Laplace–Beltrami operator becomes
where denotes the determinant of the Riemannian metric g and are the coefficients of the inverse of g. (2) can be written as
where , , and denote the Christoffel symbols of the Riemannian metric.
2.3. Lie Groups
Let G denote a connected Lie group of dimension d, i.e., a smooth manifold with a group structure such that the group operations and are smooth maps. If , the left-multiplication map, , defined by , is a diffeomorphism from G to itself. Similarly, the right-multiplication map defines a diffeomorphism from G to itself by . Let denote the pushforward map given by . A vector field V on G is said to be left-invariant if . The space of left-invariant vector fields is linearly isomorphic to , the tangent space at the identity element . By equipping the tangent space with the Lie bracket, we can identify the Lie algebra with . The group structure of G makes it possible to define an action of G on its Lie algebra . The conjugation map , for , fixes the identity e. Its pushforward map at e, , is then a linear automorphism of . Define , then is the adjoint representation of G in . The map is the adjoint action of G on . We denote by a Riemannian metric on G. The metric is said to be left-invariant if , for every , i.e., the left-multiplication maps are isometries, for every . The metric is -invariant if , for every . Note that an -invariant metric on G is equivalent to a bi-invariant (left- and right-invariant) inner product on . The differential of the Ad map at the identity yields a linear map . This linear map is equal to the Lie bracket , .
A one-parameter subgroup of G is a continuous homomorphism . The Lie group exponential map is defined as , for , where is the unique one-parameter subgroup of G whose tangent vector at e is v. For matrix Lie groups, the exponential map has the particular form: , for a square matrix A. The resulting matrix is an invertible matrix. Given an invertible matrix B, if there exists a square matrix A such that , then A is said to be the logarithm of B. In general, the logarithm might not exist, and if it does, it may fail to be unique. In a neighborhood sufficiently close to the identity, the Lie group logarithm exists and is unique. By means of left-translation (or right-translation), the Lie group exponential map can be extended to a map , for all , defined by . Similarly, the Lie group logarithm at g becomes . The matrix exponential and logarithms can be computed numerically efficiently (see [], Chapter 5, and the references therein).
Example 1.
A few examples of Lie groups include the Euclidean space with the additive group structure, the positive real line with a multiplicative group structure, the space of invertible real matrices equipped with a multiplication of matrices forming a Lie group, and the rotation group , consisting of real orthogonal matrices with determinant one or minus one forming a subgroup of .
The identification of the space of left-invariant vector fields with the Lie algebra allows for a global description of . Indeed, let be an orthonormal basis of . Then, defines left-invariant vector fields on G and the Laplace–Beltrami operator can be written as (cf. [], Proposition 2.5)
where and denote the structure coefficients given by
By left-invariance, .
2.4. Homogeneous Spaces
A homogeneous space is a particular type of quotient manifold that arises as a smooth manifold endowed with a transitive smooth action by a Lie group G. The homogeneous space is called a G-homogeneous space to indicate the Lie group action. All G-homogeneous spaces arise as a quotient manifold , for some closed subgroup . H is a closed subgroup of the Lie group G, which makes H a Lie group. Any homogeneous space is diffeomorphic to the quotient space , where is the stabilizer for the point x. The dimension of the G-homogeneous space is equal to ; the quotient map is a smooth submersion, i.e., the differential of is surjective at every point. This implies that the fibers , , are embedded submanifolds of G. We assume throughout that G acts on itself by left-multiplication.
Example 2.
The rotation group acts transitively on ; therefore, is an -homogeneous space. Consider a point in as a vector in . Rotations that fix the point occur precisely in the subspace orthogonal to the vector. Thus, the stabilizer or isotropy group is the rotation group and . The set of invertible matrices with positive determinant acts on symmetric positive definite matrices . The isotropy group is the rotation group , and thus, . A particular type of homogeneous space arises when the subgroup is a discrete subgroup of G. For example, the space defines the n-torus as a homogeneous space.
2.5. Brownian Motion on Riemannian Manifolds
The Laplacian defines Brownian motion on M as a -diffusion process up to its explosion time . The stochastic differential equation (SDE) for a Brownian motion in local coordinates is
where is the matrix square root of .
On Lie groups, an SDE for a Brownian motion on G in terms of left-invariant vector fields takes the form
where ∘ denotes integration in the Stratonovich sense. By [] (Proposition 2.6), if the inner product is Ad(G)-invariant, then . The solution of (6) is conservative or non-explosive and is called the left-Brownian motion on G (see [] and the references therein).
2.6. Brownian Bridges
In this section, we briefly review some facts on Brownian bridges on Riemannian manifolds, including Lie groups. On Lie groups, the existence of left-invariant (respectively right-invariant) vector fields allows identification of the Lie algebra with the vector space of left-invariant vector fields making the Lie group parallelizable. This allows constructing general semimartingales directly on the Lie groups.
Let be the measure of a Riemannian Brownian motion, , at some time t started at point x. Let denote the transition density of so that with the Riemannian volume measure. Conditioning the Riemannian Brownian motion to hit some point at time defines a Riemannian Brownian bridge. We let denote the corresponding probability measure. The two measures are absolutely continuous (equivalent) over the time interval , however mutually singular at time . This consequence is obvious because , whereas . The corresponding Radon–Nikodym derivative is
which is a martingale for . The Radon–Nikodym derivative defines the density for the change of measure, and it provides the conditional expectation
for any bounded and -measurable random variable . The Brownian bridge is a non-homogeneous diffusion on M with infinitesimal generator
The bridge can be described by an SDE in the frame bundle of M. Let be a lift of , and using the horizontal vector fields , we have
where denotes the lift of the transition density, B is an -valued Brownian motion, and is the pushforward of the projection . Here, is an orthonormal frame such that .
Further types of Riemannian bridges can be found in Thompson []. Brownian bridges to submanifolds are here introduced by considering the transition density evaluated at a submanifold by
where denotes the volume measure on N. Conditioning on gives
which holds for all bounded -measurable random variables . Fibers of homogeneous spaces are embedded submanifolds of a Lie group. We will later use this to derive a simulation scheme on homogeneous spaces by conditioning on the fibers.
The notion of Fermi bridges was also introduced in []. Fermi bridges have infinitesimal generator
where and .
2.7. One-Point Motions
Consider the homogeneous space , where H is a Lie subgroup of the Lie group G, and let denote the canonical projection. Suppose that G acts on M on the left and that is a process in G. This induces a process in M. For any , the induced process defines the one-point motion of in M, with initial value x. The one-point motion, , of a Brownian motion in G, started at , is only a Brownian motion in M under certain conditions (see, e.g., [], Proposition 2.7). In the case of a bi-invariant metric, a Brownian motion on G maps to a Brownian motion in M through its one-point motion, which, in general, is not the case. For example, the one-point processes of a G-valued Markov process might not preserve the Markov property if the metric is not bi-invariant.
In this paper, we explicitly chose metrics whose Brownian motions do not descend to Brownian motions in . The non-invariant metrics result in processes in with an anisotropic covariance structure. The anisotropic distributions in will arise from non-invariant metrics, and the induced processes will, in general, not inherit the Markov property.
2.8. Pushforward Measures
Let be the projection to the homogeneous space . Then, is a measurable map, and if is a measure on G, the pushforward of by , defined by , for all measurable subsets , is a measure on M. A numerical example is provided in Figure 1, showing anisotropic distributions on the homogeneous space obtained from pushing forward Brownian motions of a non-invariant metric on the top space .
The Riemannian volume measure on G decomposes into a product measure consisting of the volume measure on fibers in G, e.g., , and the volume measure on their horizontal complement, i.e., , where is the horizontal restriction of the volume measure in G. The measure of a process on G pushes forward to M, and we denote the corresponding density with respect to the volume measure on M for . Then, .
Lemma 1.
Let be a Markov process on G, started at , with density , and let . The conditional expectation on M satisfies
for all bounded, continuous, and non-negative -measurable f on M. Furthermore,
where .
3. Simulation of Bridges on Lie Groups
In this section, we consider the task of simulating (6) conditioned to hit , at time . The potentially intractable transition density for the solution of (6) inhibits simulation directly from the bridge SDE (9). Instead, we propose to add a guiding term mimicking that of Delyon and Hu [], i.e., the guiding term becomes the gradient of the distance to v divided by the time to arrival. The SDE for the guided diffusion becomes
where denotes the Riemannian distance to v. Note that we can always, for convenience, take the initial value to be the identity e. Equation (13) can equivalently be written as
where is the inverse of the Riemannian exponential map . Figure 2 illustrates one sample path of the simulation scheme in (13) on the Lie group . The corresponding axis-angle representation is visualized in Figure 3.
Figure 2.
One sample path of the guided bridge process (13) on visualized by its action on the basis vectors (red, blue, green) of . The bridge is conditioned on the rotation indicated by the black arrows.
The guiding term in (13) is identical to the guiding term described in [,]. In [], the guided processes used the frame bundle of M. In the Lie group setting, since Lie groups are parallelizable, the use of the frame bundle is not needed: the invariant vector fields provide a frame of reference at all points of G.
Numerical computations of the Lie group exponential map are often computationally efficient; see [] and the references therein for efficient algorithms. Therefore, by a change of measures argument, the equation above can be expressed in terms of the inverse of the Lie group exponential:
, where is a Brownian motion under a new measure, say . The measure can explicitly be expressed as
where denotes the law of the SDE in (13) and
Note that when the metric is bi-invariant, the group logarithm and the Riemannian logarithm coincide.
3.1. Radial Process
Below, we investigate the relation between the bridge measure and the above simulation schemes. Let be the distance to v such that is the radial process. Due to the singularities of the radial process on , the usual Itô’s formula only applies on subsets away from the cut-locus. The extension beyond the cut-locus of a Brownian motion’s radial process was due to Kendall []. Barden and Le [,] generalized the result to M-valued semimartingales. The radial process of the Brownian motion (6) is given by
where is the geometric local time of the cut-locus , which is a non-decreasing continuous random functional increasing only when g is in (see [,,]). Let , which is the local-martingale part in the above equation. The quadratic variation of satisfies by the orthonormality of ; thus, is a Brownian motion by Levy’s characterization theorem. From the stochastic integration by parts formula and (15), the squared radial process of g satisfies
where is the random measure associated with .
Similarly, we obtain an expression for the squared radial process of Y. The radial process becomes
Imposing a growth condition on the radial process yields an -bound on the radial process of the guided diffusion, []. Therefore, assume there exist constants and such that on , for every regular domain . Then, (17) satisfies
where is the first exit time of Y from the domain D.
3.2. Girsanov Change of Measure
Let be a d-dimensional Brownian motion defined on a filtered probability space , and let be a solution of (6). The process is an adapted process. As is non-explosive, we see that
for every , almost surely, and for some fixed constant . Define a new measure by
From (19), the process is a martingale, for , and defines a probability measure on each absolutely continuous with respect to . By Girsanov’s theorem (see, e.g., ([], Theorem 8.1.2)), we obtain a new process , which is a Brownian motion under the probability measure . Moreover, under the probability , Equation (6) becomes
where is the i’th component of the unit radial vector field in the direction of v. The squared radial vector field is smooth away from , and we set it to zero on . Away from , the squared radial vector field is . The added drift term acts as a guiding term, which pulls the process toward v at time .
3.3. Delyon and Hu in Lie Groups
We can now generalize the result of Delyon and Hu ([], Theorem 5) to the Lie group setting. The result here for Lie groups is analogous to the Riemannian setting as covered in [].
Theorem 1.
Let be a solution of (6). The SDE (13) yields a strong solution on and satisfies , almost surely. Moreover, the conditional expectation of g given is
for every -measurable non-negative function f on G, for , where is given in (22).
When the geometry of G is particularly simple, the equivalence of measures hold on ; see []. For example, in the case of G being simply connected:
Corollary 1.
When G is simply connected, (23) becomes
where is a constant, which depends on the initial point, the time , and the curvature in the radial direction.
4. Simulation of Bridges in Homogeneous Spaces
We now turn to bridge simulation in homogeneous spaces by sampling bridges in G conditioned on the fiber over . We simulated in the top space the Lie group G and, subsequently, projected to the homogeneous space M. Inspired by Fermi bridges to submanifolds, we guided toward the closest point in the fiber.
Guiding to the Closest Point
Recall that the projection is a submersion; hence, the fibers are embedded submanifolds of G. From Lemma 1, we obtain a conditional expectation in M by conditioning on the fiber in the Lie group. The corresponding SDE for the Fermi bridge in the Lie group setting is given by
where and , for some .
The one-point motion conditioned on corresponds to conditioning on the fiber , and we can use Fermi bridges directly. Because N is an embedded submanifold of G, we obtain from Thompson [] that is of the form
where and . Similar to the single-point case, we obtain
for any bounded measurable function f. Again, there are various situations where it can be justified to take the limit inside. See the discussion in [], Appendix C.
5. Maximum Likelihood Estimation
A Brownian motion depends both on its starting point and the underlying Riemannian metric. We can consider both parameters of the model and, given the data, seek to estimate the parameters by the maximum likelihood (MLE). The resulting optimal starting point will in this case be a diffusion mean []. As visualized in Figure 1, the pushforward measure of a Brownian motion generated by a non-invariant metric induces distributions on the quotient space, and these distributions will be anisotropic if the metric on the top space is not bi-invariant. Here, we describe a setting for estimation of the underlying metric by the maximum likelihood.
Consider i.i.d. observations on G or . Let and be the densities of Brownian motions with parameters , where g represents the starting point and A the metric tensor at g. The inverse can be thought of as the covariance of the model. We obtain a likelihood as
and, similarly, .
The bridge sampling scheme introduced above yields approximations of the intractable transition densities in (27). In the d-dimensional Euclidean case, importance sampling yields the estimate []
where . Thus, from the output of the importance sampling, we obtain an estimate of the transition density. Similar to the Euclidean case, we here obtain an expression for the heat kernel as , where
where the equality holds almost everywhere and denotes the metric . The map in (28) is the Riemannian inverse exponential map .
Algorithm 1 provides a detailed description of the iterative MLE approach. Visual examples of the iterative MLE can be found in Figures 4 and 6.
| Algorithm 1: Parameter estimation: iterative MLE. |
![]() |
6. Experiments
In this section, we present numerical results of bridge sampling on specific Lie groups and homogeneous spaces: the three-dimensional rotation group and the general linear group of invertible matrices with positive determinant . Exploiting the bridge sampling scheme described above, we show below how to estimate the true underlying metric on with iterative maximum likelihood estimation. This estimation, in turn, allows finding the parameters of the anisotropic pushforward distributions as displayed in Figure 1.
The space of the symmetric positive definite matrices is an example of a non-linear space in which geometric data appear in many applications. The space can be obtained as the homogeneous space , where is the space of invertible matrices with a positive determinant.
6.1. Discretization
We numerically approximate the Stratonovich integrals by the Euler–Heun scheme. With a time discretization , and corresponding noise , the numerical approximation of the Brownian motion (6) takes the form
where is used only as an intermediate value in integration. For the guided bridge simulations, we add the corresponding drift to (29) to obtain the numerical scheme.
6.2. Importance Sampling and Metric Estimation on SO(3)
This section takes G as the special orthogonal group , the space of three-dimensional rotation matrices. The special orthogonal group is a compact connected matrix Lie group. The rotation group is a semi-simple Lie group, and bi-invariant inner products exist. In the case of a bi-invariant metric, the Riemannian exponential map Exp coincides with the Lie group exponential map exp, and thus, the Riemannian distance function , from the rotation R to the identity I, satisfies .
Figure 2 illustrates the numerical approximation with a sample path from the guided diffusion conditioned to hit the rotation represented by the black vectors. Another way of visualizing the guided bridge on the rotation group is through the angle–axis representation. Figure 3 represents a guided process on by presenting the axis representation on and its corresponding angle of rotation.
Figure 4 illustrates how importance sampling on leads to a metric estimation of the underlying unknown metric, which generated the Brownian motion. We sampled 128 points as endpoints of a Brownian motion from the metric , and used 20 time steps to sample four bridges per observation. An iterative MLE method using gradient descent with a learning rate of and initial guesses of the metric and yielded convergence to the true metric.
Figure 4.
The importance sampling technique applies to estimate the metric on the Lie group . Sampling a Brownian motion from an underlying unknown metric, we obtain convergence to the true underlying metric using an iterative MLE method. Here, we sampled four guided bridges per observation, providing a relatively smooth iterative likelihood. (Top left) Estimation of the unknown underlying metric using bridge sampling, starting from the metric . Here, the true metric is the diagonal matrix represented by the red lines. The diagonal is represented by the colors diag (purple, blue, yellow). (Top right) The corresponding log-likelihood evolution through the iterations. (Bottom left) Estimation of the unknown underlying metric using bridge sampling, starting from the metric . (Bottom right) The corresponding iterative log-likelihood.
6.3. Diffusion Mean Estimation on the Space of Symmetric Positive Definite Matrices
The space of symmetric positive definite (SPD) matrices is used in a range of applications, one example being diffusion tensor imaging where the element of models the anisotropic diffusion of water molecules in each position of the imaged domain. The SPD matrices constitute a homogeneous space of invertible matrices with the positive determinants’ quotient the rotation group.
Figure 5 illustrates discrete-time observations of three sample paths of a guided bridge in .
Figure 5.
Discrete-time observations from three sample paths on . The sample paths are obtained as the pushforward of the Fermi bridge in . The start and endpoint are the left- and rightmost figures, where the SPD matrices are indicated by the bold face arrows.
In Figure 6, the bridge sampling scheme derived above is used to obtain an estimate of the diffusion mean [,] on by sampling guided bridge processes in the space of invertible matrices with positive determinants . This sampling method provides an estimate of the density on , which projects to a density in . Exploiting the resulting density in , the iterative MLE yields convergence to the diffusion mean.
Figure 6.
Given 256 data points in , we estimated the diffusion mean on the homogeneous space by sampling bridges in the top space conditioned on the fibers. The iterative MLE in Algorithm 1 yielded the convergence of the diffusion mean parameter, using a learning rate of and one bridge sample per observation. (Left) The purple, blue, and yellow lines correspond to the diagonal of the metric matrix, whereas the remaining colors represent the off-diagonal. The true mean value is the identity matrix indicated by the red lines. (Right) The corresponding log-likelihood evolution through the iterations.
6.4. Density Estimation on the Two-Sphere
The two-sphere can be considered the homogeneous space of three-dimensional rotations, identifying the subgroup of two-dimensional rotations as a single point. Conditioning on the fiber in , we obtain guided bridges on . In the case of a bi-invariant metric on G, the G-valued Brownian motion pushes forward to an M-valued Brownian motion. Figure 1b illustrates the estimated transition density on from sampling bridges in the Lie group conditioned on the fiber , when the underlying metric is bi-invariant. When altering the metric to a non-invariant variant one, the G-Brownian motion does not in general push forward to an M-Brownian motion. The non-invariant metrics result in a covariance structure exhibiting anisotropy, which is illustrated by Figure 1c,d.
7. Conclusions
In this paper, we presented algorithms for estimating the parameters of a class of densities that are the generalization of the Euclidean normal density to Lie groups and homogeneous spaces. We used the heat equation to generalize the normal distribution to Lie groups and homogeneous spaces, where the left- (or right-) invariant metric generalizes the notion of the covariance of a normal density. We presented algorithms for bridge simulation and for estimating the metric given the i.i.d. Lie group or homogeneous space-valued samples. The estimation algorithm was based on Monte Carlo simulations of Brownian bridges. These algorithms are expected to impact many diverse fields, including bioinformatics, medical imaging, shape analysis, computer vision, and information geometry, where Lie groups or homogeneous spaces are the natural model spaces for data samples.
8. Code
The code used for the experiments is available in the Theano Geometry http://bitbucket.org/stefansommer/theanogeometry and Jax Geometry http://bitbucket.org/stefansommer/jaxgeometry software packages (accessed on 7 July 2022). The implementation uses automatic differentiation libraries extensively for the geometry computations, as is further described in [].
Author Contributions
Conceptualization, S.J. and S.S.; methodology, all authors; software, M.H.J. and S.S.; writing, all authors. All authors have read and agreed to the published version of the manuscript.
Funding
The work presented is supported by the CSGB Centre for Stochastic Geometry and Advanced Bioimaging funded by a grant from the Villum Foundation, the Villum Foundation Grants 22924 and 40582, the Novo Nordisk Foundation Grant NNF18OC0052000, and the National Science Foundation Grant DMS-1912030.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Pedersen, A.R. Consistency and asymptotic normality of an approximate maximum likelihood estimator for discretely observed diffusion processes. Bernoulli 1995, 1, 257–279. [Google Scholar] [CrossRef]
- Bladt, M.; Sørensen, M. Simple simulation of diffusion bridges with application to likelihood inference for diffusions. Bernoulli 2014, 20, 645–675. [Google Scholar] [CrossRef]
- Bladt, M.; Finch, S.; Sørensen, M. Simulation of multivariate diffusion bridges. J. R. Stat. Soc. Ser. B Stat. Methodol. 2016, 78, 343–369. [Google Scholar] [CrossRef]
- Bui, M.N.; Pokern, Y.; Dellaportas, P. Inference for partially observed Riemannian Ornstein–Uhlenbeck diffusions of covariance matrices. arXiv 2021, arXiv:2104.03193. [Google Scholar]
- Delyon, B.; Hu, Y. Simulation of Conditioned Diffusion and Application to Parameter Estimation. Stoch. Process. Their Appl. 2006, 116, 1660–1675. [Google Scholar] [CrossRef]
- Jensen, M.H.; Mallasto, A.; Sommer, S. Simulation of Conditioned Diffusions on the Flat Torus. In Proceedings of the International Conference on Geometric Science of Information, Toulouse, France, 27–29 August 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 685–694. [Google Scholar]
- Jensen, M.H.; Sommer, S. Simulation of Conditioned Semimartingales on Riemannian Manifolds. arXiv 2021, arXiv:2105.13190. [Google Scholar]
- Van der Meulen, F.; Schauer, M. Bayesian estimation of discretely observed multi-dimensional diffusion processes using guided proposals. Electron. J. Stat. 2017, 11, 2358–2396. [Google Scholar] [CrossRef]
- Papaspiliopoulos, O.; Roberts, G. Importance sampling techniques for estimation of diffusion models. Stat. Methods Stoch. Differ. Equ. 2012, 311–340. [Google Scholar]
- Schauer, M.; Van Der Meulen, F.; Van Zanten, H. Guided proposals for simulating multi-dimensional diffusion bridges. Bernoulli 2017, 23, 2917–2950. [Google Scholar] [CrossRef]
- Sommer, S.; Arnaudon, A.; Kuhnel, L.; Joshi, S. Bridge Simulation and Metric Estimation on Landmark Manifolds. In Proceedings of the Graphs in Biomedical Image Analysis, Computational Anatomy and Imaging Genetics, Lecture Notes in Computer Science, Quebec, QC, Canada, 10–14 September 2017; Springer: Berlin/Heidelberg, Germany, 2017; pp. 79–91. [Google Scholar]
- Bui, M.N. Inference on Riemannian Manifolds: Regression and Stochastic Differential Equations. Ph.D. Thesis, UCL (University College London), London, UK, 2022. [Google Scholar]
- Fisher, R. Dispersion on a Sphere. Proc. R. Soc. Lond. A Math. Phys. Eng. Sci. 1953, 217, 295–305. [Google Scholar] [CrossRef]
- Kent, J.T. The Fisher-Bingham Distribution on the Sphere. J. R. Stat. Soc. Ser. B (Methodol.) 1982, 44, 71–80. [Google Scholar] [CrossRef]
- Thompson, J. Brownian bridges to submanifolds. Potential Anal. 2018, 49, 555–581. [Google Scholar] [CrossRef]
- García-Portugués, E.; Sørensen, M.; Mardia, K.V.; Hamelryck, T. Langevin Diffusions on the Torus: Estimation and Applications. Stat. Comput. 2017, 29, 1–22. [Google Scholar] [CrossRef]
- Hamelryck, T.; Kent, J.T.; Krogh, A. Sampling Realistic Protein Conformations Using Local Structural Bias. PLoS Comput. Biol. 2006, 2, e131. [Google Scholar] [CrossRef] [PubMed]
- Pennec, X.; Fillard, P.; Ayache, N. A Riemannian Framework for Tensor Computing. Int. J. Comput. Vis. 2006, 66, 41–66. [Google Scholar] [CrossRef]
- Vaillant, M.; Miller, M.; Younes, L.; Trouvé, A. Statistics on Diffeomorphisms via Tangent Space Representations. NeuroImage 2004, 23, S161–S169. [Google Scholar] [CrossRef]
- Yang, L. Means of Probability Measures in Riemannian Manifolds and Applications to Radar Target Detection. Ph.D. Thesis, Poitiers University, Poitiers, France, 2011. [Google Scholar]
- Grenander, U. Probabilities on Algebraic Structures; Wiley: New York, NY, USA/London, UK, 1963. [Google Scholar]
- Jensen, M.H.; Joshi, S.; Sommer, S. Bridge Simulation and Metric Estimation on Lie Groups. In Proceedings of the Geometric Science of Information; Lecture Notes in Computer Science; Nielsen, F., Barbaresco, F., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 430–438. [Google Scholar] [CrossRef]
- Pennec, X.; Sommer, S.; Fletcher, T. Riemannian Geometric Statistics in Medical Image Analysis; Elsevier: Amsterdam, The Netherlands, 2020. [Google Scholar]
- Liao, M. Lévy Processes in Lie Groups; Cambridge University Press: Cambridge, UK, 2004; Volume 162. [Google Scholar]
- Shigekawa, I. Transformations of the Brownian motion on a Riemannian symmetric space. In Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete; 1984; pp. 493–522. [Google Scholar]
- Kendall, W.S. The radial part of Brownian motion on a manifold: A semimartingale property. Ann. Probab. 1987, 15, 1491–1500. [Google Scholar] [CrossRef]
- Barden, D.; Le, H. Some consequences of the nature of the distance function on the cut locus in a Riemannian manifold. J. LMS 1997, 56, 369–383. [Google Scholar] [CrossRef]
- Le, H.; Barden, D. Itô correction terms for the radial parts of semimartingales on manifolds. Probab. Theory Relat. Fields 1995, 101, 133–146. [Google Scholar] [CrossRef]
- Hsu, E.P. Stochastic Analysis on Manifolds; AMS: Providence, RI, USA, 2002; Volume 38. [Google Scholar]
- Thompson, J. Submanifold Bridge Processes. Ph.D. Thesis, University of Warwick, Coventry, UK, 2015. [Google Scholar]
- Hansen, P.; Eltzner, B.; Huckemann, S.F.; Sommer, S. Diffusion Means in Geometric Spaces. arXiv 2021, arXiv:2105.12061. [Google Scholar]
- Hansen, P.; Eltzner, B.; Sommer, S. Diffusion Means and Heat Kernel on Manifolds. arXiv 2021, arXiv:2103.00588. [Google Scholar]
- Kühnel, L.; Sommer, S.; Arnaudon, A. Differential Geometry and Stochastic Dynamics with Deep Learning Numerics. Appl. Math. Comput. 2019, 356, 411–437. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
