First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications

Shyntar, Alexandra; Hillen, Thomas

doi:10.3390/axioms14100743

Open AccessArticle

First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications

by

Alexandra Shyntar

and

Thomas Hillen

^*

Mathematical and Statistical Sciences, University of Alberta, North Campus, Edmonton, AB T6G 2G1, Canada

^*

Author to whom correspondence should be addressed.

Axioms 2025, 14(10), 743; https://doi.org/10.3390/axioms14100743

Submission received: 25 July 2025 / Revised: 26 August 2025 / Accepted: 18 September 2025 / Published: 30 September 2025

(This article belongs to the Special Issue Advances in Mathematical Modeling and Related Topics)

Download

Browse Figures

Versions Notes

Abstract

Spherical distributions, in particular, the von Mises–Fisher distribution, are often used to analyze directional data. The first and second moments of these distributions are of central interest, as they describe mean orientations as well as anisotropic diffusion tensors. Finding these moments often requires a numerical approximation of complex trigonometric integrals. Instead, we apply the divergence theorem on suitable domains to derive explicit forms of the first and second moments for n-dimensional von Mises–Fisher and peanut distributions. Based on these new formulas, we characterize some meaningful characteristics of these distributions: fractional anisotropy and the anisotropy ratio. We find, surprisingly, that the peanut distribution has an upper bound on anisotropy, while the von-Mises Fisher distribution has no such bound. As a side benefit, we find different forms of some identities for Bessel functions.

Keywords:

spherical distributions; second moment; von Mises-Fisher distribution; peanut distribution; fractional anisotropy

MSC:

92B15; 60E05; 62H11

1. Introduction

Spherical distributions are of special importance when working with directional data [1]. They are used to represent orientation of objects or individuals and identify movement directions. Recently, spherical distributions have been particularly important in the modelling of directed motion in biology. Examples in ecology that have been modelled include the directed motion of wolves along seismic lines [2], the orientation of sea turtles along geomagnetic lines [3], hilltopping of butterflies [4], and migration of whales [5]. Examples of directed cell movement include the movement of melanoma cells [6], spread of glioma in the brain [7], oriented motion of cells on microfabricated structures [8], and directed motion of immune cells [9].

Mathematically, a spherical distribution is a probability distribution on the unit sphere

S^{n - 1}

in

R^{n}

, where

n > 0

denotes the space dimension. We usually denote it as

q (θ)

,

θ \in S^{n - 1}

with the basic properties

q (θ) \geq 0, f o r a l l θ \in S^{n - 1}, a n d \int_{S^{n - 1}} q (θ) d θ = 1 .

(1)

For application to glioma, three spherical distributions have been proposed (we give explicit definitions later): the von Mises distribution (5) (also called von Mises–Fisher for higher dimensions), the peanut distribution (6), and the ordinary distribution function (ODF) (7) [7,10,11,12]. The von Mises distribution has been used in a model that was fit to data for 10 patients diagnosed with glioma in [7], while the ODF has been used for a theoretical model for glioma growth incorporating the effect from endothelial cells as well as the acidity from the environment in [12]. Here, we focus on the peanut and von Mises distributions. We generalize the distributions to any space dimensions

n > 0

and we compute new formulas for the expectation and variance–covariance matrices for these distributions. We also applied the method to the ODF distribution, but, unfortunately, we did not obtain an explicit formula for the second moment of the ODF distribution. Other important spherical distributions are the Bingham, the Fisher–Bingham, and the Kent distributions. These have not been used extensively in biological modelling; hence, we discuss them only briefly in the conclusion section.

1.1. Anisotropic Transport Equations

The popular use of spherical distributions in ecology and cell biology relies on the modelling of movement data with transport equations. In this short subsection, we recall some of the essential steps of this framework, to set the stage for the following calculations. Among the above applications, let us choose the example of wolf movement along seismic lines. The arguments for the other applications (glioma, immune cells, sea turtles, whales) are very similar.

Seismic lines are clear-cut lines in the Northern Canadian forests, which are cut by oil exploration companies to search for oil and gas [13]. Wolves in these forests can use these lines to migrate further than normal and increase their hunting success. Needless to say, this has a severe ecological impact. Using GPS data on wolf movement, it was shown by McKenzie et al. [2,14] that the directional orientation of wolves near seismic lines follows a von Mises distribution. Let us call it

q (x, θ)

,

θ \in S^{n - 1}

.

To formulate a mathematical model for the wolf movement, we introduce a wolf density

p (t, x, θ)

that depends on time t, location x, and orientation

θ

and consider the anisotropic transport equation

p_{t} + s θ \cdot \nabla p = μ (q (x, θ) \int_{S^{n - 1}} p (t, x, θ^{'}) d θ^{'} - p) .

(2)

Here

μ

is a rate of change of direction and s is an average speed. The arguments of

p (t, x, θ)

have been suppressed in three of the terms.

The analysis of these types of models is well established, many papers are available [15,16,17] and we will not recall any details here. However, we will discuss the important parabolic scaling. Parabolic scaling is a scaling transformation of the transport Equation (2) into macroscopic time and space scales. It can be shown that, to leading order the integrated particle density

P (t, x) = \int_{S^{n - 1}} p (t, x, θ) d θ

satisfies an advection diffusion equation

P_{t} (t, x) + \nabla \cdot (b (x) P (t, x)) = \nabla \nabla^{T} : (D (x) P (t, x)),

(3)

where the drift velocity and the diffusion coefficient are proportional to the first and second moments of

q (θ)

:

b (x) = s \int_{S^{n - 1}} θ q (x, θ) d θ, D (x) = \frac{s^{2}}{μ} \int_{S^{n - 1}} θ θ^{T} q (x, θ) d θ .

(4)

Hence, very clearly, knowledge of the first and second moments of

q (θ)

is essential here.

In the application to wolf data, the distribution

q (x, θ)

encodes the directional network of seismic lines in the environment [2,14]. In the case of sea turtles,

q (x, θ)

describes the orientation towards the target site [3]. Applied to glioma data,

q (x, θ)

is informed from an MRI imaging technique called diffusion tensor imaging (DTI) [7,12,18,19,20] (see also Section 5 for a simplified example).

Hillen et al. [10] calculated the explicit forms of the expectation and the variance–covariance matrix for the von Mises–Fisher distribution in two and three dimensions. It appears that the use of the divergence theorem in suitable domains was the key to obtaining the explicit formulas for the moments of these distributions.

1.2. Outline of the Paper

Here, we extend that methodology to higher dimensions. The advantage of knowing the explicit form of the moments for spherical distributions is that one can easily calculate the expectation and variance–covariance matrix without having to do the integrals numerically; hence, significantly reducing the computational cost for large dimensions. We find explicit forms for n-dimensional von Mises–Fisher and peanut distributions which, to our knowledge, are new formulas.

In addition, we consider the level of anisotropies that arise from these different formulations (peanut, ODF, von Mises–Fisher). The anisotropy is a measure of the distortion of the distribution in certain directions. It can be easily visualized by plotting level sets of the corresponding distributions

q (θ) = c o n s t

(see Figure 1). To measure the anisotropy, we first compute the eigenvalues of the corresponding variance–covariance matrices, and then compute the fractional anisotropy (

FA

) and the maximal eigenvalue ratio R. We find that peanut distribution has a limit to how much anisotropy it can account for, however the von Mises–Fisher does not have such a limitation, showing that the von Mises–Fisher is the better choice for capturing anisotropy. Please note that, here, we focus on theoretical results, specific applications are treated in the cited literature [2,3,7,14,17].

1.3. Definitions of Some Spherical Distributions

In this section, we give the explicit definitions of the spherical distributions that we will work with. We use given parameters where

k > 0

is a concentration parameter,

u \in R^{n}

is a given vector, and

A \in R^{n \times n}

is a given positive definite matrix. In the case of glioma modelling, A is estimated from the diffusion tensor data (see Section 5).

The n-dimensional von Mises–Fisher distribution is defined as [10]

$q (θ) = \frac{k^{\frac{n}{2} - 1}}{{(2 π)}^{\frac{n}{2}} I_{\frac{n}{2} - 1} (k)} e^{k θ \cdot u}$

(5)

where $θ \in S^{n - 1}$ and $I_{p} (k)$ with $p \in C$ defines the modified Bessel function of the first kind of order p, k is the concentration parameter, and $u \in S^{n - 1}$ defines the likeliest direction of the distribution. Here, we employ a formulation that uses Bessel functions. An alternative definition uses the Gamma function and can be found in Appendix A.1.
The n-dimensional peanut distribution is defined as [11]

$q (θ) = \frac{n}{| S^{n - 1} | t r (A)} θ^{T} A θ$

(6)

where $t r (A)$ denotes the trace of A, $θ \in S^{n - 1}$ , and $| S^{n - 1} |$ is the surface area of an $n - 1$ dimensional sphere.
The n-dimensional ODF distribution is defined as [12,21]

$q (θ) = \frac{1}{{4 π | A |}^{\frac{1}{2}} {(θ^{T} A^{- 1} θ)}^{\frac{3}{2}}},$

(7)

where $θ$ and A hold the same definitions as in the peanut distribution defined above. In (7), $A^{- 1}$ denotes the inverse of A and $| A | = det A$ .
We only briefly discuss the Bingham distribution [1,22], which is defined as

$q (θ) = \frac{1}{\sqrt{{| A | (4 π Δ)}^{3}}} e^{\frac{- θ^{T} A^{- 1} θ}{4 Δ}}$

(8)

where $θ$ , A, $| A |$ hold the same definitions as above and $Δ$ is the diffusion time.
The Fisher–Bingham distribution [1] is defined as

$q (θ) = \frac{1}{a (k, A)} e^{k u^{T} θ + θ^{T} A θ}$

(9)

where k is a concentration parameter and u is the mean direction and $a (k, A)$ is a normalization constant. In the case of a diagonal matrix A, the Fisher–Bingham distribution is also known as Kent distribution [1,23].

We show the 2D and 3D versions of the peanut, ODF, Bingham, and von Mises–Fisher distributions in Figure 1.

1.4. Definitions of First and Second Moments

Since we will be calculating the first and second moments of the spherical distributions in n dimensions to obtain the expectation and the variance–covariance matrix, we provide the explicit definitions here. In general, the expectation [10] is defined as

E [q] = \int_{S^{n - 1}} θ q (θ) d θ

(10)

and variance–covariance matrix [10] is defined as

Var [q] = \int_{S^{n - 1}} θ θ^{T} q (θ) d θ - E [q] E {[q]}^{T} .

(11)

Here,

a b^{T}

for

a, b \in R^{n}

denotes the tensor product between matrices. For simplicity of notation, we give names to the zeroth, first, and second moment. The zeroth moment is defined by

M^{0} [q (θ)] = \int_{S^{n - 1}} q (θ) d θ,

(12)

the first moment is defined by

M^{1} [q (θ)] = \int_{S^{n - 1}} θ q (θ) d θ = E [q (θ)]

(13)

and the second moment is defined by

M^{2} [q (θ)] = \int_{S^{n - 1}} θ θ^{T} q (θ) d θ .

(14)

2. First and Second Moments of von Mises–Fisher Distributions

In this section, we calculate the first and second moments of the general n-dimensional von Mises–Fisher distributions (5), in order to get general formulas for expectation and the variance–covariance matrix for these distributions.

For simplicity of notation, sums over crossed repeated indices are understood as in tensor calculus [10]

a^{i} b_{i} = \sum_{i = 1}^{n} a^{i} b_{i} .

(15)

We also recommend referring to Appendices Appendix A.2, Appendix A.4 and Appendix A.5 for the basic properties of Bessel functions, definitions of

B_{1} (0)

and

S^{n - 1}

, as well as formulas that are used in the preceeding calculations.

Theorem 1.

Consider the von Mises–Fisher distribution (5) in any dimension n. The expectation and variance–covariance matrix are

\begin{matrix} E [q] & = & \frac{I_{\frac{n}{2}} (k)}{I_{\frac{n}{2} - 1} (k)} u, \end{matrix}

(16)

\begin{matrix} Var [q] & = & \frac{I_{\frac{n}{2}} (k)}{k I_{\frac{n}{2} - 1} (k)} I + [\frac{I_{\frac{n}{2} + 1} (k)}{I_{\frac{n}{2} - 1} (k)} - {(\frac{I_{\frac{n}{2}} (k)}{I_{\frac{n}{2} - 1} (k)})}^{2}] u u^{T} . \end{matrix}

(17)

Proof.

Expectation: We abbreviate the normalization constant in (5) as

c = \frac{k^{\frac{n}{2} - 1}}{{(2 π)}^{\frac{n}{2}} I_{\frac{n}{2} - 1} (k)} .

(18)

Following a similar procedure as in [10] we calculate the expectation. Let

b \in R^{n}

be an arbitrary vector, then, using (5) we obtain

\begin{matrix} b \cdot \frac{1}{c} E [q] = b \cdot \int_{S^{n - 1}} θ e^{k θ u} d θ = \int_{S^{n - 1}} θ^{j} b_{j} e^{k θ_{l} u^{l}} d θ, \end{matrix}

(19)

where we use the summation convention (15). Note, that

θ

is a unit vector and also the outside normal vector at

θ

on

S^{n - 1}

. Hence, we apply the divergence theorem on

S^{n - 1}

and continue the calculation as follows

\begin{matrix} \int_{S^{n - 1}} θ^{j} b_{j} e^{k θ_{l} u^{l}} d θ & = \int_{B_{1} (0)} \frac{\partial}{\partial v^{j}} b_{j} e^{k v_{l} u^{l}} d v \\ = b \cdot u k \int_{B_{1} (0)} e^{k v u} d v \\ = b \cdot u k \int_{0}^{1} \int_{S^{n - 1}} e^{k r v u} r^{n - 1} d v d r \\ = b \cdot u \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 2}} \int_{0}^{1} r^{\frac{n}{2}} I_{\frac{n}{2} - 1} (r k) d r, \end{matrix}

(20)

where, in the third equivalence, we divide the integral on a ball into radial and angular components, and in the last step we applied (1).

Making use of the Bessel identities (A5) and (A6), we rewrite the term within the integral, that is

\begin{matrix} r^{\frac{n}{2}} I_{\frac{n}{2} - 1} (r k) = r^{\frac{n}{2}} i^{- \frac{n}{2} + 1} J_{\frac{n}{2} - 1} (i r k) = \frac{i^{- n}}{k^{\frac{n}{2} + 1}} \frac{d}{d r} {(i r k)}^{\frac{n}{2}} J_{\frac{n}{2}} (i r k) . \end{matrix}

(21)

Using (21), we simplify further and apply the fundamental theorem of calculus to obtain

b \cdot u \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 2}} \int_{0}^{1} r^{\frac{n}{2}} I_{\frac{n}{2} - 1} (r k) d r = b \cdot u \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 1}} i^{- \frac{n}{2}} J_{\frac{n}{2}} (i k) = b \cdot u \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 1}} I_{\frac{n}{2}} (k) .

(22)

Therefore, we find that

b \cdot \frac{1}{c} E [q] = b \cdot u \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 1}} I_{\frac{n}{2}} (k) .

After simplifying, and noting that b is an arbitrary vector, it follows that

E [q] = \frac{I_{\frac{n}{2}} (k)}{I_{\frac{n}{2} - 1} (k)} u .

(23)

Note that, from this calculation, we also obtain the following useful formula

\int_{B_{1} (0)} e^{k v u} d v = \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2}}} I_{\frac{n}{2}} (k),

(24)

by equating the right-hand term of the fourth equivalence in (20) with the right-hand term in the last equivalence of (22).

Second moment: We now calculate the second moment as well as the variance–covariance matrix for the von Mises–Fisher distribution (5).

Since the expectation for (5) is nonzero, we directly calculate the second term in the variance–covariance matrix formula (11). Using (23), it is simply

E [q] E {[q]}^{T} = {(\frac{I_{\frac{n}{2}} (k)}{I_{\frac{n}{2} - 1} (k)})}^{2} u u^{T} .

(25)

Now, we calculate the first term in the variance–covariance matrix formula (11), which is the second moment, by following the approach in [10].

Let c be defined by (18) and

a, b \in R^{n}

. Again, we use the fact that

θ

is the outside normal vector at

θ

on

S^{n - 1}

to apply the divergence theorem. Thus, we obtain

\begin{matrix} a^{T} \int_{S^{n - 1}} θ θ^{T} q (θ) d θ b & = c \int_{S^{n - 1}} a_{j} θ^{j} b_{m} θ^{m} e^{k θ^{l} u_{l}} d θ \\ = c \int_{S^{n - 1}} θ^{j} (a_{j} b_{m} θ^{m} e^{k θ^{l} u_{l}}) d θ \\ = c \int_{B_{1} (0)} \frac{\partial}{\partial v^{j}} (a_{j} b_{m} v^{m} e^{k v^{l} u_{l}}) d v \\ = c (a^{T} b \int_{B_{1} (0)} e^{k v u} d v + k a^{T} u b \cdot \int_{B_{1} (0)} v e^{k v u} d v) . \end{matrix}

The first integral is solved in (24). We calculate the integral in the second term as

\int_{B_{1} (0)} v e^{k v u} d v = \int_{0}^{1} r^{n} \int_{S^{n - 1}} v e^{k r θ u} d v d r = \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 1}} u \int_{0}^{1} r^{\frac{n}{2} + 1} I_{\frac{n}{2}} (r k) d r

by applying (1) at the second equivalence.

By using the Bessel identities (A5) and (A6), we can rewrite the term in the integral as

r^{\frac{n}{2} + 1} I_{\frac{n}{2}} (r k) = r^{\frac{n}{2} + 1} i^{- \frac{n}{2}} J_{\frac{n}{2}} (i r k) = \frac{i^{- n - 2}}{k^{\frac{n}{2} + 2}} \frac{d}{d r} {(i r k)}^{\frac{n}{2} + 1} J_{\frac{n}{2} + 1} (i r k) .

(26)

Using (26) and (A5), we obtain

\frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2} - 1}} u \int_{0}^{1} r^{\frac{n}{2} + 1} I_{\frac{n}{2}} (r k) d r = \frac{{(2 π)}^{\frac{n}{2}} i^{- \frac{n}{2} - 1}}{k^{\frac{n}{2}}} u J_{\frac{n}{2} + 1} (i k) = \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2}}} u I_{\frac{n}{2} + 1} (k) .

(27)

Thus,

\int_{B_{1} (0)} v e^{k v u} d v = \frac{{(2 π)}^{\frac{n}{2}}}{k^{\frac{n}{2}}} u I_{\frac{n}{2} + 1} (k) .

(28)

Using (22) and (28), we find that

\int_{S^{n - 1}} θ θ^{T} q (θ) d θ = \frac{I_{\frac{n}{2}}}{k I_{\frac{n}{2} - 1}} I + \frac{I_{\frac{n}{2} + 1}}{I_{\frac{n}{2} - 1}} u u^{T},

(29)

since

a, b

are arbitrary.

Substituting (25) and (29) into the variance formula (11), we obtain

Var [q] = \frac{I_{\frac{n}{2}} (k)}{k I_{\frac{n}{2} - 1} (k)} I + \frac{I_{\frac{n}{2} + 1} (k)}{I_{\frac{n}{2} - 1} (k)} u u^{T} - {(\frac{I_{\frac{n}{2}} (k)}{I_{\frac{n}{2} - 1} (k)})}^{2} u u^{T} .

(30)

□

Corollary 1.

Consider the bimodal von Mises–Fisher distribution

q (θ) = \frac{k^{\frac{n}{2} - 1}}{2 {(2 π)}^{\frac{n}{2}} I_{\frac{n}{2} - 1} (k)} (e^{k θ \cdot u} + e^{- k θ \cdot u}) .

(31)

The expectation and the variance–covariance matrix for (31) is given by

\begin{matrix} E [q] & = & 0, \\ Var [q] & = & \frac{I_{\frac{n}{2}} (k)}{k I_{\frac{n}{2} - 1} (k)} I + \frac{I_{\frac{n}{2} + 1} (k)}{I_{\frac{n}{2} - 1} (k)} u u^{T} . \end{matrix}

(32)

Proof.

We can rewrite (31) as

q (θ) = \frac{1}{2} q_{u} (θ) + \frac{1}{2} q_{- u} (θ)

(33)

where

q_{u} (θ)

is defined by (5) and

q_{- u} (θ)

is also defined by (5), but with u replaced by

- u

. Using this, we can calculate the expectation and the variance–covariance matrix using the results in Theorem 1. The expectation is calculated as follows

E [q] = \frac{1}{2} E [q_{u}] + \frac{1}{2} E [q_{- u}] = \frac{I_{\frac{n}{2}} (k)}{2 I_{\frac{n}{2} - 1} (k)} u - \frac{I_{\frac{n}{2}} (k)}{2 I_{\frac{n}{2} - 1} (k)} u = 0 .

(34)

Since the expectation is 0, the variance–covariance matrix is given by its second moment. Therefore, using (29), we obtain

\begin{matrix} V a r [q] & = \frac{1}{2} [\frac{I_{\frac{n}{2}}}{k I_{\frac{n}{2} - 1}} I + \frac{I_{\frac{n}{2} + 1}}{I_{\frac{n}{2} - 1}} u u^{T}] + \frac{1}{2} [\frac{I_{\frac{n}{2}}}{k I_{\frac{n}{2} - 1}} I + \frac{I_{\frac{n}{2} + 1}}{I_{\frac{n}{2} - 1}} (- u) {(- u)}^{T}] \\ = \frac{I_{\frac{n}{2}}}{k I_{\frac{n}{2} - 1}} I + \frac{I_{\frac{n}{2} + 1}}{I_{\frac{n}{2} - 1}} u u^{T} . \end{matrix}

(35)

□

Example for $n = 2$ : The von Mises–Fisher distribution in two dimensions is called the von Mises distribution. It has been used in various applications that we mentioned above. Hence, here, we recall the result for the bimodal von Mises distribution in two dimensions [10]:

\begin{matrix} E [q] & = & 0, \\ Var [q] & = & \frac{I_{1} (k)}{k I_{0} (k)} I + \frac{I_{2} (k)}{I_{0} (k)} u u^{T} . \end{matrix}

(36)

Example for $n = 3$ : It is interesting to consider the important case of the von Mises distribution (5) for

n = 3

. For

n = 3

, we find

\begin{matrix} E [q] & = & \frac{I_{\frac{3}{2}} (k)}{I_{\frac{1}{2}} (k)} u, \\ Var [q] & = & \frac{I_{\frac{3}{2}} (k)}{k I_{\frac{1}{2}} (k)} I + [\frac{I_{\frac{5}{2}} (k)}{I_{\frac{1}{2}} (k)} - {(\frac{I_{\frac{3}{2}} (k)}{I_{\frac{1}{2}} (k)})}^{2}] u u^{T} . \end{matrix}

In [10], these terms were found as

\begin{matrix} E [q] & = & (coth k - \frac{1}{k}) u, \\ Var [q] & = & (\frac{coth k}{k} - \frac{1}{k^{2}}) I + [1 - \frac{coth k}{k} + \frac{2}{k^{2}} - {coth}^{2} k] u u^{T} . \end{matrix}

Hence, we can compare the coefficients and find three new identities for Bessel functions:

\begin{matrix} \frac{I_{\frac{3}{2}} (k)}{I_{\frac{1}{2}} (k)} & = & coth k - \frac{1}{k}, \end{matrix}

(37)

\begin{matrix} \frac{I_{\frac{3}{2}} (k)}{k I_{\frac{1}{2}} (k)} & = & \frac{coth k}{k} - \frac{1}{k^{2}}, \end{matrix}

(38)

\begin{matrix} \frac{I_{\frac{5}{2}} (k)}{I_{\frac{1}{2}} (k)} - {(\frac{I_{\frac{3}{2}} (k)}{I_{\frac{1}{2}} (k)})}^{2} & = & 1 - \frac{coth k}{k} + \frac{2}{k^{2}} - {coth}^{2} k . \end{matrix}

(39)

Note that (37) is equivalent with (38) for

k \neq 0

. These formulas can also be derived by using the trigonometric form of Bessel function identities for

I_{\frac{1}{2}}

,

I_{\frac{3}{2}}

,

I_{\frac{5}{2}}

defined in [24], where

I_{\frac{1}{2}} (k) = \frac{2^{\frac{1}{2}}}{{(k π)}^{\frac{1}{2}}} sinh k .

(40)

We confirm that the above identities hold numerically in Figure 2.

Summary of Section 2

In this section, we computed the first and second moments of the von Mises–Fisher distribution (5) in the proof of Theorem 1 to obtain the explicit forms of the expectation and variance–covariance matrix for n dimensions. To our knowledge, these formulas for the expectation and the variance–covariance matrix in the general dimension n are new. Corollary 1 provides a method on how to generalize the explicit formulas in Theorem 1 to obtain the analytical form of the expectation and variance–covariance matrix for mixed modal von Mises–Fisher distributions, where we used the bimodal von Mises–Fisher distribution as an example. In this section, we also find new versions of Bessel function identities as a side result.

3. First and Second Moment of Peanut Distribution

We first note that the peanut distribution (6), the ODF (7), and the Bingham distribution (8) are all of the form

q (θ) = F (θ^{T} A θ),

with an appropriate function F, where A is a given matrix. In the case of ODF, we actually use

A^{- 1}

, but the structure is the same. Due to symmetry in

θ

, the first moment equals zero:

Lemma 1.

If A is a given matrix and q is of the form

q (θ) = F (θ^{T} A θ)

, then

E [q] = 0

. Moreover, all odd moments are zero too.

Proof.

Let

θ_{1}

denote the first coordinate of

θ

in

R^{n}

. We split the sphere

S^{n - 1}

according to the positive and negative

θ_{1}

component, obtaining two hemispheres:

S^{+} : = {θ \in S^{n - 1} : θ_{1} \geq 0} a n d S^{-} : = {θ \in S^{n - 1} : θ_{1} < 0} .

This allows us to exploit symmetry to get the following result

\begin{matrix} E [q] & = & \int_{S^{+}} θ F (θ^{T} A θ) d θ + \int_{S^{-}} θ F (θ^{T} A θ) d θ \\ = & \int_{S^{+}} θ F (θ^{T} A θ) d θ + \int_{S^{+}} - θ F ((- θ^{T}) A (- θ)) d θ \\ = & 0 . \end{matrix}

□

For the second moment, we consider each distribution separately.

3.1. Second Moment of the Peanut Distribution

Here, we calculate the variance–covariance matrix for the peanut distribution (6). Note that, since

E [q] = 0

, the variance–covariance matrix is simply the second moment of (6), that is

\begin{matrix} Var [q] = \int_{S^{n - 1}} θ θ^{T} q (θ) d θ . \end{matrix}

(41)

With this, we can formulate our result, which to our knowledge is a new formula for the variance–covariance matrix of the peanut distribution (6) in n dimensions.

Theorem 2.

Consider the peanut distribution (6). Then

\begin{matrix} E [q] & = & 0, \end{matrix}

(42)

\begin{matrix} Var [q] & = & \frac{1}{(n + 2)} I + \frac{1}{(n + 2) t r (A)} (A + A^{T}) . \end{matrix}

(43)

Proof.

We have seen in Lemma 1 that

E (q) = 0

. For simplicity, we define the coefficient in (6) as

c = \frac{n}{| S^{n - 1} | t r (A)} .

(44)

To compute the second moment, we multiply

Var [q]

by two arbitrary vectors

a, b \in R^{n}

in order to obtain a scalar value. Then, we use the fact that

θ

is the outside normal vector at

θ

on

S^{n - 1}

to apply the divergence theorem. Using (6) and (44), we obtain the following result

\begin{matrix} a^{T} (Var [q]) b & = \int_{S^{n - 1}} a^{T} θ θ^{T} q (θ) b d θ \\ = c \int_{S^{n - 1}} a_{i} θ^{i} b_{j} θ^{j} θ^{T} A θ d θ \\ = c \int_{B_{1} (0)} \frac{\partial}{\partial v_{i}} (a_{i} b_{j} v^{j} v^{T} A v) d v \\ = a^{T} (c \int_{B_{1} (0)} v^{T} A v d v) b + a^{T} (c \int_{B_{1} (0)} (A + A^{T}) v v^{T} d v) b, \end{matrix}

where, in the third equivalence, the divergence theorem was applied and, in the last equivalence, the product rule was used.

Next, we simplify the final two terms in the above calculation. The first term simplifies as

\begin{matrix} a^{T} (c \int_{B_{1} (0)} v^{T} A v d v) b & = a^{T} (c \int_{0}^{1} r^{n + 1} \int_{S^{n - 1}} θ^{T} A θ d θ) b \\ = \frac{a^{T} b}{n + 2}, \end{matrix}

(45)

since

\int_{S^{n - 1}} q (θ) d θ = 1

, implying that

\int_{S^{n - 1}} θ^{T} A θ d θ = 1 / c .

For the second term, we set

J = A + A^{T}

and simplify

\begin{matrix} a^{T} (c \int_{B_{1} (0)} J v v^{T} d v) b & = c \int_{0}^{1} r^{n + 1} d r \int_{S^{n - 1}} θ_{j} a^{i} J_{i j} θ^{m} b_{m} d θ \\ = \frac{c}{n + 2} \int_{B_{1} (0)} \frac{\partial}{\partial w_{j}} (a^{i} J_{i j} w^{m} b_{m}) d w \\ = \frac{c}{n + 2} a^{i} J_{i j} \int_{B_{1} (0)} \frac{\partial}{\partial w_{j}} w^{m} b_{m} d w \\ = \frac{c}{n + 2} a^{i} J_{i j} b_{j} | B_{1} (0) | \\ = \frac{1}{(n + 2) t r (A)} a^{T} J b . \end{matrix}

(46)

In the above calculation, the divergence theorem was applied at the second equivalence and the last equivalence made use of (A10).

Therefore, since

a, b \in R^{n}

are arbitrary, it follows that

Var [q] = \frac{1}{(n + 2)} I + \frac{1}{(n + 2) t r (A)} (A + A^{T}) .

□

3.2. Summary of Section 3

In this section, we computed the first and second moments of the peanut distribution (6) to obtain explicit forms for the expectation and variance–covariance matrix in Theorem 6. To our knowledge, the analytical formulas for expectation and variance–covariance matrix for the peanut distribution are new. We also showed that spherical distributions in the form

q (θ) = F (θ^{T} A θ)

have

E [q] = 0

. We applied the same method for the ODF and the Bingham distributions, but unfortunately, no closed-form expressions could be derived.

4. Eigenvalues and Anisotropy

When spherical distributions are used in diffusion models, the variance–covariance matrix describes the anisotropic movement of the random walkers [17]. The level of anisotropy can be measured by the eigenvalues of the diffusion tensor and by the fractional anisotropy (

FA

) [7]. The fractional anisotropy

FA

is an index between 0 and 1, where 0 is the case of full radial symmetry giving isotropy and 1 is the case of extreme anisotropic alignment to a particular direction. Another indicator of anisotropy is the ratio R of the largest to the smallest eigenvalue of the diffusion tensor. This quantity ranges between 1 (isotropic) to ∞ (maximal anisotropic).

Note that the fractional anisotropy and the anisotropy ratio are only used for symmetric distributions of the form

q (- θ) = q (θ)

. In this case,

E [q] = 0

and

Var [q] = \int_{S^{n - 1}} θ θ^{T} q (θ) d θ .

Hence, the following applies to the peanut distribution and to the bimodal von Mises Fisher distribution (31). The ODF and the Bingham distributions are also symmetric, but since we do not have explicit forms of the second moments, we cannot compute their anisotropy values for general dimensions.

Recall that, in (4), we identified the diffusion tensor as

D = \frac{s^{2}}{μ} Var [q],

(47)

where s is the speed and

μ

is the turning rate of a cell or individual. Hence, the eigenvalues of

D

are scaled versions of the eigenvalues of

{Var}_{q}

.

The fractional anisotropy (FA) formulas are defined separately for two and three dimensions and we summarize them in the following definition.

Definition 1.

1.: The fractional anisotropy in two dimensions is given by

${FA}_{2} = \sqrt{\frac{2 [{(λ_{1} - \bar{λ})}^{2} + {(λ_{2} - \bar{λ})}^{2}]}{λ_{1}^{2} + λ_{2}^{2}}},$

(48)

where $λ_{1}, λ_{2}$ are the eigenvalues of a two-dimensional tensor and $\bar{λ}$ is the average of the two eigenvalues [7].
2.: The fractional anisotropy in three dimensions is given by

${FA}_{3} = \sqrt{\frac{3 [{(λ_{1} - \bar{λ})}^{2} + {(λ_{2} - \bar{λ})}^{2} + {(λ_{3} - \bar{λ})}^{2}]}{2 (λ_{1}^{2} + λ_{2}^{2} + λ_{3}^{2})}},$

(49)

where $λ_{1}, λ_{2}, λ_{3}$ are the eigenvalues of a three-dimensional tensor and $\bar{λ}$ is the average of the three eigenvalues [7].
3.: The anisotropy ratio R is defined as

$R = \frac{max {e i g e n v a l u e s o f D}}{min {e i g e n v a l u e s o f D}} .$

(50)

Theorem 3.

Given a symmetric, positive definite matrix A with eigenvalue and eigenvector pairs

({\hat{λ}}_{i}, {\hat{v}}_{i})

,

i = 1, \dots, n

, the diffusion tensor arising from the peanut distribution (6) is

D = \frac{s^{2}}{μ (n + 2)} I + \frac{2 s^{2}}{μ (n + 2) t r (A)} A .

(51)

The diffusion tensor (51) has the eigenvalues

λ_{n} = \frac{s^{2}}{μ (n + 2)} (1 + \frac{2 {\hat{λ}}_{n}}{t r (A)}) .

The fractional anisotropies and the ratio of eigenvalues are bounded where

\begin{matrix} 0 \leq {FA}_{2} & = 2 | {\hat{λ}}_{1} - {\hat{λ}}_{2} | \sqrt{\frac{1}{{(t r (A) + 2 {\hat{λ}}_{1})}^{2} + {(t r (A) + 2 {\hat{λ}}_{2})}^{2}}} \leq \frac{2}{\sqrt{10}}, \end{matrix}

(52)

\begin{matrix} 0 \leq {FA}_{3} & = \sqrt{\frac{2 [{(2 {\hat{λ}}_{1} + {\hat{λ}}_{2} - {\hat{λ}}_{3})}^{2} + {(2 {\hat{λ}}_{2} - {\hat{λ}}_{1} - {\hat{λ}}_{3})}^{2} + {(2 {\hat{λ}}_{3} - {\hat{λ}}_{1} - {\hat{λ}}_{2})}^{2}]}{3 [{(t r (A) + 2 {\hat{λ}}_{1})}^{2} + {(t r (A) + 2 {\hat{λ}}_{2})}^{2} + {(t r (A) + 2 {\hat{λ}}_{3})}^{2}]}} \leq \frac{2}{\sqrt{11}}, \end{matrix}

(53)

\begin{matrix} 1 \leq R & = \frac{t r (A) + 2 m a x ({\hat{λ}}_{n})}{t r (A) + 2 m i n ({\hat{λ}}_{n})} \leq 3 . \end{matrix}

(54)

Proof.

To compute the eigenvalues, we multiply both sides of (51) by the eigenvector

{\hat{v}}_{n}

of A and obtain

D {\hat{v}}_{n} = \frac{s^{2}}{μ (n + 2)} {\hat{v}}_{n} + \frac{2 s^{2}}{μ (n + 2) t r (A)} {\hat{λ}}_{n} {\hat{v}}_{n} = \frac{s^{2}}{μ (n + 2)} (1 + \frac{2 {\hat{λ}}_{n}}{t r (A)}) {\hat{v}}_{n} .

The formulas for the fractional anisotropy follow directly, by substituting the eigenvalues of (51) into (48) and (49), respectively. The upper bound follows from taking

λ_{1} \to \infty

. For the ratio R, we write

R = \frac{{\hat{λ}}_{1} + \sum_{i = 2}^{n - 1} {\hat{λ}}_{i} + 3 {\hat{λ}}_{n}}{3 {\hat{λ}}_{1} + \sum_{i = 2}^{n - 1} {\hat{λ}}_{i} + {\hat{λ}}_{n}} \leq \frac{9 {\hat{λ}}_{1} + 3 \sum_{i = 2}^{n - 1} {\hat{λ}}_{i} + 3 {\hat{λ}}_{n}}{3 {\hat{λ}}_{i} + \sum_{i = 2}^{n - 1} {\hat{λ}}_{i} + {\hat{λ}}_{n}} \leq 3 .

where the maximum eigenvalue and minimum eigenvalues of A are denoted by

{\hat{λ}}_{n}

and

{\hat{λ}}_{1}

, respectively. All other eigenvalues are called

{\hat{λ}}_{i}

and the trace becomes the sum of the eigenvalues. □

It is very surprising to see that the anisotropy ratio is bounded by a finite value of 3 in any space dimension, and the fractional anisotropies are uniformly bounded by a value less than 1. This means the peanut distribution does not allow for arbitrary anisotropies, and for that reason it is of limited use in the modelling of biological movement data.

Next, we consider the anisotropy of the bimodal von Mises–Fisher distribution (31).

Theorem 4.

Let

u \in S^{n - 1}

be a given direction and k the concentration parameter. The diffusion tensor arising from the bimodal von Mises–Fisher distribution (31) is of the form

D = α (k) I + β (k) u u^{T}, w h e r e α (k) = \frac{s^{2} I_{\frac{n}{2}} (k)}{μ k I_{\frac{n}{2} - 1} (k)} a n d β (k) = \frac{s^{2} I_{\frac{n}{2} + 1} (k)}{μ I_{\frac{n}{2} - 1} (k)} .

(55)

The diffusion tensor (55) has eigenvalues

λ_{1} = α + β, \begin{matrix} a n d λ_{2}, \dots, λ_{n} = α . \end{matrix}

The anisotropies and the ratio of the eigenvalues satisfy

\begin{matrix} 0 \leq {FA}_{2} & = & | β | \sqrt{\frac{1}{{(α + β)}^{2} + α^{2}}} = \sqrt{\frac{1}{{(\frac{α}{β} + 1)}^{2} + {(\frac{α}{β})}^{2}}} \leq 1 \\ 0 \leq {FA}_{3} & = & | β | \sqrt{\frac{1}{{(α + β)}^{2} + 2 α^{2}}} = \sqrt{\frac{1}{{(\frac{α}{β} + 1)}^{2} + 2 {(\frac{α}{β})}^{2}}} \leq 1 \\ 1 \leq R & = & 1 + \frac{β}{α} \leq \infty . \end{matrix}

Proof.

To compute the eigenvalues for (55), we first take the vector u and show that it is indeed an eigenvector of (55), since

D u = α u + β u u^{T} u = (α + β) u .

(56)

Hence,

λ_{1} = α + β

. Now, we take any

u^{⊥} \in S^{n - 1}

(where

u^{T} u^{⊥} = 0

) and show that it is an eigenvector of (55) as well, since

D u^{⊥} = α u^{⊥} + β u u^{T} u^{⊥} = α u^{⊥} .

(57)

Thus,

u^{⊥} \in S^{n - 1}

is an eigenvector with an eigenvalue

λ = α

. Since we can define

n - 1

perpendicular vectors, we conclude that

λ_{2}, \dots, λ_{n} = α .

The formulas for the anisotropies and for R follow from substituting the determined eigenvalues into the definitions. To determine the upper bounds for the formulas, we compute the limits of

α (k)

and

β (k)

. We start with the isotropic case of

k \to 0

. We apply the expansion of Bessel functions as shown in (A4)

\begin{matrix} lim_{k \to 0} α (k) & = \frac{s^{2}}{μ} lim_{k \to 0} \frac{\frac{1}{Γ (\frac{n}{2} + 1) 2^{\frac{n}{2}}} k^{\frac{n}{2}} + \frac{1}{Γ (\frac{n}{2} + 2) 2^{\frac{n}{2} + 2}} k^{\frac{n}{2} + 2} + \dots}{k [\frac{1}{Γ (\frac{n}{2}) 2^{\frac{n}{2} - 1}} k^{\frac{n}{2} - 1} + \frac{1}{Γ (\frac{n}{2} + 1) 2^{\frac{n}{2} + 1}} k^{\frac{n}{2} + 1} + \dots]} & = \frac{s^{2}}{μ n} . \end{matrix}

(58)

and

\begin{matrix} lim_{k \to 0} β (k) & = \frac{s^{2}}{μ} lim_{k \to 0} \frac{\frac{1}{Γ (\frac{n}{2} + 2) 2^{\frac{n}{2} + 1}} k^{\frac{n}{2} + 1} + \frac{1}{Γ (\frac{n}{2} + 3) 2^{\frac{n}{2} + 3}} k^{\frac{n}{2} + 3} + \dots}{\frac{1}{Γ (\frac{n}{2}) 2^{\frac{n}{2} - 1}} k^{\frac{n}{2} - 1} + \frac{1}{Γ (\frac{n}{2} + 1) 2^{\frac{n}{2} + 1}} k^{\frac{n}{2} + 1} + \dots} & = 0 . \end{matrix}

(59)

To evaluate the limit as

k \to \infty

, we apply the expansion (A7) for large k to obtain

\begin{matrix} lim_{k \to \infty} α (k) & = \frac{s^{2}}{μ} lim_{k \to \infty} \frac{\frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2})}^{2} - 1) + \dots]}{k \frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2} - 1)}^{2} - 1) + \dots]} = 0, \end{matrix}

(60)

and

\begin{matrix} lim_{k \to \infty} β (k) & = \frac{s^{2}}{μ} lim_{k \to \infty} \frac{\frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2} + 1)}^{2} - 1) + \dots]}{\frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2} - 1)}^{2} - 1) + \dots]} = 1 . \end{matrix}

(61)

Now, we compute the bounds for the ratio

R (k)

. We see that if

k \to 0

,

R (k) \to 1

by the above calculations. Further,

R (k) \to \infty

as

k \to \infty

, since

lim_{k \to \infty} \frac{β (k)}{α (k)} = lim_{k \to \infty} \frac{k I_{\frac{n}{2} + 1}}{I_{\frac{n}{2}}} = lim_{k \to \infty} \frac{k \frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2} + 1)}^{2} - 1) + \dots]}{\frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2})}^{2} - 1) + \dots]} = \infty .

(62)

And finally,

lim_{k \to \infty} \frac{α (k)}{β (k)} = lim_{k \to \infty} \frac{I_{\frac{n}{2}}}{k I_{\frac{n}{2} + 1}} = lim_{k \to \infty} \frac{\frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2})}^{2} - 1) + \dots]}{k \frac{e^{k}}{\sqrt{2 π k}} [1 - \frac{1}{8 k} (4 {(\frac{n}{2} + 1)}^{2} - 1) + \dots]} = 0 .

(63)

So,

{FA}_{2} \to 0

and

{FA}_{3} \to 0

as

k \to 0

. Further,

{FA}_{2} \to 1

and

{FA}_{3} \to 1

as

k \to \infty

. □

In Figure 3, we confirm our calculations numerically. We plot

α (k)

,

β (k)

,

\frac{α (k)}{β (k)}

,

F A_{2} (k)

,

F A_{3} (k)

with respect to k and illustrate that they converge to the limits computed above. Note that, for some images in Figure 3, we truncated the intervals to see the functions more clearly.

Summary

In this section, we computed the eigenvalues for the peanut (6) and the bimodal von Mises–Fisher distributions (31) for their respective diffusion tensors in Theorems 3 and 4. We also found explicit versions of the

F A_{2}

and

F A_{3}

formulas as well as the R ratio in Theorems 3 and 4. The

F A_{2}

,

F A_{3}

, and R formulas for the peanut distribution have been shown to be bounded, unlike for the von Mises–Fisher distribution.

5. An Illustrative Example of Glioma Spread

To illustrate the main results, we consider a glioma spread model in a simplified scenario. Glioma are highly aggressive brain tumors which invade into healthy tissue guided by white matter fibre tracks [25]. This process has been successfully modelled using Fisher–KPP-type equations (see model (66) below) [7,26,27]. The underlying anisotropy of white matter tracks can be measured through diffusion tensor MRI (DTI) [18,28,29], and this is where we start our illustrative example. We consider a two-dimensional domain and assume that a diffusion tensor is measured as

A : = (\begin{matrix} 1 & 0 \\ 0 & 0.01 \end{matrix}) .

This indicates a strong bias in the x direction and much less movement in the y direction. The measurements of DTI record the movement of water molecules [18,29], which we need to transfer into a movement model for cancer cells. We do this using the transport equation approach mention in Section 1.1 and the peanut distribution and the bimodal von Mises distributions, respectively. Both methods will give us an effective diffusion tensor for the glioma cells.

We also assume, for simplification, that movement speed and turning rate are normalized such that

s^{2} / μ = 1

.

Taking A from above, with

t r A = 1.01

, we compute the two-dimensional diffusion tensor of the peanut distribution (51) as

D_{P e a n u t} = \frac{1}{4} I + \frac{2}{4 (1.01)} A = (\begin{matrix} 0.745 & 0 \\ 0 & 0.255 \end{matrix}) .

(64)

To compute the corresponding diffusion tensor for the bimodal von Mises distribution (55) for

n = 2

, we need the dominant eigenvector u and the concentration parameter k. In [7], the model was fit to glioma patient data and it was found that the concentration parameter k is about 3–5 times the fractional anisotropy of the patients DTI data. Hence, here, we assume

k = 5 {FA}_{2} (A) = 5 \sqrt{2} = 7.07 .

The dominant direction is

u = {(1, 0)}^{T}

. Then, from (36) we find

D_{v o n M i s e s} = (\begin{matrix} \frac{I_{1} (k)}{k I_{0} (k)} + \frac{I_{2} (k)}{I_{0} (k)} & 0 \\ 0 & \frac{I_{1} (k)}{k I_{0} (k)} \end{matrix}) = (\begin{matrix} 0.869 & 0 \\ 0 & 0.131 \end{matrix}) .

(65)

We see immediately that the von Mises tensor is more anisotropic than the peanut tensor above.

We now take these two diffusion tensors, (64) and (65), and an isotropic tensor,

D_{i s o} = diag (0.5, 0.5)

, for comparison and solve the corresponding anisotropic glioma model [7]

P_{t} = \nabla \nabla : (D P) + r P (1 - P)

(66)

on a square domain with no-flux boundary conditions. Here

P (x, t)

describes the glioma cell density as a function of space and time and r is the maximal cancer growth rate. We choose

r = 0.1

and we initialize the simulation with a small cancer seed in the middle of the domain. In Figure 4, we show the initial condition and the model results after 40 time units. We see that the von Mises distribution is able to describe a stronger directional bias as compared to the peanut. Note that, for this example, A does not change with space and remains constant at any given spatial point, meaning that this example is applicable to a small region in the tissue, such as a small section of a white matter tract in the brain. For larger domains, like the entire brain,

A (x)

depends on space, and a more detailed modelling is needed. This was done, for example, in [7,11,20].

6. Discussion

The standard way to compute moments of spherical distributions would be to use polar coordinates and engage in a series of trigonometric integrations. While this is often possible in two and three dimensions, it gets tedious, or impossible, in higher dimensions. Here, we present an alternative method that can be used in any space dimension. This method was first used in [10] in two and three dimensions. It uses the divergence theorem on the unit ball to transform and simplify the integrals as much as possible. As a result, we obtain explicit formulas for the variance–covariance matrices of the n-dimensional von Mises–Fisher distribution (Theorem 1) and the n-dimensional peanut distribution (Theorem 2) as new results. The formulas derived in Theorems 1 and 2 involve Bessel functions, which are standard functions in any software package. Hence, the numerical implementation of these formulas is straightforward. We tried to apply the method for computing the first and second moments of other spherical distributions such as the ordinary distribution function (ODF) (7) and the Bingham distribution (8), which, unfortunately, did not lead to closed form expressions. However, because Bingham and ODF distributions are symmetric, we do show that their expectation is zero. We also note that, based on our experience, Bingham and Kent distributions have not been used in the biological context yet. They seem to be more relevant in geophysical applications [30]. Hence, they are not the main focus of our paper.

Once the variance–covariance matrices were established, we considered levels of anisotropy as measured through the fractional anisotropies and the ratio of maximal over minimal eigenvalues. Surprisingly, the anisotropy of the peanut distribution is bounded away from 1 (Theorem 3) making it problematic to use in modelling. The anisotropy of the von Mises–Fisher distribution covers the entire range from isotropic to fully anisotropic (Theorem 4), and seems to have no restriction if used for modelling.

Using the explicit formulas derived here will significantly decrease the computation time in problems having a large amount of data that require a higher-dimensional von Mises–Fisher distribution [31] and decrease the computation time when simulating the transport equations (partial differential equations) that utilize the biological movement data [3,4,7]. Moreover, the formulas derived for the generalized von Mises–Fisher distribution can be easily extended to compute the expectation and variance covariance matrices of mixed von Mises–Fisher distributions by substituting the appropriate directional vectors and adding up the terms.

As we compute the higher dimensional integrals, we showcase a geometric interpretation of these integrals and the use of the divergence theorem. Hence, we combine the theory of distributions with calculus in a new way. Besides our explicit formulas, we believe our approach can open the door for further computations of explicit formulas for expectation, variance, and even higher moments of spherical distributions and enable further applications in biology.

Author Contributions

Conceptualization, A.S. and T.H.; Proof development, A.S.; Figure 1, Figure 2 and Figure 3, A.S.; Figure 4, T.H.; Writing—original draft preparation, A.S. and T.H.; Writing—review and editing, A.S. and T.H.; Supervision, T.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Natural Science and Engineering Research Council of Canada (NSERC) grant number RGPIN-2023-04269.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

AS acknowledges the funding from the Alberta Graduate Excellence Scholarship and The Maud Menten Institute Accelerator award (PRN2). TH is supported through a discovery grant of the Natural Science and Engineering Research Council of Canada (NSERC), RGPIN-2023-04269.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Derivation of the von Mises–Fisher Distribution

In [1], the von Mises–Fisher distribution is defined as the probability density

f (u; μ, k) = {(\frac{k}{2})}^{\frac{n}{2} - 1} \frac{1}{Γ (\frac{n}{2}) I_{\frac{n}{2} - 1} (k)} e^{k θ u}

(A1)

with respect to the uniform distribution on

S^{n - 1}

. The parameters in

f (u; μ, k)

have the same interpretation as in (5). If we use the standard Lebesgue measure on

S^{n - 1}

, we divide (A1) by

| S^{n - 1} |

, and obtain the probability density (5). We can see this by the following calculation

q (θ) = \frac{1}{| S^{n - 1} |} {(\frac{k}{2})}^{\frac{n}{2} - 1} \frac{1}{Γ (\frac{n}{2}) I_{\frac{n}{2} - 1} (k)} e^{k θ u} = \frac{k^{\frac{n}{2} - 1}}{{(2 π)}^{\frac{n}{2}} I_{\frac{n}{2} - 1} (k)} e^{k θ u}

(A2)

where we made use of (A8) and (A9).

Appendix A.2. Modified Bessel Functions of First Kind

The von Mises–Fisher distribution (5) has a modified Bessel function of first kind appearing in the normalization factor. Here, we state the definitions of the Bessel function of first kind

J_{p} (x)

and the modified Bessel function of first kind

I_{p} (x)

of order p (

p, x \in C

) as well as list some useful identities.

The Bessel function and modified Bessel function of first kind can be defined as follows [32]

J_{p} (x) = \sum_{m = 0}^{\infty} \frac{{(- 1)}^{m}}{m! Γ (p + m + 1)} {(\frac{x}{2})}^{2 m + p},

(A3)

I_{p} (x) = \sum_{m = 0}^{\infty} \frac{1}{m! Γ (p + m + 1)} {(\frac{x}{2})}^{2 m + p},

(A4)

where

p \in C

,

x \in C

.

Bessel functions have many identities [24]. We will use the following ones often in our calculations

I_{p} (x) = i^{- p} J_{p} (i x),

(A5)

\frac{d}{d x} (x^{p} J_{p} (x)) = x^{p} J_{p - 1} (x),

(A6)

where

p \in R

. See Appendix A.3 for the proofs of these identities.

We will also make use of the following expansion for large

x \in R

,

x > 0

[33]

I_{p} (x) \sim \frac{e^{x}}{\sqrt{2 π x}} (1 + \sum_{n = 0}^{\infty} \frac{{(- 1)}^{n}}{n! {(8 x)}^{n}} (\prod_{m = 1}^{n} 4 p^{2} - {(2 m - 1)}^{2})) .

(A7)

for fractional and integer orders p.

Appendix A.3. Bessel Function Identities Proofs

Identity (A5) follows through the following calculation

i^{- p} J_{p} (i x) = \sum_{m = 0}^{\infty} \frac{{(- 1)}^{m}}{m! Γ (p + m + 1)} i^{2 m} {(\frac{x}{2})}^{2 m + p} = \sum_{m = 0}^{\infty} \frac{1}{m! Γ (p + m + 1)} {(\frac{x}{2})}^{2 m + p} = I_{p} (x)

since

i^{2 m} = {(i^{2})}^{m} = {(- 1)}^{m}

.

To prove (A6), we compute

\begin{matrix} \frac{d}{d x} x^{p} J_{p} (x) & = \frac{d}{d x} (\sum_{m = 0}^{\infty} \frac{{(- 1)}^{m}}{m! Γ (m + p + 1)} {(\frac{x}{2})}^{2 p + 2 m} 2^{p}) \\ = \sum_{m = 0}^{\infty} \frac{{(- 1)}^{m} (2 p + 2 m)}{m! Γ (m + p + 1)} {(\frac{x}{2})}^{2 p + 2 m - 1} 2^{p - 1} \\ = x^{p} \sum_{m = 0}^{\infty} \frac{{(- 1)}^{m} (p + m)}{m! Γ (m + p + 1)} {(\frac{x}{2})}^{p + 2 m - 1} \\ = x^{p} \sum_{m = 0}^{\infty} \frac{{(- 1)}^{m}}{m! Γ (m + p)} {(\frac{x}{2})}^{p + 2 m - 1} \\ = x^{p} J_{p - 1} (x) \end{matrix}

where the exchange of integration and summation operators is justified since the sum is absolutely convergent by ratio test, and in the fourth equivalence (A9) was used.

Appendix A.4. n-Ball and (n − 1)-Sphere

The surface area of an n-sphere,

| S^{n - 1} |

, and the volume of an n-ball,

| B^{n} |

, appear often in our calculations. Here, we recall these formulas explicitly [34]

| S^{n - 1} | = \frac{2 π^{\frac{n}{2}}}{Γ (\frac{n}{2})}, | B_{1} (0) | = | B^{n} | = \frac{π^{\frac{n}{2}}}{Γ (\frac{n}{2} + 1)},

(A8)

where

Γ (x)

is the gamma function. See [34] for more details as to how the formulas in (A8) were derived.

The gamma function has the following identity [35], which we will use often

Γ (x + 1) = x Γ (x) .

(A9)

Using (A9) it is easy to see that

\frac{n | B_{1} (0) |}{| S^{n - 1} |} = 1 .

(A10)

Appendix A.5. Matrix Derivative

We denote by

D_{v}

an operator that performs the gradient with respect to

v \in R^{n}

, that is,

D_{v} w = {(w_{v_{1}}, w_{v_{2}}, \dots, w_{v_{n}})}^{T}

. It follows that

D_{x} (x^{T} A x) = A x + A^{T} x

(A11)

where

A \in R^{n \times n} .

The identity (A11) can be computed as follows. Since

x^{T} A x = \sum_{i = 1}^{n} \sum_{j = 1}^{n} x_{i} A_{i j} x_{j}

(A12)

and

\begin{matrix} \frac{d x^{T} A x}{d x_{k}} & = \sum_{i = 1}^{n} \frac{d x_{i}}{d x_{k}} \sum_{j = 1}^{n} A_{i j} x_{j} + \sum_{i = 1}^{n} x_{i} \sum_{j = 1}^{n} A_{i j} \frac{d x_{j}}{d x_{k}} \\ = \sum_{j = 1}^{n} A_{k j} x_{j} + \sum_{i = 1}^{n} x_{i} A_{i k} \end{matrix}

it follows that

D_{x} x^{T} A x = (\begin{matrix} \frac{d x^{T} A x}{d x_{1}} \\ \frac{d x^{T} A x}{d x_{2}} \\ ⋮ \\ \frac{d x^{T} A x}{d x_{2}} \end{matrix}) = (\begin{matrix} \sum_{j = 1}^{n} A_{1 j} x_{j} + \sum_{i = 1}^{n} x_{i} A_{i 1} \\ \sum_{j = 1}^{n} A_{2 j} x_{j} + \sum_{i = 1}^{n} x_{i} A_{i 2} \\ ⋮ \\ \sum_{j = 1}^{n} A_{n j} x_{j} + \sum_{i = 1}^{n} x_{i} A_{i n} \end{matrix}) = A x + A^{T} x .

(A13)

References

Mardia, K.V.; Jupp, P.E. Directional Statistics; John Wiley & Sons: Hoboken, NJ, USA, 2009. [Google Scholar]
McKenzie, H.W.; Merrill, E.H.; Spiteri, R.J.; Lewis, M.A. How linear features alter predator movement and the functional response. Interface Focus 2012, 2, 205–216. [Google Scholar] [CrossRef] [PubMed]
Painter, K.J.; Hillen, T. Navigating the flow: Individual and continuum models for homing in flowing environments. J. R. Soc. Interface 2015, 12, 20150647. [Google Scholar] [CrossRef] [PubMed]
Painter, K.J. Multiscale models for movement in oriented environments and their application to hilltopping in butterflies. Theor. Ecol. 2014, 7, 53–75. [Google Scholar] [CrossRef]
Johnston, S.T.; Painter, K.J. Avoidance, confusion or solitude? Modelling how noise pollution affects whale migration. Mov. Ecol. 2024, 12, 17. [Google Scholar] [CrossRef]
Friedl, P.; Bröcker, E.B. The biology of cell locomotion within three-dimensional extracellular matrix. Cell. Mol. Life Sci. CMLS 2000, 57, 41–64. [Google Scholar] [CrossRef] [PubMed]
Swan, A.; Hillen, T.; Bowman, J.C.; Murtha, A.D. A patient-specific anisotropic diffusion model for brain tumour spread. Bull. Math. Biol. 2018, 80, 1259–1291. [Google Scholar] [CrossRef]
Painter, K.J.; Hillen, T. From random walks to fully anisotropic diffusion models for cell and animal movement. In Cell Movement: Modeling and Applications; Springer: Berlin/Heidelberg, Germany, 2018; pp. 103–141. [Google Scholar] [CrossRef]
Wolf, K.; Müller, R.; Borgmann, S.; Brocker, E.B.; Friedl, P. Amoeboid shape change and contact guidance: T-lymphocyte crawling through fibrillar collagen is independent of matrix remodeling by MMPs and other proteases. Blood 2003, 102, 3262–3269. [Google Scholar] [CrossRef]
Hillen, T.; Painter, K.J.; Swan, A.C.; Murtha, A.D. Moments of von Mises and Fisher distributions and applications. Math. Biosci. Eng. 2017, 14, 673–694. [Google Scholar] [CrossRef]
Painter, K.; Hillen, T. Mathematical modelling of glioma growth: The use of diffusion tensor imaging (DTI) data to predict the anisotropic pathways of cancer invasion. J. Theor. Biol. 2013, 323, 25–39. [Google Scholar] [CrossRef] [PubMed]
Conte, M.; Surulescu, C. Mathematical modeling of glioma invasion: Acid-and vasculature mediated go-or-grow dichotomy and the influence of tissue anisotropy. Appl. Math. Comput. 2021, 407, 126305. [Google Scholar] [CrossRef]
Finnegan, L.; Pigeon, K.E.; Cranston, J.; Hebblewhite, M.; Musiani, M.; Neufeld, L.; Schmiegelow, F.; Duval, J.; Stenhouse, G.B. Natural regeneration on seismic lines influences movement behaviour of wolves and grizzly bears. PLoS ONE 2018, 13, e0195480. [Google Scholar] [CrossRef]
McKenzie, H.W.; Lewis, M.A.; Merrill, E.H. First passage time analysis of animal movement and insights into the functional response. Bull. Math. Biol. 2009, 71, 107–129. [Google Scholar] [CrossRef]
Othmer, H.G.; Dunbar, S.R.; Alt, W. Models of dispersal in biological systems. J. Math. Biol. 1988, 26, 263–298. [Google Scholar] [CrossRef]
Othmer, H.G.; Hillen, T. The diffusion limit of transport equations derived from velocity-jump processes. SIAM J. Appl. Math. 2000, 61, 751–775. [Google Scholar] [CrossRef]
Hillen, T. M5 mesoscopic and macroscopic models for mesenchymal motion. J. Math. Biol. 2006, 53, 585–616. [Google Scholar] [CrossRef] [PubMed]
O’Donnell, L.J.; Westin, C.F. An introduction to diffusion tensor image analysis. Neurosurg. Clin. 2011, 22, 185–196. [Google Scholar] [CrossRef] [PubMed]
Douaud, G.; Jbabdi, S.; Behrens, T.E.; Menke, R.A.; Gass, A.; Monsch, A.U.; Rao, A.; Whitcher, B.; Kindlmann, G.; Matthews, P.M.; et al. DTI measures in crossing-fibre areas: Increased diffusion anisotropy reveals early white matter alteration in MCI and mild Alzheimer’s disease. Neuroimage 2011, 55, 880–890. [Google Scholar] [CrossRef]
Engwer, C.; Hunt, A. Surulescu, C. Effective equations for anisotropic glioma spread with proliferation: A multiscale approach and comparisons with previous settings. Math. Med. Biol. 2016, 33, 435–459. [Google Scholar] [CrossRef] [PubMed]
Aganj, I.; Lenglet, C.; Sapiro, G.; Yacoub, E.; Ugurbil, K.; Harel, N. Reconstruction of the orientation distribution function in single-and multiple-shell q-ball imaging within constant solid angle. Magn. Reson. Med. 2010, 64, 554–566. [Google Scholar] [CrossRef]
Basser, P.J. Diffusion and diffusion tensor MR imaging: Fundamentals. In Magnetic Resonance Imaging of the Brain and Spine; Lippincott Williams and Wilkins: Philadelphia, PA, USA, 2008; pp. 1752–1767. [Google Scholar]
Kent, J.T. The Fisher-Bingham distribution on the sphere. J. R. Stat. Soc. Ser. B (Methodol.) 1982, 44, 71–80. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; US Government Printing Office: Washington, DC, USA, 1968; Volume 55.
Giese, A.; Kluwe, L.; Laube, B.; Meissner, H.; Berens, M.E.; Westphal, M. Migration of human glioma cells on myelin. Neurosurgery 1996, 38, 755–764. [Google Scholar] [CrossRef]
Swanson, K.R.; Alvord, E.C., Jr.; Murray, J.D. A Quantitative Model for Differential Motility of Gliomas in Grey and White Matter. Cell Prolif. 2000, 33, 317–329. [Google Scholar] [CrossRef]
Konukoglu, E.; Clatz, O.; Bondiau, P.; Delignette, H.; Ayache, N. Extrapolation glioma invasion margin in brain magnetic resonance images: Suggesting new irradiation margins. Med. Image Anal. 2010, 14, 111–125. [Google Scholar] [CrossRef] [PubMed]
Berenschot, G. Visualization of Diffusion Tensor Imaging. Master’s Thesis, Eindhoven University of Technology, Eindhoven, The Netherlands, 2003. [Google Scholar]
Beaulieu, C. The basis of anisotropic water diffusion in the nervous system—A technical review. NMR Biomed. 2002, 15, 435–455. [Google Scholar] [CrossRef] [PubMed]
Fallaize, C.J.; Kypraios, T. Bayesian Model Choice for Directional Data. J. Comput. Graph. Stat. 2024, 33, 25–34. [Google Scholar] [CrossRef]
Banerjee, A.; Dhillon, I.S.; Ghosh, J.; Sra, S.; Ridgeway, G. Clustering on the Unit Hypersphere using von Mises-Fisher Distributions. J. Mach. Learn. Res. 2005, 6, 1345–1382. [Google Scholar]
Baricz, Á. Generalized Bessel Functions of the First Kind; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Olver, F.W. NIST Handbook of Mathematical Functions Hardback and CD-ROM; Cambridge University Press: Cambridge, MA, USA, 2010. [Google Scholar]
Folland, G.B. How to integrate a polynomial over a sphere. Am. Math. Mon. 2001, 108, 446–448. [Google Scholar] [CrossRef]
Sebah, P.; Gourdon, X. Introduction to the gamma function. Am. J. Sci. Res. 2002, 2–18. [Google Scholar]

Figure 1. Illustration of the spherical distributions in 2D and 3D (first and third rows, respectively) as well as their diffusion peanuts in 2D and 3D (second and last row, respectively). For the peanut, ODF, and Bingham distributions,

D_{W}

was taken to be a diagonal matrix with

1, 0.5

, and

1, 0.5, 0.25

on the main diagonal for 2D and 3D, respectively. For the Bingham distribution,

Δ = 0.75

. For von Mises–Fisher,

k = 10

and the preferred direction u was set to

(- 1, 0)

and

(- 1, 0, 0)

for 2D and 3D, respectively. Note here that the peanut and ODF are symmetric (

q (- θ) = q (θ)

), whereas von Mises−Fisher is asymmetric.

Figure 1. Illustration of the spherical distributions in 2D and 3D (first and third rows, respectively) as well as their diffusion peanuts in 2D and 3D (second and last row, respectively). For the peanut, ODF, and Bingham distributions,

D_{W}

was taken to be a diagonal matrix with

1, 0.5

, and

1, 0.5, 0.25

on the main diagonal for 2D and 3D, respectively. For the Bingham distribution,

Δ = 0.75

. For von Mises–Fisher,

k = 10

and the preferred direction u was set to

(- 1, 0)

and

(- 1, 0, 0)

for 2D and 3D, respectively. Note here that the peanut and ODF are symmetric (

q (- θ) = q (θ)

), whereas von Mises−Fisher is asymmetric.

Figure 2. The black curve in the left, middle, and left images illustrates the left−hand side of the Equations (40), (37), and (39). The dashed pink curve in the left, middle, and left images illustrates the right-hand side of the Equations (40), (37), and (39). The images show numerical agreement between both sides of the equivalences (40), (37), and (39).

Figure 3. The top row illustrates

α (k) \to 0

,

β (k) \to 1

, and

\frac{α (k)}{β (k)} \to 0

as

k \to \infty

for

n = 5, 17, 33, 50, 77

(see Theorem 4 for explicit forms of

α (k)

and

β (k)

). The bottom image illustrates

F A_{2} (k) \to 1

and

F A_{3} (k) \to 1

as

k \to \infty

for dimensions

n = 2

and

n = 3

, respectively. In the bottom figure, the blue curve is

F A_{2} (k)

and the orange curve is

F A_{3} (k)

(see Theorem 4 for the definitions of

F A_{2}

and

F A_{3}

). All images relate to the bimodal von Mises–Fisher distribution.

Figure 3. The top row illustrates

α (k) \to 0

,

β (k) \to 1

, and

\frac{α (k)}{β (k)} \to 0

as

k \to \infty

for

n = 5, 17, 33, 50, 77

(see Theorem 4 for explicit forms of

α (k)

and

β (k)

). The bottom image illustrates

F A_{2} (k) \to 1

and

F A_{3} (k) \to 1

as

k \to \infty

for dimensions

n = 2

and

n = 3

, respectively. In the bottom figure, the blue curve is

F A_{2} (k)

and the orange curve is

F A_{3} (k)

(see Theorem 4 for the definitions of

F A_{2}

and

F A_{3}

). All images relate to the bimodal von Mises–Fisher distribution.

Figure 4. Initial condition and simulations of glioma spread of model (66) at time

t = 40

on a

[0, 60] \times [0, 60]

square domain, using three different diffusion tensors, the isotropic tensor, the peanut, and the bimodal von Mises. Here, blue indicates maximum tumor density, while white indicates healthy tissue.

Figure 4. Initial condition and simulations of glioma spread of model (66) at time

t = 40

on a

[0, 60] \times [0, 60]

square domain, using three different diffusion tensors, the isotropic tensor, the peanut, and the bimodal von Mises. Here, blue indicates maximum tumor density, while white indicates healthy tissue.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shyntar, A.; Hillen, T. First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications. Axioms 2025, 14, 743. https://doi.org/10.3390/axioms14100743

AMA Style

Shyntar A, Hillen T. First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications. Axioms. 2025; 14(10):743. https://doi.org/10.3390/axioms14100743

Chicago/Turabian Style

Shyntar, Alexandra, and Thomas Hillen. 2025. "First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications" Axioms 14, no. 10: 743. https://doi.org/10.3390/axioms14100743

APA Style

Shyntar, A., & Hillen, T. (2025). First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications. Axioms, 14(10), 743. https://doi.org/10.3390/axioms14100743

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

First and Second Moments of Spherical Distributions That Are Relevant for Biological Applications

Abstract

1. Introduction

1.1. Anisotropic Transport Equations

1.2. Outline of the Paper

1.3. Definitions of Some Spherical Distributions

1.4. Definitions of First and Second Moments

2. First and Second Moments of von Mises–Fisher Distributions

Summary of Section 2

3. First and Second Moment of Peanut Distribution

3.1. Second Moment of the Peanut Distribution

3.2. Summary of Section 3

4. Eigenvalues and Anisotropy

Summary

5. An Illustrative Example of Glioma Spread

6. Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Derivation of the von Mises–Fisher Distribution

Appendix A.2. Modified Bessel Functions of First Kind

Appendix A.3. Bessel Function Identities Proofs

Appendix A.4. n-Ball and (n − 1)-Sphere

Appendix A.5. Matrix Derivative

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI