A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications

Mano, Shuhei

doi:10.3390/math10214081

Open AccessFeature PaperArticle

A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications

by

Shuhei Mano

The Institute of Statistical Mathematics, Tokyo 190-8562, Japan

Mathematics 2022, 10(21), 4081; https://doi.org/10.3390/math10214081

Submission received: 10 September 2022 / Revised: 25 October 2022 / Accepted: 26 October 2022 / Published: 2 November 2022

(This article belongs to the Special Issue Random Combinatorial Structures)

Download

Browse Figures

Versions Notes

Abstract

A diffusion-taking value in probability-measures on a graph with vertex set V,

\sum_{i \in V} x_{i} δ_{i}

is studied. The masses on each vertex satisfy the stochastic differential equation of the form

d x_{i} = \sum_{j \in N (i)} \sqrt{x_{i} x_{j}} d B_{i j}

on the simplex, where

{B_{i j}}

are independent standard Brownian motions with skew symmetry, and

N (i)

is the neighbour of the vertex i. A dual Markov chain on integer partitions to the Markov semigroup associated with the diffusion is used to show that the support of an extremal stationary state of the adjoint semigroup is an independent set of the graph. We also investigate the diffusion with a linear drift, which gives a killing of the dual Markov chain on a finite integer lattice. The Markov chain is used to study the unique stationary state of the diffusion, which generalizes the Dirichlet distribution. Two applications of the diffusions are discussed: analysis of an algorithm to find an independent set of a graph, and a Bayesian graph selection based on computation of probability of a sample by using coupling from the past.

Keywords:

Bayesian graph selection; coupling from the past; integer partition; interacting particle system; independent set finding; measure-valued diffusion

MSC:

60K35; 05C81; 60J70; 60J90; 65C05

1. Introduction

Consider a finite graph

G = (V, E)

consisting of vertices

V = {1, \dots, r}

,

r \in {2, 3, \dots}

and edges E. Throughout this paper, a graph is undirected and connected. The neighbour of the vertex

i \in V

is denoted by

N (i) : = {j \in V : j \sim i}

, where

j \sim i

means that i and j are adjacent. The degree of the vertex i is denoted by

d_{i} : = | N (i) |

, which is the cardinality of the set

N (i)

. An independent set of

G

is a subset of V such that no two are adjacent. In other words, a set of vertices is independent if and only if it is a clique in the graph complement of

G

.

If two vertices of a graph

G

have precisely the same neighbour, throughout this paper, we call the graph obtained by identifying these two vertices with keeping the adjacency a reduced graph of

G

.

Let

P (Δ_{r - 1})

be the totality of probability measures on the simplex

Δ_{r - 1} = {(x_{1}, \dots, x_{r}) \in R_{\geq 0}^{V} : \sum_{i \in V} x_{i} = 1}

equipped with the topology of weak convergence. Itoh et al. [1] discussed a diffusion-taking value in probability measures on graph

G

, whose state is identified with probability measure

\sum_{i = 1}^{r} x_{i} (t) δ_{i}

and the masses on each vertex

{x (t) = (x_{1} (t), \dots, x_{r} (t)) \in Δ_{r - 1}, P_{x} : t \geq 0} \in P (Δ_{r - 1})

which starts from

x (0) = x

and satisfies the stochastic differential equation of the form

d x_{i} = \sum_{j \in N (i)} \sqrt{x_{i} x_{j}} d B_{i j}, i \in V,

(1)

where

{B_{i j}}_{i \sim j \in V}

are independent standard Brownian motions with skew symmetry

B_{i j} = - B_{j i}

. The generator of the diffusion L operates on a function

f \in C_{0}^{2} (Δ_{r - 1})

as

L f = \frac{1}{2} \sum_{i, j \in V} σ_{i j} (x) \frac{\partial^{2} f}{\partial x_{i} \partial x_{j}},

(2)

where

σ_{i i} (x) = x_{i} \sum_{j \in N (i)} x_{j}

,

σ_{i j} (x) = - x_{i} x_{j}

,

i \sim j

, and

σ_{i j} (x) = 0

otherwise, and

C_{0}^{2} (Δ_{r - 1})

is the totality of functions with a continuous derivative up to the second order and compact support in

Δ_{r - 1}

.

We say a face of the simplex

Δ_{r - 1}

corresponds to a set of vertices

U \subset V

if the face is the interior of the convex hall of

{e_{i} : i \in U}

denoted by

int (conv {e_{i} : i \in U})

, where

{e_{1}, \dots, e_{r}}

is the set of standard basis of the vector space

R^{r}

. If U consists of a single vertex, say

e_{i}

,

int (conv {e_{i}})

should be read as

e_{i}

. An observation on the diffusion is as follows.

Proposition 1.

Every point in a face of the simplex

Δ_{r - 1}

corresponding to an independent set

V_{I} ⊊ V

of a graph

G

is a fixed point of the stochastic differential Equation (1). Namely,

x \in int (conv {e_{i} : i \in V_{I}})

(3)

is a fixed point.

Proof.

For each

i \in V

,

x_{i} (t)

is a martingale with respect to the natural filtration generated by the diffusion

(x (t), P_{x})

. The condition (3) implies

x_{i} = 0

for each

i \notin V_{I}

,

E (x_{i} (t)) = 0

, and thus

x_{i} (t) = 0

,

\forall t \geq 0

almost surely. Then, for each

i \in V_{I}

, (1) reduces to

d x_{i} (t) = 0

, and we have

x_{i} (t) = x_{i}

,

\forall t \geq 0

. Therefore, (3) is a fixed point of (1). □

Itoh et al. [1] discussed the diffusion

(x (t), P_{x})

as an approximation of the following discrete stochastic model. Consider a system of N particles, where each particle is assigned to one vertex in V, and there is a continuous-time Markov chain

{(n_{1} (s), \dots, n_{r} (s)) \in N_{0}^{V} : n_{1} (s) + \dots + n_{r} (s) = N, s \geq 0}, N_{0} : = {0, 1, \dots} .

Here,

n_{i} (s)

is the number of particles assigned to the vertex

i \in V

at time s. At each instant of time, two of the N particles are chosen uniformly randomly. If particles at

i \sim j

vertices are chosen, one of the particles is chosen with equal probabilities and assigned to another vertex. This causes a transition from

(i, j)

to

(i, i)

or

(j, j)

with equal probabilities. This stochastic model seems to have various applications. Tainaka et al. [2] discussed this stochastic model as a model of a speciation process caused by geography. Ben-Haim et al. [3] considered a slightly modified version, in which a transition from

(i - 1, i + 1)

to

(i, i)

,

i \in {2, \dots, r - 1}

occurs on the one-dimensional graph, where

i \sim i + 1

,

i \in {1, \dots, r - 1}

. They called it a “compromise process” because if we regard the vertices as political positions, then a transition is a compromise. The process

x^{N} (s) : = n (s) / N

converges weakly to the diffusion

(x (t), P_{x})

: if

x^{N} (0) \Rightarrow x (0)

, then

x^{N} ([\cdot N (N - 1) / 2]) \Rightarrow x (\cdot)

as

N \to \infty

in the space of right continuous functions with left limits (see Theorem 10.3.5 of [4]).

Apart from an approximation of the discrete stochastic model discussed above, the diffusion

(x (t), P_{x})

seems to appear in various contexts. The diffusion on a complete graph appears as an approximation of a quite different discrete stochastic model, called the Wright–Fisher model in population genetics, which evolves by repetition of multinomial sampling of a fixed number of particles (see, e.g., Chapter 10 of [4], for details). The class of measure-valued diffusions is called Fleming–Viot processes. Such diffusions appear as prior distributions in Bayesian statistics.

As we will see below, the support of an extremal stationary state of the semigroup associated with the diffusion

(x (t), P_{x})

is a face of

Δ_{r - 1}

which corresponds to an independent set of

G

. In this sense, the diffusion can be regarded as independent set-finding in graph theory. Some problems related with independent set-finding, such as the maximum independent set problem, are known to be NP-hard, so it is believed that there is no efficient algorithm to solve them. Therefore, algorithms to find maximal independent sets are useful to obtain practical solutions. For example, Luby [5] discussed a parallel algorithm for finding maximal independent sets that was derived from a stochastic algorithm to find a maximal independent set. His algorithm is based on a step to find an independent set based on random permutation executed by

O (| E |)

processors in time

O (1)

for large

| E |

.

Let us assume there exists a strongly continuous Markov semigroup

{T_{t} : t \geq 0}

associated with the diffusion

(x (t), P_{x})

governed by the generator (2) such that

T_{t} 1 = 1 and T_{t} f \geq 0, \forall f \in C (Δ_{r - 1}) satisfying f \geq 0,

where

C (Δ_{r - 1})

is the totality of continuous functions on

Δ_{r - 1}

, and

T_{t} f - f = \int_{0}^{t} T_{s} L f d s, \forall f \in C_{0}^{2} (Δ_{r - 1}) .

(4)

The existence of such a semigroup for complete graphs was proven by Ethier [6]. For the solution of the stochastic differential Equation (1), we have

T_{t} f (x) = E_{x} {f (x (t))} .

Let us denote by

{T_{t}^{*} : t \geq 0}

the adjoint semigroup on

P (Δ_{r - 1})

induced by

{T_{t} : t \geq 0}

. Consider the totality of all fixed points of

{T_{t}^{*}}

:

S = {μ \in P (Δ_{r - 1}) : T_{t}^{*} μ = μ, \forall t \geq 0} .

We call each element of

S

a stationary state of

{T_{t}^{*}}

. A stationary state

ν

satisfies

〈 ν, L f 〉 = \int L f (x) ν (d x) = 0, \forall f \in C_{0}^{2} (Δ_{r - 1}) .

(5)

The set

S

is non-empty and convex. The totality of the extremal elements of stationary states is denoted by

S_{ext}

. Namely, a stationary state

ν

is uniquely represented as

ν = \sum_{i = 1}^{s} p_{i} ν_{i}, ν_{1}, \dots, ν_{s} \in S_{ext}

for some

p \in Δ_{s - 1}

,

s \in {2, 3, \dots}

. In Theorem 1, we see that support of an extremal stationary state of the diffusion

(x (t), P_{x})

is a face of

Δ_{r - 1}

corresponding to an independent set of

G

.

In this paper, we use the term support in a sloppy sense. Namely, positivity of a stationary state is not assumed unless otherwise stated. In fact, Proposition 1 implies that if the diffusion

(x (t), P_{x})

starts from any point x in

int (conv {e_{i} : i \in V_{I}})

, the stationary state is

δ_{x}

, or an atom at x. In this situation, we say the support is

int (conv {e_{i} : i \in V_{I}})

. In other words, if a stationary state does not have probability mass anywhere in an open set, then we say the set is not the support of the stationary state. Some examples are as follows.

Example 1.

Let

G = K_{r}

, which is a complete graph consisting of r vertices. Since each vertex is maximally independent, a stationary state is represented as

\sum_{i = 1}^{r} p_{i} δ_{e_{i}}

, where

S_{ext} = {δ_{e_{i}} : i \in {1, \dots, r}}

. For the solution of the stochastic differential Equation (1),

p_{i}

is the absorption probability for the vertex

i \in V

. Since

x_{i} (t)

is a martingale, we know

p_{i} = x_{i} (0)

.

Example 2.

Let

G = C_{r}

for an even positive integer r, which is a cycle graph consisting of r vertices, i.e.,

i \sim i + 1

\mod r

. The maximal independent sets are the set of all even integer vertices and that of all odd integer vertices. When

r = 4

, we have independent sets

{1}

,

{2}

,

{3}

,

{4}

,

{1, 3}

, and

{2, 4}

. The supports of extremal stationary states are the faces

e_{i}

,

i \in {1, \dots, 4}

,

int (conv {e_{1}, e_{3}})

, and

int (conv {e_{2}, e_{4}})

. The totality of the extremal stationary states is

S_{ext} = {δ_{e_{1}}, δ_{e_{2}}, δ_{e_{3}}, δ_{e_{4}}, ν_{1, 3}, ν_{2, 4}}

, where

ν_{1, 3}

and

ν_{2, 4}

are densities (not necessarily strictly positive) on

int (conv {e_{1}, e_{3}})

and

int (conv {e_{2}, e_{4}})

, respectively. Therefore, a stationary state is represented as

\sum_{i = 1}^{4} p_{i} δ_{e_{i}} + p_{1, 3} ν_{1, 3} + p_{2, 4} ν_{2, 4}

.

Example 3.

Let

G = K_{r, s}

for positive integers r and s, which is a complete bipartite graph consisting of two disjointed and maximally independent sets of r and s vertices. For a graph

K_{3, 2}

whose maximal independent sets are

{1, 2, 3}

and

{4, 5}

, a stationary state may be represented as

\sum_{i = 1}^{5} p_{i} δ_{e_{i}} + p_{1, 2} ν_{1, 2} + p_{1, 3} ν_{1, 3} + p_{2, 3} ν_{2, 3} + p_{4, 5} ν_{4, 5} + p_{1, 2, 3} ν_{1, 2, 3}

.

Obtaining an explicit expression for the stationary states is a challenging problem. Itoh et al. [1] successfully obtained an explicit expression for the stationary states for a star graph

S_{2}

, where a star graph

S_{r - 1}

,

r \geq 3

is a complete bipartite graph

K_{1, r - 1}

, and the vertices of

S_{r - 1}

are numbered such that

1 \sim i

,

i \in {2, \dots, r}

. A stationary state may be represented as

\sum_{i = 1}^{3} p_{i} δ_{e_{i}} + p_{2, 3} ν_{2, 3}

. If we identify vertices

{2, 3}

, the star graph

S_{2}

is reduced to a complete graph

K_{2}

. Using the arguments for a complete graph in Example 1, we know

p_{1} = x_{1}

and

p_{2} + p_{2, 3} ν_{2, 3} + p_{3} = x_{2} + x_{3}

. Itoh et al. [1] obtained an explicit expression for the diffusion starting from

x \notin {e_{1}, e_{2}, e_{3}, int (conv {e_{2}, e_{3}})}

:

p_{i} = \frac{x_{1}}{2} \{\frac{2 - x_{1}}{\sqrt{{(2 - x_{1})}^{2} - 4 x_{i}}} - 1\}, i \in {2, 3}

(6)

by using martingales introduced in Section 2. This result is for a specific graph but can be applied to other graphs reducible to

S_{2}

. For example, the four-cycle graph

C_{4}

discussed in Example 2 can be reduced to

S_{2}

. Explicit expressions for

p_{i}

,

i \in {1, 2, 3, 4}

are immediately obtained.

This paper is organized as follows. In Section 2, the martingales used by Itoh et al. [1] are revisited in a slightly generalized form. An interpretation of the martingales is presented in Section 3. A duality relation between the Markov semigroup associated with the diffusion and a Markov chain on the set of ordered integer partitions is established. The dual Markov chain is studied and used to show that the support of an extremal stationary state of the adjoint semigroup is an independent set of the graph. In Section 4, we investigate the diffusion with a linear drift, which gives a killing of the dual Markov chain on a finite integer lattice. The Markov chain is studied and used to study the unique stationary state of the diffusion, which generalizes the Dirichlet distribution. In Section 5, two applications of the diffusions are discussed: analysis of an algorithm to find an independent set of a graph and Bayesian graph selection based on computation of the probability of a sample by using coupling from the past. Section 6 is devoted to discussion of open problems.

2. Invariants among Moments

For a graph

G = (V, E)

,

r = | V |

, an element

a \in N_{0}^{V}

with

| a | : = a_{1} + \dots + a_{r} < \infty

is denoted by

a = a_{1} e_{1} + \dots + a_{r} e_{r}

. We use multi-index notation; a monomial

\prod_{i} x_{i}^{a_{i}}

is simply written as

x^{a}

.

For star graphs

S_{r - 1}

, Itoh et al. [1] noticed the following homogeneous polynomials of arbitrary order

n \geq r

:

\sum_{a_{2} + \dots + a_{r} = n, a \geq 1} (\begin{matrix} n - r + 1 \\ a - 1 \end{matrix}) (\begin{matrix} n \\ a \end{matrix}) {(c x (t))}^{a}, (\begin{matrix} n \\ a \end{matrix}) = \frac{n!}{a_{2}! \dots a_{r}!}

are martingales, where the sum is taken over all ordered positive integer partitions of n satisfying

a_{2} + \dots + a_{r} = n

with

a_{i} \geq 1

,

i \in {2, \dots, r}

and

c_{2} + \dots + c_{r} = 0

with

c_{i} \in R

,

i \in {2, \dots, r}

. This result is generalized for a generic graph; an example is a reducible graph for which vertices in an independent set can be identified (a reduced graph is defined in Section 1).

Proposition 2.

Let

V_{I} ⊊ V

be an independent set of a graph

G

sharing an adjacent vertex. The homogeneous polynomials of any order

n \geq | V_{I} | + 1

:

\sum_{\sum_{i \in V_{I}} a_{i} = n, a \geq 1} (\begin{matrix} n - | V_{I} | \\ a - 1 \end{matrix}) (\begin{matrix} n \\ a \end{matrix}) {(c x (t))}^{a}, (\begin{matrix} n \\ a \end{matrix}) = \frac{n!}{\prod_{i \in V_{I}} a_{i}!}

is a martingale with respect to the natural filtration generated by the solution

(x (t), P_{x})

of the stochastic differential Equation (1), where

\sum_{i \in V_{I}} c_{i} = 0

with

c_{i} \in R

,

i \in V_{I}

.

Proof.

Applying Itô’s formula to monomials

x^{a} = \prod_{i \in V_{I}} x_{i}^{a_{i}}

, we have

d x^{a} = \frac{1}{2} \sum_{i \in V_{I}} a_{i} (a_{i} - 1) x^{a - e_{i} + e_{j}},

where the vertex j is adjacent to all vertices of

V_{I}

. Then,

d \sum_{\sum_{k \in V_{I}} a_{k} = n, a \geq 1} f (n, a) x^{a} = \frac{x_{j}}{2} \sum_{\sum_{k \in V_{I}} a_{k} = n, a \geq 1} \sum_{i \in V_{I}} a_{i} (a_{i} - 1) x^{a - e_{i}} f (n, a),

(7)

where

f (n, a) = (\begin{matrix} n - | V_{I} | \\ a - 1 \end{matrix}) (\begin{matrix} n \\ a \end{matrix}) c^{a} .

The right side of Equation (7) is proportional to

\sum_{i \in V_{I}} c_{i} \sum_{\sum_{k \in V_{I}} a_{k} = n, a \geq 1, a_{i} \geq 2} x^{a - e_{i}} f (n - 1, a - e_{i}),

and it vanishes because the second summation does not depend on the index i and

\sum_{i \in V_{I}} c_{i} = 0

. □

Proposition 2 gives invariants among n-th order moments of the marginal distribution of the solution of the stochastic differential Equation (1) at a given time. More precisely, such a moment is represented as

m_{a} (t) : = E_{x} {{(x (t))}^{a}} = T_{t} x^{a}, | a | = a_{1} + \dots + a_{r} = n, a \in N_{0}^{V} .

(8)

Itoh et al. [1] used the invariants to derive the expression (6) for masses on atoms in the star graph

S_{2}

.

Corollary 1.

Let

V_{I} ⊊ V

be an independent set of a graph

G

sharing an adjacent vertex. For moments of each order

n \geq | V_{I} | + 1

, we have

\begin{matrix} \sum_{\sum_{i \in V_{I}} a_{i} = n, a \geq 1} (\begin{matrix} n - | V_{I} | \\ a - 1 \end{matrix}) (\begin{matrix} n \\ a \end{matrix}) c^{a} m_{a} (t) \\ = \sum_{\sum_{i \in V_{I}} a_{i} = n, a \geq 1} (\begin{matrix} n - | V_{I} | \\ a - 1 \end{matrix}) (\begin{matrix} n \\ a \end{matrix}) {(c x)}^{a}, \forall t > 0, \end{matrix}

where

\sum_{i \in V_{I}} c_{i} = 0

with

c_{i} \in R

,

i \in V_{I}

.

A small example follows.

Example 4.

Let

G = C_{4}

, which is the cycle graph consisting of four vertices discussed in Example 2. A maximal independent set of

C_{4}

,

V_{I} = {2, 4}

shares 1 or 3 of the adjacent vertices. The totality of ordered positive integer partitions of four are

(a_{2}, a_{4}) = (1, 3)

,

(2, 2)

, and

(3, 1)

, which correspond to the fourth-order moments

m_{e_{2} + 3 e_{4}} (t)

,

m_{2 e_{2} + 2 e_{4}} (t)

, and

m_{3 e_{2} + e_{4}} (t)

, respectively. They constitute an invariant:

m_{e_{2} + 3 e_{4}} (t) - 3 m_{2 e_{2} + 2 e_{4}} (t) + m_{3 e_{2} + e_{4}} (t) = x_{2} x_{4} (x_{4}^{2} - 3 x_{2} x_{4} + x_{2}^{2}), \forall t \geq 0 .

The existence of invariants among same-order moments is interesting, but we are also interested in computation of each moment. They can be computed by simple algebra since each order moment satisfies a system of differential equations:

\frac{d}{d t} m_{a} (t) = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} (a_{i} - 1)}{2} m_{a - e_{i} + e_{j}} - \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2} m_{a}, m_{a} (0) = x^{a}

(9)

for each

a \in Π_{n, r}^{\geq 0}

, where

Π_{n, r}^{\geq 0}

is the totality of the ordered positive integer partitions of an integer n with r positive integers:

Π_{n, r}^{\geq 0} : = {a \in N_{0}^{r} : a_{1} + \dots + a_{r} = n} .

However, it is obvious that solving the system (9) becomes prohibitive as the cardinality of the set

Π_{n, r}^{\geq 0}

grows. Computation of moments via stochastic simulation is discussed in Section 5.

3. Dual Process on Integer Partitions

To study diffusions

(x (t), P_{x})

governed by the generator (2), we employ a tool called duality, which is a familiar tool in the study of interacting particle systems (see, e.g., Chapter 2 of [7]).

Consider a graph

G = (V, E)

and let

(a (t), P_{a})

,

a (0) = a

be a continuous-time Markov chain on the set of ordered non-negative integer partitions of n with

r = | V |

, which is denoted by

Π_{n, r}^{\geq 0}

, by the rate matrix

{R_{a, b}}

:

\begin{matrix} R_{a, a - e_{i} + e_{j}} & = \frac{a_{i} (a_{i} - 1)}{2}, & i \in V, j \in N (i), \\ R_{a, b} & = 0, & for all other b \neq a, \\ R_{a, a} & = - \sum_{b \neq a} R_{a, b}, \end{matrix}

(10)

where

R_{a, b} = lim_{t ↓ 0} \frac{P_{a} (a (t) = b) - δ_{a, b}}{t}, \forall a, b \in Π_{n, r}^{\geq 0} .

The backward equation for the transition probability

P_{a} (a (t) = \cdot)

is

\frac{d}{d t} P_{a} (a (t)) = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} (a_{i} - 1)}{2} P_{a - e_{i} + e_{j}} (a (t)) + R_{a, a} P_{a} (a (t)) .

(11)

We have the following duality relation between the Markov semigroup

{T_{t}}

and the Markov chain

(a (t), P_{a})

.

Lemma 1.

The Markov semigroup

{T_{t}}

associated with the generator (2) and the Markov chain

(a (t), P_{a})

with the rate matrix (10) satisfy

T_{t} x^{a} = E_{x} {{(x (t))}^{a}} = E_{a} [x^{a (t)} exp \{- \int_{0}^{t} k (a (s)) d s\}], \forall t \geq 0

(12)

for each

a \in Π_{n, r}^{\geq 0}

, where the killing rate is

k (a) = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2} - \sum_{i \in V} \frac{d_{i} a_{i} (a_{i} - 1)}{2} .

(13)

Proof.

Noting that

\begin{matrix} L x^{a} & = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} (a_{i} - 1)}{2} x^{a - e_{i} + e_{j}} - \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2} x^{a} \\ = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} (a_{i} - 1)}{2} (x^{a - e_{i} + e_{j}} - x^{a}) - \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2} x^{a} + \sum_{i \in V} \frac{d_{i} a_{i} (a_{i} - 1)}{2} x^{a} \\ = \sum_{b \in Π_{n, r}^{\geq 0}} R_{a, b} x^{b} - k (a) x^{a} \end{matrix}

and the operation (4), we see that

g (t, a) = T_{t} x^{a}

satisfies the differential equation

\frac{d}{d t} g (t, a) = \sum_{b \in I} R_{a, b} g (t, a) - k (a) g (t, a), g (0, a) = x^{a}

for each

a \in Π_{n, r}^{\geq 0}

. This is uniquely solved by means of the Feynman–Kac formula, and the assertion follows. □

Since the total number of particles is kept, i.e.,

| a (t) | = a_{1} (t) + \dots + a_{r} (t) = | a |

,

\forall t

, the killing rate (13) is bounded. The killing rate is not positive definite; however, a key observation is that if the support of a monomial a denoted by

supp (a) : = {i \in V : a_{i} > 0}

is an independent set of

G

, then the killing rate is non-positive:

k (a) \leq 0

. The converse is not always true.

To illustrate the Markov chain

(a (t), P_{a})

, let us ask specific questions. What is the moment of

2 e_{2} + e_{4}

for the cycle graph

C_{4}

discussed in Example 2? For a chain that starts from

2 e_{2} + e_{4}

, there are two possible transitions: the one is absorbed into the state

e_{1} + e_{2} + e_{4}

and the other is absorbed into the state

e_{2} + e_{3} + e_{4}

, where the rates are unities (see Figure 1).

The waiting time for the occurrence of either of these two transitions follows the exponential distribution of rate two. Since

k (a) = - 2

and

k (e_{1} + e_{2} + e_{4}) = k (e_{2} + e_{3} + e_{4}) = 2

, the right side of the duality relation (12) can be computed as

\begin{matrix} x_{2}^{2} x_{4} e^{2 t} e^{- 2 t} + \frac{1}{2} (x_{1} x_{2} x_{4} + x_{2} x_{3} x_{4}) \int_{0}^{t} e^{2 s - 2 (t - s)} 2 e^{- 2 s} d s \\ = x_{2}^{2} x_{4} + \frac{(x_{1} + x_{3}) x_{2} x_{4}}{2} (1 - e^{- 2 t}), t \geq 0, \end{matrix}

where s is the time that one of the two possible transitions occurs. The first term corresponds to the case that no transition occurs until time t. Let us call a transition event a collision.

Remark 1.

An analogous dual Markov chain of the diffusion approximation of a kind of Wright–Fisher model was effectively used by Shiga [8,9], where a transition

a \to a - e_{i}

occurs with the same rate as in (10). Such a transition event is called “coalescent”. In contrast to a collision, the total number of particles decreases by a coalescent.

Here, we have a simple observation about the invariants among moments discussed in Section 2. If a chain

(a (t), P_{a})

starts from a state a such that

supp (a)

is a maximal independent set

V_{I}

of the graph

G

, then the killing rate is non-positive, and the duality relation (12) is reduced to

m_{a} (t) = x^{a} + E_{a} [x^{a (t)} exp \{- \int_{0}^{t} k (a (s)) d s\}; collisions occur] .

Corollary 1 implies cancellation of the second term among moments. Moreover, considering a case that the diffusion

(x (t), P_{x})

starts from a point x in the face corresponding to

V_{I}

, by Proposition 1, we know that after a collision,

supp (a (t))

must contain a vertex that is not contained in

V_{I}

.

Let us ask another question. What is the moment of

a = 2 e_{1} + e_{2} + e_{3}

for the star graph

S_{2}

discussed in Section 1? Some consideration reveals that a chain

(a (t), P_{a})

starting from a will never be absorbed, and transitions occur among the three states: a,

b = e_{1} + 2 e_{2} + e_{3}

, and

c = e_{1} + e_{2} + 2 e_{3}

. Since

k (a) = 2

and

k (b) = k (c) = 1

, the duality relation (12) gives

m_{a} (t) = e^{- t} E_{a} [x^{a (t)} exp \{- \int_{0}^{t} 1 (a (s) = a) d s\}],

(14)

where

1 (A)

is 1 if the argument

A

is true and zero otherwise. Solving the backward Equation (11), we immediately obtain

P_{a} (a (t) = a) = \frac{1}{3} + \frac{2}{3} e^{- 3 t}, P_{a} (a (t) = b) = P_{a} (a (t) = c) = \frac{1}{3} - \frac{1}{3} e^{- 3 t} .

(15)

However, computation of the right side of Equation (14) does not seem easy because the expectation depends on a sample path of the chain

(a (s) : 0 \leq s \leq t)

. Nevertheless, the moments can be obtained easily by solving the system of differential equations (9). In fact, we have

(\begin{matrix} m_{a} (t) \\ m_{b} (t) \\ m_{c} (t) \end{matrix}) = (e^{- (3 - \sqrt{3}) t} (A + B) + e^{- 2 t} C + e^{- (3 + \sqrt{3}) t} (A - B)) (\begin{matrix} x_{1}^{2} x_{2} x_{3} \\ x_{1} x_{2}^{2} x_{3} \\ x_{1} x_{2} x_{3}^{2} \end{matrix}),

where

A = (\begin{matrix} \frac{1}{2} & 0 & 0 \\ 0 & \frac{1}{4} & \frac{1}{4} \\ 0 & \frac{1}{4} & \frac{1}{4} \end{matrix}), B = \frac{\sqrt{3}}{6} (\begin{matrix} - 1 & 1 & 1 \\ 1 & \frac{1}{2} & \frac{1}{2} \\ 1 & \frac{1}{2} & \frac{1}{2} \end{matrix}), C = (\begin{matrix} 0 & 0 & 0 \\ 0 & \frac{1}{2} & - \frac{1}{2} \\ 0 & - \frac{1}{2} & \frac{1}{2} \end{matrix}) .

The observations above lead to the following proposition on the fate of the Markov chain

(a (t), P_{a})

.

Proposition 3.

Consider the Markov chain

(a (t), P_{a})

on the set of the ordered non-negative integer positive partitions

Π_{n, r}^{\geq 0}

.

$(i)$: If a chain starts from a state a satisfying $1 \leq | a | \leq r$ , then it is absorbed into an element of ${0, 1}^{V}$ ;
$(i i)$: If a chain starts from a state a satisfying $n = | a | > r$ , then the transition probability $P_{a} (a (t) = \cdot)$ converges to the uniform distribution on the set of ordered positive integer partitions

$Π_{n, r} : = {b \in N^{r} : b_{1} + \dots + b_{r} = n} ⊊ Π_{n, r}^{\geq 0} .$

Proof.

(i): A state a is absorbing if and only if the row vector of the rate matrix (10) is zero, which implies $a \in {0, 1}^{V}$ . Consider the set of vertices $V_{0} (a (t)) = {i \in V : a_{i} (t) = 0}$ . Then, $z (t) : = | V_{0} (a (t)) |$ , $t \geq 0$ is a death process and is absorbed into the state $r - n$ at Markov time $τ_{0} < \infty$ with respect to the Markov chain $(a (t), P_{a})$ , where $a_{j} (τ_{0}) = 1$ for $j \in V \ V_{0} (a (τ_{0}))$ . Let us show that the process eventually decreases if $z (t) > r - n$ . If $z (t) > r - n$ , at least one vertex, say $j \in V \ V_{0} (a (t))$ , satisfies $a_{j} (t) \geq 2$ . If the vertex j is connected with a vertex in $V_{0} (a (t))$ , say k, the transition $a (t) \to a (t_{+}) = a (t) - e_{j} + e_{k}$ occurs with positive probability, and it makes $z (t_{+}) = z (t) - 1$ . Otherwise, the vertex j should be connected with at least one vertex in $V \ V_{0} (a (t))$ , say l. The transition $a (t) \to a (t_{+}) = a (t) - e_{j} + e_{l}$ makes $a_{l} (t_{+}) \geq 2$ . If the vertex l is connected with a vertex in $V_{0} (a (t))$ , the assertion is shown. Otherwise, the vertex l should be connected with at least one vertex in $V \ {V_{0} (a (t)) \cup j}$ . The proof is completed by repeating this procedure until we reach a vertex in $V \ V_{0} (a (t))$ that is connected with a vertex in $V_{0} (a (t))$ .
(ii): If a chain starts from a state in the set $Π_{n, r}^{\geq 0} \ Π_{n, r}$ , the argument for $(i)$ , i.e., $z (t)$ is a death process, shows that in a finite time the chain reaches a state, say a, in the set $Π_{n, r}$ and never exits from $Π_{n, r}$ . For such a case, consider restarting the chain from the state a. For simplicity, we consider the case of $n = r + 1$ . Then, a state $a (t)$ can be identified with the unique vertex $i \in V$ satisfying $a_{i} (t) = 2$ . Since we are considering a connected graph, there exists a path $i = v_{0}, v_{1}, v_{2}, \dots, v_{s} = j$ for any $i \neq j \in V$ with some $s \in N$ , where $v_{k - 1} \sim v_{k}$ , $k = 1, \dots, s$ . Since all of the transitions occur with rate unity, the sample path has a positive probability. Hence, the Markov chain is irreducible and ergodic, and there exists the unique stationary distribution on the set $Π_{n, r}$ . Since the uniform distribution on $Π_{n, r}$ satisfies the backward Equation (11), it is the stationary distribution. Cases of $n > r + 1$ can be shown in a similar manner by showing that up to $n - r$ particles can be moved from one vertex to another vertex.

□

Proposition 3 shows that if the Markov chain

(a (t), P_{a})

starts from a state a satisfying

| a | > | V |

, convergence of the chain can be divided into two phases: (1) exit from the set

Π_{n, r}^{\geq 0} \ Π_{n, r}

and (2) convergence to the uniform distribution on the set

Π_{n, r}

. The largest eigenvalue of the rate matrix is zero. To consider the mixing, we have to know the second-largest eigenvalue, say

λ_{2}

. By spectral decomposition of the transition probability, the mixing time of Phase 2, or the infimum of t satisfying

max_{a \in Π_{n, r}} ∥ P_{a} (a (t) = \cdot) - | Π_{n, r} {|^{- 1} ∥}_{TV} < ϵ

for

ϵ > 0

is less than

c λ_{2}^{- 1} log ϵ^{- 1}

for some constant

c > 0

, where

{∥ μ - ν ∥}_{TV}

is the total variation distance between probability measures

μ

and

ν

.

Example 5.

Consider the case that

n = r + 1

(see the proof of Proposition 3 (

i i

)). It can be shown that the rate matrix

{R_{a, b}}

reduces to the negative of the Laplacian matrix of

G

, whose second- largest eigenvalue is called the algebraic connectivity of

G

. For a connected graph, it is known that the largest eigenvalue is zero and the second-largest eigenvalue is bounded below by

4 / (r diam (G))

[10], where

diam (G)

is the diameter of

G

, or the maximum of the shortest path lengths between any pair of vertices in

G

. For the star graph

S_{2}

and the fourth-order monomials discussed above, the rate matrix is the negative of the Laplacian matrix:

R = - (\begin{matrix} 2 & - 1 & - 1 \\ - 1 & 1 & 0 \\ - 1 & 0 & 1 \end{matrix}),

and the eigenvalues are 0,

- 1 (< - 2 / 3)

, and

- 3

, where

r = 3

,

diam (S_{2}) = 2

. The two eigenvalues appear in the transition probabilities (15). For a generic graph with large r, the mixing time is

O (r diam (G)) log ϵ^{- 1}

.

Assessment of Phase 1 seems harder. The death process

z (t) = | V_{0} (a (t)) |

,

t > 0

with

z (0) \leq r - 1

is not a Markov process. The death rate is bounded below:

\sum_{i \in V} \sum_{j \in N (i) \cap V_{0} (a (t))} \frac{a_{i} (t) (a_{i} (t) - 1)}{2} \geq \frac{a_{k} (t) (a_{k} (t) - 1)}{2} for some k \in V \ V_{0} (a (t)) .

To obtain a rough estimate of the right side, let us suppose the state

a (t)

follows the uniform distribution on the set of ordered positive integer partitions

Π_{n, r - z}

, where

P (a_{k} = i) = (\begin{matrix} n - i \\ r - z - 2 \end{matrix}) {(\begin{matrix} n - 1 \\ r - z - 1 \end{matrix})}^{- 1}, i \in {1, \dots, n - r + z + 1} .

Then, the expectation of the lower bound is

{(n - r + z)}_{r - z + 2} / {(r - z)}_{3}

, where

{(n)}_{i} = n (n + 1) \dots (n + i - 1)

. When n is large, the dominant contribution to the expectation of the waiting time for the exit comes from the period

{t : z (t) = z (0)}

, and it is

O (n^{z (0) - r - 2})

. Hence, the expectation of the waiting time for the exit would be

O (n^{- 3})

.

Shiga [8,9] and Shiga and Uchiyama [11] studied structures of extremal stationary states of the diffusion approximation of a kind of Wright–Fisher model in

{[0, 1]}^{S}

for a countable set S by using its dual Markov chain. Extremal states of the adjoint Markov semigroup

{T_{t}^{*}}

on

P (Δ_{r - 1})

induced by

{T_{t}}

associated with the diffusion

(x (t), P_{x})

can be studied by using the dual Markov chain

(a (t), P_{a})

. Note that positivity of a stationary state is not assumed, as explained in Section 1.

Theorem 1.

The support of an extremal stationary state of the adjoint Markov semigroup

{T_{t}^{*}}

is a face of the simplex

Δ_{r - 1}

corresponding to an independent set

V_{I}

of the graph

G

, namely,

int (conv {e_{i} : i \in V_{I}})

.

Proof.

Consider a Markov chain

(a (t), P_{a})

with rate matrix (10) starting from a state

a (0) = a \in {0, 1}^{V}

. According to Proposition 3

(i)

, such an a is an absorbing state and the chain stays there. Lemma 1 gives

T_{t} x^{a} = x^{a} exp (- t \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2}), \forall t \geq 0 .

A stationary state

ν \in P (Δ_{r - 1})

satisfies

〈 ν, T_{t} x^{a} 〉 = 〈 T_{t}^{*} ν, x^{a} 〉 = 〈 ν, x^{a} 〉

for any

x^{a}

. If

a \in {0, 1}^{V}

, this condition reduces to

〈 ν, x^{a} 〉 \{1 - exp (- t \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2})\} = 0, \forall t \geq 0 .

Therefore, if

supp (a)

is not an independent set, then

〈 ν, x^{a} 〉 = 0

. Since we are considering a connected graph

G

, V is not an independent set of

G

. The condition reduces to

〈 ν, x_{1} \dots x_{r} 〉 = 0

. Since

{x \in Δ_{r - 1} : x_{1} \dots x_{r} > 0} = int (Δ_{r - 1}) = int (conv {e_{i} : i \in V}),

the condition

〈 ν, x_{1} \dots x_{r} 〉 = 0

excludes

int (conv {e_{i} : i \in V})

from the support of

ν

. Suppose there exists a vertex

j_{1}

such that

V^{(1)} = V \ {j_{1}}

is still not an independent set. Set a is such that

a_{i} = 1

if

i \in V^{(1)}

and

a_{i} = 0

otherwise for each

i \in {1, \dots, r}

. Since

{x \in Δ_{r - 1} : x^{a} > 0} = int (conv {e_{i} : i \in V^{(1)}}),

the condition

〈 ν, x^{a} 〉 = 0

excludes

int (conv {e_{i} : i \in V^{(1)}})

from the support of

ν

. Repeating this procedure yields an independent set

V_{I} = V \ {j_{1}, \dots, j_{s}}

for some

s \in N

, and the face

int (conv {e_{i} : i \in V_{I}})

is not excluded from the support of

ν

. □

The steps in the above proof appear in the following example.

Example 6.

Let

G = C_{4}

, which is a cycle graph consisting of four vertices. The support of an extremal stationary state appearing in Example 2 is confirmed as follows. Remove the vertex

j_{1} = 2

from the vertex set V. Since

V^{(1)} = V \ {2} = {1, 3, 4}

is not an independent set, the face

int (conv {e_{1}, e_{3}, e_{4}})

is excluded from the support of extremal stationary states. Then, remove

j_{2} = 4

from

V^{(1)}

. Since

V^{(2)} = V \ {2, 4} = {1, 3}

is an independent set, the face

int (conv {e_{1}, e_{3}})

is the support of an extremal stationary state.

A direct consequence of Theorem 1 on the moments is as follows.

Corollary 2.

For each

n \in N

, if the limit of an n-th order moment of the diffusion

(x (t), P_{x})

on a graph

G

is positive, namely,

lim_{t \to \infty} m_{a} (t) = lim_{t \to \infty} E_{x} {{(x (t))}^{a}} > 0, for a satisfying | a | = n,

then

supp (a)

is an independent set of

G

.

4. Diffusion with Linear Drift

In this section, we consider the diffusion

(\tilde{x} (t), {\tilde{P}}_{x})

taking value in probability measures on a graph

G = (V, E)

,

r = | V |

satisfying the following stochastic differential equation with linear drift:

d x_{i} = \sum_{j \in N (i)} \sqrt{x_{i} x_{j}} d B_{i j} + \frac{α}{2} (1 - r x_{i}) d t, i \in V

(16)

for

α \in R_{> 0}

. The drift term,

α (1 - r x_{i}) d t / 2

, gives a killing of the dual process with a linear rate. As shown below, behaviours of the diffusion and the dual Markov chain are significantly different from those without drift discussed in previous sections.

In Itoh et al.’s discrete stochastic model described in Section 1, this drift corresponds to adding the following dynamics: at each instant of time, one of N particles is chosen uniformly randomly and assigned to another vertex chosen uniformly randomly with rate

α (r - 1) / (N - 1)

. In the Wright–Fisher model, this drift corresponds to a mutation mechanism [4].

Let

(\tilde{a} (t), {\tilde{P}}_{a})

be a continuous-time Markov chain on a finite integer lattice, or the set of non-negative integers

I : = {a \in N_{0}^{V} : a_{1} + \dots + a_{r} < \infty}

by the rate matrix

{{\tilde{R}}_{a, b}}

:

\begin{matrix} {\tilde{R}}_{a, a - e_{i} + e_{j}} & = \frac{a_{i} (a_{i} - 1)}{2}, & i \in V, j \in N (i), \\ {\tilde{R}}_{a, a - e_{i}} & = \frac{α}{2} a_{i}, & i \in V, \\ {\tilde{R}}_{a, b} & = 0, & for all other b \neq a, \\ {\tilde{R}}_{a, a} & = - \sum_{b \neq a} {\tilde{R}}_{a, b}, \end{matrix}

(17)

where

{\tilde{R}}_{a, b} = lim_{t ↓ 0} \frac{{\tilde{P}}_{a} (\tilde{a} (t) = b) - δ_{a, b}}{t}, \forall a, b \in I

The backward equation for the transition probability

{\tilde{P}}_{a} (\tilde{a} (t) = \cdot)

is

\begin{matrix} \frac{d}{d t} {\tilde{P}}_{a} (\tilde{a} (t)) = & \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} (a_{i} - 1)}{2} {\tilde{P}}_{a - e_{i} + e_{j}} (\tilde{a} (t)) + \frac{α}{2} \sum_{i \in V} {\tilde{P}}_{a - e_{i}} (\tilde{a} (t)) \\ - {\tilde{R}}_{a, a} {\tilde{P}}_{a} (\tilde{a} (t)) . \end{matrix}

The following duality relation between the Markov semigroup associated with the diffusion

(\tilde{x} (t), {\tilde{P}}_{x})

denoted by

{{\tilde{T}}_{t} : t \geq 0}

and the Markov chain

(\tilde{a} (t), {\tilde{P}}_{a})

can be shown in the same manner as Lemma 1.

Lemma 2.

The Markov semigroup

{{\tilde{T}}_{t}}

and the Markov chain

(\tilde{a} (t), {\tilde{P}}_{a})

with the rate matrix (17) satisfy

{\tilde{T}}_{t} x^{a} = {\tilde{E}}_{x} (x^{a} (t)) = {\tilde{E}}_{a} [x^{\tilde{a} (t)} exp \{- \int_{0}^{t} \tilde{k} (\tilde{a} (s)) d s\}], t \geq 0

(18)

for each

a \in I

, where the killing rate is

\tilde{k} (a) = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2} + \frac{α}{2} (r - 1) | a | - \sum_{i \in V} \frac{d_{i} a_{i} (a_{i} - 1)}{2} .

(19)

In contrast to the rate matrix

{R_{a, b}}

in (10), particles are erased, and it makes the total number of particles

| \tilde{a} (t) | = {\tilde{a}}_{1} (t) + \dots + {\tilde{a}}_{r} (t)

decrease. It is clear from the rate matrix (17) that 0 is the unique absorbing state.

Proposition 4.

Let

τ : = inf {t > 0 : \tilde{a} (t) = 0}

, which is a Markov time with respect to the Markov chain

(\tilde{a} (t), {\tilde{P}}_{a})

with the rate matrix (17). Then,

{\tilde{P}}_{a} (τ < \infty) = 1 and {\tilde{E}}_{a} τ = \frac{2}{α} \sum_{i = 1}^{| a |} \frac{1}{i} .

Proof.

Since

\tilde{a} (t) = 0

if and only if

| \tilde{a} (t) | = 0

, we consider the Markov chain of the cardinality

| \tilde{a} (t) |

. According to the rate matrix (17), it is a linear death process with rate

α | \tilde{a} (t) | / 2

. Noting that

τ

is the convolution of exponential random variables of rates

α i / 2

,

i \in {1, \dots, | a |}

, we have the assertion. □

To illustrate the Markov chain

(\tilde{a} (t), {\tilde{P}}_{a})

, let us ask a specific question. What is the moment of

a = e_{1} + e_{2}

from the cycle graph

C_{4}

? For a chain that starts from a, there are four possible sample paths (Figure 2):

(i): No particles are erased;
(ii): Particle 1 is erased, but Particle 2 survives;
(iii): Particle 2 is erased, but Particle 1 survives;
(iv): Both particles are erased.

The waiting time for either of the two particles to be erased follows the exponential distribution of rate

α

. Since

\tilde{k} (a) = 1 + 3 α

and

\tilde{k} (e_{1}) = \tilde{k} (e_{2}) = 3 α / 2

, the right side of the duality relation (18) can be computed as

\begin{matrix} x_{1} x_{2} e^{- α t} e^{- (1 + 3 α) t} + \frac{x_{1} + x_{2}}{2} \int_{0}^{t} α e^{- α s} e^{- α (t - s) / 2} e^{- (1 + 3 α) s - 3 α (t - s) / 2} d s \\ + \int_{0}^{t} \int_{0}^{u} α e^{- α s} \frac{α}{2} e^{- α (u - s) / 2} e^{- (1 + 3 α) s - 3 α (u - s) / 2} d s d u \\ = & x_{1} x_{2} e^{- t (1 + 4 α)} + \frac{(x_{1} + x_{2}) α}{2 (1 + 2 α)} (e^{- 2 α t} - e^{- (1 + 4 α) t}) + \frac{α}{4 (1 + 4 α)} - \frac{α e^{- 2 α t}}{4 (1 + 2 α)} \\ + \frac{α^{2} e^{- (1 + 4 α) t}}{2 (1 + 2 α) (1 + 4 α)}, \end{matrix}

where

s > 0

and

u > s

are the times that a particle is erased.

The stationary state of the adjoint Markov semigroup

{{\tilde{T}}_{t}^{*}}

on

P (Δ_{r - 1})

induced by

{{\tilde{T}}_{t}}

consists of the unique probability measure.

Theorem 2.

For the adjoint Markov semigroup

{{\tilde{T}}_{t}^{*}}

, there exists the unique stationary state

ν_{α} \in P (Δ_{r - 1})

satisfying

lim_{t \to \infty} {\tilde{T}}_{t}^{*} δ_{x} = ν_{α}

for every

x \in Δ_{r - 1}

.

Proof.

Since the Markov chain

(\tilde{a} (t), {\tilde{P}}_{a})

with the rate matrix (17) is absorbed into 0, Lemma 2 and Proposition 4 give

\begin{matrix} lim_{t \to \infty} {\tilde{T}}_{t} x^{a} = & lim_{t \to \infty} {\tilde{E}}_{a} [x^{\tilde{a} (t)} exp \{- \int_{0}^{t} \tilde{k} (\tilde{a} (s)) d s\}; t \leq τ] \\ + lim_{t \to \infty} {\tilde{E}}_{a} [x^{\tilde{a} (t)} exp \{- \int_{0}^{t} \tilde{k} (\tilde{a} (s)) d s\}; t > τ] \\ = & {\tilde{E}}_{a} [exp \{- \int_{0}^{τ} \tilde{k} (\tilde{a} (s)) d s\}; τ < \infty] = {\tilde{E}}_{a} exp \{- \int_{0}^{τ} \tilde{k} (\tilde{a} (s)) d s\} \end{matrix}

(20)

for all

a \in I

. Since

{lim}_{t \to \infty} 〈 δ_{x}, {\tilde{T}}_{t} x^{a} 〉 = {lim}_{t \to \infty} 〈 {\tilde{T}}_{t}^{*} δ_{x}, x^{a} 〉

for each

x \in Δ_{r - 1}

, there exists a unique probability measure

ν_{α}

satisfying

{lim}_{t \to \infty} {\tilde{T}}_{t} f = 〈 ν_{α}, f 〉

for all

f \in C (Δ_{r - 1})

. □

The stationary state

ν_{α}

converges weakly to a common limit as

α \to \infty

irrespective of the graph.

Corollary 3.

The stationary state

ν_{α}

of the adjoint Markov semigroup

{{\tilde{T}}_{t}^{*}}

satisfies

lim_{α \to \infty} ν_{α} = δ_{(1 / r, \dots, 1 / r)} .

Proof.

Since the killing rate (19) is bounded and

\tilde{k} (a) = α (r - 1) | a | / 2 + O (1)

for large

α

, the leading contribution to the expression (20) can be evaluated by the death process

| \tilde{a} (t) |

considered in Proposition 4, whose waiting time follows the exponential distribution of rate

α | \tilde{a} (t) | / 2

. Let

n = | \tilde{a} (0) | = | a |

. We have

\begin{matrix} 〈 ν_{α}, x^{a} 〉 & = {\tilde{E}}_{a} exp \{- \int_{0}^{τ} \tilde{k} (\tilde{a} (s)) d s\} \\ \leq \int_{0}^{\infty} d s_{1} \dots d s_{n} \prod_{i = 1}^{n} \{e^{- \frac{α}{2} (r - 1) i s_{i} + c_{n}} \frac{α}{2} i e^{- \frac{α}{2} i s_{i}}\} = {(\frac{α}{2})}^{n} n! \prod_{i = 1}^{n} \frac{2}{α r i + 2 c_{n}}, \forall a \in I, \end{matrix}

where

c_{n}

is a constant satisfying

\tilde{k} (b) \leq α (r - 1) | b | / 2 + c_{n}

for all b satisfying

| b | \in {1, \dots, n}

. In the same way,

〈 ν_{α}, x^{a} 〉

is bounded below. The assertion follows by taking the limit

α \to \infty

of these bounds. □

Moreover, the stationary state

ν_{α}

has a continuous and strictly positive density.

Theorem 3.

For the adjoint Markov semigroup

{{\tilde{T}}_{t}^{*}}

, the unique stationary state

ν_{α} \in P (Δ_{r - 1})

is absolutely continuous with respect to the Lebesgue measure on

Δ_{r - 1}

and admits a probability density that is strictly positive in

int (Δ_{r - 1})

and is of

C^{\infty} (Δ_{r - 1})

-class.

Proof.

We first show that

ν_{α}

has a density of

C^{\infty} (Δ_{r - 1})

-class. By Theorem 2, we have

\begin{matrix} 〈 ν_{α}, e^{\sqrt{- 1} \sum_{i \in V} x_{i} n_{i}} 〉 & = \sum_{j = 0}^{\infty} \frac{{(\sqrt{- 1})}^{j}}{j!} \sum_{a \in Π_{j, r}^{\geq 0}} (\begin{matrix} j \\ a \end{matrix}) 〈 ν_{α}, {(x n)}^{a} 〉 \\ = \sum_{j = 0}^{\infty} \frac{{(\sqrt{- 1})}^{j}}{j!} \sum_{a \in Π_{j, r}^{\geq 0}} (\begin{matrix} j \\ a \end{matrix}) n^{a} {\tilde{E}}_{a} exp \{- \int_{0}^{τ} \tilde{k} (\tilde{a} (s)) d s\} \end{matrix}

for each

n \in Z^{V}

. Therefore,

ν_{α}

has a

C^{\infty} (Δ_{r - 1})

-density represented as

\begin{matrix} p_{α} (x) = & \sum_{n \in Z^{V}} e^{- \sqrt{- 1} \sum_{i \in V} x_{i} n_{i}} \sum_{j = 0}^{\infty} \frac{{(\sqrt{- 1})}^{j}}{j!} \sum_{a \in Π_{j, r}^{\geq 0}} (\begin{matrix} j \\ a \end{matrix}) n^{a} {\tilde{E}}_{a} exp \{- \int_{0}^{τ} \tilde{k} (\tilde{a} (s)) d s\} . \end{matrix}

We next show that the density

p_{α}

is strictly positive in

int (Δ_{r - 1})

. Consider an approximation of

p_{α} (x)

by polynomials:

p_{α}^{(n)} (x) = \sum_{i_{1} = 0}^{n} \dots \sum_{i_{r} = 0}^{n} p_{α} (\frac{i_{1}}{n}, \dots, \frac{i_{r}}{n}) (\begin{matrix} n \\ i \end{matrix}) x^{i}

satisfying

{lim}_{n \to \infty} p_{α}^{(n)} (x) = p_{α} (x)

. Suppose there exist a point

\bar{x} \in int (Δ_{r - 1})

satisfying

p_{α} (\bar{x}) = 0

. Since the polynomials

{\bar{x}}^{i}

are strictly positive, for any small positive constants

ϵ_{1}

and

ϵ_{2}

, there exists an N such that

p_{α} (\frac{i_{1}}{n}, \dots, \frac{i_{r}}{n}) < ϵ_{1}, \forall i \in {0, \dots, n}^{V}

and

int (Δ_{r - 1})

is covered by open balls:

B_{n} (i) = {x : | x_{i_{j}} - i_{j} / n | < ϵ_{2}}, i \in {0, \dots, n}^{V}

for all

n > N

. Since

p_{α}

is smooth, for every point

x \in int (Δ_{r - 1})

, we can find a ball containing x and

p_{α} (x) < ϵ_{1} + c ϵ_{2}

for some constant c. This implies

p_{α} (x) = 0

,

\forall x \in int (Δ_{r - 1})

, but it contradicts the fact that

〈 ν_{α}, x^{a} 〉 > 0

,

\forall a \in I

followed by the expression (20) because the killing rate

\tilde{k} (\tilde{a})

is bounded and the Markov time satisfies

\tilde{P} (τ < \infty) = 1

by Proposition 4. □

An immediate consequence is the following corollary, which is an analogous result to Corollary 2.

Corollary 4.

The moments of the stationary state

ν_{α}

of the adjoint Markov semigroup

{{\tilde{T}}_{t}^{*}}

are positive, namely,

m_{a} (α) : = 〈 ν_{α}, x^{a} 〉 = {\tilde{E}}_{ν_{α}} x^{a} > 0, for each a \in I .

(21)

The moments of the stationary state can be obtained by the condition for the stationary state (5). It gives a system of recurrence relations:

0 = \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} (a_{i} - 1)}{2} m_{a - e_{i} + e_{j}} - \sum_{i \in V} \sum_{j \in N (i)} \frac{a_{i} a_{j}}{2} m_{a} + \frac{α}{2} \sum_{i \in V} a_{i} (m_{a - e_{i}} - r m_{a})

(22)

for each

a \in I

with the boundary condition

m_{0} = 1

. In contrast to the system of ordinary differential equations (9), this system is not closed among the moments of the same order. Prior to solving the system for the moment of a given monomial a, we have to solve the systems for the moments of lower orders than a. Therefore, it seems a formidable task to solve System (22). Diffusion on a complete graph is an exception.

Example 7.

Let

G = K_{r}

, which is the complete graph consisting of r vertices discussed in Example 1. The unique solution of the system of recurrence relations (22) is

m_{a} (α) = \frac{\prod_{i = 1}^{r} {(α)}_{a_{i}}}{{(r α)}_{n}}, \forall a \in I .

(23)

Moreover, since this expression is the moments of the symmetric Dirichlet distribution of parameter α, the stationary state is the Dirichlet distribution:

ν_{α} (x_{1}, \dots, x_{r}) = \frac{Γ (r α)}{{Γ (α)}^{r}} \prod_{i = 1}^{r} x_{i}^{α - 1} d x_{1} \dots d x_{r} .

Remark 2.

The limit of the moments (23) is known as the Dirichlet-multinomial distribution up to multiplication of the multinomial coefficient. Renumbering the set

supp (a)

by

{1, \dots, l}

and taking the limit

α \to 0

and

r \to \infty

with

r α = θ > 0

, the expression (23) reduces to the form

\frac{θ^{l}}{{(θ)}_{n}} \prod_{i = 1}^{r} (a_{i} - 1)!, \forall a \in Π_{n, l},

(24)

which is known as the Ewens sampling formula, or the exchangeable partition probability function of the Dirichlet prior process in Bayesian statistics (see, e.g., [12] for an introduction). Karlin and McGregor [13] derived this formula by using a system of recurrence relations based on coalescents mentioned in Remark 1. In this sense, we have found an alternative system of recurrence relations (22) the Formula (24) satisfies based on collisions.

5. Applications

In this section, we present applications of the results developed in previous sections.

5.1. Finding Independent Sets of Graphs

Itoh et al.’s discrete stochastic model described in Section 1 stops when the set of vertices occupied by at least one particle constitutes an independent set of a graph. The model is summarized as the following procedure.

The cardinality of the set of vertices to which at least one particle is assigned decreases, and the set eventually reduces to an independent set of

G

. The integer M is needed to confirm that we cannot choose particles from neighboring vertices. If M is sufficiently large, Algorithm 1 provides an independent set with high probability.

Algorithm 1 Finding an independent set of a graph

Input: A graph $G = (V, E)$ with vertices $V = {1, \dots, r}$ , the number of particles $N \geq r$ , and an integer $M \in N$ to stop the iteration.
Output: A candidate of an independent set of $G$ .

Step 1: Assign particles to vertices such that at least one particle is assigned to each vertex.
Step 2: Set $c = 0$ .
Step 3: Choose two distinct uniformly random particles. Let $i \in V$ and $j \in V$ be the vertices to which the particles are assigned.
Step 4: If $i \sim j$ , then assign both particles to i or j with probability $1 / 2$ and go to Step 2. Otherwise, $c \leftarrow c + 1$ .
Step 5: If $c < M$ , go to Step 3.
Step 6: Output the list of vertices to which at least one particle is assigned.

A natural question is how many steps are needed to find an independent set. Answering this question seems hard, but regarding the diffusion satisfying the stochastic differential Equation (1) as an approximation of the procedure of Algorithm 1, we can deduce some rough idea. Because of the scaling in the diffusion limit, the unit time in the diffusion corresponds to the

N (N - 1) / 2

iterations of Steps 3 and 4 of Algorithm 2.

According to the argument of Proposition 1, a sample path of the diffusion

(x (t), P_{x})

starting from a point

x \in int (Δ_{r - 1})

is absorbed into lower dimensional faces and is eventually absorbed into a face corresponding to an independent set.

Proposition 5.

Let

U \subset V

be a set of vertices that is not an independent set of a graph

G

. For a sample path of the diffusion

(x (t), P_{x})

starting from a point

x \in int (conv {e_{i} : i \in U})

, the Markov time

τ_{U} = inf {t > 0 : x (t) \in Bd (conv {e_{i} : i \in U})}

(25)

satisfies

P_{x} (τ_{U} > t) \geq c_{x} e^{- t | E_{U} |}, t > 0,

where

E_{U}

is the edge set of the induced subgraph of

G

consisting of U,

Bd (conv {e_{i} : i \in U})

is the boundary of

conv {e_{i} : i \in U}

, and

c_{x} \in (0, 1]

is a constant depending on x.

Proof.

By the argument in the proof of Theorem 1, we have

\begin{matrix} E_{x} (x (t)) & = E_{x} {x (t); x (t) > 0} \leq max (x) E_{x} {1 (x (t) > 0)} \\ = max (x) P_{x} {x (t) \in int (conv {e_{i} : i \in U})} \\ = max (x) P_{x} (τ_{U} > t), x = \prod_{i \in U} x_{i}, \end{matrix}

while

E_{x} (x (t)) = x e^{- t | E_{U} |}

. Choose

c_{x} = x {max (x)}^{- 1} = x {| U |}^{| U |}

. □

The author does not find any other property of the Markov time

τ_{U}

for generic graphs, but the diffusion on a complete graph is an exception; the probability distribution function can be obtained exactly.

Proposition 6.

Let

G

be a complete graph

K_{r}

. For a face

Δ_{s - 1} = conv {e_{i} : i \in U}

,

2 \leq s \leq r

, the distribution of the Markov time (25) is represented as

\begin{matrix} P_{x} (τ_{U} > t) & = \sum_{i \geq s} (2 i - 1) {(- 1)}^{i} e^{- \frac{i (i - 1)}{2} t} \\ \times & \sum_{l = s - 2}^{i - 2} \frac{{(2 - i)}_{l} {(i + 1)}_{l}}{l! (l + 1)!} \{\sum_{a \in Π_{l + 2, s}} (\begin{matrix} l + 2 \\ a \end{matrix}) x^{a} - \sum_{b \in Π_{l + 1, s}} (\begin{matrix} l + 1 \\ b \end{matrix}) x^{b}\} . \end{matrix}

Proof.

The inclusion–exclusion argument shows the following expression:

\begin{matrix} P_{x} (τ_{U} > t) = & {\bar{p}}_{1, \dots, s} (t) - \sum_{i_{1}, \dots, i_{s - 1}} {\bar{p}}_{i_{1}, \dots, i_{s - 1}} (t) + \sum_{i_{1}, \dots, i_{s - 2}} {\bar{p}}_{i_{1}, \dots, i_{s - 2}} (t) \\ - \dots + {(- 1)}^{s - 1} \sum_{i = 1}^{s} {\bar{p}}_{i} (t), \end{matrix}

(26)

where

{\bar{p}}_{i_{1}, \dots, i_{u}} (t)

is the total mass on

conv {e_{i_{1}}, \dots, e_{i_{u}}}

and the summations are taken over the totality of distinct indices chosen from

{1, \dots, s}

. For

K_{2}

, an explicit expression can be obtained by solving a backward equation for the diffusion (Equation (4.15) in [14]):

{\bar{p}}_{1} (t) = x_{1} + \sum_{i \geq 2} (2 i - 1) {(- 1)}^{i} e^{- i (i - 1) t / 2} \sum_{l \geq 0} \frac{{(2 - i)}_{l} {(i + 1)}_{l}}{l! (l + 1)!} (x_{1}^{l + 2} - x_{1}^{l + 1}) .

(27)

A complete graph can be reduced to

K_{2}

by any partition of the vertex set to two vertex sets (the reduction is defined in Section 1). For example,

K_{3}

is reducible to

K_{2}

consisting of Vertices

{1 + 2, 3}

, where the Vertex

1 + 2

is obtained by identifying Vertices 1 and 2. In the same way,

K_{3}

is also reducible to

{1 + 3, 2}

and

{2 + 3, 1}

. Therefore, an expression of

{\bar{p}}_{i_{1}, \dots, i_{u}} (t)

is obtained by the right side of (27) by replacing

x_{1}

with

x_{i_{1}} + \dots + x_{i_{u}}

. The inclusion–exclusion argument gives

\begin{matrix} \sum_{a \in Π_{l, s}} (\begin{matrix} l \\ a \end{matrix}) x^{a} = & {(x_{1} + \dots + x_{s})}^{l} - \sum_{i_{1}, \dots, i_{s - 1}} {(x_{i_{1}} + \dots + x_{i_{s - 1}})}^{l} + \dots \\ + {(- 1)}^{s - 1} \sum_{i = 1}^{s} (x_{1} + \dots + x_{s}), \end{matrix}

(28)

where both sides are zero if

l < s

. Substituting the expression obtained by the expression (27) into (26) and collecting terms by using the equality (28), the assertion follows. □

According to Proposition 6, the probability distribution function of the exit time from

int (Δ_{s - 1})

is asymptotically

1 - (2 s - 1)!! 2^{s - 1} x e^{- s (s - 1) t / 2}

for large t, where

(2 s - 1)!! = (2 s - 1) (2 s - 3) \dots 1

. Let a sequence of vertex sets occupied by at least one particle in Algorithm 2 be denoted by V,

U^{(r - 1)}

,

U^{(r - 2)}

, …,

U^{(1)} = {i}

for a vertex

i \in V

. If the exit time for

U^{(s)}

followed the exponential distribution of mean

| E_{U^{(s)}} | = s (s - 1) / 2

(of course, this is not exactly true), the expectation that the waiting time of a sample path is absorbed into the vertex would be

2 (1 - r^{- 1})

for large r. This rough argument suggests that the expectation of the computation cost of Algorithm 2 would be

O (r N^{2})

for a complete graph because an iteration of Steps 3 and 4 can be executed in

O (r)

. Luby’s algorithm for finding an independent set described in Section 1 demands

O (1)

using

O (r^{2})

processors.

5.2. Bayesian Graph Selection

Consider a sampling of particles of size n from the unique stationary state of the adjoint Markov semigroup

{{\tilde{T}}^{*}}

on

P (Δ_{r - 1})

induced by the Markov semigroup

{T_{t}}

associated with the diffusion

(\tilde{x} (t), {\tilde{P}}_{x})

that appeared in Theorem 2 such that

a_{i}

particles of a graph

G = (V, E)

are taken from the vertex

i \in V

. We assume the probability of taking a sample does not depend on the order of particles, namely, they are exchangeable. Such probabilities constitute the multinomial distribution, namely, a probability measure on ordered non-negative integer partitions of n as

q_{a} : = (\begin{matrix} n \\ a \end{matrix}) x^{a}, a \in Π_{n, r}^{\geq 0}, r = | V |,

satisfying

\sum_{a} q_{a} (t) = 1

. The moment

m_{a}

defined by (21) is the expectation of the sample probability

q_{a}

up to the multinomial coefficient:

{\tilde{E}}_{ν_{α}} q_{a} = (\begin{matrix} n \\ a \end{matrix}) {\tilde{E}}_{ν_{α}} x^{a} = (\begin{matrix} n \\ a \end{matrix}) m_{a} (α) .

(29)

Before proceeding to discuss computational issues, we present a motivating problem in Bayesian statistics. The expected sample probability (29) is a mixture of multinomial distributions of parameters x over the stationary state

ν_{α}

of the adjoint Markov semigroup

{{\tilde{T}}^{*}}

. In statistical terminology, the sample probability

q_{a}

and the expectation (29) are called the likelihood and the marginal likelihood, respectively, and

ν_{α}

is called the prior distribution for the parameter

x \in Δ_{r - 1}

.

Suppose we are interested in selecting a graphical model consisting of four vertices from three candidate models: a star graph

S_{3}

, a cycle graph

C_{4}

, and a complete graph

K_{4}

(Figure 3).

For this purpose, we employ stationary states

ν_{α}

as the prior distributions. As we have seen in Example 7, the prior distribution for

K_{4}

is the Dirichlet distribution, but for the prior distributions for other graphs, closed form expressions of the distribution function are not known. Suppose we have a sample consisting of two particles. If it is

e_{1} + e_{3}

, by solving the recurrence relation (22), we obtain the expected sample probabilities under

S_{3}

,

C_{4}

, and

K_{4}

as

\frac{α}{2 (1 + 4 α)}, \frac{1}{8}, and \frac{α}{2 (1 + 4 α)},

respectively. On the other hand, if the sample is

e_{2} + e_{4}

, they are

\frac{1}{8}, \frac{1}{8}, and \frac{α}{2 (1 + 4 α)} .

If

α

is small,

e_{1} + e_{3}

supports

C_{4}

, while

e_{2} + e_{4}

does not support

K_{4}

. This is reasonable because the vertex set

{1, 3}

is an independent set of

C_{4}

but not an independent set of

S_{3}

and

K_{4}

. On the other hand, the set

{2, 4}

is not an independent set of

K_{4}

, but it is an independent set of

S_{3}

and

C_{4}

. The ratio of marginal likelihoods is called the Bayes factor, which is a standard model-selection criterion in Bayesian statistics (see, e.g., Section 6.1 of [15]). If a sample is

e_{1} + e_{3}

, the Bayes factor of

C_{4}

to

S_{3}

or

K_{4}

is

1 + 1 / (4 α)

. Therefore,

C_{4}

is supported if

α

is small, while all graphs are equivalent if

α

is large. We do not discuss details of statistical aspects, including how to choose

α

, but it is worthwhile to mention that positive

α

improves the stability of model selection, especially for small samples. In fact, if

α

is small, the Bayes factor drastically changes by adding a sample unit. Suppose we have a sample

e_{1} + e_{3}

and take an additional sample unit. If it is

e_{2}

, the expected sample probabilities for the sample

e_{1} + e_{2} + e_{3}

under

S_{3}

,

C_{4}

, and

K_{4}

are

\frac{3 α (1 + 12 α)}{32 (1 + 3 α) (1 + 4 α)}, \frac{3 α (1 + 12 α)}{32 (1 + 3 α) (1 + 4 α)}, and \frac{3 α^{2}}{4 (1 + 2 α) (1 + 4 α)},

respectively. The Bayes factor of

C_{4}

to

S_{3}

is unity, which means that the graphs

C_{4}

and

S_{3}

are equivalent. This conclusion is quite different from that deduced from the sample

e_{1} + e_{3}

. In the limit of a large

α

, by Corollary 3, the expected sample probability of any graph follows the unique limit distribution—the multinomial distribution.

Now let us discuss computation of expected sample probabilities. A closed-form expression of the stationary state

ν_{α}

of the adjoint Markov semigroup

{{\tilde{T}}^{*}}

is not known for generic graphs; nevertheless, we can compute the expected sample probabilities of any graph by solving the system of recurrence relations (22). Solving the system becomes prohibitive as the sample size n grows, but the following algorithm, which is a byproduct of Theorem 2, provides an unbiased estimator of the expected sample probability.

By Corollary 4, the output of Algorithm 2 is an unbiased estimator of

m_{a} (α)

, which gives the expected sample probability (29) by multiplying the multinomial coefficient.

Algorithm 2 Estimating the sample probability, or the marginal likelihood

Input: A sample taken from a graph $G = (V, E)$ consisting of a vector $a \in Π_{n, r}$ , $r = | V |$ and the parameter value $α > 0$ .
Output: A random variable following $exp {- \int_{0}^{τ} \tilde{k} (\tilde{a} (s)) d s}$ that appeared in (20).

Step 1: Set $k = 0$ .
Step 2: Get a random number T following the exponential distribution of mean $\sum_{i = 1}^{r} d_{i} a_{i} (a_{i} - 1) / 2 + α | a | / 2$ .
Step 3: $k \leftarrow k + \tilde{k} (a) T$ .
Step 4: Divide the interval $[0, 1]$ with the ratio
$a_{1} (a_{1} - 1) : \dots : a_{1} (a_{1} - 1) : a_{2} (a_{2} - 1) : \dots : a_{2} (a_{2} - 1) : \dots : a_{r} (a_{r} - 1) : \dots : a_{r} (a_{r} - 1) : α a_{1} : \dots : α a_{r}$
such that $a_{i} (a_{i} - 1)$ , $i \in {1, \dots, r}$ appears $d_{i}$ times.
Step 5: Get a random number U following the uniform distribution on $[0, 1]$ .
Step 6: If U falls in the j-th interval of $a_{i} (a_{i} - 1)$ , let $a_{i} \leftarrow a_{i} - 1$ , $a_{j} \leftarrow a_{j} + 1$ . Otherwise, if U falls in the interval of $α a_{i}$ , let $a_{i} \leftarrow a_{i} - 1$ .
Step 7: If $| a | = 0$ , output $e^{- k}$ . Otherwise, go to Step 2.

An attractive property of Algorithm 2 as a Markov chain Monte Carlo is that it is a direct sampler; namely, it generates random variables independently and exactly follow the target distribution. In fact, this algorithm can be regarded as a variant of a direct sampling algorithm called coupling from the past (see, e.g., Chapter 25 of [16] for a concise summary). Regarding a sample as being generated from the infinite past and that time is going backward, the time when all particles are erased is the time when the sample path can be regarded as that which came from the infinite past because the sample path does not depend on any events older than the time. We have the following estimate of steps needed to complete the procedure.

Proposition 7.

For a sample of size n, the steps needed to complete Algorithm 2 to obtain an unbiased estimator of the expected sample probability (29) are

O {| E | (n + n (n - 1) / (2 α))}

for large

| E |

.

Proof.

For a state a satisfying

| a | = m

, the probability that a particle is erased at the next step is bounded below:

\frac{α m}{\sum_{i = 1}^{r} a_{i}^{2} + (α - 1) m} \geq \frac{α}{m + α - 1} .

Therefore, the waiting time to erase a particle is stochastically smaller than the waiting time of an event following the geometric distribution of waiting time

(m + α - 1) / α

. The sum of the waiting times from

m = 1

to

m = n

is

O (n + n (n - 1) / (2 α))

. Steps 4 and 6 demand

d_{1} + \dots + d_{r} + r = 2 | E | + r = O (| E |)

steps for large

| E |

. Therefore, the assertion follows. □

We have focused on the diffusion with drift, but moments of the marginal distribution of the diffusion without drift at a given time, i.e., (8), can be computed by an analogue to Algorithm 2. The problem reduces to solving the system of differential equations (9). We omit the discussion, but a similar problem for a complete graph

K_{4}

was discussed in [17].

6. Discussion

We have studied diffusions taking value in probability measures on a graph whose masses on each vertex satisfy the stochastic differential equations of the forms (1) and (16) by using their dual Markov chains on integer partitions and on finite integer lattices, respectively. Many problems remain to be solved, especially for (1). First of all, a formal proof of the existence of the semigroup

{T_{t}}

associated with the generator (2) should be established, which demands pathwise uniqueness of the solution of (1). As we have emphasized in the text, some arguments, especially those after Propositions 3 and 6, are rough and restrictive. They could be improved. Stationary states of the Markov semigroup need further studies. A counterpart of Theorem 1.5 of [11] or Theorem 3 on regularity of stationary states could be established by detailed analysis of the diffusion. Further properties of the diffusion such as absorption probabilities into a stationary state and the waiting times are interesting. To obtain explicit expressions of them is challenging, but such expressions would be helpful for further understanding the diffusion.

Two applications of the diffusions are discussed: analysis of an algorithm to find an independent set of a graph, and Bayesian graph selection based on computation of expected sample probability by using coupling from the past. Further applications and targets for modelling may exist.

For the coalescents mentioned in Remark 1, the properties of a “genealogy” of a sample, which is a sample path of the dual Markov chain, are intensively studied because a genealogy itself is used as a stochastic model of DNA sequence variation (see, e.g., [18]). Random graphs such as Figure 1 and Figure 2 are counterparts of such genealogies. Study of the properties of such random graphs would be interesting.

Funding

The author is supported in part by JSPS KAKENHI grant 20K03742 and was supported in part by JST Presto Grant, Japan.

Data Availability Statement

Not applicable.

Acknowledgments

The author thanks Yoshiaki Itoh for introducing their work [1] to him. An earlier version of this work was presented at a workshop on coalescent theory at the Centre de Recherches Mathématiques, University of Montreal, in October 2013.

Conflicts of Interest

The author declares no conflict of interest.

References

Itoh, Y.; Mallows, C.; Shepp, L. Explicit sufficient invariants for an interacting particle system. J. Appl. Prob. 1998, 35, 633–641. [Google Scholar] [CrossRef]
Tainaka, K.; Itoh, Y.; Yoshimura, J.; Asami, T. A geographical model of high species diversity. Popul. Ecol. 2006, 48, 113–119. [Google Scholar] [CrossRef]
Ben-Haim, E.; Krapivsky, P.L.; Redner, S. Bifurcations and patterns in compromise process. Physica 2003, 183, 190–204. [Google Scholar]
Ethier, S.N.; Kurtz, T.G. Markov Processes: Characterization and Convergence; Wiley: Hoboken, NJ, USA, 1986. [Google Scholar]
Luby, M. A simple parallel algorithm for the maximal independent set finding problem. SIAM J. Comput. 1986, 15, 1036–1053. [Google Scholar] [CrossRef]
Ethier, S.N. A class of degenerate diffusion processes occurring in population genetics. Comm. Pure Appl. Math. 1976, 29, 483–493. [Google Scholar] [CrossRef]
Liggett, T.M. Interacting Particle Systems; Springer: Berlin/Heidelberg, Germany, 1985. [Google Scholar]
Shiga, T. An interacting system in population genetics. J. Math. Kyoto Univ. 1980, 20, 213–242. [Google Scholar] [CrossRef]
Shiga, T. An interacting system in population genetics, II. J. Math. Kyoto Univ. 1980, 20, 723–733. [Google Scholar] [CrossRef]
Mohar, B. The Laplacian Spectrum of Graphs. In Graph Theory, Combinatorics, and Applications; Alavi, Y., Chartrand, G., Oellermann, O.R., Schwenk, A.J., Eds.; Wiley: Hoboken, NJ, USA, 1991; Volume 2, pp. 871–898. [Google Scholar]
Shiga, T.; Uchiyama, K. Stationary state and their stability of the stepping stone model involving mutation and selection. Prob. Theor. Rel. Fields 1986, 73, 87–117. [Google Scholar] [CrossRef]
Mano, S. Partitions, Hypergeometric Systems, and Dirichlet Processes in Statistics; Springer: Tokyo, Japan, 2018. [Google Scholar]
Karlin, S.; McGregor, J. Addendum to paper of W. Ewens. Theor. Popul. Biol. 1972, 3, 113–116. [Google Scholar] [CrossRef]
Kimura, M. Diffusion models in population genetics. J. Appl. Prob. 1964, 1, 177–232. [Google Scholar] [CrossRef][Green Version]
Bernardo, J.M.; Smith, A.F.M. Bayesian Theory; Wiley: Chichester, UK, 1994. [Google Scholar]
Levin, D.A.; Peres, Y. Markov Chains and Mixing Times, 2nd ed.; American Mathematical Society: Providence, RI, USA, 2017. [Google Scholar]
Mano, S. Duality between the two-locus Wright–Fisher diffusion model and the ancestral process with recombination. J. Appl. Probab. 2013, 50, 256–271. [Google Scholar] [CrossRef][Green Version]
Durrett, R. Probability Models for DNA Sequence Evolution, 2nd ed.; Springer: New York, NY, USA, 2008. [Google Scholar]

Figure 1. Possible transitions of the chain

(a (t), P_{a})

on the cycle graph

C_{4}

starting from

a = 2 e_{2} + e_{4}

.

Figure 1. Possible transitions of the chain

(a (t), P_{a})

on the cycle graph

C_{4}

starting from

a = 2 e_{2} + e_{4}

.

Figure 2. Possible sample paths of the chain

(\tilde{a} (t), {\tilde{P}}_{a})

on the cycle graph

C_{4}

starting from

e_{1} + e_{2}

.

Figure 2. Possible sample paths of the chain

(\tilde{a} (t), {\tilde{P}}_{a})

on the cycle graph

C_{4}

starting from

e_{1} + e_{2}

.

Figure 3. Three candidate graphical models: a star graph

S_{3}

, a cycle graph

C_{4}

, and a complete graph

K_{4}

.

Figure 3. Three candidate graphical models: a star graph

S_{3}

, a cycle graph

C_{4}

, and a complete graph

K_{4}

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mano, S. A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications. Mathematics 2022, 10, 4081. https://doi.org/10.3390/math10214081

AMA Style

Mano S. A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications. Mathematics. 2022; 10(21):4081. https://doi.org/10.3390/math10214081

Chicago/Turabian Style

Mano, Shuhei. 2022. "A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications" Mathematics 10, no. 21: 4081. https://doi.org/10.3390/math10214081

APA Style

Mano, S. (2022). A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications. Mathematics, 10(21), 4081. https://doi.org/10.3390/math10214081

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Measure-on-Graph-Valued Diffusion: A Particle System with Collisions and Its Applications

Abstract

1. Introduction

2. Invariants among Moments

3. Dual Process on Integer Partitions

Proof.

4. Diffusion with Linear Drift

5. Applications

5.1. Finding Independent Sets of Graphs

5.2. Bayesian Graph Selection

6. Discussion

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI