Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio

Chakraborty, Santanu

doi:10.3390/math11244993

Open AccessArticle

Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio

by

Santanu Chakraborty

School of Mathematical and Statistical Sciences, University of Texas Rio Grande Valley, 1201 West University Drive, Edinburg, TX 78539-2999, USA

Mathematics 2023, 11(24), 4993; https://doi.org/10.3390/math11244993

Submission received: 27 September 2023 / Revised: 4 December 2023 / Accepted: 6 December 2023 / Published: 18 December 2023

(This article belongs to the Special Issue Analytical Methods and Convergence in Probability with Applications, 2nd Edition)

Download Review Reports Versions Notes

Abstract

:

Consider a sequence

{(X_{n})}_{n \geq 1}

of i.i.d.

2 \times 2

stochastic matrices with each

X_{n}

distributed as

μ

. This

μ

is described as follows. Let

{(C_{n}, D_{n})}^{T}

denote the first column of

X_{n}

and for a given real r with

0 < r < 1

, let

r^{- 1} C_{n}

and

r^{- 1} D_{n}

each be Bernoulli distributions with parameters

p_{1}

and

p_{2}

, respectively, and

0 < p_{1}, p_{2} < 1

. Clearly, the weak limit of the sequence

μ^{n}

, namely

λ

, is known to exist, whose support is contained in the set of all

2 \times 2

rank one stochastic matrices. In a previous paper, we considered

0 < r \leq \frac{1}{2}

and obtained

λ

explicitly. We showed that

λ

is supported countably on many points, each with positive

λ

-mass. Of course, the case

0 < r \leq \frac{1}{2}

is tractable, but the case

r > \frac{1}{2}

is very challenging. Considering the extreme nontriviality of this case, we stick to a very special such r, namely,

r = \frac{\sqrt{5} - 1}{2}

(the reciprocal of the golden ratio), briefly mention the challenges in this nontrivial case, and completely identify

λ

for a very special situation.

Keywords:

random walk; stochastic matrices; limiting measure; golden ratio

MSC:

60B15

1. Introduction

As the title of the paper suggests, the reader can understand that this paper deals with a situation where one considers products of independent and identically distributed random

2 \times 2

stochastic matrices and their limiting behavior. In other words, here we are considering a probability measure

μ

on a collection of

2 \times 2

stochastic matrices and studying the limiting behavior of the convolution sequence

μ^{n}

. To a reader new to this area, the author would like to refer the reader to the book by Hognas and Mukherjea [1]. This book starts from the very basic concepts, such as the definition of a semigroup, topological semigroups, semigroups of matrices, etc., in chapter 1 and then moves forward to more complex concepts, such as probability measures of semigroups, convolution products of probabilities and convergence, random walks on semigroups, random walks on semigroups of nonnegative matrices (and in particular stochastic matrices), etc. The current author collaborated on a few papers in this area [2,3,4,5,6].

For complete understanding of this article, we will go over a few details about convergence of convolution products of probability measures on semigroups of matrices. If

B

denotes the collection of Borel subsets of a set S, then

P (S)

can be the set of all regular probability measures

μ

on

B

. Then, denoting the collection of continuous functions on S as

C (S)

, for

μ, ν \in P (S)

, and

f \in C (S)

, one defines the following iterated integral:

I (f) = \int \int f (x y) μ (d x) ν (d y)

By the Riesz representation theorem, there exists a unique regular probability measure

λ

such that for any function

f \in C (S)

with compact support, we have

I (f) = \int f d λ

Then,

λ

is called the convolution of the probability measures

μ

and

ν

. There is a proposition in [1] that shows that for

μ, ν \in P (S)

, and

B \in B

,

μ * ν (B) = \int μ (B x^{- 1}) ν (d x) = \int ν (x^{- 1} B) μ (d x)

Having defined the convolution product of regular probability measures on semigroups, one can consider a sequence of regular probability measures

μ_{1}, μ_{2}, μ_{3}, \dots

, construct a sequence of convolution products of these regular probability measures

μ_{1}, μ_{1} ★ μ_{2}, μ_{1} ★ μ_{2} ★ μ_{3}, \dots

, and talk about conditions when such convolution sequences will converge. Then, one can specialize to the independent identically distributed situation where for each i, we have,

μ_{i} = μ

for

i = 1, 2, 3, \dots

. Then, the convolution sequence looks like

μ^{n}

for

n = 1, 2, 3, \dots

. In all these situations, [1] assumes that S is a locally compact, second countable Hausdorff topological semigroup.

Then, if someone further specializes to the situation when S is a semigroup of nonnegative matrices or say, stochatic matrices of a fixed order d, then one considers the usual matrix topology. There have been quite a few papers that study the conditions when the convolution sequence

μ^{n}

converges. Mukherjea [7] first gave conditions when such a sequence converges for i.i.d.

2 \times 2

stochastic matrices. Then, subsequently such conditions for higher order stochastic matrices were obtained [5,6]. But none of these papers performed detailed study on the nature of the corresponding limiting measures. But motivated by a paper by Chamayou and Letac [8], we have investigated the nature of the limiting measure

λ

for a very special

μ

on

2 \times 2

i.i.d. stochastic matrices.

Before proceeding further, let us denote the probability measure on stochastic matrices of a fixed order d by

μ

and its support by

S (μ)

. So,

S (μ)

is a subcollection of stochastic matrices of a fixed order d. Thus, for any convolution product

μ^{n}

, we will denote its support by

S (μ^{n})

and the support of the limiting measure

λ

(if it exists) by

S (λ)

.

If we denote the closure of an arbirary set E by

\bar{E}

, then

S (μ^{n}) = \bar{\{A_{1} A_{2} \dots A_{n} | for each i, A_{i} \in S (μ), 1 \leq i \leq n\}}

where n is a positive integer and

S = \bar{\cup_{n = 1}^{\infty} S (μ^{n})}

Also, denote

P

to be the set of

d \times d

strictly positive stochastic matrices in

S

.

Chamayou and Letac [8] proved that if

{(X_{n})}_{n \geq 1}

is a sequence of

d \times d

i.i.d. stochastic matrices such that

P ({min}_{i, j} {(X_{1})}_{i j} = 0) < 1

, then

Y = {lim}_{n \to \infty} X_{n} X_{n - 1} \dots X_{1}

exists almost surely and

P (Y has rank 1) = 1

; furthermore, if for any Borel B of

d \times d

stochastic matrices (with usual

R^{d^{2}}

-topology), we denote

μ (B) = P (X_{1} \in B)

and

λ (B) = P (Y \in B)

, and then

λ

is the unique solution of the convolution equation

λ ★ μ = λ

.

Then, in [2], we noted that this wonderful result of Chamayou and Letac also holds under the (slightly weaker) condition that

μ^{m} (P) > 0

for some positive integer m (as opposed to just 1, instead of m, taken in [8]) where

μ^{m}

is the distribution of the product

X_{m} \dots X_{1}

and

P

is the set of

d \times d

strictly positive stochastic matrices. The reason is as follows: the Chamayou and Letac result shows that under the weaker condition, the subsequence

Y_{n m} = X_{n m} X_{n m - 1} \dots X_{1}

converges almost surely to some

d \times d

rank one stochastic matrix,

Y_{0}

, and consequently, any subsequence

X_{n_{k}} X_{n_{k} - 1} \dots X_{1}

with

n_{k} > s_{k} m

(for some

s_{k}

) will also converge almost surely to a

d \times d

stochastc matrix

V Y_{0} (= Y_{0}

, as

Y_{0}

has rank one), where V is a limit point of the product subsequence

X_{n_{k}} X_{n_{k} - 1} \dots X_{s_{k} m + 1}

. This establishes our observation.

Next we mention below some situations when

S (λ)

consists of all rank one matrices:

Situation 1: If

{(X_{i})}_{i \geq 1}

, as before, is i.i.d.

d \times d

stochastic matrices such that for some positive integer

m \geq 1

,

μ^{m} (P) > 0

(1)

then the sequence

μ^{n}

, where

μ (B) = P (X_{1} \in B)

for Borel sets B of

d \times d

stochastic matrices, converges weakly to a probability measure

λ

and

S (λ)

consists of all rank one stochastic matrices in

S

such that

λ (P) > 0

.

Situation 2: When

λ

is the weak limit of

{(μ^{n})}_{n \geq 1}

and

S

contains a rank one matrix, then the support of

λ

,

S (λ)

consists of all rank one stochastic matrices in

S

. This is an algebraic fact for the support of an idempotent probability measure (note that

λ = λ ★ λ

; see [1]).

In the same paper, Chamayou and Letac (see also [9]) tried to identify

λ

in the case when the rows of

X_{1}

above are independent, and for

1 \leq i \leq d

, the i-th row of

X_{1}

has Dirichlet distribution with positive parameters

α_{i 1}, α_{i 2}, \dots, α_{i d}

, and they were successful in the case when

\sum_{j = 1}^{d} α_{i j} = \sum_{j = 1}^{d} α_{j i}, 1 \leq i \leq d

. Indeed, there are only very few (other than those given in [8,9,10]) examples in the literature even for

2 \times 2

stochastic matrices when the limit distribution

λ

has been identified completely in the above context. Our paper [2] is an example.

In [2], we considered

2 \times 2

i.i.d. stochastic matrices

{(X_{n})}_{n \geq 1}

with

X_{n} = (\begin{matrix} C_{n} & 1 - C_{n} \\ D_{n} & 1 - D_{n} \end{matrix})

, each

X_{n}

is distributed as

μ

and

r^{- 1} C_{n}

and

r^{- 1} D_{n}

are each Bernoulli distributions (with possibly different parameters

p_{1} and p_{2}, 0 < p_{1}, p_{2} < 1

) for a real r satisfying

0 < r \leq 1

. Our goal was to identify

λ

, the distribution of

\lim_{n \to \infty} X_{n} X_{n - 1} \dots X_{1}

. Clearly, there are exactly four matrices in the support of

μ

, each with positive mass. It is well known that that

μ^{n}

converges weakly to a limiting measure

λ

and the support of

λ

consists of rank one matrices. In particular, if r equals 1, the support of

λ

has exactly two matrices, namely,

(\begin{matrix} 0 & 1 \\ 0 & 1 \end{matrix})

and

(\begin{matrix} 1 & 0 \\ 1 & 0 \end{matrix})

. In [2], a complete solution is given to the problem for

0 < r \leq \frac{1}{2}

and also for

r = 1

.

The situation

\frac{1}{2} < r < 1

is much more challenging. Before explaining where the challenge lies, let us make the following convention:

From now on, we will often denote the matrix

(\begin{matrix} x & 1 - x \\ x & 1 - x \end{matrix})

by simply x when there is no fear of confusion. Thus, for the limiting measure

λ

,

λ (x)

will mean

λ (\begin{matrix} x & 1 - x \\ x & 1 - x \end{matrix})

and if we write that the support of

λ

,

S (λ)

is contained in

[0, 1]

, then this means the following:

S (λ) \subset \{(\begin{matrix} x & 1 - x \\ x & 1 - x \end{matrix}) : 0 \leq x \leq 1\}

Now, we are going to explain why the case

\frac{1}{2} < r < 1

is more challenging. Although we find it quite easy to observe that

λ (0)

and

λ (r)

have the same expressions as in the previous case, it is indeed hard to exhibit a point in

(0, r)

with positive

λ

-mass.

However, there is a special situation when things are more tractable, namely,

r = \frac{\sqrt{5} - 1}{2}

(the reciprocal of the golden ratio). We denote this special r as

r_{g}

. Notice that

r_{g}

satisfies the equation

r_{g}^{2} + r_{g} - 1 = 0

. Using this equation extensively, we completely solve for

λ

in this particular situation. It can be seen that although this is just one case, the proof is highly nontrivial. According to the author, the reason why

r_{g}

works for us is because of the fact that

λ (1 - r_{g})

could be found out easily and so this technique of proof worked.

It may be mentioned here that there have been numerous studies in the literature involving the golden ratio. One very recent study invloving golden ratio is in the context of machine learning [11].

As in the case of

0 < r \leq \frac{1}{2}

, here also

λ

is discrete with masses at countably many points. Our main theorem appears in Section 4.

One gets a feeling that for any other r satisfying

\frac{1}{2} < r < 1

, finding the value of

λ (1 - r)

itself will be a challenge, making it quite nontrivial. Thus, for a general

\frac{1}{2} < r < 1

, a different technique of proof might be needed to obtain a complete solution.

In the next section (Section 2), we describe our set up, state the results proved in [2] for

0 < r \leq \frac{1}{2}

, and briefly discuss the more challenging situation

\frac{1}{2} < r < 1

. In Section 3, we focus on

r = r_{g} = \frac{\sqrt{5} - 1}{2}

(reciprocal of the golden ratio) and prove two important propostions. We prove our main theorem and a series of lemmas leading to it in Section 4. We have some concluding remarks and comments in Section 5.

2. Preliminaries

In our case, we are considering the case of a probability measure

μ

on

2 \times 2

stochastic matrices.

S (μ)

denotes its support, which is a subcollection of

2 \times 2

stochastic matrices.

S (μ^{n})

denotes the support of

μ^{n}

where

μ^{n}

is the convolution sequence. As pointed out in [7],

μ^{n}

converges if and only if

S (μ)

is not a singleton:

S (μ) \neq \{(\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix})\}

And in case there is a strictly positive matrix in

S (μ)

, then the support

S (λ)

of the limiting measure

λ

consists of rank one matrices. Our special case satisfies that condition:

We consider

2 \times 2

i.i.d. stochastic matrices

{(X_{n})}_{n \geq 1}

with

X_{n} = (\begin{matrix} C_{n} & 1 - C_{n} \\ D_{n} & 1 - D_{n} \end{matrix})

, such that each

X_{n}

is distributed as

μ

. Also, assume that for a given r with

0 < r \leq 1

, both

r^{- 1} C_{n}

and

r^{- 1} D_{n}

are Bernoulli distributions with parameters

p_{1}

and

p_{2}

respectively.

Then, the support of

μ

has exactly four matrices as

S (μ)

is given by:

S (μ) = \{(\begin{matrix} 0 & 1 \\ 0 & 1 \end{matrix}), (\begin{matrix} 0 & 1 \\ r & 1 - r \end{matrix}), (\begin{matrix} r & 1 - r \\ 0 & 1 \end{matrix}), (\begin{matrix} r & 1 - r \\ r & 1 - r \end{matrix})\}

Let the

μ

-masses at these points be denoted by

p_{00}, p_{01}, p_{10}, p_{11}

respectively so that

p_{00} + p_{01} = 1 - p_{1}

,

p_{00} + p_{10} = 1 - p_{2}

,

p_{10} + p_{11} = p_{1}

and

p_{01} + p_{11} = p_{2}

.

Let

λ

be the distribution of

{lim}_{n \to \infty} X_{n} X_{n - 1} \dots X_{1}

.

In case r equals 1, one can easily observe that

λ

is a Bernoulli distribution with parameters entirely dependent on the probability mass function of

μ

, namely,

λ (0) = \frac{p_{00} + p_{01}}{1 - p_{10} + p_{01}}

This follows by solving for

λ (0)

and

λ (1)

in the convolution equation

λ ★ μ = λ

.

For

0 < r < 1

, the support of

μ^{n}

,

S (μ^{n})

and consequently

S

is contained in the set

\{(\begin{matrix} x & 1 - x \\ y & 1 - y \end{matrix}) : 0 \leq x \leq r, 0 \leq y \leq r\}

This can be proved using induction on n. One assumes up to some positive integer l and proves for

l + 1

by noticing that when one multiplies a matrix in

S (μ^{l})

by a matrix in

S (μ)

, the entiries in the product matrix satisfies the condition that each entry in the first column is between 0 and r because each entry in the first column of the matrices from

S (μ^{l})

and

S (μ)

is so.

Also, since the relation

λ ★ μ = λ

holds, the support of

λ

, namely,

S (λ)

consists of all rank one matrices in

S

. As a result,

S (λ) \subset \{x : 0 \leq x \leq r\}

, where x stands for

(\begin{matrix} x & 1 - x \\ x & 1 - x \end{matrix})

. Moreover, exploiting the identity

λ ★ μ = λ

, we have

λ (0) = \frac{p_{00}}{1 - p_{10}}, λ (r) = p_{11} + λ (0) p_{01} = \frac{p_{11} (1 - p_{10}) + p_{00} p_{01}}{1 - p_{10}}

and for other points x with

0 < x < r

with positive

λ

-masses, we have

λ (x) = λ (r^{- 1} x) p_{10} + λ (1 - r^{- 1} x) p_{01}

(2)

Next, we state the results proved in [2] for

0 < r \leq \frac{1}{2}

:

2.1. Case: $0 < r \leq \frac{1}{2}$

First of all, we introduce some notations. For each

i \geq 1

, define

A_{i} = \{\sum_{j = 1}^{k} {(- 1)}^{j - 1} r^{i_{j}} : 1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i, k \leq i\}, A = \cup_{i = 1}^{\infty} A_{i}

We have two propositions for taking care of the cases

0 < r < \frac{1}{2}

and

r = \frac{1}{2}

:

Proposition 1.

For

0 < r < \frac{1}{2}

, we have the following:

(i): For every positive integer $i \geq 1$ , $| A_{i} | = 2^{i - 1}$ and each point in $A_{i}$ has positive λ-mass. These are the only points of degree i in the support of λ with positive λ-mass.
(ii): Each such point has λ-measure equal to $λ (r) p_{10}^{i - 1 - k} p_{01}^{k}$ . For every $i > 1$ , $λ (A_{i}) = λ (r) {[p_{10} + p_{01}]}^{i - 1}$ .
(iii): $λ (0) + \sum_{i = 1}^{\infty} λ (A_{i}) = λ (0) + λ (r) \cdot [\sum_{i = 1}^{\infty} {(p_{10} + p_{01})}^{i - 1}] = 1$ .

Proposition 2.

For

r = \frac{1}{2}

, we have the following:

(i): The only points that have positive λ-masses are the dyadic rationals in $[0, \frac{1}{2}]$ . Thus, for every i, there are exactly $2^{i - 2}$ dyadic rationals of the form $\frac{k}{2^{i}}$ with $k \leq 2^{i - 1}$ and k odd with positive λ-mass. $A_{i}$ consists of exactly these points. Also, $| A_{i} | = 2^{i - 2}$ .
(ii): A typical point in $A_{i}$ has λ-measure equal to $λ (\frac{1}{2}) (p_{10} + p_{01}) p_{10}^{i - 1 - k} p_{01}^{k - 1}$ for some positive integer k. For every $i > 1$ , $λ (A_{i}) = λ (\frac{1}{2}) {[p_{10} + p_{01}]}^{i - 1}$ .
(iii): The sum of the λ-masses of all dyadic rationals in $[0, \frac{1}{2}]$ along with the λ-mass at zero equals 1. Equivalently, $λ (0) + \sum_{i = 1}^{\infty} λ (A_{i}) = λ (0) + λ (\frac{1}{2}) \cdot [\sum_{i = 1}^{\infty} {(p_{10} + p_{01})}^{i - 1}] = 1$

The case

\frac{1}{2} < r < 1

turns out to be quite nontrivial. We briefly introduce that case below:

2.2. Case: $\frac{1}{2} < r < 1$

The case

\frac{1}{2} < r < 1

is distinctly different from the case

r < \frac{1}{2}

because now we have

1 - r < r

. Since for each r,

λ

has masses at 0 and r, it is not absolutely continuous for any r. Now, suppose we continue with the same notation of A introduced in the case

0 < r \leq \frac{1}{2}

. Thus,

A = \cup_{i = 1}^{\infty} A_{i}

where, for every positive integer i,

A_{i} = \{\sum_{j = 1}^{k} {(- 1)}^{j - 1} r^{i_{j}} : 1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i, k \leq i\}

It then easily follows that each of these points in A also has positive mass even in the case

\frac{1}{2} < r < 1

. However, it is indeed a challenge to calculate

λ

-masses at these points.

Also, since

1 - r \in (0, r)

, it is natural to have points of the form

1 + \sum_{j = 1}^{k} {(- 1)}^{j} r^{i_{j}},

1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i, k \leq i

for any positive integer i in the interval

(0, r)

(to see this, notice that

r^{i_{1}} > \sum_{j = 2}^{k} {(- 1)}^{j} r^{i_{j}}

). Accordingly, define

A^{*} = \cup_{i = 1}^{\infty} A_{i}^{*}

, where

A_{i}^{*} = \{1 + \sum_{j = 1}^{k} {(- 1)}^{j} r^{i_{j}} : 1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i, k \leq i\}

Recall that, for

0 < r \leq \frac{1}{2}

, each point in A has positive

λ

-mass and each point in

A^{*}

is outside

(0, r)

and has zero

λ

-mass.

For

\frac{1}{2} < r < 1

, of course, each polynomial in A is in

(0, r)

. But, although some polynomials in

A^{*}

are numerically less than r, it is not easy to see which of these points have positive

λ

-masses. Clearly, some polynomials in

A_{i}^{*}

are outside

(0, r)

and have zero

λ

-measure if i is large enough. For example, for a fixed r, it is possible to get a positive integer

m > 1

such that

1 - r^{m} \geq r > 1 - r^{m - 1}

. Next, consider

i_{1} = l

with

l \geq m

for a polynomial

1 + \sum_{1}^{k} {(- 1)}^{j} r^{i_{j}}

in

A_{i}^{*}

with

1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i, k \leq i

. Then, this polynomial is greater than or equal to

1 - r^{m} + \sum_{2}^{k} {(- 1)}^{j} r^{i_{j}}

, which is obviously greater than r and has

λ

-measure zero. But, it is a possibility that some points in

A^{*}

could have positive

λ

-masses too.

Recall the very special r,

r = r_{g}

, the reciprocal of the golden ratio. We know

r_{g}

satisfies the equation

r_{g}^{2} + r_{g} - 1 = 0

and

1 - r_{g}

actually equals

r_{g}^{2}

, whose

λ

-measure can be found out easily. The next two sections deal with this special case.

3. $r = r_{g}$ : Main Results

In this section and also in the next section, we deal with

r = r_{g} = \frac{\sqrt{5} - 1}{2}

unless stated otherwise. This is a very special case of

\frac{1}{2} < r < 1

. Note that

r_{g}

is the reciprocal of the golden ratio and is the positive solution of the equation

r^{2} + r - 1 = 0

. To avoid dealing with too many radical signs and complicating matters, we will continue to use

r_{g}

in these two sections for this particluar choice of r.

Remark 1.

A polynomial

1 + \sum_{1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

in

A_{i}^{*}

with

1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i, k \leq i

and

i_{1} \geq 2

has zero λ-measure.

This is because,

1 - r_{g}^{2} = r_{g}

implies that such a polynomial is greater than

r_{g}

in magnitude. However, for

i_{1} = 1

, such a polynomial may have positive

λ

-measure as well.

In order to notice this, first observe that,

λ (1 - r_{g}) > 0

. This is because, using (2), we have

λ (1 - r_{g}) = λ (r_{g}^{2}) = λ (r_{g}) p_{10} + λ (1 - r_{g}) p_{01}

implying that

λ (1 - r_{g}) = \frac{p_{10}}{1 - p_{01}} λ (r_{g})

where

λ (r_{g})

is already known.

Next, consider a nontrivial example, say, the polynomial

1 - r_{g} + r_{g}^{2} - r_{g}^{3}

. Using (2) repeatedly and Remark 2, we find that its

λ

- measure equals

λ (r_{g}^{2}) p_{10}^{2} p_{01} + λ (1 - r_{g}^{2}) p_{10} p_{01}^{2} = λ (1 - r_{g}) p_{10}^{2} p_{01} + λ (r_{g}) p_{10} p_{01}^{2}

implying that the polynomial under consideration has non-zero

λ

-measure. Since we know

λ (r_{g})

and

λ (1 - r_{g})

, it is possible to find out

λ (1 - r_{g} + r_{g}^{2} - r_{g}^{3})

explicitly.

But, this is only a particular example. Can we make a general observation? Yes. Look at the following result.

Proposition 3.

Any polynomial in

A^{*}

either has λ-measure 0 or can be written as a polynomial in A.

Proof.

To fix ideas, we assume that our polynomial in

A^{*}

is

1 + \sum_{j = 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

with

1 \leq i_{1} < i_{2} < i_{3} < \dots < i_{k} = i

and

k \leq i

. Because of Remark 3.0, we can assume that

i_{1} = 1

. Then, we consider the following cases:

Case 1:

i_{j} = j

for

j = 2, 3, \dots, k

.

Then, the given polynomial equals

1 - r_{g} + r_{g}^{2} - \dots + {(- 1)}^{k} r_{g}^{k}

Subcase 1: k is even, say,

k = 2 m

. Then, the above polynomial equals

1 - r_{g} + r_{g}^{2} - \dots + r_{g}^{2 m}

. Notice that

r_{g}^{j} - r_{g}^{j + 1} = r_{g}^{j + 1} - r_{g}^{j + 3}

for

j = 0, 1, 2, \dots

. Thus, the given polynomial equals

r_{g} - r_{g}^{3} + r_{g}^{3} - r_{g}^{5} + \dots + r_{g}^{2 m - 1} - r_{g}^{2 m + 1} + r_{g}^{2 m}

which equals

r_{g} - r_{g}^{2 m + 1} + r_{g}^{2 m} > r_{g}

. So, it has

λ

-measure 0.

Subcase 2: k is odd,

k = 2 m + 1

. Then, the above polynomial equals

1 - r_{g} + r_{g}^{2} - \dots + r_{g}^{2 m} - r_{g}^{2 m + 1}

. Once again recall that

r_{g}^{j} - r_{g}^{j + 1} = r_{g}^{j + 1} - r_{g}^{j + 3}

for

j = 0, 1, 2, \dots

. So, the given polynoimal equals

r_{g} - r_{g}^{3} + r_{g}^{3} - r_{g}^{5} + \dots + r_{g}^{2 m - 1} - r_{g}^{2 m + 1} + r_{g}^{2 m + 1} - r_{g}^{2 m + 3} = r_{g} - r_{g}^{2 m + 3}

. And it is a polynomial in A.

Case 2: There exists an l such that

i_{l} > l

and

i_{j} = j

for

j < l

. Then, the given polynomial equals

1 - r_{g} + r_{g}^{2} - \dots + {(- 1)}^{l - 1} r_{g}^{l - 1} + \sum_{j = l}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

.

Subcase 1: l is even, say,

l = 2 m

. Then, the polynomial equals

1 - r_{g} + r_{g}^{2} - \dots - r_{g}^{2 m - 1} + \sum_{j = 2 m}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. Again, we use

r_{g}^{j} - r_{g}^{j + 1} = r_{g}^{j + 1} - r_{g}^{j + 3}

for

j = 0, 1, 2, \dots

so that the given polynomial equals

r_{g} - r_{g}^{3} + r_{g}^{3} - r_{g}^{5} + \dots + r_{g}^{2 m - 1} - r_{g}^{2 m + 1} + \sum_{j = 2 m}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. If

i_{2 m} = 2 m + 1

, then this polynomial equals

r_{g} - r_{g}^{3} + r_{g}^{3} - r_{g}^{5} + \dots + r_{g}^{2 m - 1} - r_{g}^{2 m + 1} + r_{g}^{2 m + 1} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

, which equals

r_{g} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. This is, of course, a polynomial in A. On the other hand, if

i_{2 m} > 2 m + 1

, then the above polynomial equals

r_{g} - r_{g}^{2 m + 1} + \sum_{j = 2 m}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. Once again, it is a polynomial in A.

Subcase 2: l is odd, say,

l = 2 m + 1

. Then, the given polynomial equals

1 - r_{g} + r_{g}^{2} - \dots - r_{g}^{2 m - 1} + r_{g}^{2 m} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. Applying once again

r_{g}^{j} - r_{g}^{j + 1} = r_{g}^{j + 1} - r_{g}^{j + 3}

for

j = 0, 1, 2, \dots

, this polynomial equals

r_{g} - r_{g}^{3} + r_{g}^{3} - r_{g}^{5} + \dots + r_{g}^{2 m - 1} - r_{g}^{2 m + 1} + r_{g}^{2 m} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. This simplifies to

r_{g} - r_{g}^{2 m + 1} + r_{g}^{2 m} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}} = r_{g} + r_{g}^{2 m + 2} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

. If

i_{2 m + 1} = 2 m + 2

, then the above equals

r_{g} + \sum_{j = 2 m + 2}^{k} {(- 1)}^{j} r_{g}^{i_{j}} > r_{g}

. So, it has

λ

-measure zero. If

i_{2 m + 1} > 2 m + 2

, then the given polynomial equals

r_{g} + r_{g}^{2 m + 2} + \sum_{j = 2 m + 1}^{k} {(- 1)}^{j} r_{g}^{i_{j}}

which is same as

r_{g} + r_{g}^{2 m + 2} - r_{g}^{i_{2 m + 1}} + \sum_{j = 2 m + 2}^{k} {(- 1)}^{j} r_{g}^{i_{j}} > r_{g}

. So, it has

λ

-measure equal to zero. □

Remark 2.

Because of the above proposition, it is good enough to consider only polynomials in A. We will rather consider the same polynomials as in the case

0 < r < \frac{1}{2}

and will try to work out their λ-measures.

We have seen in Section 2 that the number of elements in

A_{n}

equals

2^{n - 1}

. But, because of the relationship

1 - r_{g} = r_{g}^{2}

in the current situation, there will be redundancy and all polynomials are not distinct. So, we will see that we need to consider at most

2^{n - 2}

elements from

A_{n}

for each

n \geq 3

:

Proposition 4.

There are at most

2^{n - 2}

distinct elements in

A_{n}

for each

n \geq 3

.

Proof.

Once again, the identity

1 - r_{g} = r_{g}^{2}

has a big role to play. For

n = 1, 2, 3

or 4, it is trivial to observe. For general n, first notice that

r_{g}^{n} = r_{g}^{n - 2} - r_{g}^{n - 1}

, and so

r_{g}^{n}

can be considered to be in

A_{n - 1}

. More generally, define

Q_{n} = \{r_{g}^{n}\} \cup \{\sum_{j = 1}^{k} {(- 1)}^{j - 1} r_{g}^{i_{j}} : 1 \leq i_{1} < \dots < i_{k - 1} < i_{k}; i_{k - 1} < n, i_{k} = n; k < n; n - i_{k - 1} \geq 2\}

and

R_{n} = \{\sum_{j = 1}^{k} {(- 1)}^{j - 1} r_{g}^{i_{j}} : 1 \leq i_{1} < \dots < i_{k - 1} < i_{k}; i_{k - 1} = n - 1, i_{k} = n, k \leq n\}

Let

Q = \cup_{n = 3}^{\infty} Q_{n}

and

R = \cup_{n = 3}^{\infty} R_{n}

. Then, observe that each polynomial in Q is numerically equal to a polynomial in R of less degree.

We see this as follows:

Consider an

n > 2

. Take a polynomial in

Q_{n}

. If it is

r_{g}^{n}

, we have already provided the argument, that is,

r_{g}^{n} = r_{g}^{n - 2} - r_{g}^{n - 1} \in R_{n - 1}

. Otherwise, consider a typical element from

Q_{n}

, say,

r_{g}^{i_{1}} - r_{g}^{i_{2}} + \dots + {(- 1)}^{k - 1} r_{g}^{i_{k - 1}} + {(- 1)}^{k} r_{g}^{n}

with

1 \leq i_{1} < i_{2} < \dots < n

and for some

k < n

. If

n - i_{k - 1} = 2

, then

r_{g}^{i_{k - 1}} - r_{g}^{n} = r_{g}^{n - 2} - r_{g}^{n} = r_{g}^{n - 1} = r_{g}^{n - 3} - r_{g}^{n - 2}

. As a result, the given polynomial equals

r_{g}^{i_{1}} - r_{g}^{i_{2}} + \dots + + {(- 1)}^{k - 1} r_{g}^{n - 3} + {(- 1)}^{k} r_{g}^{n - 2}

. So, it is a polynomial in R of less degree (

n - 2

). On the other hand, if

n - i_{k - 1} > 2

, then

r_{g}^{i_{k - 1}} - r_{g}^{n} = r_{g}^{i_{k - 1}} - r_{g}^{n - 2} + r_{g}^{n - 1}

. So, the given polynomial equals

r_{g}^{i_{1}} - r_{g}^{i_{2}} + \dots + {(- 1)}^{k - 1} r_{g}^{i_{k - 1}} + {(- 1)}^{k} r_{g}^{n - 2} + {(- 1)}^{k + 1} r_{g}^{n - 1}

. Once again, this is a polynomial in R of less degree (

n - 1

).

It is clear that for each n,

A_{n} = Q_{n} \cup R_{n}

and hence

A = Q \cup R

. So, because of this observation, the only polynomials in A that can be considered for

λ

-mass calculation are the ones in R. Also, it follows that for

n \geq 3

,

R_{n}

has at most

2^{n - 2}

distinct polynomials. Consequently,

A_{n}

also has at the most

2^{n - 2}

distinct elements and the proposition follows. □

Remark 3.

Thus, for each n, we have fewer polynomials of degree n compared to the situation

0 < r < \frac{1}{2}

.

Now, it is time we prove our main theorem. We prove it in the next section.

4. $r = r_{g}$ : Proof of the Main Theorem

Here is our main theorem:

Theorem 1.

Consider

r = r_{g} = \frac{\sqrt{5} - 1}{2}

. Then

λ (0) + λ (r_{g}) + λ (r_{g}^{2}) + λ (r_{g} - r_{g}^{2}) + λ (R) = 1

where

R = \cup_{n = 3}^{\infty} R_{n}

with

R_{n} = \{\sum_{j = 1}^{k} {(- 1)}^{j - 1} r_{g}^{i_{j}} : 1 \leq i_{1} < i_{2} < \dots < i_{k - 2} < i_{k - 1} < i_{k}; i_{k - 1} = n - 1, i_{k} = n, k \leq n\}

First, notice that, using (2), it follows that

λ (r_{g}^{2}) = \frac{p_{10}}{1 - p_{01}} λ (r_{g})

and

λ (r_{g} - r_{g}^{2}) = (\frac{p_{10}^{2}}{1 - p_{01}} + p_{01}) λ (r_{g})

. Thus, in order to prove the theorem, it is enough to prove:

λ (R) = \frac{p_{10} + p_{01}}{1 - p_{10} - p_{01}} λ (r_{g} - r_{g}^{2})

(3)

because then,

λ (r_{g} - r_{g}^{2}) + λ (R) = (1 + \frac{p_{10} + p_{01}}{1 - p_{10} - p_{01}}) λ (r_{g} - r_{g}^{2})

As a result,

λ (0) + λ (r_{g}) + λ (r_{g}^{2}) + λ (r_{g} - r_{g}^{2}) + λ (R) = λ (0) + \frac{1}{1 - p_{10} - p_{01}} λ (r_{g})

But, recall from Section 3:

λ (0) = \frac{p_{00}}{1 - p_{10}}, λ (r_{g}) = p_{11} + λ (0) p_{01} = \frac{p_{11} (1 - p_{10}) + p_{00} p_{01}}{1 - p_{10}}

This implies that

λ (0) + \frac{1}{1 - p_{10} - p_{01}} λ (r_{g}) = 1

This is the reason that it is good enough to prove (3). For this, we proceed as follows.

First of all, notice that

R_{3} = {r_{g}^{2} - r_{g}^{3}, r_{g} - r_{g}^{2} + r_{g}^{3}}

,

R_{4} = {r_{g}^{3} - r_{g}^{4}, r_{g}^{2} - r_{g}^{3} + r_{g}^{4}, r_{g} - r_{g}^{3} + r_{g}^{4}, r_{g} - r_{g}^{2} + r_{g}^{3} - r_{g}^{4}}

etc. and in general

R_{n} = {r_{g}^{n - 1} - r_{g}^{n}, r_{g}^{n - 2} - r_{g}^{n - 1} + r_{g}^{n}, \dots, r_{g} - r_{g}^{n - 1} + r_{g}^{n}, \dots, r_{g} - r_{g}^{2} + \dots + r_{g}^{n - 1} - r_{g}^{n}}

Next, we introduce some notations for any

0 < r < 1

.

Define

g : R \to R

and

f_{j} : R \to R

for every positive integer j as follows:

g (p) = r p

and

f_{j} (p) = r^{j} - p

. Thus,

R_{3} = {f_{2} (r^{3}), f_{1} f_{2} (r^{3})}

,

R_{4} = {f_{3} (r^{4}), f_{2} f_{3} (r^{4}), f_{1} f_{3} (r^{4}), f_{1} f_{2} f_{3} (r^{4})}

etc., and in general,

R_{n} = {f_{n - 1} (r^{n}), f_{n - 2} f_{n - 1} (r^{n}), \dots, f_{1} f_{n - 1} (r^{n}), \dots, f_{1} f_{2} \dots f_{n - 1} (r^{n})}

We further define operators

F_{j}

for

j \geq 2

on R as follows:

F_{2} = {f_{2}, f_{1} f_{2}}

,

F_{3} = {f_{3}, f_{2} f_{3}, f_{1} f_{3}, f_{1} f_{2} f_{3}}

etc., and in general,

F_{n - 1} = {f_{n - 1}, f_{n - 2} f_{n - 1}, \dots, f_{1} f_{n - 1}, \dots, f_{1} f_{2} \dots f_{n - 1}}

Thus,

R_{j} = F_{j - 1} (r^{j})

for

j = 3, 4, \dots

and

F_{j - 1} (p) = {f_{j - 1} (p), f_{j - 2} f_{j - 1} (p), \dots, f_{1} f_{j - 1} (p), \dots, f_{1} f_{2} \dots f_{j - 1} (p)}

In general, one would anticipate

| F_{2} (p) | = 2, | F_{3} (p) | = 4, \dots, | F_{j - 1} (p) | = 2^{j - 2}

. But, for

r = r_{g}

, equality is replaced by ≤ for some ps.

Now, in order to prove (3), we will use a series of Lemmas 1–5. Lemma 1 identifies that connsecutive

R_{i}

s have nonempty overlaps for

i \geq 3

, Lemma 2 evaluates the cardinality of the consecutive overlaps, Lemma 3 evaluates the cardinality of the consecutive differences, Lemma 4 calculates the

λ

- measures of these differences, and, finally, Lemma 5 puts them together to evaluate the

λ

-measure of R thereby proving (3). Thus, once Lemmas 1–5 are proved, (3) is proved and the proof of the theorem is complete.

Lemma 1.

Consecutive

R_{i}

s (

R_{i}

and

R_{i + 1}

) have nonempty intersections for

i \geq 3

. In fact,

R_{4} \cap R_{3} = \emptyset

but

R_{j + 1} \cap R_{j} \neq \emptyset

for

j > 3

Proof.

It is trivial to observe that

R_{4} \cap R_{3} = ϕ

. Now, notice that

r_{g}^{2} - r_{g}^{4} + r_{g}^{5}, r_{g} - r_{g}^{2} + r_{g}^{4} - r_{g}^{5} \in R_{5} \cap R_{4}

because

r_{g}^{2} - r_{g}^{4} + r_{g}^{5} = r_{g} - r_{g}^{2} + r_{g}^{5} = r_{g} - r_{g}^{2} + r_{g}^{3} - r_{g}^{4} \in R_{4}

and automatically,

r_{g} - r_{g}^{2} + r_{g}^{4} - r_{g}^{5} = r_{g} - (r_{g} - r_{g}^{2} + r_{g}^{3} - r_{g}^{4}) = r_{g}^{2} - r_{g}^{3} + r_{g}^{4} \in R_{4}

. Thus,

R_{5} \cap R_{4} = F_{2} (r_{g}^{4} - r_{g}^{5})

and

| R_{5} \cap R_{4} | = 2

. In general,

R_{j + 1} \cap R_{j} \supseteq F_{j - 2} (r_{g}^{j} - r_{g}^{j + 1}) \cup F_{j - 4} (r_{g}^{j} - r_{g}^{j + 1})

for

j \geq 6

. In fact, we can show that for positive integers

k \geq 3

R_{2 k - 1} \cap R_{2 k - 2} = F_{2 k - 4} (r_{g}^{2 k - 2} - r_{g}^{2 k - 1}) \cup F_{2 k - 6} (r_{g}^{2 k - 2} - r_{g}^{2 k - 1}) \cup \dots \cup F_{2} (r_{g}^{2 k - 2} - r_{g}^{2 k - 1})

R_{2 k} \cap R_{2 k - 1} = F_{2 k - 3} (r_{g}^{2 k - 1} - r_{g}^{2 k}) \cup F_{2 k - 5} (r_{g}^{2 k - 1} - r_{g}^{2 k}) \cup \dots \cup F_{3} (r_{g}^{2 k - 1} - r_{g}^{2 k})

So, Lemma 1 is proved. □

Lemma 2.

For

i \geq 4

,

| R_{i} \cap R_{i + 1} |

s are evaluated upper bounds for

| R_{i + 1} - R_{i} |

are determined as follows:

For

k \geq 3

, we have,

| R_{2 k - 1} \cap R_{2 k - 2} | = \frac{2}{3} (2^{2 k - 4} - 1), | R_{2 k} \cap R_{2 k - 1} | = \frac{4}{3} (2^{2 k - 4} - 1)

so that

| R_{2 k - 1} - R_{2 k - 2} | \leq \frac{2^{2 k - 2} + 2}{3}, | R_{2 k} - R_{2 k - 1} | \leq \frac{2^{2 k - 1} + 4}{3}

Proof.

From Lemma 1, it follows that

| R_{5} \cap R_{4} | = 2

implying

| R_{5} - R_{4} | \leq 2^{3} - 2 = 6

,

| R_{6} \cap R_{5} | = 4

implying

| R_{6} - R_{5} | \leq 2^{4} - 4 = 12

.

In general, notice that for

k \geq 4

,

| F_{2 k - 2 l} (r_{g}^{2 k - 2 l + 2} - r_{g}^{2 k - 2 l + 3}) | = 2^{2 k - 2 l - 1}

and

| F_{2 k - 2 l + 1} (r_{g}^{2 k - 2 l + 3} - r_{g}^{2 k - 2 l + 4}) | = 2^{2 k - 2 l}

for

2 \leq l \leq k - 1

. Also,

| R_{2 k - 1} \cap R_{2 k - 2} | = 2^{2 k - 5} + 2^{2 k - 7} + \dots + 2 = \frac{2}{3} (2^{2 k - 4} - 1)

implying that

| R_{2 k - 1} - R_{2 k - 2} | \leq 2^{2 k - 3} - \frac{2}{3} (2^{2 k - 4} - 1) = \frac{2^{2 k - 2} + 2}{3}

and

| R_{2 k} \cap R_{2 k - 1} | = 2^{2 k - 4} + 2^{2 k - 6} + \dots + 2^{2} = \frac{4}{3} (2^{2 k - 4} - 1)

implying that

| R_{2 k} - R_{2 k - 1} | \leq 2^{2 k - 2} - \frac{4}{3} (2^{2 k - 4} - 1) = \frac{2^{2 k - 1} + 4}{3}

.

Thus, Lemma 2 is proved. □

Lemma 3.

For

i \geq 4

,

| R_{i + 1} - R_{i} |

s are evaluated exactly by getting rid of the redundancies:

More explicitly, for

j \geq 6

, not all elements in

R_{j}

are distinct. In fact, for

k \geq 3

,

R_{2 k}

has

2^{2 k - 4} - 2

and

R_{2 k + 1}

has

2^{2 k - 3} - 2

pairs of elements which are numerically equal so that

| R_{2 k} - R_{2 k - 1} | = \frac{5 \cdot 2^{2 k - 4} + 10}{3}, | R_{2 k + 1} - R_{2 k} | = \frac{5 \cdot 2^{2 k - 3} + 8}{3}

Proof.

From now on, we refer to duplicates as those pairs of polynomials or elements in R which have different algebraic expressions, but becaue of our choice of r, they are numerically equal. In order to exactly evaluate

| R_{i + 1} - R_{i} |

for

i \geq 4

, we need to identify such pairs.

Thus,

R_{6} - R_{5}

has two pairs of duplicates, namely,

r_{g}^{2} - r_{g}^{5} + r_{g}^{6}

&

r_{g} - r_{g}^{2} + r_{g}^{4} - r_{g}^{5} + r_{g}^{6}

;

r_{g} - r_{g}^{2} + r_{g}^{5} - r_{g}^{6}

and

r_{g}^{2} - r_{g}^{4} + r_{g}^{5} - r_{g}^{6}

because

r_{g} - r_{g}^{2} + r_{g}^{4} - r_{g}^{5} + r_{g}^{6} = r_{g}^{2} - r_{g}^{4} + r_{g}^{4} - r_{g}^{5} + r_{g}^{6} = r_{g}^{2} - r_{g}^{5} + r_{g}^{6}

r_{g} - r_{g}^{2} + r_{g}^{5} - r_{g}^{6} = r_{g}^{2} - r_{g}^{4} + r_{g}^{5} - r_{g}^{6}

In general,

R_{2 k} - R_{2 k - 1}

has

2 + 4 + \dots + 2^{2 k - 5}

pairs of duplicates implying that

| R_{2 k} - R_{2 k - 1} | = \frac{2^{2 k - 1} + 4}{3} - (2^{2 k - 4} - 2) = \frac{5 \cdot 2^{2 k - 4} + 10}{3}

. Here, each pair in the union are disjoint sets.

Also,

R_{2 k + 1} - R_{2 k}

has

2 + 4 + \dots + 2^{2 k - 5}

pairs of duplicates implying that

| R_{2 k + 1} - R_{2 k} | = \frac{2^{2 k} + 2}{3} - (2^{2 k - 3} - 2) = \frac{5 \cdot 2^{2 k - 3} + 8}{3}

. Again, each pair in the union are disjoint sets.

Thus, Lemma 3 is proved. □

Lemma 4.

λ-measures of

R_{i + 1} - R_{i}

for

i \geq 3

are calculated as:

First of all,

λ (R_{3}) = λ (r_{g} - r_{g}^{2}) (p_{10} + p_{01})

and for

k \geq 2

,

λ (R_{2 k} - R_{2 k - 1}) = λ (R_{2 k - 1} - R_{2 k - 2}) (p_{10} + p_{01}) + λ (r_{g} - r_{g}^{2}) p_{10}^{2 k - 3} p_{01} (p_{10} + p_{01})

which equals

λ (R_{2 k} - R_{2 k - 1}) = λ (r_{g} - r_{g}^{2}) {(p_{10} + p_{01})}^{2 k - 2} [1 + \sum_{l = 0}^{k - 2} p_{10}^{2 l + 1} p_{01} {(p_{10} + p_{01})}^{- 2 l - 1} - \sum_{l = 1}^{k - 3} p_{10}^{2 l + 1} p_{01} {(p_{10} + p_{01})}^{- 2 l - 2}]

(4)

where for

k = 2

, the last sum in the above equation is absent. Also, we have,

λ (R_{2 k + 1} - R_{2 k}) = λ (R_{2 k} - R_{2 k - 1}) (p_{10} + p_{01}) - λ (r_{g} - r_{g}^{2}) p_{10}^{2 k - 3} p_{01} (p_{10} + p_{01})

which equals

λ (R_{2 k + 1} - R_{2 k}) = λ (r_{g} - r_{g}^{2}) {(p_{10} + p_{01})}^{2 k - 1} [1 + \sum_{l = 0}^{k - 2} p_{10}^{2 l + 1} p_{01} [{(p_{10} + p_{01})}^{- 2 l - 1} - {(p_{10} + p_{01})}^{- 2 l - 2}]]

(5)

where for

k = 2

,

R_{2 k} - R_{2 k - 1} = R_{4} - R_{3} = R_{4}

and

R_{2 k - 1} - R_{2 k - 2} = R_{3} - R_{2} = R_{3}

.

Proof.

Recall that

R_{3} = F_{2} (r_{g}^{3}) = {r_{g}^{2} - r_{g}^{3}, r_{g} - r_{g}^{2} + r_{g}^{3}}

. Then, using (2) and Proposition 3, we have

λ (R_{3}) = λ (r_{g}^{2} - r_{g}^{3}) + λ (r_{g} - r_{g}^{2} + r_{g}^{3}) = λ (r_{g} - r_{g}^{2}) (p_{10} + p_{01})

Next, we have

R_{4} = F_{3} (r_{g}^{4}) = {r_{g}^{3} - r_{g}^{4}, r_{g}^{2} - r_{g}^{3} + r_{g}^{4}, r_{g} - r_{g}^{3} + r_{g}^{4}, r_{g} - r_{g}^{2} + r_{g}^{3} - r_{g}^{4}}

. We find

λ

-measures of these points by making use of (2) and Remark 2. Thus, we notice that

λ (1 - r_{g} + r_{g}^{2} - r_{g}^{3}) = λ (r_{g} - r_{g}^{3} + r_{g}^{4}) = λ (r_{g} - r_{g}^{2}) p_{10} p_{01}

Putting all these together,

λ (R_{4} - R_{3})

equals

λ (r_{g} - r_{g}^{2}) {(p_{10} + p_{01})}^{2} + λ (1 - r_{g} + r_{g}^{2} - r_{g}^{3}) (p_{10} + p_{01}) = λ (r_{g} - r_{g}^{2}) {(p_{10} + p_{01})}^{2} + λ (r_{g} - r_{g}^{2}) p_{10} p_{01} (p_{10} + p_{01})

In other words,

λ (R_{4} - R_{3}) = λ (R_{3}) (p_{10} + p_{01}) + λ (r_{g} - r_{g}^{2}) p_{10} p_{01} (p_{10} + p_{01})

(6)

It is to be noted that

R_{4} \cap R_{3} = \emptyset

, and so

R_{4} - R_{3} = R_{4}

which implies

λ (R_{4} - R_{3}) = λ (R_{4})

.

Before proceeding further, we notice that

F_{j} (r_{g}^{j + 1}) = g (F_{j - 1} (r_{g}^{j})) \cup f_{1} \circ g (F_{j - 1} (r_{g}^{j}))

for

j \geq 4

and

R_{j + 1} - R_{j}

equals

g (R_{j} - R_{j - 1}) \cup f_{1} \circ g (R_{j} - R_{j - 1})

for

j \geq 5

.

However, at the next stage, we have already noticed that

R_{5} \cap R_{4} \neq \emptyset

, and so

R_{5} - R_{4} \neq R_{5}

. In fact,

R_{5} = F_{4} (r_{g}^{5})

and

R_{5} - R_{4} = F_{4} (r_{g}^{5}) - F_{2} (r_{g}^{4} - r_{g}^{5})

Now, notice that

F_{4} (r_{g}^{5}) = g (R_{4} - R_{3}) \cup f_{1} \circ g (R_{4} - R_{3})

So,

R_{5} - R_{4} = g (R_{4} - R_{3}) \cup f_{1} \circ g (R_{4} - R_{3}) - F_{2} (r_{g}^{4} - r_{g}^{5})

This is the same as

[g (R_{4} - R_{3}) - g (r_{g} - r_{g}^{3} + r_{g}^{4})] \cup [f_{1} \circ g (R_{4} - R_{3}) - f_{1} \circ g (r_{g} - r_{g}^{3} + r_{g}^{4})]

Since

g (R_{4} - R_{3}) - g (r_{g} - r_{g}^{3} + r_{g}^{4})

and

f_{1} \circ g (R_{4} - R_{3}) - f_{1} \circ g (r_{g} - r_{g}^{3} + r_{g}^{4})

do not have overlaps, we deduce that

λ (R_{5} - R_{4}) = λ (R_{4} - R_{3}) (p_{10} + p_{01}) - λ (r_{g} - r_{g}^{2}) p_{10} p_{01} (p_{10} + p_{01})

(7)

which equals

λ (R_{5} - R_{4}) = λ (r_{g} - r_{g}^{2}) {(p_{10} + p_{01})}^{3} + λ (r_{g} - r_{g}^{2}) p_{10} p_{01} {(p_{10} + p_{01})}^{2} - λ (r_{g} - r_{g}^{2}) p_{10} p_{01} (p_{10} + p_{01})

(8)

Thus, from Equations (6) and (8), we observe that Lemma 4 is proved for

k = 2

. For general k, one can use induction on k and carefully sort out the issues with the duplicates to complete the proof of the lemma. □

Lemma 5.

Finally, we calculate λ-measure of R:

λ (R) = \sum_{j = 3}^{\infty} λ (R_{j} - R_{j - 1}) = \sum_{j = 1}^{\infty} λ (r_{g} - r_{g}^{2}) {(p_{10} + p_{01})}^{j} = λ (r_{g} - r_{g}^{2}) \cdot \frac{p_{10} + p_{01}}{1 - p_{10} - p_{01}}

where we put

R_{2} = \emptyset

.

Proof.

Using (4) and (5) for

k \geq 2

, Lemma 5 follows trivially and the proof of the theorem is complete. □

5. Concluding Remarks

In the present context, it is interesting to recall an older problem, first introduced in [7]. It is as follows: consider the very simple situation of a

μ

that is supported on exactly two

2 \times 2

stochastic matrices, namely,

(\begin{matrix} a_{1} & 1 - a_{1} \\ b_{1} & 1 - b_{1} \end{matrix})

and

(\begin{matrix} a_{2} & 1 - a_{2} \\ b_{2} & 1 - b_{2} \end{matrix})

with

a_{i} > b_{i}

for

i = 1, 2

. Let the

μ

-masses at these two points be p and

1 - p

, respectively, where

0 < p < 1

. Let

λ

be the weak limit of the convolution sequence

μ^{n}

. What is the nature of

λ

? If we denote

a_{1} - b_{1} = s

and

a_{2} - b_{2} = t

, then, in [12], some partial solution to this problem was mentioned. In the special case scenario when

s = t

and

p = \frac{1}{2}

, it was observed in [13] that it is precisely the case of Bernoulli convolutions. In fact, the following proposition is stated in [13]:

Proposition 5.

Let μ be a probability measure giving equal mass to the matrices

(\begin{matrix} a_{1} & 1 - a_{1} \\ b_{1} & 1 - b_{1} \end{matrix})

and

(\begin{matrix} a_{2} & 1 - a_{2} \\ b_{2} & 1 - b_{2} \end{matrix})

with

a_{i} > b_{i}

for

i = 1, 2

. Let, say,

a_{1} - b_{1} = a_{2} - b_{2} = t

. Then, the limiting measure λ of the convolution sequence

μ^{n}

is absolutely continuous (where the limt λ is identified as a probability on

[0, 1]

) iff the law of

\sum_{n = 0}^{\infty} t^{n} ϵ_{n}

is absolutely continuous where

ϵ_{n}

’s are i.i.d.

+ 1

and

- 1

with equal probabilities.

Although the century old problem of Bernoulli convolutions was finally solved in [14], there had been a lot of previous studies at various times in different directions in spite of it being apparently a simple problem with

μ

concentrated on two points only. Thus, it is quite possible that under our current set up of

μ

being concentrated on four matrices with

\frac{1}{2} < r < 1

, the problem may be at least as challenging as the Bernoulli convolution problem.

We bring in the context of Bernoulli convolutions here to make readers aware that for a nontrivial

\frac{1}{2} < r < 1

, one needs to explore a number of ideas to proceed towards a complete solution for our problem.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

I acknowledge my affiliation, the University of Texas Rio Grande Valley, for allowing me to use the university office, university library, and university computers for conducting my research.

Conflicts of Interest

The author declares no conflict of interest.

References

Hognas, G.; Mukherjea, A. Probability Measures on Semigroups, 2nd ed.; Springer: New York, NY, USA, 2011. [Google Scholar]
Chakraborty, S.; Mukherjea, A. Limit distributions of random walks on stochastic matrices. Proc. Indian Acad. Sci. (Math Sci.) 2014, 124, 603–612. [Google Scholar] [CrossRef]
Chakraborty, S. Cyclicity and weak convergence for convolution of measures on non-negative matrices. Sankhya Indian J. Stat. 2007, 69, 304–313. [Google Scholar]
Chakraborty, S.; Mukherjea, A. Completely simple semigroups of real d×d matrices and real random walks. Contemp. Math. 2010, 516, 99–108. [Google Scholar]
Chakraborty, S.; Rao, B.V. Convolution powers of probabilities on stochastic matrices of order 3. Sankhya A 1998, 60, 151–170. [Google Scholar]
Chakraborty, S.; Rao, B.V. Convolution powers of probabilities on stochastic matrices. J. Theor. Probab. 2001, 14, 599–603. [Google Scholar] [CrossRef]
Mukherjea, A. Limit Theorems: Stochastic matrices, ergodic markov chains and measures on semigroups. In Probability Analysis and Related Topics; Bharucha Reid, A.T., Ed.; Academic Press: New York, NY, USA, 1979; Volume 2, pp. 143–203. [Google Scholar]
Chamayou, J.F.; Letac, G. A transient random walk on stochastic matrices with Dirichlet distributions. Ann. Prob. 1994, 22, 424–430. [Google Scholar] [CrossRef]
Van Assche, W. Products of 2 × 2 stochastic matrices with random entries. J. Appl. Prob. 1986, 23, 1019–1024. [Google Scholar] [CrossRef]
Chassaing, P.; Letac, G.; Mora, M. Brocot Sequences and Random Walks in SL(2,R); Springer: Berlin/Heidelberg, Germany, 1985; Volume 1034, pp. 37–50. [Google Scholar]
Jaeger, S. The Golden Ratio in Machine Learning. In Proceedings of the IEEE Applied Imagery Pattern Recognition Workshop (AIPR), Washington, DC, USA, 12–14 October 2021; pp. 1–7. [Google Scholar]
Mukherjea, A.; Tserpes, N.A. Measures on Topological Semigroups; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 1976; Volume 547. [Google Scholar]
Chakraborty, S.; Rao, B.V. Bernoulli Convolutions. In Mathematical Models for Bioengineering and Probabilistic Systems; Mishra, J.C., Ed.; Narosa Publishing House: New Delhi, India, 2005; pp. 380–404. [Google Scholar]
Solomyak, B. On the random series ∑±λⁿ (an Erdos problem). Ann. Math. 1995, 142, 611–625. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chakraborty, S. Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio. Mathematics 2023, 11, 4993. https://doi.org/10.3390/math11244993

AMA Style

Chakraborty S. Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio. Mathematics. 2023; 11(24):4993. https://doi.org/10.3390/math11244993

Chicago/Turabian Style

Chakraborty, Santanu. 2023. "Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio" Mathematics 11, no. 24: 4993. https://doi.org/10.3390/math11244993

APA Style

Chakraborty, S. (2023). Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio. Mathematics, 11(24), 4993. https://doi.org/10.3390/math11244993

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio

Abstract

1. Introduction

2. Preliminaries

2.1. Case: $0 < r \leq \frac{1}{2}$

2.2. Case: $\frac{1}{2} < r < 1$

3. $r = r_{g}$ : Main Results

4. $r = r_{g}$ : Proof of the Main Theorem

5. Concluding Remarks

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Limit Distributions of Products of Independent and Identically Distributed Random 2 × 2 Stochastic Matrices: A Treatment with the Reciprocal of the Golden Ratio

Abstract

1. Introduction

2. Preliminaries

2.1. Case: 0 < r ≤ 1 2

2.2. Case: 1 2 < r < 1

3. r = r g : Main Results

4. r = r g : Proof of the Main Theorem

5. Concluding Remarks

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.1. Case: $0 < r \leq \frac{1}{2}$

2.2. Case: $\frac{1}{2} < r < 1$

3. $r = r_{g}$ : Main Results

4. $r = r_{g}$ : Proof of the Main Theorem