Waiting Time Problems for Patterns in a Sequence of Multi-State Trials

Bara Kim; Jeongsim Kim; Jerim Kim

doi:10.3390/math8111893

,

and

¹

Department of Mathematics, Korea University, Seoul 02841, Korea

²

Department of Mathematics Education, Chungbuk National University, Chungbuk 28644, Korea

³

Department of Mathematics, University of Seoul, Seoul 02504, Korea

^*

Author to whom correspondence should be addressed.

Mathematics2020, 8(11), 1893;https://doi.org/10.3390/math8111893

This article belongs to the Special Issue Queue and Stochastic Models for Operations Research

Version Notes

Order Reprints

Abstract

In this paper, we investigate waiting time problems for a finite collection of patterns in a sequence of independent multi-state trials. By constructing a finite GI/M/1-type Markov chain with a disaster and then using the matrix analytic method, we can obtain the probability generating function of the waiting time. From this, we can obtain the stopping probabilities and the mean waiting time, but it also enables us to compute the waiting time distribution by a numerical inversion.

Keywords:

pattern; sooner waiting time; stopping probability; matrix analytic method

1. Introduction

Waiting time problems for runs and patterns in a random sequence of trials are considered important, as they are of theoretical interest and have practical applications in various areas of statistics and applied probability such as reliability, sampling inspection, quality control, DNA/RNA sequence analysis, and hypothesis testing ([1]). For comprehensive surveys and applications of related waiting time problems, refer to the books of Balakrishnan and Koutras [2] and Fu and Lou [3].

Let

{X_{t}, t \geq 1}

be a sequence of random variables taking values in a finite set A. A finite sequence of elements of A is called a pattern. We consider a finite collection

C = {C_{1}, C_{2}, \dots, C_{K}}

of patterns, possibly of different lengths. For

i = 1, \dots, K

, let

τ_{C_{i}}

be the waiting time until the first occurrence of pattern

C_{i}

as a run in the series

X_{1}, X_{2}, \dots

. Let W be the waiting time until one of the K patterns appears, i.e.,

\begin{matrix} W = min {τ_{C_{1}}, \dots, τ_{C_{K}}} . \end{matrix}

Many researchers have studied waiting time problems for general and specific choices of

C

in a random sequence of trials. When

{X_{t}, t \geq 1}

is a sequence of independent and identically distributed (i.i.d.) Bernoulli trials, Fu and Koutras [4] developed a finite Markov chain embedding method, which was first employed by Fu [5], to study the exact distributions for the numbers of specified runs and patterns. Fu [6] extended the finite Markov chain embedding method to study the exact distributions for the numbers of runs and patterns in a sequence of i.i.d. multi-state trials. In addition, he obtained the waiting time distribution of a specified pattern.

In this paper, we are mainly interested in computing the waiting time distribution, as well as the stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, K

. Li [7], Gerber and Li [8], Guibas and Odlyzko [9], Blom and Thorburn [10] and Antzoulakos [11] considered the case when

{X_{t}, t \geq 1}

is a sequence of i.i.d. multi-state trials. Li [7] and Gerber and Li [8] used the martingale approach to obtain the mean waiting time

E [W]

and the stopping probabilities

P (W = τ_{C_{i}})

,

i = 1, \dots, K

for a finite collection

C

of patterns. Guibas and Odlyzko [9] used the combinatorial method to obtain the probability generating function of the waiting time. Blom and Thorburn [10] also used the combinatorial method to obtain the mean waiting time

E [W]

and the stopping probabilities

P (W = τ_{C_{i}})

,

i = 1, \dots, K

for a finite collection

C

of patterns with the same length. Antzoulakos [11] used the finite Markov chain embedding method to study waiting time problems for a single pattern as well as a finite collection

C

of patterns.

Han and Hirano [12], Fu and Chang [13], Glaz et al. [14], Pozdnyakov [15], Gava and Salotti [16], Zhao et al. [17] and Kerimov and Öner [18] considered the case when

{X_{t}, t \geq 1}

is a discrete time homogenous Markov chain with a finite state space, i.e.,

{X_{t}, t \geq 1}

is a sequence of Markov dependent multi-state trials. Han and Hirano [12] studied waiting time problems for two different patterns. Fu and Chang [13] studied waiting time problems for a finite collection of patterns by using the finite Markov chain embedding method. Glaz et al. [14] obtained the mean waiting time

E [W]

and the probability generating function of the waiting time for a finite collection of patterns in a two-state Markov chain by using the method of gambling teams and martingale approach. Pozdnyakov [15] investigated the same problems as in Glaz et al. [14] for multi-state Markovian trials. Gava and Salotti [16] obtained the system of linear equations of stopping probabilities

P (W = τ_{C_{i}})

,

i = 1, \dots, K

, by using the methods developed for gambling teams in [14,15]. Recently, Zhao et al. [17] found a method, which is based on the method of [9], to calculate

E [W]

and

P (W = τ_{C_{i}})

,

i = 1, \dots, K

. Even more recently, Kerimov and Öner [18] found oscillation properties of the expected stopping times and stopping probabilities for patterns consisting of two consecutive states. For useful reviews of different approaches to solve waiting time problems of patterns for both i.i.d. and Markov dependent trials, refer to Fu and Lou [3].

Antzoulakos [11] and Fu and Chang [13] obtained the probability generating function of the waiting time for a finite collection of patterns in a sequence of i.i.d. and Markov dependent multi-state trials, respectively. They used a Markov chain with absorbing states corresponding to the patterns and considered the waiting time as the first entrance time into the absorbing state. The Markov chain has the transition probability matrix P of the form:

\begin{matrix} P = [\begin{matrix} P_{T T} & P_{T A} \\ O & I \end{matrix}], \end{matrix}

(1)

where

P_{T T}

is the submatrix of P whose entries are transition probabilities from a transient state to a transient state,

P_{T A}

is the submatrix of P whose entries are transition probabilities from a transient state to an absorbing state, O is the zero matrix and I is the identity matrix. By using the general formula that represents the probability generating function of the first entrance time into the absorbing state, they obtained the probability generating function of the waiting time. Their results are expressed in terms of the submatrices

P_{T T}

and

P_{T A}

, as well as variants of them. Chang [19] also studied waiting time problems for a finite collection of patterns. He investigated the distribution of the waiting time until the rth occurrence of any pattern in the collection of patterns. He also used the expression (1) for the analysis.

In this paper, we consider a sequence of i.i.d. multi-state trials. We also use a Markov chain with transition probability matrix of the form (1). However, we heavily investigate the structure of the submatrices

P_{T T}

and

P_{T A}

. This enables us to construct a finite GI/M/1-type Markov chain with a disaster and consider the waiting time as the time until the occurrence of the disaster. Based on this and the matrix analytic method, we obtain the probability generating function of the waiting time W on

{W = τ_{C_{j}}}

,

Ψ_{j} (z) = E [z^{W} 1_{{W = τ_{C_{j}}}}]

,

j = 1, \dots, K

. From this, we can obtain the stopping probabilities

P (W = τ_{C_{i}})

,

i = 1, \dots, K

as well as the conditional/unconditional mean waiting times,

E [W | W = τ_{C_{j}}]

and

E [W]

, but it also enables us to compute the waiting time distribution by a numerical inversion. The benefit of our method is that it is useful and efficient even when the length of the pattern is large. Our method can also be extended to Markov dependent multi-state trials.

The paper is organized as follows. In Section 2, we formulate our waiting time problems. In Section 3, we construct a GI/M/1-type Markov chain with a disaster. From this we can obtain our results, which are given in Section 4. In Section 5, numerical examples are presented to illustrate our results. Conclusions are given in Section 6.

2. Problem Formulation

Let

{X_{t}, t \geq 1}

be a sequence of i.i.d. trials taking values in a finite set A. Assume that for

t = 1, 2, \dots

,

\begin{matrix} P (X_{t} = x) = p_{x}, x \in A, \end{matrix}

where

\sum_{x \in A} p_{x} = 1

. For a finite collection

C = {C_{1}, C_{2}, \dots, C_{K}}

of patterns, suppose that pattern

C_{i}

is of the form

\begin{matrix} C_{i} = s_{1}^{i} \dots s_{l_{i}}^{i}, i = 1, \dots, K, \end{matrix}

where

s_{j}^{i} \in A, j = 1, \dots, l_{i}

, i.e.,

C_{i}

is any pattern of length

l_{i}

. Here,

l_{i}

,

i = 1, \dots, K

are fixed positive integers with

l_{1} \geq l_{2} \geq \dots \geq l_{K}

. Recall that W is the waiting time until one of the K patterns appears, i.e.,

\begin{matrix} W = min {τ_{C_{1}}, \dots, τ_{C_{K}}} . \end{matrix}

We will call W the sooner waiting time.

Our main interest is to derive the probability generating function of the sooner waiting time W on

{W = τ_{C_{j}}}

,

j = 1, \dots, K

, i.e.,

\begin{matrix} Ψ_{j} (z) = E [z^{W} 1_{{W = τ_{C_{j}}}}], j = 1, \dots, K . \end{matrix}

(2)

From this, we can obtain the stopping probabilities, the conditional/unconditional probability mass functions of W, and the conditional/unconditional means of W as follows:

The stopping probabilities $P (W = τ_{C_{j}}), j = 1, \dots, K$ , are given by $P (W = τ_{C_{j}}) = Ψ_{j} (1)$ .
The conditional probability mass function of W, given $W = τ_{C_{j}}$ , i.e., $P (W = n | W = τ_{C_{j}})$ , $j = 1, \dots, K$ , can be computed from the conditional probability generating function of W given $W = τ_{C_{j}}$ , $\frac{Ψ_{j} (z)}{Ψ_{j} (1)}$ , by a numerical inversion.
The probability mass function of W, $P (W = n)$ , can be computed from $\sum_{j = 1}^{K} Ψ_{j} (z)$ by a numerical inversion.
The conditional mean of W, $E [W | W = τ_{C_{j}}]$ , $j = 1, \dots, K$ , can be obtained from $E [W | W = τ_{C_{j}}] = \frac{W_{j}^{(1)}}{Ψ_{j} (1)}$ , where $W_{j}^{(1)} = \frac{d}{d z} Ψ_{j} (z) |_{z = 1}$ . In addition, the unconditional mean of W, $E [W]$ , can be obtained from $E [W] = \sum_{j = 1}^{K} W_{j}^{(1)}$ .

3. GI/M/1-Type Markov Chain with a Disaster

In this section, we construct a GI/M/1-type Markov chain with a disaster to obtain an expression for (2). We define the following three terms:

$s_{1} \dots s_{j}$ is a subpattern of pattern $C_{i}$ if $s_{1} \dots s_{j} = s_{k}^{i} s_{k + 1}^{i} \dots s_{k + j}^{i}$ for some k with $1 \leq k \leq l_{i} - j$ ; when $j = 0$ , $s_{1} \dots s_{j}$ means the null pattern (i.e., the pattern with length 0).
A subpattern $s_{1} \dots s_{j}$ of pattern $C_{i}$ is proper if $j < l_{i}$ (i.e., $s_{1} \dots s_{j} \neq C_{i}$ ).
A subpattern $s_{1} \dots s_{j}$ of pattern $C_{i}$ is a leading subpattern of $C_{i}$ if $s_{1} \dots s_{j} = s_{1}^{i} \dots s_{j}^{i}$ for some j with $0 \leq j \leq l_{i}$ .

Assume that for

i \neq j

,

C_{i}

is not a proper subpattern of

C_{j}

.

We now introduce a two-dimensional process

({\tilde{N}}_{t}, {\tilde{J}}_{t})

,

t = 0, 1, 2, \dots

, where

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

are defined as follows:

(i)

{\tilde{N}}_{0} = 0

, and for

t = 1, 2, \dots

,

${\tilde{N}}_{t}$ is the largest $n \in {0, 1, \dots, min {l_{1} - 1, t}}$ such that $(X_{t - n + 1}, \dots, X_{t})$ is a proper leading subpattern of a pattern in $C$ , if $t < W$ .
${\tilde{N}}_{t} = Δ$ , where $Δ$ is an extra point, if $t \geq W$ .

(ii)

{\tilde{J}}_{0} = 1,

and for

t = 1, 2, \dots

,

${\tilde{J}}_{t}$ is the smallest $i \in {1, \dots, K}$ such that $(X_{t - {\tilde{N}}_{t} + 1}, \dots, X_{t})$ is a proper leading subpattern of pattern $C_{i}$ , if ${\tilde{N}}_{t} \in {1, \dots, l_{1} - 1}$ .
${\tilde{J}}_{t} = 1$ , if ${\tilde{N}}_{t} = 0$ .
${\tilde{J}}_{t} = j$ , if ${\tilde{N}}_{t} = Δ$ and $W = τ_{C_{j}}$ .

To clarify the definitions of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

, we provide the following example: Let

{X_{t}, t \geq 1}

be a sequence of i.i.d. trials taking values in a finite set

A = {a, b, c}

. Suppose

C = {C_{1}, C_{2}, C_{3}}

, where

C_{1} = a a a b b b, C_{2} = a a b a

and

C_{3} = a b c

. For example, if we consider the sequence of trials

\begin{matrix} b c a a b a b b a a a b b b b c \dots, \end{matrix}

then

({\tilde{N}}_{t}, {\tilde{J}}_{t})

,

t = 0, 1, 2, \dots

are given in Table 1. As another example, if we consider the sequence of trials

\begin{matrix} a b a b a c c a a c a a b a a \dots, \end{matrix}

then

({\tilde{N}}_{t}, {\tilde{J}}_{t})

,

t = 0, 1, 2, \dots

are given in Table 2. Note that

{({\tilde{N}}_{t}, {\tilde{J}}_{t}), t = 0, 1, \dots}

is a discrete time Markov chain.

Table 1. Sample paths of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

corresponding to the sample path of

X_{t}

,

b c a a b a b b a a a b b b b c \dots

.

Table 2. Sample paths of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

corresponding to the sample path of

X_{t}

,

a b a b a c c a a c a a b a a \dots

.

Define

m_{0} = 1

and for

k = 1, 2, \dots, l_{1} - 1

, let

m_{k}

be the number of patterns in

C

whose lengths are larger than k, i.e.,

\begin{matrix} m_{k} & = max {j \in {1, \dots, K} : k < l_{j}}, k = 1, \dots, l_{1} - 1 . \end{matrix}

We also define

m_{Δ} = K

. Note that

{\tilde{N}}_{t} \in {0, 1, \dots, l_{1} - 1, Δ}

. If

{\tilde{N}}_{t} = k

, then

{\tilde{J}}_{t} \in {1, 2, \dots, m_{k}}

. Furthermore, the set of all possible values of

{\tilde{J}}_{t}

when

{\tilde{N}}_{t} = k \in {0, 1, \dots, l_{1} - 1, Δ}

is given by

I_{k} = \{\begin{matrix} {1} & if k = 0, \\ {i : 1 \leq i \leq m_{k}, s_{1}^{i} \dots s_{k}^{i} is not a leading subpattern of C_{j} for j < i} & if 1 \leq k \leq l_{1} - 1, \\ {1, \dots, K} & if k = Δ . \end{matrix}

Therefore, the state space of the discrete time Markov chain

{({\tilde{N}}_{t}, {\tilde{J}}_{t}), t = 0, 1, \dots}

is

\begin{matrix} E & = {(k, i) : k = 0, 1, \dots, l_{1} - 1, Δ; i \in I_{k}} . \end{matrix}

(3)

For each state

(k, i)

, the first component k is called level. The one-step transition probability matrix P of

{({\tilde{N}}_{t}, {\tilde{J}}_{t}), t = 0, 1, \dots}

is given, in lexicographic order with

Δ

being the last element in the set of levels, as follows:

P = \begin{matrix} 0 \\ 1 \\ 2 \\ ⋮ \\ l_{1} - 2 \\ l_{1} - 1 \\ Δ \end{matrix} [\begin{array}{c} P_{00} & P_{01} & O & O & \dots & \dots & O & P_{0 Δ} \\ P_{10} & P_{11} & P_{12} & O & \dots & \dots & O & P_{1 Δ} \\ P_{20} & P_{21} & P_{22} & P_{23} & O & \dots & O & P_{2 Δ} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋱ & ⋮ & ⋮ & ⋮ \\ P_{l_{1} - 2, 0} & P_{l_{1} - 2, 1} & P_{l_{1} - 2, 2} & \dots & \dots & \dots & P_{l_{1} - 2, l_{1} - 1} & P_{l_{1} - 2, Δ} \\ P_{l_{1} - 1, 0} & P_{l_{1} - 1, 1} & P_{l_{1} - 1, 2} & \dots & \dots & \dots & P_{l_{1} - 1, l_{1} - 1} & P_{l_{1} - 1, Δ} \\ O & O & O & \dots & \dots & \dots & O & I_{Δ} \end{array}],

(4)

where the submatrices are described below. A matrix consisting of

(i, j)

components with

i \in I

and

j \in J

will be called an

I \times J

matrix.

For $k = 0, 1, \dots, l_{1} - 1$ , $P_{k 0}$ is the $I_{k} \times I_{0}$ matrix whose $(i, 1)$ -component is

$\begin{matrix} {(P_{k 0})}_{i 1} = \sum_{x \in A_{k 0}^{i}} p_{x}, \end{matrix}$

where $A_{k 0}^{i}$ is the subset of A consisting of x such that $s_{k^{'}}^{i} s_{k^{'} + 1}^{i} \dots s_{k}^{i} x$ is not a leading subpattern of a pattern in $C$ for any $k^{'} \in {1, \dots, k + 1}$ .
For $k = 0, 1, \dots, l_{1} - 1$ , $P_{k Δ}$ is an $I_{k} \times I_{Δ}$ matrix. The $(i, j)$ -component of $P_{k Δ}$ is

$\begin{matrix} {(P_{k Δ})}_{i j} = p_{s_{l_{j}}^{j}} \end{matrix}$

if $s_{1}^{i} s_{2}^{i} \dots s_{k}^{i} = s_{1}^{j} s_{2}^{j} \dots s_{l_{j} - 1}^{j}$ . Otherwise, ${(P_{k Δ})}_{i j} = 0$ .
For $k = 0, 1, \dots, l_{1} - 1$ , and $k^{'} = 1, \dots, min {k + 1, l_{1} - 1}$ , $P_{k k^{'}}$ is an $I_{k} \times I_{k^{'}}$ matrix. The $(i, j)$ -component of $P_{k k^{'}}$ is

$\begin{matrix} {(P_{k k^{'}})}_{i j} = p_{s_{k^{'}}^{j}} \end{matrix}$

if the following three conditions hold:
(i)
$s_{k - k^{'} + 2}^{i} s_{k - k^{'} + 3}^{i} \dots s_{k}^{i} s_{k^{'}}^{j}$ is a proper leading subpattern of pattern $C_{j}$ ;
(ii)
$s_{1}^{j} \dots s_{k^{'}}^{j}$ is not a proper leading subpattern of pattern $C_{j^{'}}$ for $j^{'} \in {1, \dots, j - 1}$ ;
(iii)
$s_{n}^{i} s_{n + 1}^{i} \dots s_{k}^{i} s_{k^{'}}^{j}$ is not a leading subpattern of a pattern in $C$ for $n \in {1, 2, \dots, k - k^{'} + 1}$ .
Otherwise, ${(P_{k k^{'}})}_{i j} = 0$ .
O’s are zero matrices (possibly of different sizes).
$I_{Δ}$ is the $I_{Δ} \times I_{Δ}$ identity matrix.

To make it easier to understand how the matrix P in (4) is constructed, we explain with an example. For the previously described example with

A = {a, b, c}

,

C_{1} = a a a b b b, C_{2} = a a b a

and

C_{3} = a b c

, we have

\begin{matrix} I_{0} = {1}, I_{1} = {1}, I_{2} = {1, 3}, I_{3} = {1, 2}, I_{4} = {1}, I_{5} = {1}, I_{Δ} = {1, 2, 3} . \end{matrix}

The matrix P is

\begin{matrix} P = [\begin{array}{c} P_{00} & P_{01} & O & O & O & O & P_{0 Δ} \\ P_{10} & P_{11} & P_{12} & O & O & O & P_{1 Δ} \\ P_{20} & P_{21} & P_{22} & P_{23} & O & O & P_{1 Δ} \\ P_{30} & P_{31} & P_{32} & P_{33} & P_{34} & O & P_{1 Δ} \\ P_{40} & P_{41} & P_{42} & P_{43} & P_{44} & P_{45} & P_{1 Δ} \\ P_{50} & P_{51} & P_{52} & P_{53} & P_{54} & P_{55} & P_{1 Δ} \\ O & O & O & O & O & O & I_{Δ} \end{array}], \end{matrix}

where

\begin{matrix} \begin{matrix} P_{00} = p_{b} + p_{c}, & P_{01} = p_{a}, \\ P_{10} = p_{c}, & P_{11} = 0, & P_{12} = [p_{a} p_{b}], \\ P_{20} = [\begin{matrix} p_{c} \\ p_{b} \end{matrix}], & P_{21} = [\begin{matrix} 0 \\ p_{a} \end{matrix}], & P_{22} = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}], & P_{23} = [\begin{matrix} p_{a} & p_{b} \\ 0 & 0 \end{matrix}], \\ P_{30} = [\begin{matrix} p_{c} \\ p_{b} \end{matrix}], & P_{31} = [\begin{matrix} 0 \\ 0 \end{matrix}], & P_{32} = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}], & P_{33} = [\begin{matrix} p_{a} & 0 \\ 0 & 0 \end{matrix}], & P_{34} = [\begin{matrix} p_{b} \\ 0 \end{matrix}], \\ P_{40} = 0, & P_{41} = 0, & P_{42} = [0 0], & P_{43} = [0 0], & P_{44} = 0, & P_{45} = p_{b}, \\ P_{50} = p_{c}, & P_{51} = p_{a}, & P_{52} = [0 0], & P_{53} = [0 0], & P_{54} = 0, & P_{55} = 0, \\ P_{0 Δ} = [0 0 0], & P_{1 Δ} = [0 0 0], & P_{2 Δ} = [\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & p_{c} \end{matrix}], & P_{3 Δ} = [\begin{matrix} 0 & 0 & 0 \\ 0 & p_{a} & p_{c} \end{matrix}], \\ P_{4 Δ} = [0 p_{a} p_{c}], & P_{5 Δ} = [p_{b} 0 0], & I_{Δ} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}] . \end{matrix} \end{matrix}

That is, P is given by

\begin{matrix} P = [\begin{array}{c} p_{b} + p_{c} & p_{a} & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ p_{c} & 0 & p_{a} & p_{b} & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ p_{c} & 0 & 0 & 0 & p_{a} & p_{b} & 0 & 0 & 0 & 0 & 0 \\ p_{b} & p_{a} & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & p_{c} \\ p_{c} & 0 & 0 & 0 & p_{a} & 0 & p_{b} & 0 & 0 & 0 & 0 \\ p_{b} & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & p_{a} & p_{c} \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & p_{b} & 0 & p_{a} & p_{c} \\ p_{c} & p_{a} & 0 & 0 & 0 & 0 & 0 & 0 & p_{b} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{array}] . \end{matrix}

Let

{(N_{t}, J_{t}), t = 0, 1, \dots}

be a two-dimensional discrete time Markov chain with the same state space

E

as that given in (3) and the same transition probability matrix P as that given in (4), but with an arbitrary initial state. Note that

{(N_{t}, J_{t}), t = 0, 1, \dots}

is a finite

G I / M / 1

-type Markov chain with a disaster. This disaster occurs when

N_{t}

reaches

Δ

.

4. Probability Generating Function of the Waiting Time

In this section, we derive an expression for (2). The analysis is based on the matrix analytic method. For more details, refer to Neuts [20,21] and Latouche and Ramaswami [22]. Let

\begin{matrix} τ_{n} & = inf {t \geq 1 : N_{t} = n}, n = 0, 1, \dots, l_{1} - 1, \\ τ_{Δ} & = inf {t \geq 1 : N_{t} = Δ} . \end{matrix}

For

n = 0, 1, \dots, l_{1} - 2

, we define

\begin{matrix} {(G_{n} (z))}_{i j} & = E [z^{τ_{n + 1}} 1_{{τ_{n + 1} < τ_{Δ}, J_{τ_{n + 1}} = j}} ∣ (N_{0}, J_{0}) = (n, i)], \end{matrix}

which means the probability generating functions for the time of the first visit to state

(n + 1, j)

, starting from state

(n, i)

at time 0, before the first visit to level

Δ

. Let

G_{n} (z)

be the matrix of the probability generating functions whose

(i, j)

-component is

{(G_{n} (z))}_{i j}

. Conditioning on the first transition, we have

\begin{matrix} G_{n} (z) = z [P_{n, n + 1} + \sum_{k = 0}^{n} P_{n k} G_{k : n + 1} (z)], 0 \leq n \leq l_{1} - 2, \end{matrix}

(5)

where the

(i, j)

-component of

G_{k : n + 1} (z)

is

\begin{matrix} {(G_{k : n + 1} (z))}_{i j} = E [z^{τ_{n + 1}} 1_{{τ_{n + 1} < τ_{Δ}, J_{τ_{n + 1}} = j}} ∣ (N_{0}, J_{0}) = (k, i)], 0 \leq k \leq n \leq l_{1} - 2 . \end{matrix}

Since

\begin{matrix} G_{n : n + 1} (z) & = G_{n} (z), 0 \leq n \leq l_{1} - 2, \\ G_{k : n + 1} (z) & = G_{k} (z) G_{k + 1 : n + 1} (z), 0 \leq k < n \leq l_{1} - 2, \end{matrix}

we have

\begin{matrix} G_{k : n + 1} (z) = G_{k} (z) G_{k + 1} (z) \dots G_{n} (z), 0 \leq k \leq n \leq l_{1} - 2 . \end{matrix}

(6)

Substituting (6) into (5), we obtain

G_{n} (z) = z [P_{n, n + 1} + \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n} (z)], 0 \leq n \leq l_{1} - 2 .

(7)

Equation (7) can be interpreted as follows: Starting from level n, the Markov chain may visit level

n + 1

(while avoiding level

Δ

) in two ways: it may move up to level

n + 1

at the very next transition (contributing the factor

z P_{n, n + 1}

), or it may move to level k (

0 \leq k \leq n

) at the first transition, move up from level k to level

k + 1

, then from level

k + 1

to level

k + 2

, and so on, until finally moving from level n to level

n + 1

(contributing the factor

z \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n} (z)

). From (7), we obtain

G_{n} (z) = z {[I_{n} - z \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n - 1} (z)]}^{- 1} P_{n, n + 1}, 0 \leq n \leq l_{1} - 2,

(8)

where

I_{n}

is the

I_{n} \times I_{n}

identity matrix.

For

n = 0, 1, \dots, l_{1} - 1

, we define

\begin{matrix} {(H_{n} (z))}_{i j} = \{\begin{matrix} E [z^{τ_{Δ}} 1_{{τ_{n + 1} > τ_{Δ}, J_{τ_{Δ}} = j}} ∣ (N_{0}, J_{0}) = (n, i)] & if n = 0, 1, \dots, l_{1} - 2, \\ E [z^{τ_{Δ}} 1_{{J_{τ_{Δ}} = j}} ∣ (N_{0}, J_{0}) = (l_{1} - 1, i)] & if n = l_{1} - 1, \end{matrix} \end{matrix}

which means that

{(H_{n} (z))}_{i j}

(

n = 0, 1, \dots, l_{1} - 2

) is the probability generating function for the time of the first visit to state

(Δ, j)

, starting from state

(n, i)

, before the first visit to level

n + 1

, and that

{(H_{l_{1} - 1} (z))}_{i j}

is the probability generating function for the time of the first visit to state

(Δ, j)

, starting from state

(l_{1} - 1, i)

. Let

H_{n} (z)

be the matrix of the probability generating functions whose

(i, j)

-component is

{(H_{n} (z))}_{i j}

. Conditioning on the first transition, we have

\begin{matrix} H_{n} (z) = z [P_{n Δ} + \sum_{k = 0}^{n} P_{n k} H_{k : n + 1} (z)], 0 \leq n \leq l_{1} - 1, \end{matrix}

(9)

where the

(i, j)

-component of

H_{k : n + 1} (z)

(

0 \leq k \leq n \leq l_{1} - 1

) is

\begin{matrix} {(H_{k : n + 1} (z))}_{i j} = \{\begin{matrix} E [z^{τ_{Δ}} 1_{{τ_{n + 1} > τ_{Δ}, J_{τ_{Δ}} = j}} ∣ (N_{0}, J_{0}) = (k, i)] & if n = 0, 1, \dots, l_{1} - 2, \\ E [z^{τ_{Δ}} 1_{{J_{τ_{Δ}} = j}} ∣ (N_{0}, J_{0}) = (k, i)] & if n = l_{1} - 1 . \end{matrix} \end{matrix}

(10)

Since

\begin{matrix} H_{n : n + 1} (z) & = H_{n} (z), 0 \leq n \leq l_{1} - 1, \\ H_{k : n + 1} (z) & = H_{k} (z) + G_{k} (z) H_{k + 1} (z), 0 \leq k < n \leq l_{1} - 1, \end{matrix}

we have

\begin{matrix} H_{k : n + 1} (z) = \sum_{k^{'} = k}^{n} G_{k} (z) \dots G_{k^{'} - 1} (z) H_{k^{'}} (z), 0 \leq k \leq n \leq l_{1} - 1 . \end{matrix}

(11)

Substituting (11) into (9), we obtain

\begin{matrix} H_{n} (z) = z [P_{n Δ} + \sum_{k = 0}^{n} P_{n k} \sum_{k^{'} = k}^{n} G_{k} (z) G_{k + 1} (z) \dots G_{k^{'} - 1} (z) H_{k^{'}} (z)], 0 \leq k \leq n \leq l_{1} - 1, \end{matrix}

which can be written as

\begin{matrix} H_{n} (z) = & z [P_{n Δ} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} G_{k} (z) G_{k + 1} (z) \dots G_{k^{'} - 1} (z) H_{k^{'}} (z)] \\ + z \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n - 1} (z) H_{n} (z), 0 \leq k \leq n \leq l_{1} - 1 . \end{matrix}

From this equation, we obtain

\begin{matrix} H_{n} (z) = & z {[I_{n} - z \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n - 1} (z)]}^{- 1} \\ \times [P_{n Δ} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} G_{k} (z) G_{k + 1} (z) \dots G_{k^{'} - 1} (z) H_{k^{'}} (z)], 0 \leq n \leq l_{1} - 1 . \end{matrix}

(12)

Recall that

Ψ_{j} (z) = E [z^{W} 1_{{W = τ_{C_{j}}}}]

,

j = 1, \dots, K

. Since

\begin{matrix} P (W = n, W = τ_{C_{j}}) = P ((τ_{Δ}, J_{τ_{Δ}}) = (n, j) ∣ (N_{0}, J_{0}) = (0, 1)), \end{matrix}

we have

\begin{matrix} Ψ_{j} (z) = E [z^{W} 1_{{W = τ_{C_{j}}}}] = E [z^{τ_{Δ}} 1_{{J_{τ_{Δ}} = j}} ∣ (N_{0}, J_{0}) = (0, 1)], j = 1, \dots, K, \end{matrix}

which means, by (10),

\begin{matrix} Ψ_{j} (z) = {(H_{0 : l_{1}} (z))}_{1 j} . \end{matrix}

Therefore, by (11),

\begin{matrix} (Ψ_{1} (z), \dots, Ψ_{K} (z)) = \sum_{k = 0}^{l_{1} - 1} G_{0} (z) G_{1} (z) \dots G_{k - 1} (z) H_{k} (z) . \end{matrix}

In summary, we obtain the following theorem.

Theorem 1.

The probability generating functions of the sooner waiting time W on

{W = τ_{C_{j}}}

,

Ψ_{j} (z) = E [z^{W} 1_{{W = τ_{C_{j}}}}]

,

j = 1, \dots, K

, are given by

\begin{matrix} (Ψ_{1} (z), \dots, Ψ_{K} (z)) = \sum_{k = 0}^{l_{1} - 1} G_{0} (z) G_{1} (z) \dots G_{k - 1} (z) H_{k} (z), \end{matrix}

(13)

where

G_{k} (z)

,

k = 0, 1, \dots, l_{1} - 2

and

H_{k} (z)

,

k = 0, 1, \dots, l_{1} - 1

are given by (8) and (12), respectively.

From Theorem 1, we can obtain the following results.

Corollary 1.

(i): The stopping probabilities $P (W = τ_{C_{j}}), j = 1, \dots, K$ , are given by

$\begin{matrix} P (W = τ_{C_{j}}) = Ψ_{j} (1) . \end{matrix}$
(ii): The conditional probability generating functions of W, given $W = τ_{C_{j}}, j = 1, \dots, K$ , are given by

$\begin{matrix} E [z^{W} | W = τ_{C_{j}}] = \frac{Ψ_{j} (z)}{Ψ_{j} (1)} . \end{matrix}$

(14)
(iii): The marginal probability generating function of W is given by

$\begin{matrix} E [z^{W}] = \sum_{j = 1}^{K} Ψ_{j} (z) . \end{matrix}$

(15)

Remark. As mentioned in Section 2, the conditional probability mass function

P (W = n | W = τ_{C_{j}})

,

j = 1, \dots, K

can be computed from (14) by a numerical inversion. In addition, the probability mass function

P (W = n)

can be computed from (15) by a numerical inversion. For the numerical inversion of probability generating functions, refer to Abate and Whitt [23].

By Theorem 1, we can also obtain the conditional/unconditional means of the sooner waiting time. To get this, we introduce

\begin{matrix} G_{k} = G_{k} (1), G_{k}^{(1)} = \frac{d}{d z} G_{k} (z) |_{z = 1}, k = 0, 1, \dots, l_{1} - 2, \\ H_{k} = H_{k} (1), H_{k}^{(1)} = \frac{d}{d z} H_{k} (z) |_{z = 1}, k = 0, 1, \dots, l_{1} - 1 . \end{matrix}

Recall that

W_{j}^{(1)} = \frac{d}{d z} Ψ_{j} (z) |_{z = 1}

,

j = 1, \dots, K

. By differentiating (13) with respect to z and evaluating at

z = 1

, we have

(W_{1}^{(1)}, \dots, W_{K}^{(1)}) = \sum_{k = 0}^{l_{1} - 1} (\sum_{i = 0}^{k - 1} G_{0} G_{1} \dots G_{i - 1} G_{i}^{(1)} G_{i + 1} \dots G_{k - 1} H_{k} + G_{0} G_{1} \dots G_{k - 1} H_{k}^{(1)}) .

(16)

Therefore, to obtain an expression for

W_{j}^{(1)}

,

j = 1, \dots, K

, we need to determine

G_{k}^{(1)}

,

k = 0, 1, \dots, l_{1} - 2

, and

H_{k}^{(1)}

,

k = 0, 1, \dots, l_{1} - 1

. Equation (8) may be written as

[I_{n} - z \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n - 1} (z)] G_{n} (z) = z P_{n, n + 1}, 0 \leq n \leq l_{1} - 2,

from which

\begin{matrix} [I_{n} - \sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1}] G_{n}^{(1)} - [\sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{i = k}^{n - 1} G_{k} \dots G_{i}^{(1)} \dots G_{n - 1}] G_{n} \\ = P_{n, n + 1}, 0 \leq n \leq l_{1} - 2 . \end{matrix}

Therefore,

G_{n}^{(1)}

,

n = 0, 1, \dots, l_{1} - 2

are obtained as follows:

\begin{matrix} G_{n}^{(1)} = & {[I_{n} - \sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1}]}^{- 1} [\sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{i = k}^{n - 1} G_{k} \dots G_{i}^{(1)} \dots G_{n - 1}] G_{n} \\ + {[I_{n} - \sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1}]}^{- 1} P_{n, n + 1} . \end{matrix}

(17)

Similarly, we can obtain

H_{n}^{(1)}

,

n = 0, 1, \dots, l_{1} - 1

, by using equation (12). Equation (12) may be written as

\begin{matrix} [I_{n} - z \sum_{k = 0}^{n} P_{n k} G_{k} (z) G_{k + 1} (z) \dots G_{n - 1} (z)] H_{n} (z) \\ = z [P_{n Δ} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} G_{k} (z) G_{k + 1} (z) \dots G_{k^{'} - 1} (z) H_{k^{'}} (z)], 0 \leq n \leq l_{1} - 1, \end{matrix}

from which

\begin{matrix} [I_{n} - \sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1}] H_{n}^{(1)} - [\sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{i = k}^{n - 1} G_{k} \dots G_{i}^{(1)} \dots G_{n - 1}] H_{n} \\ = & P_{n Δ} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} G_{k} G_{k + 1} \dots G_{k^{'} - 1} H_{k^{'}} \\ + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} (\sum_{i = k}^{k^{'} - 1} G_{k} \dots G_{i}^{(1)} \dots G_{k^{'} - 1} H_{k^{'}} + G_{k} \dots G_{k^{'} - 1} H_{k^{'}}^{(1)}), 0 \leq n \leq l_{1} - 1 . \end{matrix}

Therefore,

H_{n}^{(1)}

,

n = 0, 1, \dots, l_{1} - 1

are obtained as follows:

\begin{matrix} H_{n}^{(1)} = & {[I_{n} - \sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1}]}^{- 1} [\sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{i = k}^{n - 1} G_{k} \dots G_{i}^{(1)} \dots G_{n - 1}] H_{n} \\ + {[I_{n} - \sum_{k = 0}^{n} P_{n k} G_{k} \dots G_{n - 1}]}^{- 1} {P_{n Δ} + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} G_{k} G_{k + 1} \dots G_{k^{'} - 1} H_{k^{'}} \\ + \sum_{k = 0}^{n - 1} P_{n k} \sum_{k^{'} = k}^{n - 1} (\sum_{i = k}^{k^{'} - 1} G_{k} \dots G_{i}^{(1)} \dots G_{k^{'} - 1} H_{k^{'}} + G_{k} \dots G_{k^{'} - 1} H_{k^{'}}^{(1)})}, 0 \leq n \leq l_{1} - 1 . \end{matrix}

(18)

Since

W_{j}^{(1)} = E [W 1_{{W = τ_{C_{j}}}}]

,

j = 1, \dots, K

, we can obtain the conditional mean waiting times

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, K

from

E [W | W = τ_{C_{j}}] = \frac{W_{j}^{(1)}}{P (W = τ_{C_{j}})}

. We can also obtain the unconditional mean waiting time

E [W]

from

E [W] = \sum_{j = 1}^{K} W_{j}^{(1)}

. From these two formulas and (16), we obtain the following theorem.

Theorem 2.

The conditional and unconditional means of the sooner waiting time W are given by, respectively,

\begin{matrix} (E [W | W = τ_{C_{1}}], \dots, E [W | W = τ_{C_{K}}]) = (\frac{W_{1}^{(1)}}{P (W = τ_{C_{1}})}, \dots, \frac{W_{K}^{(1)}}{P (W = τ_{C_{K}})}), \\ E [W] = \sum_{j = 1}^{K} W_{j}^{(1)}, \end{matrix}

where

\begin{matrix} (W_{1}^{(1)}, \dots, W_{K}^{(1)}) = \sum_{k = 0}^{l_{1} - 1} (\sum_{i = 0}^{k - 1} G_{0} G_{1} \dots G_{i - 1} G_{i}^{(1)} G_{i + 1} \dots G_{k - 1} H_{k} + G_{0} G_{1} \dots G_{k - 1} H_{k}^{(1)}), \end{matrix}

with

G_{k}^{(1)}

,

k = 0, 1, \dots, l_{1} - 2

and

H_{k}^{(1)}

,

k = 0, 1, \dots, l_{1} - 1

given by (17) and (18), respectively.

5. Numerical Examples

In this section, we present numerical results for the computations of the stopping probabilities, the probability mass functions (along with the tail probabilities) of the sooner waiting time, and the conditional/unconditional means of the sooner waiting time. To illustrate our results, we provide two examples.

Example 1.

Let

{X_{n}, n \geq 1}

be a sequence of i.i.d. trials taking values in a finite set

A = {a, b, c}

. Assume that for

n = 1, 2, \dots

,

\begin{matrix} p_{a} = P (X_{n} = a) = \frac{1}{3}, p_{b} = P (X_{n} = b) = \frac{1}{3}, p_{c} = P (X_{n} = c) = \frac{1}{3} . \end{matrix}

Suppose that

K = 10

, i.e., the collection

C

consists of 10 patterns,

C = {C_{1}, \dots, C_{10}}

. We select the collection of patterns

{C_{1}, \dots, C_{10}}

as shown in Table 3, where the lengths of the patterns,

l_{1}, \dots, l_{10}

, are chosen from the order statistics of i.i.d. random variables with mean 5. The set of patterns given in Table 3 is an example of a randomly selected pattern set such that one pattern is not a subpattern of another. The procedure of randomly selecting a one pattern set is omitted here.

Table 3. The patterns used in Example 1.

By Theorem 1, we can compute

(Ψ_{1} (z), \dots, Ψ_{K} (z))

. Table 4 shows the stopping probabilities

P (W = τ_{C_{j}}) = Ψ_{j} (1), j = 1, \dots, 10 .

Table 4. The stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 10

for Example 1.

In Figure 1, we plot the joint probabilities

P (W \geq n, W = τ_{C_{j}})

,

j = 1, 4, 7, 10

, with n varying. This can be computed by the numerical inversion of its generating function:

\begin{matrix} \sum_{n = 1}^{\infty} P (W \geq n, W = τ_{C_{j}}) z^{n} & = \frac{z}{1 - z} (Ψ_{j} (1) - Ψ_{j} (z)) . \end{matrix}

Figure 1. Plots of the joint probabilities

P (W \geq n, W = τ_{C_{j}})

when

j = 1, 4, 7, 10

for Example 1.

In Table 5, we present the probability mass function of W

P (W = n) = \sum_{j = 1}^{10} P (W = n, W = τ_{C_{j}}),

and the tail probability of W

P (W \geq n) = \sum_{j = 1}^{10} P (W \geq n, W = τ_{C_{j}}),

with n varying. Here,

P (W = n, W = τ_{C_{j}})

can be computed by the numerical inversion of its generating function:

\begin{matrix} \sum_{n = 1}^{\infty} P (W = n, W = τ_{C_{j}}) z^{n} & = Ψ_{j} (z) . \end{matrix}

Table 5. The probability mass function

P (W = n)

and tail probability

P (W \geq n)

for Example 1.

By Theorem 2, we can compute the conditional mean of the sooner waiting time W,

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, K

, and the unconditional mean of W,

E [W]

. Table 6 shows the conditional and unconditional mean waiting times for Example 1.

Table 6. The conditional mean

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, 10

and unconditional mean

E [W]

for Example 1.

The next example will be for the Bernoulli trials.

Example 2.

Let

{X_{n}, n \geq 1}

be a sequence of i.i.d. Bernoulli trials, i.e.,

{X_{n}, n \geq 1}

takes values in a finite set

A = {0, 1}

. Assume that for

n = 1, 2, \dots

,

\begin{matrix} p_{0} = P (X_{n} = 0) = \frac{1}{2}, p_{1} = P (X_{n} = 1) = \frac{1}{2} . \end{matrix}

Suppose that the collection

C

consists of 5 patterns,

C = {C_{1}, \dots, C_{5}}

, where

\begin{matrix} C_{1} & = 1111111111111111, \\ C_{2} & = 0101010101010101, \\ C_{3} & = 001001001001, \\ C_{4} & = 00010001, \\ C_{5} & = 0000 . \end{matrix}

For Example 2, the joint probabilities

P (W \geq n, W = τ_{C_{j}})

,

j = 1, \dots, 5

are shown in Figure 2. Also, the stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 5

, the probability mass function of W (along with the tail probability) and the conditional/unconditional means of W are shown in Table 7, Table 8 and Table 9, respectively.

Figure 2. Plots of the joint probabilities

P (W \geq n, W = τ_{C_{j}})

,

j = 1, \dots, 5

for Example 2.

Table 7. The stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 5

for Example 2.

Table 8. The probability mass function

P (W = n)

and tail probability

P (W \geq n)

for Example 2.

Table 9. The conditional mean

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, 5

and unconditional mean

E [W]

for Example 2.

6. Conclusions

We have derived the probability generating function of the sooner waiting time for a finite collection of patterns in a sequence of i.i.d. multi-state trials. From this probability generating function we have obtained the stopping probabilities and the mean waiting time, but it also has enabled us to compute the waiting time distribution by a numerical inversion. As mentioned in the introduction, our method can be extended to Markov dependent multi-state trials.

For further research, we will investigate the tail asymptotics for the sooner waiting time W. From Figure 1 and Figure 2, we can expect that the distribution of W has a geometric tail behavior. This is true under certain aperiodic condition because W is the first passage time to a subset of the state space, in a discrete time Markov chain with a finite state space. Under some assumptions about periodicity, the distribution of W exhibits a geometric tail behavior, i.e.,

\begin{matrix} P (W \geq n) \sim c σ^{n} a s n \to \infty \end{matrix}

for some

c > 0

and

σ \in (0, 1)

. Here “∼" means that the limit of the ratio is 1. It would be of interest to find explicit expressions for c and

σ

. We also have the following geometric tail behavior:

\begin{matrix} P (W \geq n, W = τ_{C_{i}}) \sim c_{i} σ^{n} a s n \to \infty \end{matrix}

for some

c_{i} > 0

and

σ \in (0, 1)

. Here

σ

is independent of i and is the same as that described above. It would also be of interest to find explicit expressions for

c_{i}

,

i = 1, \dots, K

.

Author Contributions

Conceptualization, B.K. and J.K. (Jeongsim Kim); investigation, B.K. and J.K (Jeongsim Kim); methodology, B.K. and J.K. (Jeongsim Kim); numerical investigation, J.K. (Jerim Kim); writing–original draft preparation, J.K. (Jeongsim Kim); writing–review and editing, B.K., J.K. (Jeongsim Kim) and J.K. (Jerim Kim). All authors have read and agreed to the published version of the manuscript.

Funding

The first author’s research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2020R1A2B5B01001864). The second author’s research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2020R1F1A1A01065568).

Acknowledgments

We are grateful to the reviewers for their valuable comments and suggestions, which greatly improved this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kim, B.; Kim, J. Sooner waiting time problems in a sequence of multi-state trials with random rewards. Stat. Probab. Lett. 2019, 153, 171–179. [Google Scholar] [CrossRef]
Balakrishnan, N.; Koutras, M.V. Runs and Scans with Applications; Wiley: New York, NY, USA, 2002. [Google Scholar]
Fu, J.C.; Lou, W.Y.W. Distribution Theory of Runs and Patterns and Its Applications: A Finite Markov Chain Imbedding Approach; World Scientific: Singapore, 2003. [Google Scholar]
Fu, J.C.; Koutras, M.V. Distribution theory of runs: A Markov chain approach. J. Am. Stat. Assoc. 1994, 89, 1050–1058. [Google Scholar] [CrossRef]
Fu, J.C. Reliability of large consecutive-k-out-of-n: F systems with (k-1)-step Markov dependence. IEEE Trans. Reliab. 1986, R-35, 602–606. [Google Scholar] [CrossRef]
Fu, J.C. Distribution theory of runs and patterns associated with a sequence of multi-state trials. Stat. Sin. 1996, 6, 957–974. [Google Scholar]
Li, S.-Y.R. A martingale approach to the study of occurrence of sequence patterns in repeated experiments. Ann. Probab. 1980, 8, 1171–1176. [Google Scholar] [CrossRef]
Gerber, H.U.; Li, S.-Y.R. The occurrence of sequence patterns in repeated experiments and hitting times in a Markov chain. Stoch. Process. Their Appl. 1981, 11, 101–108. [Google Scholar] [CrossRef]
Guibas, L.J.; Odlyzko, A.M. String overlaps, pattern matching, and nontransitive games. J. Comb. Theory Ser. A 1981, 30, 183–208. [Google Scholar] [CrossRef]
Blom, G.; Thorburn, D. How many random digits are required until given sequences are obtained? J. Appl. Probab. 1982, 19, 518–531. [Google Scholar] [CrossRef]
Antzoulakos, D.L. Waiting times for patterns in a sequence of multistate trials. J. Appl. Probab. 2001, 38, 508–518. [Google Scholar] [CrossRef]
Han, Q.; Hirano, K. Sooner and later waiting time problems for patterns in Markov dependent trials. J. Appl. Probab. 2003, 40, 73–86. [Google Scholar] [CrossRef]
Fu, J.C.; Chang, Y.M. On probability generating functions for waiting time distributions of compound patterns in a sequence of multistate trials. J. Appl. Probab. 2002, 39, 70–80. [Google Scholar] [CrossRef]
Glaz, J.; Kulldorff, M.; Pozdnyakov, V.; Steele, J.M. Gambling teams and waiting times for patterns in two-state Markov chains. J. Appl. Probab. 2006, 43, 127–140. [Google Scholar] [CrossRef][Green Version]
Pozdnyakov, V. On occurrence of patterns in Markov chains: Method of gambling teams. Stat. Probab. Lett. 2008, 78, 2762–2767. [Google Scholar] [CrossRef]
Gava, R.J.; Salotti, D. Stopping probabilities for patterns in Markov chains. J. Appl. Probab. 2014, 51, 287–292. [Google Scholar] [CrossRef]
Zhao, M.-Z.; Xu, D.; Zhang, H.-Z. Waiting times and stopping probabilities for patterns in Markov chains. Appl. Math. J. Chin. Univ. 2018, 33, 25–34. [Google Scholar] [CrossRef]
Kerimov, A.; Öner, A. Oscillation Properties of Expected Stopping Times and Stopping Probabilities for Patterns Consisting of CONSECUTIVE states in Markov Chains. Rocky Mt. J. Math. Available online: https://projecteuclid.org/euclid.rmjm/1588298550 (accessed on 1 May 2020).
Chang, Y.M. Distribution of waiting time until the rth occurrence of a compound pattern. Stat. Probab. Lett. 2005, 75, 29–38. [Google Scholar] [CrossRef]
Neuts, M.F. Matrix-Geometric Solutions in Stochastic Models: An Algorithmic Approach; Johns Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]
Neuts, M.F. Structured Stochastic Matrices of M/G/1 Type and Their Applications; Marcel Dekker, Inc.: New York, NY, USA, 1989. [Google Scholar]
Latouche, G.; Ramaswami, V. Introduction to Matrix Analytic Methods in Stochastic Modeling; ASA-SIAM series on Statistics and Applied Probability; SIAM: Philadelphia, PA, USA, 1999. [Google Scholar]
Abate, J.; Whitt, W. Numerical inversion of probability generating functions. Oper. Res. Lett. 1992, 12, 245–251. [Google Scholar] [CrossRef]

Figure 1. Plots of the joint probabilities

P (W \geq n, W = τ_{C_{j}})

when

j = 1, 4, 7, 10

for Example 1.

Figure 2. Plots of the joint probabilities

P (W \geq n, W = τ_{C_{j}})

,

j = 1, \dots, 5

for Example 2.

Table 1. Sample paths of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

corresponding to the sample path of

X_{t}

,

b c a a b a b b a a a b b b b c \dots

.

Table 1. Sample paths of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

corresponding to the sample path of

X_{t}

,

b c a a b a b b a a a b b b b c \dots

.

t	0	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	⋯
$X_{t}$		b	c	a	a	b	b	b	a	a	a	b	b	b	b	c	⋯
${\tilde{N}}_{t}$	0	0	0	1	2	3	0	0	1	2	3	4	5	$Δ$	$Δ$	$Δ$	⋯
${\tilde{J}}_{t}$	1	1	1	1	1	2	1	1	1	1	1	1	1	1	1	1	⋯

Table 2. Sample paths of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

corresponding to the sample path of

X_{t}

,

a b a b a c c a a c a a b a a \dots

.

Table 2. Sample paths of

{\tilde{N}}_{t}

and

{\tilde{J}}_{t}

corresponding to the sample path of

X_{t}

,

a b a b a c c a a c a a b a a \dots

.

t	0	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	⋯
$X_{t}$		a	b	a	b	a	c	c	a	a	c	a	a	b	b	a	⋯
${\tilde{N}}_{t}$	0	1	2	1	2	1	0	0	1	2	0	1	2	3	$Δ$	$Δ$	⋯
${\tilde{J}}_{t}$	1	1	3	1	3	1	1	1	1	1	1	1	1	2	2	2	⋯

Table 3. The patterns used in Example 1.

$C_{1}$	bbcabbccbacc
$C_{2}$	cbcbccbbb
$C_{3}$	cbbaacbcc
$C_{4}$	abaccba
$C_{5}$	bacabb
$C_{6}$	cbcab
$C_{7}$	acaab
$C_{8}$	caaca
$C_{9}$	aaca
$C_{10}$	aaac

Table 4. The stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 10

for Example 1.

Table 4. The stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 10

for Example 1.

j	$P (W = τ_{C_{j}})$
1	$6.3509 \times 10^{- 5}$
2	$1.7152 \times 10^{- 3}$
3	$1.7152 \times 10^{- 3}$
4	$1.4655 \times 10^{- 2}$
5	$4.6715 \times 10^{- 2}$
6	$1.3893 \times 10^{- 1}$
7	$9.3671 \times 10^{- 2}$
8	$1.2852 \times 10^{- 1}$
9	$1.5249 \times 10^{- 1}$
10	$4.2152 \times 10^{- 1}$

Table 5. The probability mass function

P (W = n)

and tail probability

P (W \geq n)

for Example 1.

Table 5. The probability mass function

P (W = n)

and tail probability

P (W \geq n)

for Example 1.

n	$P (W = n)$	$P (W \geq n)$
5	$2.8807 \times 10^{- 2}$	$9.7531 \times 10^{- 1}$
10	$2.6300 \times 10^{- 2}$	$8.3422 \times 10^{- 1}$
15	$2.2424 \times 10^{- 2}$	$7.1069 \times 10^{- 1}$
20	$1.9103 \times 10^{- 2}$	$6.0542 \times 10^{- 1}$
30	$1.3863 \times 10^{- 2}$	$4.3936 \times 10^{- 1}$
40	$1.0060 \times 10^{- 2}$	$3.1884 \times 10^{- 1}$
50	$7.3009 \times 10^{- 3}$	$2.3138 \times 10^{- 1}$
60	$5.2983 \times 10^{- 3}$	$1.6792 \times 10^{- 1}$
70	$3.8450 \times 10^{- 3}$	$1.2186 \times 10^{- 1}$
80	$2.7903 \times 10^{- 3}$	$8.8432 \times 10^{- 2}$
90	$2.0249 \times 10^{- 3}$	$6.4175 \times 10^{- 2}$
100	$1.4695 \times 10^{- 3}$	$4.6572 \times 10^{- 2}$
200	$5.9533 \times 10^{- 5}$	$1.8867 \times 10^{- 3}$

Table 6. The conditional mean

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, 10

and unconditional mean

E [W]

for Example 1.

Table 6. The conditional mean

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, 10

and unconditional mean

E [W]

for Example 1.

(a) $E [W \| W = τ_{C_{j}}]$
j	$E [W \| W = τ_{C_{j}}]$
1	41.7543
2	40.0476
3	40.0476
4	40.6783
5	41.0372
6	36.3888
7	35.0704
8	35.6263
9	29.7715
10	31.4236
(b) $E [W]$
$E [W]$	33.3582

Table 7. The stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 5

for Example 2.

Table 7. The stopping probabilities

P (W = τ_{C_{j}})

,

j = 1, \dots, 5

for Example 2.

j	$P (W = τ_{C_{j}})$
1	$2.1412 \times 10^{- 4}$
2	$2.9831 \times 10^{- 4}$
3	$4.7235 \times 10^{- 3}$
4	$5.5265 \times 10^{- 2}$
5	$9.3950 \times 10^{- 1}$

Table 8. The probability mass function

P (W = n)

and tail probability

P (W \geq n)

for Example 2.

Table 8. The probability mass function

P (W = n)

and tail probability

P (W \geq n)

for Example 2.

n	$P (W = n)$	$P (W \geq n)$
5	$3.1250 \times 10^{- 2}$	$9.3750 \times 10^{- 1}$
10	$3.0273 \times 10^{- 2}$	$7.7734 \times 10^{- 1}$
15	$2.4963 \times 10^{- 2}$	$6.3660 \times 10^{- 1}$
20	$2.0452 \times 10^{- 2}$	$5.2110 \times 10^{- 1}$
30	$1.3705 \times 10^{- 2}$	$3.4916 \times 10^{- 1}$
40	$9.1827 \times 10^{- 3}$	$2.3396 \times 10^{- 1}$
60	$6.1528 \times 10^{- 3}$	$1.5676 \times 10^{- 1}$
70	$4.1227 \times 10^{- 3}$	$1.0504 \times 10^{- 1}$
80	$2.7624 \times 10^{- 3}$	$7.0380 \times 10^{- 2}$
90	$1.8509 \times 10^{- 3}$	$4.7158 \times 10^{- 2}$
50	$1.2402 \times 10^{- 3}$	$3.1598 \times 10^{- 2}$
100	$8.3100 \times 10^{- 4}$	$2.1172 \times 10^{- 2}$
200	$1.5158 \times 10^{- 5}$	$3.8620 \times 10^{- 4}$

Table 9. The conditional mean

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, 5

and unconditional mean

E [W]

for Example 2.

Table 9. The conditional mean

E [W | W = τ_{C_{j}}]

,

j = 1, \dots, 5

and unconditional mean

E [W]

for Example 2.

(a) $E [W \| W = τ_{C_{j}}]$
j	$E [W \| W = τ_{C_{j}}]$
1	38.6231
2	38.7707
3	36.8873
4	37.4396
5	26.7334
(b) $E [W]$
$E [W]$	27.3792

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Waiting Time Problems for Patterns in a Sequence of Multi-State Trials

Abstract

1. Introduction

2. Problem Formulation

3. GI/M/1-Type Markov Chain with a Disaster

4. Probability Generating Function of the Waiting Time

5. Numerical Examples

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics