Near-Record Values in Discrete Random Sequences

Lafuente, Miguel; Gouet, Raúl; López, F. Javier; Sanz, Gerardo

doi:10.3390/math10142442

Open AccessArticle

Near-Record Values in Discrete Random Sequences

¹

Departamento de Métodos Estadísticos, Facultad de Ciencias, Universidad de Zaragoza, C/Pedro Cerbuna 12, 50009 Zaragoza, Spain

²

Departamento Ingeniería Matemática y Centro de Modelamiento Matemático (CNRS IRL 2807), Universidad de Chile, Avenida Beauchef 851, Santiago 8370456, Chile

³

Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza, 50018 Zaragoza, Spain

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2022, 10(14), 2442; https://doi.org/10.3390/math10142442

Submission received: 14 June 2022 / Revised: 8 July 2022 / Accepted: 11 July 2022 / Published: 13 July 2022

Download

Browse Figures

Versions Notes

Abstract

:

Given a sequence

(X_{n})

of random variables,

X_{n}

is said to be a near-record if

X_{n} \in (M_{n - 1} - a, M_{n - 1}]

, where

M_{n} = max {X_{1}, \dots, X_{n}}

and

a > 0

is a parameter. We investigate the point process

η

on

[0, \infty)

of near-record values from an integer-valued, independent and identically distributed sequence, showing that it is a Bernoulli cluster process. We derive the probability generating functional of

η

and formulas for the expectation, variance and covariance of the counting variables

η (A), A \subset [0, \infty)

. We also derive the strong convergence and asymptotic normality of

η ([0, n])

, as

n \to \infty

, under mild regularity conditions on the distribution of the observations. For heavy-tailed distributions, with square-summable hazard rates, we prove that

η ([0, n])

grows to a finite random limit and compute its probability generating function. We present examples of the application of our results to particular distributions, covering a wide range of behaviours in terms of their right tails.

Keywords:

record; near-record; Bernoulli cluster process; law of large numbers; central limit theorem

MSC:

60G70; 60G55; 60F05; 60F15

1. Introduction

Outstanding achievements and world records in athletics events such as the 100 m sprint always make headlines and arouse widespread admiration. Similarly, considerable media attention and public concern are attached to record figures (often bad) relating to the economy, the weather or healthcare systems. Crucial social questions arise when we are faced with a steady flow of records, which are presented as ominous signs of dramatic underlying phenomena. It is therefore unsurprising that the term “record” has become such a constant in our modern everyday life and in a wide range of specialist domains. The probabilistic theory and statistical analysis of record breaking data can be helpful in assessing the seriousness of these issues.

The mathematical theory of records is well developed, especially for data generated by independent and identically distributed (i.i.d.) random variables (r.v.) with a continuous underlying distribution function. As is well known, in this setting, one can only expect about log n record values among n observations, which means that records are rare. The reader interested in the theory of records can consult the monographs [1,2,3]. For statistical inference from record data, see [4].

Concepts of “quasi-records” emerged as natural extensions of records and have proven to be worthwhile from a mathematical as well as an applied perspective. The general idea of values close to records was translated into a variety of definitions that were theoretically analysed and applied in widely different contexts. Near-records were introduced in [5] for applications in finance, and their properties were analysed in [6,7]. In addition, the related concept of the

δ

-record was introduced in [8] and later studied in [9,10,11,12]. These objects have practical applications in the case of negative

δ

, since

δ

-records are more numerous than records. So, by considering samples of

δ

-records for statistical inference, we address the problem of the scarcity of records while keeping the extremal nature of the data. Indeed, it has been shown in [9] and in references therein that inferences based on

δ

-records outperform those based on records only.

The main objects of interest in this work are near-records. An observation is a near-record if it is not a record but is at a distance of less than

a > 0

units from the last record; that is, it falls short of being a record by less than a units. While the number of records in an i.i.d. sequence of continuous r.v. grows with the logarithm of the number of observations, the number of near-records grows at speeds depending on the distribution of the observations. In fact, for heavy-tailed distributions, there are fewer near-records than records (in extreme cases, only a finite number of near-records can be observed along the whole sequence), while for light-tailed distributions, near-records outnumber records; see [13] for details.

Another interesting aspect of near-records is related to their values. It is well known that record values of an i.i.d. sequence behave as the so-called Shorrock process [14], which is a mixture of a non-homogeneous Poisson process and a Bernoulli process. In the particular case of non-negative integer-valued r.v., k is a record value with probability

P (X_{1} = k | X_{1} \geq k)

, and the events wherein

{k

is a record value} are independent.

In this paper, we focus on the process of near-record values for i.i.d. sequences of r.v., taking non-negative integer values. The case of continuous r.v., analysed in [13], showed that near-record values follow a Poisson cluster point process, where records are the centres of the clusters and near-records are the points in each cluster. The main characteristics of this process, including its asymptotic behaviour, were derived from the properties of the Poisson cluster process, which has been thoroughly studied in the literature. In the discrete setting of the present paper, we prove that near-record values also behave as a cluster process, with centres following a Bernoulli process. We fully characterise the process by giving an expression for its probability generating functional. In particular, we find the exact distribution of the number of times that k is a near-record, which turns out to be a mixture of a point mass at 0 and a geometric distribution. We also characterise the distribution of the total number of near-records for heavy-tailed distributions. Moreover, we study the limiting behaviour of the number of near-records with values less than n, as n goes to infinity, by giving laws of large numbers and central limit theorems. Rather than relying on properties of cluster processes, as done in [13], here, we use a more direct approach that consists of approximating the sequences under study by a sum of independent r.v. We give several examples of applications of our results to particular families, ranging from heavy-tailed distributions, with a finite number of near-records, to light-tailed ones, such as the Poisson distribution.

The paper is organised as follows: notations and first definitions are presented in Section 2. The process of near-record values is studied in Section 3, while in Section 4, we consider the eventual finiteness of the total number of near-records, followed by asymptotic results in Section 5. Finally, illustrative examples are shown in Section 6 and Appendix A is devoted to technical results.

2. Notation and Preliminary Definitions

The sets of real and positive real numbers are denoted by

R

and

R_{+}

, respectively. The sets of positive and non-negative integers are denoted by

N

and

Z_{+}

, respectively. Sequences in

R

are indexed by

N

and are written in lower-case letters, between parentheses, such as

(x_{n}), (y_{k})

, etc. All r.v. are assumed to be defined on a common probability space

(Ω, F, P)

. The indicator r.v. of an event

B \in F

, taking the value 1 on B and 0 otherwise, is denoted by

{1 1}_{B}

. The indicator function of

A \subseteq R

, equal to 1 on A and 0 on

R \ A

, is denoted by

{1 1}_{A}

.

When referring to a geometric r.v. or distribution throughout the paper, we assume 0 as the starting value. The probability generating function (p.g.f.) of an r.v. X, taking values in

Z_{+}

, is defined as

φ_{X} (t) = E (t^{X}) = \sum_{k = 0}^{\infty} t^{k} P (X = k)

, for all

t \in R

, such that the series is absolutely convergent.

Sequences of r.v. are also indexed by

N

and are written in upper-case letters, such as

(X_{n}), (Y_{k})

, etc. The convergence of deterministic sequences to a limit L is denoted by

x_{n} \to L

or

lim x_{n} = L

, and it is implicitly understood as

n \to \infty

, unless otherwise stated. The notation

x_{n} \sim y_{n}

stands for

x_{n} / y_{n} \to 1

. The same notation applies to random sequences, where the mode of convergence (almost sure

\overset{a . s .}{⟶}

or in distribution

\overset{D}{⟶}

) is indicated over the arrow. The

σ

-algebra of Borel subsets of

R_{+}

is denoted by

B_{+}

.

Definition 1.

Let

(X_{n})

be a sequence of r.v. and let a be a positive parameter. Then, for

n \in N

,

$X_{n}$ is a record if $X_{n} > M_{n - 1}$ , and
$X_{n}$ is near-record if $M_{n - 1} - a < X_{n} \leq M_{n - 1},$

where

M_{n} = max {X_{1}, \dots, X_{n}}

, with

M_{0} = - \infty

, by convention.

From the above definitions, it is clear that a near-record is not a record, but it can take the value of the current record. Other random sequences of interest related to records are record times

(L_{n})

, defined as

L_{n} = min {k \in N | k > L_{n - 1}, X_{k} > M_{L_{n - 1}}},

for

n \geq 1

, with

L_{0} = 0

, and record values

(R_{n})

, given by

R_{n} = X_{L_{n}} = M_{L_{n}}

, for

n \geq 1

. Additionally, we consider the set of record values as a point process on

R

, which can be described by the random counting measure

ξ

, defined by

ξ (A) = c a r d {n \in N | R_{n} \in A}, A \in B_{+} .

(1)

We also define

I_{n} : = ξ ({n})

—the indicator of the event for which a record takes the value n.

Observe that record times

L_{n}

are the jump times of the sequence of partial maxima and that record values

R_{n}

are the (strictly increasing) subsequence of partial maxima

(M_{n})

, sampled at those jump times. However, without further probabilistic assumptions on

(X_{n})

, it may happen that

L_{n} = \infty

, from some value of n on, which is equivalent to the existence of a final record. Furthermore, we have to ensure that the counting measure

ξ

is boundedly finite in the sense of being finite on bounded sets.

Similarly, the sequence

(L_{n}^{a})

of near-record times is defined by

L_{n}^{a} = min {k \in N | k > L_{n - 1}^{a}, M_{k - 1} - a < X_{k} \leq M_{k - 1}},

for

n \geq 1

, with

L_{0}^{a} = 0

, and near-record values

(R_{n}^{a})

are given by

R_{n}^{a} = X_{L_{n}^{a}}

, for

n \geq 1

. We define the counting measure of near-record values by

η (A) = c a r d {n \in N | R_{n}^{a} \in A}, A \in B_{+},

(2)

and define the related r.v.

η (n) = η ([0, n])

and

η_{n} = η ({n})

, for

n \in Z_{+}

.

As for records, assumptions are needed in order to ensure that near-record times and values are well defined. Additionally, in order to characterise

η

as a cluster point process, we consider a classification of near-records in terms of their proximity to records.

Definition 2.

(a) For

m, n \in N

, the n-th near-record value

R_{n}^{a}

is said to be associated to the m-th record value

R_{m}

if

L_{m} < L_{n}^{a} < L_{m + 1}

.

(b) For

m \in N

, the point process

η (\cdot | R_{m})

of near-record values associated to

R_{m}

is defined by the random counting measure

η (A | R_{m}) = c a r d {n \in N | R_{n}^{a} \in A, L_{m} < L_{n}^{a} < L_{m + 1}}, A \in B_{+} .

(3)

We state here the probabilistic assumptions regarding

(X_{n})

, which hold throughout the paper. We assume that

(X_{n})

is a sequence of i.i.d. r.v., taking non-negative integer values, with

p_{k} : = P (X_{1} = k), y_{k} : = P (X_{1} > k), k \in Z_{+}

. For convenience, we define

p_{k} = 0

and

y_{k} = 1

for

k < 0

.

In addition, let

r_{k} = \frac{p_{k}}{y_{k - 1}} = P (X_{1} = k | X_{1} \geq k), k \in Z_{+},

(4)

be the hazard or failure rates, and

q_{k} = \frac{y_{k}}{y_{k - a}} = P (X_{1} > k | X_{1} > k - a), k \in Z_{+} .

(5)

Note that

y_{k} = \prod_{i = 0}^{k} (1 - r_{i})

,

k \in Z_{+}

.

In order to ensure that no final record exists and thus that all record times are well defined, we assume that

y_{k} > 0, \forall k \in Z_{+}

. This, in particular, implies

r_{k} < 1, \forall k \in Z_{+}

. In addition, to avoid unnecessary complications, we assume

a \in N

.

Example 1 (Records and near-records).

Let us consider a near-record parameter

a = 3

and the following sequence of 17 observations:

2, 4, 3, 6, 1, 6, 7, 1, 7, 8, 6, 7, 2, 4, 5, 8, 12, \dots

. See Figure 1.

The sequence of partial maxima is $M_{1} = 2$ , $M_{2} = 4$ , $M_{3} = 4$ , $M_{4} = 6$ , $M_{5} = 6 \dots$ .
For the record value sequence, we have $R_{1} = 2$ , $R_{2} = 4$ , $R_{3} = 6$ , $R_{4} = 7$ , $R_{5} = 8$ , $R_{6} = 12$ .
According to Definition 2, there are no near-records associated to $R_{1}$ , there is one near-record (with value 3) associated to $R_{2}$ , one near-record (with value 6) associated to $R_{3}$ , one near record (with value 7) associated to $R_{4}$ and two near-records (with values 6 and 7) associated to $R_{5}$ . Note also that, as $X_{17} = R_{6} = 12$ and $a = 3$ , there will be no near-records with value smaller than 10 after observation 17. Thus, $η ([0, 9])$ , the number of near-records with value in the interval $[0, 9]$ , is equal to 5.

Figure 1. Representation of the sequence given in Example 1. Red dots represent record observations, while blue dots are near-record observations with parameter

a = 3

.

Figure 1. Representation of the sequence given in Example 1. Red dots represent record observations, while blue dots are near-record observations with parameter

a = 3

.

3. The Point Process of Near-Record Values

We recall that a point process N on

R_{+}

can be seen as a random measure and has a probability generating functional (p.g.fl.) defined by

G_{N} [h] = E (exp (\int log h (x) N (d x)))

, under appropriate conventions regarding the logarithm of 0, where

h : R_{+} \to [0, 1]

is a measurable function equal to 1 outside some bounded subset of

R_{+}

(such functions are referred to as “suitable”). Alternative formulas for the p.g.fl., in the form of a product–integral or a product are given by

G_{N} [h] = E (\prod_{x \in R_{+}} h {(x)}^{N (d x)}) = E (\prod_{_{x : N ({x}) > 0}} h {(x)}^{N ({x})}) .

(6)

In this section, we show that the near-record process

η

is a discrete cluster process. Indeed, since

η (A) = \sum_{m = 1}^{\infty} η (A | R_{m}), A \in B_{+}

, process

η

can be seen as a superposition of a denumerable family of point processes which, by Proposition 1 (c) below, are conditionally independent. Moreover, since the r.v.

X_{n}

take values in

Z_{+}

, we find that, for every bounded A,

η (A) \leq L_{K + a}

, where

K \in N

is an upper bound of A, the process

η

is boundedly finite.

We characterise

η

by means of its p.g.fl. and compute its first moments and other quantities of interest. To that end, we first present some useful results about records and near-records.

Lemma 1.

(a) The point process ξ of record values has its atoms in

Z_{+}

, and the r.v.

I_{n}

are independent Bernoulli, with

E (I_{n}) = r_{n}, n \in Z_{+} .

(b) For any suitable function h,

G_{ξ} [h] = \prod_{n = 0}^{\infty} (1 - r_{n} (1 - h (n))) .

(7)

Proof.

For a proof of (a), see, for instance, Theorem 16.1 in [3]. To prove (b), from (a) and the second formula in (6), we obtain, noting that

h = 0

outside a bounded set, and using the convention

0^{0} = 1

,

\begin{matrix} G_{ξ} [h] & = E (\prod_{n = 0}^{\infty} h {(n)}^{ξ ({n})}) \\ = E (\prod_{n = 0}^{\infty} h {(n)}^{I_{n}}) \\ = \prod_{n = 0}^{\infty} E (h {(n)}^{I_{n}}) \\ = \prod_{n = 0}^{\infty} (1 - r_{n} + r_{n} h (n)) . \end{matrix}

□

Proposition 1.

(a): Let $S_{m} = η (R_{+} | R_{m})$ be the number of near-records associated to record $R_{m}, m \in N$ , according to Definition 2. Then,

$P (S_{m} = s | R_{m}) = {(1 - q_{_{R_{m}}})}^{s} q_{_{R_{m}}}, s \in Z_{+} .$

(8)

That is, $S_{m}$ is geometrically distributed, conditionally on $R_{m}$ .
(b): Let $L_{n_{1}}^{a} < \dots < L_{n_{_{S_{m}}}}^{a}$ be the near-record times associated to $R_{m}$ . Then, conditionally on $R_{m}, S_{m}$ , the near-record values $Y_{m, j} : = X_{L_{n_{j}}^{a}}, j = 1, \dots, S_{m}$ are i.i.d. with

$P (Y_{m, 1} = k_{1}, \dots, Y_{m, S_{m}} = k_{S_{m}} ∣ R_{m}, S_{m}) = \prod_{j = 1}^{S_{m}} π (k_{j}, R_{m}),$

(9)

where $π (k, i) : = \frac{p_{k}}{y_{i - a} - y_{i}} {1 1}_{(i - a, i]} (k), i, k \in Z_{+}$ .
Moreover, conditionally on $R_{m}, S_{m}$ , the r.v. $N_{m, k} : = η ({k} | R_{m}) = \sum_{j = 1}^{S_{m}} {1 1}_{{Y_{m, j} = k}}$ , for $k \in (R_{m} - a, R_{m}] \cap Z_{+}$ , are multinomially distributed, with parameters $S_{m}, π (k, R_{m})$ .
(c): The σ-algebras $F_{m} : = σ {R_{m}, S_{m}, Y_{m, j}, j = 1, \dots, S_{m}}, m \in N$ are independent, conditionally on $R : = σ {R_{m} | m \in N}$ .

Proof.

(a): Note that the r.v. $X_{n}, n > L_{m},$ are independent and identically distributed as $X_{1}$ . Define the subsequence $(X_{k_{n}})$ , with $k_{1} = min {k > L_{m} | X_{k} > R_{m} - a}$ and $k_{n} = min {k > k_{n - 1} | X_{k} > R_{m} - a}, n \geq 2$ . Then, conditionally on $R_{m}$ , the sequence $(X_{k_{n}})$ is also i.i.d., but their common (conditional) distribution is $P (X_{1} \leq x | X_{1} > R_{m} - a)$ . Lastly, $S_{m}$ is the number of terms $X_{k_{n}}$ up to (but no including) the first $X_{k_{n}} > R_{m}$ . Hence, conditionally on $R_{m}$ , $S_{m}$ is geometrically distributed, as stated.
(b): The near-record values $Y_{m, j}$ are precisely the $X_{k_{n}}$ before the next record. So, conditionally on $R_{m}, S_{m}$ , they are i.i.d. with probabilities $π (k, R_{m})$ . In addition, from the arguments above, it is clear that the $N_{m, k}$ are (conditionally) multinomial.
(c): Note that, since the $(X_{n})$ are i.i.d, the $σ$ -algebras $G_{m} = σ {X_{k}, L_{m} \leq k < L_{m + 1}}, m \in N$ are independent, conditionally on $R$ . Then, since the r.v. $S_{m}$ , $Y_{m, j}$ , $j = 1, \dots, S_{m}$ are $G$ -measurable, the result follows.

□

We compute below the p.g.fl. of the point process

η (\cdot | R_{m})

, which is obtained from (6) by taking the conditional expectation. That is,

G_{η (\cdot | R_{m})} [h] : = E (\prod_{k = 0}^{\infty} h {(k)}^{η ({k} | R_{m})} | R_{m}) .

(10)

For

h : R_{+} \to [0, 1]

measurable,

A \in B_{+}

and

i \in Z_{+}

, let

α_{i} (h) = \frac{1}{y_{i}} \sum_{k = i - a + 1}^{i} p_{k} (1 - h (k)), α_{i} (A) = \frac{1}{y_{i}} \sum_{k = i - a + 1}^{i} p_{k} {1 1}_{A} (k) .

(11)

Additionally, let

α_{i} (n) = α_{i} ([0, n]), n \in Z_{+}

.

Proposition 2.

For a suitable function h,

G_{η (\cdot | R_{m})} [h] = \frac{1}{1 + α_{_{R_{m}}} (h)} .

(12)

Proof.

Suppose

R_{m} = i

, for some

m \in N

. From (10) and (b) of Proposition 1, we get

\begin{matrix} G_{η (\cdot | R_{m})} [h] & = E (E (\prod_{k = 0}^{\infty} h {(k)}^{η ({k} | R_{m})} | R_{m}, S_{m}) | R_{m}) \\ = E (E (\prod_{k = 0}^{\infty} h {(k)}^{N_{m, k}} | R_{m}, S_{m}) | R_{m}) \\ = E ({(\sum_{k = 0}^{\infty} π (k, R_{m}) h (k))}^{S_{m}} | R_{m}) \\ = \frac{q_{_{R_{m}}}}{1 - (1 - q_{_{R_{m}}}) \sum_{k = 0}^{\infty} π (k, R_{m}) h (k)} \\ = \frac{y_{_{R_{m}}}}{y_{_{R_{m} - a}} - \sum_{k = R_{m} - a + 1}^{R_{m}} h (k) p_{k}} \\ = \frac{1}{1 + α_{_{R_{m}}} (h)}, \end{matrix}

where the third and fourth equalities, as shown above, follow from the expressions of the p.g.f. of the multinomial and geometric distributions, respectively. □

Definition 3

(Definition 6.3.I in [15]). A (boundedly finite) point process N is a cluster point process on

R_{+}

, with the centre process

N_{c}

on

R_{+}

and component processes the family of point processes

{N (\cdot | y) : y \in R_{+}}

if, for every bounded

A \in B_{+}

,

N (A) = \sum_{y \in R_{+}} N (A | y) {1 1}_{{N_{c} ({y}) > 0}} .

(13)

Definition 4.

For

i \in Z_{+}

, let

ζ_{i}

be the point process with p.g.fl. given by

G_{ζ_{i}} [h] = \frac{1}{1 + α_{i} (h)},

where h is a suitable function and

α_{i} (h)

is defined in (11).

Theorem 1.

(a): The point process η of near-records is a cluster process on $Z^{+}$ , with the centre process ξ and independent components processes ${ζ_{i}, i \in Z^{+}}$ .
(b): For a suitable function h,

$G_{η} [h] = \prod_{i = 0}^{\infty} \frac{1 + (1 - r_{i}) α_{i} (h)}{1 + α_{i} (h)} .$

(14)

In particular, taking $h (k) = t^{{1 1}_{A} (k)}, t \in [0, 1]$ and $A \in B_{+}$ bounded, we obtain the p.g.f. of $η (A)$ , given by

$φ_{η (A)} (t) = \prod_{i = 0}^{\infty} \frac{1 + (1 - r_{i}) (1 - t) α_{i} (A)}{1 + (1 - t) α_{i} (A)} .$

(15)
(c): For every bounded $A, B \in B_{+}$ ,

$E (η (A)) = \sum_{i = 0}^{\infty} α_{i} (A) r_{i} < \infty$ ,
$V a r (η (A)) = \sum_{i = 0}^{\infty} α_{i}^{2} (A) r_{i} (2 - r_{i}) + E (η (A)) < \infty$ ,
$Cov (η (A), η (B)) = \sum_{i = 0}^{\infty} α_{i} (A) α_{i} (B) r_{i} (2 - r_{i})$ , for $A \cap B = \emptyset$ .

Proof.

(a) Observe that

η (A) = \sum_{m = 1}^{\infty} η (A | R_{m}) = \sum_{i = 0}^{\infty} ζ_{i} (A) I_{i} .

(16)

So, according to Definition 3,

η

is a cluster point process, as asserted. Independence of component processes follows from (c) in Proposition 1, because

η (A | R_{m})

is

F_{m}

-measurable, for any

m \in N

.

(b) For h, a suitable function, let

\tilde{h} : Z_{+} \to [0, 1]

be defined as

\tilde{h} (i) = G_{ζ_{i}} [h] = \frac{1}{1 + α_{i} (h)}

, which is also a suitable function. From 6.3.6 in [15], we have

G_{η} [h] = G_{ξ} [\tilde{h}]

. Therefore, by Lemma 1 (b) and Proposition 2,

\begin{matrix} G_{η} [h] & = \prod_{i = 0}^{\infty} (1 - r_{i} (1 - \frac{1}{1 + α_{i} (h)})) \\ = \prod_{i = 0}^{\infty} \frac{1 + (1 - r_{i}) α_{i} (h)}{1 + α_{i} (h)} \\ = \prod_{i = 0}^{\infty} (\frac{r_{i}}{1 + α_{i} (h)} + 1 - r_{i}) . \end{matrix}

For

φ_{η (A)} (t)

we replace h by

t^{{1 1}_{A}}

in (14) and get (15), noting that

α_{i} (t^{{1 1}_{A}}) = (1 - t) α_{i} (A)

.

(c1) Observe that

η (A) = \sum_{k \in A} \sum_{m = 1}^{\infty} η ({k} | R_{m}) = \sum_{k \in A} \sum_{m = 1}^{\infty} N_{m, k}

and recall that

N_{m, k}

is binomial, conditional on

R_{m}, S_{m}

, with parameters

S_{m}, π (k, R_{m})

, and that

S_{m}

is geometric, conditional on

R_{m}

, with parameter

q_{_{R_{m}}}

.

Moreover,

N_{m, A} : = \sum_{k \in A} N_{m, k}

is binomial, conditional on

R_{m}, S_{m}

, with parameters

S_{m}, π (A, R_{m})

, where

π (A, R_{m}) : = \sum_{k \in A \cap Z_{+}} π (k, R_{m})

, hence

\begin{matrix} E (N_{m, A}) & = E (E (N_{m, A} | R_{m}, S_{m})) \\ = E (S_{m} π (A, R_{m})) \\ = E (E (S_{m} π (A, R_{m}) | R_{m})) \\ = E (π (A, R_{m}) E (S_{m} | R_{m})) \\ = E (π (A, R_{m}) \frac{1 - q_{_{R_{m}}}}{q_{_{R_{m}}}}) . \end{matrix}

(17)

So, noticing that

π (A, i) \frac{1 - q_{i}}{q_{i}} = α_{i} (A)

,

E (η (A)) = E (\sum_{m = 1}^{\infty} α_{_{R_{m}}} (A)) = E (\sum_{i = 0}^{\infty} α_{i} (A) I_{i}) = \sum_{i = 0}^{\infty} α_{i} (A) r_{i},

(18)

which is finite since

α_{i} (A) > 0

only for a finite set of i values.

(c2) From the computations above, it is clear that

E (η (A) | R) = \sum_{m = 1}^{\infty} α_{_{R_{m}}} (A) = \sum_{i = 0}^{\infty} α_{i} (A) I_{i} .

Hence, the variance of the conditional expectation is

V a r (E (η (A) | R)) = \sum_{i = 0}^{\infty} α_{i}^{2} (A) r_{i} (1 - r_{i}) .

We compute next the expectation of the conditional variance, namely

E (V a r (η (A) | R))

. Observe that, because of the conditional independence of the

η (A | R_{n}), n \in N

, we have

V a r (η (A) | R) = \sum_{m = 1}^{\infty} V a r (η (A | R_{m}) | R) = \sum_{m = 1}^{\infty} V a r (N_{m, A} | R_{m}) .

Moreover,

\begin{matrix} V a r (N_{m, A} | R_{m}) & = E (V a r (N_{m, A} | R_{m}, S_{m}) | R_{m}) + V a r (E (N_{m, A} | R_{m}, S_{m}) | R_{m}) \\ = E (S_{m} π (A, R_{m}) (1 - π (A, R_{m})) | R_{m}) + V a r (S_{m} π (A, R_{m}) | R_{m}) \\ = \frac{1 - q_{_{R_{m}}}}{q_{_{R_{m}}}} π (A, R_{m}) (1 - π (A, R_{m})) + \frac{1 - q_{_{R_{m}}}}{q_{_{R_{m}}}^{2}} π^{2} (A, R_{m}) \\ = α_{_{R_{m}}} (A) (1 - π (A, R_{m})) + α_{_{R_{m}}} (A) \frac{π (A, R_{m})}{q_{_{R_{m}}}} \\ = α_{_{R_{m}}} (A) (1 + π (A, R_{m}) (\frac{1 - q_{_{R_{m}}}}{q_{_{R_{m}}}})) \\ = α_{_{R_{m}}} (A) (1 + α_{_{R_{m}}} (A)) . \end{matrix}

Therefore,

V a r (η (A) | R) = \sum_{m = 1}^{\infty} V a r (N_{m, A} | R_{m}) = \sum_{i = 0}^{\infty} α_{i} (A) (1 + α_{i} (A)) I_{i}

and so,

E (V a r (η (A) | R)) = \sum_{i = 0}^{\infty} α_{i} (A) (1 + α_{i} (A)) r_{i}

. Collecting terms from the expressions above and using the formula

V a r (η (A)) = V a r (E (η (A) | R)) + E (V a r (η (A) | R))

, we obtain

\begin{matrix} V a r (η (A)) & = \sum_{i = 0}^{\infty} α_{i}^{2} (A) r_{i} (1 - r_{i}) + \sum_{i = 0}^{\infty} α_{i} (A) (1 + α_{i} (A)) r_{i} \\ = \sum_{i = 0}^{\infty} α_{i}^{2} (A) r_{i} (2 - r_{i}) + E (η (A)) . \end{matrix}

(19)

(c3) The covariance

C o v (η (A), η (B))

, when

A \cap B = \emptyset

, follows immediately from the formula for the variance, noting that

η (A \cup B) = η (A) + η (B)

. □

Corollary 1.

For

N \in Z_{+}

, the r.v.

η_{N}

(the number of near-records taking the value N) is distributed as a mixture of a point mass at 0, with probability

\frac{1 - c}{1 + d}

, and a geometric distribution, with a success probability equal to

\frac{1}{1 + d}

. That is,

P (η_{N} = 0) = \frac{1 + c}{1 + d}, P (η_{N} = k) = \frac{(d - c) d^{k - 1}}{{(1 + d)}^{k + 1}}, k \geq 1,

(20)

where

c = r_{N}

and

d = r_{N} / q_{N + a - 1}

. Moreover,

\begin{matrix} E (η_{N}) & = p_{N} \sum_{i = N}^{N + a - 1} \frac{r_{i}}{y_{i}}, \\ V a r (η_{N}) & = p_{N}^{2} \sum_{i = N}^{N + a - 1} \frac{r_{i} (2 - r_{i})}{y_{i}^{2}} + p_{N} \sum_{i = N}^{N + a - 1} \frac{r_{i}}{y_{i}}, \\ C o v (η_{N}, η_{N + 1}) & = p_{N} p_{N + 1} \sum_{i = N + 1}^{N + a - 1} \frac{r_{i} (2 - r_{i})}{y_{i}^{2}} . \end{matrix}

(21)

Proof.

After simple computations, we obtain the p.g.f.,

φ_{η_{N}} (t) = \prod_{i = N}^{N + a - 1} \frac{1 + \frac{p_{N}}{y_{i - 1}} (1 - t)}{1 + \frac{p_{N}}{y_{i}} (1 - t)} = \frac{1 + c (1 - t)}{1 + d (1 - t)},

(22)

which yields the probability mass function (p.m.f.) (20). Formulas in (21) follow from Theorem 1 (c), observing that

α_{i} ({N}) = \frac{p_{N}}{y_{i}} {1 1}_{[N, N + a - 1]} (i)

, for

i, N \in Z_{+}

. □

Remark 1.

Note that

α_{i} (A) = \frac{1}{y_{i}} \sum_{k \in [i - a + 1, i] \cap A} p_{k}

, which implies

C o v (η (A), η (B)) = 0

, if A and B are at least a units apart; that is, if

min {| i - j | : i \in A, j \in B} \geq a .

In fact, the r.v.

η_{N}

and

η_{M}

are independent if

| M - N | \geq a

, due to the independence of the r.v.

I_{n}

. In other words, the r.v.

η_{N}

are

(a - 1)

-dependent.

4. Finiteness of the Number of Near-Records

Theorem 2.

If

\sum_{i = 0}^{\infty} r_{i}^{2} < \infty

, then

η (R_{+}) < \infty

a.s; that is, the number of near-records in the whole sequence

(X_{n})

is finite a.s. Moreover,

η (R_{+})

has the finite expectation

E (η (R_{+})) = \sum_{i = 0}^{\infty} α_{i} (R_{+}) r_{i}

(23)

and p.g.f. given by

φ_{η (R_{+})} (t) = \prod_{i = 0}^{\infty} \frac{1 + (1 - r_{i}) (1 - t) α_{i} (R_{+})}{1 + (1 - t) α_{i} (R_{+})},

(24)

with

α_{i} (R_{+}) = \frac{y_{i - a}}{y_{i}} - 1

.

Proof.

From Proposition 1, we have

\sum_{m = 1}^{\infty} P (S_{m} > 0 | R) = \sum_{m = 1}^{\infty} (1 - q_{_{R_{m}}}) = \sum_{i = 0}^{\infty} (1 - \frac{y_{i}}{y_{i - a}}) I_{i} .

Taking the expectation above, we obtain

\begin{matrix} \sum_{m = 1}^{\infty} P (S_{m} > 0) & = \sum_{i = 0}^{\infty} (1 - \frac{y_{i}}{y_{i - a}}) r_{i} \\ = \sum_{i = 0}^{\infty} (1 - \prod_{j = i - a + 1}^{i} (1 - r_{j})) r_{i} \\ \leq \sum_{i = 0}^{\infty} \sum_{j = i - a + 1}^{i} r_{j} r_{i} \\ = \sum_{j = 0}^{a - 1} \sum_{i = j}^{\infty} r_{i - j} r_{i} \\ \leq a \sum_{i = 0}^{\infty} r_{i}^{2} . \end{matrix}

(25)

where the final term in the display above follows from the Cauchy–Schwarz inequality. Therefore, by the Borel–Cantelli lemma,

P (S_{m} > 0 i . o .) = 0

, which yields the result.

In order to compute the p.g.f. of

η (R_{+})

, we observe that

η (n) \overset{a . s .}{⟶} η (R_{+})

, and so, by the monotone convergence theorem,

φ_{η (n)} (t) \to φ_{η (R_{+})} (t)

, for

t \in [0, 1]

. Furthermore, from (15), we have

φ_{η (R_{+})} (t) = lim_{n \to \infty} \prod_{i = 0}^{\infty} \frac{1 + (1 - r_{i}) (1 - t) α_{i} (n)}{1 + (1 - t) α_{i} (n)} = \prod_{i = 0}^{\infty} \frac{1 + (1 - r_{i}) (1 - t) α_{i} (R_{+})}{1 + (1 - t) α_{i} (R_{+})} .

(26)

The interchange of the limit and product above is justified by the monotone convergence theorem, after taking logarithms, since the sequence inside the product decreases with n.

Finally, (23) is obtained, for example, from the derivative of

φ_{η (R_{+})} (t)

at

t = 1^{-}

or as the limit of

E (η (n))

. Finiteness follows from the bound

1 - q_{i} \leq \sum_{j = i - a + 1}^{i} r_{j}

, used in (25), which implies

q_{i} \to 1

. Indeed, for sufficiently large i, we have

q_{i} \geq 1 / 2

and

α_{i} (R_{+}) = \frac{1}{y_{i}} \sum_{j = i - a + 1}^{i} p_{j} = \frac{y_{i - a}}{y_{i}} - 1 = \frac{1 - q_{i}}{q_{i}} \leq 2 (1 - q_{i}) .

The conclusion

α_{i} (R_{+}) < \infty

is obtained after arguing as in (25). □

5. Asymptotic Behaviour

We now focus on the asymptotic behaviour of

η (n)

. From Theorem 2, we know that if

\sum_{i = 0}^{\infty} r_{i}^{2} < \infty

, then

{lim}_{n \to \infty} η (n)

is finite a.s. In this section, we obtain laws of large numbers and a central limit theorem for

η (n)

under the assumption

\sum_{i = 0}^{\infty} r_{i}^{2} = \infty

.

Lemma 2.

The random variables

Z_{i} = \sum_{m = 1}^{\infty} S_{m} {1 1}_{{R_{m} = i}}, i \in Z_{+},

(27)

are independent with p.g.f.

φ_{_{Z_{i}}} (s) = 1 - r_{i} + \frac{q_{i} r_{i}}{1 - (1 - q_{i}) s}, s \in [0, 1] .

(28)

Proof.

For simplicity, we prove pairwise independence since the argument extends easily to the general case, but details are somewhat laborious. We compute the joint p.g.f. of

Z_{i}, Z_{j}

as follows: for

s, t \in [0, 1]

,

\begin{matrix} φ_{Z_{i}, Z_{j}} (s, t) & = E (s^{Z_{i}} t^{Z_{j}}) \\ = E (s^{\sum_{m = 1}^{\infty} S_{m} {1 1}_{{R_{m} = i}}} t^{\sum_{m = 1}^{\infty} S_{m} {1 1}_{{R_{m} = j}}}) \\ = E (E (\prod_{m = 1}^{\infty} s^{S_{m} {1 1}_{{R_{m} = i}}} t^{S_{m} {1 1}_{{R_{m} = j}}} | R)) \\ = E (\prod_{m = 1}^{\infty} E (s^{S_{m} {1 1}_{{R_{m} = i}}} t^{S_{m} {1 1}_{{R_{m} = j}}} | R)) \\ = E (\prod_{m = 1}^{\infty} \frac{q_{_{R_{m}}}}{1 - (1 - q_{_{R_{m}}}) s^{{1 1}_{{R_{m} = i}}} t^{{1 1}_{{R_{m} = j}}}}) \\ = E (\frac{q_{i} I_{i}}{1 - (1 - q_{i}) s} \frac{q_{j} I_{j}}{1 - (1 - q_{j}) t} + \frac{q_{i} I_{i} (1 - I_{j})}{1 - (1 - q_{i}) s} + \frac{q_{j} (1 - I_{i}) I_{j}}{1 - (1 - q_{j}) t} + (1 - I_{i}) (1 - I_{j})) \\ = \frac{q_{i} r_{i}}{1 - (1 - q_{i}) s} \frac{q_{j} r_{j}}{1 - (1 - q_{j}) t} + \frac{q_{i} r_{i} (1 - r_{j})}{1 - (1 - q_{i}) s} + \frac{q_{j} (1 - r_{i}) r_{j}}{1 - (1 - q_{j}) t} + (1 - r_{i}) (1 - r_{j}) . \end{matrix}

From the formula above, we get

φ_{Z_{i}, Z_{j}} (s, 1) = φ_{Z_{i}} (s)

, as in (28). In addition,

φ_{Z_{i}, Z_{j}} (s, t) = φ_{Z_{i}, Z_{j}} (s, 1) φ_{Z_{i}, Z_{j}} (1, t)

, which implies the independence of

Z_{i}, Z_{j}

because the interval of convergence of the p.g.f. of

Z_{i}

can be extended from

[0, 1]

to

[0, {(1 - q_{i})}^{- 1})

. □

Remark 2.

Note that

Z_{i}

in (27) is the number of near-records associated to i if i is a record value and is equal to 0 otherwise. Indeed, (28) shows that

Z_{i}

is distributed as a mixture of a point mass at 0 and a geometric random variable of parameter

q_{i}

, with respective weights

1 - r_{i}, r_{i}

.

Our interest in the variable

Z_{i}

arises from the following inequalities, which are easily verified:

\sum_{i = 0}^{n} Z_{i} \leq η (n) \leq \sum_{i = 0}^{n + a - 1} Z_{i} .

(29)

The strategy of the proof is to establish the desired asymptotic results for the sum of

Z_{i}

s, which are then transferred to

η

. For that purpose, we assume some minimal conditions on the hazard rates

r_{n}

, besides

\sum_{i = 0}^{\infty} r_{i}^{2} = \infty

.

The following proposition gathers some useful facts about the variable

Z_{i}

.

Proposition 3.

(a)

E (Z_{i}) = \frac{1 - q_{i}}{q_{i}} r_{i}

,

V a r (Z_{i}) = \frac{1 - q_{i}}{q_{i}^{2}} r_{i} ((1 - q_{i}) (1 - r_{i}) + 1)

.

(b) If

\sum_{i = 0}^{\infty} r_{i}^{2} = \infty

then

\sum_{i = 0}^{\infty} E (Z_{i}) = \sum_{i = 0}^{\infty} V a r (Z_{i}) = \infty

.

(c) If either

(i): $\sum_{i = 0}^{\infty} r_{i}^{2} = \infty$ and $lim sup r_{n} < 1$ or
(ii): $lim r_{n} = 1$ and $lim \frac{1 - r_{n}}{1 - r_{n - 1}} = 1$

hold, then

(c1): $lim \frac{E (Z_{n + k})}{\sum_{i = 0}^{n} E (Z_{i})} = lim \frac{V a r (Z_{n + k})}{\sum_{i = 0}^{n} V a r (Z_{i})} = 0, \forall k \in N$ ,
(c2): $\frac{Z_{n + k}}{\sqrt{\sum_{i = 0}^{n} V a r (Z_{i})}} \overset{P}{⟶} 0, \forall k \in N .$

Proof.

(a) The (factorial) moments of

Z_{i}

are computed by differentiating the p.g.f. in (28).

(b) From (a) we have

\begin{matrix} \sum_{i = 0}^{\infty} E (Z_{i}) & = \sum_{i = 0}^{\infty} \frac{1 - q_{i}}{q_{i}} r_{i} \\ \geq \sum_{i = 0}^{\infty} (1 - q_{i}) r_{i} \\ = \sum_{i = 0}^{\infty} (1 - \prod_{j = i - a + 1}^{i} (1 - r_{j})) r_{i} \\ \geq \sum_{i = 0}^{\infty} r_{i}^{2} . \end{matrix}

(30)

For the variance, we obtain from (a) and the divergence of expectations that

\sum_{i = 0}^{\infty} V a r (Z_{i}) \geq \sum_{i = 0}^{\infty} \frac{1 - q_{i}}{q_{i}^{2}} r_{i} \geq \sum_{i = 0}^{\infty} \frac{1 - q_{i}}{q_{i}} r_{i} = \infty .

(c) Suppose that (i) holds. Then,

lim inf q_{n} > 0

, which implies that expectations

E (Z_{n})

and variances

V a r (Z_{n})

are bounded above. Hence, from (b), the limits in (c1) hold.

Suppose now that (ii) holds, then

q_{n} = \prod_{i = n - a + 1}^{n} (1 - r_{i}) \to 0

, and also,

lim \frac{q_{n}}{q_{n + 1}} = lim \frac{\prod_{i = n - a + 1}^{n} (1 - r_{i})}{\prod_{i = n - a + 2}^{n + 1} (1 - r_{i})} = lim \frac{1 - r_{n - a + 1}}{1 - r_{n + 1}} = 1,

hence

lim \frac{E (Z_{n + 1})}{E (Z_{n})} = lim \frac{(1 - q_{n + 1}) q_{n} r_{n + 1}}{q_{n + 1} (1 - q_{n}) r_{n}} = lim \frac{q_{n}}{q_{n + 1}} = 1 .

(31)

The same argument applies to prove that

V a r (Z_{n + 1}) / V a r (Z_{n}) \to 1

and so, claim (c1) follows from Lemma A1. Finally, convergence in (c2) is obtained from (c1) and Markov’s inequality, noting that, from (a), we have

{(E (Z_{n + k}))}^{2} \leq V a r (Z_{n + k})

. □

Theorem 3.

If either

(i): $\sum_{i = 0}^{\infty} r_{i}^{2} = \infty$ and ${lim sup}_{n} r_{n} < 1$ or
(ii): ${lim}_{n} r_{n} = 1$ and ${lim}_{n} n^{β} (1 - \frac{1 - r_{n}}{1 - r_{n - 1}}) = 0$ , for some $β \in (1 / 2, 1)$ ,

then

\frac{η (n)}{E (η (n))} \overset{a . s .}{⟶} 1 .

Proof.

By (29) and (c1) in Proposition 3, the result follows if we show that

\frac{\sum_{i = 0}^{n} Z_{i}}{\sum_{i = 0}^{n} E (Z_{i})} \overset{a . s .}{⟶} 1 .

(32)

By the strong law of large numbers for sequences of i.i.d. r.v., (32) follows if we prove that

\sum_{n = 0}^{\infty} \frac{V a r (Z_{n})}{{(\sum_{i = 0}^{n} E (Z_{i}))}^{2}} < \infty .

(33)

Suppose first that (i) holds. Note that

V a r (Z_{i}) \leq 2 E (Z_{i}) / q_{i}, i \in Z_{+}

and also that

{lim inf}_{n} q_{n} > 0

. Hence, there exists a positive constant

γ

such that

q_{n} > γ

, for

n \in Z_{+}

. So,

\sum_{n = 0}^{\infty} \frac{V a r (Z_{n})}{{(\sum_{i = 0}^{n} E (Z_{i}))}^{2}} \leq 2 γ^{- 1} \sum_{n = 0}^{\infty} \frac{E (Z_{n})}{{(\sum_{i = 0}^{n} E (Z_{i}))}^{2}} < \infty,

(34)

where convergence in the right-hand side of (34) follows from Abel-Dini’s Theorem A1.

On the other hand, if condition (ii) holds, then it is easy to see that

q_{n} \to 0

and

{lim}_{n} n^{β} (1 - \frac{q_{n}}{q_{n - 1}}) = 0

. In addition,

q_{n} E (Z_{n}) \to 1

and

q_{n}^{2} V a r (Z_{n}) \to 1

. Therefore, (33) is equivalent to

\sum_{n = 0}^{\infty} \frac{q_{n}^{- 2}}{{(\sum_{i = 0}^{n} q_{i}^{- 1})}^{2}} < \infty,

which follows from Proposition A1, thus proving the stated result. □

Theorem 4.

If either

(i): $\sum_{i = 0}^{\infty} r_{i}^{2} = \infty$ and ${lim sup}_{n} r_{n} < 1$ or
(ii): ${lim}_{n} r_{n} = 1$ and ${lim}_{n} \frac{1 - r_{n}}{1 - r_{n - 1}} = 1$ ,

then

\frac{η (n) - E (η (n))}{\sqrt{V a r (η (n))}} \overset{D}{⟶} N (0, 1),

(35)

where

N (μ, σ^{2}), μ \in R, σ > 0

stands for the normal distribution with expectation μ and variance

σ^{2}

.

Proof.

First, we prove asymptotic normality for

\sum_{i = 1}^{n} Z_{i}

and then transfer the result to

η (n)

. To that end, we show that the following Lyapunov condition holds:

\frac{1}{s_{n}^{3}} \sum_{i = 0}^{n} E (| Z_{i} - E (Z_{i}) |^{3}) \to 0,

(36)

where

s_{n}^{2} = \sum_{i = 0}^{n} V a r (Z_{i})

. Indeed, from the elementary inequality

∣ a - b ∣^{3} < a^{3} + b^{3}

, for

a, b \geq 0

, we get

E (| Z_{i} - E (Z_{i}) |^{3}) \leq E (Z_{i}^{3}) + {(E (Z_{i}))}^{3} = r_{i} \frac{(1 - q_{i})}{q_{i}^{3}} (q_{i}^{2} - 6 q_{i} + 6) + r_{i}^{3} {(\frac{1 - q_{i}}{q_{i}})}^{3} .

(37)

Note first that

V a r (Z_{i}) \geq E (Z_{i})

. Moreover, if (i) holds, then

lim inf q_{n} > 0

, and from (37) above, we obtain

E (| Z_{i} - E (Z_{i}) |^{3}) \leq K E (Z_{i}), i \in N

, where

K > 0

is a generic constant. So, the sequence in (36) is bounded above by

K {(\sum_{i = 1}^{n} E (Z_{i}))}^{- 1 / 2}

, which tends to 0, because of Proposition 3 (b).

If (ii) holds, then

q_{n} \to 0

, which implies

E (| Z_{i} - E (Z_{i}) |^{3}) \leq K_{1} q_{i}^{- 3}

and

V a r (Z_{i}) \geq K_{2} q_{i}^{- 2}

, for sufficiently large

i \in N

, where

K_{1}, K_{2} > 0

are generic constants. Then, the Lyapunov condition in (36) holds if

λ_{n} : = \frac{\sum_{i = 0}^{n} q_{i}^{- 3}}{{(\sum_{i = 0}^{n} q_{i}^{- 2})}^{3 / 2}} \to 0 .

From the Cauchy–Schwarz inequality, we have

\sum_{i = 0}^{n} q_{i}^{- 3} \leq {(\sum_{i = 0}^{n} q_{i}^{- 2} \sum_{i = 0}^{n} q_{i}^{- 4})}^{1 / 2}

, and so,

λ_{n}^{2} \leq \frac{\sum_{i = 0}^{n} q_{i}^{- 2} \sum_{i = 0}^{n} q_{i}^{- 4}}{{(\sum_{i = 0}^{n} q_{i}^{- 2})}^{3}} = \frac{\sum_{i = 0}^{n} q_{i}^{- 4}}{{(\sum_{i = 0}^{n} q_{i}^{- 2})}^{2}} \to 0 .

(38)

Convergence to 0 in (38) is obtained from Lemma A2, since

q_{n - 1} / q_{n} \to 1

.

From the Lyapunov central limit theorem, we conclude that

\frac{\sum_{i = 0}^{n} Z_{i} - \sum_{i = 0}^{n} E (Z_{i})}{s_{n}} \overset{D}{⟶} N (0, 1),

which, by (29) and Proposition 3 (c2), implies that

\frac{η (n) - \sum_{i = 0}^{n} E (Z_{i})}{s_{n}} \overset{D}{⟶} N (0, 1) .

(39)

Now, taking expectations in (29), we have

\sum_{i = 0}^{n} E (Z_{i}) \leq E (η (n)) \leq \sum_{i = 0}^{n + a - 1} E (Z_{i})

. In addition, from (a) in Proposition 3, we have

{(E (Z_{n + k}))}^{2} \leq V a r (Z_{n + k})

, and so

\frac{η (n) - E (η (n))}{s_{n}} \overset{D}{⟶} N (0, 1),

(40)

by (b) and (c1) in Proposition 3. To conclude the proof, we must show that

\frac{V a r (η (n))}{s_{n}^{2}} \to 1 .

(41)

From Theorem 1 (c), we have

V a r (η (n)) = \sum_{i = 0}^{\infty} α_{i} (n) r_{i} (α_{i} (n) (2 - r_{i}) + 1)

, and noting that

α_{i} (n) = \{\begin{matrix} (y_{i - a} - y_{i}) / y_{i} & for i \leq n, \\ (y_{i - a} - y_{n}) / y_{i} & for n < i \leq n + a - 1, \\ 0 & for i > n + a - 1, \end{matrix}

we obtain

V a r (η (n)) = \sum_{i = 0}^{n} \frac{y_{i - a} - y_{i}}{y_{i}} r_{i} (\frac{y_{i - a} - y_{i}}{y_{i}} (2 - r_{i}) + 1) + \sum_{i = n + 1}^{n + a - 1} \frac{y_{i - a} - y_{n}}{y_{i}} r_{i} (\frac{y_{i - a} - y_{n}}{y_{i}} (2 - r_{i}) + 1) .

(42)

In addition, from Proposition 3 (a), we get

\begin{matrix} V a r (Z_{i}) & = \frac{1 - q_{i}}{q_{i}} r_{i} (\frac{1 - q_{i}}{q_{i}} (1 - r_{i}) + \frac{1}{q_{i}}) \\ = \frac{y_{i - a} - y_{i}}{y_{i}} r_{i} (\frac{y_{i - a} - y_{i}}{y_{i}} (2 - r_{i}) - \frac{y_{i - a} - y_{i}}{y_{i}} + \frac{y_{i - a}}{y_{i}}) \\ = \frac{y_{i - a} - y_{i}}{y_{i}} r_{i} (\frac{y_{i - a} - y_{i}}{y_{i}} (2 - r_{i}) + 1), \end{matrix}

and, from (42), we have

V a r (η (n)) - s_{n}^{2} \geq 0

and

\begin{matrix} V a r (η (n)) - s_{n}^{2} & = \sum_{i = n + 1}^{n + a - 1} \frac{y_{i - a} - y_{n}}{y_{i}} r_{i} (\frac{y_{i - a} - y_{n}}{y_{i}} (2 - r_{i}) + 1) \\ \leq \sum_{i = n + 1}^{n + a - 1} \frac{y_{i - a} - y_{i}}{y_{i}} r_{i} (\frac{y_{i - a} - y_{i}}{y_{i}} (2 - r_{i}) + 1) \\ = \sum_{i = n + 1}^{n + a - 1} V a r (Z_{i}) . \end{matrix}

From the inequalities above, (41) is a direct consequence of (c1) in Proposition 3. □

6. Examples

In this section, we present some examples of application of our results to particular distributions. For each distribution, we consider the r.v.

η_{N}

analysed in Corollary 1. In particular, we give formulas for

E (η_{N})

,

V a r (η_{N})

and the correlation

ρ (η_{N}, η_{N + 1})

. We also study the asymptotic behaviour of

η (n)

.

The distributions that we consider in this section are very different in terms of their right tails. Example 2 is devoted to a heavy-tailed distribution, similar to the Zeta distribution (see Example 3.1 in [16]), which is the discrete counterpart of the Pareto distribution. Example 3 deals with the geometric distribution, which has an exponential-like tail, while Example 4 is about the Poisson distribution, which is light-tailed.

Example 2

(Heavy-tailed distribution). Let

p_{k} = {(k (k + 1))}^{- 1}

, hence

y_{k} = {(k + 1)}^{- 1}

and

r_{k} = {(k + 1)}^{- 1}, k \in N

. Then, from (21) we have

\begin{matrix} E (η_{N}) & = \frac{a}{N (N + 1)}, \\ V a r (η_{N}) & = a \frac{N^{2} + 3 N + a}{N^{2} {(N + 1)}^{2}}, \\ C o v (η_{N}, η_{N + 1}) & = \frac{(a - 1) (a + 2 N + 1)}{N {(N + 1)}^{2} (N + 2)}, \end{matrix}

ρ (η_{N}, η_{N + 1}) = \frac{(a - 1) (a + 2 N + 1)}{a \sqrt{(N^{2} + 3 N + a) (N^{2} + 5 N + a + 4)}} .

Note that

ρ (η_{N}, η_{N + 1}) \sim \frac{2 (a - 1)}{a} \frac{1}{N}

, as

N \to \infty

, for every

a \geq 1

. The p.m.f. of

η_{N}

can be obtained from (20) by noting that

c = \frac{1}{N + 1}

and

d = \frac{N + a}{N (N + 1)}

.

Regarding the asymptotic behaviour of

η (n)

, we observe that

\sum_{k = 0}^{\infty} r_{k}^{2} < \infty

and so, from Theorem 2,

η (n) \to η (R_{+}) < \infty

a.s. We now compute the main characteristics of this r.v. For the expectation, note that

α_{i} (R_{+}) = i

, for

i < a

, and

α_{i} (R_{+}) = \frac{a}{i - a + 1}

, for

i \geq a

. So, from (23), we obtain

E (η (R_{+})) = \sum_{i = 0}^{a - 1} \frac{i}{i + 1} + \sum_{i = a}^{\infty} \frac{a}{(i + 1) (i - a + 1)} = a .

It is interesting to see that the expected total number of near-records is equal to the near-record parameter a. For the variance, we use (19) to obtain

V a r (η (R_{+})) = E (η (R_{+})) + \sum_{i = 0}^{a - 1} i^{2} r_{i} (2 - r_{i}) + \sum_{i = a}^{\infty} {(\frac{a}{i - a + 1})}^{_{2}} r_{i} (2 - r_{i}),

which, after some algebra (see Appendix B), yields

V a r (η (R_{+})) = a^{2} - 3 a + 2 (1 + \frac{1}{a}) \sum_{i = 1}^{a} \frac{1}{i} + (a - 1) \frac{π^{2}}{3} .

(43)

In addition, the p.g.f. of

η (R_{+})

is easily computed from (24) as

φ_{R_{+}} (t) = \prod_{i = 1}^{a} (1 - \frac{(i - 1) (1 - t)}{i + (i - 1) i (1 - t)}) \prod_{i = a + 1}^{\infty} (1 - \frac{a (1 - t)}{(i - a) i + a i (1 - t)}) .

(44)

The p.m.f. of

η (R_{+})

can be obtained from (44). For instance, taking

t = 0

, we get

P (η (R_{+}) = 0) = φ_{R_{+}} (0) = \prod_{i = 1}^{\infty} (1 - \frac{min {a, i - 1}}{i^{2}}) .

Example 3

(Geometric distribution). The geometric distribution has

p_{k} = p {(1 - p)}^{k}

,

y_{k} = {(1 - p)}^{k + 1}

,

r_{k} = p

, for

k \in Z_{+}

, with

p \in (0, 1)

. From (21), it is easy to see that

\begin{matrix} E (η_{N}) & = p (\frac{1}{{(1 - p)}^{a}} - 1), \\ V a r (η_{N}) & = p^{2} (\frac{1}{{(1 - p)}^{2 a}} - 1) + p (\frac{1}{{(1 - p)}^{a}} - 1), \\ C o v (η_{N}, η_{N + 1}) & = \frac{p^{2}}{1 - p} (\frac{1}{{(1 - p)}^{2 (a - 1)}} - 1) . \end{matrix}

Observe that none of the quantities above depend on N. The p.m.f. of

η_{N}

is given by (20), with

c = p

,

d = \frac{p}{{(1 - p)}^{a}}

.

Since

r_{k} = p, k \in Z_{+}

, hypothesis

(i)

of Theorems 3 and 4 holds, and so we have a strong law of large numbers and a central limit theorem for

η (n)

. For Theorem 3 note that

η (n) = \sum_{N = 0}^{n} η_{N}

and therefore,

\frac{η (n)}{n} \overset{a . s .}{⟶} p (\frac{1}{{(1 - p)}^{a}} - 1) .

For the central limit theorem, as shown in the proof of Theorem 4, we can replace

V a r (η (n))

by

\sum_{i = 0}^{n} V a r (Z_{i})

in the denominator of (35). Since

q_{i} = {(1 - p)}^{a}

, for

i \geq a

, from Proposition 3 (a), we get

σ^{2} : = V a r (Z_{i}) = \frac{1 - {(1 - p)}^{a}}{{(1 - p)}^{2 a}} p ((1 - {(1 - p)}^{a}) (1 - p) + 1) .

Therefore,

\frac{η (n) - p (\frac{1}{{(1 - p)}^{a}} - 1) n}{\sqrt{n}} \overset{D}{⟶} N (0, σ^{2}) .

(45)

Example 4

(Poisson distribution). The Poisson distribution has

p_{k} = e^{- λ} λ^{k} / k!

for

k \in Z_{+}

. Although there is not a manageable form of

y_{k}

and

r_{k}

, the following bound, taken from [17], are useful:

\frac{λ}{k + 1} - {(\frac{λ}{k + 1})}^{2} \leq 1 - r_{k} \leq \frac{λ}{k + 1}, k \in Z_{+} .

(46)

Explicit expressions for the quantities in (21) can be written out, but they shed little light on their dependence on

λ

and N. Instead, we analyse their asymptotic behaviour for large N. By (46),

r_{k} \to 1

, so

y_{k} / y_{k - 1} \to 0

. Therefore,

\sum_{i = N}^{N + a - 1} r_{i} / y_{i} \sim 1 / y_{N + a - 1}

and

\frac{p_{N}}{y_{N + a - 1}} = \frac{r_{N}}{(1 - r_{N}) \dots (1 - r_{N + a - 1})} \sim {(\frac{N}{λ})}^{a},

as

N \to \infty

. This immediately yields the asymptotic behaviour of the formulas in (21):

\begin{matrix} E (η_{N}) & \sim {(\frac{N}{λ})}^{a}, \\ V a r (η_{N}) & \sim {(\frac{N}{λ})}^{2 a}, \\ C o v (η_{N}, η_{N + 1}) & \sim {(\frac{N}{λ})}^{2 a - 1}, \end{matrix}

and

ρ (η_{N}, η_{N + 1}) \sim λ / N

. Hence, as in Example 2, the correlation coefficient between

η_{N}

and

η_{N + 1}

converges to 0 as

N \to \infty

.

For the asymptotic behavior of

η (n)

, note that (46) guarantees that

{lim}_{n} r_{n} = 1

and, moreover,

| r_{n} - r_{n - 1} | \leq C / n^{2}

and

1 - r_{n} \geq D / n

, for all large enough values of n and some positive constants C and D. Hence, condition

(i i)

in Theorem 3 holds, with

β = 3 / 4

, and so does condition

(i i)

in Theorem 4. In order to apply Theorem 3 note that, since

E (η_{N}) \sim {(N / λ)}^{a}

, we have

E (η (n)) = \sum_{N = 0}^{n} E (η_{N}) \sim \frac{n^{a + 1}}{λ^{a} (a + 1)}

and

\frac{η (n)}{n^{a + 1}} \overset{a . s .}{⟶} \frac{1}{λ^{a} (a + 1)} .

For the central limit theorem, as in Example 3, the scaling sequence can be taken as

{(\sum_{i = 0}^{n} V a r (Z_{i}))}^{1 / 2}

. Since

q_{i} = \prod_{j = i - a + 1}^{i} (1 - r_{j}) \to 0

, we have

V a r (Z_{i}) \sim q_{i}^{- 2} \sim {(i / λ)}^{2 a}

, so

\sum_{i = 0}^{n} V a r (Z_{i}) \sim \frac{n^{2 a + 1}}{λ^{2 a} (2 a + 1)} .

In addition, from the proof of Theorem 4, the centring sequence in the central limit theorem can be chosen as

\sum_{i = 0}^{n} E (Z_{i})

, which in turn can be replaced by

\sum_{i = a}^{n} q_{i}^{- 1}

. Indeed, for

i \geq a

,

e_{i} : = |E (Z_{i}) - \frac{1}{q_{i}}| = \frac{1}{q_{i}} | r_{i} - r_{i} q_{i} - 1 | \leq 1 + \frac{1 - r_{i}}{q_{i}} = 1 + \prod_{j = i - a + 1}^{i - 1} \frac{1}{1 - r_{j}} .

Thus, by (46),

\sum_{i = 0}^{n} e_{i} \leq n + C n^{a}

, for a given constant

C > 0

, which implies

\frac{\sum_{i = 0}^{n} e_{i}}{{(\sum_{i = 0}^{n} V a r (Z_{i}))}^{1 / 2}} \to 0 .

We can further replace

\sum_{i = a}^{n} q_{i}^{- 1}

by

\frac{n^{a + 1}}{(a + 1) λ^{a}}

since, by (46), it can be shown that

|\frac{1}{q_{i}} - {(\frac{i}{λ})}^{a}| < C i^{a - 1},

for all

i \geq a

and some constant

C > 0

. Therefore, we conclude

\frac{η (n) - \frac{n^{a + 1}}{(a + 1) λ^{a}}}{n^{a + 1 / 2}} \overset{D}{⟶} N (0, σ^{2}),

(47)

where

σ = \frac{1}{λ^{a} \sqrt{2 a + 1}}

.

Remark 3.

In Examples 3 and 4 above, we observe that the normalising sequences in the law of large numbers and central limit theorem depend on the right-tail behaviour of the parent distribution of the observations. This is also the case for the speed of convergence of

η (n)

to the normal distribution in the central limit theorem, as shown in Figure 2. Convergence is very fast for the geometric distribution, while it is much slower in the Poisson distribution (the distribution of

η (30)

in the geometric distribution is closer to the normal than the distribution of

η (100)

in the Poisson).

7. Conclusions and Future Work

In this paper, we have studied the point process of near-record values from a sequence of independent and identically distributed discrete random variables. Near-records arise as a natural complement of records, with applications in statistical inference.

We have shown that this process is a Bernoulli cluster process and obtained its probability generating functional, as well as formulas for the expectation, variance, covariance and probability generating functions for related counting processes.

We have given a condition for the finiteness of the total number of near-records along the whole sequence of observations. This condition is provided in terms of convergence of the squared hazard rates series. In addition, the explicit expression of its probability generating function is obtained.

In the case where the total number of near-records is not finite, strong convergence and central limit theorems for the number of record values in growing intervals are derived under mild regularity conditions. Finally, we have presented examples of the application of our results to particular families, which show that the asymptotics of near-record values depends critically on the right-tail behaviour of the parent distribution.

Some interesting questions remain open, such as a more detailed analysis of the sequence

η (n)

, including the law of the iterated logarithm and large deviations, or departures from the i.i.d. hypothesis (e.g., linear trend model). They will be addressed in future work.

Author Contributions

Conceptualisation, M.L., R.G., F.J.L. and G.S.; Investigation, M.L., R.G., F.J.L. and G.S.; Methodology, M.L., R.G., F.J.L. and G.S.; Writing—original draft, M.L., R.G., F.J.L. and G.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ANID/BASAL grants FB210005, ACE210010 and grant PID2020-116873GB-I00, funded by MCIN/AEI/10.13039/501100011033. The authors are members of the research group Modelos Estocásticos (DGA).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

The authors thank the referees for valuable comments and suggestions which improved the presentation of the paper.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

a.s.	Almost surely
i.i.d.	Independent and identically distributed
i.o.	Infinitely often
p.g.f.	Probability generating function
p.g.fl.	Probability generating functional
p.m.f.	Probability mass function
r.v.	random variable(s)

Appendix A. Technical Results

In what follows,

(x_{n}), (y_{n})

are sequences of positive numbers and

S_{n} = \sum_{k = 1}^{n} x_{k}

, for

n \in N

.

Lemma A1

(Stolz-Cesàro). Suppose

y_{n} \to \infty

, then

{lim}_{n} \frac{x_{n + 1} - x_{n}}{y_{n + 1} - y_{n}} = L

implies

{lim}_{n} \frac{x_{n}}{y_{n}} = L

. In particular, if

x_{n + 1} / x_{n} \to 1

then

x_{n} / S_{n} \to 0

.

Proof.

See, for instance, [18]. □

Lemma A2.

If

x_{n} \to \infty

and

x_{n - 1} / x_{n} \to 1

, then

x_{n} / S_{n} \to 0

and

\sum_{i = 1}^{n} x_{i}^{2} / S_{n}^{2} \to 0

.

Proof.

See lemma A1 in [19]. □

Theorem A1

(Abel-Dini). Suppose that

S_{n} \to \infty

, then

\sum_{n = 1}^{\infty} \frac{x_{n}}{S_{n}^{a}} < \infty

if and only if

a > 1

.

Proof.

See [20], page 441. □

Proposition A1.

Suppose that

x_{n} \to \infty

and

n^{β} (1 - \frac{x_{n - 1}}{x_{n}}) \to 0

, for some

β \in (1 / 2, 1)

, then

\sum_{n = 1}^{\infty} \frac{x_{n}^{2}}{S_{n}^{2}} < \infty .

Proof.

The hypotheses imply that there exists

C > 0

such that

x_{n} / S_{n} < C n^{- β}

for all

n \geq 1

(see pages 2013–2014 in [16]). The conclusion follows from the convergence of the series

\sum_{n = 1}^{\infty} n^{- 2 β}

for

β \in (1 / 2, 1)

. □

Appendix B. Computations of Var(η[0,∞)) in Example 2

Note that

V a r (η [0, \infty)) = a + A - B + C - D

, with

A = 2 \sum_{i = 0}^{a - 1} \frac{i^{2}}{i + 1}

,

B = \sum_{i = 0}^{a - 1} {(\frac{i}{i + 1})}^{2}

,

C = 2 a^{2} \sum_{i = a}^{\infty} \frac{1}{{(i - a + 1)}^{2} (i + 1)}

and

D = \sum_{i = a}^{\infty} {(\frac{a}{(i + 1) (i - a + 1)})}^{2}

. We have

A = 2 \sum_{i = 1}^{a} (i - 2 + \frac{1}{i}) = 2 (\frac{a (a + 1)}{2} - 2 a + H (a)) = a^{2} - 3 a + 2 H (a),

where

H (n)

is the n-th harmonic number. In a similar way for B we have

B = \sum_{i = 1}^{a} 1 - 2 \sum_{i = 1}^{a} \frac{1}{i} + \sum_{i = 1}^{a} \frac{1}{i^{2}} = a - 2 H (a) + \sum_{i = 1}^{a} \frac{1}{i^{2}} .

The sum in C can be computed via simple fractions as follows:

C = 2 a^{2} \sum_{i = a}^{\infty} (\frac{1}{a^{2} (i + 1)} + \frac{1}{a {(i - a + 1)}^{2}} - \frac{1}{a^{2} (i - a + 1)}) = - 2 \sum_{i = 1}^{a} \frac{1}{i} + 2 a \sum_{i = 1}^{\infty} \frac{1}{i^{2}} = - 2 H (a) + a \frac{π^{2}}{3} .

Finally, for D, we can split the sum and compute as follows:

D = a^{2} \sum_{i = a}^{\infty} (\frac{1}{a^{2} {(i + 1)}^{2}} + \frac{2}{a^{3} (i + 1)} + \frac{1}{a^{2} {(i - a + 1)}^{2}} - \frac{2}{a^{3} (i - a + 1)}) = \sum_{i = a + 1}^{\infty} \frac{1}{i^{2}} - \frac{2}{a} H (a) + \frac{π^{2}}{6} .

Then, collecting the partial results above, Formula (43) is obtained.

References

Ahsanullah, M. Record Statistics; Nova Science: Commack, NY, USA, 1995. [Google Scholar]
Arnold, B.C.; Balakrishnan, N.; Nagaraja, H.N. Records; Wiley: New York, NY, USA, 1998; p. 312. [Google Scholar]
Nevzorov, V.B. Records: Mathematical Theory; American Mathematical Society: Providence, RI, USA, 2001; Volume 194. [Google Scholar]
Gulati, S.; Padgett, W.J. Parametric and Nonparametric Inference from Record-Breaking Data; Springer: New York, NY, USA, 2003; Volume 172. [Google Scholar]
Balakrishnan, N.; Pakes, A.G.; Stepanov, A. On the number and sum of near-record observations. Adv. Appl. Probab. 2005, 37, 765–780. [Google Scholar] [CrossRef]
Pakes, A.G. Limit theorems for numbers of near-records. Extremes 2007, 10, 207–224. [Google Scholar] [CrossRef]
Bairamov, I.; Stepanov, A. Numbers of near bivariate record-concomitant observations. J. Multivar. Anal. 2011, 102, 908–917. [Google Scholar] [CrossRef] [Green Version]
Gouet, R.; López, F.J.; Sanz, G. Asymptotic normality for the counting process of weak records and δ-records in discrete models. Bernoulli 2007, 13, 754–781. [Google Scholar] [CrossRef]
Gouet, R.; López, F.J.; Maldonado, L.P.; Sanz, G. Statistical inference for the Weibull distribution based on δ-record data. Symmetry 2020, 12, 20. [Google Scholar] [CrossRef] [Green Version]
Gouet, R.; Lafuente, M.; López, F.J.; Sanz, G. Exact and asymptotic properties of δ-records in the linear drift model. J. Stat. Mech. Theory Exp. 2020, 2020, 103201. [Google Scholar] [CrossRef]
López-Blázquez, F.; Salamanca-Miño, B. Distribution theory of δ-record values. Case δ ≤ 0. TEST 2013, 22, 715–738. [Google Scholar] [CrossRef]
López-Blázquez, F.; Salamanca-Miño, B. Distribution theory of δ-record values: Case δ ≥ 0. TEST 2015, 24, 558–582. [Google Scholar] [CrossRef]
Gouet, R.; López, F.J.; Sanz, G. On the point process of near-record values. TEST 2015, 24, 302–321. [Google Scholar] [CrossRef]
Shorrock, R.W. On record values and record times. J. Appl. Probab. 1972, 9, 316–326. [Google Scholar] [CrossRef]
Daley, D.J.; Vere-Jones, D. An Introduction to the Theory of Point Processes, vol I: Elementary Theory and Methods, 2nd ed.; Probability and Its Applications; Springer: New York, NY, USA, 2003. [Google Scholar]
Gouet, R.; Javier López, F.; Sanz, G. Laws of large numbers for the number of weak records. Stat. Probab. Lett. 2008, 78, 2010–2017. [Google Scholar] [CrossRef] [Green Version]
Vervaat, W. Limit theorems for records from discrete distributions. Stoch. Process. Their Appl. 1973, 1, 317–334. [Google Scholar] [CrossRef] [Green Version]
Stolz, O. Vorlesungen über allgemeine Arithmetik: Nach den Neueren Ansichten; BG Teubner: Leipzig, Germany, 1885; p. 174. [Google Scholar]
Gouet, R.; López, F.J.; Sanz, G. Limit laws for the cumulative number of ties for the maximum in a random sequence. J. Stat. Plan. Inference 2009, 139, 2988–3000. [Google Scholar] [CrossRef]
Hildebrandt, T.H. Remarks on the Abel-Dini Theorem. Am. Math. Mon. 1942, 49, 441–445. [Google Scholar] [CrossRef]

Figure 2. Smoothed histograms for simulated values of the normalised sequences

η (n)

, given in (45) and (47), with

a = 3

, different n values and

10^{5}

runs for each setting. Left panel: Geometric distribution with parameter

p = 0.5

. Right panel: Poisson distribution with rate

λ = 1

.

Figure 2. Smoothed histograms for simulated values of the normalised sequences

η (n)

, given in (45) and (47), with

a = 3

, different n values and

10^{5}

runs for each setting. Left panel: Geometric distribution with parameter

p = 0.5

. Right panel: Poisson distribution with rate

λ = 1

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lafuente, M.; Gouet, R.; López, F.J.; Sanz, G. Near-Record Values in Discrete Random Sequences. Mathematics 2022, 10, 2442. https://doi.org/10.3390/math10142442

AMA Style

Lafuente M, Gouet R, López FJ, Sanz G. Near-Record Values in Discrete Random Sequences. Mathematics. 2022; 10(14):2442. https://doi.org/10.3390/math10142442

Chicago/Turabian Style

Lafuente, Miguel, Raúl Gouet, F. Javier López, and Gerardo Sanz. 2022. "Near-Record Values in Discrete Random Sequences" Mathematics 10, no. 14: 2442. https://doi.org/10.3390/math10142442

APA Style

Lafuente, M., Gouet, R., López, F. J., & Sanz, G. (2022). Near-Record Values in Discrete Random Sequences. Mathematics, 10(14), 2442. https://doi.org/10.3390/math10142442

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Near-Record Values in Discrete Random Sequences

Abstract

1. Introduction

2. Notation and Preliminary Definitions

3. The Point Process of Near-Record Values

4. Finiteness of the Number of Near-Records

5. Asymptotic Behaviour

6. Examples

7. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Technical Results

Appendix B. Computations of Var(η[0,∞)) in Example 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI