Poissonization Principle for a Class of Additive Statistics

Igor Borisov; Maman Jetpisbaev

doi:10.3390/math10214084

Abstract

In this paper, we consider a class of additive functionals of a finite or countable collection of the group frequencies of an empirical point process that corresponds to, at most, a countable partition of the sample space. Under broad conditions, it is shown that the asymptotic behavior of the distributions of such functionals is similar to the behavior of the distributions of the same functionals of the accompanying Poisson point process. However, the Poisson versions of the additive functionals under consideration, unlike the original ones, have the structure of sums (finite or infinite) of independent random variables that allows us to reduce the asymptotic analysis of the distributions of additive functionals of an empirical point process to classical problems of the theory of summation of independent random variables.

Keywords:

empirical point process; Poisson point process; Poissonization; group frequency; additive functional

MSC:

60F05

1. Introduction

In this paper, we study a class of additive functionals (statistics) of a finite or countable collection of group frequencies constructed by a sample of size n with a finite or countable partition of the sample space. Under broad conditions, it is shown that, as

n \to \infty

, the asymptotic behavior of distributions of the additive functionals under consideration is completely similar to the behavior of distributions of the same functionals of the accompanying Poisson point process. From here it is easy to establish that the above-mentioned weak convergence is equivalent to that for the same additive functionals but with independent group frequencies, which are constructed, respectively, using a finite or countable collection of independent copies of the original sample, when we fix in the i-th partition element only the points from the i-th independent copy of the original sample. In other words, in the case under consideration, we remove the dependence of the initial group frequencies with a multinomial distribution. This phenomenon makes it possible to directly use the diverse tool of the summation theory of independent random variables to study the limiting behavior of the additive statistics being considered.

The structure of this paper is as follows. In Section 2, we introduce the empirical and accompanying Poisson vector point processes and formulate some important results regarding their connection. In Section 3, we introduce a class of additive statistics and give a number of examples. Section 4 contains the main result of the paper, i.e., a duality theorem, which states that an original additive statistic with some normalizing and centering constants weakly converges to a limit if, and only if, their Poisson version with the same normalizing and centering constants weakly converges to the same limit. In Section 5, we discuss some applications of the duality theorem. In Section 6, we present moment inequalities connecting the original additive statistics and their Poisson versions. Section 7 is devoted to asymptotic analysis of first two moments of additive statistics connected with an infinite multinomial urn model. Section 8 contains proofs of all results of the paper. Finally, in Section 9, we summarize the results and discuss some their extensions.

2. Empirical and Poisson Point Processes

Let

{X_{i}^{(k)}, i \geq 1}

,

k = \bar{1, m}

be a finite set of independent copies of a sequence of independent identically distributed random variables with values in an arbitrary measurable space

(X, A)

and distribution P. For any natural

n_{1}, \dots, n_{m}

, consider m independent empirical point processes based on respective samples

X_{1}^{(k)}, \dots, X_{n_{k}}^{(k)}

,

k = \bar{1, m}

:

V_{n_{k}}^{(k)} (A) : = \sum_{i = 1}^{n_{k}} I_{A} (X_{i}^{(k)}), k = \bar{1, m}, A \in A .

Define the m independent accompanying Poisson point processes as

Π_{n_{k}}^{(k)} (A) : = \sum_{i = 1}^{π_{k} (n_{k})} I_{A} (X_{i}^{(k)}), k = \bar{1, m}, A \in A,

where

π_{k} (t)

,

k = \bar{1, m}

, are independent standard Poisson processes on the positive half-line, which do not depend on all sequences

{X_{i}^{(k)}; i \geq 1}

,

k = \bar{1, m}

. In other words,

Π_{n_{k}} (A) = V_{π_{k} (n_{k})} (A)

for all

k = \bar{1, m}

. We consider the point processes

V_{n_{k}} (\cdot)

and

Π_{n_{k}} (\cdot)

as stochastic processes with trajectories from the measurable space

(B^{A}, C)

of all bounded functions indexed by the elements of the set

A

, with the

σ

-algebra

C

of all cylindrical subsets of the space

B^{A}

. The distributions of stochastic processes

V_{n_{k}} (\cdot)

and

Π_{n_{k}} (\cdot)

on

C

are defined in a standard way.

Now, we introduce the vector-valued empirical and accompanying Poisson point processes

{\bar{V}}_{\bar{n}} (A) : = (V_{n_{1}}^{(1)} (A), \dots, V_{n_{m}}^{(m)} (A)) \equiv {\bar{V}}_{\bar{n}},

{\bar{Π}}_{\bar{n}} (A) : = (Π_{n_{1}}^{(1)} (A), \dots, Π_{n_{m}}^{(m)} (A)) \equiv {\bar{Π}}_{\bar{n}},

where

\bar{n} = (n_{1}, n_{2}, \dots, n_{m})

. The vector-valued point processes

{\bar{V}}_{\bar{n}}

and

{\bar{Π}}_{\bar{n}}

are considered as random elements with values in the measurable space

({(B^{A})}^{m}, C^{m})

.

Let

A_{0} \in A

with

p : = P (A_{0}) \in (0, 1)

. Consider the restrictions of the vector point processes

{\bar{V}}_{\bar{n}}

and

{\bar{Π}}_{\bar{n}}

to the set

A_{0} : = {A \in A : A \subseteq A_{0}} .

(1)

These so-called

A_{0}

-restrictions are denoted by

{\bar{V}}_{\bar{n}}^{0}

and

{\bar{Π}}_{\bar{n}}^{0}

, respectively. For the distributions

L ({\bar{V}}_{\bar{n}}^{0})

and

L ({\bar{Π}}_{\bar{n}}^{0})

in the measurable space

({(B^{A})}^{m}, C^{m})

, there are the following three assertions (some particular versions of these assertions have been proved in [1,2]).

Theorem 1.

The following inequality is valid:

L ({\bar{V}}_{\bar{n}}^{0}) \leq \frac{1}{{(1 - p)}^{m}} L ({\bar{Π}}_{\bar{n}}^{0}) .

(2)

Corollary 1.

For any non-negative measurable functional F defined on

({(B^{A})}^{m}, C^{m})

,

E F ({\bar{V}}_{\bar{n}}^{0}) \leq \frac{1}{{(1 - p)}^{m}} E F ({\bar{Π}}_{\bar{n}}^{0});

(3)

the expectation on the right-hand side of (3) may be infinite at that.

The following result plays an essential role in proving the main result of the paper—a duality limit theorem for the distributions

L ({\bar{V}}_{\bar{n}})

and

L ({\bar{Π}}_{\bar{n}})

(see Theorem 3 below).

Theorem 2.

For each multi-index

\bar{n}

, one can define some vector point processes

{\bar{V}}_{\bar{n}}^{0 *}

and

{\bar{Π}}_{\bar{n}}^{0 *}

on a common probability space so that they coincide in distribution with the point processes

{\bar{V}}_{\bar{n}}^{0}

and

{\bar{Π}}_{\bar{n}}^{0}

, respectively, and

sup_{A_{c} \subseteq A_{0}} P (sup_{A \in A_{c}} ∥{\bar{V}}_{\bar{n}}^{0 *} (A) - {\bar{Π}}_{\bar{n}}^{0 *} (A)∥ \neq 0) \leq 1 - {(1 - p)}^{m} < m p,

(4)

where

∥ (z_{1}, \dots, z_{m}) ∥ : = {max}_{k \leq m} | z_{k} |

, and the outer supremum is taken over all at most countable families

A_{c}

of sets from

A_{0}

.

Remark 1.

In Theorem 2, the sup-seminorm

sup_{A \in A_{c}} ∥ \cdot ∥

is obviously measurable with respect to the cylindrical σ-algebra

C^{m}

. If instead of

A_{c}

we substitute the entire class

A_{0}

(possibly uncountable) then this measurability may no longer exist (unless, of course, the point processes under consideration do not have the separability property). Nevertheless, the assertion of Theorem 2 remains valid in this case if the probability

P

is replaced by the outer probability

P^{*} (N_{o}) : = {inf}_{N \in C^{m} : N \supseteq N_{o}} P (N)

. However, the outer probability has only the property of semiadditivity, which makes it difficult to use.

Let measurable sets

Δ_{1}, Δ_{2}, \dots

form a finite or countable partition of the sample space under the condition

p_{i} : = P (Δ_{i}) > 0

for all i. Without loss of generality, we can assume that the sequence

{p_{i}}

is monotonically nonincreasing. Denoted by

ν_{n_{k} 1}^{(k)}, ν_{n_{k} 2}^{(k)}, \dots

,

k = \bar{1, m}

, the corresponding group frequencies are defined by the sample

X_{1}^{(k)}, \dots, X_{n_{k}}^{(k)}

. Put

{\bar{ν}}_{i \bar{n}} : = {\bar{V}}_{\bar{n}} (Δ_{i}) = (ν_{n_{1} i}^{(1)}, \dots, ν_{n_{m} i}^{(m)}), i = 1, 2, \dots .

Let us agree that everywhere below the limit relation

\bar{n} \to \infty

will be understood as

n_{k} \to \infty

for all

k = \bar{1, m}

.

3. Additive Statistics: Examples

In the paper, we consider a class of additive statistics of the form

Φ_{f} ({\bar{V}}_{\bar{n}}) : = \sum_{i \geq 1} f_{i \bar{n}} ({\bar{ν}}_{i \bar{n}}),

(5)

where

f \equiv {f_{i \bar{n}}}

is an array of arbitrary finite functions defined on

Z_{+}^{m}

under the condition

\sum_{i \geq 1} | f_{i \bar{n}} (0, \dots, 0) | < \infty \forall n,

(6)

which ensures the correct definition of the functional

Φ_{f} ({\bar{V}}_{\bar{n}})

in the case of a countable partition of the sample space, since the sum under consideration contains only a finite set of nonzero random vectors

{\bar{ν}}_{i \bar{n}}

. In the case of a finite partition and

m = 1

, additive functionals of the form (5) were considered in [3,4,5].

We now give some examples of such statistics.

(1) Consider a finite partition

{Δ_{i}; i = 1, \dots, N}

of the sample space. Put

f_{i \bar{n}} (\bar{x})

: = \frac{| \bar{x} - \bar{n} p_{i} |^{2}}{| \bar{n} p_{i} |}

,

i = 1, \dots, N

, where

| \cdot |

is the standard Euclidean norm in

R^{m}

. Then the functional

Φ_{χ^{2}} ({\bar{V}}_{\bar{n}}) : = \sum_{i = 1}^{N} \frac{| {\bar{ν}}_{i \bar{n}} - \bar{n} p_{i} |^{2}}{| \bar{n} p_{i} |}

(7)

is an m-variate version of a well-known

χ^{2}

-statistic. Note that, in the present paper, we are primarily interested in the case where

N \equiv N (\bar{n}) \to \infty

as

\bar{n} \to \infty

.

(2) Let now the sizes of all m samples be equal:

n_{j} = n

,

j = 1, \dots, m

. In an equivalent reformulation of the original problem, we consider a sample of m-dimensional observations

{(X_{i}^{1}, \dots, X_{i}^{m}); i \leq n}

under the main hypothesis that the sample vector coordinates are independent and have the same N-atomic distribution with unknown masses

p_{1}, \dots, p_{N}

. In this case, the log-likelihood function can be represented as the additive functional

Φ_{log} ({\bar{V}}_{\bar{n}}) : = \sum_{i = 1}^{N} ({\bar{ν}}_{i \bar{n}}, \bar{1}) log p_{i},

where

\bar{1}

is the unit vector in

R^{m}

and

(\cdot, \cdot)

is the Euclidean inner product.

(3) Consider a finite or countable partition

{Δ_{i}; i \geq 1}

. Let

f_{i \bar{n}} (\bar{x}) \equiv f (\bar{x}) : = I_{B} (\bar{x})

be the indicator function of some subset

B \subset Z_{+}^{m}

. Then the functional

Φ_{I_{B}} ({\bar{V}}_{\bar{n}}) : = \sum_{i \geq 1} I_{B} ({\bar{ν}}_{i \bar{n}})

(8)

counts the number of partition elements (cells) containing any number of vector sample observations from the range B in a multinomial scheme (finite or infinite) of placing particles into cells (see [6,7,8,9,10,11,12]). Note that in the case of an infinite multinomial scheme in (8), it is additionally assumed that

0 \notin B

.

In the case

m = 2

and

B = {(x, y) \in Z_{+}^{2} : x = 0, y > 0}

, the two-sample statistic (8) counts the number of nonempty cells after second (“additional”) series of trials (“future” sample), which were empty in the first series (“original” sample). Statistics of such a kind play an important role in the theory of species sampling (for example, see [13,14]). In this case the functional (8) is called the number of unseen species in the original sample.

(4) In the case

m = 1

, consider the joint distribution (see [10]) of the random variables

Φ_{I_{B}} (V_{n_{1}}), Φ_{I_{B}} (V_{n_{1} + n_{2}}), \dots, Φ_{I_{B}} (V_{n_{1} + \dots + n_{m}})

defined in (8) by the sample

(X_{1}, \dots, X_{N})

, with

N = n_{1} + \dots + n_{m}

. It is clear that studying the asymptotic behavior of the joint distribution of these random variables (for example, proving the multidimensional central limit theorem) can be reduced to the study of the limit distributions of the linear combinations of the form

a_{1} Φ_{I_{B}} (V_{n_{1}}) + a_{2} Φ_{I_{B}} (V_{n_{1} + n_{2}}) + \dots + a_{m} Φ_{I_{B}} (V_{n_{1} + \dots + n_{m}})

for almost all vectors

(a_{1}, \dots, a_{m})

with respect to the Lebesgue measure on

R^{m}

. It is easy to see that, for any natural

j \leq m

,

V_{n_{1} + \dots + n_{j}} = V_{n_{1}}^{(1)} + \dots + V_{n_{j}}^{(j)},

where the empirical point processes

V_{n_{1}}^{(1)}, \dots, V_{n_{j}}^{(j)}

are defined by the above-mentioned independent subsamples. So, in this case, we deal with a functional of the form (5) defined by m independent empirical point processes corresponding to the m independent subsamples

(X_{1}, \dots, X_{n_{1}})

,

(X_{n_{1} + 1}, \dots, X_{n_{1} + n_{2}})

,…,

(X_{N - n_{m} + 1}, \dots, X_{N})

, and with the array of functions

f_{i \bar{n}} (\bar{x}) \equiv f (x_{1}, \dots, x_{m}) : = a_{1} I_{B} (x_{1}) + a_{2} I_{B} (x_{1} + x_{2}) + \dots + a_{m} I_{B} (x_{1} + \dots + x_{m}) .

(9)

(5) Consider the stochastic process

{Φ_{I_{B}} ({\bar{V}}_{\bar{n}}); B \subset Z_{+}^{m}}

indexed by all subsets of

Z_{+}^{m}

. As was noted above, studying the asymptotic behavior of the joint distributions of this process can be reduced to studying the asymptotic behavior of the distributions of any linear combinations of corresponding one-dimensional projections of this process, i.e., to studying the asymptotic behavior of the distributions of functionals of the form (5) for

m = 1

and the array of functions

f_{i \bar{n}} (x) \equiv f (x) : = a_{1} I_{B_{1}} (x) + a_{2} I_{B_{2}} (x) + \dots + a_{r} I_{B_{r}} (x)

(10)

for almost all vectors

(a_{1}, \dots, a_{r})

. For one-point sets, the asymptotic analysis of the above-mentioned joint distributions can be found, for example, in [7,8,9,10,11,12].

(6) Consider the case

m = 1

and the functional

Φ_{f} (V_{n}) : = \sum_{i \geq 1} n p_{i} I_{B} (ν_{i n}),

(11)

which counts the sampling ratio of the cells containing any number of particles from the range B. For the one-point set

B = {0}

, such functional was considered in [9]. In general, if instead of

n p_{i}

in (11) we consider arbitrary weights

g (n, i) > 0

(under condition (6)) with one or another interpretation, the functional

Φ_{f} (V_{n})

in this case will be interpreted as the total weight of the corresponding cells.

4. Poissonization: Duality Theorem

In this section, we present the main result of the paper—a duality theorem for additive statistics under consideration. First of all, we explain the term “Poissonization”. It means that studying the limit behavior of the original additive statistics, we reduce the problem to studying the following “Poissonian version” of the functional (5) under condition (6):

Φ_{f} (Π_{\bar{n}}) : = \sum_{i \geq 1} f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}),

(12)

where

{\bar{π}}_{i \bar{n}} = (π_{n_{1} i}^{(1)}, \dots, π_{n_{m} i}^{(m)})

,

π_{n_{k} i}^{(k)} : = Π_{n_{k}} (Δ_{i})

,

i \geq 1

, is a sequence of independent Poisson random variables with respective parameters

n_{k} p_{i}

. It is clear that the functional (12) is well defined with probability 1 since only a finite number of the vectors

{{\bar{π}}_{i \bar{n}}}

differ from the zero vector. Independence of the summands is a crucial difference of the Poisson version of an additive functional from the original one. Some elements of Poissonization for additive functionals of the form (8) and (10) are contained, for example, in [9,12]. In [9], the author used the well-known representation of an empirical point process as the conditional Poisson point process under the condition that the number of atoms of the accompanying Poisson point process equals n. Moreover, in [9], the simple known representation

π (n) = n + O_{p} (\sqrt{n})

was employed, where

O_{p} (\sqrt{n})

denotes a random variable such that

O_{p} (\sqrt{n}) / \sqrt{n}

is bounded in probability as

n \to \infty

. In [12], proving the multivariate central limit theorem for the above-mentioned joint distributions (in fact, for functionals of the form (10) in the case of one-point subsets

{B_{i}}

), the authors applied a reduction to the joint distributions of the Poissonian versions of additive functionals using known upper bounds for a multivariate Poisson approximation to a multinomial distribution (see also [15]). The main goal of the paper is to establish a duality theorem, which demonstrates absolute identity of the asymptotic behavior of the distributions of the additive functionals under consideration and their Poissonian versions.

First, we formulate a crucial auxiliary assertion in proving the main result.

Lemma 1.

Let

{Δ_{\bar{n}}}

be an arbitrary scalar array satisfying the condition

f_{i \bar{n}} (π_{i \bar{n}}) Δ_{\bar{n}} \overset{p}{\to} 0

as

\bar{n} \to \infty

for every fixed i. Then, for each multiindex

\bar{n}

, one can define on a common probability space a pair of point processes

{\bar{V}}_{\bar{n}, Δ_{\bar{n}}}^{*}

and

{\bar{Π}}_{\bar{n}, Δ_{\bar{n}}}^{*}

such that

L ({\bar{V}}_{\bar{n}, Δ_{\bar{n}}}^{*}) = L ({\bar{V}}_{\bar{n}})

,

L ({\bar{Π}}_{\bar{n}, Δ_{\bar{n}}}^{*}) = L ({\bar{Π}}_{\bar{n}})

, and for any

ε > 0

,

P (| Δ_{\bar{n}} | |Φ_{f} ({\bar{V}}_{\bar{n}, Δ_{\bar{n}}}^{*}) - Φ_{f} ({\bar{Π}}_{\bar{n}, Δ_{\bar{n}}}^{*})| > ε) \to 0 a s \bar{n} \to \infty .

(13)

Remark 2.

Lemma 1 only asserts that the marginal distributions (that is, for each

\bar{n}

separately) of the arrays

{{\bar{V}}_{\bar{n}, Δ_{\bar{n}}}^{*}, \bar{n} \in Z_{+}^{m}}

and

{{\bar{V}}_{\bar{n}}, \bar{n} \in Z_{+}^{m}}

, and also

{{\bar{Π}}_{\bar{n}, Δ_{\bar{n}}}^{*}, \bar{n} \in Z_{+}^{m}}

and

{{\bar{Π}}_{\bar{n}}, \bar{n} \in Z_{+}^{m}}

. Note that the probability in (13) is precisely determined by the marginal distributions of the mentioned random arrays, i.e., formally, it also depends on

\bar{n}

. Without loss of generality, we can assume that pairs of point processes

({\bar{V}}_{\bar{n}, Δ_{\bar{n}}}^{*}

,

{\bar{Π}}_{\bar{n}, Δ_{b a r n}}^{*})

are independent in

\bar{n}

, and on this extended probability space, the universal probability measure

P

in (13) is given in the standard way, which no longer depends on

\bar{n}

. In this case it is correct to speak about the convergence to zero in probability of the sequence of random variables in (13).

Lemma 1 gives the key to the proof of the following duality theorem, a criterion for the weak convergence of distributions of functionals of the point processes under consideration. The essence of this result is that the asymptotic behavior of the distributions of additive functionals of the point processes

{\bar{V}}_{\bar{n}}

and

{\bar{Π}}_{\bar{n}}

is exactly the same. In addition, one can also indicate a third class of additive functionals (under condition (6)) that has the same property:

Φ_{f}^{*} : = \sum_{i \geq 1} f_{i \bar{n}} ({\bar{ν}}_{i \bar{n}}^{*}),

where

{{\bar{ν}}_{i \bar{n}}^{*}, i \geq 1}

is a sequence of independent random vectors such that

L ({\bar{ν}}_{i \bar{n}}^{*}) = L ({\bar{ν}}_{i \bar{n}})

for all i. The functional

Φ_{f}^{*}

is well defined due to the Borel–Cantelli lemma and the simple estimate

P ({\bar{ν}}_{i \bar{n}}^{*} \neq 0) = P ({\bar{ν}}_{i \bar{n}} \neq 0) \leq m ∥ \bar{n} ∥ p_{i}

.

Let us agree that the symbol <<⇒>> in what follows will denote the weak convergence of distributions. The main result of the paper is as follows.

Theorem 3.

Under the conditions of Lemma 1, the following three limit relations are equivalent as

\bar{n} \to \infty

:

(1) L (Φ_{f} ({\bar{V}}_{\bar{n}}) Δ_{\bar{n}} - M_{\bar{n}}) ⟹ L (γ),

(2) L (Φ_{f} ({\bar{Π}}_{\bar{n}}) Δ_{\bar{n}} - M_{\bar{n}}) ⟹ L (γ),

(3) L (Φ_{f}^{*} Δ_{\bar{n}} - M_{\bar{n}}) ⟹ L (γ),

where

M_{\bar{n}}

and

Δ_{\bar{n}}

are some scalar arrays and γ is some random variable.

5. Applications

Theorem 3 allows us to reduce the asymptotic analysis of the distributions of the additive functionals under consideration to a similar analysis of their Poissonian versions, i.e., to the asymptotic analysis of distributions of sums (finite or infinite) of independent random variables, or to reduce the problem to studying the limit behavior of the distributions

L (Φ_{f} ({\bar{V}}_{\bar{n}})

, absolutely ignoring the dependence of the random variables

{{\bar{ν}}_{i \bar{n}}, i \geq 1}

. Note also that, under some rather broad assumptions, the law

L (γ)

will be infinitely divisible. A detailed analysis of such conditions and corresponding examples will be considered in a separate paper. Here we present only a few of these corollaries, focusing our attention on the equivalence of the first two relations of Theorem 3.

First of all, we note one useful property of the expectations of the functionals under consideration as functions of

\bar{n}

.

Lemma 2.

Let

{max}_{\bar{n}} {sup}_{\bar{x}} | f_{i \bar{n}} (\bar{x}) | \leq C_{i}

,

\sum_{i \geq 1} C_{i} p_{i} < \infty

, and

\sum_{i \geq 1} E | f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) | < \infty \forall \bar{n} .

(14)

Then the relations

lim_{\bar{n} \to \infty} | E Φ_{f} ({\bar{V}}_{\bar{n}}) | = \infty

and

lim_{\bar{n} \to \infty} | E Φ_{f} ({\bar{Π}}_{\bar{n}}) | = \infty

are equivalent. In the case of infinite limits,

E Φ_{f} ({\bar{V}}_{\bar{n}}) \sim E Φ_{f} ({\bar{Π}}_{\bar{n}}) a s \bar{n} \to \infty .

Remark 3.

For functionals of the form (8) in an infinite multinomial scheme, the conditions of Lemma 2 are typical. Let

m = 1

and

B : = {j : j > k}

for any

k \geq 0

. Then

lim_{n \to \infty} E Φ_{f} (V_{n}) = lim_{n \to \infty} \sum_{i \geq 1} P (ν_{i n} > k) = \infty

since, by virtue of the law of large numbers,

lim_{n \to \infty} P (ν_{i n} > k) \to 1

for every fixed i. Moreover, in the case under consideration, obviously,

E Φ_{f} (V_{n}) \leq n

. Similarly, without any restrictions on the probabilities

{p_{i}}

, the infinite limits in Lemma 2 for functionals of the form (8) (and even more so for (11)) also hold for the set B consisting of all odd natural numbers. Here the limit relation

lim_{n \to \infty} E Φ_{f} ({\bar{Π}}_{\bar{n}}) \equiv lim_{n \to \infty} \sum_{i g e 1} P (π_{i n} \in B) = \infty

follows immediately from the equality

P (π_{i n} \in B) = \frac{1}{2} (1 - e^{- 2 n p_{i}})

.

It is also worth noting that for some sets B the main contribution to the limit behavior of the series

\sum_{i \geq 1} P (π_{i n} \in B)

can be made not only by their initial segments but also tails. For example, this will be the case for any one-point sets

B_{k} : = {k}

for

k > 0

if the group probabilities are given as

p_{i} = C i^{- 1 - b}

or

p_{i} = c e^{- C_{o} i^{α}}

for some constants

c, C, C_{o}, b > 0

and

α \in (0, 1)

. In this case, for any subset B of natural numbers in the definition of the functionals (8) and (11), the expectation limits indicated in Lemma 2 will be infinite (see Section 7 and [9,12]). On the other hand, if

p_{i} = c e^{- C_{o} i}

, then for any one-point set the expectations mentioned will be bounded uniformly in n (see Section 7 and [9,12]). For more complex functionals with kernels (9) or (10) for the above-mentioned distributions

{p_{i}}

, one can find sufficiently broad conditions that ensure unbounded increase in their expectations and variances as

\bar{n} \to \infty

for almost all vectors

(a_{1}, \dots, a_{r}) \in R^{r}

(see Section 7).

Now we present one of the corollaries of Theorem 3, namely, the law of large numbers for the additive functionals under consideration, setting in this theorem

Δ_{\bar{n}} : = {(E Φ_{f} ({\bar{Π}}_{\bar{n}}))}^{- 1}

,

M_{\bar{n}} : = 0

, and

γ : = 1

.

Corollary 2.

Let the conditions of Lemma 2 be fulfilled. If

| E Φ_{f} (Π_{\bar{n}}) | \to \infty

as

\bar{n} \to \infty

then the following criterion holds:

\frac{Φ_{f} ({\bar{V}}_{\bar{n}})}{E Φ_{f} ({\bar{V}}_{\bar{n}})} \overset{p}{⟶} 1 i f f \frac{Φ_{f} ({\bar{Π}}_{\bar{n}})}{E Φ_{f} ({\bar{Π}}_{\bar{n}})} \overset{p}{⟶} 1;

in this case, the normalizations

E Φ_{f} ({\bar{V}}_{\bar{n}})

and

E Φ_{f} ({\bar{Π}}_{\bar{n}})

can be swapped.

Remark 4.

In consideration of Chebyshev’s inequality, a sufficient condition for the limit relations in Corollary 2 is as follows:

\frac{\sum_{i \geq 1} D f_{i \bar{n}} ({\bar{π}}_{i \bar{n}})}{{(\sum_{i \geq 1} E f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}))}^{2}} \to 0 .

For example, let

f_{i \bar{n}} (\cdot) \geq 0

and

sup_{\bar{x}, i, \bar{n}} f_{i \bar{n}} (\bar{x}) \leq C_{0}

. Then

D f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) \leq C_{0} E f_{i \bar{n}} ({\bar{π}}_{i \bar{n}})

and

\frac{\sum_{i \geq 1} D f_{i \bar{n}} ({\bar{π}}_{i \bar{n}})}{{(\sum_{i \geq 1} E f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}))}^{2}} \leq C_{0} {|\sum_{i \geq 1} E f_{i \bar{n}} ({\bar{π}}_{i \bar{n}})|}^{- 1} \to 0 .

In particular, this estimate is valid in the case

f_{i \bar{n}} (\bar{x}) \equiv f (\bar{x}) : = I_{B} (\bar{x})

, with

0 \notin B

, if only

E Φ_{f} ({\bar{Π}}_{\bar{n}}) = \sum_{i \geq 1} P ({\bar{π}}_{i \bar{n}} \in B) \to \infty

.

We now formulate an analog of Lemma 2 for the variances of the functionals under consideration.

Lemma 3.

Under the conditions

{max}_{\bar{n}} {sup}_{\bar{x}} | f_{i \bar{n}} (\bar{x}) | \leq C_{i}

\forall i

and

\sum_{i \geq 1} C_{i}^{2} p_{i} < \infty

the limit relation

lim_{\bar{n} \to \infty} D Φ_{f} ({\bar{V}}_{\bar{n}}) = \infty

holds if and only if

lim_{\bar{n} \to \infty} D Φ_{f} ({\bar{Π}}_{\bar{n}}) = \infty

. In the case of infinite limit the following equivalence is valid:

D Φ_{f} ({\bar{V}}_{\bar{n}}) \sim D Φ_{f} ({\bar{Π}}_{\bar{n}})

as

\bar{n} \to \infty

.

Lemma 3 and Theorem 3 imply the following important criterion, which allows us to reduce proving the central limit theorem for additive functionals

Φ_{f} ({\bar{V}}_{\bar{n}})

to proving the same assertion for the Poissonian version

Φ_{f} ({\bar{Π}}_{\bar{n}})

.

Corollary 3.

Under the conditions of Lemma 3 and

D Φ_{f} ({\bar{Π}}_{\bar{n}}) \to \infty

as

\bar{n} \to \infty

the limit relation

L (\frac{Φ_{f} ({\bar{V}}_{\bar{n}}) - E Φ_{f} ({\bar{V}}_{\bar{n}})}{D^{1 / 2} Φ_{f} ({\bar{V}}_{\bar{n}})}) ⟹ N (0, 1) a s \bar{n} \to \infty,

is valid if, and only if,

L (\frac{Φ_{f} ({\bar{Π}}_{\bar{n}}) - E Φ_{f} ({\bar{Π}}_{\bar{n}})}{D^{1 / 2} Φ_{f} ({\bar{Π}}_{\bar{n}})}) ⟹ N (0, 1) a s \bar{n} \to \infty,

where

N (0, 1)

is the standard normal distribution. In this case, the normalizing and centering sequences in these two limit relations can be, respectively, swapped.

In order to prove this corollary we should put in Theorem 3

Δ_{\bar{n}} : = D^{- 1 / 2} Φ_{f} ({\bar{Π}}_{\bar{n}})

,

M_{\bar{n}} : = E Φ_{f} ({\bar{V}}_{\bar{n}}) D^{- 1 / 2} Φ_{f} ({\bar{Π}}_{\bar{n}})

, and

L (γ) : = N (0, 1)

. In this case, Lemma 3 allows us only to replace the normalizing and centering sequences in Theorem 3 with some equivalent sequences.

Remark 5.

The validity of the central limit theorem for the sequence

Φ_{f} ({\bar{Π}}_{\bar{n}})

in Theorem 3 will be justified if, say, the third-order Lyapunov condition is met:

\frac{\sum_{i \geq 1} E {| f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) - E f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) |}^{3}}{{(\sum_{i \geq 1} D f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}))}^{3 / 2}} \to 0 a s \bar{n} \to \infty .

For example, let

sup_{\bar{x}, i, \bar{n}} | f_{i \bar{n}} (\bar{x}) | \leq C_{0}

. Then it is easy to see that

\sum_{i \geq 1} E {| f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) - E f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) |}^{3} \leq 2 C_{0} \sum_{i \geq 1} D f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) .

Thus, if

D Φ_{f} ({\bar{Π}}_{\bar{n}}) \to \infty

as

\bar{n} \to \infty

, then the Lyapunov condition will be met and the approval of the above investigation will take place. So an important special case

f_{i \bar{n}} (\bar{x}) : = I_{B} (\bar{x})

is included in the scheme at issue if

D Φ_{I_{B}} ({\bar{Π}}_{\bar{n}}) = \sum_{i \geq 1} P ({\bar{π}}_{i \bar{n}} \in B) (1 - P ({\bar{π}}_{i \bar{n}} \in B)) \to \infty a s \bar{n} \to \infty .

Note that examples for which the specified variance property takes place or is violated are given, for example, in [9].

Finally, here is another consequence of Theorem 3, relating to the asymptotic behavior of

χ^{2}

-statistics in (7) at

m = 1

and

N \equiv N (n) \to \infty

. First of all, note that

E Φ_{χ^{2}} (Π_{n}) = N,

D_{n} : = D Φ_{χ^{2}} (Π_{n}) = 2 N + \sum_{i = 1}^{N} \frac{1}{n p_{i}} .

Corollary 4.

Let

N \equiv N (n) \to \infty

as

n \to \infty

. Then the following two asymptotic relations are equivalent:

L (\frac{Φ_{χ^{2}} (V_{n}) - N}{{D_{n}}^{1 / 2}}) ⟹ N (0, 1),

(15)

L (\frac{Φ_{χ^{2}} (Π_{n}) - N}{{D_{n}}^{1 / 2}}) ⟹ N (0, 1) .

(16)

Note that in the present case, the requirement of Lemma 1 is met, since each term

\frac{{(ν_{i n} - n p_{i})}^{2}}{n p_{i}}

(as a sequence of n) is bounded in probability due to Markov’s inequality, and therefore, with the normalizing sequence

Δ_{n} : = D_{n}^{- 1}

, this term will tend to zero in probability as

n \to \infty

.

Remark 6.

In the relations (15) and (16) we can say just about the double limit when

N, n \to \infty

because this assertion is missing restrictions on the rate of increase in the sequence

N (n)

. The proposed formulation in Corollary 4, equivalent to the one just mentioned, is more convenient to refer to Theorem 3. Note that the centering sequence

E_{n}

can be replaced with its equivalent sequence

E Φ_{χ^{2}} (V_{n}) = N - 1

. Replacement in the normalization in (15) the variance

D_{n}

with the variance of the

χ^{2}

-statistic itself, i.e., by the term (for example, see [16])

D Φ_{χ^{2}} (V_{n}) = 2 N + \frac{1}{N} \sum_{i = 1}^{N} \frac{1}{n p_{i}} - \frac{3 N - 2}{n},

is possible only if these two variances are equivalent. For example, this would be the case if

{min}_{i \leq N} n p_{i} \to \infty

. This means that the growth rate of the sequence

N \equiv N (n)

is subject to appropriate constraints, which is not the case in the above consequence. So, in this assertion we can talk about a double limit as

n, N \to \infty

.

The formulated criterion allows us to establish a fairly general sufficient condition for the asymptotic normality of

χ^{2}

-statistics with an increasing number of groups.

Theorem 4.

Let

N \equiv N (n) \to \infty

as

n \to \infty

. Then the asymptotic relation (15) is valid if

\frac{\sum_{i = 1}^{N} {(n p_{i})}^{- 2}}{{(N + \sum_{i = 1}^{N} {(n p_{i})}^{- 1})}^{3 / 2}} ⟶ 0

(17)

as

n \to \infty

.

The problem of finding more or less broad sufficient conditions for asymptotic normality

χ^{2}

-statistics with a growing number of groups were studied by many authors in the second half of the last century (for example, see [3,4,5,16,17,18]). Note that all known sufficient conditions for the above weak convergence imply fulfillment of the asymptotic relation (17). For example, the condition

{min}_{i \leq N} n p_{i} \to \infty

along with

N \to \infty

(see [17,18]), obviously immediately entails relation (17). It is equally obvious that the requirement of the so-called regularity of multinomial models (see [3,4,5]), i.e.,

0 < c_{1} \leq min_{i \leq N} N p_{i}, max_{i \leq N} N p_{i} < c_{2} < \infty,

where the constants

c_{1}

and

c_{2}

are independent of N, also implies (17). On the other hand, it is easy to construct examples in which the regularity requirement of the multinomial model is violated but relation (17) is valid. For example, let

p_{i} : = C_{N} i^{- 1 - b}

,

i = 1, \dots, N

, where

b > 0

and

C_{N} : = {(\sum_{i \leq N} i^{- 1 - b})}^{- 1}

. It is easy to see that, as

N \to \infty

, the sums

\sum_{i = 1}^{N} p_{i}^{- 2}

and

\sum_{i = 1}^{N} p_{i}^{- 1}

increase as

N^{3 + 2 b}

and

N^{2 + b}

, respectively. Therefore, as

n, N \to \infty

, the ratio in (17) is equivalent to

\frac{N^{3 + 2 b}}{\sqrt{n} {(N^{2 + b})}^{3 / 2}} = \frac{N^{b / 2}}{\sqrt{n}}

up to a constant factor. So, here we already need to measure the growth rate N with n. Obviously, in this case, in order to fulfill condition (17), you need to require that

N = o (n^{1 / b})

. If the probabilities

p_{i}

decrease exponentially then the growth rate zone for N narrows to

o (log n)

. It is worth to note that for the above-mentioned power-type probabilities at issue the condition

{min}_{i \leq N} n p_{i} \to \infty

implies the asymptotic relation

N = o (n^{1 / (b + 1)})

that is more restrictive than the above constraint.

6. Probability and Moment Inequalities

The next theorem is related to estimation of the distribution tails of additive functionals.

Theorem 5.

Let

f_{i \bar{n}} (\cdot) \geq 0

for all i. Then, for any

x > 0

,

P (Φ_{f} ({\bar{V}}_{\bar{n}}) \geq x) \leq 2 C^{*} P (Φ_{f} ({\bar{Π}}_{\bar{n}}) \geq x / 2),

(18)

where

C^{*} : = min_{j \geq 1} max {{(\sum_{i \leq j} p_{i})}^{- 1}, {(\sum_{i > j} p_{i})}^{- 1}}

. If additionally

{sup}_{x} f_{1 \bar{n}} (x) \leq c_{0}

then

P (Φ_{f} ({\bar{V}}_{\bar{n}}) \geq x) \leq p_{1}^{- 1} P (Φ_{f} ({\bar{Π}}_{\bar{n}}) \geq x - c_{0}) .

(19)

Remark 7.

In (19), the constant

c_{0}

may depend on

\bar{n}

. What is more, we can use the truncation of the random variable

f_{1 \bar{n}} (ν_{i_{\bar{n}}})

at the level

c_{0}

, while adding to the right-hand side of inequality (19) the probability

P (f_{1 \bar{n}} (ν_{i_{\bar{n}}}) > c_{0})

.

Corollary 5.

Under the conditions of Theorem 5, let F be a continuous nondecreasing function defined on

R_{+}

, with

F (0) = 0

. If

E F (2 Φ_{f} ({\bar{Π}}_{\bar{n}})) < \infty

then

E F (Φ_{f} ({\bar{V}}_{\bar{n}})) \leq 2 C^{*} E F (2 Φ_{f} ({\bar{Π}}_{\bar{n}})) .

(20)

As an example, consider the functional

Φ_{I_{B}} ({\bar{V}}_{\bar{n}})

defined in (8). Then, as a consequence of (19) and Chernoff’s upper bound [19] for the distribution tail of a sum of independent nonidentically distributed Bernoulli random variables (the transition from finite sums to series in this case is obvious), we obtain the following result.

Corollary 6.

Put

M_{n} (B) : = E Φ_{I_{B}} ({\bar{Π}}_{\bar{n}}) = \sum_{i \geq 1} P (π_{i n} \in B)

. Then for any

ε > {(M_{n} (B))}^{- 1}

the following inequality holds:

P (|\frac{Φ_{I_{B}} ({\bar{V}}_{\bar{n}})}{M_{n} (B)} - 1| > ε) \leq 2 p_{1}^{- 1} e^{- \frac{δ^{2} M_{n} (B)}{2 + δ}},

(21)

where

δ : = ε - \frac{1}{M_{n} (B)} > 0

.

Remark 8.

one can replace the Poissonian mean

M_{n} (B)

in (21) with the mean

E Φ_{I_{B}} ({\bar{V}}_{\bar{n}})

, which differs from

M_{n} (B)

by no more than 1 due to Barbour–Hall’s estimate of the Poisson approximation to a binomial distribution (see [15,20]). Further, if the condition

M_{n} (B) \to \infty

is met as

n \to \infty

then from (21) we obtain not only the law of large numbers (already formulated in Corollary 2), but at a certain growth rate of the sequence

M_{n} (B)

, the strong law of large numbers (SLLN) (see Section 7). If in the case

m = 1

we consider the infinite intervals

B \equiv B_{k} : = {i : i > k}

for any

k \in Z_{+}

then the SLLN occurs at any speed of increasing the sequence

M_{n} (B)

to infinity. This follows from estimate (21), the monotonicity of the functions

I_{B_{k}} (x)

, and the simple technique in proving SLLN in [9,21].

7. Asymptotic Analysis of the Means and Variances of Additive Statistics

In the previous section, it was noted that when proving certain limit theorems for the introduced additive functionals, it is extremely important to have information about the behavior of their means and variances. In this section, for additive statistics (8)–(11), we demonstrate exactly how the asymptotic behavior of these moments is studied. To simplify the notation, we will consider here the case

m = 1

. The subsequent asymptotic analysis is based on the following elementary assertion, which is presented in one way or another in many papers on this topic.

Lemma 4.

Let

f_{n} (x)

be a sequence of non-negative, integrable, and piecewise monotonic functions defined on

R_{+}

. Suppose that each

f_{n} (x)

has M monotonicity intervals, where M is independent of n. Finally, assume that, as

n \to \infty

,

\int_{0}^{\infty} f_{n} (x) d x \to \infty, sup_{x \geq 0} f_{n} (x) = o (\int_{0}^{\infty} f_{n} (x) d x) .

Then, as

n \to \infty

,

\sum_{j > 0} f_{n} (j) \sim \int_{0}^{\infty} f_{n} (x) d x .

We now give a few examples of calculating the asymptotics we need.

(1) Let

B_{k} : = {i : i > k}

for any

k \in Z_{+}

. In Remark 3 it was already noted that

M_{n} (B_{k}) \to \infty

due to the strong law of large numbers for binomially distributed random variables. However, for specific classes of distributions

{p_{i}}

, one can estimate the growth rate of the sequence

{M_{n} (B_{k})}

. For example, let

p_{i} : = C i^{- 1 - b}

, where

b > 0

,

i = 1, 2, \dots

. Then, using Lemma 4 and the well-known connection between the tail of a Poisson distribution and the corresponding gamma distribution, we obtain after integration by parts and a change of the integration variable:

\begin{matrix} M_{n} (B_{k}) \equiv \sum_{i \geq 1} P (π_{i n} > k) = \sum_{i \geq 1} γ_{k + 1, 1} (n p_{i}) \\ \sim {(C n)}^{\frac{1}{1 + b}} \int_{0}^{\infty} γ_{k + 1, 1} (y^{- 1 - b}) d y = \frac{{(C n)}^{\frac{1}{1 + b}}}{k!} Γ (k + \frac{b}{1 + b}), \end{matrix}

(22)

where

γ_{k + 1, 1} (z) : = \int_{0}^{z} \frac{t^{k}}{k!} e^{- t} d t

,

Γ (z) : = \int_{0}^{\infty} t^{z - 1} e^{- t} d t

,

z > 0

, are the distribution function of the gamma-distribution with parameters

(k + 1, 1)

, and the gamma-function, respectively. For example, if

k = 0

then the asymptotics of the expectation of the number of nonempty cells is as follows (see [6,9]):

M_{n} (B_{0}) \sim {(C n)}^{\frac{1}{1 + b}} \int_{0}^{\infty} (1 - e^{- y^{- 1 - b}}) d y = {(C n)}^{\frac{1}{1 + b}} Γ (\frac{b}{1 + b}) .

(23)

By analogy to the arguments in proving (22), after an appropriate change of the integration variable, we obtain for the one-point sets the following asymptotics:

\begin{matrix} M_{n} ({k}) \sim {(C n)}^{\frac{1}{1 + b}} \int_{0}^{\infty} \frac{y^{- k (1 + b)}}{k!} e^{- y^{- 1 - b}} d y \\ = \frac{{(C n)}^{\frac{1}{1 + b}}}{(1 + b) k!} \int_{0}^{\infty} x^{k - 1 - \frac{1}{1 + b}} e^{- x} d x = \frac{{(C n)}^{\frac{1}{1 + b}}}{(1 + b) k!} Γ (k - \frac{1}{1 + b}) . \end{matrix}

(24)

Thus, from (24) it follows that for any subset B of the natural series in the case under consideration of a power-law decrease in

{p_{i}}

the following asymptotic representation is true:

M_{n} (B) \sim \frac{{(C n)}^{\frac{1}{1 + b}}}{(1 + b)} \sum_{k \in B} \frac{1}{k!} Γ (k - \frac{1}{1 + b}) .

(25)

Note that, due to the countable additivity of the finite measure

M_{n} (\cdot)

and the relations (22)–(24), the sum (possibly infinite) in (25) will always be finite.

Remark 9.

Inequality (21), relation (25), and the Borel–Cantelli lemma guarantee that the strong law of large numbers holds for the sequence

{M_{n} (B)}

for any subsets B of the natural series in the case of a power-law decrease in the probabilities

{p_{i}}

. Moreover, what has been said and the above asymptotics are also preserved for probabilities of the form

p_{i} : = C (i) i^{- 1 - b}

, where

C (x)

is a slowly varying function under certain minimal constraints (see [9,12]). In this case, in the asymptotic relations (22)–(25) instead of C one should substitute

C (n)

.

Asymptotic behavior of the variances of the functionals

Φ_{I_{B}} ({\bar{Π}}_{n})

for some B and broad conditions on the rate of decrease in the sequence

{p_{i}}

is given in [9]. Here we only demonstrate how this variance is calculated for arbitrary subsets B of the natural series under the above conditions on

{p_{i}}

. Analogously with (22) we have for the infinite intervals

B_{k}

:

\begin{matrix} D_{n} (B_{k}) : = D Φ_{I_{B_{k}}} ({\bar{Π}}_{n}) = \sum_{i \geq 1} P (π_{i n} > k) - \sum_{i \geq 1} P^{2} (π_{i n} > k) \\ = \sum_{i \geq 1} γ_{k + 1, 1} (n p_{i}) - \sum_{i \geq 1} γ_{k + 1, 1}^{2} (n p_{i}) \sim {(C n)}^{\frac{1}{1 + b}} \int_{0}^{\infty} (γ_{k + 1, 1} (y^{- 1 - b}) - γ_{k + 1, 1}^{2} (y^{- 1 - b})) d y . \end{matrix}

(26)

Similarly to proving (24), we derive the asymptotics of the variance for the one-point sets:

\begin{matrix} D_{n} ({k}) = \sum_{i \geq 1} P (π_{i n} = k) - \sum_{i \geq 1} P^{2} (π_{i n} = k) \\ = \frac{{(C n)}^{\frac{1}{1 + b}}}{(1 + b)} (\int_{0}^{\infty} \frac{1}{k!} x^{k - 1 - \frac{1}{1 + b}} e^{- x} d x - \int_{0}^{\infty} \frac{1}{{(k!)}^{2}} x^{2 k - 1 - \frac{1}{1 + b}} e^{- 2 x} d x) \\ = \frac{{(C n)}^{\frac{1}{1 + b}}}{(1 + b) k!} (Γ (k - \frac{1}{1 + b}) - \frac{2^{\frac{1}{1 + b} - 2 k}}{k!} Γ (2 k - \frac{1}{1 + b})) . \end{matrix}

(27)

Although the set function

D_{n} (\cdot)

is not additive, the extension to arbitrary subsets B of the natural series of computing the asymptotics of

D_{n} (B)

presents no difficulty. Along with formula (25), which gives one term in the resulting asymptotics, we use the following representation for the second sum:

\begin{matrix} \sum_{i \geq 1} P^{2} (π_{i n} \in B) \sim \frac{{(C n)}^{\frac{1}{1 + b}}}{1 + b} \int_{0}^{\infty} {(\sum_{k \in B} \frac{x^{k}}{k!})}^{2} x^{- 1 - \frac{1}{1 + b}} e^{- 2 x} d x \\ = \frac{{(C n)}^{\frac{1}{1 + b}}}{1 + b} \sum_{k, l \in B} \frac{2^{\frac{1}{1 + b} - k - l}}{k! l!} Γ (k + l - \frac{1}{1 + b}) . \end{matrix}

(28)

Thus, the difference between the right-hand sides of (25) and (28) determines the asymptotic of

D_{n} (B)

for any subset of the natural series.

(2) The asymptotics of the first two moments for the functionals (10) for pairwise disjoint sets

{B_{j}}

is derived in exactly the same way. In the case of one-point sets

B_{j} : = {k_{j}}

, the asymptotic behavior of the first moment immediately follows from the previous calculations. As for the variance, we should first note that, due to the orthogonality of the indicator random variables under consideration, we have

D \sum_{s = 1}^{r} a_{s} I_{B_{s}} (π_{i n}) = \sum_{s = 1}^{r} a_{s}^{2} P (π_{i n} = k_{s}) - {(\sum_{s = 1}^{r} a_{s} P (π_{i n} = k_{s}))}^{2}

= \sum_{s = 1}^{r} a_{s}^{2} P (π_{i n} = k_{s}) - \sum_{j, s = 1}^{r} a_{s} a_{j} P (π_{i n} = k_{s}) P (π_{i n} = k_{j}) .

Summation over i of the resulting expression and the previous calculations give the desired asymptotics:

D Φ_{f} (Π_{n}) \sim \frac{{(C n)}^{\frac{1}{1 + b}}}{b + 1} \sum_{s, j = 1}^{r} [\frac{a_{s}^{2}}{r k_{s}!} Γ (k_{s} - \frac{1}{b + 1}) - \frac{2^{\frac{1}{b + 1} - k_{s} - k_{j}} a_{s} a_{j}}{k_{s}! k_{j}!} Γ (k_{s} + k_{j} - \frac{1}{b + 1})] .

We note the resulting representation can vanish on the set of vectors

(a_{1}, \dots, a_{r})

of zero Lebesgue measure in

R^{r}

, i.e., on the surface defined by the relation

\sum_{s, j = 1}^{r} B_{s, j} a_{s} a_{j} = 0

for some coefficients

{B_{s, j}}

.

For infinite intervals of the form

B_{j} : = {i : i > k_{j}}

, the variance is studied in a similar way. We assume without loss of generality that

k_{1} \leq k_{2} \leq \dots \leq k_{r}

. To calculate the variance of this functional, it suffices for us to restrict ourselves to the second moment, since the asymptotics of the first one has already been studied. We have

E {(\sum_{s = 1}^{r} a_{s} I (π_{i n} > k_{s}))}^{2} = \sum_{s = 1}^{r} a_{s}^{2} P (π_{i n} > k_{s}) + 2 E \sum_{j = 1}^{r - 1} a_{j} I (π_{i n} > k_{j}) \sum_{s > j}^{r} a_{s} I (π_{i n} > k_{s})

= \sum_{s = 1}^{r} a_{s}^{2} P (π_{i n} > k_{s}) + 2 E \sum_{j = 1}^{r - 1} a_{j} \sum_{s > j}^{r} a_{s} I (π_{i n} > k_{s})

= \sum_{s = 1}^{r} a_{s}^{2} P (π_{i n} > k_{s}) + 2 \sum_{j = 1}^{r - 1} a_{j} \sum_{s > j}^{r} a_{s} P (π_{i n} > k_{s}) .

Further calculations in essence have already been made earlier. So, finally we obtain

D Φ_{f} (Π_{n}) \sim {(C n)}^{\frac{1}{1 + b}} \sum_{s, j = 1}^{r} [\frac{a_{s}^{2}}{r} \int_{0}^{\infty} Γ_{k_{s} + 1, 1} (v^{- 1 - b}) d v - a_{s} a_{j} \int_{0}^{\infty} Γ_{k_{s} + 1, 1} (v^{- 1 - b}) Γ_{k_{j} + 1, 1} (v^{- 1 - b}) d v]

with comments similar to the above regarding the zeroing of the double sum.

To conclude this section, we give an example where the above-mentioned moments of the functional under consideration do not tend to infinity as n grows. We put

p_{j} = e^{- C j}

, with

C : = log 2

. Let us show that

sup_{n} \sum_{j \geq 1} P (π_{n j} = k) < \infty .

This estimate obviously implies that the first two moments of the functional

Φ_{I_{B}} (Π_{n})

are uniformly bounded in n for

B : = {k}

. Indeed, one has

\sum_{j \geq 1} P (π_{n j} = k) = \frac{n^{k}}{k!} \sum_{j \geq 1} e^{- n e^{- C j}} e^{- C k j} \leq \frac{e^{C k} n^{k}}{k!} \int_{1}^{\infty} e^{- n e^{- C x}} e^{- C k x} d x

= \frac{e^{C k} n^{k}}{C k!} \int_{0}^{e^{- C}} e^{- n t} t^{k - 1} d t = \frac{e^{C k}}{C k!} \int_{0}^{n e^{- C}} e^{- u} u^{k - 1} d u;

here we used the estimate

e^{- n e^{- C j}} e^{- C k j} \leq e^{C k} e^{- n e^{- C x}} e^{- C k x}

for all

x \in [j, j + 1]

, also representing the integral over the semiaxis

[0, \infty)

as a series of integrals over the indicated segments of unit length. If

n \to \infty

then the integral in the last expression converges monotonically to the quantity

Γ (k)

, which proves our assertion. Note also that a similar example is given in [9].

8. Proofs

Proof of Theorem 1.

The assertion of the theorem is essentially a consequence of some results from [1,2,22,23]. First we introduce the necessary notation and recall the assertions from [22,23] we need.

Let

{Y_{i}}

be a sequence of independent identically distributed random elements taking values in a measurable Abelian group

(G, A)

with measurable operation <<+>>. Assume that the zero (neutral) element 0, as a one-point set, belongs to

σ

-algebra

A

and

p : = P

(Y_{1} \neq 0) \in (0, 1)

. Denote by

{Y_{i}^{0}}

a sequence of independent identically distributed random variables with marginal distribution

L (Y_{1}^{0}) = L (Y_{1} | Y_{1} \neq 0),

and also put

S_{n} : = Σ_{i = 1}^{n} Y_{i}

and

S_{n}^{0} : = Σ_{i = 1}^{n} Y_{i}^{0}

. In [1,2,22], the following assertion was obtained. □

Lemma 5.

For any natural n, the following representations are valid:

L (S_{n}) = L (S_{ν (n, p)}^{0}), L (S_{π (n)}) = L (S_{π (n p)}^{0}),

(29)

where

L (ν (n, p)) \equiv B_{n, p},

is the binomial distribution with parameters n and p,

π (t)

is a standard Poisson process; wherein the pair

(ν (n, p), π (n p))

does not depend on the sequence

{X_{i}^{0}} .

The second important assertion gives an estimate for the Radon–Nikodim derivative of the binomial distribution with respect to the accompanying Poisson law (see [23]).

Lemma 6.

For all

p \in (0, 1)

and natural n, the following estimate holds:

sup_{k \geq 0} \frac{B_{n, p} (k)}{L (π (n p)) (k)} \leq \frac{1}{1 - p} .

(30)

Remark 10.

There are other estimates for this Radon–Nikodim derivative. For example, in [24], it was established that

sup_{k \geq 0} \frac{B_{n, p} (k)}{L (π (n p)) (k)} \leq \frac{2}{\sqrt{1 - p}}

for any n and

p \in (0, 1)

. Note that for

p \geq 3 / 4

this estimate is more accurate than (30).

It is clear that it is enough to prove the assertion for

m = 1

. A proof of the general case is carried out by induction on m and immediately follows from the total probability formula and an estimate for the conditional probability when

m - 1

coordinates of the vector

{\bar{V}}_{\bar{n}}

are fixed. From (29) and (30) and the total probability formula (when the sequence

{Y_{i}^{0}}

is fixed) we obtain the inequality

L (S_{n}) \leq \frac{1}{1 - p} L (S_{π (n)}) .

(31)

Now we put

Y_{i} : = I_{A} (X_{i}^{(1)})

,

A \in A_{0}

, where

A_{0}

is defined in (1). Consider the Abelian group

G : = \{\sum_{i = 1}^{k} e_{i} I_{A} (z_{i}), A \in A_{0}; \forall k \geq 1, \forall z_{i} \in X, \forall e_{i} = - 1, 1\}

and equip this group with the cylindric

σ

-algebra. It is clear that

Y_{i} \in G

and the following is true:

P (Y_{1} \neq 0) = P (A_{0}) = p \in (0, 1)

. So, inequality (2) follows from (31) and the above-mentioned induction on m.

Proof of Theorem 2.

We will carry out our reasoning in the generality and notation of the proof of Theorem 1. Both relations (29) will be the basis of construction where the sequence

{Y_{i}^{0}}

is assumed to be the same in constructing the sums

S_{n}^{0}

and

S_{π (n)}^{0}

on a common probability space. So, to prove the first two assertion of the theorem, we only need to construct on the common probability space the random variables

ν (n, p)

and

π_{n p}

so that they would be as close as possible to each other. The resulting probability space will be the direct product of the two probability spaces where are, respectively, defined the sequence of independent identically distributed random variables

{Y_{i}^{0}}

and the above-mentioned pair of scalar indices. For the optimal definition of random indices

ν (n, p)

and

π_{n p}

on a common probability space, we use Dobrushin’s theorem (see [25]), which guarantees the existence of marginal copies

ν^{*} (n, p)

and

π_{n p}^{*}

of the mentioned random indices defined on a common probability space so that

P (ν^{*} (n, p) \neq π_{n p}^{*}) = d_{T V} (L (ν (n, p), L (π_{n p})),

(32)

where

d_{T V} (\cdot, \cdot)

is the total variation distance between distributions. Now we use the well-known estimate of Poisson approximation to a binomial distribution (see [15,20]):

d_{T V} (L (ν (n, p), L (π_{n p})) \leq p \land (n p^{2}) \leq p .

(33)

Using the described construction to each of the m independent coordinates of the vector point processes under consideration, we easily obtain from (32) and (33) the assertion of the theorem. □

Proof of Lemma 1.

Fix a multi-index

\bar{n}

. Let us assume that the point processes

{\bar{V}}_{\bar{n}}

and

{\bar{Π}}_{\bar{n}}

are defined on the same probability space in one way or another. Then for any natural k we have the estimate

| Φ_{f} ({\bar{V}}_{\bar{n}}) - Φ_{f} ({\bar{Π}}_{\bar{n}}) | \leq \sum_{i \geq k} |f_{i \bar{n}} ({\bar{ν}}_{i \bar{n}}) - f_{i \bar{n}} ({\bar{π}}_{i \bar{n}})| + ζ_{k \bar{n}},

(34)

where

ζ_{k \bar{n}} : = \sum_{i < k} | f_{i \bar{n}} ({\bar{ν}}_{i \bar{n}}) | + \sum_{i < k} | f_{i \bar{n}} ({\bar{π}}_{i \bar{n}}) |

. Put

A_{0} : = ⋃_{i \geq k} Δ_{i}, p (k) : = P (A_{0}) = \sum_{i \geq k} p_{i}

. Note that the tail of the series on the right-hand side of inequality (34) is a functional of the

A_{0}

-restrictions of the studied vector point processes defined on common probability space. So we can use Theorem 2, which guarantees the existence of an absolute coupling (depending on k) of the mentioned

A_{0}

-restrictions with the following lower bound for the coincidence probability (see (4); here, in order not to clutter up the notation, we omit the upper symbol <<*>>):

P (\begin{matrix} (ν_{n_{1} k}^{(1)}, ν_{n_{1} k + 1}^{(1)}, \dots) = (π_{n_{1} k}^{(1)}, π_{n_{1} k + 1}^{(1)}, \dots) \\ (ν_{n_{2} k}^{(2)}, ν_{n_{2} k + 1}^{(2)}, \dots) = (π_{n_{2} k}^{(2)}, π_{n_{2} k + 1}^{(2)}, \dots) \\ \dots \dots \dots \\ (ν_{n_{m} k}^{(m)}, ν_{n_{m} k + 1}^{(m)}, \dots) = (π_{n_{m} k}^{(m)}, π_{n_{m} k + 1}^{(m)}, \dots) \end{matrix})

= P (sup_{Δ_{j}, j \geq k} ∥V_{\bar{n}}^{0} (Δ_{j}, \dots, Δ_{j}) - Π_{\bar{n}}^{0} (Δ_{j}, \dots, Δ_{j})∥ = 0) \geq {(1 - p (k))}^{m} .

(35)

Hence, the coupling method of Theorem 2 vanishes the first term on the right-hand side of (35) with a probability no less than

{(1 - p (k))}^{m}

.

Further, by virtue of estimate (2) we conclude that

L ({\bar{ν}}_{i \bar{n}})) \leq \frac{1}{{(1 - p_{i})}^{m}} L ({\bar{π}}_{i \bar{n}})

for any i. Therefore, by virtue of the conditions of the theorem, we have

Δ_{\bar{n}} f_{i \bar{n}} (ν_{i \bar{n}}) \overset{p}{\to} 0

for any i for

\bar{n} \to \infty

. So, for any given (obviously, such construction exists) random variable

ζ_{k \bar{n}}

on the same probability space with the

A_{0}

-restrictions of the point processes mentioned above, there is the relation

Δ_{\bar{n}} ζ_{k \bar{n}} \overset{p}{\to} 0

for

\bar{n} \to \infty

for any fixed k. Therefore, using the diagonal method, one can choose

k \equiv k (\bar{n}) \to \infty

for

\bar{n} \to \infty

, for which

Δ_{\bar{n}} ζ_{k \bar{n}} \overset{p}{\to} 0

as

\bar{n} \to \infty

. After constructing the point processes under consideration on a common probability space by the method of Theorem 2 for each

\bar{n}

and already chosen

k (\bar{n})

(in this case, obviously,

p (k (n)) \to 0

), the limit relation

()

will hold. Lemma 1 is proved. □

Proof of Theorem 3.

The equivalence of items 1 and 2 directly follows from Lemma 1 and the evident two-sided estimate

P (ξ \leq x - ε) - P (| ξ - η | > ε) \leq P (η \leq x) \leq P (ξ \leq x + ε) + P (| ξ - η | > ε)

for any

x \in R

,

ε > 0

, and arbitrary random variables

ξ

and

η

defined on a common probability space. It remains to put

ξ : = Φ_{f} ({\bar{V}}_{\bar{n}, Δ_{\bar{n}}}^{*}) Δ_{\bar{n}} - M_{\bar{n}}, η : = Φ_{f} ({\bar{Π}}_{\bar{n}, Δ_{\bar{n}}}^{*}) Δ_{\bar{n}} - M_{\bar{n}},

where the point processes

V_{\bar{n}, Δ_{\bar{n}}}^{*}

and

{\bar{Π}}_{\bar{n}, Δ_{\bar{n}}}^{*}

are defined in Lemma 1.

We now prove the equivalence of items 2 and 3 of the theorem. To this end we need to reformulate the assertion in Lemma 1 where we substitute

Φ_{f}^{*}

for the functional

Φ_{f} ({\bar{V}}_{\bar{n}})

. As the resulting probability space in this assertion, we consider the direct product of the probability spaces where

ν_{n i}

and

π_{n i}

are defined by Dobrushin’s theorem. We only note that, after such construction,

P ({{\bar{ν}}_{i \bar{n}}^{*}, i \geq k} \equiv {{\bar{π}}_{i \bar{n}}, i \geq k}) \geq 1 - m \sum_{i \geq k} p_{i} \sim 1

if only

k \to \infty

. Further, we repeat the corresponding reasoning in the proof of Lemma 1 (using the corresponding analog of (34)) as well as the above-mentioned arguments in proving the equivalence of items 1 and 2. □

Proof of Lemma 2.

We restrict ourselves to the case

m = 2

. For an arbitrary m, the assertion can be easily proved by induction on m using analogues of the estimates that will be given below. So we have

E Φ_{f} ({\bar{V}}_{\bar{n}}) = \sum_{i \geq 1} \sum_{k_{1}, k_{2} \geq 0} f_{i \bar{n}} (k_{1}, k_{2}) P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}),

E Φ_{f} ({\bar{Π}}_{\bar{n}}) = \sum_{i \geq 1} \sum_{k_{1}, k_{2} \geq 0} f_{i \bar{n}} (k_{1}, k_{2}) P (π_{i n_{1}}^{(1)} = k_{1}) P (π_{i n_{2}}^{(2)} = k_{2});

here the introduction of the operator

E

under the summation sign in the second formula is legal due to (14) and Fubini’s theorem. Now, estimate the total variation distance between the distributions of the vectors

(ν_{i n_{1}}^{(1)}, ν_{i n_{2}}^{(2)})

and

(π_{i n_{1}}^{(1)}, π_{i n_{2}}^{(2)})

:

\sum_{k_{1}, k_{2} \geq 0} | P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}) - P (π_{i n_{1}}^{(1)} = k_{1}) P (π_{i n_{2}}^{(2)} = k_{2}) |

\leq \sum_{k_{1}, k_{2} \geq 0} | P (ν_{i n_{1}}^{(1)} = k_{1}) - P (π_{i n_{1}}^{(1)} = k_{1}) | P (ν_{i n_{2}}^{(2)} = k_{2})

+ \sum_{k_{1}, k_{2} \geq 0} | P (ν_{i n_{2}}^{(2)} = k_{2}) - P (π_{i n_{2}}^{(2)} = k_{2}) | P (π_{i n_{1}}^{(1)} = k_{1})

= \sum_{k_{1} \geq 0} | P (ν_{i n_{1}}^{(1)} = k_{1}) - P (π_{i n_{1}}^{(1)} = k_{1}) | + \sum_{k_{2} \geq 0} | P (ν_{i n_{2}}^{(2)} = k_{2}) - P (π_{i n_{2}}^{(2)} = k_{2}) | .

We now use once more Barbour–Hall’s upper bound (see [15,20]) for the total variation distance between the distributions

L (ν_{i n_{j}}^{(j)})

and

L (π_{i n_{j}}^{(j)})

:

\sum_{k_{j} \geq 0} | P (ν_{i n_{j}}^{(j)} = k_{j}) - P (π_{i n_{j}}^{(j)} = k_{j}) | < 2 p_{i}, j = \bar{1, m} .

Then the total variation distance between the distributions of the bivariate vectors under consideration is estimated as follows:

\sum_{k_{1}, k_{2} \geq 0} | P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}) - P (π_{i n_{1}}^{(1)} = k_{1}) P (π_{i n_{2}}^{(2)} = k_{2}) | \leq 4 p_{i} .

Therefore,

\begin{matrix} |\sum_{i \geq 1} \sum_{k_{1}, k_{2} \geq 0} f_{i \bar{n}} (k_{1}, k_{2}) P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}) \\ - \sum_{i \geq 1} \sum_{k_{1}, k_{2} \geq 0} f_{i \bar{n}} (k_{1}, k_{2}) P (π_{i n_{1}}^{(1)} = k_{1}) P (π_{i n_{2}}^{(2)} = k_{2})| \\ \leq \sum_{i \geq 1} C_{i} \sum_{k_{1}, k_{2} \geq 0} |P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}) - P (π_{i n_{1}}^{(1)} = k_{1}) P (π_{i n_{2}}^{(2)} = k_{2})| \leq 4 \sum_{i \geq 1} C_{i} p_{i} \end{matrix}

or

| E Φ_{f} ({\bar{V}}_{\bar{n}}) - E Φ_{f} ({\bar{Π}}_{\bar{n}}) | \leq 4 \sum_{i \geq 1} C_{i} p_{i} .

From here we obtain the assertion we need. □

Proof of Lemma 3.

As in the proof of Lemma 2, we restrict ourselves to the case

m = 2

. It is clear that we need to examine two series

S_{1} ({\bar{V}}_{\bar{n}}) : = \sum_{i \geq 1} \sum_{k_{1}, k_{2} \geq 0} f_{i \bar{n}}^{2} (k_{1}, k_{2}) P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}),

S_{2} ({\bar{V}}_{\bar{n}}) : = \sum_{i \geq 1} {(\sum_{k_{1}, k_{2} \geq 0} f_{i \bar{n}} (k_{1}, k_{2}) P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}))}^{2},

In the same way as in the proof of Lemma 1, we obtain

| S_{1} ({\bar{V}}_{\bar{n}}) - S_{1} ({\bar{Π}}_{\bar{n}}) | \leq 4 \sum_{i \geq 1} C_{i}^{2} p_{i} .

Similarly,

\begin{matrix} | S_{2} ({\bar{V}}_{\bar{n}}) - S_{2} ({\bar{Π}}_{\bar{n}}) | \\ \leq \sum_{i \geq 1} 2 C_{i} \sum_{k_{1}, k_{2} \geq 0} | f_{i \bar{n}} (k_{1}, k_{2}) | |P (ν_{i n_{1}}^{(1)} = k_{1}) P (ν_{i n_{2}}^{(2)} = k_{2}) - P (π_{i n_{1}}^{(1)} = k_{1}) P (π_{i n_{2}}^{(2)} = k_{2})| \\ \leq 4 \sum_{i \geq 1} C_{i}^{2} p_{i} . \end{matrix}

From these estimates it follows that

| D Φ_{f} ({\bar{Π}}_{\bar{n}}) - D Φ_{f} ({\bar{V}}_{\bar{n}}) | \leq 8 \sum_{i \geq 1} C_{i}^{2} p_{i},

whence we obtain the assertion of Lemma 2. □

Proof of Theorem 4.

By Corollary 4, it suffices to present conditions for the asymptotic normality of the Poisson version of the

χ^{2}

-statistic, i.e., conditions for the feasibility of relation (16). As such, we take the Lyapunov condition of third order. Indeed, consider the following scheme of series of independent in each series of centered random variables:

ξ_{i n} : = \frac{{(π_{i n} - n p_{i})}^{2}}{n p_{i}} - 1, i = 1, \dots, N (n), n \geq 1 .

The Lyapunov condition of third order, which guarantees the fulfillment of the central limit theorem (16), is as follows:

D_{n}^{- 3 / 2} \sum_{i = 1}^{N (n)} E {| ξ_{i n} |}^{3} \to 0 a s n \to \infty .

(36)

In order to estimate the absolute third moment in (36), we need the well-known recurrence relation for the central moments of the Poisson distribution:

E {(π_{λ} - λ)}^{n} = λ \sum_{k = 0}^{n - 2} C_{n - 1}^{k} E {(π_{λ} - λ)}^{k}, n \geq 2,

where

π_{λ}

is a Poisson random variable with parameter

λ

. From here it follows that

E {(π_{λ} - λ)}^{6} = 15 λ^{3} + 25 λ^{2} + λ,

and using the elementary estimate

| a^{2} {- 1 |}^{3} \leq 4 (a^{6} + 1)

, we obtain

E | ξ_{i n} |^{3} \leq \frac{4}{{(n p_{i})}^{3}} (15 {(n p_{i})}^{3} + 25 {(n p_{i})}^{2} + n p_{i}) + 4 = 64 + \frac{100}{n p_{i}} + \frac{4}{{(n p_{i})}^{2}} .

It is clear that, to prove relation (36) it suffices to verify that, under the conditions of the theorem,

\frac{64 N + 100 \sum_{i = 1}^{N} \frac{1}{n p_{i}} + 4 \sum_{i = 1}^{N} \frac{1}{{(n p_{i})}^{2}}}{{(2 N + \sum_{i = 1}^{N} \frac{1}{n p_{i}})}^{3 / 2}}

\leq 100 {(2 N + \sum_{i = 1}^{N} \frac{1}{n p_{i}})}^{- 1 / 2} + \frac{4 \sum_{i = 1}^{N} \frac{1}{{(n p_{i})}^{2}}}{{(N + \sum_{i = 1}^{N} \frac{1}{n p_{i}})}^{3 / 2}} \to 0,

that is true in virtue of (17). □

Proof of Theorem 5.

For any natural k, denote

Φ_{f}^{(k)} ({\bar{V}}_{\bar{n}}) : = \sum_{i \leq k} f_{i \bar{n}} ({\bar{ν}}_{i \bar{n}}) .

P (Φ_{f} ({\bar{V}}_{\bar{n}}) \geq x) \leq P (Φ_{f}^{(k)} ({\bar{V}}_{\bar{n}}) \geq \frac{x}{2}) + P (Φ_{f} ({\bar{V}}_{\bar{n}}) - Φ_{f}^{(k)} ({\bar{V}}_{\bar{n}}) \geq \frac{x}{2}) .

(37)

In the notation of Theorem 1, let

V_{\bar{n}}^{0}

be the restriction of the point process

{\bar{V}}_{\bar{n}}

to the set

A_{0} : = ⋃_{i \leq k} Δ_{i}

with hit probability

p : = \sum_{i \leq k} p_{i}

. Under the sign of the first probability of the right-hand side of inequality (37), instead of the point process

{\bar{V}}_{\bar{n}}

, we can substitute

V_{\bar{n}}^{0}

and use inequality (2) for the distributions of the restrictions of the corresponding point processes.

The difference

Φ_{f} ({\bar{V}}_{\bar{n}}) - Φ_{f}^{(k)} ({\bar{V}}_{\bar{n}}) = \sum_{i > k} f_{i \bar{n}} ({\bar{ν}}_{i \bar{n}})

is also an additive functional of the restriction of the point process

{\bar{V}}_{\bar{n}}

to the additional set

A_{0} : = ⋃_{i > k} Δ_{i}

with hit probability

p : = \sum_{i > k} p_{i}

. For this functional, we also use estimate (2). As a result, from (37) and Theorem 1, taking into account the non-negativity of the terms

f_{i \bar{n}} (\cdot)

, we obtain

P (Φ_{f} ({\bar{V}}_{\bar{n}}) \geq x) \leq {(\sum_{i > k} p_{i})}^{- m} P (Φ_{f}^{(k)} ({\bar{Π}}_{\bar{n}}) \geq \frac{x}{2})

+ {(\sum_{i \leq k} p_{i})}^{- m} P (Φ_{f} ({\bar{Π}}_{\bar{n}}) - Φ_{f}^{(k)} ({\bar{Π}}_{\bar{n}}) \geq \frac{x}{2}) \leq 2 C^{*} P (Φ_{f} ({\bar{Π}}_{\bar{n}}) \geq \frac{x}{2}) .

The theorem is proved. □

Proof of Corollary 5.

is based on the following well-known equality. If

ζ

is a non-negative random variable with finite mean then

E ζ = \int_{0}^{\infty} P (ζ \geq x) d x .

Using successively this equality for

ζ

equal to

Φ_{f} ({\bar{V}}_{\bar{n}})

or

2 Φ_{f} ({\bar{Π}}_{\bar{n}})

, we easily obtain from (18) the moment inequality (20). □

9. Conclusions

In this paper, we discuss a remarkable asymptotic property of a wide class of additive statistics that allows us to ignore the dependence of the summands in the additive structure of the statistics under consideration and to reduce asymptotic analysis of their distributions to the classical theory of the central limit problem. As consequences, we obtain refinements of certain results concerning the limit behavior of some known classes of additive statistics. Although we limited ourselves only to the law of large numbers and the central limit theorem for the statistics at issue, in the model under consideration it is possible to study sufficient conditions for the weak convergence of their distributions to other infinitely divisible laws as well. In fact, we deal here with a variant of Poisson approximation of empirical point processes, or in other words, with a compound Poisson approximation of an n-th partial sum of independent random variables taking values in some function space. So, in the present paper we deal with the classical subject of Probability Theory and the Poisson approximation of sums of independent multivariate random variables (for example, see [1,12,22,23]).

Moreover, one can reformulate the above-mentioned Poissonization duality theorem for more general U-statistic-type functionals

U_{f} ({\bar{V}}_{n}) : = \sum_{i_{1} \leq \dots \leq i_{m}} f_{\bar{n}, i_{1}, \dots, i_{m}} ({\bar{ν}}_{\bar{n}, i_{1}}, \dots, {\bar{ν}}_{\bar{n}, i_{m}}),

where

f \equiv {f_{\bar{n}, i_{1}, \dots, i_{m}} (\cdot)}

is an array of finite functions defined on

Z_{+}^{d}

, with

d : = \sum_{k \leq m} n_{k}

, satisfying only the restriction

\sum_{i_{1} \leq \dots \leq i_{m}} | f_{\bar{n}, i_{1}, \dots, i_{m}} (0, \dots, 0) | < \infty \forall \bar{n} .

For example, in this more general setting, one can study the limit behavior of the functionals

U_{I} (V_{n}) : = \sum_{i \geq 1} I_{\bar{A}} (ν_{i - 1, n}) I_{A} (ν_{i, n}) \dots I_{A} (ν_{i + m - 1, n}) I_{\bar{A}} (ν_{i + m, n}),

where

\bar{A}

is the complement of an arbitrary subset

A \subset Z_{+}

, with

0 \notin A

, and

ν_{0 n} : = 0

. These functionals count the number of success chains of length m in the dependent (finite or infinite) Bernoulli trials

{I_{A} (ν_{i, n}); i \geq 1}

.

Author Contributions

Conceptualization, I.B.; formal analysis, I.B. and M.J.; methodology, I.B.; writing—original draft, I.B. and M.J.; writing—review and editing, I.B. All authors have read and agreed to the published version of the manuscript.

Funding

The study of I. Borisov was supported by the Russian Science Foundation, project no. 22-21-00414.

Acknowledgments

The authors thank the anonymous reviewers for careful reading of the paper and insightful comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Borisov, I.S. Poisson approximation of the partial sum process in Banach spaces. Sib. Math. J. 1996, 37, 627–634. [Google Scholar]
Borisov, I.S. Moment inequalities connected with accompanying Poisson laws in Abelian groups. Int. J. Math. Math. Sci. 2003, 44, 2771–2786. [Google Scholar]
Medvedev, J.I. Some theorems on the asymptotic distribution of the χ² statistic. Dokl. Akad. Nauk SSSR 1970, 192, 987–989. (In Russian) [Google Scholar]
Medvedev, Y.I. Decomposable statistics in a polynomial scheme. I. Theory Probab. Appl. 1977, 22, 1–15. [Google Scholar]
Medvedev, Y.I. Decomposable statistics in a polynomial scheme. II. Theory Probab. Appl. 1978, 22, 607–615. [Google Scholar]
Bahadur, R.R. On the number of distinct values in a large sample from an infinite discrete distribution. Proc. Nat. Inst. Sci. India 1960, 26A, 67–75. [Google Scholar]
Kolchin, V.F.; Sevastyanov, B.A.; Chistyakov, V.P. Random Assignments; Nauka: Moscow, Russia, 1976. [Google Scholar]
Darling, D.A. Some limit theorems associated with multinomial trials. In Fifth Berkeley Symposium on Mathematical Statistics and Probability; Part 1. Berkley–Los Angelos; University of California Press: Berkeley, CA, USA, 1967; Volume 2, pp. 345–350. [Google Scholar]
Karlin, S. Central limit theorems for certain infinite urn schemes. J. Math. Mech. 1967, 17, 373–401. [Google Scholar]
Chebunin, M.; Kovalevskii, A.P. Functional central limit theorems for certain statistics in an infinite urn scheme. Stat. Probab. Lett. 2016, 119, 344–348. [Google Scholar]
Sevastyanov, B.A.; Chistyakov, V.P. Asymptotic Normality in a Classical Problem with Balls. Theory Probab. Appl. 1964, 9, 198–211. [Google Scholar]
Barbour, A.D.; Gnedin, A.V. Small counts in the infinite occupancy scheme. Electr. J. Probab. 2009, 14, 365–384. [Google Scholar] [CrossRef]
Fisher, R.A.; Corbet, A.S.; Williams, C.B. The relation between the number of species and the number of individuals in a random sample of an animal population. J. Anim. Ecol. 1943, 12, 42–58. [Google Scholar] [CrossRef]
Orlitsky, A.; Suresh, A.T.; Wu, Y. Supplementary Information for: Estimating the number of unseen species: A bird in the hand is worth log n in the bush. Proc. Natl. Acad. Sci. USA 2016, 1511, 07428. [Google Scholar]
Barbour, A.D.; Holst, L.; Janson, S. Poisson Approximation; Oxford University Press: Oxford, UK, 1992. [Google Scholar]
Kruglov, V.M. The asymptotic behavior of the Pearson statistic. Theory Probab. Appl. 2001, 45, 69–92. [Google Scholar]
Steck, G.P. Limit Theorems for Conditional Distributions; University of California Press: Berkeley, CA, USA, 1957; Volume 2, pp. 237–284. [Google Scholar]
Tumanyan, S.K. Asymptotic distribution of χ²-criterion when the size of observations and the number of groups simultaneously increase. Theory Probab. Appl. 1956, 1, 117–131. [Google Scholar]
Hagerup, T.; Rüb, C. A guided tour of Chernoff bounds. Inf. Process. Lett. 1990, 33, 305–308. [Google Scholar] [CrossRef]
Barbour, A.D.; Hall, P. On the rate of Poisson convergence. Math. Proc. Camb. Philos. Soc. 1984, 95, 473–480. [Google Scholar]
Loéve, M. Probability Theory; Nauchnaya Literatura: Moscow, Russia, 1962. (In Russian) [Google Scholar]
Borisov, I.S. Strong Poisson and mixed approximations of sums of independent random variables in Banach spaces. Sib. Adv. Math. 1993, 3, 1–13. [Google Scholar]
Borisov, I.S.; Ruzankin, P.S. Poisson approximation for expectations of unbounded functions of independent random variables. Ann. Probab. 2002, 30, 1657–1680. [Google Scholar]
Borisov, I.S. Approximation of distributions of von Mises statistics with multidimensional kernels. Sib. Math. J. 1991, 32, 554–566. [Google Scholar]
Dobrushin, R.L. Definition of random variables by conditional distributions. Theory Probab. Appl. 1970, 15, 458–486. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.