Abstract
In this paper, we consider a class of additive functionals of a finite or countable collection of the group frequencies of an empirical point process that corresponds to, at most, a countable partition of the sample space. Under broad conditions, it is shown that the asymptotic behavior of the distributions of such functionals is similar to the behavior of the distributions of the same functionals of the accompanying Poisson point process. However, the Poisson versions of the additive functionals under consideration, unlike the original ones, have the structure of sums (finite or infinite) of independent random variables that allows us to reduce the asymptotic analysis of the distributions of additive functionals of an empirical point process to classical problems of the theory of summation of independent random variables.
Keywords:
empirical point process; Poisson point process; Poissonization; group frequency; additive functional MSC:
60F05
1. Introduction
In this paper, we study a class of additive functionals (statistics) of a finite or countable collection of group frequencies constructed by a sample of size n with a finite or countable partition of the sample space. Under broad conditions, it is shown that, as , the asymptotic behavior of distributions of the additive functionals under consideration is completely similar to the behavior of distributions of the same functionals of the accompanying Poisson point process. From here it is easy to establish that the above-mentioned weak convergence is equivalent to that for the same additive functionals but with independent group frequencies, which are constructed, respectively, using a finite or countable collection of independent copies of the original sample, when we fix in the i-th partition element only the points from the i-th independent copy of the original sample. In other words, in the case under consideration, we remove the dependence of the initial group frequencies with a multinomial distribution. This phenomenon makes it possible to directly use the diverse tool of the summation theory of independent random variables to study the limiting behavior of the additive statistics being considered.
The structure of this paper is as follows. In Section 2, we introduce the empirical and accompanying Poisson vector point processes and formulate some important results regarding their connection. In Section 3, we introduce a class of additive statistics and give a number of examples. Section 4 contains the main result of the paper, i.e., a duality theorem, which states that an original additive statistic with some normalizing and centering constants weakly converges to a limit if, and only if, their Poisson version with the same normalizing and centering constants weakly converges to the same limit. In Section 5, we discuss some applications of the duality theorem. In Section 6, we present moment inequalities connecting the original additive statistics and their Poisson versions. Section 7 is devoted to asymptotic analysis of first two moments of additive statistics connected with an infinite multinomial urn model. Section 8 contains proofs of all results of the paper. Finally, in Section 9, we summarize the results and discuss some their extensions.
2. Empirical and Poisson Point Processes
Let , be a finite set of independent copies of a sequence of independent identically distributed random variables with values in an arbitrary measurable space and distribution P. For any natural , consider m independent empirical point processes based on respective samples , :
Define the m independent accompanying Poisson point processes as
where , , are independent standard Poisson processes on the positive half-line, which do not depend on all sequences , . In other words, for all . We consider the point processes and as stochastic processes with trajectories from the measurable space of all bounded functions indexed by the elements of the set , with the -algebra of all cylindrical subsets of the space . The distributions of stochastic processes and on are defined in a standard way.
Now, we introduce the vector-valued empirical and accompanying Poisson point processes
where . The vector-valued point processes and are considered as random elements with values in the measurable space .
Let with . Consider the restrictions of the vector point processes and to the set
These so-called -restrictions are denoted by and , respectively. For the distributions and in the measurable space , there are the following three assertions (some particular versions of these assertions have been proved in [1,2]).
Theorem 1.
The following inequality is valid:
Corollary 1.
For any non-negative measurable functional F defined on ,
the expectation on the right-hand side of (3) may be infinite at that.
The following result plays an essential role in proving the main result of the paper—a duality limit theorem for the distributions and (see Theorem 3 below).
Theorem 2.
For each multi-index , one can define some vector point processes and on a common probability space so that they coincide in distribution with the point processes and , respectively, and
where , and the outer supremum is taken over all at most countable families of sets from .
Remark 1.
In Theorem 2, the sup-seminorm is obviously measurable with respect to the cylindrical σ-algebra . If instead of we substitute the entire class (possibly uncountable) then this measurability may no longer exist (unless, of course, the point processes under consideration do not have the separability property). Nevertheless, the assertion of Theorem 2 remains valid in this case if the probability is replaced by the outer probability . However, the outer probability has only the property of semiadditivity, which makes it difficult to use.
Let measurable sets form a finite or countable partition of the sample space under the condition for all i. Without loss of generality, we can assume that the sequence is monotonically nonincreasing. Denoted by , , the corresponding group frequencies are defined by the sample . Put
Let us agree that everywhere below the limit relation will be understood as for all .
3. Additive Statistics: Examples
In the paper, we consider a class of additive statistics of the form
where is an array of arbitrary finite functions defined on under the condition
which ensures the correct definition of the functional in the case of a countable partition of the sample space, since the sum under consideration contains only a finite set of nonzero random vectors . In the case of a finite partition and , additive functionals of the form (5) were considered in [3,4,5].
We now give some examples of such statistics.
(1) Consider a finite partition of the sample space. Put , , where is the standard Euclidean norm in . Then the functional
is an m-variate version of a well-known -statistic. Note that, in the present paper, we are primarily interested in the case where as .
(2) Let now the sizes of all m samples be equal: , . In an equivalent reformulation of the original problem, we consider a sample of m-dimensional observations under the main hypothesis that the sample vector coordinates are independent and have the same N-atomic distribution with unknown masses . In this case, the log-likelihood function can be represented as the additive functional
where is the unit vector in and is the Euclidean inner product.
(3) Consider a finite or countable partition . Let be the indicator function of some subset . Then the functional
counts the number of partition elements (cells) containing any number of vector sample observations from the range B in a multinomial scheme (finite or infinite) of placing particles into cells (see [6,7,8,9,10,11,12]). Note that in the case of an infinite multinomial scheme in (8), it is additionally assumed that .
In the case and , the two-sample statistic (8) counts the number of nonempty cells after second (“additional”) series of trials (“future” sample), which were empty in the first series (“original” sample). Statistics of such a kind play an important role in the theory of species sampling (for example, see [13,14]). In this case the functional (8) is called the number of unseen species in the original sample.
(4) In the case , consider the joint distribution (see [10]) of the random variables
defined in (8) by the sample , with . It is clear that studying the asymptotic behavior of the joint distribution of these random variables (for example, proving the multidimensional central limit theorem) can be reduced to the study of the limit distributions of the linear combinations of the form
for almost all vectors with respect to the Lebesgue measure on . It is easy to see that, for any natural ,
where the empirical point processes are defined by the above-mentioned independent subsamples. So, in this case, we deal with a functional of the form (5) defined by m independent empirical point processes corresponding to the m independent subsamples , ,…, , and with the array of functions
(5) Consider the stochastic process indexed by all subsets of . As was noted above, studying the asymptotic behavior of the joint distributions of this process can be reduced to studying the asymptotic behavior of the distributions of any linear combinations of corresponding one-dimensional projections of this process, i.e., to studying the asymptotic behavior of the distributions of functionals of the form (5) for and the array of functions
for almost all vectors . For one-point sets, the asymptotic analysis of the above-mentioned joint distributions can be found, for example, in [7,8,9,10,11,12].
(6) Consider the case and the functional
which counts the sampling ratio of the cells containing any number of particles from the range B. For the one-point set , such functional was considered in [9]. In general, if instead of in (11) we consider arbitrary weights (under condition (6)) with one or another interpretation, the functional in this case will be interpreted as the total weight of the corresponding cells.
4. Poissonization: Duality Theorem
In this section, we present the main result of the paper—a duality theorem for additive statistics under consideration. First of all, we explain the term “Poissonization”. It means that studying the limit behavior of the original additive statistics, we reduce the problem to studying the following “Poissonian version” of the functional (5) under condition (6):
where , , , is a sequence of independent Poisson random variables with respective parameters . It is clear that the functional (12) is well defined with probability 1 since only a finite number of the vectors differ from the zero vector. Independence of the summands is a crucial difference of the Poisson version of an additive functional from the original one. Some elements of Poissonization for additive functionals of the form (8) and (10) are contained, for example, in [9,12]. In [9], the author used the well-known representation of an empirical point process as the conditional Poisson point process under the condition that the number of atoms of the accompanying Poisson point process equals n. Moreover, in [9], the simple known representation was employed, where denotes a random variable such that is bounded in probability as . In [12], proving the multivariate central limit theorem for the above-mentioned joint distributions (in fact, for functionals of the form (10) in the case of one-point subsets ), the authors applied a reduction to the joint distributions of the Poissonian versions of additive functionals using known upper bounds for a multivariate Poisson approximation to a multinomial distribution (see also [15]). The main goal of the paper is to establish a duality theorem, which demonstrates absolute identity of the asymptotic behavior of the distributions of the additive functionals under consideration and their Poissonian versions.
First, we formulate a crucial auxiliary assertion in proving the main result.
Lemma 1.
Let be an arbitrary scalar array satisfying the condition as for every fixed i. Then, for each multiindex , one can define on a common probability space a pair of point processes and such that , , and for any ,
Remark 2.
Lemma 1 only asserts that the marginal distributions (that is, for each separately) of the arrays and , and also and . Note that the probability in (13) is precisely determined by the marginal distributions of the mentioned random arrays, i.e., formally, it also depends on . Without loss of generality, we can assume that pairs of point processes , are independent in , and on this extended probability space, the universal probability measure in (13) is given in the standard way, which no longer depends on . In this case it is correct to speak about the convergence to zero in probability of the sequence of random variables in (13).
Lemma 1 gives the key to the proof of the following duality theorem, a criterion for the weak convergence of distributions of functionals of the point processes under consideration. The essence of this result is that the asymptotic behavior of the distributions of additive functionals of the point processes and is exactly the same. In addition, one can also indicate a third class of additive functionals (under condition (6)) that has the same property:
where is a sequence of independent random vectors such that for all i. The functional is well defined due to the Borel–Cantelli lemma and the simple estimate .
Let us agree that the symbol <<⇒>> in what follows will denote the weak convergence of distributions. The main result of the paper is as follows.
Theorem 3.
Under the conditions of Lemma 1, the following three limit relations are equivalent as :
where and are some scalar arrays and γ is some random variable.
5. Applications
Theorem 3 allows us to reduce the asymptotic analysis of the distributions of the additive functionals under consideration to a similar analysis of their Poissonian versions, i.e., to the asymptotic analysis of distributions of sums (finite or infinite) of independent random variables, or to reduce the problem to studying the limit behavior of the distributions , absolutely ignoring the dependence of the random variables . Note also that, under some rather broad assumptions, the law will be infinitely divisible. A detailed analysis of such conditions and corresponding examples will be considered in a separate paper. Here we present only a few of these corollaries, focusing our attention on the equivalence of the first two relations of Theorem 3.
First of all, we note one useful property of the expectations of the functionals under consideration as functions of .
Lemma 2.
Let , , and
Then the relations and are equivalent. In the case of infinite limits,
Remark 3.
For functionals of the form (8) in an infinite multinomial scheme, the conditions of Lemma 2 are typical. Let and for any . Then
since, by virtue of the law of large numbers, for every fixed i. Moreover, in the case under consideration, obviously, . Similarly, without any restrictions on the probabilities , the infinite limits in Lemma 2 for functionals of the form (8) (and even more so for (11)) also hold for the set B consisting of all odd natural numbers. Here the limit relation follows immediately from the equality .
It is also worth noting that for some sets B the main contribution to the limit behavior of the series can be made not only by their initial segments but also tails. For example, this will be the case for any one-point sets for if the group probabilities are given as or for some constants and . In this case, for any subset B of natural numbers in the definition of the functionals (8) and (11), the expectation limits indicated in Lemma 2 will be infinite (see Section 7 and [9,12]). On the other hand, if , then for any one-point set the expectations mentioned will be bounded uniformly in n (see Section 7 and [9,12]). For more complex functionals with kernels (9) or (10) for the above-mentioned distributions , one can find sufficiently broad conditions that ensure unbounded increase in their expectations and variances as for almost all vectors (see Section 7).
Now we present one of the corollaries of Theorem 3, namely, the law of large numbers for the additive functionals under consideration, setting in this theorem , , and .
Corollary 2.
Let the conditions of Lemma 2 be fulfilled. If as then the following criterion holds:
in this case, the normalizations and can be swapped.
Remark 4.
In consideration of Chebyshev’s inequality, a sufficient condition for the limit relations in Corollary 2 is as follows:
For example, let and . Then and
In particular, this estimate is valid in the case , with , if only .
We now formulate an analog of Lemma 2 for the variances of the functionals under consideration.
Lemma 3.
Under the conditions and the limit relation holds if and only if . In the case of infinite limit the following equivalence is valid: as .
Lemma 3 and Theorem 3 imply the following important criterion, which allows us to reduce proving the central limit theorem for additive functionals to proving the same assertion for the Poissonian version .
Corollary 3.
Under the conditions of Lemma 3 and as the limit relation
is valid if, and only if,
where is the standard normal distribution. In this case, the normalizing and centering sequences in these two limit relations can be, respectively, swapped.
In order to prove this corollary we should put in Theorem 3 , , and . In this case, Lemma 3 allows us only to replace the normalizing and centering sequences in Theorem 3 with some equivalent sequences.
Remark 5.
The validity of the central limit theorem for the sequence in Theorem 3 will be justified if, say, the third-order Lyapunov condition is met:
For example, let . Then it is easy to see that
Thus, if as , then the Lyapunov condition will be met and the approval of the above investigation will take place. So an important special case is included in the scheme at issue if
Note that examples for which the specified variance property takes place or is violated are given, for example, in [9].
Finally, here is another consequence of Theorem 3, relating to the asymptotic behavior of -statistics in (7) at and . First of all, note that
Corollary 4.
Let as . Then the following two asymptotic relations are equivalent:
Note that in the present case, the requirement of Lemma 1 is met, since each term (as a sequence of n) is bounded in probability due to Markov’s inequality, and therefore, with the normalizing sequence , this term will tend to zero in probability as .
Remark 6.
In the relations (15) and (16) we can say just about the double limit when because this assertion is missing restrictions on the rate of increase in the sequence . The proposed formulation in Corollary 4, equivalent to the one just mentioned, is more convenient to refer to Theorem 3. Note that the centering sequence can be replaced with its equivalent sequence . Replacement in the normalization in (15) the variance with the variance of the -statistic itself, i.e., by the term (for example, see [16])
is possible only if these two variances are equivalent. For example, this would be the case if . This means that the growth rate of the sequence is subject to appropriate constraints, which is not the case in the above consequence. So, in this assertion we can talk about a double limit as .
The formulated criterion allows us to establish a fairly general sufficient condition for the asymptotic normality of -statistics with an increasing number of groups.
Theorem 4.
The problem of finding more or less broad sufficient conditions for asymptotic normality -statistics with a growing number of groups were studied by many authors in the second half of the last century (for example, see [3,4,5,16,17,18]). Note that all known sufficient conditions for the above weak convergence imply fulfillment of the asymptotic relation (17). For example, the condition along with (see [17,18]), obviously immediately entails relation (17). It is equally obvious that the requirement of the so-called regularity of multinomial models (see [3,4,5]), i.e.,
where the constants and are independent of N, also implies (17). On the other hand, it is easy to construct examples in which the regularity requirement of the multinomial model is violated but relation (17) is valid. For example, let , , where and . It is easy to see that, as , the sums and increase as and , respectively. Therefore, as , the ratio in (17) is equivalent to
up to a constant factor. So, here we already need to measure the growth rate N with n. Obviously, in this case, in order to fulfill condition (17), you need to require that . If the probabilities decrease exponentially then the growth rate zone for N narrows to . It is worth to note that for the above-mentioned power-type probabilities at issue the condition implies the asymptotic relation that is more restrictive than the above constraint.
6. Probability and Moment Inequalities
The next theorem is related to estimation of the distribution tails of additive functionals.
Theorem 5.
Let for all i. Then, for any ,
where . If additionally then
Remark 7.
Corollary 5.
Under the conditions of Theorem 5, let F be a continuous nondecreasing function defined on , with . If then
As an example, consider the functional defined in (8). Then, as a consequence of (19) and Chernoff’s upper bound [19] for the distribution tail of a sum of independent nonidentically distributed Bernoulli random variables (the transition from finite sums to series in this case is obvious), we obtain the following result.
Corollary 6.
Put . Then for any the following inequality holds:
where .
Remark 8.
one can replace the Poissonian mean in (21) with the mean , which differs from by no more than 1 due to Barbour–Hall’s estimate of the Poisson approximation to a binomial distribution (see [15,20]). Further, if the condition is met as then from (21) we obtain not only the law of large numbers (already formulated in Corollary 2), but at a certain growth rate of the sequence , the strong law of large numbers (SLLN) (see Section 7). If in the case we consider the infinite intervals for any then the SLLN occurs at any speed of increasing the sequence to infinity. This follows from estimate (21), the monotonicity of the functions , and the simple technique in proving SLLN in [9,21].
7. Asymptotic Analysis of the Means and Variances of Additive Statistics
In the previous section, it was noted that when proving certain limit theorems for the introduced additive functionals, it is extremely important to have information about the behavior of their means and variances. In this section, for additive statistics (8)–(11), we demonstrate exactly how the asymptotic behavior of these moments is studied. To simplify the notation, we will consider here the case . The subsequent asymptotic analysis is based on the following elementary assertion, which is presented in one way or another in many papers on this topic.
Lemma 4.
Let be a sequence of non-negative, integrable, and piecewise monotonic functions defined on . Suppose that each has M monotonicity intervals, where M is independent of n. Finally, assume that, as ,
Then, as ,
We now give a few examples of calculating the asymptotics we need.
(1) Let for any . In Remark 3 it was already noted that due to the strong law of large numbers for binomially distributed random variables. However, for specific classes of distributions , one can estimate the growth rate of the sequence . For example, let , where , . Then, using Lemma 4 and the well-known connection between the tail of a Poisson distribution and the corresponding gamma distribution, we obtain after integration by parts and a change of the integration variable:
where , , , are the distribution function of the gamma-distribution with parameters , and the gamma-function, respectively. For example, if then the asymptotics of the expectation of the number of nonempty cells is as follows (see [6,9]):
By analogy to the arguments in proving (22), after an appropriate change of the integration variable, we obtain for the one-point sets the following asymptotics:
Thus, from (24) it follows that for any subset B of the natural series in the case under consideration of a power-law decrease in the following asymptotic representation is true:
Note that, due to the countable additivity of the finite measure and the relations (22)–(24), the sum (possibly infinite) in (25) will always be finite.
Remark 9.
Inequality (21), relation (25), and the Borel–Cantelli lemma guarantee that the strong law of large numbers holds for the sequence for any subsets B of the natural series in the case of a power-law decrease in the probabilities . Moreover, what has been said and the above asymptotics are also preserved for probabilities of the form , where is a slowly varying function under certain minimal constraints (see [9,12]). In this case, in the asymptotic relations (22)–(25) instead of C one should substitute .
Asymptotic behavior of the variances of the functionals for some B and broad conditions on the rate of decrease in the sequence is given in [9]. Here we only demonstrate how this variance is calculated for arbitrary subsets B of the natural series under the above conditions on . Analogously with (22) we have for the infinite intervals :
Similarly to proving (24), we derive the asymptotics of the variance for the one-point sets:
Although the set function is not additive, the extension to arbitrary subsets B of the natural series of computing the asymptotics of presents no difficulty. Along with formula (25), which gives one term in the resulting asymptotics, we use the following representation for the second sum:
Thus, the difference between the right-hand sides of (25) and (28) determines the asymptotic of for any subset of the natural series.
(2) The asymptotics of the first two moments for the functionals (10) for pairwise disjoint sets is derived in exactly the same way. In the case of one-point sets , the asymptotic behavior of the first moment immediately follows from the previous calculations. As for the variance, we should first note that, due to the orthogonality of the indicator random variables under consideration, we have
Summation over i of the resulting expression and the previous calculations give the desired asymptotics:
We note the resulting representation can vanish on the set of vectors of zero Lebesgue measure in , i.e., on the surface defined by the relation for some coefficients .
For infinite intervals of the form , the variance is studied in a similar way. We assume without loss of generality that . To calculate the variance of this functional, it suffices for us to restrict ourselves to the second moment, since the asymptotics of the first one has already been studied. We have
Further calculations in essence have already been made earlier. So, finally we obtain
with comments similar to the above regarding the zeroing of the double sum.
To conclude this section, we give an example where the above-mentioned moments of the functional under consideration do not tend to infinity as n grows. We put , with . Let us show that
This estimate obviously implies that the first two moments of the functional are uniformly bounded in n for . Indeed, one has
here we used the estimate for all , also representing the integral over the semiaxis as a series of integrals over the indicated segments of unit length. If then the integral in the last expression converges monotonically to the quantity , which proves our assertion. Note also that a similar example is given in [9].
8. Proofs
Proof of Theorem 1.
The assertion of the theorem is essentially a consequence of some results from [1,2,22,23]. First we introduce the necessary notation and recall the assertions from [22,23] we need.
Let be a sequence of independent identically distributed random elements taking values in a measurable Abelian group with measurable operation <<+>>. Assume that the zero (neutral) element 0, as a one-point set, belongs to -algebra and . Denote by a sequence of independent identically distributed random variables with marginal distribution
and also put and . In [1,2,22], the following assertion was obtained. □
Lemma 5.
For any natural n, the following representations are valid:
where is the binomial distribution with parameters n and p, is a standard Poisson process; wherein the pair does not depend on the sequence
The second important assertion gives an estimate for the Radon–Nikodim derivative of the binomial distribution with respect to the accompanying Poisson law (see [23]).
Lemma 6.
For all and natural n, the following estimate holds:
Remark 10.
There are other estimates for this Radon–Nikodim derivative. For example, in [24], it was established that
for any n and . Note that for this estimate is more accurate than (30).
It is clear that it is enough to prove the assertion for . A proof of the general case is carried out by induction on m and immediately follows from the total probability formula and an estimate for the conditional probability when coordinates of the vector are fixed. From (29) and (30) and the total probability formula (when the sequence is fixed) we obtain the inequality
Now we put , , where is defined in (1). Consider the Abelian group
and equip this group with the cylindric -algebra. It is clear that and the following is true: . So, inequality (2) follows from (31) and the above-mentioned induction on m.
Proof of Theorem 2.
We will carry out our reasoning in the generality and notation of the proof of Theorem 1. Both relations (29) will be the basis of construction where the sequence is assumed to be the same in constructing the sums and on a common probability space. So, to prove the first two assertion of the theorem, we only need to construct on the common probability space the random variables and so that they would be as close as possible to each other. The resulting probability space will be the direct product of the two probability spaces where are, respectively, defined the sequence of independent identically distributed random variables and the above-mentioned pair of scalar indices. For the optimal definition of random indices and on a common probability space, we use Dobrushin’s theorem (see [25]), which guarantees the existence of marginal copies and of the mentioned random indices defined on a common probability space so that
where is the total variation distance between distributions. Now we use the well-known estimate of Poisson approximation to a binomial distribution (see [15,20]):
Using the described construction to each of the m independent coordinates of the vector point processes under consideration, we easily obtain from (32) and (33) the assertion of the theorem. □
Proof of Lemma 1.
Fix a multi-index . Let us assume that the point processes and are defined on the same probability space in one way or another. Then for any natural k we have the estimate
where . Put . Note that the tail of the series on the right-hand side of inequality (34) is a functional of the -restrictions of the studied vector point processes defined on common probability space. So we can use Theorem 2, which guarantees the existence of an absolute coupling (depending on k) of the mentioned -restrictions with the following lower bound for the coincidence probability (see (4); here, in order not to clutter up the notation, we omit the upper symbol <<*>>):
Hence, the coupling method of Theorem 2 vanishes the first term on the right-hand side of (35) with a probability no less than .
Further, by virtue of estimate (2) we conclude that for any i. Therefore, by virtue of the conditions of the theorem, we have for any i for . So, for any given (obviously, such construction exists) random variable on the same probability space with the -restrictions of the point processes mentioned above, there is the relation for for any fixed k. Therefore, using the diagonal method, one can choose for , for which as . After constructing the point processes under consideration on a common probability space by the method of Theorem 2 for each and already chosen (in this case, obviously, ), the limit relation will hold. Lemma 1 is proved. □
Proof of Theorem 3.
The equivalence of items 1 and 2 directly follows from Lemma 1 and the evident two-sided estimate
for any , , and arbitrary random variables and defined on a common probability space. It remains to put
where the point processes and are defined in Lemma 1.
We now prove the equivalence of items 2 and 3 of the theorem. To this end we need to reformulate the assertion in Lemma 1 where we substitute for the functional . As the resulting probability space in this assertion, we consider the direct product of the probability spaces where and are defined by Dobrushin’s theorem. We only note that, after such construction,
if only . Further, we repeat the corresponding reasoning in the proof of Lemma 1 (using the corresponding analog of (34)) as well as the above-mentioned arguments in proving the equivalence of items 1 and 2. □
Proof of Lemma 2.
We restrict ourselves to the case . For an arbitrary m, the assertion can be easily proved by induction on m using analogues of the estimates that will be given below. So we have
here the introduction of the operator under the summation sign in the second formula is legal due to (14) and Fubini’s theorem. Now, estimate the total variation distance between the distributions of the vectors and :
We now use once more Barbour–Hall’s upper bound (see [15,20]) for the total variation distance between the distributions and :
Then the total variation distance between the distributions of the bivariate vectors under consideration is estimated as follows:
Therefore,
or
From here we obtain the assertion we need. □
Proof of Lemma 3.
As in the proof of Lemma 2, we restrict ourselves to the case . It is clear that we need to examine two series
In the same way as in the proof of Lemma 1, we obtain
Similarly,
From these estimates it follows that
whence we obtain the assertion of Lemma 2. □
Proof of Theorem 4.
By Corollary 4, it suffices to present conditions for the asymptotic normality of the Poisson version of the -statistic, i.e., conditions for the feasibility of relation (16). As such, we take the Lyapunov condition of third order. Indeed, consider the following scheme of series of independent in each series of centered random variables:
The Lyapunov condition of third order, which guarantees the fulfillment of the central limit theorem (16), is as follows:
In order to estimate the absolute third moment in (36), we need the well-known recurrence relation for the central moments of the Poisson distribution:
where is a Poisson random variable with parameter . From here it follows that
and using the elementary estimate , we obtain
It is clear that, to prove relation (36) it suffices to verify that, under the conditions of the theorem,
that is true in virtue of (17). □
Proof of Theorem 5.
For any natural k, denote
In the notation of Theorem 1, let be the restriction of the point process to the set with hit probability . Under the sign of the first probability of the right-hand side of inequality (37), instead of the point process , we can substitute and use inequality (2) for the distributions of the restrictions of the corresponding point processes.
The difference
is also an additive functional of the restriction of the point process to the additional set with hit probability . For this functional, we also use estimate (2). As a result, from (37) and Theorem 1, taking into account the non-negativity of the terms , we obtain
The theorem is proved. □
9. Conclusions
In this paper, we discuss a remarkable asymptotic property of a wide class of additive statistics that allows us to ignore the dependence of the summands in the additive structure of the statistics under consideration and to reduce asymptotic analysis of their distributions to the classical theory of the central limit problem. As consequences, we obtain refinements of certain results concerning the limit behavior of some known classes of additive statistics. Although we limited ourselves only to the law of large numbers and the central limit theorem for the statistics at issue, in the model under consideration it is possible to study sufficient conditions for the weak convergence of their distributions to other infinitely divisible laws as well. In fact, we deal here with a variant of Poisson approximation of empirical point processes, or in other words, with a compound Poisson approximation of an n-th partial sum of independent random variables taking values in some function space. So, in the present paper we deal with the classical subject of Probability Theory and the Poisson approximation of sums of independent multivariate random variables (for example, see [1,12,22,23]).
Moreover, one can reformulate the above-mentioned Poissonization duality theorem for more general U-statistic-type functionals
where is an array of finite functions defined on , with , satisfying only the restriction
For example, in this more general setting, one can study the limit behavior of the functionals
where is the complement of an arbitrary subset , with , and . These functionals count the number of success chains of length m in the dependent (finite or infinite) Bernoulli trials .
Author Contributions
Conceptualization, I.B.; formal analysis, I.B. and M.J.; methodology, I.B.; writing—original draft, I.B. and M.J.; writing—review and editing, I.B. All authors have read and agreed to the published version of the manuscript.
Funding
The study of I. Borisov was supported by the Russian Science Foundation, project no. 22-21-00414.
Acknowledgments
The authors thank the anonymous reviewers for careful reading of the paper and insightful comments and suggestions.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Borisov, I.S. Poisson approximation of the partial sum process in Banach spaces. Sib. Math. J. 1996, 37, 627–634. [Google Scholar]
- Borisov, I.S. Moment inequalities connected with accompanying Poisson laws in Abelian groups. Int. J. Math. Math. Sci. 2003, 44, 2771–2786. [Google Scholar]
- Medvedev, J.I. Some theorems on the asymptotic distribution of the χ2 statistic. Dokl. Akad. Nauk SSSR 1970, 192, 987–989. (In Russian) [Google Scholar]
- Medvedev, Y.I. Decomposable statistics in a polynomial scheme. I. Theory Probab. Appl. 1977, 22, 1–15. [Google Scholar]
- Medvedev, Y.I. Decomposable statistics in a polynomial scheme. II. Theory Probab. Appl. 1978, 22, 607–615. [Google Scholar]
- Bahadur, R.R. On the number of distinct values in a large sample from an infinite discrete distribution. Proc. Nat. Inst. Sci. India 1960, 26A, 67–75. [Google Scholar]
- Kolchin, V.F.; Sevastyanov, B.A.; Chistyakov, V.P. Random Assignments; Nauka: Moscow, Russia, 1976. [Google Scholar]
- Darling, D.A. Some limit theorems associated with multinomial trials. In Fifth Berkeley Symposium on Mathematical Statistics and Probability; Part 1. Berkley–Los Angelos; University of California Press: Berkeley, CA, USA, 1967; Volume 2, pp. 345–350. [Google Scholar]
- Karlin, S. Central limit theorems for certain infinite urn schemes. J. Math. Mech. 1967, 17, 373–401. [Google Scholar]
- Chebunin, M.; Kovalevskii, A.P. Functional central limit theorems for certain statistics in an infinite urn scheme. Stat. Probab. Lett. 2016, 119, 344–348. [Google Scholar]
- Sevastyanov, B.A.; Chistyakov, V.P. Asymptotic Normality in a Classical Problem with Balls. Theory Probab. Appl. 1964, 9, 198–211. [Google Scholar]
- Barbour, A.D.; Gnedin, A.V. Small counts in the infinite occupancy scheme. Electr. J. Probab. 2009, 14, 365–384. [Google Scholar] [CrossRef]
- Fisher, R.A.; Corbet, A.S.; Williams, C.B. The relation between the number of species and the number of individuals in a random sample of an animal population. J. Anim. Ecol. 1943, 12, 42–58. [Google Scholar] [CrossRef]
- Orlitsky, A.; Suresh, A.T.; Wu, Y. Supplementary Information for: Estimating the number of unseen species: A bird in the hand is worth log n in the bush. Proc. Natl. Acad. Sci. USA 2016, 1511, 07428. [Google Scholar]
- Barbour, A.D.; Holst, L.; Janson, S. Poisson Approximation; Oxford University Press: Oxford, UK, 1992. [Google Scholar]
- Kruglov, V.M. The asymptotic behavior of the Pearson statistic. Theory Probab. Appl. 2001, 45, 69–92. [Google Scholar]
- Steck, G.P. Limit Theorems for Conditional Distributions; University of California Press: Berkeley, CA, USA, 1957; Volume 2, pp. 237–284. [Google Scholar]
- Tumanyan, S.K. Asymptotic distribution of χ2-criterion when the size of observations and the number of groups simultaneously increase. Theory Probab. Appl. 1956, 1, 117–131. [Google Scholar]
- Hagerup, T.; Rüb, C. A guided tour of Chernoff bounds. Inf. Process. Lett. 1990, 33, 305–308. [Google Scholar] [CrossRef]
- Barbour, A.D.; Hall, P. On the rate of Poisson convergence. Math. Proc. Camb. Philos. Soc. 1984, 95, 473–480. [Google Scholar]
- Loéve, M. Probability Theory; Nauchnaya Literatura: Moscow, Russia, 1962. (In Russian) [Google Scholar]
- Borisov, I.S. Strong Poisson and mixed approximations of sums of independent random variables in Banach spaces. Sib. Adv. Math. 1993, 3, 1–13. [Google Scholar]
- Borisov, I.S.; Ruzankin, P.S. Poisson approximation for expectations of unbounded functions of independent random variables. Ann. Probab. 2002, 30, 1657–1680. [Google Scholar]
- Borisov, I.S. Approximation of distributions of von Mises statistics with multidimensional kernels. Sib. Math. J. 1991, 32, 554–566. [Google Scholar]
- Dobrushin, R.L. Definition of random variables by conditional distributions. Theory Probab. Appl. 1970, 15, 458–486. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).