Abstract
This article completes our studies on the formal construction of asymptotic approximations for statistics based on a random number of observations. Second order Chebyshev–Edgeworth expansions of asymptotically normally or chi-squared distributed statistics from samples with negative binomial or Pareto-like distributed random sample sizes are obtained. The results can have applications for a wide spectrum of asymptotically normally or chi-square distributed statistics. Random, non-random, and mixed scaling factors for each of the studied statistics produce three different limit distributions. In addition to the expected normal or chi-squared distributions, Student’s t-, Laplace, Fisher, gamma, and weighted sums of generalized gamma distributions also occur.
Keywords:
second order Chebyshev–Edgeworth expansions; negative binomially distributed sample sizes; Pareto-like distributed sample sizes; asymptotically normally distributed statistics; asymptotically chi-square distributed statistics; scaled Student’s t-distribution; normal distribution; discrete Pareto distribution; generalized Laplace distribution; weighted sums of generalized gamma distributions MSC:
62E17; 62H10; 60E05
1. Introduction
To improve the convergence properties of sums of independent identically distributed random variables in the Central Limit Theorem, asymptotic expansions of distribution functions of normalized sums were considered. The history of asymptotic expansions in nonparametric statistics is presented in detail in Wallace [], Bickel [], and Hall [], among others. Chebyshev–Edgeworth expansions, with which we are concerned here, are presented in great detail in Bhattacharya and Rao [] for random vectors and in Petrov [] for one-dimensional random variables. For instance, in Pfanzagl [] and Bentkus et al. [], the authors emphasize that asymptotic expansions can provide more effective approximations for asymptotic studies in statistical theory. Second order approximations of distribution functions of sums of random variables are of great importance because they take into account the skewness and kurtosis of the random variable in addition to the expected value and the variance, as in the Central Limit Theorem. In Burnashev [], second order expansions are proved for the asymptotically normally distributed sample median on a sample of size m and its MSE. Based on this, for a Laplace population with density , the actual MSE with exact data is compared numerically with approximations data. For the normal approximation, the influence of the remaining term is below 10% only for , while for the approximation with the second order expansion, the influence of the remaining term is below 10% already from . For a Cauchy population with smooth and heavy tailed density , for the normal approximation, the influence of the remaining term is below 10% for , while for the approximation with the second order expansion, the influence of the remaining term is below 10% already from . Consequently, as Burnashev [] pointed out, asymptotic expansions can significantly improve the exactness of statistical conclusions, even in the case of a small number of observations. The results in the abovementioned papers are based on non-random sample sizes or non-random number of observations.
When planning statistical studies, situations often arise where the sample sizes are unknown in advance and they are modeled as realizations of random variables. Many models from medicine, finance, risk theory, physics, and reliability lead to samples with random dimensions. For instance, in the papers by Nunes et al. [,,], different models in medical research random size samples were investigated in order to prevent false conclusions. In Esquível et al. [], the authors give an informative overview of statistical inference with a random number of observations and some applications. Results for mean and variance for normally distributed samples, calculation of quantiles, and interval estimates with random sample size were also proved. Döbler [] gives a detailed review of the literature on random sums as well as recent results on approximation in various metrics. In Schluter and Trede [] (Theorem 1, Proposition 1), the authors show, using the convergence of a negative binomial random sum, that the growth rate of cities is Student t-distributed with 2 degrees of freedom. Their empirical investigations verify the result. The references in the above-cited papers provide further applications for random dimension sampling.
Bening et al. [,] proved convergence rates and asymptotic expansions for distributions of statistics based on samples with random dimension . Here, is a statistic based on a non-random number of independent observations. The random variables size form a sequence of integer random sample sizes that depends on a natural parameter n with in probability for . Inequalities with a convergence rate are assumed for the approximations of the distribution functions of both the normalized statistics and the normalized random sample sizes . As examples, convergence rates and first order asymptotic expansions are derived for the statistics , where is an asymptotically normal statistic and the random sample size is either negatively binomial or Pareto-like distributed.
In Christoph et al. [], inequalities for the second order approximations of the distribution functions of normalized negative binomial and Pareto-like sample sizes were proved. Consequently, second order Chebyshev–Edgeworth approximations and the corresponding Cornish–Fisher expansions could be obtained for the distribution of the normalized arithmetic mean of a sample with normalized negative binomial or Pareto-like sample sizes where the remainders are of order .
The present work provides a supplement to our paper, Christoph and Ulyanov [], where we have developed a formal second order design for asymptotic Chebyshev–Edgeworth approximations. We considered asymptotically normal statistics with sample size having negative binomial distribution as well as asymptotically chi-squared statistics with Pareto-like distributed sample sizes. In addition to the distributions of statistic and random sample size , three scaling factors for are also introduced, leading to different expansions. It is the first paper to consider approximations for asymptotic chi-square statistics based on random sample sizes. Some more applications of random sample size sampling were also mentioned.
In the present paper, we provide similar results for asymptotically normal statistics of samples with Pareto-like distributed sample sizes and for asymptotically chi-squared statistics with sample size having negative binomial distribution.
For better reader convenience, we list in Section 2 some notations, conditions, and statements that were also used in Christoph and Ulyanov []. Section 3 states the necessary approximations for the statistics and the sample sizes . The dependence of the limit distributions of the scaled statistic on the distributions of the statistic and the sample size , as well as the scaling factors, is discussed in Section 4. Section 5 then presents the main results. As examples, we consider the same statistic as in Christoph and Ulyanov [] (Corollaries 1 and 2), but with changed sample sizes. Section 6 provides the proofs of the main results, leaving three auxiliary lemmas to Appendix A. Conclusions are presented in Section 7.
2. Notation and Preliminaries
Let be a probability space on which all occurring random variables are given.
Set positive numbers, real axis, integer part of real y, and indicator function as follows:
Let be independent identically distributed random variables. Define the statistic
based on the random sample with a non-random sample size .
Consider the sequence of discrete random variables , depending on an integer parameter . This integer indicates the random dimension of the observations . Let us assume that the sample size does not depend on , where in probability when . Define for each the statistic obtained from a random sample by
It follows from Esquível et al. [] (Theorem 2.1.1) that the statistic is well-defined in (1).
Since we want to prove second order approximations for the statistic in form of inequalities, we need the corresponding assumptions for the statistic and for the random sample size as well.
For the statistic with and the random sample sizes we suppose conditions on the structure of the approximating functions as well as on the convergence rate:
Assumption 1.
There are a distribution function , bounded functions , which are differentiable for all , , as well as such that
Assumption 2.
There exists a distribution function with , a bounded variation function , a sequence of numbers , , and such that for
Remark 1.
Assumptions 1 and 2 require inequalities for the approximations of and for all , leading to inequalities for the approximations of . See also Remark 5 below on Poisson and binomial random variables . For these sample sizes, we are so far only aware of estimates of the remaining terms with small-o or large- convergence rates. About the differences between inequalities and order bounds, see, e.g., Fujikoshi and Ulyanov [] (Chapter 1).
Remark 2.
In Bening et al. [], these conditions are formulated more generally. Assumption 1 requires the existence of ,..., with and Assumption 2 that of ,..., with . We restrict ourselves here, as in Christoph and Ulyanov [], to the required approximation functions.
Assumptions 1 and 2 lead to the approximations for the distribution functions of statistics :
Proposition 1.
Bening et al. [] proved general transfer theorems under the conditions indicated in Remark 2 only for case . Therefore, the proof is repeated in Christoph and Ulyanov [] (Appendix A.1).
3. Second Order Estimates for Both the Statistics and the Sample Sizes
First we consider the following statistics with non-random sample size m and with the corresponding second order approximations. Let the asymptotically normal statistic satisfy the following inequality:
with and refers to the standard normal distribution function with density function :
Asymptotically chi-squared distributed statistics satisfy the following inequality:
where , , denotes the chi-squared distribution function with d degrees of freedom and the density function :
In Christoph and Ulyanov [] (Sections 3.1 and 3.2), some examples of such statistics are given that satisfy (7) or (8) and consequently, Assumption 1.
As already announced, we consider the following random sample sizes with the corresponding second order approximations.
The Pareto-like random sample sizes are defined as follows:
Let , be independent discrete Pareto II random variables with parameter , which are discretized from continuous Lomax (Pareto II) random variables on , for a review, see, e.g., Buddana and Kozubowski []. For , there are defined
Proposition 2.
(Christoph and Ulyanov [], Proposition 4) Let be the discrete Pareto-like random variable whose distribution function is given in (9); then, for all integers and fixed positive , we have
with jump correcting function and does not depend on n. Furthermore,
with optimal bound in (12) for , where a is the convergence rate in (7).
Remark 3.
The inverse exponential random variable with distribution function and rate parameter is “heavy tailed” with shape parameter 1 as is . Thus, the expected values of these two random variables do not exist.
Suppose the positive integer has a (shifted by 1) negative binomial distribution with probability of success , , parameter , probabilities
In statistical studies, for counting models, the negative binomial and Poisson distributions are the two most important ones. In Schluter and Trede [] (Section 2.1), the authors emphasize that the negative binomial distribution with its two parameters can typically observe over-dispersion in count data, while this is not the case with the one-parameter Poisson distribution. They proved in a more general framework
while denotes the gamma distribution that has identical scale and shape parameters , whose density is
In Bening and Korolev [] (Lemma 2.2), the result (14) was also obtained.
Proposition 3.
(Christoph and Ulyanov [], Proposition 3) Let . The discrete random variable has probabilities and expected value given in (13). Then, for all :
where does not depend on n and with the jump correcting function ,
Moreover, negative moments satisfy the estimation for all ,
and the convergence rate in case cannot be improved.
Remark 4.
Second order Chebyshev–Edgeworth expansions (10) and (15) with were first proved in Christoph et al. [] (Theorems 4 and 1). Approximations in (10) and (15) with remainder estimations or are given, e.g., in Bening et al. [] and Gavrilenko et al. []. In Christoph et al. [] (Corollaries 5.4 and 6.5), leading terms for the negative moments of and are derived that lead to (17) and (12).
Remark 5.
The negative binomial distribution belongs to the class of Panjer distributions, which also includes the Poisson and binomial distributions. Samples with binomial or Poisson distributed sample sizes were studied among others in the above-cited papers [,,,]. Convergence rate bounds for statistics based on such samples are given in Döbler [], Korolev [], Bulinski and Slepov []. Döbler [], Korolev and Shevtsova [], Sunklodas [] obtained Berry–Esseen bounds for sums based on samples with binomial and Poisson sample sizes. To the best of the authors’ knowledge, Chebyshev–Edgeworth expansions for these lattice distributed random variables have only been proven so far with bounds of small-o or large- rates, see, e.g., Petrov [] (Chapter 6, Theorem 6) or Kolassa and McCullagh []. Therefore, inequality (3) in Assumption 2 is not fulfilled.
4. Limit Distributions of Statistics with Random Size Samples using Different Scaling Factors
We now consider the statistics and the sample sizes , which are supposed to satisfy the inequalities (2) and (3) in Assumptions 1 and 2, respectively. Let us investigate the scaled statistics with the sequence as . We analyze the two cases and as limiting distributions F in Assumption 1 with respect to the exponents and : If , then and , while if , then and . Then, conditioning on and using (2) and (3), we have
Consequently, the limit distribution of the scaled statistic is a scale mixture of underlying F with mixing distribution H: , as Refer to, e.g., Choy and Chan [], Fujikoshi et al. [] (Chapter 13), and Fujikoshi and Ulyanov [] (Chapter 2) and the references therein.
The limiting distributions therefore only arise from the leading distributions and in the inequalities (2) and (3) and also depend on the parameter .
In Christoph and Ulyanov [] (Sections 5 and 6), the cases with as well as with were considered. Now, we interchange the distributions of random sample sizes . We first study the limiting distributions of asymptotically normally distributed statistics with Pareto-like distributed sample sizes and also asymptotically chi-squared distributed statistics with negative binomial distributed sample sizes . Since and hold, the integral range in the last integral in (18) can be extended from to for further investigations.
4.1. The Case and
In Christoph and Ulyanov [,], asymptotically normally distributed statistics for samples of m-dimensional normally distributed vectors were considered: correlation coefficient as well as the three geometric features: the length of a vector, the distance, and the angle between two vectors. Inequalities for second order approximations for statistic are derived when the dimension m is replaced by Pareto-like distributed random dimension . For the median of a sample with random sample size analogous results are shown in Christoph et al. [] (Section 6). All these asymptotically normally distributed statistics with Pareto-like random dimensions or sample sizes have the same limiting distribution.
Let . Since , we choose as . Then, the limit laws for
with corresponding densities
Therefore, the limit distributions are the Laplace law with density and scale parameter for , the standard normal law and density for and for the scaled Student’s t-distribution with 2 degrees of freedom and density . These mixed scale distributions are discussed in more detail in Christoph and Ulyanov [] (Section 4.2).
4.2. The Case and
Asymptotically chi-squared distributed statistics of samples with random sample size were considered for the first time in Christoph and Ulyanov [] in case of , .
Now, negatively binomial distributed sample sizes are considered. With and , the limit distributions for
The corresponding densities are
We prove (20) for in Section 6 in the proof of Theorem 2.
The scale mixtures are the (scaled by d) F-distribution with parameters and and density for , the chi-squared distribution with d degrees of freedom and density for and a gamma distribution of generalized type occurs with density for . The modified Bessel function of the third kind or Macdonald functions also occurred in Christoph and Ulyanov [,] in generalized gamma and Laplace densities.
Remark 6.
The Macdonald function satisfying order-reflection formula and may be expressed for with integer m in closed forms. In Oldham et al. [] (Formulas 51:4:1 and 26:13:3), the Macdonald functions for are explicitly given. Using Prudnikov et al. [] (Formulas 2.3.16.1-3), the densities can be calculated:
Example 1.
Some densities for :
Remark 7.
If is an integer, the distribution functions of the densities can also be calculated explicitly by substitution and partial integration.
Example 2.
Distribution functions for given densities with :
Remark 8.
The generalized gamma distribution has two shape parameters α and β, a scale parameter λ, and the density
The density (25) is given in Korolev and Zeifman [] and Korolev and Gorshenin [] and summarizes many known densities. Generalized gamma distributions are defined in many different ways, but they do not correspond to the ones that occur above.
Remark 9.
The densities with integer are generalized gamma densities given in formula (25) or may be represented as linear combinations of such densities. The parameters and apply in all densities . The parameter β also depends on the number of derivatives in the densities .
Example 3.
Some linear combinations of generalized gamma densities:
5. Main Results
Inequalities for approximations to scaled statistics for will be presented. Here, and when the statistic is asymptotically normally distributed, or and when normalized has chi-squared limit distribution.
5.1. Asymptotically Normal Statistics and Pareto-like Sample Sizes
Let asymptotically normal statistic satisfy inequality (7) with coefficients and the rate of convergence . The Pareto-like sample size , , is given in (9), which fulfills the inequality (10). For the scaling factors, select and in formula (18).
Theorem 1.
Under the conditions given above, the following approximations apply:
- i:
- ii:
- Let . The random scaling factor with leads to the normal approximation :where is the rate of convergence in (7) and
- iii:
As applications of the Theorem 1, we now examine the Student t-distribution, the Student t-test statistic, and the sample mean as asymptotically normal statistics considered in Christoph and Ulyanov [] (Section 3.1 and Corollary 1) for the case of negative binomial sample sizes .
Corollary 1.
Let the conditions of Theorem 1 be satisfied:
- i:
- Let . In case of the Student’s t-statistic with m degrees of freedom estimated in [] (Formula (18)), inequality (7) is valid with , and . The non-random scaling factor and Pareto-like sample sizes lead to:
- ii:
- Let . Let be the Student’s t-statistic with sample mean and sample variance , which was considered in [] (Formulas (21) and (20)). The first order approximation (7) with , , , the Pareto-like random sample sizes and the random scaling factor result in:
- iii:
- Let . Considering sample mean estimated in [] (Formulas (15) and (16)), one has (7) with , , , , , Pareto-like random sample sizes and mixed scaling factor , thenwith
5.2. Asymptotically Chi-Squared Distributed with Negative Binomially Distributed Sample Sizes
Let the asymptotically chi-squared distributed statistics satisfy inequality (8) with coefficients , and the rate of convergence . The negative binomially distributed sample sizes with parameter and success probability are given in (13) and fulfill the inequality (15). For the scaling factors, choose and in formula (18).
Theorem 2.
Under the conditions given above, the following approximations apply.
- i:
- Let . The non-random scaling factor at statistics leads to approximations by the scaled F-distribution having parameters and and density given in (20) with :where
- ii:
- For and random scaling factor at , the approximation does not change:where
- iii:
- Let and . The mixed scaling factor at results in a gamma distribution of generalized type with density given in (20) for :whereThe restriction in Theorem 2(iii) has a purely proof-technical character. In Proposition 4, a result is shown with .
Remark 10.
The function can be calculated explicitly for with integer . Then, for example, and .
Example 4.
As applications to Theorem 2, we now examine Hotelling’s distribution and normalized quotients of two independent chi-square distributions as asymptotic chi-square distributions, considered in Christoph and Ulyanov [] (Section 3.2 and Corollary 2) where the sample sizes had Pareto-like distribution.
Corollary 2.
The conditions of the Theorem 2 shall be fulfilled:
- i:
- Let . Consider Hotelling’s generalized -statistic with independently distributed random matrices and having Wishart distributions and , respectively. Then, inequality (8) holds with limit distribution , , and . The non-random scaling factor by leads towhere the scaled F-distribution with density is given in (20) for
- ii:
- Let , and be independent and be scale mixtures satisfying inequality (8) with coefficients and . Random degrees of freedom instead of m and random scaling factor lead towhere
- iii:
- Let . The statistics satisfy the inequality (8) with the limiting distribution and the coefficients and . The mixed scaling factor at results in a limiting gamma distribution of generalized type . Only if is an integer, the involved Macdonald functions may be explicitly calculated. Since , we choose and find . Then, uniformly in :where and are specified in (23).
Remark 11.
6. Proofs
For the proofs of Theorems 1 and 2, we use Proposition 1. The statistics and the sample size are either asymptotically normally and discretely Pareto-like distributed (i.e., and ) or asymptotically chi-squared and negatively binomially distributed (i.e., and ). In both cases, the size defined in (6) is uniformly bounded for all , see Christoph and Ulyanov [] (Lemma A1). Next, the bounds that are required in (4) for the negative moments of sample sizes and are provided by (12) and (17). Furthermore, it follows from Christoph and Ulyanov [] (Proposition 2 and Lemma A2) that in both cases the domain of integration of the integrals in the function defined in (5) can be extended from to :
where if and or if and , respectively, and
We still have to calculate the integrals in (29) that contain , , and , respectively.
Proof of Theorem 1.
We now consider , and . Here, , and we divide the function given in (11) into two parts: and . The densities of the limit distributions were given in (20). If to calculate the integrals in (29) involving , and we use Prudnikov et al. [] (Formulas 2.3.16.2 and 2.3.16.3):
for , and , respectively. The corresponding integral with was estimated in Christoph et al. [] (see Proof of Theorem 5) by .
In case of , we obtain . To calculate the integrals with and we use [] (Formula 2.3.3.1) with and :
If , the integrals with , and are calculated using (31) with and . From Christoph and Ulyanov [] (see Proof of Theorem 8), it follows that holds: and Theorem 1 is proved. □
Proof of Theorem 2.
Now, we consider the case , and . This combination has not yet been studied in the literature. Only if , there is a result by Monahkov []; see Remark 11 above. Then, , and we divide the function given in (16) into two parts: and .
For , the density in (20) and the integrals in (29) with and are computed with (31) for . The integral with is estimated in (A1) in Lemma A1. Together with the inequality , we get (26).
In case of , we obtain . To calculate the integrals with , we use (31) with and .
If the density in (20) and the integrals with and are calculated using Prudnikov et al. [] (Formula 2.3.16.1):
with , and . We use the order-reflection formula and the recursion formula; see Oldham et al. [] (Chapter 51.5):
The integral with is estimated in (A4) in Lemma A2 and Theorem 2 is proved. □
7. Conclusions
The common goal of the present work and that of Christoph and Ulyanov [] is to develop formal second order Chebyshev–Edgeworth expansions for sample statistics with random sample sizes. Corresponding expansions are assumed for the statistics with non-random sample sizes as well as for the random sample sizes. The statistics examined are asymptotically normally distributed and, for the first time in this setting, also asymptotically chi-squared distributed. The random sample sizes have negative binomial or Pareto-like distributions. The formal construction of the approximating functions allows the results to be used for a whole family of asymptotically normal or chi-squared distributed statistics. The Student t-distribution with m degrees of freedom, the one-sample Student t-test statistic, and the sample mean are considered as examples of asymptotic normal statistics. Hotelling’s generalized statistic and scale mixture of a normalized quotient of two independent chi-squared random variables were studied as examples of the asymptotic chi-squared distributions. In addition, random, non-random, and mixed scaling factors for the statistics are considered, which have a significant influence on the limit distributions. The limit laws are scale mixtures of the normal with mixing gamma or chi-squared with mixing inverse exponential distributions. In addition to the normal distribution and the chi-square distribution, there are a variety of limit distributions: the Laplace, the scaled Student t-, the scaled Fisher, the generalized gamma, and linear combinations of generalized gamma distributions.
The remaining terms in the approximations of the scaled statistics are estimated by inequalities.
Author Contributions
Conceptualization, G.C. and V.V.U.; methodology, V.V.U. and G.C.; formal analysis, G.C. and V.V.U.; investigation, G.C. and V.V.U.; writing—original draft, G.C. and V.V.U.; writing—review and editing, V.V.U. and G.C. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding. It was carried out within the project, “Analysis of the quality of approximations in the statistical analysis of multivariate observations” of the Magdeburg University, the program of the Moscow Center for Fundamental and Applied Mathematics, Lomonosov Moscow State University, and HSE University Basic Research Programs.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors thank the Editor for his support and the Reviewers for their appropriate comments which have improved the quality of this paper.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Auxiliary Lemmas
Lemma A1.
Let then
Proof of Lemma A1.
We use the Fourier series expansion of the jump correcting function at all non-integer points y; see Prudnikov et al. [] (Formula 5.4.2.9 for ):
and Prudnikov et al. [] (Formula 2.5.31.4):
Integration by parts in the integral , using (A2), interchanging sum and integral and applying (A3) with , and leads to
with
Now, we split the exponent and obtain
Since , we find uniform in
and Lemma A1 is proved. □
Lemma A2.
Let , then
Proof of Lemma A2.
Integration by parts in the integral , using the Fourier series expansion (A2), interchanging sum and integral, we find
with
In the literature, we have only found integrals with power functions and . Therefore, we integrate by parts in the integral :
where , and
Hence,
Lemma A2 is proved. □
Lemma A3.
Let , , and , then
Proof of Lemma A3.
Integration by parts in the integral , using the Fourier series expansion (A2), interchanging sum and integral, we find
with
Using Prudnikov et al. [] (Formula 2.5.37.3), with the real constants , and , we obtain
It was shown in Christoph et al. [] (Proof of Theorem 5) that Leibniz’s integral rule allows differentiation to q under the integral sign in (A6). Therefore,
Since , , , , and we find ,
and
Lemma A3 is proved. □
References
- Wallace, D.L. Asymptotic approximations to distributions. Ann. Math. Statist. 1958, 29, 635–654. [Google Scholar] [CrossRef]
- Bickel, P.J. Edgeworth expansions in nonparametric statistics. Ann. Statist. 1974, 2, 1–20. [Google Scholar] [CrossRef]
- Hall, P. The Bootstrap and Edgeworth Expansion; Springer Series in Statistics; Springer: New York, NY, USA, 1992. [Google Scholar]
- Bhattacharya, R.N.; Ranga Rao, R. Normal Approximation and Asymptotic Expansions; Wiley: New York, NY, USA, 1976. [Google Scholar]
- Petrov, V.V. Limit Theorems of Probability Theory, Sequences of Independent Random Variables; Clarendon Press: Oxford, UK, 1995. [Google Scholar]
- Pfanzagl, J. Asymptotic expansions related to minimum contrast estimators. Ann. Statist. 1973, 1, 993–1026. [Google Scholar] [CrossRef]
- Bentkus, V.; Götze, F.; van Zwet, W.R. An Edgeworth expansion for symmetric statistics. Ann. Statist. 1997, 25, 851–896. [Google Scholar] [CrossRef]
- Burnashev, M.V. Asymptotic expansions for median estimate of a parameter. Theory Probab. Appl. 1997, 41, 632–645. [Google Scholar] [CrossRef]
- Nunes, C.; Capistrano, G.; Ferreira, D.; Ferreira, S.S.; Mexia, J.T. Exact critical values for one-way fixed effects models with random sample sizes. J. Comput. Appl. Math. 2019, 354, 112–122. [Google Scholar] [CrossRef]
- Nunes, C.; Capistrano, G.; Ferreira, D.; Ferreira, S.S.; Mexia, J.T. Random sample sizes in orthogonal mixed models with stability. Comp. Math. Methods 2019, 1, e1050. [Google Scholar] [CrossRef]
- Nunes, C.; Mário, A.; Ferreira, D.; Moreira, E.M.; Ferreira, S.S.; Mexia, J.T. An algorithm for simulation in mixed models with crossed factors considering the sample sizes as random. J. Comput. Appl. Math. 2022, 404, 113463. [Google Scholar] [CrossRef]
- Esquível, M.L.; Mota, P.P.; Mexia, J.T. On some statistical models with a random number of observations. J. Stat. Theory Pract. 2016, 10, 805–823. [Google Scholar] [CrossRef]
- Döbler, C. New Berry-Esseen and Wasserstein bounds in the CLT for non-randomly centered random sums by probabilistic methods. ALEA Lat. Am. J. Probab. Math. Stat. 2015, 12, 863–902. [Google Scholar]
- Schluter, C.; Trede, M. Weak convergence to the Student and Laplace distributions. J. Appl. Probab. 2016, 53, 121–129. [Google Scholar] [CrossRef]
- Bening, V.E.; Galieva, N.K.; Korolev, V.Y. On rate of convergence in distribution of asymptotically normal statistics based on samples of random size. Ann. Math. Inform. 2012, 39, 17–28. [Google Scholar]
- Bening, V.E.; Galieva, N.K.; Korolev, V.Y. Asymptotic expansions for the distribution functions of statistics constructed from samples with random sizes. Inform. Appl. 2013, 7, 75–83. (In Russian) [Google Scholar]
- Christoph, G.; Monakhov, M.M.; Ulyanov, V.V. Second-order Chebyshev-Edgeworth and Cornish-Fisher expansions for distributions of statistics constructed with respect to samples of random size. J. Math. Sci. 2020, 244, 811–839, Translated from Zap. Nauchnykh Semin. POMI 2017, 466, 167–207. [Google Scholar] [CrossRef]
- Christoph, G.; Ulyanov, V.V. Chebyshev–Edgeworth-type approximations for statistics based on samples with random sizes. Mathematics 2021, 9, 775. [Google Scholar] [CrossRef]
- Fujikoshi, Y.; Ulyanov, V.V. Non-Asymptotic Analysis of Approximations for Multivariate Statistics; Springer: Singapore, 2020. [Google Scholar]
- Christoph, G.; Ulyanov, V.V. Second order expansions for high-dimension low-sample-size data statistics in random setting. Mathematics 2020, 8, 1151, Reprinted in Special Issue: Stability Problems for Stochastic Models: Theory and Applications; Zeifman, A.; 57 Korolev, V.; Sipin, A., Eds.; MPDI: Basel, Switzerland, 2021; pp. 259–286.. [Google Scholar] [CrossRef]
- Buddana, A.; Kozubowski, T.J. Discrete Pareto distributions. Econ. Qual. Control. 2014, 29, 143–156. [Google Scholar] [CrossRef]
- Bening, V.E.; Korolev, V.Y. On the use of Student’s distribution in problems of probability theory and mathematical statistics. Theory Probab. Appl. 2005, 49, 377–391. [Google Scholar] [CrossRef]
- Gavrilenko, S.V.; Zubov, V.N.; Korolev, V.Y. The rate of convergence of the distributions of regular statistics constructed from samples with negatively binomially distributed random sizes to the Student distribution. J. Math. Sci. 2017, 220, 701–713. [Google Scholar] [CrossRef]
- Christoph, G.; Ulyanov, V.V.; Bening, V.E. Second order expansions for sample median with random sample size. ALEA Lat. Am. J. Probab. Math. Stat. 2022, 19, 339–365. [Google Scholar] [CrossRef]
- Korolev, V. Bounds for the rate of convergence in the generalized Rényi theorem. Mathematics 2022, 10, 4252. [Google Scholar] [CrossRef]
- Bulinski, A.; Slepov, N. Sharp estimates for proximity of geometric and related sums distributions to limit laws. Mathematics 2022, 10, 4747. [Google Scholar] [CrossRef]
- Korolev, V.; Shevtsova, I. An improvement of the Berry-Esseen inequality with applications to Poisson and mixed Poisson random sums. Scand. Actuar. J. 2012, 2012, 81–105. [Google Scholar] [CrossRef]
- Sunklodas, J.K. On the normal approximation of a binomial random sum. Lith. Math. J. 2014, 54, 356–365. [Google Scholar] [CrossRef]
- Petrov, V.V. Sums of Independent Random Variables; Akademie-Verlag: Berlin, Germany, 1975. [Google Scholar]
- Kolassa, J.E.; McCullagh, P. Edgeworth series for lattice distributions. Ann. Statist. 1990, 18, 981–985. [Google Scholar] [CrossRef]
- Choy, T.B.; Chan, J.E. Scale mixtures distributions in statistical modelling. Aust. N. Z. J. Stat. 2008, 50, 135–146. [Google Scholar] [CrossRef]
- Fujikoshi, Y.; Ulyanov, V.V.; Shimizu, R. Multivariate Statistics. High-Dimensional and Large-Sample Approximations; Wiley Series in Probability and Statistics; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2010. [Google Scholar]
- Christoph, G.; Ulyanov, V.V. Random dimension low sample size asymptotics. In Recent Developments in Stochastic Methods and Applications; Shiryaev, A.N., Samouylov, K.E., Kozyrev, D.V., Eds.; Springer Proceedings in Mathematics & Statistics; Springer International Publishing: Cham, Switzerland, 2021; Volume 371, pp. 215–228. ISBN 978-3-030-83266-7 and 978-3-030-83266-0. [Google Scholar] [CrossRef]
- Oldham, K.B.; Myland, J.C.; Spanier, J. An Atlas of Functions, 2nd ed.; Springer Science + Business Media: New York, NY, USA, 2009. [Google Scholar]
- Prudnikov, A.P.; Brychkov, Y.A.; Marichev, O.I. Integrals and Series, Vol. 1: Elementary Functions, 3rd ed.; Gordon & Breach Science Publishers: New York, NY, USA, 1992. [Google Scholar]
- Korolev, V.Y.; Zeifman, A.I. Generalized negative binomial distributions as mixed geometric laws and related limit theorems. Lith. Math. J. 2019, 59, 366–388. [Google Scholar] [CrossRef]
- Korolev, V.Y.; Gorshenin, A. Probability models and statistical tests for extreme precipitation based on generalized negative binomial distributions. Mathematics 2020, 8, 604. [Google Scholar] [CrossRef]
- Monakhov, M.M. Chebyshev–Edgeworth expansions for distributions of generalized Hotelling-type statistics based on random size samples. Inform. Primen. [Informatics Its Appl.] 2021, 15, 72–81. (In Russian) [Google Scholar] [CrossRef]
- Fujikoshi, Y.; Ulyanov, V.V.; Shimizu, R. L1-norm error bounds for asymptotic expansions of multivariate scale mixtures and their applications to Hotelling’s generalized T02. J. Multivar. Anal. 2005, 96, 1–19. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).