Abstract
Only a few count smoothers are available for the widespread use of discrete associated kernel estimators, and their constructions lack systematic approaches. This paper proposes the mean dispersion technique for building count kernels. It is only applicable to count distributions that exhibit the underdispersion property, which ensures the convergence of the corresponding estimators. In addition to the well-known binomial and recent CoM-Poisson kernels, we introduce two new ones such the double Poisson and gamma-count kernels. Despite the challenging problem of obtaining explicit expressions, these kernels effectively smooth densities. Their good performances are pointed out from both numerical and comparative analyses, particularly for small and moderate sample sizes. The optimal tuning parameter is here investigated by integrated squared errors. Also, the added advantage of faster computation times is really very interesting. Thus, the overall accuracy of two newly suggested kernels appears to be between the two old ones. Finally, an application including a tail probability estimation on a real count data and some concluding remarks are given.
Keywords:
binomial kernel; CoM-Poisson kernel; double Poisson distribution; gamma-count distribution; integrated squared errors; mode dispersion; normalizing constant MSC:
62G05; 62G07; 62G99
1. Introduction
Nonparametric statistics use (discrete) asymmetric kernel methods to capture and visually represent complex relationships between variables that cannot be effectively captured by symmetric kernels. A common practice now is to employ kernels whose support coincides with the nature or support of the dataset, whether it is count, categorical, bounded, unbounded, continuous, left or right skewed, and so on. See, for example, refs. [1,2,3] for smoothing of probability mass, probability density, and regression functions with environmental, econometric and financial applications, among others. Discrete smoothing of probability mass functions in the case of counts and categoricals has not been studied as extensively as its continuous counterparts, primarily due to the limited options for suitable kernels. There are two main classes of discrete associated kernels. The first, called of the “second order” (with in Definition 1), includes kernels whose estimators are consistent and asymptotically tend towards the true function as for the Dirac kernel. Such discrete (asymmetric and symmetric) kernels are, for instance, triangular [4], Aitchison–Aitken or Dirac Uniform [5], Wang and Van Ryzin [6], and the recently proposed CoM-Poisson [7,8]; see also [3]. The second class, so-called of the “first order” (with in Definition 1), generally contains the binomial kernel [4] appropriate for small and moderate sample sizes for which the corresponding estimators do not converge. The previous discrete kernels are suitable for categorical data, with the exception of the binomial and CoM-Poisson kernels, which are appropriated for count data. Additionally, all these associated kernels are typically underdispersed, i.e., its variance is less than the expectation; on the contrary, the equidispersed Poisson and overdispersed negative binomial kernels are not recommended for count smoothings; see [4], the authors of which make intensive simulations that highlight the superiority of these underdispersed discrete smoothers versus equi-/over-dispersed ones. The reader can also refer to [9,10] for some applications of discrete kernels in survey sampling and the model specification test, respectively, and more generally to Li and Racine [11]. Here is the precise definition of a discrete associated kernel; see, e.g., Esstafa et al. [8].
Definition 1.
Let be the discrete support of the probability mass function (pmf) f to be estimated, a target point and a bandwidth. A parameterized pmf on the discrete support is called “discrete associated kernel” if the following conditions are satisfied:
where denotes the discrete random variable with pmf .
We suppose is a sample of independent and identically distributed (iid) discrete random variables with a pmf f defined on . The usual discrete associated kernel estimator of f is generally not a pmf. This is particularly true for some discrete associated-kernels such as the binomial, triangular, and CoM-Poisson, where the total mass of the corresponding estimator does not necessarily equal one; see, e.g., [12]. Specifically, one can express both estimators as follows:
with
where is an arbitrary sequence of positive smoothing (or tuning) parameters that satisfies , while is a suitably chosen discrete kernel function. For the following three kernels of Dirac, Aitchison and Aitken [5] and Wang and Van Rizin [6], it is easy to check that , and therefore . Esstafa et al. [8] recently demonstrated the effectiveness of the normalized version (1) compared to the unnormalized one (2) with illustrations using the existing count (convergent or not-convergent) smoothers: binomial and CoM-Poisson, respectively. We here use the standardized version (1) for all. In nonparametric (discrete or continous) kernel estimation, the tuning (smoothing) parameter plays a crucial role in preventing overfitting and underfitting. Bandwidth selection methods can be categorized into three families: global bandwidths for all smoothers, adaptive for continuous kernels, and local ones for discrete (count and categorical) estimators. For example, Chu et al. [13] proposed a rule-of-thumb approach, Harfouche et al. [1] utilized cross-validations, and Somé et al. [2] emploed a local Bayesian method. It is important to note that smoothers using local bandwidths, which vary according to each estimation point, are referred to as “balloon estimators”, while adaptive bandwidths, varying for each data point, are known as “sample-point estimators”.
In this paper, we propose two new count kernels, namely the double Poisson and the gamma-count, derived from their underdispersed distributions parts. These additions enrich the list of existing count kernels, such as binomial and CoM-Poisson. Specifically, they reinforce the roster of “first-order” kernels, which exclusively contains the CoM-Poisson, and whose smoothers are consistent and asymptotically tend towards the Dirac kernel. Their construction is performed through the mean dispersion technique which appears as a variant of the mode dispersion method [12] in continuous cases. The rest of the paper is organized as follows. In Section 2, we recall some underdispersed count distributions including the double Poisson and gamma-count which have some specific properties in their expressions. Section 3 is devoted to building the double Poisson and gamma-count kernels after introducing the mean dispersion method of consruction and despite some approximations in their properties. Section 4 presents the main results from our simulation studies and then an application on a count dataset on development days of insect pests on Hura trees. Final remarks are made in Section 5. Some other underdispersed count distributions and local Bayesian bandwidth selection are mentionned in Appendix A and Appendix B in relation to their feasibility.
2. Some Properties of Underdispersed Count Distributions
In this section we recall three count distributions, namely the double Poisson, the gamma-count and the CoM-Poisson, which are underdispersed according to a part of their parameters. Before building their corresponding associated kernels to satisfy Definition 1, we point out their main properties needed (pmf, mean and variance) even if they are not generally in closed-form expressions. Thus, approximation and computation approaches are used for a better understanding of the parameters.
- The double Poisson pmf is defined bywithand where is the dispersion parameter and . The mean and variance do not have closed-form expressions but they can be approximated, respectively, byWe note that the values and correspond to overdispersion, equidispersion and underdispersion, respectively. See Efron [14] and Toledo et al. [15] for further details.
- The gamma-count pmf for the number of events within the time interval is given, with , throughwith the cumulative distribution functionand for and T can be set to one, without loss of generality. The parameter is such that and refer to underdispersion or overdispersion, respectively. Here, the mean and variance are not available in closed form but they can be computed throughSee Winkelmann [16] for further details, Zeviani et al. [17] for an application to regression model, and also [15].Numerically and from Figure 1, we can observe that the mean of the gamma-count distribution is almost always a constant around ; specifically, by zooming in, we notice that the shape of the curve is logarithmic or approximately linear in for fixed . The same fact is observed for its mode, as shown in Figure 2. We also note that Figure 2 highlights the role of as a shape or location parameter and as a scale or dispersion parameter of the gamma-count distribution. Hence, the variance of the gamma-count distribution can be seen as a function of .
Figure 1. Computation of the mean of gamma-count distribution according to and .
Figure 2. Some gamma-count distributions according to (a) and also to (b) fixed. - The CoM-Poisson distribution with location parameter and dispersion parameter ( for underdispersion) such that its pmf is defined bywhere function is the solution to equationand it is used to define the normalizing constant . Then,when as . See, for example, ref. [8] for some references.
Also, we can refer to Appendix A for some underdispersed count distributions such as the BerPoi, generalized Poisson, an underdispersed Poisson, the BerG and the hyper-Poisson, for which their corresponding count associated kernels are inconclusive.
3. Associated Kernel Versions
We introduce in this section the notion of the mean dispersion-ready pmf, a new method inspired by the mode dispersion technique (see, for example, [12]) and adapted to the discrete setting. This method allows construction of discrete associated kernels and is applicable to underdispersed count distributions.
Definition 2.
A mean dispersion-ready pmf is a underdispersed parametrized pmf with discrete support , , such that has moments of second order with mode and admitting dispersion parameter D.
Remark 1.
Let be a mean dispersion-ready pmf on . The following two assertions are satisfied:
- (i)
- the mode m of always belongs to ;
- (ii)
- if μ is the mean of , then , where denotes the integer part.
In order to create discrete associated kernels from an underdispersed unimodal mean dispersion-ready pmf defined on , the mean dispersion method requires, if it exists, an explicit solution of the following system of equations:
It should be noted that this construction may not always be possible, and alternative methods can be found in [8,12,18].
Now, we illustrate the use of (6) in four examples such both new double Poisson and gamma-count kernels, as well as the old CoM-Poisson and binomial kernels.
Example 1.
The double Poisson kernel of the second order and underdispersed for any is defined on for each ,
where is the normalizing constant. It comes from (3) with the reparametrization of the system which implies
as , where is the count random variable associated to this double Poisson kernel.
Example 2.
The gamma-count kernel, which exhibits the underdispersion phenomenon for any , is derived from (4) with parametrization . It is defined on for each and any by
Example 3.
The CoM-Poisson kernel of the second order and underdispersed for any is defined with for each ,
where is the normalizing constant and represents a function of x and given by the solution of
One can refer to [7,8] for further details. This construction implies that and
Example 4.
The first-order and underdispersed binomial kernel is introduced by Kokonendji and Senga Kiessé [4] as follows: for each and ,
with as and
Figure 3 and Figure 4 show different behaviours of these four underdispersed count kernels at the origin and at , respectively, according to three values of the bandwidth . Hence, the two newly suggested count kernels appear to be better competitors to the second-order CoM-Poisson kernel compared to the binomial one.
Figure 3.
Gamma-count (GC), double Poisson (DP) kernels with parametrization compared to binomial (B) and CoM-Poisson (CMP) at with (a) ; (b) ; (c) .
Figure 4.
Double Poisson and gamma-count kernels with parametrization compared to CoM-Poisson (CMP) and binomial kernels at with (a) ; (b) ; (c) .
4. Simulation Studies and an Application to Real Data
The purpose of all numerical studies conducted here is to investigate the performances of the two new double Poisson and gamma-count kernels alongside the classical binomial and CoM-Poisson smoothers derived from (1) and (2). Computations are conducted on a 2.30 GHz PC using the R 4.2.1 software [19]. The previous smoothers are fitted using the rmutil, Ake and mpcmp packages [20,21,22], respectively. The corresponding four underdispersed count kernel estimators are assessed by employing integrated squared error (ISE) method to determine the optimal bandwidth parameter
In fact, the usual cross-validation (data-driven) technique does not converge in simulations and real data for the proposed kernel estimator: double Poisson and gamma-count. The reader can refer, for others methods, to Chu [13] for the plug-in method, Harfouche et al. [1] for cross-validation and to Kokonendji and Senga Kiéssé [4] for mean integrated squared errors.
In this study, we examine the performances of four count smoothers using count simulated datasets under four different scenarios denoted by A, B, C, and D. These scenarios are chosen to assess how well the estimators handle zero inflation, unimodality and multimodality. We evaluate the effectiveness of the smoothers by analyzing the empirical estimates of the ISE, specifically
where is the number of replications and n denotes the sample size.
- Scenario A is generated by using the Poisson distribution
- Scenario B comes from the zero-inflated Poisson distribution
- Scenario C is from a mixture of two Poisson distributions
- Scenario D comes from a mixture of three Poisson distributions
Table 1 presents the computation times required to perform all ISE bandwidth selection techniques (7) for gamma-count, double Poisson, binomial and CoM-Poisson smoothers based on a single replication of sample sizes ranging from to 500 for the target function C. For all sample sizes, the results show that the CoM-Poisson is the most time consuming followed by the double Poisson smoother mainly due to the normalizing constant in their expressions, (5) and (3), respectively. As the sample sizes increase, the binomial kernel outperforms in terms of CPU times due to its support , whereas the gamma-count kernel becomes the second quickest due to the integrals in its expression.
Table 1.
Comparison of execution times (in seconds) for one replication of Scenario C using gamma-count (gc), double Poisson (dp), binomial (b) and CoM-Poisson (cmp) kernel estimates.
Figure 5 depicts the true pmf and the smoothing ones using gamma-count, double Poisson, binomial and CoM-Poisson kernels with respect to Scenario C, and for one replication. The graphs show that, in general, the two new underdispersed count kernel estimators are accurate.
Figure 5.
True pmf and estimate ones by gamma-count (GC), double Poisson (DP), binomial (B) and CoM-Poisson (CMP) kernels for the bimodal Scenario C with (a) ; (b) .
Table 2 exhibits some empirical values of , obtained through ISE bandwidth selection (7) using as number of replications, according to the four Scenarios A, B, C and D and with respect to sample sizes Then, several behaviours emerge. As the sample sizes increase, the smoothings improve for all smoothers. As expected, the binomial kernel is the least efficient since it is of the first order. The three others have comparable performances. The two new count kernels, namely double Poisson and gamma-count, are slightly more precise than the CoM-Poisson one, notably for small and medium sample sizes (i.e., ) while the latter is the best for large sample sizes. Additionally, approximations made for the moments of the gamma-count distribution (4) may help clarify the performance discrepancy between the two new kernels for larger sample sizes. Finally, from a purely practical perspective, Table 1 and Table 2 highlight the following ranking in performances: double Poisson, gamma-count and CoM-Poisson.
Table 2.
Empirical mean values of with their standard deviations in parentheses over replications and with different sample sizes under four Scenarios A, B, C and D by using gamma-count (gc), double Poisson (dp), binomial (b) and CoM-Poisson (cmp) kernel estimators with the bandwidth selection.
Now, we apply these four underdispersed count kernels for smoothing the real count dataset on development days of insect pests on Hura trees with moderate sample size ; see also [8]. Practical performances are here examined via the empirical method (7) and the empirical criterion of :
where is the empirical or naive estimator. The double Poisson and the CoM-Poisson kernels are comparable and appear to be the best with , and , followed by the binomial smoother with and and finally the gamma-count smoother with and .
Figure 6 offers their graphical representations. We also evaluate the practical upper tail probability suitable for applied statisticians. Then, these tail probabilities are estimated to be , , , and for the empirical frequency , gamma-count, double Poisson, binomial and CoM-Poisson kernel estimations, respectively. Although the double Poisson and the CoM-Poisson have similar performances, we recommend, again, the first one, which is more flexible and much faster; see Table 1.
Figure 6.
Empirical frequency with its corresponding gamma-count (GC), double Poisson (DP), binomial (B) and CoM-Poisson (CMP) kernel estimates of count dataset of insect pests on Hura trees with .
5. Summary and Final Remarks
We introduced two novel underdispersed count kernels, specifically the double Poisson and gamma-count ones. They were developed using the proposed mean dispersion method. Also, we considered the integrated squared error method (7) to select as quickly and efficiently as possible the bandwidth of their corresponding estimations. Through simulation experiments and real count data analysis, we demonstrated that these kernels perform better than the binomial kernel, while falling between the CoM-Poisson kernel smoothing (which performs the best) and the binomial kernel (which performs the worst). Although the CoM-Poisson and double Poisson kernels have similar performances, we strongly recommend using the latter due to its significantly lower time consumption and its flexibility from some closed-form expressions.
We note that any underdispersed count distribution cannot always lead to its corresponding associated kernel; see Appendix A and also [23,24]. It would also be better to improve the bandwidth selection with data-driven methods; Appendix B mentions the direction for the local Bayesian bandwidth selection. In addition, an important fact for smoothing a pmf on with is to consider, for instance, the k-shifted version of any underdispersed count kernel. In fact, the two main properties of the associated kernel, as recalled in Definition 1, are first to adapt the support of the kernel to and second to maintain the variance property which tends to as .
Author Contributions
Conceptualization, C.C.K. and S.M.S.; methodology, C.C.K. and S.M.S.; software, S.M.S.; validation, C.C.K. and S.M.S.; formal analysis, C.C.K. and S.M.S.; investigation, S.M.S.; resources, C.C.K. and S.M.S.; data curation, C.C.K. and S.M.S.; writing—original draft preparation, C.C.K. and S.M.S.; writing—review and editing, C.C.K., S.M.S., Y.E. and M.B.; visualization, C.C.K. and S.M.S.; supervision, C.C.K.; funding acquisition: C.C.K.; project administration: C.C.K., S.M.S. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded, for the first author, by “Brazilian-French Network in Mathematics”.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
All data used in the study are publicly available.
Acknowledgments
Part of this paper was written during the visit of the first author to the Department of Statistics, Universidade Federal do Rio Grande do Norte, Natal, RN, Brasil which is supported by grant from “Brazilian-French Network in Mathematics”. The authors are grateful to the Associate Editor and three anonymous referees for their constructive comments.
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| cmp | CoM-Poisson |
| dp | Double Poisson |
| gc | gamma-count |
| iid | Independent and identically distributed |
| ISE | Integrated squared error |
| pmf | Probability mass functions |
Appendix A. Some Other Underdispersed Count Distributions for Kernels Attempts
We provide five count distributions, namely BerPoi, generalized Poisson, underdispersed Poisson, BerG, and hyper-Poisson, which can exhibit the underdispersion property and have closed-form expressions for their pmf, mean, and variance. However, it is not possible to construct their corresponding associated kernels. Thus, an alternative approach to the proposed mean dispersion method may be necessary.
- The BerPoi distribution has its pmf,with and . Its mean and variance are, respectively,Bourguignon et al. [25] propose a reparametrization by mean and dispersion index ; that is, and with and . It follows from this parametrization thatandwith conditions and to ensure (A1) being underdispersed and a proper pmf.
- The generalized Poisson (GP) is defined through its pmf aswith and ; see Harris et al. [26]. The corresponding mean and variance are given byWe thus obtain underdispersion for .
- The pmf of the so-called Underdispersed Poisson distribution of Singh et al. [27] is given, for and , bywith
- The BerG distribution is defined bywith parameters and . Its mean and variance are, successively,This model presents overdispersion, equidispersion and underdispersion for , and , respectively. See Bourguignon and de Medeiros [28] for further details.
- The hyper-Poisson distribution, initially proposed by Bardwell and Crow [29], is defined as follows:with , for , r a positive integer, andas the confluent hypergeometric series. The mean and variance areThis distribution is overdispersed if , equidispersed if and underdispersed if . See also [29,30] for some details.
Appendix B. Local Bayesian Bandwidths of Discrete Kernels
Among the three approaches of the Bayesian bandwidths (global, adaptive and local), it is known that the local one is appropriate for discrete kernel estimators.
Hence, our approach involves treating h as a tuning parameter of the pmf and then constructing a Bayesian estimator for h using . We assume a prior distribution for h, and then apply the Bayes theorem to obtain the posterior distribution of h at the (local) point of estimation x:
Since f is unknown, we use in Equation (1) as the natural estimator of f, and afterward we can estimate the posterior by the so-called posterior density as
Under the squared error loss function, the Bayes estimator of the smoothing (tuning) parameter h is the mean of the previous posterior density given by
Since the smoothing parameter h here belongs to , a natural univariate prior distribution of is the beta distribution with positive parameters and :
where and is the Euler beta function defined by
Then, the posterior becomes
where specific of double Poisson, gamma-count, CoM-Poisson and binomial kernels are, respectively,
and
For instance, only the local bandwidths of the binomial kernel estimator have the exact expressions as
with ; see, e.g., Somé et al. [2] for more details in univariate and multivariate setups.
References
- Harfouche, L.; Adjabi, S.; Zougab, N.; Funke, B. Multiplicative bias correction for discrete kernels. Stat. Methods Appl. 2018, 27, 253–276. [Google Scholar] [CrossRef]
- Somé, S.M.; Kokonendji, C.C.; Belaid, N.; Adjabi, S.; Abid, R. Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics. Stat. Methods Appl. 2023, 32, 843–865. [Google Scholar] [CrossRef]
- Racine, J.S.; Li, Q. Nomparametric estimation of regression functions with both categorical and continuous data. J. Econom. 2004, 119, 99–130. [Google Scholar] [CrossRef]
- Kokonendji, C.C.; Senga Kiessé, T. Discrete associated kernels method and extensions. Stat. Methodol. 2011, 8, 497–516. [Google Scholar] [CrossRef]
- Aitchison, J.; Aitken, C.G.G. Multivariate binary discrimination by the kernel method. Biometrika 1976, 63, 413–420. [Google Scholar] [CrossRef]
- Wang, M.; Van Ryzin, J. A class of smooth estimators for discrete distributions. Biometrika 1981, 68, 301–309. [Google Scholar] [CrossRef]
- Huang, A.; Sippel, L.; Fung, T. Consistent second-order discrete kernel smoothing using dispersed Conway-Maxwell-Poisson kernels. Comput. Stat. 2022, 37, 551–563. [Google Scholar] [CrossRef]
- Esstafa, Y.; Kokonendji, C.C.; Somé, S.M. Asymptotic properties of the normalised discrete associated-kernel estimator for probability mass function. J. Nonparametric Stat. 2023, 35, 355–372. [Google Scholar] [CrossRef]
- Sánchez-Borrego, I.; Opsomer, J.D.; Rueda, M.; Arcos, A. Nonparametric estimation with mixed data types in survey sampling. Rev. Mat. Complut. 2014, 27, 685–700. [Google Scholar] [CrossRef]
- Hsiao, C.; Li, Q.; Racine, J.S. A consistent model specification test with mixed discrete and continuous data. J. Econ. 2007, 140, 802–826. [Google Scholar] [CrossRef]
- Li, Q.; Racine, J.S. Nonparametric Econometrics: Theory and Practice; Princeton University Press: Princeton, NJ, USA, 2023. [Google Scholar]
- Kokonendji, C.C.; Somé, S.M. On multivariate associated kernels to estimate general density functions. J. Korean Stat. Soc. 2018, 47, 112–126. [Google Scholar] [CrossRef]
- Chu, C.Y.; Henderson, D.J.; Parmeter, C.F. Plug-in bandwidth selection for kernel density estimation with discrete data. Econometrics 2015, 3, 199–214. [Google Scholar] [CrossRef]
- Efron, B. Double exponential families and their use in generalized linear regression. J. Am. Stat. Assoc. 1986, 81, 709–721. [Google Scholar] [CrossRef]
- Toledo, D.; Umetsu, C.A.; Camargo, A.F.M.; Rodrigues De Lara, I.A. Flexible models for non-equidispersed count data: Comparative performance of parametric models to deal with underdispersion. AStA Adv. Stat. Anal. 2022, 106, 473–497. [Google Scholar] [CrossRef]
- Winkelmann, R. Duration dependence and dispersion in count-data models. J. Bus. Econ. Stat. 1995, 3, 467–474. [Google Scholar]
- Zeviani, W.M.; Ribeiro, P.J., Jr.; Bonat, W.H.; Shimakura, S.E.; Muniz, J.A. The Gamma-count distribution in the analysis of experimental underdispersed data. J. Appl. Stat. 2014, 41, 2616–2626. [Google Scholar] [CrossRef][Green Version]
- Jin, X.; Kawczak, J. Birnbaum-Saunder and lognormal kernel estimators for modelling durations in high frequency financial data. Ann. Econom. Financ. 2003, 4, 103–1024. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: http://cran.r-project.org/ (accessed on 28 March 2023).
- Swihart, B.; Lindsey, J. Rmutil: Utilities for Nonlinear Regression and Repeated Measurements Models, R Package Version 1.1.0. 2017. Available online: https://CRAN.R-project.org/package=rmutil (accessed on 28 March 2023).
- Wansouwé, W.E.; Somé, S.M.; Kokonendji, C.C. Ake: An R package for discrete and continuous associated kernel estimations. R J. 2016, 8, 258–276. [Google Scholar] [CrossRef]
- Fung, T.; Alwan, A.; Wishart, J.; Huang, A. Mpcmp: Mean-Parametrized Conway-Maxwell Poisson (COM-Poisson) Regression, R Package Version 0.3.6. 2020. Available online: https://cran.r-project.org/web/packages/mpcmp/index.html (accessed on 28 March 2023).
- Cahoy, D.; Di Nardo, E.; Polito, F. Flexible models for overdispersed and underdispersed count data. Stat. Pap. 2021, 62, 2969–2990. [Google Scholar] [CrossRef]
- Louzayadio, C.G.; Malouata, R.O.; Koukouatikissa, M.D. A weighted Poisson distribution for underdispersed count data. Int. J. Stat. Probab. 2021, 10, 157. [Google Scholar] [CrossRef]
- Bourguignon, M.; Gallardo, D.I.; Medeiros, R.M. A simple and useful regression model for underdispersed count data based on Bernoulli–Poisson convolution. Stat. Pap. 2022, 63, 821–848. [Google Scholar] [CrossRef]
- Harris, T.; Yang, Z.; Hardin, J.W. Model. Underdispersed Count Data Gen. Poisson Regression. Stata J. 2012, 12, 736–747. [Google Scholar] [CrossRef]
- Singh, B.P.; Singh, G.; Das, U.D.; Maurya, D.K. An Under-Dispersed Discrete Distribution and Its Application. J. Stat. Appl. Probab. Lett. 2021, 8, 205–213. [Google Scholar]
- Bourguignon, M.; de Medeiros, R.M. A simple and useful regression model for fitting count data. Test 2022, 31, 790–827. [Google Scholar] [CrossRef]
- Bardwell, G.E.; Crow, E.L. A two-parameter family of hyper-Poisson distributions. J. Am. Stat. Assoc. 1964, 9, 133–141. [Google Scholar] [CrossRef]
- Sáez-Castillo, A.J.; Conde-Sánchez, A. A hyper-Poisson regression model for overdispersed and underdispersed count data. Comput. Stat. Data Anal. 2013, 61, 148–157. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).