Abstract
A Poisson distribution is commonly used as the innovation distribution for integer-valued autoregressive models, but its mean is equal to its variance, which limits flexibility, so a flexible, one-parameter, infinitely divisible Bell distribution may be a good alternative. In addition, for a parameter with a small value, the Bell distribution approaches the Poisson distribution. In this paper, we introduce a new first-order, non-negative, integer-valued autoregressive model with Bell innovations based on the binomial thinning operator. Compared with other models, the new model is not only simple but also particularly suitable for time series of counts exhibiting overdispersion. Some properties of the model are established here, such as the mean, variance, joint distribution functions, and multi-step-ahead conditional measures. Conditional least squares, Yule–Walker, and conditional maximum likelihood are used for estimating the parameters. Some simulation results are presented to access these estimates’ performances. Real data examples are provided.
1. Introduction
In recent years, studying count time series has attracted a lot of attention in different fields, such as finance, medical science, and insurance. There are many models for count data that have been proposed by scholars. The most famous model was first introduced by McKenzie (1985) [] and Al-Osh and Alzaid (1987) [] based on the binomial thinning ∘ operator (Steutel and van Harn 1979 []) called the first-order integer-valued autoregressive (INAR(1)) process. Given a non-negative integer-valued random variable (r.v.) X and a constant , the binomial thinning operator ∘ is defined as , where the counting series is a sequence of independent identically distributed (i.i.d.) Bernoulli r.v.s with . Then, the form of the INAR(1) model is
where is a sequence of i.i.d. discrete r.v.s, with the mean and finite variance . is independent of and for . According to Alzaid and Al-Osh (1988) [], we know that the mean and variance of the INAR(1) model are
For innovation , the Poisson distribution is often assumed as the distribution of in the INAR(1) model. A natural characteristic of the Poisson distribution is equidispersion; i.e., its mean and variance are equal to each other. In practice, however, many data examples are overdispersed (variance is greater than mean) relative to the Poisson distribution. For this reason, the INAR(1) model with Poisson innovations is not always suitable for modeling integer-valued time series. Therefore, several models which describe the overdispersion phenomena have been discussed in the statistical literature.
One common approach is to change the thinning operation in the INAR(1) model. Weiß (2018) [] summarized several alternative thinning operators, such as random coefficient thinning, iterated thinning and quasi-binomial thinning. Ristić et al. (2009) [] proposed the negative binomial thinning operator and defined the corresponding INAR(1) process with geometric marginal distributions. Liu and Zhu (2021) [] generalized the binomial thinning operator to the extended binomial one.
Changing the distribution of innovations is also used to modify the INAR(1) model. Jung et al. (2005) [] indicated that the INAR(1) model with negative binomial innovations (NB-INAR(1)) is appropriate for generating overdispersion. Jazi et al. (2012) [] defined a zero-inflated Poisson ZIP(, ) for innovations (ZIP-INAR(1)), because a frequent occurrence in overdispersion is that the incidence of zero counts is greater than expected from the Poisson distribution. Jazi et al. (2012) [] proposed a modification of the INAR(1) model with geometric innovations (G-INAR(1)) for modeling overdispersed count data. Schweer and Weiß (2014) [] investigated the compound Poisson INAR(1) (CP-INAR(1)) model, which is suitable for fitting datasets with overdispersion. According to Schweer and Weiß (2014) [], we can also know that the negative binomial distribution and the geometric distribution both belong to the compound Poisson distribution. Livio et al. (2018) [] presented the INAR(1) model with the Poisson–Lindley innovations, i.e., PL-INAR(1). Bourguignon et al. (2019) [] introduced the INAR(1) model with the double Poisson (DP-INAR(1)) and generalized Poisson innovations (GP-INAR(1)). Qi et al. (2019) [] considered zero-and-one inflated INAR(1)-type models, and Cunha et al. (2021) [] introduced an INAR(1) model with Borel innovations to model zero truncated count time series.
This paper applies the second approach to dealing with overdispersion. Although several models have been proposed in recent years, most of the considered distributions are based on some generalizations of the Poisson distribution and have more than one parameter, such as the zero-inflated Poisson, compound Poisson, double Poisson, and generalized Poisson distributions. Here we use a relatively simple distribution introduced by Castellares et al. (2018) [] for the innovations, i.e., the Bell distribution. It has the advantages of having only one parameter, belonging to the exponential family, having a simple probability mass function, and having infinite divisibility. Infinite divisibility is significant for constructing the binomial thinning INAR(1) model. Further, the Bell distribution is suitable for modeling some overdispersed count data. Therefore, we introduce a new INAR(1) model with Bell innovations (BL-INAR(1)), which can account for overdispersion in an INAR(1) framework.
In order to observe whether the BL-INAR(1) model has advantages, we compare it with the INAR(1) model with Poisson innovations (P-INAR(1)), G-INAR(1), PL-INAR(1), NB-INAR(1), ZIP-INAR(1), DP-INAR(1), and GP-INAR(1) models. Different information criteria, such as Akaike’s information criterion (AIC) [], the Bayesian information criterion (BIC) [], the consistent Akaike information criterion (CAIC) [], and the Hannan–Quinn information criterion (HQIC) [], are used to compare the above eight models. By comparing the results of different information criteria, it can be seen that the BL-INAR(1) model is competitive when modeling the overdispersed integer-valued time series data, which shows that the proposed BL-INAR(1) model is meaningful; see Section 5 for more details.
We organize the remaining parts of this paper as follows. In Section 2, we briefly review the Bell distribution, including its definition and some properties. Then we propose the BL-INAR(1) model, and its basic properties are constructed; conditional mean and variance are obtained. Section 3 discusses estimates of the model parameters by using the conditional least squares (CLS), Yule–Walker (YW), and conditional maximum likelihood (CML) methods. In Section 4, a numerical simulation of the estimates is presented with some discussions. In Section 5, we compare the proposed model with the other seven INAR(1)-type models when fitting two real data examples, which show the superior performances of the proposed model. The paper concludes in Section 6.
2. The BL-INAR(1) Model
In this section, we present a brief review of the Bell distribution (Castellares et al., 2018 []). Its definition and some properties are presented. Later we introduce the BL-INAR(1) model and derive some basic properties of it.
2.1. The Bell Distribution
At first, we introduce the Bell numbers. Bell (1934) [] has provided the following expansion:
where is the Bell number defined by
The Bell number is the n-th moment of the Poisson distribution with parameter equal to 1. Some Bell numbers are listed as follows. , , 21,147, 115,975, 678,570, 4,213,597 and 27,644,437.
For the convenience of the reader, we introduce the following definition and properties of the Bell distribution described in Castellares et al. (2018) []:
Definition 1.
A discrete r.v. Z taking values in has a Bell distribution with parameter , denoted as , if its probability mass function is given by
where is the Bell number in (2).
We can see that the Bell distribution has only one parameter, and it belongs to the one-parameter exponential family of distributions. If , the probability generating function is
The mean and variance of Z are
Note that ; hence, the Bell distribution is overdispersed, which means the Bell distribution may be suitable for count data with overdispersion in certain situations.
There are some other interesting properties of the Bell distribution, including the following: (i) the Poisson distribution is not nested in the Bell family, but for small values of the parameter, the Bell distribution approaches the Poisson distribution; (ii) it is identifiable, strongly unimodal and infinitely divisible; (iii) a r.v. has the same distribution as , where has zero-truncated Poisson distribution with parameter , and . See Castellares et al. (2018) [] for more properties.
Additionally, there are some papers based on the Bell distribution, and the following are a few related references: Batsidis et al. (2020) [] proposed and studied a goodness-of-fit test for the Bell distribution, which is consistent against fixed alternatives; Castellares et al. (2020) [] presented a new two-parameter Bell–Touchard discrete distribution; Lemonte et al. (2020) [] introduced a zero-inflated Bell regression model for count data; Muhammad et al. (2021) [] proposed a Bell ridge regression as a solution to the multicollinearity problems.
2.2. Definition and Properties of the BL-INAR(1) Process
In this section, we give the definition of the BL-INAR(1) process, and its basic statistical properties are derived.
Definition 2.
According to Equation (4), we know the mean and variance of are finite; therefore, the process of in (5) is an ergodic stationary Markov chain (Du and Li, (1991) []) with transition probabilities
Further, we can obtain the joint probability function as follows:
The conditional mean, conditional variance, mean, variance, covariance and autocorrelation function of the BL-INAR(1) process are given in the following lemma.
Lemma 1.
Let be the process in Definition 2. Then it has the following properties:
(i) ;
(ii) ;
(iii) ;
(iv) ;
(v) ;
(vi) .
The proof of Lemma 1 is similar to that of Theorem 1 of Qi et al. (2019) [], so it is omitted.
According to Lemma 1, the dispersion index (Fisher, 1950 []) of is derived as follows:
thus, the BL-INAR(1) process is suited for overdispersed integer-valued time series.
Additionally, we can obtain the k-step ahead conditional mean and k-step ahead conditional variance of the BL-INAR(1) process in the following theorem.
Theorem 1.
The k-step ahead conditional mean and k-step ahead conditional variance of the BL-INAR(1) process are given, respectively, by:
and
For more details about the proof of this theorem, see Qi et al. (2019) [] and Ristić, Bakouch, and Nastić (2009) []. It is easy to see that if , and , which are the unconditional mean and unconditional variance of , respectively.
3. Estimation of Parameters
The true values of parameters and are unknown in practice; therefore, we need to estimate the value of . Sometimes we have to give an estimate of first to get the estimate of . In this section, we consider three methods for estimating parameters, namely, CLS, YW and CML.
3.1. Conditional Least Squares Estimation
The CLS estimates of the parameters and are obtained by
and the CLS estimates of are given by
and
Then, the CLS estimate of can be obtained by solving the equation .
According to Theorems 3.1 and 3.2 in Tjøstheim (1986) [], we can establish the consistency and asymptotic normality of the CLS estimates and in the following theorem. The proofs of Theorem 2 and the following theorem are given in Appendix A.
Theorem 2.
Let and be the CLS estimates of the BL-INAR(1) process; then is strongly consistent for (α, μ); and the asymptotic distribution follows as:
where
and .
Using the delta method, we can obtain the limit distribution of , and we can also know that is consistent.
3.2. Yule–Walker Estimation
Let come from the process in Definition 2. The sample mean is , and the sample autocorrelation function is
From Lemma 1, we know , thus the Yule–Walker (YW) estimate of is given by
and
with ; then the estimate of can be obtained.
For asymptotic properties of the YW estimates, Freeland and McCabe (2005) [] showed that the YW and CLS estimates are asymptotically equivalent for a Poisson INAR(1) process. The next theorem shows that the conclusion holds for our BL-INAR(1) process.
Theorem 3.
In the BL-INAR(1) process, CLS and YW estimates are asymptotically equivalent, i.e.,
3.3. Conditional Maximum Likelihood Estimation
According to the joint probability function (6), the likelihood function can be obtained as:
To condition on variable , we can obtain the conditional log likelihood function as:
the CML estimates of are the values of obtained by maximizing the conditional log likelihood function . It is easy to check that the BL-INAR(1) process satisfies conditions (C1)–(C6) of Franke and Seligmann (1993) []; thus, the CML estimates are consistent and asymptotically normal. The proof is similar to those of Theorems 22.4 and 22.5 of Franke and Seligmann (1993) [], so it is omitted.
4. Simulation
A Monte Carlo simulation was conducted to study the performances of the CLS, YW, and CML estimates of the BL-INAR(1) model. The CML estimates were obtained by using the BFGS quasi-Newton nonlinear optimization algorithm with numerical derivatives. We considered YW estimates as initial values for the algorithm. The simulation was conducted using R programming language, and the size of the sample was 100, 250, 500, or 1000. The number of replicates was 1000. For the true values of parameters, we considered and and and .
First, we give the Q–Q plots of the CLS, YW, and CML estimates for the BL-INAR(1) model with sample size , , and in Figure 1. From the six Q–Q plots, we can see that they contain roughly straight lines; i.e., the estimates of the parameters are normally distributed. Then, the numerical simulation results are presented in Table 1 and Table 2. By comparing the two tables, we can find that with the same and T, the mean squared error (MSE) for the estimate of increased with the increase of , but the MSE for the estimate of decreased. Additionally, the MSE for the estimate of increased with the increase of with the same and T, but the MSE for the estimate of decreased. Furthermore, we can observe that the estimates of CLS and YW are similar, and the bias tended toward zero for all estimates as the sample size increased. The estimates of CML converged faster to the true parameter values. We conclude that the CML estimates produced the smallest mean square errors, and CML performed better than CLS and YW.
Figure 1.
The Q–Q plots of the CLS, YW, and CML estimates for the BL-INAR(1) model with sample size .
Table 1.
Empirical means and mean squared errors (in parentheses) of the estimates of the parameters for some values of and of the BL-INAR(1) model.
Table 2.
Empirical means and mean squared errors (in parentheses) of the estimates of the parameters for some values of and of the BL-INAR(1) model.
5. Real Data Examples
In this section, we present two applications of the BL-INAR(1) model to real datasets, and compare it with the P-INAR(1), G-INAR(1), PL-INAR(1), NB-INAR(1), ZIP-INAR(1), DP-INAR(1), and GP-INAR(1) models. Results of the comparison are discussed here as well.
5.1. Disconduct Data
The first dataset is a monthly count of disconduct in the first census tract in Rochester, which can be obtained from Available online: http://www.forecastingprinciples.com (accessed on 8 May 2012). The data comprise 132 observations () starting from January 1991 and ending in December 2001.
The time plot, histogram, autocorrelation function (ACF), and partial autocorrelation function (PACF) are provided in Figure 2. We applied the Ljung–Box test (Ljung and Box (1978) []) to check whether this time series dataset has any autocorrelation. The p-value of the Ljung–Box test is , which is less than 0.05. This means that the time series data have some autocorrelation, and according to the PACF diagram, the data are first-order autocorrelated, which shows that the AR(1)-type process is appropriate for modeling this dataset.
Figure 2.
The time plot, histogram, ACF, and PACF of disconduct data.
The sample mean and variance of the data are and , respectively. Thus, we got the dispersion index . According to the overdispersion test of Schweer and Weiß (2014) [], the critical value of the data is 1.1994. The dispersion index exceeds the critical value, which means that the equidispersed P-INAR(1) model is not a good choice for the data.
For comparison, we calculated the CML estimates of parameters, and the AIC, BIC, CAIC, HQIC, fitted mean, and fitted variance of the BL-INAR(1) model, the P-INAR(1) model, the G-INAR (1) model, the PL-INAR(1) model, the ZIP-INAR(1) model, the NB-INAR(1) model, the DP-INAR(1) model, and the GP-INAR(1) model. Among the eight models, the first four are two-parameter models and the last four are three-parameter models. The results are presented in Table 3. We found that the AIC, BIC, CAIC, and HQIC of the BL-INAR(1) model were smaller than those of others. We also found that the fitted means of all eight models were near to the sample mean, and the fitted mean of the PL-INAR(1) model was the closest to the sample mean. In terms of fitted variance, Table 3 shows that the fitted variance of the BL-INAR(1) model performed better than those of the other seven models.
Table 3.
CML estimates, AIC, BIC, CAIC, HQIC, fitted mean, fitted variance, and RMSE for eight INAR(1) models of disconduct data.
For the prediction, we used the first 126 observations to estimate the parameters, and then predicted the last six observations. The predicted values of the disconduct data could be given by . For a further comparison of models, we calculated the root mean square values of the prediction errors (RMSEs) for the last 6 months of the data, and the RMSE is defined as . We present the RMSE results of eight models in the last column of Table 3. From the table, we can see that the RMSE of the G-INAR(1) model was best. The RMSE of the BL-INAR(1) model is smaller than those of the P-INAR(1) model, the NB-INAR(1) model, the DP-INAR(1) model, and the GP-INAR(1) model; and a little larger than those of the G-INAR(1) model, the PL-INAR(1) model, and the ZIP-INAR(1) model. Although the fitted mean and RMSE of the BL-INAR(1) model are not the best, it is the best choice under the other five criteria. Further, we analyze the Pearson residuals, and Figure 3 plots the ACF, PACF, and Q–Q plots of residuals. The ACF and PACF graphs show no correlation between residuals, which is supported by the result of the Ljung–Box test with a p-value of . The Q–Q plots appear to be roughly normally distributed, as we expected. Hence, we can conclude that the BL-INAR(1) model is the most suitable among those available for this dataset.
Figure 3.
The ACF, PACF, and Q–Q plots of the Pearson residual for disconduct data using the BL-INAR(1) model.
5.2. Strikes Data
The second dataset, which was analyzed by Weiß (2010) [], is the monthly number of work stoppages (strikes and lock-outs) of 1000 or more workers for the period 1994–2002. It was published by the US Bureau of Labor Statistics and can be obtained by online at the address Available online: http://www.bls.gov/wsp/ (accessed on 8 May 2012). The data contain 108 observations, and the time plot, histogram, ACF, and PACF are provided in Figure 4. As with the previous example, the Ljung–Box test was used to check whether the strike data have any autocorrelation. The p-value of the Ljung–Box test was , which shows that the time series data have some autocorrelation, and according to the PACF diagram, it is also first-order autocorrelated, so an AR(1)-type process is appropriate for modeling this dataset.
Figure 4.
The time plot, histogram, ACF, and PACF of data on strikes.
The sample mean, variance, and dispersion index were calculated to be , , and , respectively. According to the overdispersion test, the critical value of the data is , and we observe that it was inappropriate to use the P-INAR(1) model to fit the data. The CML estimates, AIC, BIC, CAIC, HQIC, fitted mean, and fitted variance for the BL-INAR(1), P-INAR(1), G-INAR(1), PL-INAR(1), NB-INAR(1), ZIP-INAR(1), DP-INAR(1), and GP-INAR(1) models were obtained and are shown in Table 4. We see that the AIC, BIC, CAIC, and HQIC of the BL-INAR(1) model are smaller than those of others, and the fitted mean of the BL-INAR(1) model is not much different from those of the other seven models. Further, we can see that the BL-INAR(1) model performed better than others when calculating the fitted variance. Similarly to the previous example, the first 102 observations were used to estimate the parameters and predict the last six observations. The RMSE of the predictions is also presented in Table 4. We can observe that the RMSE of the G-INAR(1) model is the smallest; however, it is only 0.05 less than the RMSE of the BL-INAR(1) model. As in the previous example, although the BL-INAR(1) model was not the best under the fitted mean and RMSE criteria, it performed best under the other five criteria. Additionally, we show the Pearson residuals analysis. Figure 5 gives the ACF, PACF, and Q–Q plots of the residuals. We found that there is no evidence of any significant correlation within the residuals, a finding also supported by the Ljung–Box test with a p-value of 0.9522, which is greater than 0.05. The Q–Q plot also appears to be roughly normally distributed. Thus, according to above discussions and its simplicity, we can conclude that the BL-INAR(1) model was the most appropriate.
Table 4.
CML estimates, AIC, BIC, CAIC, HQIC, fitted mean, fitted variance, and RMSE from eight INAR(1) models of strike data.
Figure 5.
The ACF, PACF, and Q–Q plots of the Pearson residual for strike data using the BL-INAR(1) model.
Combined with the above two examples and the advantages of the Bell distribution with one parameter and a simple form, the BL-INAR(1) model is competitive with the other seven models.
6. Conclusions
A new INAR(1) model with Bell innovations based on the binomial thinning operator was introduced in this paper. Based on the overdispersed property of the Bell distribution, we found that the BL-INAR(1) model is suitable for overdispersed data. Some basic properties of the model were obtained, such as transition probabilities, conditional mean, conditional variance, mean, variance, covariance, autocorrelation function, and k-step ahead conditional mean and variance. For unknown parameters, CLS, YW, and CML methods are used to estimate them. The Q–Q plots showed that the estimates of the parameters are normally distributed. The simulated results revealed that the CML estimates of parameters of the BL-INAR(1) model were better than the CLS and YW estimates. Finally, by comparing the AIC values, BIC values, CAIC values, HQIC values, fitted means, fitted variances, and RMSE values of the predictions among eight INAR(1) models, two real datasets both showed that the BL-INAR(1) model fits better than other INAR(1) models. The analysis of residuals also shows that the BL-INAR(1) model provided adequate fits to those datasets.
Although there are many overdispersed INAR(1) models, some interesting properties of the Bell distribution, such as having one parameter, infinitely divisibility, having a simple probability mass function, belonging to the one-parameter exponential family of distributions, and for a parameter with a small value, having the Bell distribution approach the Poisson distribution, make the BL-INAR(1) model competitive. Some extended distributions of the Bell distribution, such as the zero-inflated Bell distribution and the Bell–Touchard distribution, provide ideas for us to study related INAR models in the future.
Author Contributions
Conceptualization, F.Z.; methodology, F.Z.; software, J.H.; validation, J.H. and F.Z.; formal analysis, J.H.; investigation, J.H. and F.Z.; resources, F.Z.; data curation, J.H.; writing—original draft preparation, J.H.; writing—review and editing, F.Z.; visualization, J.H.; supervision, F.Z.; project administration, F.Z.; funding acquisition, F.Z. All authors have read and agreed to the published version of the manuscript.
Funding
Zhu’s work is supported by National Natural Science Foundation of China, grant numbers 11871027 and 11731015.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The disconduct data and the strike data are available at http://www.forecastingprinciples.com (accessed on 1 June 2021) and http://www.bls.gov/wsp/ (accessed on 1 June 2021 ), respectively .
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Appendix A.1. Proof of Theorem 2
To prove this theorem, we need to show that the conditions given in Theorems 3.1 and 3.2 of Tjøstheim (1986) [] are satisfied.
Define , and the true value of the unknown parameter . According to Lemma 1, we know that and that is almost surely three times differentiable in an open set containing .
Condition 1:
and , for
According to , we have
For the second derivative of , we have
Condition 2:
The vectors are linearly independent in the sense that if and are arbitrary real numbers such that
then . Note that
Then .
Condition 3:
For , there exist functions and for such that
Note that and
then we can choose , which guarantees that and .
For , it is easy to know that . So we choose , to satisfy and .
The above three conditions ensure that is a strongly consistent estimator for . According to Theorem 3.2 in Tjøstheim (1986) [], the asymptotic distribution of is
where ,
and
We can then find that
where , which follows from the following derivation:
according to Lemma 1,
then, we have
Thus, we obtain that
Appendix A.2. Proof of Theorem 3
The proof is similar to that of Theorem 4.2 in Cunha et al. (2021) []. For estimator , we have
where and . For estimator , we only need to prove is .
References
- McKenzie, E. Some simple models for discrete variate time series. Water Resour. Bull. 1985, 21, 645–650. [Google Scholar] [CrossRef]
- Al-Osh, M.A.; Alzaid, A.A. First-order integer-valued autoregressive (INAR(1)) process. J. Time Ser. Anal. 1987, 8, 261–275. [Google Scholar] [CrossRef]
- Steutel, F.W.; van Harn, K. Discrete analogues of self-decomposability and stability. Ann. Probab. 1979, 7, 893–899. [Google Scholar] [CrossRef]
- Alzaid, A.A.; Al-Osh, M.A. First-order integer-valued autoregressive process: Distributional and regression properties. Stat. Neerl. 1988, 42, 53–61. [Google Scholar] [CrossRef]
- Weiß, C.H. An Introduction to Discrete-Valued Time Series; John Wiley & Sons: Hoboken, NJ, USA, 2018. [Google Scholar]
- Ristić, M.M.; Bakouch, H.S.; Nastić, A.S. A new geometric first-order integer-valued autoregressive (NGINAR(1)) process. J. Stat. Plan. Inference 2009, 139, 2218–2226. [Google Scholar] [CrossRef]
- Liu, Z.; Zhu, F. A new extension of thinning-based integer-valued autoregressive models for count data. Entropy 2021, 23, 62. [Google Scholar] [CrossRef] [PubMed]
- Jung, R.C.; Ronning, G.; Tremayne, A.R. Estimation in conditional first order autoregression with discrete support. Stat. Pap. 2005, 46, 195–224. [Google Scholar] [CrossRef]
- Jazi, M.A.; Jones, G.; Lai, C.-D. First-order integer valued AR processes with zero inflated Poisson innovations. J. Time Ser. Anal. 2012, 33, 954–963. [Google Scholar] [CrossRef]
- Jazi, M.A.; Jones, G.; Lai, C.-D. Integer valued AR(1) with geometric innovations. J. Iran. Stat. Soc. 2012, 11, 173–190. [Google Scholar]
- Schweer, S.; Weiß, C.H. Compound Poisson INAR(1) processes: Stochastic properties and testing for overdispersion. Comput. Stat. Data Anal. 2014, 77, 267–284. [Google Scholar] [CrossRef]
- Livio, T.; Mamode Khan, N.; Bourgignon, M.; Bakouch, H.S. An INAR(1) model with Poisson–Lindley innovations. Econ. Bull. 2018, 38, 1505–1513. [Google Scholar]
- Bourguignon, M.; Rodrigues, J.; Santos-Neto, M. Extended Poisson INAR(1) processes with equidispersion, underdispersion and overdispersion. J. Appl. Stat. 2019, 46, 101–118. [Google Scholar] [CrossRef]
- Qi, X.; Li, Q.; Zhu, F. Modeling time series of count with excess zeros and ones based on INAR(1) model with zero-and-one inflated Poisson innovations. J. Comput. Appl. Math. 2019, 346, 572–590. [Google Scholar] [CrossRef]
- Cunha, E.T.D.; Bourguignon, M.; Vasconcellos, K.L.P. On shifted integer-valued autoregressive model for count time series showing equidispersion, underdispersion or overdispersion. Commun. Stat.-Theory Methods 2021. [Google Scholar] [CrossRef]
- Castellares, F.; Ferrari, S.L.P.; Lemonte, A.J. On the Bell distribution and its associated regression model for count data. Appl. Math. Model. 2018, 56, 172–185. [Google Scholar] [CrossRef]
- Akaike, H. Information theory as an extension of the maximum likelihood principle. In Proceedings of the Second International Symposium on Information Theory; Petrov, B.N., Csaki, F., Eds.; Akadémiai Kiado: Budapest, Hungary, 1973; pp. 267–281. [Google Scholar]
- Schwarz, G. Estimating the Dimension of a Model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
- Bozdogan, H. Model selection and Akaike’s Information Criterion (AIC): The general theory and its analytical extensions. Psychometrika 1978, 52, 345–370. [Google Scholar] [CrossRef]
- Hannan, E.J.; Quinn, B.G. The Determination of the Order of an Autoregression. J. R. Stat. Soc. Ser. B 1979, 41, 190–195. [Google Scholar] [CrossRef]
- Bell, E.T. Exponential polynomials. Ann. Math. 1934, 35, 258–277. [Google Scholar] [CrossRef]
- Batsidis, A.; Jiménez-Gamero, M.D.; Lemonte, A.J. On goodness-of-fit tests for the Bell distribution. Metrika 2020, 83, 297–319. [Google Scholar] [CrossRef]
- Castellares, F.; Lemonte, A.J.; Moreno–Arenas, G. On the two-parameter Bell–Touchard discrete distribution. Commun. Stat.-Theory Methods 2020, 49, 4834–4852. [Google Scholar] [CrossRef]
- Lemonte, A.J.; Moreno-Arenas, G.; Castellares, F. Zero-inflated Bell regression models for count data. J. Appl. Stat. 2020, 47, 265–286. [Google Scholar] [CrossRef]
- Muhammad, A.; Muhammad, N.A.; Abdul, M. On the estimation of Bell regression model using ridge estimator. Commun. Stat.-Simul. Comput. 2021. [Google Scholar] [CrossRef]
- Du, J.G.; Li, Y. The integer valued autoregressive (INAR(p)) model. J. Times Ser. Anal. 1991, 12, 129–142. [Google Scholar]
- Fisher, R.A. The significance of deviations from expectation in a Poisson series. Biometrics 1950, 6, 17–24. [Google Scholar] [CrossRef]
- Tjøstheim, D. Estimation in nonlinear time series models. Stoch. Process. Their Appl. 1986, 21, 251–273. [Google Scholar] [CrossRef]
- Freeland, R.K.; McCabe, B. Asymptotic properties of CLS estimates in the Poisson AR(1) model. Stat. Probab. Lett. 2005, 73, 147–153. [Google Scholar] [CrossRef]
- Franke, J.; Seligmann, T. Conditional maximum likelihood estimates for INAR(1) processes and their application to modelling epileptic seizure counts. In Developments in Time Series Analysis; Rao, T.S., Ed.; Chapman and Hall/CRC: Boca Raton, FL, USA, 1993; pp. 310–330. [Google Scholar]
- Ljung, G.M.; Box, G.E.P. On a measure of lack of fit in time series models. Biometrika 1978, 65, 297–303. [Google Scholar] [CrossRef]
- Weiß, C.H. The INARCH(1) model for overdispersed time series of Counts. Commun. Stat.-Simul. Comput. 2010, 39, 1269–1291. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).