Next Article in Journal
Panel Data Models for School Evaluation: The Case of High Schools’ Results in University Entrance Examinations
Previous Article in Journal
A Class of Enhanced Nonparametric Control Schemes Based on Order Statistics and Runs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection

School of Mathematics and Statistics, Henan University, Kaifeng 475004, China
Stats 2023, 6(1), 293-311; https://doi.org/10.3390/stats6010018
Submission received: 17 January 2023 / Revised: 29 January 2023 / Accepted: 8 February 2023 / Published: 9 February 2023

Abstract

:
In this paper, we develop a novel soft-clipping discrete beta GARCH (ScDBGARCH) model that provides an available method to model bounded time series with under-dispersion, equi-dispersion or over-dispersion. The new model not only allows positive dependence, but also negative dependence. The stochastic properties of the models are established, and these results are, in turn, used in the analysis of the asymptotic properties of the conditional maximum likelihood (CML) estimator of the new model. In addition, we apply the new model to measles infection to show its improved performance.

1. Introduction

More and more authors have underlined the importance and the common occurrence of the bounded integer-valued time series over more than two decades. McKenzie [1] proposed the binomial AR (BAR) model based on the binomial thinning operator to analyze the bounded integer-valued time series. To monitor bounded data (which increase at a certain point and then slowly decrease to the initial level), Weiß and Testik [2] further discussed the BAR model by constructing positive additive outliers; see Möller et al. [3] for its some extensions of zero inflation and Chen et al. [4] for its two types of innovative outliers. Kang et al. [5] proposed an extended binomial AR(1) model based on the generalized binomial thinning operator, which relaxes the independence assumption of the binomial thinning operator. To analyze bounded data with under-dispersion, equi-dispersion and over-dispersion, Chen et al. [6] first constructed the Conway–Maxwell–Poisson–binomial thinning operator based on the Conway–Maxwell–Poisson–binomial distribution [7], and then proposed the Conway–Maxwell–Poisson–binomial AR model. To accurately and flexibly capture the correlation structure between two random coefficients in the BAR process, Zhang et al. [8] proposed a new version of the BAR model by using the Farlie–Gumbel–Morgenstern copula, which allows both positive and negative correlations.
In fact, the volatility (especially the heteroscedasticity) is a reality for many important processes and cannot be described by the above models. Here, we take the number of districts with new cases of measles infection per week in the year 2016–2017 reported in n = 38 Germany’s districts as an example and present its path in Figure 1, which shows that there seems to be more variation at the median of the time series, where the level also appears to be higher.
For this purpose, Weiß and Pollett [9] considered a linear binomial ARCH(1) model, which is generalized to the pth-order case by Ristić et al. [10]. Lee and Lee [11] further discussed a version of the linear binomial ARCH(1) model with a feedback mechanism. Chen et al. [12] proposed two classes of dynamic binomial ARCH models to model time series with a finite range. Chen et al. [13] generalized the binomial ARCH model to the beta-binomial GARCH model, which allows both the conditional and marginal binomial indices of dispersion to be greater than one, i.e., data with extra-binomial variation can be more adequately captured than binomial GARCH-type models. See Liu et al. [14] for the bounded Poisson AR process and Liu et al. [15] for the novel category AR process.
However, the negative ACF cannot be achieved by the above models. To resolve this dilemma, Weiß and Jahn [16], inspired by the softplus INGARCH model [17], proposed the soft-clipping BGARCH model based on the soft-clipping function [18], i.e.,
S c c ( x ) = c log 1 + exp ( x / c ) 1 + exp ( ( x 1 ) / c ) , x ( 0 , 1 ) , c > 0 .
To further investigate the soft-clipping function, we give some example of the plot of S c c ( x ) in Figure 2, when c takes the value in { 0.02 , 0.05 , 0.1 , 0.3 , 0.5 , 0.7 , 1 , 1.5 } , x ( 0 , 1 ) . From Figure 2, S c c ( x ) tends to a linear function when c 0 .
“Although the beta-binomial distribution is very flexible with respect to its shape, it is, to a large extent focused on dealing with data sets which appear, in some way, to arise from binomial distributions but which are in fact overdispersed”, which was discussed by Turner [19]. Hence, another concern arises because beta-binomial distribution focuses on over-dispersion such that under-dispersed pseudo-binomial data sets (which are rare but do exist) cannot be analyzed by the beta-binomial GARCH-type models. To fill this gap, we proposed a new soft-clipping discrete beta GARCH (ScDBGARCH) model based on a re-scaled discrete beta binomial distribution. What is remarkable about the ScDBGARCH model is that it not only can be fitted to under-dispersed data (besides over-dispersed bounded data), but also allows negative dependence (besides positive dependence).
It is worth mentioning that the realization of negative dependence for the ScDBGARCH model is mainly due to the incorporated soft-clipping function. Another main contribution of this paper is that we establish the stochastic order of the discrete beta binomial distribution, and then discuss the stability property of the new model. In addition, we discuss the CML estimators and establish their asymptotic normality. Last but not least, we illustrate the availability and superiority in analyzing the count of districts with new cases of measles infection per week in the period of the year 2016–2017 reported in n = 38 of Germany’s districts.
The paper is organized as follows. Section 2 first gives a brief review of the discrete beta distribution, then gives the definition of the soft-clipping discrete beta GARCH model and its stability properties. Conditional maximum likelihood estimation and their asymptotic properties are established in Section 3. Section 4 provides real data to show the effectiveness of the new model. Conclusions are made in Section 5. Appendix A presents some auxiliary results.

2. Model Formulation and Stability Properties

2.1. Discrete Beta Distribution

For the readers’ convenience, we first give a brief review of the discrete beta distribution, which is introduced by Turner [19].
A random variable X taking values in { n bot , n bot + 1 , n bot + 2 , , n top } is said to follow a discrete beta distribution with parameters ( α , β ) if its probability mass function of X takes the form
P ( X = x | α , β , n ) = 1 Z ( α , β ) f x n bot + 1 n top n bot + 2 , x = n bot , n bot + 1 , , n top ,
where
f ( x ) = 1 B ( α , β ) x α 1 ( 1 x ) β 1 , B ( α , β ) = Γ ( α ) Γ ( β ) Γ ( α + β ) , Z ( α , β ) = x = n bot n top f x n bot + 1 n top n bot + 2 ,
where n top N is the predetermined upper limit of the range and n bot = 0 or 1 is the predetermined lower limit of the range. For simplicity, we denote X DB 1 ( n bot , n top , α , β ) .
Furthermore, the probability mass function (given in (2)) of X can be rewritten as the exponential family form, i.e.,
P ( X = x | α , β ) = h ( x ) exp α T 1 ( x ) + β T 2 ( x ) A ( α , β ) ,
where A ( α , β ) = log x = n bot n top h ( x ) exp α T 1 ( x ) + β T 2 ( x ) , T 1 ( x ) = log x n bot + 1 n top n bot + 2 , T 2 ( x ) = log n top x + 1 n top n bot + 2 ,   h ( x ) = ( n top n bot + 2 ) 2 ( x n bot + 1 ) ( n top x + 1 ) .
In fact, f ( · ) involving in (2) is the probability density function of the beta distribution with parameters α and β . By Lemma A2 in Appendix A, one can obtain the mean, variance and BID of the DB 1 ( n bot , n top , α , β ) , if n bot = 0 and n top . Similarly, the moments of DB 1 ( 1 , n top , α , β ) can be obtained if n top . It is worth mentioning that μ b = α α + β and σ b 2 = α β ( α + β ) 2 ( 1 + α + β ) (given in Lemma A2) are precisely the mean and variance of the beta distribution with parameters α and β . Hence, we consider a reparameterization of the discrete beta distribution given in (2) by setting p = α / ( α + β ) and τ = α + β . For simplicity, we rewrite DB 1 ( n bot , n top , α , β ) as DB 2 ( n bot , n top , p , τ ) .
Unfortunately, the specific range of BID for the DB 2 ( n bot , n top , p , ϕ ) distribution cannot be obtained, except the case for n top + . To solve this dilemma, we give an example of the BID in Figure 3 with n : = n top ( 2 , 4 , 6 , 8 ) and n bot = 0 , when p and ϕ are varying from 0.1 to 0.9 with increment 0.1 and 0.1 to 8.1 with increment 0.1, respectively. See Figure 4 for n bot = 1 .
From Figure 3, we have the following observations. First, for given τ , when p 1 , the BID is decreasing but greater than 1, except for that of n top = 2 (the BID is less than 1, if p 1). Second, when p takes a small value, the BID takes the maximum if τ takes the boundary value. Third, for the given p and τ , the BID tends to a greater value when n top is increasing.
From Figure 4, we have the following observations. First, for n top = 2 , the BID seems to be increasing but lower than 1 when p 1 , τ . Second, for n top > 2 , if τ takes the non-boundary value, the BID seems to be increasing and then decreasing when p 1 ; otherwise, the BID seems to be decreasing. Third, for the given p and τ , the BID tends to a greater value when n top is increasing.
To sum up, the discrete beta distribution allows to model bounded data with under-dispersion, equi-dispersion and over-dispersion.
Similar to the statistical-order property of the one-parameter exponential family in [20], Proposition 1 illustrates that it does hold for the DB 1 distribution.
Proposition 1. 
Suppose X i DB 1 ( n bot , n top , α i , β i ) , i = 1 , 2 . If α 1 α 2 and β 1 β 2 , then the following conclusions hold and are equivalent:
(1) 
X 1 l r X 2 ,
(2) 
p 1 p 2 ,
where p i = α i / ( α i + β i ) , i = 1 , 2 .
Proof. 
(1) It is easy to see that X i exhibits the following probability density function
f X i ( x ) = 1 Z ( α i , β i ) 1 B ( α i , β i ) x n bot + 1 n top n bot + 2 α i 1 1 x n bot + 1 n top n bot + 2 β i 1 ,
where Z ( α i , β i ) = x = 0 n 1 B ( α i , β i ) x n bot + 1 n top n bot + 2 α i 1 1 x n bot + 1 n top n bot + 2 β i 1 . Hence,
l ( x ) : = f X 1 ( x ) f X 2 ( x ) = Z ( α 2 , β 2 ) B ( α 2 , β 2 ) Z ( α 1 , β 1 ) B ( α 1 , β 1 ) x n bot + 1 n top n bot + 2 α 1 α 2 1 x n bot + 1 n top n bot + 2 β 1 β 2 x n bot + 1 n top n bot + 2 α 1 α 2 1 x n bot + 1 n top n bot + 2 β 1 β 2
and
l ( x ) ( α 1 α 2 ) x n bot + 1 n top n bot + 2 α 1 α 2 1 1 x n bot + 1 n top n bot + 2 β 1 β 2 ( β 1 β 2 ) x n bot + 1 n top n bot + 2 α 1 α 2 1 x n bot + 1 n top n bot + 2 β 1 β 2 1 0
with equality only if α 1 = α 2 and β 1 = β 2 . Hence, X 1 l r X 2 . Furthermore, p 1 p 2 by Theorem 4.2 in Wang [21].
(2) Note that if α 1 α 2 and β 1 β 2 ,
1 p 1 1 p 2 = α 1 + β 1 α 1 α 2 + β 2 α 2 = β 1 α 1 β 2 α 2 0 .
Hence, p 1 p 2 , if α 1 α 2 and β 1 β 2 , and vice versa. Therefore, X 1 l r X 2 . The proof is end. □

2.2. Discrete Beta GARCH(1,1) Model with a Nearly Linear Structure

Inspired by Weiß and Jahn [16] and DB 2 distribution, we give the definition of the ScDBGARCH(1,1) model by
Z t | F t 1 DB 2 ( n bot , n top , p t , τ ) , p t = S c c ( w + α 1 p t 1 + β 1 Z t 1 / n top ) ,
where F t is the σ -field generated by { Z t , p t , t Z } , τ > 0 , | α 1 | < 1 , | β 1 | < 1 and | α 1 | + | β 1 | < 1 , S c c ( x ) = c log ( 1 + exp ( x / c ) ) / ( 1 + exp ( ( x 1 ) / c ) ) , x ( 0 , 1 ) , c > 0 , n bot = 0 or 1 and n top N is the predetermined upper limit of the range.
By (3) and (4), the conditional probability mass function of { Z t } takes the form
P ( Z t = z t | F t 1 ) = h ( z t ) exp p t τ T 1 ( z t ) + ( 1 τ ) p t T 2 ( z t ) A ( p t τ , ( 1 τ ) p t ) ,
where h ( z t ) = ( n top n bot + 2 ) 2 ( z t n bot + 1 ) ( n top z t + 1 ) , T 1 ( z t ) = log z t n bot + 1 n top n bot + 2 ,   T 2 ( z t ) = log n top z t + 1 n top n bot + 2 ,   A p t τ , ( 1 τ ) p t = log i = n bot n top h ( i ) exp ( p t τ T 1 ( i ) + ( 1 τ ) p t T 2 ( i ) ) .
Note that Proposition 1 presents that the new discrete beta distribution exhibits a statistical-order property, which is similar to the one-parameter exponential family in Davis and Liu [20]. Hence, a natural idea of the stability of the ScDBGARCH model is using the theory of the iterated random function approach [22] to construct the stability properties of the ScDBGARCH model. For this purpose, we first illustrate the stochastic order of the coupling process { Z t , λ t , t Z } given in (4), and then account for the moment property of | Z i Z j | ( i j ), which is essential to derive the stability of the proposed model.
Proposition 2.
If { Z t , p t , t Z } satisfies (4), then Z 1 l r Z 2 , if p 1 l r p 2 , where “lr” denotes the likelihood ratio.
The result of Proposition 2 can be obtained by Proposition 1, i = 1 , 2 . We omit it.
Proposition 3.
For all i = 1 , 2 , if Z i DB 2 ( n bot , n top , p t , τ ) and F λ i is the cumulative distribution function of DB 2 ( n bot , n top , p t , τ ) with μ i = z = n bot n top z P ( Z i = z ) and F λ i 1 ( u ) : = inf { t 0 , F μ i ( t ) u } , then E | Z 1 Z 2 | = | λ 1 λ 2 | , where u is a uniform random variable in ( 0 , 1 ) and Z i = F λ i 1 ( u ) .
Proof. 
Denote λ i = x = n bot n top x P ( X i = x ) with X i DB 1 ( n bot , n top , α i , β i ) . Similar to the first item of Proposition 1, λ 1 l r λ 2 , if α 1 α 2 , β 1 β 2 . Therefore, Z 1 l r Z 2 by Proposition 2, i.e., Z 1 s t Z 2 and F λ 1 1 ( t ) F λ 2 1 ( t ) , t ( 0 , 1 ) . Hence E | Z 1 Z 2 | = E ( Z 2 Z 1 ) = λ 2 λ 1 = | λ 1 λ 2 | . Similarly, E | Z 1 Z 2 | = E ( Z 1 Z 2 ) = | λ 1 λ 2 | , if λ 1 λ 2 . Thus, E | Z 1 Z 2 | = | λ 1 λ 2 | . The proof is complete. □
In the following, we demonstrate that S c c ( · ) satisfies the contraction condition by using Lemma A1 in Appendix A, i.e., z 1 , z 2 0 ( z 1 z 2 ) , p 1 , p 2 0 ( p 1 p 2 ) , there exist α and β such that
| S c c w + α 1 p 1 + β 1 z 1 n top S c c w + α 1 p 2 + β 1 z 2 n top | < | α | | p 1 p 2 | + | β | | z 1 z 2 | , S c c ( w + α 1 p 1 + β 1 z 1 ) α p 1 + β z 1 + S c c ( w ) , | S c c ( w + α 1 p 1 + β 1 z 1 / n top ) S c c ( w + α 1 p 2 + β 1 z 1 / n top ) | | α | | p 1 p 2 | .
where | α 1 | < 1 and | β 1 | < 1 .
Assumption 1.
The parametric space Θ = { θ = ( w , α 1 , β 1 , ϕ ) } is compact with w R , 0 < ϕ < 1 , | α 1 | < 1 , | β 1 | < 1 and | α 1 | + | β 1 | < 1 .
Theorem 1. 
Let { Z t , t Z } satisfy (4). If Assumption 1 and the contraction condition (6) hold, then the following results hold:
(1) 
If π is a stationary distribution and p 0 π is independent of p 0 π , then { p t , t Z } is geometric-moment contracting with unique stationary distribution π and E π p 1 < .
(2) 
There exists a measurable function G : D = { ( n 1 , n 2 , ) , n i D } D such that p t = a . s . G ( Z t 1 , Z t 2 , ) , i.e., p t is F t 1 -measurable, where D = [ 0 , n ] .
(3) 
If { p t } starts from π, i.e., p 0 π , then { Z t } is a stationary time series. Furthermore, { Z t , p t } is strictly stationary and ergodic.
By Propositions 3 and (6), Theorem 1 can be proved in a similar way in Davis and Liu [20] and Chen et al. [13], and we omit it here.
It is worth mentioning that the incorporated soft-clipping function results in negative auto-regression, besides the positive auto-regression and over-dispersion. Unfortunately, because of the complexity of the discrete beta distribution, we have the closed forms of the auto-regressive coefficient. To get an idea about the abilities of the ScDBGARCH(1,1) model with c = 0.01 for explaining different autocorrelation structures, we present some ACF(2)-ACF(1) plots for the ScDBGARCH(1,1) model in Figure 5 and Figure 6. To be precise, for given n top = 10 and n bot = 0 or 1, sample size T = 200 and τ = 1 , we let β 1 = 0.05 and w = 0.5 ( 1 | α 1 | | β 1 | ) with α 1 , varying from 0.9 to 0.9 with an increment of 0.1, and we compute the values of ACF(1), ACF(2) and plot them against each other.
From Figure 5 and Figure 6, both negative ACF and non-negative ACF are allowed by the novel ScDBGARCH model, while negative ACF is rejected by the binomial GARCH-type models [10,12], i.e., the novel ScDBGARCH model is much more flexible than the classical binomial GARCH models with respect to the auto-regressive structure.
To be honest, the merit of the model ScDBGARCH goes beyond allowing negative auto-regression, and also allowing under-dispersion. To account for the dispersion, we present the plots of the BID (in Figure 7 and Figure 8) for the ScDBGARCH(1,1) model, for given n top = 10 or 2 and n bot = 0 or 1, sample size T = 200 and τ = 1 when α 1 is varying from 0.9 to 0.9 with an increment 0.1, β 1 = 0.05 and w = 0.5 ( 1 | α 1 | | β 1 | ) .
From Figure 7 and Figure 8, under-dispersion (besides over-dispersion) is allowed, especially for the ScDBGARCH model with a smaller n top . Hence, the ScDBGARCH model provides an available way to analyze bounded integer-valued time series counts.
Remark 1. 
Similar to the BGARCH(1,1) model [11] and the BBGARCH(1,1) model [13], we can define the following two models:
  • Soft-clipping beta-binomial GARCH(1,1) model with
    Z t | F t 1 BB ( n , p t , ϕ ) , p t = S c c ( w + α 1 p t 1 + β 1 Z t 1 / n ) ,
    where w R , 0 < ϕ < 1 , | α 1 | < 1 , | β 1 | < 1 and | α 1 | + | β 1 | < 1 .
    Obviously, this model, given in (7), is an example of the BBGARCH(1,1) model in [13]. For convenience, we recall it as the ScBBGARCH(1,1) model.
  • Soft-clipping binomial GARCH(1,1) model [16] with
    Z t | F t 1 Bin ( n , p t ) , p t = S c c ( w + α 1 p t 1 + β 1 Z t 1 / n ) ,
    where w R , 0 < ϕ < 1 , | α 1 | < 1 , | β 1 | < 1 and | α 1 | + | β 1 | < 1 .
    Obviously, this model, given in (8), can be regarded as a further generation of the BARCH-type model; see [10,11,12]. For convenience, we recall it as the ScBGARCH(1,1) model.

3. Parameter Estimation

In this section, we use the conditional maximum likelihood method to estimate the parameters involved in the ScDBGARCH(1,1) model and study their asymptotic behavior. Let θ = ( w , α , β , τ ) . Denote n top and n bot as the upper and lower ranges, and T N represents the size of the sample. { Z 0 , Z 1 , , Z T } is a realization of { Z t } , which can be obtained by the following steps: First, we let p 0 = S c c ( w ) and set a pre-run = 500, then generate { Z 0 , p 1 , Z 1 , , p 500 , Z 500 } , where p t is obtained by (4) and Z t is generated by using rdb ( ) function in the ddb package; see Turner [19] for more details. Second, we use p 500 as a new initial value of p t and rewrite it as p 0 , then generate { Z 0 , Z 1 , Z 2 , · · · , Z T } .
By (5), the conditional log-likelihood function of (4) can be written as
log L ( θ ) = t = 1 T log P ( Z t = z t | F t 1 ) = t = 1 T log ( h ( z t ) ) + t = 1 T η 1 T 1 ( z t ) + η 2 T 2 ( z t ) A ( η 1 , η 2 ) ,
where h ( z t ) = ( n top n bot + 2 ) 2 ( z t n bot + 1 ) ( n top z t + 1 ) , T 1 ( z t ) = log z t n bot + 1 n top n bot + 2 , η 1 = τ p t , T 2 ( z t ) = log n top z t + 1 n top n bot + 2 , A ( η 1 , η 2 ) = log i = n bot n top h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) and η 2 = ( 1 τ ) p t . Then the CML estimator θ ^ c m l is obtained by maximizing (9).
Note that t = 1 T log ( h ( z t ) ) in (9) is a constant for a given sample. Hence, the conditional log-likelihood function given in (9) can be simplified and denoted as
( θ ) = t = 1 T l t ( θ ) = t = 1 T η 1 T 1 ( z t ) + η 2 T 2 ( z t ) A ( η 1 , η 2 )
and θ ^ c m l can be obtained by maximizing (10), i.e., θ ^ c m l is a solution of the score equation
0 = t = 1 T l t ( θ ) θ = t = 1 T T 1 ( z t ) η 1 θ T 2 ( z t ) η 2 θ A ( η 1 , η 2 ) η 1 η 1 θ + A ( η 1 , η 2 ) η 2 η 2 θ = t = 1 T T 1 ( z t ) A ( η 1 , η 2 ) η 1 η 1 θ + t = 1 T T 2 ( z t ) A ( η 1 , η 2 ) η 2 η 2 θ = t = 1 T T 1 ( z t ) A 1 ( η 1 , η 2 ) η 1 θ + t = 1 T T 2 ( z t ) A 2 ( η 1 , η 2 ) η 2 θ ,
where η 1 : = η 1 ( θ ) = τ p t , η 2 : = η 2 ( θ ) = p t ( 1 τ ) , p t = S c ( u t ) ,
A ( η 1 , η 2 ) / η 1 : = A 1 ( η 1 , η 2 ) = i = n bot n top h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) T 1 ( i ) / B ( η 1 , η 2 ) , A ( η 1 , η 2 ) / η 2 : = A 2 ( η 1 , η 2 ) = i = n bot n top h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) T 2 ( i ) / B ( η 1 , η 2 ) ,
η 1 θ = τ S c c ( u t ) τ S c c ( u t ) p t 1 τ S c c ( u t ) z t 1 / n top S c c ( u t ) , η 2 θ = ( 1 τ ) S c c ( u t ) ( 1 τ ) S c c ( u t ) p t 1 ( 1 τ ) S c c ( u t ) z t 1 / n top S c c ( u t )
with A ( η 1 , η 2 ) = log B ( η 1 , η 2 ) , B ( η 1 , η 2 ) = i = n bot n top h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) ,   u t = w + α 1 p t 1 + β 1 z t 1 / n top and p t = S c c ( u t ) .
Furthermore, the Hessian matrix (denoted as H T ( θ ) ) for model (4) is obtained by further differentiation of the score equation, i.e., H T ( θ ) = t = 1 T 2 l t θ θ with 2 l t θ θ equaling to
( T 1 A 1 ) 2 η 1 θ θ + ( T 2 A 2 ) 2 η 2 θ θ A 11 η 1 θ η 1 θ A 22 η 2 θ η 2 θ ( A 12 + A 21 ) η 1 θ η 2 θ ,
where A 1 : = A 1 ( η 1 , η 2 ) , A 2 : = A 2 ( η 1 , η 2 ) , A i j : = A i j ( η 1 , η 2 ) = A i ( η 1 , η 2 ) / η j , i , j = 1 , 2 and
2 η 1 θ θ = τ S c c τ S c c p t 1 τ S c c z t 1 / n top S c c τ S c c p t 1 τ S c c p t 1 2 τ S c c p t 1 z t 1 / n top p t 1 S c c τ S c c z t 1 / n top τ S c c p t 1 z t 1 / n top τ S c c z t 1 2 / n top 2 S c c z t 1 / n top S c c p t 1 S c c S c c z t 1 / n top 0 ,
2 η 2 θ θ = ( 1 τ ) S c c ( 1 τ ) S c c p t 1 ( 1 τ ) S c c z t 1 / n top S c c ( 1 τ ) S c c p t 1 ( 1 τ ) S c c p t 1 2 ( 1 τ ) S c c ȃ p t 1 z t 1 / n top p t 1 S c c ( 1 τ ) S c c z t 1 / n top ( 1 τ ) S c c p t 1 z t 1 / n top ( 1 τ ) S c c z t 1 2 / n top 2 S c c z t 1 / n top S c c p t 1 S c c S c c z t 1 / n top 0
with S c c : = S c c ( u t ) and S c c : = S c c ( u t ) .
Lemma 1. 
Denote g ( x , η 1 , η 2 ) = T 1 ( x ) η 1 + T 2 ( x ) η 2 A ( η 1 , η 2 ) . For all x R , g ( x , η 1 , η 2 ) = g ( x , η 1 , η 2 ) if and only if η 1 = η 1 and η 2 = η 2 , where h ( x ) = ( n top n bot + 2 ) 2 ( x n bot + 1 ) ( n top x + 1 ) , T 1 ( x ) = log x n bot + 1 ) / ( n top n bot + 2 ) , T 2 ( x ) = log ( n top x + 1 ) / ( n top n bot + 2 ) , A ( η 1 , η 2 ) = log i = n bot n top h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) , n bot = 0 or 1 and n top is considered a known quantity.
Proof. 
Note that g ( x , η 1 , η 2 ) is continuously differentiable; hence,
g ( x , η 1 , η 2 ) η 1 = T 1 ( x ) i = 0 n h ( i ) T 1 ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) i = 0 n h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) , g ( x , η 1 , η 2 ) η 2 = T 2 ( x ) i = 0 n h ( i ) T 2 ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) i = 0 n h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) .
Because h ( i ) T 1 ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) / h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) is strictly increasing in terms of η 1 or η 2 , so does for
h ( i ) T 2 ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) / h ( i ) exp ( η 1 T 1 ( i ) + η 2 T 2 ( i ) ) .
Hence, g ( x , η 1 , η 2 ) / η 1 = g ( x , η 1 , η 2 ) / η 2 if and only if η 1 = η 1 and η 2 = η 2 .
To sum up, g ( x , η 1 , η 2 ) = g ( x , η 1 , η 2 ) if and only if η 1 = η 1 and η 2 = η 2 , x R . □
Assumption 2.
If there exists a t 1 such that Z t ( θ 0 ) = Z t ( θ ) , P ( z | F t 1 ) θ 0 a.s., then θ = θ 0 , where P ( z | F t 1 ) θ 0 = P ( Z t = z | F t 1 ) θ 0 is the probability measure under the true parameter θ 0 and F t 1 .
Assumption 2 establishes the identification of the ScDBGARCH(1,1) model based on Lemma 1.
Theorem 2.
Let { Z t , t Z } be a stationary and ergodic sequence with a finite range and its conditional mean process { λ t } satisfy (4), the contraction condition (6). If Assumptions 1 and 2 hold, then, as T , we obtain the following results:
(1) 
There exists an estimator θ ^ 2 c m l such that θ ^ c m l a . s . θ ;
(2) 
T ( θ ^ c m l θ ) d N 0 , H 1 ( θ ) I ( θ ) H 1 ( θ ) ,
where I ( θ ) : = E l t ( θ ) θ l t ( θ ) θ and H ( θ ) : = E 2 l t ( θ ) θ θ .
The proof of Theorem 2 is similar to Theorem 4 in Chen et al. [12]. We omit it.

4. Real Data Example

In this section, we reconsider the number of districts with new cases of measles infection per week in the year 2016–2017 reported in n = 38 of Germany’s districts. The dataset is taken from the “SurvStat” data (https://survstat.rki.de/Content/Query/Main.aspx) (accessed on 10 December 2022), which have been reported to the Robert Koch Institute by local and state health departments. See Figure 1 for its sample path.
By communication, the sample mean and variances are 4.3173 and 8.3546, respectively. The ACF and PACF plots are given in Figure 9, respectively.
Besides the ScDBGARCH(1,1) model, the ScBBGARCH(1,1) model given in (7) and the ScBGARCH(1,1) model given in (8) with c = 0.01 , we also choose the following compared models:
  • BARCH(p) model [10] with
    Z t | F t 1 Bin ( n , p t ) , p t = a 0 + k = 1 p a k Z t k / n , p = 1 , 2 ;
  • logit-BARCH(p) model [12] with
    Z t | F t 1 Bin ( n , p t ) , logit ( p t ) = α 0 + k = 1 p logit ( α k ) Z t k , p = 1 , 2 ;
  • score-BARCH(1) model [12] with
    Z t | F t 1 Bin ( n , p t ) , logit ( p t ) = α 0 + α 1 logit ( p t 1 ) + α 2 s t , s t = n p t Z t ;
  • BGARCH(1,1) model [11] with
    Z t | F t 1 Bin ( n , p t ) , p t = α 0 + α 1 p t 1 + α 2 Z t 1 / n ;
  • logit-BBGARCH(1,1) model [13] with Z t | F t 1 BB ( n , p t , ϕ ) and its mean process { λ t } satisfying logit ( λ t ) = w + α 1 logit ( λ t 1 ) + β 1 Z t 1 .
In the following, we use the above models to fit the measles infection’s data by the CML method and compare their estimated standard error (SE), −log-likelihood (−log-lik), AIC and BIC, where SE is computed by Theorem 2 and ϕ ^ = 1 / ( 1 + τ ^ ) . The CML estimates and approximated standard errors of parameters (including the fitted values of −log-lik, AIC and BIC) are summarized in Table 1.
From Table 1, we have the following observations. For the BARCH-type models, the BARCH(2) model with a linear transformation takes the smallest −log-lik, AIC and BIC. For the GARCH-type models, the ScDBGARCH(1,1) model takes the smallest −log-lik, AIC and BIC, followed by the ScBGARCH(1,1) model, which may be attributed to the merits of the soft-clipping function.For all compared models, the ScDBGARCH(1,1) model takes the smallest −log-lik, AIC and BIC. Hence, the ScDBGARCH(1,1) model is more suitable for the measles data.
To further check the adequacy of the ScDBGARCH(1,1) model, we analyze its Pearson residuals, which are defined by e t = ( Z t μ ^ t ) / σ ^ t 2 with μ ^ t = z = 0 n Z t P ( Z t = z | F t 1 ) and σ ^ t 2 = z = 0 n ( z μ ^ t ) 2 P ( Z t = z | F t 1 ) . As discussed in Weiß [23], “for an adequate model, its fitted standardized Pearson residuals are expected to be uncorrelated with a mean about 0 and a variance about 1”.
First, we calculate that the mean and variance of the Pearson residuals of the ScDBGARCH(1,1) model are 0.0059 and 1.0107 , which implies that the ScDBGARCH(1,1) model demonstrates adequacy. Second, we give its residual analysis in Figure 10, which also shows that this model does rather well.
Third, we consider the fitted values of the Ljung–Box test based on lags k = 3, 5, 7, 9, 11, 13, and 15, including their p-values and their critical values ( χ 0.95 2 ( k ) ) with 0.05 confidence, and summarize them in Table 2.
Table 2 shows that all of the Ljung–Box statistics are less than the corresponding critical values, and the p-values are much greater than the significant level 0.05. Hence, both of them further illustrate the availability of the ScDBGARCH(1,1) model in analyzing the measles data. To sum up, the ScDBGARCH(1,1) model shows better performance in analyzing the measles data.

5. Concluding and Discussion

This paper considers a new and flexible soft-clipping discrete beta GARCH(1,1) model, which not only allows positive correlation, but also negative correlation, as well as under-dispersion, equi-dispersion and over-dispersion. We discuss some properties of the new model, the CML estimate of the parameters involved in the novel model, and the large-sample property of the CML estimate. The applicability and superior of the ScDBGARCH model are illustrated by a real data example.
Like linear binomial ARCH/GARCH-type models [10,11], logit binomial ARCH-type models [12] or beta-binomial GARCH-type models [13], the ScDBGARCH model is applicable to analyze stationary non-negative data with a finite range and will be invalid for data with some time trends. Two natural methods arise, and both of them deserve a detailed analysis in a future project.
One popular method is incorporated into the covariate processes when constructing a new model. Similar to the logit-BBGARCHX model [24] and the PARX model [25], one can establish a model with covariates, taking the ScDBGARCH(1,1) model as an example:
Z t | F t 1 DB 2 ( n n top , n bot , p t , τ ) , p t = S c c ( w + α 1 p t 1 + β 1 Z t 1 / n top + f ( X t 1 , γ ) ) ,
where n top N is a predetermined upper limit of the range, n bot = 0 or 1 is a predetermined lower limit of the range, X t = ( X 1 t , X 2 t , , X d t ) is a d-dimensional exogenous covariate vector, F t is the σ -field generated by { Z s , λ s , X s , s < t } , f ( · , γ ) : R d R , γ is the additional parameter vector involved in f ( · , · ) , and ( w , α 1 , β 1 , τ ) is the parameter vector with τ > 0 , | α 1 | < 1 , | β 1 | < 1 and | α 1 | + | β 1 | < 1 . When discussing the statistical property, an essential and unavoidable point is the specific form of f ( · , · ) . See Chen and Khamthong [26] for Markov-switching cases.
Specially, if the considered data have a periodic trend, one can consider a s-periodically distributed sequence { Z t , t Z } and its mean process { λ t } satisfying (4), i.e., the s-periodicity of { Z t , t Z } is understood in the sense that Z t = d Z k s + τ for all k , t Z , τ = 1 , 2 , , s , where = d denotes equality in distribution. To highlight the periodicity, one can consider the model by letting t = k s + τ , k Z , τ = 1 , 2 , , s , and
Z k s + τ | F k s + τ 1 DB 2 ( n , p k s + τ , ϕ k s + τ ) , λ k s + τ : = ( n + 2 ) p k s + τ 1 = S c ( w k s + τ + α 1 , k s + τ λ k s + τ 1 + β 1 , k s + τ Z k s + τ 1 ) ,
where | α 1 , τ | < 1 , | β 1 , τ | < 1 and τ = 1 s | α 1 , τ | + | β 1 , τ | < 1 . See Aknouche et al. [27] for a general periodic mixed Poisson autoregression.
The other popular method is to remove the time trend by using the difference method, but having a negative value emerge (besides non-negative bounded data), i.e., Z -valued bounded data emerge. As far as we know, existing GARCH-type models are constructed by random rounding operators (see Liu and Yuan [28]), some Z -valued discrete distributions (see Alomani et al. [29], Carallo et al. [30], Cui et al. [31]), difference of two independent non-negative INGARCH models (see Gonçalves and Mendes-Lopes [32]) and non-negative INGARCH models multiplying by some special Z -valued discrete random variables (see Xu and Zhu [33]). However, they focus on Z -valued data with infinite range and cannot apply to bounded data. Hence, a future project in term of the Z -valued bounded data deserves to be considered.
In addition, as discussed in Chen et al. [6], the Conway–Maxwell–Poisson–binomial AR model shows better performance in analyzing bounded time series counts with under-dispersion, equi-dispersion and over-dispersion. A class of the Conway–Maxwell–Poisson–binomial GARCH model deserves to be considered to analyze volatility for integer-valued time series with a finite range. Similar to Bulla et al. [34] and Chen et al. [35], a signed Conway–Maxwell–Poisson–binomial (SCMPB) thinning operator and a bivariate INAR model based on the SCMPB thinning operator also deserve to be considered to analyze bivariate dependent time series with finite ranges.

Funding

Chen’s work is funded by Natural Science Foundation of Henan Province (No. 222300420127) and Postdoctoral research in Henan Province (No. 202103051).

Data Availability Statement

The number of districts with new cases of measles infection per week in the year 2016–2017 reported in n = 38 Germany’s districts is taken from the “SurvStat” (https://survstat.rki.de/Content/Query/Main.aspx) on 12 December 2019.

Acknowledgments

The author thanks the Editor-in-Chief and the anonymous referees for the valuable comments and suggestions that resulted in a substantial improvement of this paper. We acknowledge the constructive suggestions from Fukang Zhu of Jilin University on the work.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
| x | absolute of x, x R ;
lrlikelihood ratio;
s t stochastic small;
= d equality in distribution.
a . s . almost surely convergence;
d convergence in distribution.

Appendix A. Auxiliary Results

Lemma A1. 
Let S c c ( x ) = c log 1 + exp ( x / c ) 1 + exp ( ( x 1 ) / c ) , x R , c > 0 . Then,
(1) 
S c c ( x ) = exp ( x / c ) 1 + exp ( x / c ) exp ( x 1 ) / c 1 + exp ( x 1 ) / c and | S c ( x ) | 1 / 2 ;
(2) 
x 1 R , x 2 R and x 1 x 2 , | S c c ( x 2 ) S c c ( x 1 ) | 1 2 | x 2 x 1 | ;
(3) 
S c c ( x ) = exp ( x / c ) c ( 1 + exp ( x / c ) ) 2 exp ( x 1 ) / c c 1 + exp ( x 1 ) / c 2 and | S c c ( x ) | 1 2 c ;
(4) 
S c c ( x ) = exp ( x / c ) ( exp ( x / c ) 1 ) c 2 ( 1 + exp ( ( x 1 ) / c ) ) 3 exp ( x 1 / c ) ( exp ( x 1 ) / c 1 ) c 2 1 + exp ( x 1 ) / c 3 and | S c c ( x ) | 1 4 c 2 .
Proof. 
(1) Because S c c ( x ) is a continuously differentiable function in R , S c c ( x ) exists and
S c c ( x ) = exp ( x / c ) 1 + exp ( x / c ) exp ( x 1 ) / c 1 + exp ( x 1 ) / c
and
| S c c ( x ) | exp ( x / c ) 1 + exp ( x / c ) + exp ( x 1 ) / c 1 + exp ( x 1 ) / c 1 4 + 1 4 = 1 2
by Lemma 4 in [12].
(2) By using the mean value theorem, there exists at least one point δ ( x 1 , x 2 ) , x 1 x 2 , such that
S c c ( x 2 ) S c c ( x 1 ) = S c c ( δ ) ( x 2 x 1 ) ,
where S c c ( δ ) = exp ( ξ / c ) 1 + exp ( δ / c ) exp ( ( δ 1 ) / c ) 1 + exp ( ( δ 1 ) / c ) . Hence, | S c c ( δ ) | 1 / 2 and | S c c ( x 2 ) S c ( x 1 ) | 1 2 | x 2 x 1 | , x 1 x 2 .
(3) According to item (1), S c c ( x ) is a continuously differentiable function in R , thus S c c ( x ) exists and S c c ( x ) = exp ( x / c ) c ( 1 + exp ( x / c ) ) 2 exp ( x 1 ) / c c 1 + exp ( x 1 ) / c 2 . Furthermore,
| S c c ( x ) | exp ( x / c ) c ( 1 + exp ( x / c ) ) 2 + exp ( x 1 ) / c c 1 + exp ( x 1 ) / c 2 1 / ( 4 c ) + 1 / ( 4 c ) = 1 / ( 2 c )
by ( a + b ) 2 4 a b , a R , b R .
(4) By (3), S c c ( x ) is a continuously differentiable function in R , and thus, S c c ( x ) exists and
S c c ( x ) = exp ( x / c ) ( exp ( x / c ) 1 ) c 2 ( 1 + exp ( x / c ) ) 3 exp ( ( x 1 ) / c ) ( exp ( x 1 ) / c 1 ) c 2 1 + exp ( x 1 ) / c 3 .
Furthermore, by using Lemma 4 in [12], we obtain
| S c c ( x ) | exp ( x / c ) ( exp ( x / c ) 1 ) c 2 ( 1 + exp ( x / c ) ) 3 + exp ( ( x 1 ) / c ) ( exp ( x 1 ) c 1 ) c 2 1 + exp ( x 1 ) / c 3 1 c 2 exp ( 2 x / c ) ( 1 + exp ( x / c ) ) 3 + 1 c 2 exp ( 2 ( x 1 ) / c ) 1 + exp ( x 1 ) / c 3 1 4 c 2 .
The proof is complete. □
Lemma A2.
Let X DB 1 ( n bot , n top , α , β ) with n bot = 0 and n top = n . If n + , then
(1).
E ( X ) ( n + 2 ) α α + β 1 = ( n + 2 ) μ b 1 ,
(2).
Var ( X ) ( n + 2 ) 2 σ b 2 ,
(3).
BID = n Var ( X ) E X ( n E X ) n ϕ ( n + 2 ) 2 p ( 1 p ) ( n + 2 ) 2 p ( 1 p ) ( n + 1 ) > 1 , if p ( 1 p ) > n + 1 ( 1 n ϕ ) ( n + 2 ) 2 , = 1 , if p ( 1 p ) = n + 1 ( 1 n ϕ ) ( n + 2 ) 2 , < 1 , if p ( 1 p ) < n + 1 ( 1 n ϕ ) ( n + 2 ) 2 , where μ b = p = α / ( α + β ) and σ b 2 = ϕ μ b ( 1 μ b ) with ϕ = 1 / ( 1 + α + β ) .
Proof. 
By (2), we compute that
E ( X ) = 1 Z ( α , β ) x = 0 n x f x + 1 n + 2 = ( n + 2 ) 2 Z ( α , β ) x = 0 n x + 1 n + 2 1 n + 2 f x + 1 n + 2 1 n + 2 = ( n + 2 ) 2 Z ( α , β ) x = 0 n x + 1 n + 2 f x + 1 n + 2 1 n + 2 ( n + 2 ) 2 Z ( α , β ) x = 0 n 1 n + 2 f x + 1 n + 2 1 n + 2 ( n + 2 ) 2 Z ( α , β ) 0 1 x f ( x ) d x n + 2 Z ( α , β ) 0 1 f ( x ) d x ( n + 2 ) 2 n + 2 0 1 x f ( x ) d x n + 2 n + 2 0 1 f ( x ) d x ( n + 2 ) α α + β 1 = ( n + 2 ) μ b 1 , E ( X 2 ) = 1 Z ( α , β ) x = 0 n x 2 f x + 1 n + 2 = ( n + 2 ) 3 Z ( α , β ) x = 0 n ( x + 1 ) 2 ( n + 2 ) 2 2 ( x + 1 ) ( n + 2 ) 2 + 1 ( n + 2 ) 2 f x + 1 n + 2 1 n + 2 = ( n + 2 ) 3 Z ( α , β ) x = 0 n x + 1 n + 2 2 f x + 1 n + 2 1 n + 2 2 ( n + 2 ) 2 Z ( α , β ) x = 0 n x + 1 n + 2 f x + 1 n + 2 1 n + 2 + n + 2 Z ( α , β ) x = 0 n f x + 1 n + 2 1 n + 2 = ( n + 2 ) 3 Z ( α , β ) 0 1 x 2 f ( x ) d x 0 1 x f ( x ) d x 2 + ( n + 2 ) 3 Z ( α , β ) 0 1 x f ( x ) d x 2 2 ( n + 2 ) 2 Z ( α , β ) 0 1 x f ( x ) d x + n + 2 Z ( α , β ) 0 1 f ( x ) d x ( n + 2 ) 2 α β ( α + β ) 2 ( 1 + α + β ) + ( n + 2 ) 2 α 2 ( α + β ) 2 2 ( n + 2 ) α α + β + 1 = ( n + 2 ) 2 σ b 2 + ( n + 2 ) 2 μ b 2 2 ( n + 2 ) μ b + 1 ,
where σ b 2 = α β ( α + β ) 2 ( 1 + α + β ) and μ b = α α + β . Hence,
Var ( X ) = E ( X 2 ) ( E X ) 2 ( n + 2 ) 2 σ b 2 + ( n + 2 ) 2 μ b 2 2 ( n + 2 ) μ b + 1 ( ( n + 2 ) μ b 1 ) 2 = ( n + 2 ) 2 σ b 2 .
Hence, the binomial index of dispersion (BID) of X satisfies
BID = n Var ( X ) E X ( n E X ) n ϕ ( n + 2 ) 2 p ( 1 p ) ( n + 2 ) 2 p ( 1 p ) ( n + 1 ) > 1 , if p ( 1 p ) > n + 1 ( 1 n ϕ ) ( n + 2 ) 2 , = 1 , if p ( 1 p ) = n + 1 ( 1 n ϕ ) ( n + 2 ) 2 , < 1 , if p ( 1 p ) < n + 1 ( 1 n ϕ ) ( n + 2 ) 2 .
The proof is complete. □

References

  1. McKenzie, E. Some simple models for discrete variate time series. J. Am. Water Resour. Bull. 1985, 21, 645–650. [Google Scholar] [CrossRef]
  2. Weiß, C.H.; Testik, M.C. On the Phase I analysis for monitoring time-dependent count processes. IIE Trans. 2015, 47, 294–306. [Google Scholar] [CrossRef]
  3. Möller, T.A.; Weiß, C.H.; Kim, H.Y.; Sirchenko, A. Modeling zero inflation in count data time series with bounded support. Methodol. Comput. Appl. Probab. 2018, 20, 589–609. [Google Scholar] [CrossRef]
  4. Chen, H.; Li, Q.; Zhu, F. Binomial AR(1) processes with innovational outliers. Commun. Stat. Theory Methods 2021, 50, 446–472. [Google Scholar] [CrossRef]
  5. Kang, Y.; Wang, D.; Yang, K. Extended binomial AR(1) processes with generalized binomial thinning operator. Commun. Stat. Theory Methods 2020, 49, 3498–3520. [Google Scholar] [CrossRef]
  6. Chen, H.; Zhang, J.; Liu, X. A Conway-Maxwell-Poisson-Binomial AR(1) model for bounded time series data. Entropy 2023, 25, 126. [Google Scholar] [CrossRef]
  7. Shmueli, G.; Minka, T.P.; Kadane, J.B.; Borle, S.; Boatwright, P. A useful distribution for fitting discrete data: Revival of the Conway-Maxwell-Poisson distribution. Appl. Stat. 2005, 54, 127–142. [Google Scholar] [CrossRef]
  8. Zhang, R.; Wang, D.; Li, C. Flexible binomial AR(1) processes using copulas. J. Stat. Plan. Inference 2022, 219, 306–332. [Google Scholar] [CrossRef]
  9. Weiß, C.H.; Pollett, P.K. Binomial autoregressive processes with density-dependent thinning. J. Time Ser. Anal. 2014, 35, 115–132. [Google Scholar] [CrossRef]
  10. Ristić, M.M.; Weiß, C.H.; Janjić, A.D. A binomial integer-valued ARCH model. Int. J. Biostat. 2016, 12, 20150051. [Google Scholar] [CrossRef]
  11. Lee, Y.; Lee, S. CUSUM test for general nonlinear integer–valued GARCH models: Comparison study. Ann. Inst. Stat. Math. 2019, 71, 1033–1057. [Google Scholar] [CrossRef]
  12. Chen, H.; Li, Q.; Zhu, F. Two classes of dynamic binomial integer-valued ARCH models. Braz. J. Probab. Stat. 2020, 34, 685–711. [Google Scholar] [CrossRef]
  13. Chen, H.; Li, Q.; Zhu, F. A new class of integer-valued GARCH models for time series of bounded counts with extra-binomial variation. AStA Adv. Stat. Anal. 2022, 106, 243–270. [Google Scholar] [CrossRef]
  14. Liu, M.; Zhu, F.; Zhu, K. Modeling normalcy-dominant ordinal time series: An application to air quality level. J. Time Ser. Anal. 2022, 43, 460–478. [Google Scholar] [CrossRef]
  15. Liu, M.; Li, Q.; Zhu, F. Modeling air quality level with a flexible categorical autoregression. Stoch. Environ. Res. Risk Assess. 36, 2835–2845. [CrossRef]
  16. Weiß, C.H.; Jahn, M. Soft-clipping INGARCH models for time series of bounded Counts. Stat. Model. 2022. forthcoming. [Google Scholar] [CrossRef]
  17. Weiß, C.H.; Zhu, F.; Hoshiyar, A. Softplus INGARCH models. Stat. Sin. 2022, 32, 1099–1120. [Google Scholar] [CrossRef]
  18. Klimek, M.D.; Perelstein, M. Neural network-based approach to phase space integration. SciPost Phys. 2020, 9, 053. [Google Scholar] [CrossRef]
  19. Turner, R. A new versatile discrete distribution. R J. 2021, 13, 485–506. [Google Scholar] [CrossRef]
  20. Davis, R.A.; Liu, H. Theory and inference for a class of observation-driven models with application to time series of counts. Stat. Sin. 2016, 26, 1673–1707. [Google Scholar]
  21. Wang, Z. One mixed negative binomial distribution with application. J. Stat. Plan. Inference 2011, 141, 1153–1160. [Google Scholar] [CrossRef]
  22. Wu, W.; Shao, X. Limit theorems for iterated random functions. J. Appl. Probab. 2004, 41, 425–436. [Google Scholar] [CrossRef]
  23. Weiß, C.H. An Introduction to Discrete-Valued Time Series; John Wiley & Sons: Chichester, UK, 2018. [Google Scholar]
  24. Chen, H.; Li, Q.; Zhu, F. A covariate-driven beta-binomial integer-valued GARCH model for bounded counts with an application. Metrika 2023. forthcoming. [Google Scholar] [CrossRef]
  25. Agosto, A.; Cavaliere, G.; Kristensen, D.; Rahbek, A. Modeling corporate defaults: Poisson autoregressions with exogenous covariates (PARX). J. Empir. Financ. 2016, 38, 640–663. [Google Scholar] [CrossRef]
  26. Chen, C.W.S.; Khamthong, K. Bayesian modelling of nonlinear negative binomial integer-valued GARCHX models. Stat. Model. 2020, 20, 537–561. [Google Scholar] [CrossRef]
  27. Aknouche, A.; Bentarzi, W.; Demouche, N. On periodic ergodicity of a general periodic mixed Poisson autoregression. Stat. Probab. Lett. 2018, 134, 15–21. [Google Scholar] [CrossRef]
  28. Liu, T.; Yuan, X. Random rounded integer-valued autoregressive conditional heteroskedastic process. Stat. Pap. 2013, 54, 645–683. [Google Scholar] [CrossRef]
  29. Alomani, G.A.; Alzaid, A.A.; Omair, M.A. A Skellam INGARCH model. Braz. J. Probab. Stat. 2018, 32, 200–214. [Google Scholar] [CrossRef]
  30. Carallo, G.; Casarin, R.; Robert, C.P. Generalized Poisson difference autoregressive processes. arXiv 2020, arXiv:2002.04470. [Google Scholar]
  31. Cui, Y.; Li, Q.; Zhu, F. Flexible bivariate Poisson integer-valued GARCH model. Ann. Inst. Stat. Math. 2020, 72, 1449–1477. [Google Scholar] [CrossRef]
  32. Gonçalves, E.; Mendes-Lopes, N. Signed compound Poisson integer-valued GARCH processes. Commun. Stat. Theory Methods 2020, 49, 5468–5492. [Google Scholar] [CrossRef]
  33. Xu, Y.; Zhu, F. A new GJR-GARCH model for Z-valued time series. J. Time Ser. Anal. 2022, 43, 490–500. [Google Scholar] [CrossRef]
  34. Bulla, J.; Chesneau, C.; Kachour, M. A bivariate first-order signed integer-valued autoregressive process. Commun. Stat. Theory Methods 2017, 46, 6590–6604. [Google Scholar] [CrossRef]
  35. Chen, H.; Zhu, F.; Liu, X. Two-step conditional least squares estimation for the bivariate Z-valued INAR(1) model with bivariate Skellam innovations. Commun. Stat. Theory Methods 2023. forthcoming. [Google Scholar] [CrossRef]
Figure 1. Path of the measles infection counts.
Figure 1. Path of the measles infection counts.
Stats 06 00018 g001
Figure 2. Plots of the soft-clipping function.
Figure 2. Plots of the soft-clipping function.
Stats 06 00018 g002
Figure 3. Plots of the BID when n bot = 0 .
Figure 3. Plots of the BID when n bot = 0 .
Stats 06 00018 g003
Figure 4. Plots of the BID when n bot = 1 .
Figure 4. Plots of the BID when n bot = 1 .
Stats 06 00018 g004
Figure 5. Plots of attainable pairs of ACF(2) against ACF(1) for n top = 10 with c = 0.01.
Figure 5. Plots of attainable pairs of ACF(2) against ACF(1) for n top = 10 with c = 0.01.
Stats 06 00018 g005
Figure 6. Plots of attainable pairs of ACF(2) against ACF(1) for n top = 2 with c = 0.01.
Figure 6. Plots of attainable pairs of ACF(2) against ACF(1) for n top = 2 with c = 0.01.
Stats 06 00018 g006
Figure 7. Plots of BID for n top = 10 with c = 0.01 .
Figure 7. Plots of BID for n top = 10 with c = 0.01 .
Stats 06 00018 g007
Figure 8. Plots of BID for n top = 2 with c = 0.01 .
Figure 8. Plots of BID for n top = 2 with c = 0.01 .
Stats 06 00018 g008
Figure 9. Measles infection’s counts: (a) ACF, (b) PACF.
Figure 9. Measles infection’s counts: (a) ACF, (b) PACF.
Stats 06 00018 g009
Figure 10. Pearson residual analysis: (a) ACF, (b) PACF.
Figure 10. Pearson residual analysis: (a) ACF, (b) PACF.
Stats 06 00018 g010
Table 1. Estimates and SEs in parentheses for the measles infection counts.
Table 1. Estimates and SEs in parentheses for the measles infection counts.
ModelEstimates−log-likAICBIC
BARCH(1) a ^ 0 a ^ 1 212.6574429.3148434.6036
0.03670.6844
(0.0071)(0.0651)
BARCH(2) a ^ 0 a ^ 1 a ^ 2 204.8729415.7457423.6789
0.02700.40560.3669
(0.0072)(0.0991)(0.1005)
logit-BARCH(1) a ^ 0 a ^ 1 215.9542415.7457423.5789
−2.82480.1608
(0.1002)(0.0161)
logit-BARCH(2) a ^ 0 a ^ 1 a ^ 2 207.5645421.1290429.0622
−2.94730.10420.0827
(0.1087)(0.0221)(0.0220)
score-BARCH(1) α ^ 0 α ^ 1 α ^ 2 213.1136432.2272440.1604
−0.61780.67770.1192
(0.1525)(0.0751)(0.0160)
BGARCH(1,1) a ^ 0 a ^ 1 a ^ 2 212.4199430.8399438.7730
0.03320.01750.6923
(0.0087)(0.0263)(0.0675)
ScBGARCH(1,1) w ^ α ^ 1 β ^ 1 204.9207415.8414423.7746
0.22090.51230.4292
(0.2235)(0.1002)(0.0836)
ScDBGARCH(1,1) w ^ α ^ 1 β ^ 1 ϕ ^ 203.5400415.0799425.575
0.21300.49260.45060.0196
(0.2297)(0.0963)(0.0832)(0.0028)
ScBBGARCH(1,1) w ^ α ^ 1 β ^ 1 ϕ ^ 211.9121431.8242442.4017
0.31880.49460.44010.0202
(0.3267)(0.1333)(0.1116)(0.0174)
logit-BBGARCH(1,1) w ^ α ^ 1 β ^ 1 ϕ ^ 208.9151425.8302436.4078
−1.72030.42880.11370.0020
(0.2657)(0.0933)(0.0176)(0.0040)
Table 2. Values of the Ljung–Box test for the measles data.
Table 2. Values of the Ljung–Box test for the measles data.
Lag k3579111315
p-value0.77360.95190.96420.99160.99500.98000.9934
χ 0.95 2 ( k ) 7.814711.070514.067116.919019.675122.362024.9958
Ljung–Box statistic1.11441.12451.91911.98952.60834.76364.8430
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, H. A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection. Stats 2023, 6, 293-311. https://doi.org/10.3390/stats6010018

AMA Style

Chen H. A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection. Stats. 2023; 6(1):293-311. https://doi.org/10.3390/stats6010018

Chicago/Turabian Style

Chen, Huaping. 2023. "A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection" Stats 6, no. 1: 293-311. https://doi.org/10.3390/stats6010018

APA Style

Chen, H. (2023). A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection. Stats, 6(1), 293-311. https://doi.org/10.3390/stats6010018

Article Metrics

Back to TopTop