Next Article in Journal
Combining Forecasts of Time Series with Complex Seasonality Using LSTM-Based Meta-Learning
Previous Article in Journal
Downscaling Fusion Model for CMIP5 Rainfall Projection under RCP Scenarios: The Case of Trentino-Alto Adige
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

A Novel Unconstrained Geometric BINAR(1) Model †

by
Sunecher Yuvraj
1,* and
Mamode Khan Naushad
2
1
Department of Accounting, Finance and Economics, Faculty of Business Management and Finance, University of Technology Mauritius, Port-Louis 11108, Mauritius
2
Department of Economics and Statistics, University of Mauritius, Reduit 80834, Mauritius
*
Author to whom correspondence should be addressed.
Presented at the 9th International Conference on Time Series and Forecasting, Gran Canaria, Spain, 12–14 July 2023.
Eng. Proc. 2023, 39(1), 52; https://doi.org/10.3390/engproc2023039052
Published: 5 July 2023
(This article belongs to the Proceedings of The 9th International Conference on Time Series and Forecasting)

Abstract

:
Modelling the non-stationary unconstrained bivariate integer-valued autoregressive of order 1 (NSUBINAR(1)) model is challenging due to the complex cross-correlation relationship between the counting series. Hence, this paper introduces a novel non-stationary unconstrained BINAR(1) with geometric marginals (NSUBINAR(1)GEOM) based on the assumption that the counting series are both influenced by the same time-dependent explanatory variables. The generalized quasi-likelihood (GQL) estimation method is used to estimate the regression and dependence parameters. Monte Carlo simulations and an application to a real-life accident series data are presented.

1. Introduction

In the literature, several researchers have developed first-order bivariate integer-valued autoregressive (BINAR(1)) models to analyse bivariate time series of counts. Originally, Pedeli and Karlis [1,2] developed two constrained BINAR(1) models with Poisson (CBINAR(1)P) and negative binomial (NB) (CBINAR(1)NB) innovations by extending the classical INAR(1) model of McKenzie [3] based on the binomial thinning mechanism [4]. These two models were developed under stationary moment assumptions only and the cross-correlation between the bivariate series was induced by the correlated Poisson and NB innovations, hence implying a constrained relationship. By the same token, Pedeli and Karlis [5] extended the CBINAR(1)P model to an unconstrained BINAR(1) model with Poisson innovations (UBINAR(1)P) under the same condition of stationarity. In this latter model, the cross-correlation between the series was induced by the correlated Poisson innovation terms and the relationship between the observations of each counting series with previous-lagged observations of the other series.
Likewise, Ristic, Nastic, Jayakumar and Bakouch [6] and Nastic, Ristic and Popovic [7] developed a stationary UBINAR(1) model with geometric marginals (UBINAR(1)GEOM) with independent mixed geometric innovations. Hence, the cross-correlation relationship was induced only by the relation of the current observations with previous-lagged observations of the other series via the negative binomial (NB) thinning operator. Interestingly, Nastic et al. [7] showed in their paper that the UBINAR(1)GEOM yields better AICs than the above over-dispersed BINAR(1) models. However, it is worth mentioning that the UBINAR(1)GEOM model was developed only for stationary time series and hence, cannot be used to analyse non-stationary real-life over-dispersed series.
As for non-stationary time series of counts, few BINAR(1) models have been developed. Mamodekhan, Sunecher and Jowaheer [8] developed a CBINAR(1) model with Poisson innovations under non-stationarity assumption (NSCBINAR(1)P) induced by time-dependent explanatory variables. In a similar context, Sunecher, Mamodekhan and Jowaheer [9] developed a non-stationary CBINAR(1) model with NB innovations for over-dispersed time series of counts (NSCBINAR(1)NB). However, an unconstrained BINAR(1) model under the non-stationarity context that can model over-dispersed counting series has not yet been developed in the literature. Hence, in this paper we propose to develop a non-stationary unconstrained BINAR(1) model with geometric innovations (NSUBINAR(1)GEOM) similar to Pedeli and Karlis [5]. However, such model development poses some computational challenges in estimating the unknown model parameters as it is rather difficult to specify the joint generating function [10].
In the paper by Pedeli and Karlis [2,5], the conditional maximum likelihood (CML) approach was compared with the method of moments (MoM), where the authors concluded that CML yields far better estimates than MoM, but at huge computational costs. A similar conclusion was drawn by Nastic et al. [7] who used the least-square (LS) technique, as an alternative to CML, which omits the likelihood function. In the same way, due to the computational challenges of the CML, Mamodekhan et al. [8] and Sunecher et al. [9] developed the generalized quasi-likelihood (GQL) approach in the non-stationary bivariate context. Mamodekhan et al. [8] compared the GQL with CML, where it was shown that GQL yields asymptotically equally efficient estimates as CML. Hence, based on the above findings, the unknown parameters of the NSUBINAR(1)GEOM will be estimated using the GQL.
The organization of the paper is as follows: In the next section, the NSUBINAR(1) model is developed. In Section 3, the GQL approach is developed under the non-stationarity bivariate context. Section 4 focuses on the simulation part where BINAR(1) data with geometric marginals are generated and the GQL approach is used to estimate the model parameters. In Section 5, the model is applied to the accident data in Mauritius. The conclusion is provided in the final section.

2. The Non-Stationary Unconstrained BINAR(1) with Geometric Marginals (NSUBINAR(1)GEOM)

The UBINAR(1) model is specified as:
Y t [ 1 ] = ρ 11 Y t 1 [ 1 ] + ρ 12 Y t 1 [ 2 ] + R t [ 1 ]
Y t [ 2 ] = ρ 21 Y t 1 [ 1 ] + ρ 22 Y t 1 [ 2 ] + R t [ 2 ]
based on the following assumptions:
(a)
Y t [ k ] is geometric such that Y t [ k ] Geom ( μ t [ k ] 1 + μ t [ k ] ) . Hence, E( Y t [ k ] ) = μ t [ k ] and Var( Y t [ k ] ) = μ t [ k ] ( 1 + μ t [ k ] ) , where μ t [ k ] = exp ( x t β [ k ] ) with x t = [ x t 1 , x t 2 , , x t j , , x t p ] is a ( p × 1 ) vector of covariates influencing both Y t [ 1 ] and Y t [ 2 ] , with corresponding regression coefficients β [ k ] = [ β 1 [ k ] , β 2 [ k ] , , β j [ k ] , , β p [ k ] ] for t = 1 , 2 , , T and k { 1 , 2 } .
(b)
∗ is the binomial thinning operator [4] such that ρ i j Y t 1 [ j ] = m = 1 Y t 1 [ j ] Z m with Z m Geom ( ρ i j 1 + ρ i j ) . Hence, E ( ρ i j Y t 1 [ j ] ) = ρ i j E ( Y t 1 [ j ] ) and Var ( ρ i j Y t 1 [ j ] ) = ρ i j ( 1 + ρ i j ) E ( Y t 1 [ j ] ) + ρ i j 2 Var ( Y t 1 [ j ] ) .
(c)
Corr ( R t [ 1 ] , R t [ 2 ] ) = κ 12 , t t = t , 0 t t .
E ( Y t [ 1 ] ) = E ( ρ 11 Y t 1 [ 1 ] ) + E ( ρ 12 Y t 1 [ 2 ] ) + E ( R t [ 1 ] ) μ t [ 1 ] = ρ 11 μ t 1 [ 1 ] + ρ 12 μ t 1 [ 2 ] + E ( R t [ 1 ] ) .
Re-arranging Equation (3), we have
E ( R t [ 1 ] ) = λ t [ 1 ] = μ t [ 1 ] ρ 11 μ t 1 [ 1 ] ρ 12 μ t 1 [ 2 ] .
Similarly,
E ( R t [ 2 ] ) = λ t [ 2 ] = μ t [ 2 ] ρ 21 μ t 1 [ 1 ] ρ 22 μ t 1 [ 2 ] .
Var ( Y t [ 1 ] ) = Var ( ρ 11 Y t 1 [ 1 ] + ρ 12 Y t 1 [ 2 ] + R t [ 1 ] ) Var ( Y t [ 1 ] ) = ρ 11 ( 1 + ρ 11 ) E ( Y t 1 [ 1 ] ) + ρ 11 2 Var ( Y t 1 [ 1 ] ) + ρ 12 ( 1 + ρ 12 ) E ( Y t 1 [ 2 ] ) + ρ 12 2 Var ( Y t 1 [ 2 ] ) + 2 ρ 11 ρ 12 Cov ( Y t 1 [ 1 ] , Y t 1 [ 2 ] ) + Var ( R t [ 1 ] ) μ t [ 1 ] + μ t [ 1 ] 2 = ρ 11 ( 1 + ρ 11 ) μ t 1 [ 1 ] + ρ 11 2 ( μ t 1 [ 1 ] + μ t 1 [ 1 ] 2 ) + ρ 12 ( 1 + ρ 12 ) μ t 1 [ 2 ] + ρ 12 2 ( μ t 1 [ 2 ] + μ t 1 [ 2 ] 2 ) + 2 ρ 11 ρ 12 Cov ( Y t 1 [ 1 ] , Y t 1 [ 2 ] ) + Var ( R t [ 1 ] )
Re-arranging Equation (6), we have
Var ( R t [ 1 ] ) = μ t [ 1 ] ( 1 + μ t [ 1 ] ) ρ 11 ( 1 + ρ 11 ) μ t 1 [ 1 ] ρ 11 2 μ t 1 [ 1 ] ( 1 + μ t 1 [ 1 ] ) ρ 12 ( 1 + ρ 12 ) μ t 1 [ 2 ] ρ 12 2 μ t 1 [ 2 ] ( 1 + μ t 1 [ 2 ] ) 2 ρ 11 ρ 12 Cov ( Y t 1 [ 1 ] , Y t 1 [ 2 ] )
and similarly,
Var ( R t [ 2 ] ) = μ t [ 2 ] ( 1 + μ t [ 2 ] ) ρ 21 ( 1 + ρ 21 ) μ t 1 [ 1 ] ρ 21 2 μ t 1 [ 1 ] ( 1 + μ t 1 [ 1 ] ) ρ 22 ( 1 + ρ 22 ) μ t 1 [ 2 ] ρ 22 2 μ t 1 [ 2 ] ( 1 + μ t 1 [ 2 ] ) 2 ρ 21 ρ 22 Cov ( Y t 1 [ 1 ] , Y t 1 [ 2 ] ) ,
The above moments clearly indicate that the marginal distribution of R t [ k ] is rather complex to derive. To facilitate the derivation of the cross-covariances, we write Equations (1) and (2) in vector form as follows:
Y t = A Y t 1 + R t
with Y t = [ Y t [ 1 ] , Y t [ 2 ] ] , R t = [ R t [ 1 ] , R t [ 2 ] ] , A = ρ 11 ρ 12 ρ 21 ρ 22 .
Assuming Σ h , t = Cov ( Y t [ 1 ] , Y t + h [ 1 ] ) Cov ( Y t + h [ 1 ] , Y t [ 2 ] ) Cov ( Y t [ 1 ] , Y t + h [ 2 ] ) Cov ( Y t [ 2 ] , Y t + h [ 2 ] ) , from Pedeli and Karlis [5] and Ristic et al. [6], it was shown that Σ h , t = A h Σ 0 , t and hence
Cov ( Y t [ 1 ] , Y t [ 2 ] ) = Cov ( ρ 11 Y t 1 [ 1 ] + ρ 12 Y t 1 [ 2 ] + R t [ 1 ] , ρ 21 Y t 1 [ 1 ] + ρ 22 Y t 1 [ 2 ] + R t [ 2 ] ) = ( ρ 11 ρ 22 + ρ 12 ρ 21 ) Cov ( Y t 1 [ 1 ] , Y t 1 [ 2 ] ) + ρ 11 ρ 21 ( μ t 1 [ 1 ] + μ t 1 [ 1 ] 2 ) + ρ 22 ρ 12 ( μ t 1 [ 2 ] + μ t 1 [ 2 ] 2 ) + [ κ 12 , t Var ( R t [ 1 ] ) Var ( R t [ 2 ] ) ] .
Note that if α = ρ 11 = ρ 12 , γ = ρ 21 = ρ 22 , μ = μ t 1 [ 1 ] = μ t 1 [ 2 ] and κ 12 , t = 0 , Equation (10) simply reduces to Cov ( Y t [ 1 ] , Y t [ 2 ] ) = 2 α γ 1 2 α γ μ ( 1 + μ ) , which is the same as in Ristic et al. [6].

3. Estimation Method

The GQL equation to estimate the regression parameters is specified as:
D β Σ β 1 ( f μ ) = 0
with score vector f = [ f 1 , f 2 , , f t , , f t + h , , f T ] with f t = [ Y t [ 1 ] , Y t [ 2 ] ] and μ = [ μ 1 , μ 2 , , μ t , , μ T ] with corresponding mean μ t = [ μ t [ 1 ] , μ t [ 2 ] ] for t = 1 , 2 , , T .
The covariance matrix Σ β is a ( 2 T × 2 T ) . The derivative matrix D β is denoted by D β = [ D 1 , D 2 , , D t , , D T ] with
D t = μ t [ 1 ] β [ 1 ] 0 0 μ t [ 2 ] β [ 2 ] 2 p × 2
where μ t [ k ] β j [ k ] = μ t [ k ] x t j .
The Newton–Raphson iterative technique is used to estimate the regression parameters as follows:
β ^ r + 1 [ 1 ] β ^ r + 1 [ 2 ] = β ^ r [ 1 ] β ^ r [ 2 ] + [ D β Σ β 1 D β ] r 1 [ D β Σ β 1 ( f μ ) ] r
where β ^ r [ k ] are the estimates at the rth iteration and [ . ] r are the values of the expression at the rth iteration.
For an initial value of [ ρ ^ 11 , ρ ^ 12 , ρ ^ 21 , ρ ^ 22 , κ ^ 12 , t , β ] , we solve the iterative Equation (12) until convergence. These estimates are consistent and under mild regulatory conditions, ( β ^ β ) is asymptotically normal with a mean of 0 and a covariance matrix of [ D β Σ β 1 D β ] 1 as shown in [8,9,11].
A second GQL is specified to estimate the dependence parameter ψ = [ ρ 11 , ρ 12 , ρ 21 , ρ 22 ] as follows:
D ψ Σ ψ 1 ( Y ψ μ ψ ) = 0 ,
with Y ψ = [ Y 1 [ 1 ] Y 1 [ 2 ] | Y 0 [ 1 ] , Y 0 [ 2 ] , Y 2 [ 1 ] Y 2 [ 2 ] | Y 1 [ 1 ] , Y 1 [ 2 ] , , Y t [ 1 ] Y t [ 2 ] | Y t 1 [ 1 ] , Y t 1 [ 2 ] , , Y T [ 1 ] Y T [ 2 ] | Y T 1 [ 1 ] , Y T 1 [ 2 ] ] T × 1 and μ ψ = E ( Y ψ ) .
The ( T × T ) covariance matrix Σ ψ comprises of Var ( Y t [ 1 ] Y t [ 2 ] Y t 1 [ 1 ] , Y t 1 [ 2 ] ) along the diagonal and Cov ( Y t [ 1 ] Y t [ 2 ] Y t + h [ 1 ] Y t + h [ 2 ] Y t 1 [ 1 ] , Y t 1 [ 2 ] , Y t + h 1 [ 1 ] , Y t + h 1 [ 2 ] ) in the off-diagonal entries. All the entries are of higher-order moments and hence, the ’working’ multivariate normality assumption structure is used to compute these entries as in [9,12].
As for the ( T × 4 ) derivative matrix D ψ ,
E ( Y t [ 1 ] Y t [ 2 ] Y t 1 [ 1 ] , Y t 1 [ 2 ] )
= [ κ 12 , t [ μ t [ 1 ] ( 1 + μ t [ 1 ] ) ρ 11 ( 1 + ρ 11 ) μ t 1 [ 1 ] ρ 11 2 μ t 1 [ 1 ] ( 1 + μ t 1 [ 1 ] ) ρ 12 ( 1 + ρ 12 ) μ t 1 [ 2 ] ρ 12 2 μ t 1 [ 2 ] ( 1 + μ t 1 [ 2 ] ) ] 1 2 [ μ t [ 2 ] ( 1 + μ t [ 2 ] ) ρ 21 ( 1 + ρ 21 ) μ t 1 [ 1 ] ρ 21 2 μ t 1 [ 1 ] ( 1 + μ t 1 [ 1 ] ) ρ 22 ( 1 + ρ 22 ) μ t 1 [ 2 ] ρ 22 2 μ t 1 [ 2 ] ( 1 + μ t 1 [ 2 ] ) ] 1 2 + ( ρ 11 Y t 1 [ 1 ] + ρ 12 Y t 1 [ 2 ] + μ t [ 1 ] ρ 11 μ t 1 [ 1 ] ρ 12 μ t 1 [ 2 ] ) × ( ρ 21 Y t 1 [ 1 ] + ρ 22 Y t 1 [ 2 ] + μ t [ 2 ] ρ 21 μ t 1 [ 1 ] ρ 22 μ t 1 [ 2 ] ) ]
As for the estimates of κ 12 , t , they are estimated using the method of moments as follows:
κ 12 , t = Cov ˜ ( Y t [ 1 ] , Y t [ 2 ] ) ( ρ ^ 11 ρ ^ 22 + ρ ^ 12 ρ ^ 21 ) Cov ˜ ( Y t 1 [ 1 ] , Y t 1 [ 2 ] ) ρ ^ 11 ρ ^ 21 ( μ ^ t 1 [ 1 ] + μ t 1 [ 1 ] ^ 2 ) ρ ^ 22 ρ ^ 12 ( μ ^ t 1 [ 2 ] + μ t 1 [ 2 ] ^ 2 ) Var ( R t [ 1 ] ) Var ( R t [ 2 ] )
where μ ^ 0 [ k ] = μ ^ 1 [ k ] , Cov ˜ ( Y t [ 1 ] , Y t [ 2 ] ) = 1 T t = 1 T ( y t [ 1 ] μ ^ t [ 1 ] ) ( y t [ 2 ] μ ^ t [ 2 ] ) and
Cov ˜ ( Y t 1 [ 1 ] , Y t 1 [ 2 ] ) = 1 T 1 t = 2 T ( y t 1 [ 1 ] μ ^ t 1 [ 1 ] ) ( y t 1 [ 2 ] μ ^ t 1 [ 2 ] ) .
The Newton–Raphson iteration for the second GQL yields
ψ ^ r + 1 = ψ ^ r + [ D ψ Σ ^ ψ 1 D ψ ] r 1 [ D ψ Σ ^ ψ 1 ( Y ψ μ ψ ) ] r
where ψ ^ r are the estimates at the rth iteration and [ . ] r are the values of the expression at the rth iteration.
The regression estimates β ^ obtained from Equation (12) are used to solve Equation (16) until convergence. These updated values of ψ ^ are in turn replaced in Equation (12) to obtain a new set of regression parameters, which are again used to obtain a new set of dependence parameters and this cycle continues until convergence of the two sets of parameters. These estimates are consistent and under mild regulatory conditions, ( ψ ^ ψ ) is asymptotically normal with a mean of 0 and a covariance matrix [ D ψ Σ ψ 1 D ψ ] 1 .

Forecasting Equations

The forecasting equations are derived as follows:
Given Y t [ k ] , the forecasting function is expressed as
E ( Y t + 1 [ 1 ] | Y t [ 1 ] , Y t [ 2 ] ) = μ ^ t + 1 [ 1 ] + ρ ^ 11 ( Y t [ 1 ] μ ^ t [ 1 ] ) + ρ ^ 12 ( Y t [ 2 ] μ ^ t [ 2 ] ) ,
E ( Y t + 1 [ 2 ] | Y t [ 1 ] , Y t [ 2 ] ) = μ ^ t + 1 [ 2 ] + ρ ^ 21 ( Y t [ 1 ] μ ^ t [ 1 ] ) + ρ ^ 22 ( Y t [ 2 ] μ ^ t [ 2 ] )

4. Simulation Study

In this section we generate BINAR(1) time series data with geometric marginals under the following time-varying covariate design:
x t 1 = cos ( 2 π t ) + 0.01 ( t = 1 , , T / 4 ) sin ( 2 π t ) + 0.05 ( t = ( T / 4 ) + 1 , , 3 T / 4 ) cos ( 2 π t ) + 0.10 ( t = ( 3 T / 4 ) + 1 , , T )
x t 2 = ( 1 / t ) ( t = 1 , , T / 4 ) ( 1 / t ) ( t = ( T / 4 ) + 1 , , 3 T / 4 ) t ( t = ( 3 T / 4 ) + 1 , , T )
where μ t [ k ] = exp ( x t 1 β 1 [ k ] + x t 2 β 2 [ k ] ) . Assuming [ ρ 12 , ρ 21 ] = [ 0.5 , 0.5 ] , [ ρ 11 , ρ 22 ] = [ 0.9 , 0.9 ] , [ 0.3 , 0.9 ] , [ 0.3 , 0.3 ] , β [ 1 ] = 0.5 and β [ 2 ] = 0.9 for t = 1 , 2 , , T = 100 , 500 , 1000 , we generate R t [ k ] using the inverse transformation method as in [13]. A total of 5000 Monte Carlo replications are made under the above combinations and the simulated mean estimates are shown below:
From Table 1, we observe that the GQL estimates are consistent and that the cross-correlation parameter κ 12 , 1 is close to unity. In addition, as the time points increase, we notice a decrease in the standard errors, with GQL yielding low standard errors as also demonstrated in [14]. Some details on the number of non-convergent simulations include: for ρ 11 = ρ 22 = 0.9 under GQL, around 360 simulations failed for T = 100 , 300 for T = 500 and 220 for T = 1000 . For ρ 11 = 0.3 and ρ 22 = 0.9 , around 340 GQL simulations failed for T = 100 , 275 for T = 500 and 190 for T = 1000 . However, when ρ 11 = ρ 22 = 0.3 , the GQL algorithms failed in 315 simulations for T = 100 , 215 for T = 500 and 170 for T = 1000 . The failures were mainly due to either an ill-conditioned covariance matrix or the Hessian structure in Equation (12). Hence, it is concluded from this section that GQL yields far superior estimates than GLS and GMM, and constitutes of a slightly better non-convergent computational problem.

5. Analysing the Time Series of Day and Night Road Accidents in Mauritius

In this section we analyse the monthly day and night accident series data in Mauritius collected from January 2011 to January 2020 that connects the capital city of Port-Louis and the tourist zone Grand-Bay, Mauritius, totalling 109 bivariate time series data. With a sample cross-correlation of 0.3267, it is rationale to believe that there exists a cross-correlation between the two series since both sets of data were collected on the same route. The summary statistics illustrate that day and night accidents have means (variance) of 8.6422 (27.5652) and 4.2110 (13.1310), respectively. Given the significant over-dispersion, the NSUBINAR(1)GEOM in Section 2 is applied to analyse the time series. The covariates we consider are: number of speed cameras (SC) in this area, the number of police officers deployed on street patrol in the different police stations in this area (PO), the number of times the streets in the area have been re-maintained during the years (NS) and number of roundabouts (RA) from Port-Louis to Grand-Bay.
Table 2 and Table 3 show the regression estimates and the serial and dependence estimates of the in-sample accident data from January 2011 to August 2019, totalling 104 paired observations, while the out-sample data from September 2019 to January 2020 were used to validate the model. Note, the in-sample data were also analysed using the NSCBINAR(1)NB from [9] and the estimates were compared with the NSCBINAR(1)GEOM and NSUBINAR(1)GEOM.
From Table 2, using the NSUBINAR(1)GEOM, we can note an expected decrease in the number of day and night accidents by 9 and 5%, respectively, if there is an installation of an additional speed camera along the motorway M2. As more police patrols are re-enforced in the area, the number of accidents is expected to decrease by 9% during the day and 7% during the night. Similarly, better road maintenance contributes to a decrease in day accidents by 7% and 6% in night accidents. Roundabout construction must be carefully monitored as this factor leads to an expected increase in the number of day accidents by 6% and 8% during the night. In comparison with the NSCBINAR(1)NB, the signs estimated effects are the same as in NSUBINAR(1)GEOM with little fluctuation in the estimates and their corresponding standard errors, but far better than NSCBINAR(1)GEOM. Using the corresponding forecasting in Equations (17) and (18) and in [9], and based on the out-sample observations from September 2019 to January 2020, Table 4 displays the RMSEs and mean absolute deviation (MAD) under the NSCBINAR(1)NB, NSCBINAR(1)GEOM and NSUBINAR(1)GEOM.
The standard errors, RMSE and MAD, illustrate that NSUBINAR(1)GEOM yields almost the same measures as NSCBINAR(1)NB, but better estimates than the NSCBINAR(1)GEOM.

6. Conclusions

In this paper, the unconstrained non-stationary BINAR(1) model with geometric marginals was modelled. However, during the development of this model, it is observed that the joint probability function of the innovation series is rather complex to derive and this limits the construction of a conditional likelihood function to estimate the unknown model parameters. Hence, this paper proposes an alternative GQL estimation approach that only requires the correct specification of the score and moment vectors. As for the derivation of the higher-order entries of the auto-covariance matrix, the multivariate normality assumption was used. With regard to the simulation study, it is shown that the GQL approach provides consistent parameter estimates and statistically more efficient estimates. In the analysis of the accident data, reliable estimates of the different covariates were obtained and comparable to NSCBINAR(1)NB and NSCBINAR(1)GEOM. The RMSE and MAD show that NSUBINAR(1)GEOM yields better forecasts than the other two competing models, a similar conclusion illustrated in [7]. This model is commendable to analyse over-dispersed series under non-stationary setups characterized by time-dependent effects.

Author Contributions

Conceptualization, S.Y. and M.K.N.; methodology, S.Y.; software, M.K.N.; validation, S.Y. and M.K.N.; formal analysis, S.Y.; investigation, S.Y.; resources, M.K.N.; data curation, M.K.N.; writing—original draft preparation, S.Y.; writing—review and editing, M.K.N.; visualization, S.Y. and M.K.N.; supervision, M.K.N.; project administration, S.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pedeli, X.; Karlis, D. Bivariate INAR(1) Models; Technical Report; Athens University of Economics: Athens, Greece, 2009. [Google Scholar]
  2. Pedeli, X.; Karlis, D. A bivariate INAR(1) process with application. Stat. Model. Int. J. 2011, 11, 325–349. [Google Scholar] [CrossRef]
  3. McKenzie, E. Autoregressive moving-average processes with Negative Binomial and geometric marginal distributions. Adv. Appl. Probab. 1986, 18, 679–705. [Google Scholar] [CrossRef]
  4. Steutel, F.; Van Harn, K. Discrete analogues of self-decomposability and stability. Ann. Probab. 1979, 7, 3893–3899. [Google Scholar] [CrossRef]
  5. Pedeli, X.; Karlis, D. Some properties of multivariate INAR(1) processes. Comput. Stat. Data Anal. 2013, 67, 213–225. [Google Scholar] [CrossRef]
  6. Ristic, M.; Nastic, A.; Jayakumar, K.; Bakouch, H. A bivariate INAR(1) time series model with geometric marginals. Appl. Math. Lett. 2012, 25, 481–485. [Google Scholar] [CrossRef] [Green Version]
  7. Nastic, A.; Ristic, M.; Popovic, P. Estimation in a bivariate integer-valued autoregressive process. Commun.-Stat.-Theory Methods 2016, 45, 5660–5678. [Google Scholar] [CrossRef]
  8. Mamode Khan, N.; Sunecher, Y.; Jowaheer, V. Modelling a Non-Stationary BINAR(1) Poisson Process. J. Stat. Comput. Simul. 2016, 86, 3106–3126. [Google Scholar] [CrossRef]
  9. Sunecher, Y.; Mamodekhan, N.; Jowaheer, V. A GQL estimation approach for analysing non-stationary over-dispersed BINAR(1) time series. J. Stat. Comput. Simul. 2017, 87, 1911–1924. [Google Scholar] [CrossRef]
  10. Nastic, A.; Laketa, P.; Ristic, M. Random environment integer-valued autoregressive process. J. Time Ser. Anal. 2016, 37, 267–287. [Google Scholar] [CrossRef]
  11. Sutradhar, B.; Jowaheer, V.; Rao, P. Remarks on asymptotic efficient estimation for regression effects in stationary and non-stationary models for panel count data. Braz. J. Probab. Stat. 2014, 28, 241–254. [Google Scholar] [CrossRef]
  12. Jowaheer, V.; Mamode Khan, N.; Sunecher, Y. A non-stationary BINAR(1) process with negative binomial innovations for modeling the number of goals in the first and second half: The case study of Arsenal Football Club. Commun.-Stat.-Case Stud. Data Anal. Appl. 2017, 2, 21–33. [Google Scholar] [CrossRef]
  13. Ristic, M.; Bakouch, H.; Nastic, A. A new geometric first-order integer-valued autoregressive (NGINAR(1)) process. J. Stat. Plan. Inference 2009, 136, 2218–2226. [Google Scholar] [CrossRef]
  14. Sunecher, Y.; Mamodekhan, N.; Jowaheer, V. Estimating the parameters of a BINMA Poisson model for a non-stationary bivariate time series. Commun. Stat. Simul. Comput. 2016, 46, 6803–6827. [Google Scholar] [CrossRef]
Table 1. GQL estimates of the parameters and standard errors under the non-stationary geometric BINAR(1) model.
Table 1. GQL estimates of the parameters and standard errors under the non-stationary geometric BINAR(1) model.
ρ 11 ρ 22 TMethods β ^ 1 [ 1 ] β ^ 2 [ 1 ] β ^ 1 [ 2 ] β ^ 2 [ 2 ] ρ ^ 11 ρ ^ 22 ρ ^ 12 ρ ^ 21 κ ^ 12 , 1
0.90.9100GQL0.48230.48700.88760.88400.88190.88470.48590.48470.9815
(0.0910)(0.0931)(0.0977)(0.0946)(0.1163)(0.1115)(0.1125)(0.1153)
500GQL0.49230.49140.89500.89170.89420.89200.49450.49400.9909
(0.0517)(0.0512)(0.0547)(0.0585)(0.0649)(0.0625)(0.0623)(0.0630)
1000GQL0.49910.49880.89600.89810.89840.89940.50040.50060.9979
(0.0122)(0.0171)(0.0194)(0.0146)(0.0271)(0.0224)(0.0209)(0.0237)
0.30.9100GQL0.48940.48910.88700.88400.28560.88680.48990.48880.9826
(0.0969)(0.0977)(0.0915)(0.0959)(0.1133)(0.1172)(0.1128)(0.1119)
500GQL0.49260.49270.89440.89590.29150.89280.49650.49400.9918
(0.0526)(0.0556)(0.0507)(0.0512)(0.0681)(0.0694)(0.0671)(0.0631)
1000GQL0.49950.49750.89690.89940.29880.89630.50080.50110.9995
(0.0137)(0.0175)(0.0111)(0.0141)(0.0213)(0.0293)(0.0251)(0.0221)
0.30.3100GQL0.48230.48700.88040.88960.28120.28340.48540.48350.9819
(0.0981)(0.0911)(0.0931)(0.0928)(0.1169)(0.1160)(0.1135)(0.1197)
500GQL0.49290.49420.89100.89350.29640.29310.49200.49150.9913
(0.0594)(0.0589)(0.0562)(0.0580)(0.0614)(0.0621)(0.0677)(0.0681)
1000GQL0.49560.49920.89870.89900.29880.29800.50040.50010.9966
(0.0152)(0.0125)(0.0135)(0.0133)(0.0241)(0.0208)(0.0219)(0.0211)
Table 2. Monthly day and night accidents: GQL estimates of the regression parameters.
Table 2. Monthly day and night accidents: GQL estimates of the regression parameters.
ModelTime SeriesInterceptNSSCPORA c ^
Day Accidents2.5353−0.0815−0.0942−0.09340.07601.8475
s.e(0.0633)(0.0396)(0.0456)(0.0397)(0.0289)(0.0948)
NSCBINAR(1)NBNight Accidents0.9272−0.0790−0.0671−0.08240.09430.8965
s.e(0.0742)(0.0245)(0.0213)(0.0329)(0.0330)(0.0980)
Day Accidents2.4445−0.0824−0.0952−0.09550.0668
s.e(0.0560)(0.0273)(0.0314)(0.0215)(0.0164)
NSUBINAR(1)GEOMNight Accidents0.9106−0.0714−0.0572−0.07470.0817
s.e(0.0710)(0.0180)(0.0157)(0.0260)(0.0205)
Day Accidents2.413−0.0929−0.0866−0.09610.0852
s.e(0.0958)(0.0353)(0.0385)(0.0419)(0.0342)
NSCBINAR(1)GEOMNight Accidents0.9623−0.0876−0.0894−0.07850.0899
s.e(0.0952)(0.0367)(0.0387)(0.0375)(0.0345)
Table 3. Monthly day and night accidents: GQL estimates of the dependence parameters.
Table 3. Monthly day and night accidents: GQL estimates of the dependence parameters.
ModelTime Series ρ ^ Serial ρ ^ Cross κ ^ 12 , 1
Day Accidents0.2620 0.0065
s.e
NSCBINAR(1)NBNight Accidents0.2941
s.e
Day Accidents0.27480.07280.0025
s.e(0.0346)(0.0234)
NSUBINAR(1)GEOMNight Accidents0.24380.0563
s.e(0.0388)(0.0191)
Day Accidents0.2145 0.0034
s.e
NSCBINAR(1)GEOMNight Accidents0.2442
s.e
Table 4. RMSE and MAD for the one step-ahead forecast for the number of monthly day and night accidents.
Table 4. RMSE and MAD for the one step-ahead forecast for the number of monthly day and night accidents.
ModelRMSERMSEMADMAD
Y t [ 1 ] Y t [ 2 ] Y t [ 1 ] Y t [ 2 ]
NSCBINAR(1)NB0.1320.1410.1090.120
NSUBINAR(1)GEOM0.1200.1290.0980.104
NSCBINAR(1)GEOM0.1960.1890.1550.144
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yuvraj, S.; Naushad, M.K. A Novel Unconstrained Geometric BINAR(1) Model. Eng. Proc. 2023, 39, 52. https://doi.org/10.3390/engproc2023039052

AMA Style

Yuvraj S, Naushad MK. A Novel Unconstrained Geometric BINAR(1) Model. Engineering Proceedings. 2023; 39(1):52. https://doi.org/10.3390/engproc2023039052

Chicago/Turabian Style

Yuvraj, Sunecher, and Mamode Khan Naushad. 2023. "A Novel Unconstrained Geometric BINAR(1) Model" Engineering Proceedings 39, no. 1: 52. https://doi.org/10.3390/engproc2023039052

Article Metrics

Back to TopTop