A Novel Unconstrained Geometric BINAR(1) Model

Sunecher Yuvraj; Mamode Khan Naushad

doi:10.3390/engproc2023039052

Abstract

Modelling the non-stationary unconstrained bivariate integer-valued autoregressive of order 1 (NSUBINAR(1)) model is challenging due to the complex cross-correlation relationship between the counting series. Hence, this paper introduces a novel non-stationary unconstrained BINAR(1) with geometric marginals (NSUBINAR(1)GEOM) based on the assumption that the counting series are both influenced by the same time-dependent explanatory variables. The generalized quasi-likelihood (GQL) estimation method is used to estimate the regression and dependence parameters. Monte Carlo simulations and an application to a real-life accident series data are presented.

Keywords:

non-stationary; unconstrained; BINAR(1); GQL; geometric

1. Introduction

In the literature, several researchers have developed first-order bivariate integer-valued autoregressive (BINAR(1)) models to analyse bivariate time series of counts. Originally, Pedeli and Karlis [1,2] developed two constrained BINAR(1) models with Poisson (CBINAR(1)P) and negative binomial (NB) (CBINAR(1)NB) innovations by extending the classical INAR(1) model of McKenzie [3] based on the binomial thinning mechanism [4]. These two models were developed under stationary moment assumptions only and the cross-correlation between the bivariate series was induced by the correlated Poisson and NB innovations, hence implying a constrained relationship. By the same token, Pedeli and Karlis [5] extended the CBINAR(1)P model to an unconstrained BINAR(1) model with Poisson innovations (UBINAR(1)P) under the same condition of stationarity. In this latter model, the cross-correlation between the series was induced by the correlated Poisson innovation terms and the relationship between the observations of each counting series with previous-lagged observations of the other series.

Likewise, Ristic, Nastic, Jayakumar and Bakouch [6] and Nastic, Ristic and Popovic [7] developed a stationary UBINAR(1) model with geometric marginals (UBINAR(1)GEOM) with independent mixed geometric innovations. Hence, the cross-correlation relationship was induced only by the relation of the current observations with previous-lagged observations of the other series via the negative binomial (NB) thinning operator. Interestingly, Nastic et al. [7] showed in their paper that the UBINAR(1)GEOM yields better AICs than the above over-dispersed BINAR(1) models. However, it is worth mentioning that the UBINAR(1)GEOM model was developed only for stationary time series and hence, cannot be used to analyse non-stationary real-life over-dispersed series.

As for non-stationary time series of counts, few BINAR(1) models have been developed. Mamodekhan, Sunecher and Jowaheer [8] developed a CBINAR(1) model with Poisson innovations under non-stationarity assumption (NSCBINAR(1)P) induced by time-dependent explanatory variables. In a similar context, Sunecher, Mamodekhan and Jowaheer [9] developed a non-stationary CBINAR(1) model with NB innovations for over-dispersed time series of counts (NSCBINAR(1)NB). However, an unconstrained BINAR(1) model under the non-stationarity context that can model over-dispersed counting series has not yet been developed in the literature. Hence, in this paper we propose to develop a non-stationary unconstrained BINAR(1) model with geometric innovations (NSUBINAR(1)GEOM) similar to Pedeli and Karlis [5]. However, such model development poses some computational challenges in estimating the unknown model parameters as it is rather difficult to specify the joint generating function [10].

In the paper by Pedeli and Karlis [2,5], the conditional maximum likelihood (CML) approach was compared with the method of moments (MoM), where the authors concluded that CML yields far better estimates than MoM, but at huge computational costs. A similar conclusion was drawn by Nastic et al. [7] who used the least-square (LS) technique, as an alternative to CML, which omits the likelihood function. In the same way, due to the computational challenges of the CML, Mamodekhan et al. [8] and Sunecher et al. [9] developed the generalized quasi-likelihood (GQL) approach in the non-stationary bivariate context. Mamodekhan et al. [8] compared the GQL with CML, where it was shown that GQL yields asymptotically equally efficient estimates as CML. Hence, based on the above findings, the unknown parameters of the NSUBINAR(1)GEOM will be estimated using the GQL.

The organization of the paper is as follows: In the next section, the NSUBINAR(1) model is developed. In Section 3, the GQL approach is developed under the non-stationarity bivariate context. Section 4 focuses on the simulation part where BINAR(1) data with geometric marginals are generated and the GQL approach is used to estimate the model parameters. In Section 5, the model is applied to the accident data in Mauritius. The conclusion is provided in the final section.

2. The Non-Stationary Unconstrained BINAR(1) with Geometric Marginals (NSUBINAR(1)GEOM)

The UBINAR(1) model is specified as:

Y_{t}^{[1]} = ρ_{11} * Y_{t - 1}^{[1]} + ρ_{12} * Y_{t - 1}^{[2]} + R_{t}^{[1]}

(1)

Y_{t}^{[2]} = ρ_{21} * Y_{t - 1}^{[1]} + ρ_{22} * Y_{t - 1}^{[2]} + R_{t}^{[2]}

(2)

based on the following assumptions:

(a): $Y_{t}^{[k]}$ is geometric such that $Y_{t}^{[k]} \sim Geom (\frac{μ_{t}^{[k]}}{1 + μ_{t}^{[k]}})$ . Hence, E( $Y_{t}^{[k]}) = μ_{t}^{[k]}$ and Var( $Y_{t}^{[k]}) = μ_{t}^{[k]} (1 + μ_{t}^{[k]})$ , where $μ_{t}^{[k]} = exp (x_{t}^{^{'}} β^{[k]})$ with $x_{t} = {[{x_{t}}_{1}, {x_{t}}_{2}, \dots, {x_{t}}_{j}, \dots, {x_{t}}_{p}]}^{^{'}}$ is a $(p \times 1)$ vector of covariates influencing both $Y_{t}^{[1]}$ and $Y_{t}^{[2]}$ , with corresponding regression coefficients $β^{[k]} = {[β_{1}^{[k]}, β_{2}^{[k]}, \dots, β_{j}^{[k]}, \dots, β_{p}^{[k]}]}^{^{'}}$ for $t = 1, 2, \dots, T$ and $k \in {1, 2}$ .
(b): ∗ is the binomial thinning operator [4] such that $ρ_{i j} * Y_{t - 1}^{[j]} = \sum_{m = 1}^{Y_{t - 1}^{[j]}} Z_{m}$ with $Z_{m} \sim Geom (\frac{ρ_{i j}}{1 + ρ_{i j}})$ . Hence, $E (ρ_{i j} * Y_{t - 1}^{[j]}) = ρ_{i j} E (Y_{t - 1}^{[j]})$ and $Var (ρ_{i j} * Y_{t - 1}^{[j]}) =$ $ρ_{i j} (1 + ρ_{i j}) E (Y_{t - 1}^{[j]}) + ρ_{i j}^{2} Var (Y_{t - 1}^{[j]})$ .
(c): $Corr (R_{t}^{[1]}, R_{t^{^{'}}}^{[2]}) = \{\begin{matrix} κ_{12, t} & t = t^{^{'}}, \\ 0 & t \neq t^{^{'}} . \end{matrix}$

\begin{matrix} E (Y_{t}^{[1]}) & = E (ρ_{11} * Y_{t - 1}^{[1]}) + E (ρ_{12} * Y_{t - 1}^{[2]}) + E (R_{t}^{[1]}) \\ μ_{t}^{[1]} & = ρ_{11} μ_{t - 1}^{[1]} + ρ_{12} μ_{t - 1}^{[2]} + E (R_{t}^{[1]}) . \end{matrix}

(3)

Re-arranging Equation (3), we have

E (R_{t}^{[1]}) = λ_{t}^{[1]} = μ_{t}^{[1]} - ρ_{11} μ_{t - 1}^{[1]} - ρ_{12} μ_{t - 1}^{[2]} .

(4)

Similarly,

E (R_{t}^{[2]}) = λ_{t}^{[2]} = μ_{t}^{[2]} - ρ_{21} μ_{t - 1}^{[1]} - ρ_{22} μ_{t - 1}^{[2]} .

(5)

\begin{matrix} Var (Y_{t}^{[1]}) & = Var (ρ_{11} * Y_{t - 1}^{[1]} + ρ_{12} * Y_{t - 1}^{[2]} + R_{t}^{[1]}) \\ Var (Y_{t}^{[1]}) & = ρ_{11} (1 + ρ_{11}) E (Y_{t - 1}^{[1]}) + ρ_{11}^{2} Var (Y_{t - 1}^{[1]}) + ρ_{12} (1 + ρ_{12}) E (Y_{t - 1}^{[2]}) + ρ_{12}^{2} Var (Y_{t - 1}^{[2]}) \\ + 2 ρ_{11} ρ_{12} Cov (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}) + Var (R_{t}^{[1]}) \\ μ_{t}^{[1]} + {μ_{t}^{[1]}}^{2} & = ρ_{11} (1 + ρ_{11}) μ_{t - 1}^{[1]} + ρ_{11}^{2} (μ_{t - 1}^{[1]} + {μ_{t - 1}^{[1]}}^{2}) + ρ_{12} (1 + ρ_{12}) μ_{t - 1}^{[2]} + ρ_{12}^{2} (μ_{t - 1}^{[2]} + {μ_{t - 1}^{[2]}}^{2}) \\ + 2 ρ_{11} ρ_{12} Cov (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}) + Var (R_{t}^{[1]}) \end{matrix}

(6)

Re-arranging Equation (6), we have

\begin{matrix} Var (R_{t}^{[1]}) & = μ_{t}^{[1]} (1 + μ_{t}^{[1]}) - ρ_{11} (1 + ρ_{11}) μ_{t - 1}^{[1]} - ρ_{11}^{2} μ_{t - 1}^{[1]} (1 + μ_{t - 1}^{[1]}) - ρ_{12} (1 + ρ_{12}) μ_{t - 1}^{[2]} \\ - ρ_{12}^{2} μ_{t - 1}^{[2]} (1 + μ_{t - 1}^{[2]}) - 2 ρ_{11} ρ_{12} Cov (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}) \end{matrix}

(7)

and similarly,

\begin{matrix} Var (R_{t}^{[2]}) & = μ_{t}^{[2]} (1 + μ_{t}^{[2]}) - ρ_{21} (1 + ρ_{21}) μ_{t - 1}^{[1]} - ρ_{21}^{2} μ_{t - 1}^{[1]} (1 + μ_{t - 1}^{[1]}) - ρ_{22} (1 + ρ_{22}) μ_{t - 1}^{[2]} \\ - ρ_{22}^{2} μ_{t - 1}^{[2]} (1 + μ_{t - 1}^{[2]}) - 2 ρ_{21} ρ_{22} Cov (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}), \end{matrix}

(8)

The above moments clearly indicate that the marginal distribution of

R_{t}^{[k]}

is rather complex to derive. To facilitate the derivation of the cross-covariances, we write Equations (1) and (2) in vector form as follows:

Y_{t} = A * Y_{t - 1} + R_{t}

(9)

with

Y_{t} = {[Y_{t}^{[1]}, Y_{t}^{[2]}]}^{^{'}}

,

R_{t} = {[R_{t}^{[1]}, R_{t}^{[2]}]}^{^{'}}

,

A = (\begin{matrix} ρ_{11} & ρ_{12} \\ ρ_{21} & ρ_{22} \end{matrix})

.

Assuming

Σ_{h, t} = [\begin{matrix} Cov (Y_{t}^{[1]}, Y_{t + h}^{[1]}) & Cov (Y_{t + h}^{[1]}, Y_{t}^{[2]}) \\ Cov (Y_{t}^{[1]}, Y_{t + h}^{[2]}) & Cov (Y_{t}^{[2]}, Y_{t + h}^{[2]}) \end{matrix}]

, from Pedeli and Karlis [5] and Ristic et al. [6], it was shown that

Σ_{h, t} = A^{h} Σ_{0, t}

and hence

\begin{matrix} Cov (Y_{t}^{[1]}, Y_{t}^{[2]}) & = Cov (ρ_{11} * Y_{t - 1}^{[1]} + ρ_{12} * Y_{t - 1}^{[2]} + R_{t}^{[1]}, ρ_{21} * Y_{t - 1}^{[1]} + ρ_{22} * Y_{t - 1}^{[2]} + R_{t}^{[2]}) \\ = (ρ_{11} ρ_{22} + ρ_{12} ρ_{21}) Cov (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}) + ρ_{11} ρ_{21} (μ_{t - 1}^{[1]} + {μ_{t - 1}^{[1]}}^{2}) \\ + ρ_{22} ρ_{12} (μ_{t - 1}^{[2]} + {μ_{t - 1}^{[2]}}^{2}) + [κ_{12, t} \sqrt{Var (R_{t}^{[1]})} \sqrt{Var (R_{t}^{[2]})}] . \end{matrix}

(10)

Note that if

α = ρ_{11} = ρ_{12}

,

γ = ρ_{21} = ρ_{22}

,

μ = μ_{t - 1}^{[1]} = μ_{t - 1}^{[2]}

and

κ_{12, t} = 0

, Equation (10) simply reduces to

Cov (Y_{t}^{[1]}, Y_{t}^{[2]}) = \frac{2 α γ}{1 - 2 α γ} μ (1 + μ)

, which is the same as in Ristic et al. [6].

3. Estimation Method

The GQL equation to estimate the regression parameters is specified as:

D_{β}^{^{'}} Σ_{β}^{- 1} (f - μ) = 0

(11)

with score vector

f = [f_{1}, f_{2}, \dots, f_{t}, \dots, f_{t + h}, \dots, f_{T}]

with

f_{t} = {[Y_{t}^{[1]}, Y_{t}^{[2]}]}^{^{'}}

and

μ = [μ_{1}, μ_{2}, \dots, μ_{t}, \dots, μ_{T}]

with corresponding mean

μ_{t} = {[μ_{t}^{[1]}, μ_{t}^{[2]}]}^{^{'}}

for

t = 1, 2, \dots, T

.

The covariance matrix

Σ_{β}

is a

(2 T \times 2 T)

. The derivative matrix

D_{β}

is denoted by

D_{β} = {[D_{1}, D_{2}, \dots, D_{t}, \dots, D_{T}]}^{'}

with

D_{t} = {(\begin{matrix} \frac{\partial μ_{t}^{[1]}}{\partial β^{[1]}} & 0 \\ 0 & \frac{\partial μ_{t}^{[2]}}{\partial β^{[2]}} \end{matrix})}_{2 p \times 2}

where

\frac{\partial μ_{t}^{[k]}}{\partial β_{j}^{[k]}} = μ_{t}^{[k]} x_{t j}^{'}

.

The Newton–Raphson iterative technique is used to estimate the regression parameters as follows:

(\begin{matrix} {\hat{β}}_{r + 1}^{[1]} \\ {\hat{β}}_{r + 1}^{[2]} \end{matrix}) = (\begin{matrix} {\hat{β}}_{r}^{[1]} \\ {\hat{β}}_{r}^{[2]} \end{matrix}) + {[D_{β}^{^{'}} Σ_{β}^{- 1} D_{β}]}_{r}^{- 1} {[D_{β}^{^{'}} Σ_{β}^{- 1} (f - μ)]}_{r}

(12)

where

{\hat{β}}_{r}^{[k]}

are the estimates at the rth iteration and

{[.]}_{r}

are the values of the expression at the rth iteration.

For an initial value of

[{\hat{ρ}}_{11}, {\hat{ρ}}_{12}, {\hat{ρ}}_{21}, {\hat{ρ}}_{22}

,

{\hat{κ}}_{12, t}, β]

, we solve the iterative Equation (12) until convergence. These estimates are consistent and under mild regulatory conditions,

{(\hat{β} - β)}^{'}

is asymptotically normal with a mean of 0 and a covariance matrix of

{[D_{β}^{^{'}} Σ_{β}^{- 1} D_{β}]}^{- 1}

as shown in [8,9,11].

A second GQL is specified to estimate the dependence parameter

ψ = [ρ_{11}, ρ_{12}, ρ_{21}, ρ_{22}]

as follows:

D_{ψ}^{^{'}} Σ_{ψ}^{- 1} (Y_{ψ} - μ_{ψ}) = 0,

(13)

with

Y_{ψ} = [Y_{1}^{[1]} Y_{1}^{[2]} | Y_{0}^{[1]}, Y_{0}^{[2]}, Y_{2}^{[1]} Y_{2}^{[2]} | Y_{1}^{[1]}, Y_{1}^{[2]}, \dots, Y_{t}^{[1]} Y_{t}^{[2]} | Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}, \dots, Y_{T}^{[1]} Y_{T}^{[2]} {| Y_{T - 1}^{[1]}, Y_{T - 1}^{[2]}]}_{T \times 1}^{^{'}}

and

μ_{ψ} = E (Y_{ψ})

.

The

(T \times T)

covariance matrix

Σ_{ψ}

comprises of Var

(Y_{t}^{[1]} Y_{t}^{[2]} ∣ Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]})

along the diagonal and Cov

(Y_{t}^{[1]} Y_{t}^{[2]} Y_{t + h}^{[1]} Y_{t + h}^{[2]} ∣ Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}, Y_{t + h - 1}^{[1]}, Y_{t + h - 1}^{[2]})

in the off-diagonal entries. All the entries are of higher-order moments and hence, the ’working’ multivariate normality assumption structure is used to compute these entries as in [9,12].

As for the

(T \times 4)

derivative matrix

D_{ψ}

,

E (Y_{t}^{[1]} Y_{t}^{[2]} ∣ Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]})

\begin{matrix} = [κ_{12, t} [μ_{t}^{[1]} (1 + μ_{t}^{[1]}) - ρ_{11} (1 + ρ_{11}) μ_{t - 1}^{[1]} - ρ_{11}^{2} μ_{t - 1}^{[1]} (1 + μ_{t - 1}^{[1]}) - ρ_{12} (1 + ρ_{12}) μ_{t - 1}^{[2]} \\ - ρ_{12}^{2} μ_{t - 1}^{[2]} (1 + μ_{t - 1}^{[2]})]^{\frac{1}{2}} [μ_{t}^{[2]} (1 + μ_{t}^{[2]}) - ρ_{21} (1 + ρ_{21}) μ_{t - 1}^{[1]} - ρ_{21}^{2} μ_{t - 1}^{[1]} (1 + μ_{t - 1}^{[1]}) \\ - ρ_{22} (1 + ρ_{22}) μ_{t - 1}^{[2]} - ρ_{22}^{2} μ_{t - 1}^{[2]} (1 + μ_{t - 1}^{[2]})]^{\frac{1}{2}} + (ρ_{11} Y_{t - 1}^{[1]} + ρ_{12} Y_{t - 1}^{[2]} + μ_{t}^{[1]} - ρ_{11} μ_{t - 1}^{[1]} \\ - ρ_{12} μ_{t - 1}^{[2]}) \times (ρ_{21} Y_{t - 1}^{[1]} + ρ_{22} Y_{t - 1}^{[2]} + μ_{t}^{[2]} - ρ_{21} μ_{t - 1}^{[1]} - ρ_{22} μ_{t - 1}^{[2]})] \end{matrix}

(14)

As for the estimates of

κ_{12, t}

, they are estimated using the method of moments as follows:

κ_{12, t} = \frac{\tilde{Cov} (Y_{t}^{[1]}, Y_{t}^{[2]}) - ({\hat{ρ}}_{11} {\hat{ρ}}_{22} + {\hat{ρ}}_{12} {\hat{ρ}}_{21}) \tilde{Cov} (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]}) - {\hat{ρ}}_{11} {\hat{ρ}}_{21} ({\hat{μ}}_{t - 1}^{[1]} + {\hat{μ_{t - 1}^{[1]}}}^{2}) - {\hat{ρ}}_{22} {\hat{ρ}}_{12} ({\hat{μ}}_{t - 1}^{[2]} + {\hat{μ_{t - 1}^{[2]}}}^{2})}{\sqrt{Var (R_{t}^{[1]})} \sqrt{Var (R_{t}^{[2]})}}

(15)

where

{\hat{μ}}_{0}^{[k]} = {\hat{μ}}_{1}^{[k]}

,

\tilde{Cov} (Y_{t}^{[1]}, Y_{t}^{[2]})

=

\frac{1}{T} \sum_{t = 1}^{T} (y_{t}^{[1]} - {\hat{μ}}_{t}^{[1]}) (y_{t}^{[2]} - {\hat{μ}}_{t}^{[2]})

and

\tilde{Cov} (Y_{t - 1}^{[1]}, Y_{t - 1}^{[2]})

=

\frac{1}{T - 1} \sum_{t = 2}^{T} (y_{t - 1}^{[1]} - {\hat{μ}}_{t - 1}^{[1]}) (y_{t - 1}^{[2]} - {\hat{μ}}_{t - 1}^{[2]})

.

The Newton–Raphson iteration for the second GQL yields

(\begin{matrix} {\hat{ψ}}_{r + 1} \end{matrix}) = (\begin{matrix} {\hat{ψ}}_{r} \end{matrix}) + {[D_{ψ}^{^{'}} {\hat{Σ}}_{ψ}^{- 1} D_{ψ}]}_{r}^{- 1} {[D_{ψ}^{^{'}} {\hat{Σ}}_{ψ}^{- 1} (Y_{ψ} - μ_{ψ})]}_{r}

(16)

where

{\hat{ψ}}_{r}

are the estimates at the rth iteration and

{[.]}_{r}

are the values of the expression at the rth iteration.

The regression estimates

\hat{β}

obtained from Equation (12) are used to solve Equation (16) until convergence. These updated values of

\hat{ψ}

are in turn replaced in Equation (12) to obtain a new set of regression parameters, which are again used to obtain a new set of dependence parameters and this cycle continues until convergence of the two sets of parameters. These estimates are consistent and under mild regulatory conditions,

{(\hat{ψ} - ψ)}^{^{'}}

is asymptotically normal with a mean of 0 and a covariance matrix

{[D_{ψ}^{^{'}} Σ_{ψ}^{- 1} D_{ψ}]}^{- 1}

.

Forecasting Equations

The forecasting equations are derived as follows:

Given

Y_{t}^{[k]}

, the forecasting function is expressed as

E (Y_{t + 1}^{[1]} | Y_{t}^{[1]}, Y_{t}^{[2]}) = {\hat{μ}}_{t + 1}^{[1]} + {\hat{ρ}}_{11} (Y_{t}^{[1]} - {\hat{μ}}_{t}^{[1]}) + {\hat{ρ}}_{12} (Y_{t}^{[2]} - {\hat{μ}}_{t}^{[2]}),

(17)

E (Y_{t + 1}^{[2]} | Y_{t}^{[1]}, Y_{t}^{[2]}) = {\hat{μ}}_{t + 1}^{[2]} + {\hat{ρ}}_{21} (Y_{t}^{[1]} - {\hat{μ}}_{t}^{[1]}) + {\hat{ρ}}_{22} (Y_{t}^{[2]} - {\hat{μ}}_{t}^{[2]})

(18)

4. Simulation Study

In this section we generate BINAR(1) time series data with geometric marginals under the following time-varying covariate design:

x_{t 1} = \{\begin{matrix} - \cos (2 π t) + 0.01 & (t = 1, \dots, T / 4) \\ \sin (2 π t) + 0.05 & (t = (T / 4) + 1, \dots, 3 T / 4) \\ \cos (2 π t) + 0.10 & (t = (3 T / 4) + 1, \dots, T) \end{matrix}

x_{t 2} = \{\begin{matrix} (1 / t) & (t = 1, \dots, T / 4) \\ (- 1 / t) & (t = (T / 4) + 1, \dots, 3 T / 4) \\ t & (t = (3 T / 4) + 1, \dots, T) \end{matrix}

where

μ_{t}^{[k]} = exp (x_{t 1} β_{1}^{[k]} + x_{t 2} β_{2}^{[k]})

. Assuming

[ρ_{12}, ρ_{21}] = [0.5, 0.5]

,

[ρ_{11}, ρ_{22}] =

[0.9, 0.9], [0.3, 0.9], [0.3, 0.3]

,

β^{[1]} = 0.5

and

β^{[2]} = 0.9

for

t = 1, 2, \dots, T = 100, 500, 1000

, we generate

R_{t}^{[k]}

using the inverse transformation method as in [13]. A total of 5000 Monte Carlo replications are made under the above combinations and the simulated mean estimates are shown below:

From Table 1, we observe that the GQL estimates are consistent and that the cross-correlation parameter

κ_{12, 1}

is close to unity. In addition, as the time points increase, we notice a decrease in the standard errors, with GQL yielding low standard errors as also demonstrated in [14]. Some details on the number of non-convergent simulations include: for

ρ_{11} = ρ_{22} = 0.9

under GQL, around 360 simulations failed for

T = 100

, 300 for

T = 500

and 220 for

T = 1000

. For

ρ_{11} = 0.3

and

ρ_{22} = 0.9

, around 340 GQL simulations failed for

T = 100

, 275 for

T = 500

and 190 for

T = 1000

. However, when

ρ_{11} = ρ_{22} = 0.3

, the GQL algorithms failed in 315 simulations for

T = 100

, 215 for

T = 500

and 170 for

T = 1000

. The failures were mainly due to either an ill-conditioned covariance matrix or the Hessian structure in Equation (12). Hence, it is concluded from this section that GQL yields far superior estimates than GLS and GMM, and constitutes of a slightly better non-convergent computational problem.

Table 1. GQL estimates of the parameters and standard errors under the non-stationary geometric BINAR(1) model.

5. Analysing the Time Series of Day and Night Road Accidents in Mauritius

In this section we analyse the monthly day and night accident series data in Mauritius collected from January 2011 to January 2020 that connects the capital city of Port-Louis and the tourist zone Grand-Bay, Mauritius, totalling 109 bivariate time series data. With a sample cross-correlation of 0.3267, it is rationale to believe that there exists a cross-correlation between the two series since both sets of data were collected on the same route. The summary statistics illustrate that day and night accidents have means (variance) of 8.6422 (27.5652) and 4.2110 (13.1310), respectively. Given the significant over-dispersion, the NSUBINAR(1)GEOM in Section 2 is applied to analyse the time series. The covariates we consider are: number of speed cameras (SC) in this area, the number of police officers deployed on street patrol in the different police stations in this area (PO), the number of times the streets in the area have been re-maintained during the years (NS) and number of roundabouts (RA) from Port-Louis to Grand-Bay.

Table 2 and Table 3 show the regression estimates and the serial and dependence estimates of the in-sample accident data from January 2011 to August 2019, totalling 104 paired observations, while the out-sample data from September 2019 to January 2020 were used to validate the model. Note, the in-sample data were also analysed using the NSCBINAR(1)NB from [9] and the estimates were compared with the NSCBINAR(1)GEOM and NSUBINAR(1)GEOM.

Table 2. Monthly day and night accidents: GQL estimates of the regression parameters.

Table 3. Monthly day and night accidents: GQL estimates of the dependence parameters.

From Table 2, using the NSUBINAR(1)GEOM, we can note an expected decrease in the number of day and night accidents by 9 and 5%, respectively, if there is an installation of an additional speed camera along the motorway M2. As more police patrols are re-enforced in the area, the number of accidents is expected to decrease by 9% during the day and 7% during the night. Similarly, better road maintenance contributes to a decrease in day accidents by 7% and 6% in night accidents. Roundabout construction must be carefully monitored as this factor leads to an expected increase in the number of day accidents by 6% and 8% during the night. In comparison with the NSCBINAR(1)NB, the signs estimated effects are the same as in NSUBINAR(1)GEOM with little fluctuation in the estimates and their corresponding standard errors, but far better than NSCBINAR(1)GEOM. Using the corresponding forecasting in Equations (17) and (18) and in [9], and based on the out-sample observations from September 2019 to January 2020, Table 4 displays the RMSEs and mean absolute deviation (MAD) under the NSCBINAR(1)NB, NSCBINAR(1)GEOM and NSUBINAR(1)GEOM.

Table 4. RMSE and MAD for the one step-ahead forecast for the number of monthly day and night accidents.

The standard errors, RMSE and MAD, illustrate that NSUBINAR(1)GEOM yields almost the same measures as NSCBINAR(1)NB, but better estimates than the NSCBINAR(1)GEOM.

6. Conclusions

In this paper, the unconstrained non-stationary BINAR(1) model with geometric marginals was modelled. However, during the development of this model, it is observed that the joint probability function of the innovation series is rather complex to derive and this limits the construction of a conditional likelihood function to estimate the unknown model parameters. Hence, this paper proposes an alternative GQL estimation approach that only requires the correct specification of the score and moment vectors. As for the derivation of the higher-order entries of the auto-covariance matrix, the multivariate normality assumption was used. With regard to the simulation study, it is shown that the GQL approach provides consistent parameter estimates and statistically more efficient estimates. In the analysis of the accident data, reliable estimates of the different covariates were obtained and comparable to NSCBINAR(1)NB and NSCBINAR(1)GEOM. The RMSE and MAD show that NSUBINAR(1)GEOM yields better forecasts than the other two competing models, a similar conclusion illustrated in [7]. This model is commendable to analyse over-dispersed series under non-stationary setups characterized by time-dependent effects.

Author Contributions

Conceptualization, S.Y. and M.K.N.; methodology, S.Y.; software, M.K.N.; validation, S.Y. and M.K.N.; formal analysis, S.Y.; investigation, S.Y.; resources, M.K.N.; data curation, M.K.N.; writing—original draft preparation, S.Y.; writing—review and editing, M.K.N.; visualization, S.Y. and M.K.N.; supervision, M.K.N.; project administration, S.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

https://cso.gov.tt/subject/traffic-statistics (accessed on 27 June 2023).

Conflicts of Interest

The authors declare no conflict of interest.

References

Pedeli, X.; Karlis, D. Bivariate INAR(1) Models; Technical Report; Athens University of Economics: Athens, Greece, 2009. [Google Scholar]
Pedeli, X.; Karlis, D. A bivariate INAR(1) process with application. Stat. Model. Int. J. 2011, 11, 325–349. [Google Scholar] [CrossRef]
McKenzie, E. Autoregressive moving-average processes with Negative Binomial and geometric marginal distributions. Adv. Appl. Probab. 1986, 18, 679–705. [Google Scholar] [CrossRef]
Steutel, F.; Van Harn, K. Discrete analogues of self-decomposability and stability. Ann. Probab. 1979, 7, 3893–3899. [Google Scholar] [CrossRef]
Pedeli, X.; Karlis, D. Some properties of multivariate INAR(1) processes. Comput. Stat. Data Anal. 2013, 67, 213–225. [Google Scholar] [CrossRef]
Ristic, M.; Nastic, A.; Jayakumar, K.; Bakouch, H. A bivariate INAR(1) time series model with geometric marginals. Appl. Math. Lett. 2012, 25, 481–485. [Google Scholar] [CrossRef]
Nastic, A.; Ristic, M.; Popovic, P. Estimation in a bivariate integer-valued autoregressive process. Commun.-Stat.-Theory Methods 2016, 45, 5660–5678. [Google Scholar] [CrossRef]
Mamode Khan, N.; Sunecher, Y.; Jowaheer, V. Modelling a Non-Stationary BINAR(1) Poisson Process. J. Stat. Comput. Simul. 2016, 86, 3106–3126. [Google Scholar] [CrossRef]
Sunecher, Y.; Mamodekhan, N.; Jowaheer, V. A GQL estimation approach for analysing non-stationary over-dispersed BINAR(1) time series. J. Stat. Comput. Simul. 2017, 87, 1911–1924. [Google Scholar] [CrossRef]
Nastic, A.; Laketa, P.; Ristic, M. Random environment integer-valued autoregressive process. J. Time Ser. Anal. 2016, 37, 267–287. [Google Scholar] [CrossRef]
Sutradhar, B.; Jowaheer, V.; Rao, P. Remarks on asymptotic efficient estimation for regression effects in stationary and non-stationary models for panel count data. Braz. J. Probab. Stat. 2014, 28, 241–254. [Google Scholar] [CrossRef]
Jowaheer, V.; Mamode Khan, N.; Sunecher, Y. A non-stationary BINAR(1) process with negative binomial innovations for modeling the number of goals in the first and second half: The case study of Arsenal Football Club. Commun.-Stat.-Case Stud. Data Anal. Appl. 2017, 2, 21–33. [Google Scholar] [CrossRef]
Ristic, M.; Bakouch, H.; Nastic, A. A new geometric first-order integer-valued autoregressive (NGINAR(1)) process. J. Stat. Plan. Inference 2009, 136, 2218–2226. [Google Scholar] [CrossRef]
Sunecher, Y.; Mamodekhan, N.; Jowaheer, V. Estimating the parameters of a BINMA Poisson model for a non-stationary bivariate time series. Commun. Stat. Simul. Comput. 2016, 46, 6803–6827. [Google Scholar] [CrossRef]

Table 1. GQL estimates of the parameters and standard errors under the non-stationary geometric BINAR(1) model.

$ρ_{11}$	$ρ_{22}$	T	Methods	${\hat{β}}_{1}^{[1]}$	${\hat{β}}_{2}^{[1]}$	${\hat{β}}_{1}^{[2]}$	${\hat{β}}_{2}^{[2]}$	${\hat{ρ}}_{11}$	${\hat{ρ}}_{22}$	${\hat{ρ}}_{12}$	${\hat{ρ}}_{21}$	${\hat{κ}}_{12, 1}$
0.9	0.9	100	GQL	0.4823	0.4870	0.8876	0.8840	0.8819	0.8847	0.4859	0.4847	0.9815
				(0.0910)	(0.0931)	(0.0977)	(0.0946)	(0.1163)	(0.1115)	(0.1125)	(0.1153)
		500	GQL	0.4923	0.4914	0.8950	0.8917	0.8942	0.8920	0.4945	0.4940	0.9909
				(0.0517)	(0.0512)	(0.0547)	(0.0585)	(0.0649)	(0.0625)	(0.0623)	(0.0630)
		1000	GQL	0.4991	0.4988	0.8960	0.8981	0.8984	0.8994	0.5004	0.5006	0.9979
				(0.0122)	(0.0171)	(0.0194)	(0.0146)	(0.0271)	(0.0224)	(0.0209)	(0.0237)
0.3	0.9	100	GQL	0.4894	0.4891	0.8870	0.8840	0.2856	0.8868	0.4899	0.4888	0.9826
				(0.0969)	(0.0977)	(0.0915)	(0.0959)	(0.1133)	(0.1172)	(0.1128)	(0.1119)
		500	GQL	0.4926	0.4927	0.8944	0.8959	0.2915	0.8928	0.4965	0.4940	0.9918
				(0.0526)	(0.0556)	(0.0507)	(0.0512)	(0.0681)	(0.0694)	(0.0671)	(0.0631)
		1000	GQL	0.4995	0.4975	0.8969	0.8994	0.2988	0.8963	0.5008	0.5011	0.9995
				(0.0137)	(0.0175)	(0.0111)	(0.0141)	(0.0213)	(0.0293)	(0.0251)	(0.0221)
0.3	0.3	100	GQL	0.4823	0.4870	0.8804	0.8896	0.2812	0.2834	0.4854	0.4835	0.9819
				(0.0981)	(0.0911)	(0.0931)	(0.0928)	(0.1169)	(0.1160)	(0.1135)	(0.1197)
		500	GQL	0.4929	0.4942	0.8910	0.8935	0.2964	0.2931	0.4920	0.4915	0.9913
				(0.0594)	(0.0589)	(0.0562)	(0.0580)	(0.0614)	(0.0621)	(0.0677)	(0.0681)
		1000	GQL	0.4956	0.4992	0.8987	0.8990	0.2988	0.2980	0.5004	0.5001	0.9966
				(0.0152)	(0.0125)	(0.0135)	(0.0133)	(0.0241)	(0.0208)	(0.0219)	(0.0211)

Table 2. Monthly day and night accidents: GQL estimates of the regression parameters.

Model	Time Series	Intercept	NS	SC	PO	RA	$\hat{c}$
	Day Accidents	2.5353	−0.0815	−0.0942	−0.0934	0.0760	1.8475
	s.e	(0.0633)	(0.0396)	(0.0456)	(0.0397)	(0.0289)	(0.0948)
NSCBINAR(1)NB	Night Accidents	0.9272	−0.0790	−0.0671	−0.0824	0.0943	0.8965
	s.e	(0.0742)	(0.0245)	(0.0213)	(0.0329)	(0.0330)	(0.0980)
	Day Accidents	2.4445	−0.0824	−0.0952	−0.0955	0.0668
	s.e	(0.0560)	(0.0273)	(0.0314)	(0.0215)	(0.0164)
NSUBINAR(1)GEOM	Night Accidents	0.9106	−0.0714	−0.0572	−0.0747	0.0817
	s.e	(0.0710)	(0.0180)	(0.0157)	(0.0260)	(0.0205)
	Day Accidents	2.413	−0.0929	−0.0866	−0.0961	0.0852
	s.e	(0.0958)	(0.0353)	(0.0385)	(0.0419)	(0.0342)
NSCBINAR(1)GEOM	Night Accidents	0.9623	−0.0876	−0.0894	−0.0785	0.0899
	s.e	(0.0952)	(0.0367)	(0.0387)	(0.0375)	(0.0345)

Table 3. Monthly day and night accidents: GQL estimates of the dependence parameters.

Model	Time Series	$\hat{ρ} - Serial$	$\hat{ρ} - Cross$	${\hat{κ}}_{12, 1}$
	Day Accidents	0.2620		0.0065
	s.e
NSCBINAR(1)NB	Night Accidents	0.2941
	s.e
	Day Accidents	0.2748	0.0728	0.0025
	s.e	(0.0346)	(0.0234)
NSUBINAR(1)GEOM	Night Accidents	0.2438	0.0563
	s.e	(0.0388)	(0.0191)
	Day Accidents	0.2145		0.0034
	s.e
NSCBINAR(1)GEOM	Night Accidents	0.2442
	s.e

Table 4. RMSE and MAD for the one step-ahead forecast for the number of monthly day and night accidents.

Model	RMSE	RMSE	MAD	MAD
	$Y_{t}^{[1]}$	$Y_{t}^{[2]}$	$Y_{t}^{[1]}$	$Y_{t}^{[2]}$
NSCBINAR(1)NB	0.132	0.141	0.109	0.120
NSUBINAR(1)GEOM	0.120	0.129	0.098	0.104
NSCBINAR(1)GEOM	0.196	0.189	0.155	0.144

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

A Novel Unconstrained Geometric BINAR(1) Model †

Abstract

1. Introduction

2. The Non-Stationary Unconstrained BINAR(1) with Geometric Marginals (NSUBINAR(1)GEOM)

3. Estimation Method

Forecasting Equations

4. Simulation Study

5. Analysing the Time Series of Day and Night Road Accidents in Mauritius

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Article Access Statistics

A Novel Unconstrained Geometric BINAR(1) Model^†