Modeling High-Frequency Zeros in Time Series with Generalized Autoregressive Score Models with Explanatory Variables: An Application to Precipitation

Pedro Vidal-Gutiérrez; Sergio Contreras-Espinoza; Francisco Novoa-Muñoz

doi:10.3390/axioms13010015

,

and

¹

Departamento de Estadística, Facultad de Ciencias, Universidad del Bío-Bío, Concepción 4051381, Chile

²

Departamento de Enfermería, Facultad de Ciencias de la Salud y de los Alimentos, Universidad del Bío-Bío, Chillán 3800708, Chile

^*

Author to whom correspondence should be addressed.

Axioms2024, 13(1), 15;https://doi.org/10.3390/axioms13010015

This article belongs to the Special Issue Statistical Methods and Applications

Version Notes

Order Reprints

Abstract

An extension of the Generalized Autoregressive Score (GAS) model is presented for time series with excess null observations to include explanatory variables. An extension of the GAS model proposed by Harvey and Ito is suggested, and it is applied to precipitation data from a city in Chile. It is concluded that the model provides adequate prediction, and furthermore, an analysis of the relationship between the precipitation variable and the explanatory variables is shown. This relationship is compared with the meteorology literature, demonstrating concurrence.

Keywords:

generalized autoregressive score; zero-augmented; generalized beta dynamic conditional score model distribution of the second kind; time series

MSC:

62M10

1. Introduction

In recent times, models with varying parameters have gained increasing popularity for working with time-series data. One of these models is the Generalized Autoregressive Score (GAS), or the Dynamic Conditional Score (DCS). According to Creal et al. [1], these models belong to the class of observation-driven models.

Sometimes, a significant proportion of observations in a time series is zeros, while the remaining observations are positive and are measured on a continuous scale. An example of this is daily precipitation, where there are many days with no rainfall, resulting in these days being recorded as zeros. To work with this type of data, it is necessary to utilize the zero-augmented distributions introduced by Hautsch et al. [2]. This approach, developed by Harvey and Ito [3], provides a framework for working with GAS models in the presence of a significant frequency of zeros.

The objective and contribution of this paper is to extend the model proposed in [3] to include explanatory variables. To achieve this, this research is structured as follows: Section 2 briefly introduces the necessary theory for conducting this study and incorporates the explanatory variables. Section 3 presents the obtained results, which are discussed in Section 4. Finally, the conclusions are drawn in Section 5.

2. Materials and Methods

This section provides a summary of the GAS models [1], the zero-augmented distributions [2], and the integration of these concepts. The goal is to subsequently expand the model using explanatory variables.

2.1. GAS Models

GAS models [1], also known as DCS models [4], are observation-driven models. Blasques et al. [5] define these models for an observed time series

y_{1}, \dots, y_{T}

with a density given by

y_{t} \sim p_{y} (y_{t} | f_{t}, y^{1 : t - 1}; θ), t = 1, \dots, T

. This density depends on the time-varying parameter

f_{t}

, past observations

y^{1 : t - 1} : = {y_{1}, y_{2}, \dots, y_{t - 1}}

, and static parameters

θ

. The time-varying parameter

f_{t}

is defined as the function

f_{t} : = f_{t} (y^{1 : t - 1}; θ)

.

An example of the updated equation is

f_{t + 1} = ω + β f_{t} + α s (y_{t}, f_{t}; θ)

, where

θ = (ω, α, β)

,

s (y_{t}, f_{t}; θ) = S_{t} (f_{t}; θ) \frac{\partial log p (y_{t} | f_{t}, y^{1 : t - 1}; θ)}{\partial f_{t}}

is the weighted score,

S_{t} (f_{t}; θ) = I_{t}^{- d}, d = 0, 1

, and

I_{t} = - E_{t} [\frac{\partial^{2} log p_{y} (y_{t} | f_{t}, y^{1 : t - 1}; θ)}{\partial f_{t}^{2}}]

.

Since the vector

θ

is unknown, it is estimated using the maximum likelihood method, maximizing

ℓ (θ) : = \sum_{t = 1}^{T} log p_{y} (y_{t} | f_{t}, y^{1 : t - 1}; θ)

.

2.2. Zero-Augmented Distribution for Non-Negative Variables

Consider a non-negative continuous random variable X with independent observations

{X_{t}}_{t = 1}^{n}

. To account for excess zeros, Hautsch et al. [2] allocate a probability mass at the exact zero value and define probabilities

π : = P (X > 0)

and

1 - π : = P (X = 0)

.

Conditional on

X > 0

, X follows a continuous distribution with density

g_{X} (x) : = f_{X} (x ∣ X > 0)

, which is continuous for

x \in (0, \infty)

. Consequently, the unconditional distribution of X is semicontinuous with a discontinuity at zero. This implies the density

f_{X} (x) = (1 - π) δ (x) + π g_{X} (x) I_{(x > 0)}

, where

0 \leq π \leq 1

,

δ (x)

is a point probability mass at

x = 0

, and

I_{(x > 0)}

denotes the indicator function that takes the value 1 for

x > 0

and 0 otherwise. The probability

π

is treated as a parameter of the distribution that determines how much probability mass is assigned to the strictly positive part of X support. In [3], a GAS model using a aero-augmented distribution is presented and applied to precipitation data. In that work, the possibility of extending the model using explanatory variables is raised, which is addressed in this paper.

2.3. Dynamic Model for the Zero-Augmented Distribution Model

To model time series with excess null observations, Harvey and Ito [3] defined a probability density function

g (\cdot)

for which it is possible to identify a scale parameter

φ

. In the context of GAS models, it is necessary to use a link function to introduce dynamics to the parameter, making

φ = exp (λ)

.

According to Harvey and Ito [3], in a parameter-driven model, the dynamics should be introduced through the parameter

λ

. Conversely, the DCS model is observation-driven, with the predictive distribution defined conditional on a filtered value of

λ

, denoted as

λ_{t ∣ t - 1}

.

For an observed time series

y_{1}, y_{2}, . . ., y_{T}

, let

y_{t} \sim f (y_{t} ∣ λ_{t ∣ t - 1}; θ)

, where

f (\cdot)

is the probability density function of

y_{t}

obtained from a zero-augmented distribution. In other words,

f (y_{t} ∣ λ_{t ∣ t - 1}) = (1 - π) (1 - I_{(y_{t} > 0)}) + π g (y_{t} ∣ λ_{t ∣ t - 1}) I_{(y_{t} > 0)}

. Harvey and Ito [3] introduced dynamics to

π

through a logistic transformation, so when

π_{t}

depends on

λ_{t ∣ t - 1}

, it yields:

π_{t ∣ t - 1} = \frac{exp (δ_{0} + δ_{1} λ_{t ∣ t - 1})}{1 + exp (δ_{0} + δ_{1} λ_{t ∣ t - 1})} .

(1)

Thus, the probability density function associated with

y_{t}

takes the form:

f (y_{t} ∣ π_{t ∣ t - 1}; λ_{t ∣ t - 1}) = (1 - π_{t ∣ t - 1}) (1 - I_{(y_{t} > 0)}) + π_{t ∣ t - 1} g (y_{t} ∣ λ_{t ∣ t - 1}) I_{(y_{t} > 0)} .

(2)

2.4. Derivation of the Model’s Score

To obtain the score of the model, (2) is rewritten as follows:

f (y_{t} ∣ λ_{t ∣ t - 1}) = \{\begin{matrix} 1 - π_{t ∣ t - 1}, & if y_{t} = 0 \\ π_{t ∣ t - 1} g (y_{t} ∣ λ_{t ∣ t - 1}), & if y_{t} > 0 \end{matrix} .

(3)

By taking the derivative of the logarithm of (3) with respect to

λ_{t ∣ t - 1}

and considering (1), the score of the model is given by:

\frac{\partial log f (y_{t} ∣ λ_{t ∣ t - 1})}{\partial λ_{t ∣ t - 1}} = \{\begin{matrix} - δ_{1} π_{t ∣ t - 1}, & if y_{t} = 0 \\ δ_{1} (1 - π_{t ∣ t - 1}) + \frac{\partial log [g (y_{t} ∣ λ_{t ∣ t - 1})]}{\partial λ_{t ∣ t - 1}}, & if y_{t} > 0 \end{matrix} .

When expressed in terms of the indicator function

I_{(y_{t} > 0)}

, this becomes:

\frac{\partial log f (y_{t} ∣ λ_{t ∣ t - 1})}{\partial λ_{t ∣ t - 1}} = - δ_{1} π_{t ∣ t - 1} (1 - I_{(y_{t} > 0)}) + \{δ_{1} (1 - π_{t ∣ t - 1}) + \frac{\partial log [g (y_{t} ∣ λ_{t ∣ t - 1})]}{\partial λ_{t ∣ t - 1}}\} I_{(y_{t} > 0)} .

(4)

2.5. Generalized Beta Distribution of the Second Kind

For precipitation data, Harvey and Ito [3] recommend using the generalized beta distribution of the second kind [6], which is given by:

g (y ∣ a, b, p, q) = \{\begin{matrix} \frac{a {(y / b)}^{a p - 1}}{b B (p, q) {[1 + {(y / b)}^{a}]}^{p + q}}, & 0 < y < + \infty \\ 0, & otherwise \end{matrix},

(5)

where,

a, b, q, p > 0

, with b being the scale parameter, and

a, p

, and q are the shape parameters. According to Kleiber and Kotz [7], the non-central moments of order

k \in N

are given by:

E (Y^{k}) = \frac{b^{k} B (p + k / a, q - k / a)}{B (p, q)} = \frac{b^{k} Γ (p + k / a) Γ (q - k / a)}{Γ (p) Γ (q)}

(6)

At the same time, the density of a generalized beta distribution of the second kind exhibits considerable flexibility, as demonstrated in Figure 1.

Figure 1. Generalized beta distribution of the second kind density function for

b = 1

,

p = 0.5

,

q = 2

.

Special cases encompass a broad range of distributions for non-negative variables. For instance, when

p = 1

, the distribution becomes the Burr distribution, and when

q = 1

, it becomes a log-logistic distribution (McDonald [8]).

2.6. GAS Model for a Zero-Augmented Distribution

We work with the generalized beta distribution of the second kind for which the density is given by (5). Using the exponential link function

b = exp (λ)

and incorporating the time dynamics, it yields:

g (y_{t} ∣ λ_{t ∣ t - 1}) = \{\begin{matrix} \frac{a {(y_{t} / exp (λ_{t ∣ t - 1}))}^{a p - 1}}{exp (λ_{t ∣ t - 1}) B (p, q) {[1 + {(y_{t} / exp (λ_{t ∣ t - 1}))}^{a}]}^{p + q}}, & 0 < y_{t} < + \infty \\ 0, & otherwise \end{matrix} .

(7)

Applying a logarithm to expression (7) and considering (4) results in

\frac{\partial log [g (y_{t} ∣ λ_{t ∣ t - 1})]}{\partial λ_{t ∣ t - 1}} = a (p + q) \frac{{[y_{t} exp (- λ_{t ∣ t - 1})]}^{a}}{{[y_{t} exp (- λ_{t ∣ t - 1})]}^{a} + 1} - a p .

(8)

Thus, the model for

y_{t}

in terms of the time-varying parameter

λ_{t ∣ t - 1}

is:

y_{t} \sim f (y_{t} ∣ y_{1}, \dots, y_{t - 1}, π_{t ∣ t - 1}; λ_{t ∣ t - 1}; θ),

with a probability density function given by:

f (y_{t} ∣ y_{1}, \dots, y_{t - 1}, π_{t ∣ t - 1}; λ_{t ∣ t - 1}; θ) = (1 - π_{t ∣ t - 1}) (1 - I_{(y_{t} > 0)}) + π_{t ∣ t - 1} g (y_{t} ∣ λ_{t ∣ t - 1}) I_{(y_{t} > 0)},

where

g (\cdot)

is the density of a generalized beta distribution of the second kind given in (7),

θ = (a, p, q, ω, ϕ, κ, δ_{0}, δ_{1})

,

π_{t ∣ t - 1}

is defined in (1), and

λ_{t + 1 ∣ t} = ω + ϕ λ_{t ∣ t - 1} + κ u_{t},

where

u_{t}

is the conditional score of the model and

κ

is the weight assigned to it.

2.7. Explanatory Variables

In Harvey and Luati [9], it is demonstrated that for a model for which the location parameter denoted by

μ

is time-varying, the model depends on a set of explanatory variables denoted by a

k \times 1

vector

w_{t}

as well as the past values and the score through the following formulation:

\begin{matrix} μ_{t ∣ t - 1} & = ω + w_{t}^{'} β + μ_{t ∣ t - 1}^{†}, t = 1, \dots, T, \end{matrix}

(9)

\begin{matrix} μ_{t + 1 ∣ t}^{†} & = ϕ μ_{t ∣ t - 1}^{†} + κ u_{t}, t = 1, \dots, T, \end{matrix}

(10)

where

β

is also a

k \times 1

vector representing parameters that are estimated in the model for each explanatory variable.

2.8. Diagnosis

Diebold et al. [10] state that to evaluate whether a model

y_{t}

is well-fitted, it should be demonstrated that the probability integral transform (PIT) of

z_{t} = \int_{- \infty}^{y_{t}} p_{t} (u) d u

is independent and identically distributed as the uniform distribution

U (0, 1)

, where

p_{t} (\cdot)

represents the density forecasts of the generating process

f_{y} (y_{t})

.

2.9. Prediction

To obtain predictions, Blasques et al. [5] create confidence bands for the time-varying parameter

f_{t + 1}

. They consider the model for an observed time series

y_{1}, y_{2}, \dots, y_{T}

given by

y_{t} \sim p_{y} (y_{t} ∣ f_{t}; θ)

with the update equation

f_{t + 1} = ϕ (y_{t}, f_{t}; θ) .

(11)

In GAS models,

f_{T + 1}

, by construction, depends on

y_{1}, y_{2}, \dots, y_{T}

, so the parameters need to be obtained from time

T + 2

.

Harvey and Ito [3] accomplish this through computational simulation, following the steps outlined below for

n \geq 2

:

(A): Given the point estimate by maximum likelihood ${\hat{θ}}_{T}$ and the filtered value ${\hat{f}}_{T + 1}$ obtained from (11) for $θ = {\hat{θ}}_{T}$ and $t = T$ , simulate S realizations $y_{T + 1}^{1}, \dots, y_{T + 1}^{S}$ from the estimated conditional density at time $T + 1$ . In other words,

$y_{T + 1}^{s} \sim p_{y} (y_{T + 1} ∣ {\hat{f}}_{T + 1}; {\hat{θ}}_{T}), s = 1, \dots, S .$
(B): Given the simulated observations $y_{T + 1}^{1}, \dots, y_{T + 1}^{S}$ and equation (11), obtain the filtered values ${\hat{f}}_{T + 2}^{1}, \dots, {\hat{f}}_{T + 2}^{s}$ , conditioned on ${\hat{θ}}_{T}$ and ${\hat{f}}_{T + 1}$ , using:

${\hat{f}}_{T + 2}^{s} = ϕ (y_{T + 1}^{s}, {\hat{f}}_{T + 1}; {\hat{θ}}_{T}), s = 1, \dots, S .$
(C): For ${\hat{f}}_{T + 2}^{s}, s = 1, \dots, S$ , repeat steps (A) and (B) for the periods $T + 2, \dots, T + n$ .
(D): Use ${\hat{f}}_{T + n}^{s}$ to calculate forecast bands at the desired percentiles.

2.10. Brier Probability Score

To evaluate the quality of the prediction, the Brier probability score (BPS) will be used as a measure of accuracy. This metric is widely employed in such cases (Wilks [11]). BPS was introduced by Brier et al. [12] and is given by:

B P S = \frac{1}{n} \sum_{t = 1}^{n} {(p_{t} - α_{t})}^{2},

where n represents the number of predicted values,

p_{t}

is the predicted probability at time t, and

α_{t}

takes the value 1 if the event occurred at time t and 0 otherwise. Since

0 \leq B P S \leq 1

, Salvador [13] suggests that predictions are acceptable if

B P S \leq 0.35

.

2.11. Application

The zero-augmented GAS model to be formulated will be applied to precipitation data. Let

y_{t}

represent the amount of precipitation in period t. Then,

y_{t} \sim p_{y} (y_{t} ∣ y_{1}, \dots, y_{t - 1}, λ_{t ∣ t - 1}; θ) .

It is assumed that the data-generating process for precipitation follows the zero-augmented generalized beta distribution of the second kind. As a result, the conditional density of

y_{t}

is defined as:

p (y_{t} ∣ y_{1}, \dots, y_{t - 1}, λ_{t ∣ t - 1}; θ) = (1 - π_{t ∣ t - 1}) (1 - I_{(y_{t} > 0)}) + π_{t ∣ t - 1} g (y_{t} ∣ λ_{t ∣ t - 1}) I_{(y_{t} > 0)},

where

π_{t ∣ t - 1}

is given in (1),

g (y_{t} ∣ λ_{t ∣ t - 1})

is the density of the generalized beta distribution of the second kind, which, according to Harvey and Ito [3], for improved estimates, should be reparameterized from (7) in terms of the reciprocal of the tail index:

\bar{η} = 1 / η

, where

η = a q

is the tail index. This leads to:

g (y_{t} ∣ λ_{t ∣ t - 1}) = \{\begin{matrix} \frac{a {(y_{t} / exp (λ_{t ∣ t - 1}))}^{a p - 1}}{exp (λ_{t ∣ t - 1}) B (p, \frac{1}{a \bar{η}}) {[1 + {(y_{t} / exp (λ_{t ∣ t - 1}))}^{a}]}^{p + \frac{1}{a \bar{η}}}}, & 0 < y_{t} < + \infty, \\ 0, & otherwise, \end{matrix}

where

a, p > 0

are shape parameters, and

exp (λ_{t ∣ t - 1})

is the scale parameter modeled through

λ_{t ∣ t - 1}

, which acts as the location parameter. Replacing it with

μ_{t ∣ t - 1}

in Equations (9) and (10) results in:

\begin{matrix} λ_{t ∣ t - 1} & = ω + w_{t}^{'} β + λ_{t ∣ t - 1}^{†}, t = 1, \dots, T, \\ λ_{t + 1 ∣ t}^{†} & = ϕ λ_{t ∣ t - 1}^{†} + κ u_{t}, t = 1, \dots, T . \end{matrix}

(12)

where

ϕ

and

κ

are parameters to be estimated;

w_{t}^{'} = (w_{1}, w_{2}, \dots, w_{k})

, where

w_{i}

with

i \in {1, \dots, k}

are the explanatory variables;

β = (β_{1}, β_{2}, \dots, β_{k})

, where

β_{i}

with

i \in {1, \dots, k}

are the parameters to be estimated; and

u_{t}

is the conditional score of the model, given by:

\begin{matrix} u_{t} & = \frac{\partial log p (y_{t} ∣ y_{1}, \dots, y_{t - 1}, λ_{t ∣ t - 1}; θ)}{\partial λ_{t ∣ t - 1}} \\ = - δ_{1} π_{t ∣ t - 1} (1 - I_{(y_{t} > 0)}) + \{δ_{1} (1 - π_{t ∣ t - 1}) + \frac{\partial log [g (y_{t} ∣ λ_{t ∣ t - 1})]}{\partial λ_{t ∣ t - 1}}\} I_{(y_{t} > 0)}, \end{matrix}

where

\frac{\partial log [g (y_{t} ∣ λ_{t ∣ t - 1})]}{\partial λ_{t ∣ t - 1}}

is given by Equation (8).

The conditional mean, obtained directly from (6), is given by:

E (y_{t} ∣ Y_{t - 1}) = π_{t ∣ t - 1} exp (λ_{t ∣ t - 1}) \frac{B (p + 1 / a, 1 / (a \bar{η}) - 1 / a)}{B (p, \frac{1}{a \bar{η}})} .

(13)

2.12. Dataset

The employed time series corresponds to the daily precipitation in the city of Puerto Montt in Chile, as shown in Figure 2. This variable is measured in millimeters (mm) and is equivalent to the liters of water that have fallen per square meter. The dataset was divided into two parts: the first part was used for model estimation and covers from 1 January 2011 to 31 December 2020 with a total of 3653 observations, out of which 1648 data points are zeros. The second part consisted of the following 244 observations, of which 127 data points were zeros. The data used were obtained from the website of the Dirección Meteorológica de Chile “http://www.meteochile.gob.cl/ (accessed on 22 November 2022)”, and the records belong to the El Tepual Puerto Montt Ap Station (code 410005).

Figure 2. Precipitation in Puerto Montt, Chile.

As explanatory variables, the following were used:

w_{1} : =

relative humidity, measured in percentage (%);

w_{2} : =

atmospheric pressure, measured in hectopascals (hPa); and

w_{3} : =

temperature, measured in degrees Celsius (°C). The daily maximum values reached by these variables were used.

Figure 3, Figure 4 and Figure 5 present the graphs of the explanatory variables, while Table 1 shows the descriptive statistics of these explanatory variables.

Figure 3. Humidity in Puerto Montt, Chile.

Figure 4. Pressure in Puerto Montt, Chile.

Figure 5. Temperature in Puerto Montt, Chile.

Table 1. Descriptive statistics.

2.13. Parameter Estimation

The estimation of the vector

θ = (ω, ϕ, κ, δ_{0}, δ_{1}, a, p, \bar{η}, β_{1}, β_{2}, β_{3})

was performed using the method of maximum likelihood, formulating the maximization problem as:

\hat{θ} = arg max_{θ} \sum_{t = 1}^{T} log p (y_{t} ∣ y_{1}, \dots, y_{t - 1}, λ_{t ∣ t - 1}, θ) .

The calculations were performed in the R programming language using the GB2 package (Graf et al. [14]), maxLik package (Henningsen and Toome [15]), pracma package (Borchers [16]), and DEoptim package (Mulle et al. [17]).

3. Results

In Table 2, the parameter estimates of the model are presented, along with their statistical significance and standard deviation in parentheses.

Table 2. Estimated parameters of the model.

Figure 6 presents a graph of the precipitation in Puerto Montt and the adjusted mean.

Figure 6. Fitted model for rainfall in Puerto Montt, Chile.

Figure 7 depicts the empirical cumulative distribution function (ECDF) plotted against transformed integral probabilities for positive observations, while Table 3 displays the result of the Kolmogorov–Smirnov test along with its p-value.

Figure 7. Probability integral transform (PIT) against the empirical cumulative distribution function (ECDF).

Table 3. Kolmogorov–Smirnov test results.

The graph for the predicted scale parameter is shown in Figure 8, and Figure 9 displays the graph for the prediction of the conditional mean

E (y_{T + ℓ} ∣ Y_{T})

, as given in (13).

Figure 8. Prediction of the scale parameter

exp (λ_{T + ℓ ∣ T})

with

ℓ \in {1, 2, \dots, n}

(T = 3653, n = 244)

and the confidence band for each time within the observation period.

Figure 9. Prediction of the conditional mean

E (y_{T + ℓ} ∣ Y_{T})

with

ℓ \in {1, 2, \dots, n}

(T = 3653, n = 244)

and its corresponding confidence band for each time within the observation period.

Figure 10 illustrates the predictions for the probability of no rainfall. The shaded regions represent the 95% confidence bands. Notably,

B P S = 0.24

.

Figure 10. Prediction of

(1 - π_{T + ℓ ∣ T})

with

ℓ \in {1, 2, \dots, n}

(T = 3653, n = 244)

and the confidence band for each time within the observation period.

4. Discussion

The goodness-of-fit of the model is evident from Figure 6 and is supported by the information in Table 2, where most of the estimated parameters are significant.

Additionally, Figure 7 indicates that the plot of PITs against the ECDF suggests that the data follow the distribution estimated by the model. This alignment is further confirmed by the Kolmogorov–Smirnov test results presented in Table 3, which verify that the model’s PITs follow a uniform distribution

U (0, 1)

.

It is necessary to emphasize that to find out if the PITs had a uniform distribution

U (0, 1)

, two methods were considered:

(a): The classic Kolmogorov–Smirnov test;
(b): A permutation and bootstrap approach. For this, the algorithm described in Præstgaard [18] was implemented, as suggested by one of the reviewers.

Figure 8 illustrates the predictions of the scale parameter of the model, which, combined with the estimated parameter vector

θ

, allows for density function forecasts at each prediction time point. Using these density functions (obtained from estimated parameters), conditional means are calculated and presented in Figure 9 as point estimates.

Figure 10 depicts the behavior of the parameter associated with the probability of

y_{t}

taking a value of zero in the predictions. These values contribute to calculating the Brier probability score of the model, which has been calculated as

0.24

, indicating adequate model performance according to Salvador [13].

In Figure 8, Figure 9 and Figure 10, from January to March, it can be observed that the predicted values cluster around one end of the band. This behavior arises from the nature of the zero-augmented distribution model, as explained below:

Initially, in period

T + 1

, the values of the scale and the probability of rainfall need to be determined, yielding

b_{T + 1 ∣ T} = 4.4489

and

π_{T + 1 ∣ T} = 0.2416

, respectively. As the value of

π_{T + 1 ∣ T}

is close to 0, the simulations initially produce many zeros compared to positive values. This phenomenon directly impacts the behavior observed at the lower end of Figure 8 and Figure 9 and at the upper end of Figure 10, as it corresponds to the predictions of

1 - π_{T + 1 ∣ T}

.

With the values from the preceding paragraph along with the estimation of

θ

, the density presented in Figure 11 is fully determined. Following the procedure outlined by Blasques et al. [5], using this density, values of y are simulated. For this purpose, 1000 simulations were conducted, and the resulting histogram is displayed in Figure 12.

Figure 11. Probability density function

p (y_{T + 1} ∣ y_{1}, \dots, y_{T}, λ_{T + 1 ∣ T}; θ)

with

T = 3653

.

Figure 12. Simulations of

y_{T + 1} \sim p (y_{T + 1} ∣ y_{1}, \dots, y_{T}, λ_{T + 1 ∣ T}; θ)

with

T = 3653

.

As characteristic of a zero-augmented distribution density, Figure 12 exhibits a high frequency of zeros since the probability of no precipitation is

1 - 0.2416 = 0.7584

. Therefore, such a proportion of zeros was expected.

With the aforementioned results, the conditional score,

u_{t}

, is obtained, as shown in Figure 13. It is noticeable that it inherits the shape of the graph in Figure 11.

Figure 13. Simulations of the score

u_{T + 1}

with

T = 3653

.

Now, it is possible to calculate the scale parameter for time

T + 2

, as depicted in Figure 14, which exhibits a similar pattern to that of Figure 11.

Figure 14. Scale simulations

exp (λ_{T + 2 ∣ T})

with

T = 3653

.

The same applies to the parameter for the probability of rain for time

T + 2

, which is presented in Figure 15.

Figure 15. Simulations of

π_{T + 2 ∣ T}

with

T = 3653

.

From the histograms of Figure 12, Figure 13, Figure 14, Figure 15, Figure 16 and Figure 17, the high frequency of zeros in the simulated observations causes the calculated values to inherit the same pattern. Therefore, if a specific point prediction is desired, such as the median, for instance, it should be approximated towards the side where the highest frequency lies. As mentioned before, this situation occurs during the period from January to March, which is natural due to it being the summer season in Chile. In other words, the probability of no rainfall is significantly higher compared to the other months. This pattern changes in the following months as the probabilities of no rainfall decrease (see Figure 10), and this is reflected in the corresponding predictions.

Figure 16. Simulations of

1 - π_{T + 2 ∣ T}

with

T = 3653

.

Figure 17. Simulations of

E (y_{T + 2} ∣ Y_{T})

with

T = 3653

.

Next, the estimated coefficients of the explanatory variables

β_{1}

,

β_{2}

, and

β_{3}

are interpreted. From (12), it follows that:

λ_{t ∣ t - 1} = ω + w_{1} β_{1} + w_{2} β_{2} + w_{3} β_{3} + λ_{t ∣ t - 1}^{†} .

Since the scale parameter of the model is

b_{t ∣ t - 1} = exp (λ_{t ∣ t - 1}),

when derived with respect to any of the explanatory variables,

w_{i}

with

i \in {1, 2, 3}

, the result is:

\begin{matrix} \frac{\partial b_{t ∣ t - 1}}{\partial w_{i}} & = \frac{\partial b_{t ∣ t - 1}}{\partial λ_{t ∣ t - 1}} \frac{\partial λ_{t ∣ t - 1}}{\partial w_{i}} \\ = exp (λ_{t ∣ t - 1}) β_{i}; \end{matrix}

hence, the sign of

β_{i}

determines whether

b_{t ∣ t - 1}

increases or decreases. If

β_{i} > 0

, then

b_{t ∣ t - 1}

grows, and if

β_{i} < 0

, then

b_{t ∣ t - 1}

decreases. Additionally, higher values of the scale parameter result in greater dispersion of the density, while lower values of the scale parameter lead the density to concentrate more around zero. This concentration causes a decrease in the probabilities of high values of the variable, in contrast to when the density becomes more spread out.

Regarding the probability of rain,

π_{t ∣ t - 1}

given in (1), when deriving it with respect to any of the explanatory variables,

w_{i}

with

i \in {1, 2, 3}

, the following is obtained:

\begin{matrix} \frac{\partial π_{t ∣ t - 1}}{\partial w_{i}} & = \frac{\partial π_{t ∣ t - 1}}{\partial λ_{t ∣ t - 1}} \frac{\partial λ_{t ∣ t - 1}}{\partial w_{i}} \\ = \frac{δ_{1} exp (δ_{0} + δ_{1} λ_{t ∣ t - 1})}{{[1 + exp (δ_{0} + δ_{1} λ_{t ∣ t - 1})]}^{2}} β_{i} . \end{matrix}

As seen in Table 2, where

δ_{1} > 0

, the sign of

β_{i}

determines whether

π_{t ∣ t - 1}

increases or decreases.

Finally, by differentiating the conditional mean,

E (y_{t} ∣ Y_{t - 1})

given in (13), with respect to any of the explanatory variables,

w_{i}

with

i \in {1, 2, 3}

, we have:

\begin{matrix} \frac{\partial E (y_{t} ∣ Y_{t - 1})}{\partial w_{i}} & = \frac{B (p + 1 / a, 1 / (a \bar{η}) - 1 / a)}{B (p, \frac{1}{a \bar{η}})} \frac{\partial}{\partial w_{i}} (π_{t ∣ t - 1} b_{t ∣ t - 1}) \\ = \frac{B (p + 1 / a, 1 / (a \bar{η}) - 1 / a)}{B (p, \frac{1}{a \bar{η}})} (\frac{\partial π_{t ∣ t - 1}}{\partial w_{i}} b_{t ∣ t - 1} + π_{t ∣ t - 1} \frac{\partial b_{t ∣ t - 1}}{\partial w_{i}}) . \end{matrix}

From this, the sign of

β_{i}

determines whether the conditional mean increases or decreases. If

β_{i} > 0

, then

b_{t ∣ t - 1}

,

π_{t ∣ t - 1}

, and the derivatives within the last parentheses are positive, and when

β_{i} < 0

, the opposite occurs.

In summary, the following cases can be observed:

(I): If $β_{i} > 0$ , then the scale, $b_{t ∣ t - 1}$ , increases, increasing the dispersion for $y_{t} > 0$ , making higher values more likely. Additionally, the probability of rain, $π_{t ∣ t - 1}$ , increases, and the conditional mean, $E (y_{t} ∣ Y_{t - 1})$ , also increases.
(II): If $β_{i} < 0$ , then the scale, $b_{t ∣ t - 1}$ , decreases, concentrating the density of the distribution around zero for $y_{t} > 0$ , making higher values less likely. Additionally, the probability of rain, $π_{t ∣ t - 1}$ , decreases, and the conditional mean, $E (y_{t} ∣ Y_{t - 1})$ , also decreases.

Since

β_{1} = 0.0457 > 0

(see Table 2) and is the coefficient associated with humidity and, according to Llasat Botija et al. [19], humidity promotes the formation of clouds that will lead to rainfall, this aligns with case (I).

On the other hand,

β_{2} = - 0.0010 < 0

(see Table 2), which is the coefficient associated with pressure and corresponds to case (II). According to García de Pedraza [20], when pressure increases, the skies are clearer, a condition that does not favor rainfall. Conversely, if the pressure decreases, it is a condition that favors cloud formation and rain. Therefore, the results align with meteorological science.

Meanwhile,

β_{3} = - 0.0884 < 0

(see Table 2), which is the coefficient associated with temperature and also corresponds to case (II). Regarding this, Trenberth et al. [21] mention that during the warm season over continents, higher temperatures are associated with lower precipitation amounts, while in colder seasons, lower temperatures indicate higher precipitation. Thus, an inverse relationship between temperature and rainfall would exist, but it is more related to the time of year. It is worth noting that this relationship is complex, and exceptions can occur. For example, higher temperatures could also promote cloud formation through water evaporation.

5. Conclusions

A model has been extended for data originating from a zero-augmented distribution: that is, it is to be used in time series where there is a high-frequency proportion of zeros. Additionally, it has been considered that the non-zero data come from a continuous distribution with support for positive values, following the GAS models guidelines of Harvey and Ito [3], as this would not be possible using classical models such as those of Box & Jenkins [22]. This has been applied in meteorology with the precipitation data from a city in Chile. The model has been successfully fitted and responds well to diagnostic tests.

When evaluating the predictive capability of the proposed model, the Brier PS score yielded a value of

0.24

, categorizing the model as suitable, in contrast to the values presented by Harvey and Ito [3], which were around

0.72

and

0.75

. The low value of the Brier PS score for the proposed model could signal that by incorporating explanatory variables, the fit of this type of model can be improved.

Regarding the explanatory variables, it was also very interesting to provide an interpretation of the estimated coefficients associated with each explanatory variable and to confirm that the results of the proposed model, regarding the relationship between precipitation and the explanatory variables humidity, pressure, and temperature, generally align with what is established in meteorology.

It is interesting to analyze how these models behave when the distribution associated with the non-zero part is not necessarily positive and/or continuous. For example, a discrete distribution could be used to analyze time series of the number of COVID-19 fatalities, where there is a high frequency of zeros. This could help determine whether the prediction quality remains consistent under such circumstances.

When it comes to applications in meteorology, it would be compelling to explore how to incorporate explanatory variables related to wind. These variables are known by a specific term in the literature—they are referred to as ’circular data’—and they have a distinctive treatment approach. This aspect has been studied in works by Harvey et al. [23] and Fisher and Lee [24].

It could also be important to analyze the scenario where a specific distribution cannot be identified for the non-zero part. In this case, it could be relevant to explore how to incorporate a more advanced system into these models, such as kernel density estimations for time series. These have also been studied in works such as those by Harvey and Oryshchenko [25] and Harvey [4], where non-parametric statistical tools are used to create distribution-free time-series models.

Author Contributions

Conceptualization, S.C.-E. and P.V.-G.; methodology, F.N.-M.; software, P.V.-G.; validation, S.C.-E., F.N.-M. and P.V.-G.; formal analysis, F.N.-M.; investigation, S.C.-E. and P.V.-G.; resources, F.N.-M.; data curation, P.V.-G.; writing—original draft preparation, F.N.-M.; writing—review and editing, S.C.-E.; visualization, P.V.-G.; supervision, F.N.-M.; project administration, S.C.-E. All authors have read and agreed to the published version of the manuscript.

Funding

Novoa-Muñoz’s research was fully supported by project 2220529 IF/R and Fondo de Apoyo a la Participación a Eventos Internacionales (FAPEI) at Universidad del Bío-Bío, Chile. Contreras-Espinoza was supported by Fondo de Apoyo a la Participación a Eventos Internacionales at Universidad del Bío-Bío, Chile.

Data Availability Statement

The data are obtained from the Meteorological Directorate of Chile “http://www.meteochile.gob.cl/” (accessed on 22 November 2022)—specifically, from “https://climatologia.meteochile.gob.cl/” (accessed on 22 November 2022). And the records belong to the El Tepual Puerto Montt Ap Station (code 410005).

Acknowledgments

The authors would like to thank the anonymous reviewers and the editor of this journal for their valuable time and their careful comments and suggestions because of which the quality of this paper has been improved.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

GAS	Generalized Autoregressive Score
DCS	Dynamic Conditional Score
BPS	Brier Probability Score
ECDF	Empirical Cumulative Distribution Function
PIT	Probability Integral Transform

References

Creal, D.; Koopman, S.J.; Lucas, A. Generalized autoregressive score models with applications. J. Appl. Econom. 2013, 28, 777–795. [Google Scholar] [CrossRef]
Hautsch, N.; Malec, P.; Schienle, M. Capturing the zero: A new class of zero-augmented distributions and multiplicative error processes. J. Financ. Econom. 2014, 12, 89–121. [Google Scholar] [CrossRef]
Harvey, A.; Ito, R. Modeling time series when some observations are zero. J. Econom. 2020, 214, 33–45. [Google Scholar] [CrossRef]
Harvey, A.C. Dynamic Models for Volatility and Heavy Tails: With Applications to Financial and Economic Time Series; Cambridge University Press: New York, NY, USA, 2013; Volume 52. [Google Scholar]
Blasques, F.; Koopman, S.J.; Łasak, K.; Lucas, A. In-sample confidence bands and out-of-sample forecast bands for time-varying parameters in observation-driven models. Int. J. Forecast. 2016, 32, 875–887. [Google Scholar] [CrossRef]
McDonald, J.B.; Xu, Y.J. A generalization of the beta distribution with applications. J. Econom. 1995, 66, 133–152. [Google Scholar] [CrossRef]
Kleiber, C.; Kotz, S. Statistical Size Distributions in Economics and Actuarial Sciences; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
McDonald, J.B. Some generalized functions for the size distribution of income. In Modeling Income Distributions and Lorenz Curves; Springer: New York, NY, USA, 2008; pp. 37–55. [Google Scholar]
Harvey, A.; Luati, A. Filtering with heavy tails. J. Am. Stat. Assoc. 2014, 109, 1112–1122. [Google Scholar] [CrossRef]
Diebold, F.X.; Gunther, T.A.; Tay, A.S. Evaluating Density Forecasts with Applications to Financial Risk Management. Int. Econ. Rev. 1998, 39, 863–883. [Google Scholar] [CrossRef]
Wilks, D.S. Statistical Methods in the Atmospheric Sciences; Academic Press: Cambridge, MA, USA, 2011; Volume 100. [Google Scholar]
Brier, G.W. Verification of forecasts expressed in terms of probability. Mon. Weather Rev. 1950, 78, 1–3. [Google Scholar] [CrossRef]
Salvador, J.A.F. Data Analysis Advances in Marine Science for Fisheries Management: Supervised Classification Applications. Ph.D. Thesis, Department of Computer Science and Artificial Intelligence of the University of the Basque Country, Leioa, Spain, 2011. [Google Scholar]
Graf, M.; Nedyalkova, D. GB2: Generalized Beta Distribution of the Second Kind: Properties, Likelihood, Estimation. R Package Version 2.1.1. 2022. Available online: https://CRAN.R-project.org/package=GB2 (accessed on 22 November 2022).
Henningsen, A.; Toomet, O. maxLik: A package for maximum likelihood estimation in R. Comput. Stat. 2011, 26, 443–458. [Google Scholar] [CrossRef]
Borchers, H. pracma: Practical Numerical Math Functions. R Package Version 2.4.2. 2022. Available online: https://CRAN.R-project.org/package=pracma (accessed on 22 November 2022).
Mullen, K.; Ardia, D.; Gil, D.L.; Windover, D.; Cline, J. DEoptim: An R package for global optimization by differential evolution. J. Stat. Softw. 2011, 40, 1–26. [Google Scholar] [CrossRef]
Præstgaard, J.T. Permutation and Bootstrap Kolmogorov-Smirnov Tests for the Equality of Two Distributions. Scand. J. Stat. 1995, 22, 305–322. [Google Scholar]
Llasat, B.M.D.C.; Llasat-Botija, M.; Ter, C.A. Con el agua al cuello. 2009. Available online: http://hdl.handle.net/2445/8727 (accessed on 1 November 2022).
García de Pedraza, L. Adecuado uso del barómetro. 2002. Available online: http://hdl.handle.net/20.500.11765/12031 (accessed on 1 November 2022).
Trenberth, K.E.; Jones, P.D.; Ambenje, P.; Bojariu, R.; Easterling, D.; Klein, T.A.; Parker, D.; Rahimzadeh, F.; Renwick, J.A.; Rusticucci, M.; et al. Observations. Surface and Atmospheric Climate Change; Cambridge University Press: Cambridge, UK, 2007; Chapter 3. [Google Scholar]
Box, G.E.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M. Time Series Analysis: Forecasting and Control; John Wiley & Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Harvey, A.; Hurn, S.; Thiele, S. Modeling Directional (Circular) Time Series; Apollo—University of Cambridge Repository: Cambridge, MA, USA, 2019. [Google Scholar] [CrossRef]
Fisher, N.I.; Lee, A. Time series analysis of circular data. J. R. Stat. Soc. Ser. B (Methodol.) 1994, 56, 327–339. [Google Scholar] [CrossRef]
Harvey, A.; Oryshchenko, V. Kernel density estimation for time series data. Int. J. Forecast. 2012, 28, 3–14. [Google Scholar] [CrossRef]

Figure 1. Generalized beta distribution of the second kind density function for

b = 1

,

p = 0.5

,

q = 2

.

Figure 2. Precipitation in Puerto Montt, Chile.

Figure 3. Humidity in Puerto Montt, Chile.

Figure 4. Pressure in Puerto Montt, Chile.

Figure 5. Temperature in Puerto Montt, Chile.

Figure 6. Fitted model for rainfall in Puerto Montt, Chile.

Figure 7. Probability integral transform (PIT) against the empirical cumulative distribution function (ECDF).

Figure 8. Prediction of the scale parameter

exp (λ_{T + ℓ ∣ T})

with

ℓ \in {1, 2, \dots, n}

(T = 3653, n = 244)

and the confidence band for each time within the observation period.

Figure 9. Prediction of the conditional mean

E (y_{T + ℓ} ∣ Y_{T})

with

ℓ \in {1, 2, \dots, n}

(T = 3653, n = 244)

and its corresponding confidence band for each time within the observation period.

Figure 10. Prediction of

(1 - π_{T + ℓ ∣ T})

with

ℓ \in {1, 2, \dots, n}

(T = 3653, n = 244)

and the confidence band for each time within the observation period.

Figure 11. Probability density function

p (y_{T + 1} ∣ y_{1}, \dots, y_{T}, λ_{T + 1 ∣ T}; θ)

with

T = 3653

.

Figure 12. Simulations of

y_{T + 1} \sim p (y_{T + 1} ∣ y_{1}, \dots, y_{T}, λ_{T + 1 ∣ T}; θ)

with

T = 3653

.

Figure 13. Simulations of the score

u_{T + 1}

with

T = 3653

.

Figure 14. Scale simulations

exp (λ_{T + 2 ∣ T})

with

T = 3653

.

Figure 15. Simulations of

π_{T + 2 ∣ T}

with

T = 3653

.

Figure 16. Simulations of

1 - π_{T + 2 ∣ T}

with

T = 3653

.

Figure 17. Simulations of

E (y_{T + 2} ∣ Y_{T})

with

T = 3653

.

Table 1. Descriptive statistics.

	Precipitation	Humidity	Pressure	Temperature
Mean	3.9453	96.4646	1009.912	14.7442
Standard Deviation	7.2298	2.8045	5.1589	4.2798
Minimum	0	71	987.3	4
Maximum	69	100	1028	34.1
Asymmetry	2.9967	−1.3453	−0.0555	0.4220
Kurtosis	15.6330	9.0157	3.6322	2.9062

Table 2. Estimated parameters of the model.

Parameters	Estimation
$ω$	0.0647 (1.0330)
$ϕ$	0.3356 *** (0.0603)
$κ$	0.2418 *** (0.0323)
$δ_{0}$	−4.3049 *** (1.0917)
$δ_{1}$	2.1178 *** (0.1693)
a	0.9785 *** (0.1001)
p	1.0032 *** (0.1384)
$\bar{η}$	0.4585 *** (0.1114)
$β_{1}$	0.0457 *** (0.0085)
$β_{2}$	−0.0010 *** (0.0003)
$β_{3}$	−0.0884 *** (0.0071)

***

p < 0.01

.

Table 3. Kolmogorov–Smirnov test results.

Kolmogorov–Smirnov Test
p-value KS	0.0518
p-value bootstrap KS	0.0510

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Modeling High-Frequency Zeros in Time Series with Generalized Autoregressive Score Models with Explanatory Variables: An Application to Precipitation

Abstract

1. Introduction

2. Materials and Methods

2.1. GAS Models

2.2. Zero-Augmented Distribution for Non-Negative Variables

2.3. Dynamic Model for the Zero-Augmented Distribution Model

2.4. Derivation of the Model’s Score

2.5. Generalized Beta Distribution of the Second Kind

2.6. GAS Model for a Zero-Augmented Distribution

2.7. Explanatory Variables

2.8. Diagnosis

2.9. Prediction

2.10. Brier Probability Score

2.11. Application

2.12. Dataset

2.13. Parameter Estimation

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics