Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective

Ferreira, Marta; Ferreira, Helena

doi:10.3390/risks5030033

Open AccessArticle

Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective

by

Marta Ferreira

^1,2,3,*

and

Helena Ferreira

⁴

¹

Centro de Matemática da Universidade do Minho, Campus de Gualtar 4710-057 Braga, Portugal

²

Centro de Matemática Computacional e Estocástica, Departamento de Matemática-Instituto Superior Técnico Av. Rovisco Pais 1, 1049-001 Lisboa, Portugal

³

Centro de Estatística e Aplicações, Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal

⁴

Universidade da Beira Interior, Centro de Matemática e Aplicações (CMA-UBI), Avenida Marquês d’Avila e Bolama, Covilhã 6200-001, Portugal

^*

Author to whom correspondence should be addressed.

Risks 2017, 5(3), 33; https://doi.org/10.3390/risks5030033

Submission received: 10 April 2017 / Revised: 14 June 2017 / Accepted: 16 June 2017 / Published: 27 June 2017

Download

Browse Figures

Versions Notes

Abstract

:

Pareto processes are suitable to model stationary heavy-tailed data. Here, we consider the auto-regressive Gaver–Lewis Pareto Process and address a study of the tail behavior. We characterize its local and long-range dependence. We will see that consecutive observations are asymptotically tail independent, a feature that is often misevaluated by the most common extremal models and with strong relevance to the tail inference. This also reveals clustering at “penultimate” levels. Linear correlation may not exist in a heavy-tailed context and an alternative diagnostic tool will be presented. The derived properties relate to the auto-regressive parameter of the process and will provide estimators. A comparison of the proposals is conducted through simulation and an application to a real dataset illustrates the procedure.

Keywords:

extreme value theory; autoregressive processes; extremal index; asymptotic tail independence

MSC:

60G70; 60G10

1. Introduction

Increased exposure to catastrophic losses and the complexity of financial instruments require sophisticated risk assessment tools in areas such as (re) insurance, banking, finance, among others. Extreme value theory plays an important methodological role in risk management by providing appropriate instruments to deal with values as large as or even higher than those ever observed. These techniques include heavy-tailed models and measures to evaluate tail dependence, namely to infer to what extent the occurrence of a risk value in some variable influences an analogous occurrence in another variable.

Linear ARMA (autoregressive moving average) with heavy-tailed noise may be suitable to model time series presenting peaks of observations. However, in place of a summation, a maximum operator is more propitious to derive extremal properties. Max-autoregressive and moving maximum models were developed within this spirit, such as MARMA (max-autoregressive moving average) introduced in Davis and Resnick (1989) and M4 (multivariate maxima of moving maxima) processes presented in Smith and Weissman (1996). The Pareto model, which is closed under geometric multiplication and minimization, also motivated the first-order Pareto processes presented in Arnold (2001). Further analysis may be found in Ferreira (2016) and references therein.

A random variable (rv) is modeled by Pareto(

σ, α

) if it has distribution function (df)

\begin{matrix} F_{X} (x) = 1 - {(x / σ)}^{- α}, x > σ, σ > 0, α > 0 . \end{matrix}

(1)

This model is a particular case of Pareto-type tail models, the class of regular varying tail distributions, that is,

\begin{matrix} 1 - F (x) : = \bar{F} (x) = x^{- α} L (x), \end{matrix}

(2)

where

L (x)

is a slowly varying function (i.e.,

L (x)

is a real function of positive real values satisfying

L (t x) / L (x) \to 1

, as

x \to \infty

,

\forall t > 0

). Parameter

1 / α

is usually denoted as the tail index of the Pareto rv.

Consider

{X_{n}}_{n \geq 1}

a Gaver–Lewis Pareto (GLP) process, presented in Arnold (2001). More precisely, for each positive integer n, we have

\begin{matrix} X_{n} = σ^{p} X_{n - 1}^{1 - p} ϵ_{n}^{U_{n}}, \end{matrix}

(3)

where

{ϵ_{n}}_{n \geq 1}

is an independent and identically distributed (iid) sequence with common df Pareto(

σ, α

) given in Equation (1) and independent of the Bernoulli(p) iid sequence

{U_{n}}_{n \geq 1}

with

0 < p < 1

, and

ϵ_{i}

independent of

X_{j}

\forall i > j

. If

X_{0} \sim

Pareto(

σ, α

), then

{X_{n}}_{n \geq 1}

is stationary also with marginal df Pareto(

σ, α

). The GLP process corresponds to the exponentiated version of the Gaver–Lewis process introduced in Gaver and Lewis (1980), and hence its name. Simulated samples from the GLP process with marginals Pareto(

1, 1

) and

p = 0.25, 0.5, 0.75

are plotted in Figure 1.

This is a model within the heavy-tailed class where mean values (of different orders) may not exist. For instance, in this case, the mean exists only if

α > 1

and the variance/covariance exists whenever

α > 2

. In Arnold (2001), the autocorrelation was derived as

\begin{matrix} ρ (X_{n - 1}, X_{n}) = \frac{(1 - p) (α - 2)}{α + p - 2}, \end{matrix}

(4)

for

α > 2

. Moreover, in heavy-tailed models, the extremal observations are important, and a dependence analysis based on central measures like the most common autocorrelation may be misleading if the dependence in the tails presents a different structure from the remaining.

Here, we focus on the extremal behavior of the GLP process, namely the tail dependence structure. Despite being a heavy-tailed model, it is practically unknown in modeling extreme values. We shall see that it has interesting properties, such as asymptotic tail independence, i.e., the probability that one observation exceeds an increasing large value given that the previous one has already exceeded it, approaches zero. The rate of the convergence, usually denoted coefficient of asymptotic tail independence

η

(Ledford and Tawn (1996); Wadsworth and Tawn (2012)) captures a residual tail dependence, revealing a kind of “penultimate” clustering, i.e., an aggregation of not so high values. This is a not so fortuitous behavior in real applications and can be observed in the well-known Gaussian processes (see Bortot and Tawn (1998), Ramos and Ledford (2013), and references therein). In practice, ignoring this phenomena will result in misleading inference (see, e.g., Poon et al. (2003)). However, not all series associated with environmental, social or economic phenomena are susceptible to Gaussian modeling, especially when they have heavy tails. The most common extremal models MARMA and M4, as well as, the Yeh–Arnold–Robertson Pareto (III) and the Lawrence–Lewis Pareto processes (see, respectively, Ferreira (2012) and Ferreira (2016) and references therein) are not tail independent and new processes have been appearing (Heffernan et al. (2007); Ferreira and Canto e Castro (2010); Ferreira and Ferreira (2014) and Ferreira and Ferreira (2015) ). The GLP is an additional contribution within this class. Coefficient

η

will also be extended to observations that are lag-m apart, providing an alternative to the autocorrelation function (acf).

This paper is organized as follows. The tail dependence measures and conditions to be analyzed are detailed in Section 2 and applied to the GLP process in Section 3. The tail characterization provides us with methods to estimate the autoregressive parameter p, which will be compared through simulation in Section 4. An illustration with a real dataset is addressed in Section 5.

2. Tail Dependence

A stationary sequence

{X_{n}}_{n \geq 1}

has extremal index

θ \in [0, 1]

if, for each

τ > 0

, there is a sequence of normalized levels

{u_{n} \equiv u_{n}^{(τ)}}_{n \geq 1}

, i.e.,

\begin{matrix} n (1 - F (u_{n})) \to τ, \end{matrix}

as

n \to \infty

, such that

\begin{matrix} P (max (X_{1}, \dots, X_{n}) \leq u_{n}) \to e^{- θ τ} \end{matrix}

(5)

(Leadbetter et al. (1983)). The extremal index is a measure for the clustering propensity, being interpreted as the arithmetic inverse of the mean number of exceedances of an increasing threshold per independent cluster. The null case is often ignored, corresponding to a degenerate limiting distribution for the maximum. The value

θ = 1

is associated to iid sequences but not only these. Below, we shall see that it is a form of asymptotic independence of extremes.

Some dependence conditions allow us to derive the extremal index through the joint distribution of s consecutive terms of

{X_{n}}_{n \geq 1}

.

The long-range condition D(

u_{n}

) of Leadbetter (1974) states that

α_{n, l_{n}} \to 0

, as

n \to \infty

, for some sequence

l_{n} = o (n)

, where

\begin{matrix} \begin{matrix} α_{n, l} = & sup {| P (M_{i_{1}, i_{1} + p} \leq u_{n}, M_{j_{1}, j_{1} + q} \leq u_{n}) - P (M_{i_{1}, i_{1} + p} \leq u_{n}) P (M_{j_{1}, j_{1} + q} \leq u_{n}) | : \\ 1 \leq i_{1} < i_{1} + p + l \leq j_{1} < j_{1} + q \leq n}, \end{matrix} \end{matrix}

(6)

with

M_{i, j} = {max}_{s = i + 1}^{j} (X_{s})

,

M_{0, j} = M_{j}

and

M_{i, j} = - \infty

for

i \geq j

. Consider

{k_{n}}_{n \geq 1}

such that,

\begin{matrix} k_{n} \to \infty, k_{n} α_{n, l_{n}} \to 0, k_{n} l_{n} / n \to 0, as n \to \infty . \end{matrix}

(7)

Observe that D(

u_{n}

) is a milder condition than the usual mixing, such as strong-mixing.

Under condition D(

u_{n}

), we say that the local dependence condition D

^{(s)}

(

u_{n}

) of Chernick et al. (1991) holds for

{X_{n}}_{n \geq 1}

, if for some

{k_{n}}_{n \geq 1}

satisfying Equation (7), we have

\begin{matrix} n P (X_{1} > u_{n}, M_{1, s} \leq u_{n} < M_{s, r_{n}}) \underset{n \to \infty}{⟶} 0, \end{matrix}

with

{r_{n} = [n / k_{n}]}_{n \geq 1}

(

[x]

denoting the integer part of x). The validation of D

^{(s)}

(

u_{n}

) implies that D

^{(s^{'})}

(

u_{n}

) holds for

s^{'} > s

.

If D

^{(s)}

(

u_{n}

) holds, the extremal index exists and can be computed through (Chernick et al. (1991))

\begin{matrix} θ = lim_{n \to \infty} P (M_{1, s} \leq u_{n} | X_{1} > u_{n}) . \end{matrix}

(8)

Observe that, under D

^{(1)}

(

u_{n}

), we have

θ = 1

. Condition D

^{(s)}

(

u_{n}

) is also implied by

\begin{matrix} n \sum_{j = s + 1}^{r_{n}} P (X_{1} > u_{n}, M_{1, s} \leq u_{n} < X_{j}) \underset{n \to \infty}{⟶} 0 . \end{matrix}

This corresponds to condition D

^{'}

(

u_{n}

) of Leadbetter et al. (1983) whenever

s = 1

, which locally restricts the occurrence of clusters of exceedances and thus leads to

θ = 1

. Condition D

^{″}

(

u_{n}

) of Leadbetter and Nandagopalan (1989) is obtained with

s = 2

and locally restricts upcrossing clustering.

Observe that, under D

^{″}

(

u_{n}

), we can write

\begin{matrix} θ = lim_{n \to \infty} P (X_{2} \leq u_{n} | X_{1} > u_{n}) \end{matrix}

and thus state

\begin{matrix} θ (u_{n}) \sim 1 - P (X_{2} > u_{n} | X_{1} > u_{n}), n \to \infty, \end{matrix}

where

a (x) \sim b (x)

means

a (x) / b (x) \to c

, as

x \to \infty

, for some constant c. Observe that, if

θ < 1

, then

P (X_{2} > u_{n} | X_{1} > u_{n}) > 0

, meaning that consecutive observations are tail dependent. On the other hand, under a unit extremal index, we have

P (X_{2} > u_{n} | X_{1} > u_{n}) \to 0

, as

n \to \infty

, and thus consecutive observations are asymptotically tail independent. This feature has been observed in some real data and theoretical examples (Ledford and Tawn (1996); Bortot and Tawn (1998); Ramos and Ledford (2013)).

Ledford and Tawn (1996) introduced the asymptotic tail independence coefficient,

η

, in order to measure the rate of convergence of

P (X_{2} > F^{- 1} (1 - t) | X_{1} > F^{- 1} (1 - t))

towards 0, where

F^{- 1}

is the quantile function, capturing a kind of pre-asymptotic dependence. More precisely, the asymptotic tail independence coefficient,

η \in [0, 1]

, exists whenever it holds

\begin{matrix} P (X_{1} > F^{- 1} (1 - t), X_{2} > F^{- 1} (1 - t)) \sim t^{1 / η} L (1 / t), t ↓ 0 . \end{matrix}

(9)

Thus, under Equation (9), we can state,

\begin{matrix} θ (u_{n}) \sim 1 - P (X_{2} > u_{n} | X_{1} > u_{n}) \sim 1 - u_{n}^{1 - 1 / η} L (u_{n}), n \to \infty . \end{matrix}

Observe that, if

η = 1

and

L (u_{n}) ↛ 0

, we have

θ < 1

and thus an effect of clustering of high values. Under a unit extremal index, the coefficient

η < 1

measures the rate of convergence of

θ

towards 1, capturing a kind of pre-asymptotic clustering, despite a resembling of the process to an iid sequence at increasingly high thresholds.

Analogous with the acf, we extend the coefficient

η

and state the tail dependence within random pairs that are lag-m apart,

(X_{i}, X_{i + m})

,

i \geq 1

, through the coefficient

η_{m}

, i.e.,

P (X_{1} > F^{- 1} (1 - t), X_{1 + m} > F^{- 1} (1 - t)) \sim t^{1 / η_{m}} L (1 / t), t ↓ 0,

where

η_{1} \equiv η

.

The tail dependent class has been greatly enhanced within the methodology of extreme values. However, this approach results in the overestimation of extremal dependence if the series is actually asymptotically independent. An illustration with financial data may be seen in Poon et al. (2003). The most recent literature has been addressing this issue, namely, with the introduction of new models comprising asymptotic tail independence (Heffernan et al. (2007); Ferreira and Canto e Castro (2010); (Ferreira and Ferreira) (2014, 2015). In the next section, we will show that the GLP belongs to this latter class of models.

3. The Tail Dependence of the Gaver–Lewis Process

In the following, and without loss of generality, we will take

σ = 1

.

“Mixing”conditions roughly state that two rvs become increasingly independent as they get more apart in time. One of its forms is the

β

-mixing condition, defined by

\begin{matrix} β (l) : = sup_{p \in N} E (sup_{B \in F (X_{p + l + 1}, . . .)} | P (B | F (X_{1}, . . ., X_{p})) - P (B) |) \underset{l \to \infty}{⟶} 0, \end{matrix}

with

F (.)

denoting the

σ -

field generated by the indicated random variables (Bradley (2005)).

Proposition 1.

The GLP process is

β

-mixing.

Proof.

The

β

-mixing condition will be proved through the sufficient conditions of regeneration and aperiodicity (see, e.g., Bradley (2005); Corollary 3.6).

In the following, consider notation

Q^{m} (x,] 1, y]) = P (X_{1 + m} \leq y | X_{1} = x)

, with

Q (x,] 1, y]) \equiv Q^{1} (x,] 1, y])

.

Observe that

Q (x,] 1, y]) = P (X_{2} \leq y | X_{1} = x) = P (ϵ_{2}^{U_{2}} \leq \frac{y}{x^{1 - p}}) = F_{ϵ} (\frac{y}{x^{1 - p}}) p + 1 - p, y \geq x^{1 - p} .

First, we show that GLP is regenerative, that is, it has a regeneration set, i.e., a recurrent set R such that, for some

m \in N

, a distribution

φ

and

κ \in (0, 1)

, we have

\begin{matrix} Q^{m} (x, B) \geq κ φ (B), x \in R \end{matrix}

for all Borel set B over

R

. If, for any regeneration set R and any Borel set B over

R

, we have

\begin{matrix} Q^{m + 1} (x, B) \geq κ_{1} φ (B) and Q^{m} (x, B) \geq κ_{2} φ (B), \forall x \in R, \end{matrix}

(10)

for some

m \in N

and

κ_{1}, κ_{2} \in (0, 1)

, then the process is said to be aperiodic (Asmussen (1987)).

Consider

R =] 1, r [

(and thus recurrent since it is in the state space

] 1, \infty [

of the process) and B a Borel set over

R

. Let

x \in R

,

S = [r, r^{1 / (1 - p)}]

and

V \sim

Pareto(

r^{1 - p}, α

). For all

x \in R

, we have

Q (x, B) \geq \int_{B \cap S} d Q (x, z) \geq P (V \in B \cap S) p,

and thus regeneration holds by considering

m = 1

,

φ (B) = P (V \in B | V \in S)

and

κ = P (V \in S) p

. Observe that S is also regenerative since it is recurrent (

S \subset] 1, \infty [

) and,

\forall z \in S

,

z^{1 - p} \leq r < y

, for any

y \in S

, and thus

Q (z, B) \geq κ φ (B)

, with

φ (B)

and

κ

as above. Now, we have

Q^{2} (x, B) \geq \int_{S} Q (z, B) d Q (x, z) \geq κ φ (B) Q (x, S) \geq κ φ (B) p P (V \in S) .

Therefore, the aperiodicity condition in Equation (10) is satisfied if we take

κ_{1} = κ p P (V \in S)

and

κ_{2} = κ

. ☐

Note that condition

D (u_{n})

given in Equation (6) is weaker than

β

-mixing and thus holds for GLP by the previous result.

Proposition 2.

The GLP process satisfies condition D

^{'}

(

u_{n}

) for sequences

{k_{n}}_{n \geq 1}

satisfying Equation (7) and such that

{(2 - p)}^{n / k_{n}} / n^{p} \to 0

, as

n \to \infty

.

Proof.

We have successively, for

r_{n} = [n / k_{n}]

,

n \geq 1

,

\begin{matrix} n \sum_{j = 2}^{r_{n}} P (X_{1} > u_{n}, X_{j} > u_{n}) \\ = & n \sum_{j = 2}^{r_{n}} P (X_{1} > u_{n}, X_{j} > u_{n}, U_{2} = \dots = U_{j} = 0) + n \sum_{j = 2}^{r_{n}} \sum_{k = 1}^{j - 1} P (X_{1} > u_{n}, X_{j} > u_{n}, \sum_{i = 2}^{j} 1_{{U_{i} = 1}} = k) \\ \leq & \sum_{j = 2}^{r_{n}} (\frac{τ {(1 - p)}^{j - 1}}{{(n / τ)}^{1 / {(1 - p)}^{j - 1} - 1}} + \sum_{k = 1}^{j - 1} \sum_{2 \leq s_{1} < \dots < s_{k} \leq j} \frac{τ}{{(n / τ)}^{1 - {(1 - p)}^{j - 1}}} \prod_{\begin{matrix} i = 0 \\ s_{0} = 1 \end{matrix}}^{k - 1} \frac{p {(1 - p)}^{j - 1 - k}}{1 - {(1 - p)}^{s_{k} - s_{i}}}) \\ \leq & \sum_{j = 2}^{r_{n}} (\frac{τ {(1 - p)}^{j - 1}}{{(n / τ)}^{1 / (1 - p) - 1}} + \sum_{k = 1}^{j - 1} \sum_{2 \leq s_{1} < \dots < s_{k} \leq j} \frac{τ {(1 - p)}^{j - 1 - k}}{{(n / τ)}^{p}}) \\ \leq & \frac{τ}{{(n / τ)}^{p}} \sum_{j = 2}^{r_{n}} \sum_{k = 0}^{j - 1} (\binom{j - 1}{k}) {(1 - p)}^{j - 1 - k} \\ \leq & \frac{τ^{p + 1}}{1 - p} \frac{{(2 - p)}^{r_{n}}}{n^{p}} . \end{matrix}

(11)

☐

Corollary 1.

The GLP process has

θ = 1

.

This result reveals that high observations of the GLP process behave similar to an iid scenario. However, there is a weak dependence that may be evaluated through the Ledford and Tawn coefficient

η

in Equation (9). Moreover, we will see that it relates with parameter p of the process.

Proposition 3.

The GLP process has

η = 1 / (1 + p)

.

Proof.

Consider

a_{t} = F_{X}^{- 1} (1 - t)

and take

t ↓ 0

. Observe that

\begin{matrix} P (X_{1} > a_{t}, X_{2} > a_{t}) \\ = & p P (X_{1} > a_{t}, X_{1}^{1 - p} ϵ_{2} > a_{t}) + (1 - p) P (X_{1} > a_{t}, X_{1}^{1 - p} > a_{t}) \\ = & p \int_{a_{t}}^{\infty} {\bar{F}}_{ϵ_{2}} (a_{t} / x^{1 - p}) d F_{X} (x) + (1 - p) {\bar{F}}_{X} (a_{t}^{1 / (1 - p)}) \\ = & a_{t}^{- α (1 + p)} + (1 - p) a_{t}^{- α / (1 - p)} \sim t^{p + 1} + t^{1 / (1 - p)} (1 - p), as n \to \infty . \end{matrix}

☐

The fluctuation probability in the GLP process, given by

\begin{matrix} \begin{matrix} f : = & P (X_{n - 1} < X_{n}) = p P (X_{n - 1}^{p} < ϵ_{n}) = p \int_{1}^{\infty} F_{X} (x^{1 / p}) d F_{ϵ} (x) = p / (1 + p), \end{matrix} \end{matrix}

is a simple measure that will be useful in the following.

Corollary 2.

The GLP process verifies the following equalities:

(i) $p = 1 / (1 - f) - 1$ ;
(ii) $p = 1 / η - 1$ ;
(iii) $η = 1 - f$ .

This result states a characterizing feature that can be helpful in model specification. Moreover, in order to satisfy

0 < p < 1

, we must have

0 < f < 1 / 2

and

1 / 2 < η < 1

.

Another interesting property for model identification is based on the lag-m coefficient

η_{m}

, analogous with the acf for linear models. The plots in Figure 2 exhibit a power decay as the acf of AR(1) processes. Observe also that the smaller the value of p, the higher we must choose the lag-m in order to have "almost" independent observations, i.e.,

η_{m} \approx 1 / 2

.

Proposition 4.

The GLP process has lag-m coefficient

η_{m}

given by

\begin{matrix} η_{m} = 1 / (2 - {(1 - p)}^{m}) . \end{matrix}

(12)

Proof.

Consider

a_{t} = F_{X}^{- 1} (1 - t)

. The product of powered Pareto rvs is still Pareto-type tail distributed (see, e.g., Arnold (2001)) and thus, applying Equation (2) and the theorem of the dominated convergence, we have successively, as

t ↓ 0

,

\begin{matrix} P (X_{1} > a_{t}, X_{1 + m} > a_{t}) = \int_{a_{t}}^{\infty} P (x^{{(1 - p)}^{m}} \prod_{j = 0}^{m - 1} ϵ_{m + 1 - j}^{U_{m + 1 - j} {(1 - p)}^{j}} > a_{t}) d F_{X} (x) \\ \sim & \int_{a_{t}}^{a_{t}^{{(1 - p)}^{- (m + 1)}}} {(a_{t} x^{- {(1 - p)}^{m + 1}})}^{- α} L (x / t) d F_{X} (x) + \int_{a_{t}^{{(1 - p)}^{- (m + 1)}}}^{\infty} d F_{X} (x) \\ \sim & a_{t}^{- α} L (1 / t) \int_{a_{t}}^{a_{t}^{{(1 - p)}^{- (m + 1)}}} α x^{- α (1 - {(1 - p)}^{m + 1}) - 1} d x + a_{t}^{- α {(1 - p)}^{- (m + 1)}} \\ = & \frac{a_{t}^{- α} L (1 / t)}{1 - {(1 - p)}^{m + 1}} (a_{t}^{- α (1 - {(1 - p)}^{m + 1})} - a_{t}^{- α ({(1 - p)}^{- (m + 1)} - 1)}) + a_{t}^{- α {(1 - p)}^{- (m + 1)}} \\ \sim & L^{*} (1 / t) (t^{2 - {(1 - p)}^{m + 1}}), \end{matrix}

(13)

where

L (1 / t)

and

L^{*} (1 / t)

are slowly varying functions. ☐

4. Estimation

Relations (i) and (ii) stated in Corollary 2 will provide us with estimators for the autoregressive parameter p. More precisely, from (i), we have

\begin{matrix} {\hat{p}}^{(F)} = \frac{1}{1 - \hat{f}} - 1, \end{matrix}

(14)

with

\hat{f}

corresponding to the empirical counterpart of f,

\begin{matrix} \hat{f} = \frac{1}{n - 1} \sum_{j = 2}^{n} 𝟙_{{X_{j - 1} < X_{j}}}, \end{matrix}

provided that

\hat{f} < 1 / 2

(notation

𝟙_{{\cdot}}

means the indicator function). From the iid property of the generating process

ε_{t}

(and with

X_{0}

as specified for stationarity), we have ergodicity and thus consistence of the proposed estimators. In addition,

\hat{f}

corresponds to the mean of Bernoulli trials with Markov dependence. From Klotz (1973), we have that

\sqrt{n} (\hat{f} - f)

converges in distribution to a centered Gaussian model, and thus

\sqrt{n} ({\hat{p}}^{(F)} - p)

by the Delta Method, as

n \to \infty

. For more details, see Ferreira (2012).

From (ii), the estimation of p is based on the estimation of

η

through

\begin{matrix} {\hat{p}}^{(H)} = \frac{1}{{\hat{η}}^{(H)}} - 1, \end{matrix}

(15)

as long as

{\hat{η}}^{(H)} > 1 / 2

. Observe that

η

corresponds to the tail index of

T = min (1 / (1 - F (X_{1})), 1 / (1 - F (X_{2})))

. The most common method developed in literature is the Hill estimator (Hill (1975))—thus the superscript “H”. More precisely, we have

\begin{matrix} {\hat{η}}^{(H)} = \frac{1}{k} \sum_{i = 1}^{k} log T_{n - i + 1 : n} - log u, \end{matrix}

where

T_{n - k + 1 : n}, \dots, T_{n : n}

are the k larger order statistics of T that exceed u. It is usual to consider

u \in [T_{n - k : n}, T_{n - k + 1 : n} [

and plot

{\hat{η}}^{(H)}

as a function of k. In Figure 3, we can see the Hill trajectories of

\hat{η}

for the respective GLP models considered in Figure 1. The paths are quite stable around the true value of

η

for a large range of values of k. Indeed, variable T corresponds to the minimum of unit Pareto rvs, where the Hill estimator behaves particularly well. Consistency and asymptotic normality of the Hill estimator

{\hat{η}}^{(H)}

can be seen in Draisma et al. (2004).

Expecting to observe time series data behaving exactly as the GLP functional Equation (3) is not realistic. At best, we might observe perturbed versions of the GLP process, for instance “noisy” processes of the form

X_{n}^{(δ)} = X_{n} + δ Z_{n}

,

n \geq 1

, where

{Z_{n}}_{n \geq 1}

is an iid sequence of standard Gaussian rvs and

δ > 0

. Thus, the simulations cover the GLP and “noisy” GLP sample paths for

δ = 0, 0.1, 1

. We consider 1000 replicas of sizes

n = 100, 1000, 5000

for

p = 0.25, 0.5, 0.75

, and marginals Pareto(

1, 1

). The computed estimates of the root mean squared error (rmse) and absolute bias (abias) are reported in Table 1, where the estimator

{\hat{p}}^{(H)}

is based on thresholds u corresponding to the sample minimum (

q_{0}

), the median (

q_{50}

) and the percentile 80 (

q_{80}

). We also register the number of fails resulting, respectively, from

\hat{f} \geq 1 / 2

and

{\hat{η}}^{(H)} \leq 1 / 2

. Not surprisingly, they are more associated to small sample sizes, where the case

p = 0.75

seems particularly sensitive. Indeed, the results tend to be slightly worse under large

p = 0.75

, where the process approximates to independence. In practice, the difficulty in deciding between tail dependence (

η = 1

) and asymptotic independence (

η < 1

) is well known. For a survey on this topic, see, e.g., Poon et al. (2003) and Beirlant et al. (2004). The results get better as the sample sizes increase. We observe that estimator

{\hat{p}}^{(F)}

is the best for the GLP process but not so robust for “noisy” GLP. In what concerns estimator

{\hat{p}}^{(H)}

, it seems to present an overall better performance under

u = q_{0}

.

The estimation of the tail index parameter

α

may be conducted through the Hill estimator

{\hat{α}}^{(H)}

(Hill (1975)), which is consistent and asymptotically normal under strong mixing conditions (see Rootzn et al. (1990)).

5. Application

Insurance loss data is typically well modeled by heavy-tailed processes. We consider the daily closing values for the Danish fire losses registered from January 1980 to December 1990, plotted in Figure 4 (left). Observe the high values that appear suddenly, similar to the GLP simulated sample paths (see Figure 1), as well as the close linearity of the Pareto quantile-quantile plot (right panel of Figure 4). In Figure 5 (left), the almost plane region of Hill’s sample path led us to the estimate

{\hat{α}}^{(H)} \approx 1.4

. Thus, we cannot assume the existence of the acf and should avoid an analysis based on this tool. We conduct the estimate of the GLP parameter p through

{\hat{p}}^{(F)}

and

{\hat{p}}^{(H)}

. More precisely, based on Equation (14), we obtain

{\hat{p}}^{(F)} = 0.9945

and, using Equation (15), we derive

{\hat{p}}_{q_{0}}^{(H)} = 0.9610

,

{\hat{p}}_{q_{50}}^{(H)} = 0.9715

and

{\hat{p}}_{q_{80}}^{(H)} = 0.9996

, where the quantiles

q_{0}

,

q_{50}

and

q_{80}

were considered according to the simulation study (see also the sample path estimates of Hill in the right panel of Figure 5). Formula (12) of the lag-m coefficient

η_{m}

of Proposition 4 is a similar tool to the role of the acf in identifying linear models. Table 2 presents the estimates of

η_{m}

, for

m = 1, 2, 3

, obtained from estimator

{\hat{η}}_{m}^{(H)}

, which consists of the Hill estimator, respectively applied to lag-m apart random pairs

(X_{i}, X_{i + m})

, as well as, estimates of

{\hat{η}}_{m} ({\hat{p}}^{(F)})

and

{\hat{η}}_{m} ({\hat{p}}^{(H)})

derived from Equation (12) by replacing p, respectively, by

{\hat{p}}^{(F)}

and

{\hat{p}}^{(H)}

. The closeness between the two type of estimates,

{\hat{η}}_{m}^{(H)}

and

{\hat{η}}_{m} ({\hat{p}}^{(\cdot)})

, shows a further contribution in favor of the model.

The GLP process thus seems to have potential in the modeling of this type of data. More tools regarding goodness-of-fit analysis will be addressed in a future work.

Acknowledgments

The authors wish to thank the reviewers for their important comments that have improved this work. The first was financed by Portuguese Funds through FCT—Fundação para a Ciência e a Tecnologia within the Project UID/MAT/00013/2013 and by the research center CEMAT (Instituto Superior Técnico, Universidade de Lisboa) through the Project UID/Multi/04621/2013. The second author’s research was partially supported by the research unit UID/MAT/00212/2013.

Author Contributions

The authors contributed equally to the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Arnold, Barry C. 2001. Pareto Processes. In Handbook of Statistics. Edited by D. N. Shanbhag and C. R. Rao. Vol. 19, Amsterdam: Elsevier Science B.V. [Google Scholar]
Asmussen, Soren. 1987. Applied Probability and Queues. Hoboken: John Wiley & Sons. [Google Scholar]
Beirlant, Jan, Yuri Goegebeur, Johan Segers, and Jozef Teugels. 2004. Statistics of Extremes: Theory and Applications. Hoboken: John Wiley & Sons. [Google Scholar]
Bortot, Paola, and Jonathan A. Tawn. 1998. Models for the extremes of Markov chains. Biometrika 85: 851–67. [Google Scholar] [CrossRef]
Bradley, Richard C. 2005. Basic Properties of Strong Mixing Conditions. A Survey and Some Open Questions. Probability Surveys 2: 107–44. [Google Scholar] [CrossRef]
Chernick, Michael R., Tailen Hsing, and William P. McCormick. 1991. Calculating the extremal index for a class of stationary sequences. Advance in Appliec Probability 23: 835–50. [Google Scholar] [CrossRef]
Davis, Richard A., and Sidney I. Resnick. 1989. Basic properties and prediction of max-ARMA processes. Advance in Appliec Probability 21: 781–803. [Google Scholar] [CrossRef]
Draisma, Draisma, Holger Dress, Ana Ferreira, and Laurens de Haan. 2004. Bivariate tail estimation: dependence in asymptotic independence. Bernoulli 10: 251–80. [Google Scholar] [CrossRef]
Ferreira, Marta. 2016. The Lawrence-Lewis Pareto process: an extremal approach. Electronic Journal of Applied Statistical Analysis 9: 68–82. [Google Scholar]
Ferreira, Marta. 2012. On the extremal behavior of a Pareto process: An alternative for armax modeling. Kybernetika 48: 31–49. [Google Scholar]
Ferreira, Marta, and Luísa Canto e Castro. 2010. Modeling rare events through a pRARMAX process. Journal of Statistical Planning and Inference 140: 3552–66. [Google Scholar] [CrossRef]
Ferreira, Helena, and Marta Ferreira. 2015. Extremes of scale mixtures of multivariate time series. Journal of Multivariate Analysis 137: 82–99. [Google Scholar] [CrossRef]
Ferreira, Helena, and Marta Ferreira. 2014. Extremal behavior of pMAX processes. Statistics & Probability Letters 93: 46–57. [Google Scholar]
Gaver, Donald, and P. A. W. Lewis. 1980. First-Order Autoregressive Gamma Sequences and Point Processes. Advances in Applied Probability 12: 727–45. [Google Scholar] [CrossRef]
Heffernan, Janet E., Jonathan A. Tawn, and Zhengjun Zhang. 2007. Asymptotically (in)dependent multivariate maxima of moving maxima processes. Extremes 10: 57–82. [Google Scholar] [CrossRef]
Hill, Bruce M. 1975. A Simple General Approach to Inference About the Tail of a Distribution. The Annals of Statistics 3: 1163–74. [Google Scholar] [CrossRef]
Klotz, Jerome. 1973. Statistical inference in Bernoulli trials with dependence. The Annals of Statistics 1: 373–79. [Google Scholar] [CrossRef]
Leadbetter, M. Ross. 1974. On extreme values in stationary sequences. Zeitschrift Für Wahrscheinlichkeitstheorie Und Verwandte Gebiete 28: 289–303. [Google Scholar] [CrossRef]
Leadbetter, Malcolm R., Georg Lindgren, and Holger Rootzén. 1983. Extremes and Related Properties of Random Sequences and Processes. New York: Springer. [Google Scholar]
Leadbetter, M. Ross, and S. Nandagopalan. 1989. On exceedance point processes for stationary sequences under mild oscillation restrictions. Lecture Notes in Statistics 51: 69–80. [Google Scholar]
Ledford, Anthony W., and Jonathan A. Tawn. 1996. Statistics for near independence in multivariate extreme values. Biometrika 83: 169–87. [Google Scholar] [CrossRef]
Poon, Ser-Huang, Michael Rockinger, and Jonathan Tawn. 2003. Modelling Extreme-Value Dependence in International Stock Markets. Statistica Sinica 13: 929–53. [Google Scholar] [CrossRef]
Ramos, Alexandra, and Anthony Ledford. 2013. Estimation of the extremal index function in case of asymptotically independent Markov chains and its application to stock market indexes. In Studies in Theoretical and Applied Statistics: Subseries B: Recent Developments in Modeling and Applications in Statistics (SPE2010). Edited by P. Oliveira, M. G. Temido, C. Henriques and M. Vichi. New York: Springer, pp. 89–96. [Google Scholar]
Rootzen, Holger, Malcolm R. Leadbetter, and Laurens De Haan. 1990. Tail and Quantile Estimation for Strongly Mixing Stationary Sequences. Series: Report 9024/A; Rotterdam: Erasmus University. [Google Scholar]
Smith, Richard L., and Ishay Weissman. 1996. Characterization and Estimation of the Multivariate Extremal Index. Technical Report. Chapel Hill: Universityof North Carolina. [Google Scholar]
Wadsworth, Jennifer L., and Jonathan A. Tawn. 2012. Dependence modelling for spatial extremes. Biometrika 99: 253–72. [Google Scholar] [CrossRef]

Figure 1. Simulated sample paths of the GLP process with marginals Pareto(

1, 1

) for

p = 0.25

(left),

p = 0.5

(middle) and

p = 0.75

(right).

Figure 1. Simulated sample paths of the GLP process with marginals Pareto(

1, 1

) for

p = 0.25

(left),

p = 0.5

(middle) and

p = 0.75

(right).

Figure 2. Plots of

η_{m}

for the GLP process with

p = 0.3

(left);

p = 0.5

(middle) and

p = 0.7

(right), for lags

m = 1, \dots, 6

.

Figure 2. Plots of

η_{m}

for the GLP process with

p = 0.3

(left);

p = 0.5

(middle) and

p = 0.7

(right), for lags

m = 1, \dots, 6

.

Figure 3. Hill plots of

{\hat{η}}^{(H)}

for the GLP process with marginals Pareto(

1, 1

) and

p = 0.25

(left);

p = 0.5

(middle) and

p = 0.75

(right).

Figure 3. Hill plots of

{\hat{η}}^{(H)}

for the GLP process with marginals Pareto(

1, 1

) and

p = 0.25

(left);

p = 0.5

(middle) and

p = 0.75

(right).

Figure 4. Danish fire losses: daily closing values from January 1980 to December 1990 (left); Pareto quantile-quantile plot (right).

Figure 5. Danish fire losses: trajectory of Hill estimates of

{\hat{α}}^{(H)}

(left) and

{\hat{η}}^{(H)}

(right).

Figure 5. Danish fire losses: trajectory of Hill estimates of

{\hat{α}}^{(H)}

(left) and

{\hat{η}}^{(H)}

(right).

Table 1. Simulation results of the root mean squared error (RMSE) and of the absolute bias (abias). The last three columns correspond to the number of fails (nf) in each case.

			RMSE			abias
$n = 100$		$δ = 0$	$δ = 0.1$	$δ = 1$	$δ = 0$	$δ = 0.1$	$δ = 1$	$n f_{δ = 0}$	$n f_{δ = 0.1}$	$n f_{δ = 1}$
$p = 0.25$	${\hat{p}}^{(F)}$	0.0548	0.1844	0.5050	0.0005	0.1693	0.4940	0	0	24
	${\hat{p}}_{q_{0}}^{(H)}$	0.0775	0.0837	0.2302	0.0547	0.0070	0.2127	0	0	1
	${\hat{p}}_{q_{50}}^{(H)}$	0.1225	0.1140	0.1095	0.0801	0.0130	0.0202	0	0	1
	${\hat{p}}_{q_{80}}^{(H)}$	0.2280	0.2236	0.2098	0.1587	0.0500	0.0842	8	7	35
$p = 0.5$	${\hat{p}}^{(F)}$	0.0707	0.1225	0.3317	0.0013	0.0943	0.3200	0	0	68
	${\hat{p}}_{q_{0}}^{(H)}$	0.0894	0.0894	0.1789	0.0492	0.0557	0.1576	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.1342	0.1414	0.1342	0.0586	0.0633	0.0197	2	2	1
	${\hat{p}}_{q_{80}}^{(H)}$	0.2191	0.2121	0.2049	0.0816	0.0783	0.0182	100	83	68
$p = 0.75$	${\hat{p}}^{(F)}$	0.0894	0.0548	0.1483	0.0001	0.0265	0.1287	10	0	222
	${\hat{p}}_{q_{0}}^{(H)}$	0.1000	0.0949	0.1265	0.0419	0.0422	0.0917	21	22	93
	${\hat{p}}_{q_{50}}^{(H)}$	0.1265	0.1225	0.1342	0.0116	0.0120	0.0028	143	123	121
	${\hat{p}}_{q_{80}}^{(H)}$	0.1975	0.1897	0.1975	0.0611	0.0526	0.0686	311	274	267
$n = 1000$	rmse	$δ$ = 0	$δ$ = 0.1	$δ$ = 1	$δ$ = 0	$δ$ = 0.1	$δ$ = 1	nf $_{δ = 0}$	nf $_{δ = 0.1}$	nf $_{δ = 1}$
$p = 0.25$	${\hat{p}}^{(F)}$	0.0000	0.1673	0.5010	0.0003	0.1656	0.4998	0	0	0
	${\hat{p}}_{q_{0}}^{(H)}$	0.0775	0.0000	0.1449	0.0547	0.0169	0.1423	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.1225	0.0316	0.0775	0.0801	0.0080	0.0742	0	0	0
	${\hat{p}}_{q_{80}}^{(H)}$	0.2280	0.0548	0.0775	0.1587	0.0209	0.0494	8	0	0
$p = 0.5$	${\hat{p}}^{(F)}$	0.0000	0.0949	0.3302	0.0005	0.0943	0.3285	0	0	0
	${\hat{p}}_{q_{0}}^{(H)}$	0.0316	0.0316	0.1095	0.0058	0.0122	0.1068	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.0447	0.0447	0.0632	0.0087	0.0072	0.0508	0	0	0
	${\hat{p}}_{q_{80}}^{(H)}$	0.0775	0.0775	0.0949	0.0158	0.0143	0.0459	0	0	0
$p = 0.75$	${\hat{p}}^{(F)}$	0.0316	0.0548	0.1732	0.0006	0.0459	0.1695	0	0	10
	${\hat{p}}_{q_{0}}^{(H)}$	0.0316	0.0316	0.0707	0.0053	0.0075	0.0636	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.0548	0.0548	0.0548	0.0084	0.0062	0.0186	0	0	0
	${\hat{p}}_{q_{80}}^{(H)}$	0.1000	0.0949	0.1000	0.0120	0.0047	0.0287	16	16	7
$n = 5000$	rmse	$δ$ = 0	$δ$ = 0.1	$δ$ = 1	$δ$ = 0	$δ$ = 0.1	$δ$ = 1	nf $_{δ = 0}$	nf $_{δ = 0.1}$	nf $_{δ = 1}$
$p = 0.25$	${\hat{p}}^{(F)}$	0.0000	0.1643	0.5000	0.0002	0.1649	0.4994	0	0	0
	${\hat{p}}_{q_{0}}^{(H)}$	0.0000	0.0000	0.1378	0.0015	0.0107	0.1360	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.0000	0.0000	0.0837	0.0027	0.0004	0.0836	0	0	0
	${\hat{p}}_{q_{80}}^{(H)}$	0.0316	0.0316	0.0707	0.0054	0.0044	0.0618	0	0	0
$p = 0.5$	${\hat{p}}^{(F)}$	0.0000	0.0949	0.3302	0.0002	0.0939	0.3302	0	0	0
	${\hat{p}}_{q_{0}}^{(H)}$	0.0000	0.0000	0.1049	0.0004	0.0061	0.1025	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.0000	0.0000	0.0632	0.0007	0.0004	0.0567	0	0	0
	${\hat{p}}_{q_{80}}^{(H)}$	0.0316	0.0316	0.0632	0.0020	0.0021	0.0563	0	0	0
$p = 0.75$	${\hat{p}}^{(F)}$	0.0000	0.0447	0.1703	0.0002	0.0451	0.1685	0	0	0
	${\hat{p}}_{q_{0}}^{(H)}$	0.0000	0.0000	0.0632	0.0011	0.0034	0.0585	0	0	0
	${\hat{p}}_{q_{50}}^{(H)}$	0.0316	0.0316	0.0316	0.0013	0.0000	0.0257	0	0	0
	${\hat{p}}_{q_{80}}^{(H)}$	0.0447	0.0447	0.0548	0.0037	0.0026	0.0367	0	0	0

Table 2. Danish fire losses: estimates of the lag-m coefficient

η_{m}

.

Table 2. Danish fire losses: estimates of the lag-m coefficient

η_{m}

.

${\hat{η}}_{1} ({\hat{p}}^{(F)})$		0.5014
${\hat{η}}_{2} ({\hat{p}}^{(F)})$		0.5000
${\hat{η}}_{3} ({\hat{p}}^{(F)})$		0.5000
$\hat{η_{m}}$	$q_{0}$	$q_{50}$	$q_{80}$
${\hat{η_{1}}}^{(H)} \equiv {\hat{η}}_{1} ({\hat{p}}^{(H)})$	0.5099	0.5072	0.5001
${\hat{η_{2}}}^{(H)}$	0.5094	0.4995	0.5209
${\hat{η}}_{2} ({\hat{p}}^{(H)})$	0.5004	0.5002	0.5000
${\hat{η_{3}}}^{(H)}$	0.5081	0.4985	0.4820
${\hat{η}}_{3} ({\hat{p}}^{(H)})$	0.5000	0.5000	0.5000

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ferreira, M.; Ferreira, H. Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective. Risks 2017, 5, 33. https://doi.org/10.3390/risks5030033

AMA Style

Ferreira M, Ferreira H. Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective. Risks. 2017; 5(3):33. https://doi.org/10.3390/risks5030033

Chicago/Turabian Style

Ferreira, Marta, and Helena Ferreira. 2017. "Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective" Risks 5, no. 3: 33. https://doi.org/10.3390/risks5030033

APA Style

Ferreira, M., & Ferreira, H. (2017). Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective. Risks, 5(3), 33. https://doi.org/10.3390/risks5030033

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analyzing the Gaver—Lewis Pareto Process under an Extremal Perspective

Abstract

1. Introduction

2. Tail Dependence

3. The Tail Dependence of the Gaver–Lewis Process

4. Estimation

5. Application

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI