A Dependent Lindeberg Central Limit Theorem for Cluster Functionals on Stationary Random Fields

José G. Gómez-García; Christophe Chesneau

doi:10.3390/math9030212

and

UNICAEN, CNRS, LMNO, Laboratory of Mathematics Nicolas Oresme, UFR des Sciences, Bld Maréchal Juin, Campus 2, Normandie Université, 14032 Caen, France

^*

Author to whom correspondence should be addressed.

^†

The first author was funded by the Normandy Region RIN program.

Mathematics2021, 9(3), 212;https://doi.org/10.3390/math9030212

This article belongs to the Special Issue Stochastic Models and Methods with Applications

Version Notes

Order Reprints

Abstract

In this paper, we provide a central limit theorem for the finite-dimensional marginal distributions of empirical processes

{(Z_{n} (f))}_{f \in F}

whose index set

F

is a family of cluster functionals valued on blocks of values of a stationary random field. The practicality and applicability of the result depend mainly on the usual Lindeberg condition and on a sequence

T_{n}

which summarizes the dependence between the blocks of the random field values. Finally, in application, we use the previous result in order to show the Gaussian asymptotic behavior of the proposed iso-extremogram estimator.

Keywords:

central limit theorem; cluster functional; weak dependence; Lindeberg method; extremogram

MSC:

60G60; 60F05; 60G70

1. Introduction

Recent developments in massive data processing lead us to think differently about certain problems in statistics. In particular, it is interesting to develop the construction of statistics as functions of data blocks and to study their inference. On the other hand, in some applications, only very little data are relevant to the estimates, not to mention that the estimates are also hidden among a large mass of “raw data”. We can refer the reader to Davis and Mikosch [1] for examples in extremes and to Long and De Sousa [2] for examples in astronomy. This leads us to think of clusters of data deemed “relevant” (or extremal type, within the framework of extreme value theory), where we say that two relevant values belong to two different clusters if they belong to two different blocks. Moreover, these relevant values are in the cores of blocks, where the core of a block B is defined as the smaller sub-block

C (B)

of B which contains all relevant values of B, if they exist.

In this context, we consider functionals that act on these clusters of relevant values and we develop useful lemmas in order to simplify the essential step to establish a Lindeberg central limit theorem (CLT) for these “cluster functionals” on stationary random fields, inspired by the definitions of Drees and Rootzén [3] and the approach of Bardet et al. [4] and Gómez-García [5].

The mathematical background is as follows. Let

d \in N : = {1, 2, \dots}

, and denote

n : = (n_{1}, \dots, n_{d})

,

1 : = (1, \dots, 1) \in N^{d}

and

[j] : = [1 : j]

, where

[i : j] : = {i, i + 1, \dots, j} \subset Z

. Let

X = \{X_{t} : t \in N^{d}\}

be a

R^{k} -

valued stationary random field and let

X = \{X_{n, t} : t \in

{[n_{1}] \times \dots \times [n_{d}]\}}_{n \in N^{d}}

be the corresponding normalized random observations from the random field X, defined by

X_{n, t} = L_{n} (X_{t}) I_{A} (L_{n} (X_{t}))

for some measurable functions

L_{n} : R^{k} ⟶ R^{k}

, such that

P (X_{n, 1} \in \cdot| X_{n, 1} \in A) \underset{n \to \infty}{⟶} G (\cdot),

(1)

where G is a non-degenerate distribution and

A \subseteq R^{k} \ {0}

is the so-called relevance set. Here,

I_{A} (\cdot)

denotes the usual indicator function of a subset A and the tendency

n \to \infty

means that

n_{i} \to \infty

for all

i \in [d]

. In particular, the convergence (1) is fulfilled if the random vector

X_{1}

is regularly varying. For more details on regularly varying vectors, one can refer to Resnick [6,7].

For each

i \in [d]

, let

r_{i} : = r_{n_{i}, i}

be an integer value such that

r_{i} = o (n_{i})

and

m_{i} : = ⌊ n_{i} / r_{i} ⌋ : = max \{k \in N : k \leq n_{i} / r_{i}\}

. We define the

d -

blocks (or simply blocks) of

X

by

Y_{n, j_{1} \dots j_{d}} : = {(X_{n, t})}_{t \in \prod_{i = 1}^{d} [(j_{i} - 1) r_{i} + 1 : j_{i} r_{i}]},

(2)

where

(j_{1}, \dots, j_{d}) \in D_{n, d} : = \prod_{i = 1}^{d} [m_{i}]

. Thus, we have

m_{1} m_{2} \dots m_{d}

complete blocks

Y_{n, j_{1} \dots j_{d}}

, and no more than

(m_{1} + 1) (m_{2} + 1) \dots (m_{d} + 1) - m_{1} m_{2} \dots m_{d}

incomplete ones which we ignore because we consider

m_{i}

large enough. Moreover, as usual,

\prod_{i = 1}^{d} A_{i}

denotes the Cartesian product

A_{1} \times \dots \times A_{d}

and, by stationarity, we denote

Y_{n} \overset{D}{=} Y_{n, 1}

as a generic block of

X

.

We are now going to formally define the core of a block, cluster functional and the empirical process of cluster functionals, which are generalizations of the definitions of Yun [8], Segers [9] and Drees and Rootzén [3] to

d -

blocks.

Let

y = {(x_{t})}_{t \in \prod_{i = 1}^{d} [r_{i}]}

be a

d -

block. The core of the block y with respect to the relevance set A is defined as

C (y) = \{\begin{matrix} {(x_{t})}_{t \in \prod_{i = 1}^{d} [r_{i, I} : r_{i, S}]}, & if x_{t} \in A for some t \in \prod_{i = 1}^{d} [r_{i}]; \\ 0, & otherwise, \end{matrix}

where, for each

i \in [d]

,

r_{i, I} : = min P_{i}

and

r_{i, S} : = max P_{i}

with

P_{i} = \{j_{i} \in [r_{i}] : x_{(j_{1}, \dots, j_{i}, \dots, j_{d})} \in A, for some (j_{1}, \dots, j_{i - 1}, j_{i + 1}, \dots, j_{d}) \in \prod_{k \in [d] \ {i}} [r_{k}]\} .

Let

(E, E)

be a measurable subspace of

(R^{k}, B (R^{k}))

for some

k \geq 1

such that

0 \in E

and let

B_{l_{1}, \dots, l_{d}} (E)

be the set of

E -

valued blocks (or arrays) of size

l_{1} \times l_{2} \times \dots \times l_{d}

, with

l_{1}, \dots, l_{d} \in N

. Consider now the set

E_{\cup} : = ⋃_{l_{1}, \dots, l_{d} = 1}^{\infty} B_{l_{1}, \dots, l_{d}} (E),

which is equipped with the

σ -

field

E_{\cup}

induced by the Borel

- σ -

fields on

B_{l_{1}, \dots, l_{d}} (E)

, for

l_{1}, \dots, l_{d} \in N

. A cluster functional is a measurable map

f : (E_{\cup}, E_{\cup}) ⟶ (R, B (R))

such that

f (y) = f (C (y)), for all y \in E_{\cup}, and f (0) = 0 .

(3)

Let

F

be a class of cluster functionals and let

\{Y_{n, j_{1} j_{2} \dots j_{d}} : (j_{1}, \dots, j_{d}) \in D_{n, d}\}

be the family of blocks of size

r_{1} \times r_{2} \times \dots \times r_{d}

defined in (2). The empirical process

Z_{n}

of cluster functionals in

F

, is the process

{(Z_{n} (f))}_{f \in F}

defined by

Z_{n} (f) : = \frac{1}{\sqrt{n_{n} v_{n}}} \sum_{(j_{1}, \dots, j_{d}) \in D_{n, d}} (f (Y_{n, j_{1} \dots j_{d}}) - E f (Y_{n, j_{1} \dots j_{d}})),

(4)

where

n_{n} = n_{1} \dots n_{d}

and

v_{n} : = P (X_{n, 1} \in A)

with

A \subseteq E \ {0}

denoting the relevance set.

Under the Lindeberg condition and the convergence to zero of a sequence

T_{n}

that summarizes the dependence between the blocks of values of the random field, we prove that the finite-dimensional marginal distributions (fidis) of the empirical process (4) converge to a Gaussian process. The proof basically consists of the “Lindeberg method” for a CLT of stationary time series as in Bardet et al. [4], but adapted here to stationary random fields.

Since Bardet et al. [4] gave a Lindeberg CLT for time series, Gómez-García [5] used this approach in order to obtain a Lindeberg CLT for cluster functionals on time series whose convergence depends mainly on the Lindeberg condition and the convergence to zero of

T_{n}

that summarizes the dependence. Moreover, Gómez-García [5] simplified

T_{n}

by using the coefficients of weak-dependence of Doukhan and Louhichi [10]. This allowed the attainment of partially more general results than Drees and Rootzén [3] which are established under mixing. Note that the family of weakly dependent processes of Doukhan and Louhichi [10] is more general that the family of mixing processes, see Andrews [11].

In the context of random fields, the approach is not very simple. In fact, we must first generalize the results of Bardet et al. [4] within the framework of random fields, then we could simplify the term of dependence by fixing short range dependence conditions on the random field X like convenient conditions for the decay rates of the weak-dependence coefficients of Doukhan and Louhichi [10]. In this work, we concentrate on the first part and we introduce a measure (and its estimator) which motivates the choice of this generalization: the iso-extremogram, which can be viewed as a correlogram for extreme values of space–time processes.

The rest of the paper consists of three complementary sections. In Section 2, we provide useful lemmas in order to establish the CLT for the fidis of the cluster functionals empirical process (4). Then, in Section 3, we introduce the iso-extremogram and we use the CLT of Section 2 in order to show that, under appropriate additional conditions, the iso-extremogram estimator has an asymptotically Gaussian behavior. Section 4 is dedicated to the conclusions and perspectives of this approach.

2. Results

In this section, we provide useful lemmas which notably simplify the essential step to establish a CLT for the fidis of the empirical process defined in (4). The proof consists of the same techniques as Bardet et al. [4] used in the demonstrations of their dependent and independent Lindeberg lemmas, but generalized here to random fields.

In order to establish the CLT, firstly, consider the following basic assumption:

(Bas): The vector $r = (r_{1}, \dots, r_{d}) \in N^{d}$ is such that $r_{i} ≪ n_{i}$ for each $i \in [d]$ .
In addition, denoting $r_{n} = r_{1} \dots r_{d}$ , we have $r_{n} v_{n} ⟶ τ < \infty$ and $n_{n} v_{n} ⟶ \infty$ , as $n \to \infty$ .

Secondly, consider the following essential convergence assumptions:

(Lin): ${(r_{n} v_{n})}^{- 1} E [{(f (Y_{n}) - E f (Y_{n}))}^{2} I_{\{| f (Y_{n}) - E f (Y_{n}) | > ϵ \sqrt{n_{n} v_{n}}\}}] = o (1)$ , $\forall ϵ > 0$ , $\forall f \in F$ ;
(Cov): ${(r_{n} v_{n})}^{- 1} Cov (f (Y_{n}), g (Y_{n})) ⟶ c (f, g)$ , $\forall f, g \in F$ .

Consider now the random blocks

Y_{n, j_{1} \dots j_{d}}

, with

(j_{1}, \dots, j_{d}) \in D_{n, d}

defined in (2). For each

k -

tuple of cluster functionals

f_{k} = (f_{1}, \dots, f_{k})

and each

(j_{1}, \dots, j_{d}) \in D_{n, d}

, we define the following random vector:

W_{n, j_{1} \dots j_{d}} : = \frac{1}{\sqrt{n_{n} v_{n}}} (f_{1} (Y_{n, j_{1} \dots j_{d}}) - E f_{1} (Y_{n, j_{1} \dots j_{d}}), \dots, f_{k} (Y_{n, j_{1} \dots j_{d}}) - E f_{k} (Y_{n, j_{1} \dots j_{d}})) .

(5)

Without loss of generality and in order to simplify the writing, we consider

d = 2

in the rest of this section.

Let

{(W_{n, i j}^{'})}_{(i, j) \in D_{n, 2}}

be a sequence of zero mean independent

R^{k}

-valued random variables, independent of the sequence

{(W_{n, i j})}_{(i, j) \in D_{n, 2}}

, such that

W_{n, i j}^{'} \sim N_{k} (0, Cov (W_{n, i j}))

, for all

(i, j) \in D_{n, 2}

. Denote by

C_{b}^{3}

the set of bounded functions

h : R^{k} ⟶ R

with bounded and continuous partial derivatives up to order 3 and, for

h \in C_{b}^{3}

and

n = (n_{1}, n_{2}) \in N^{2}

, define

Δ_{n} : = |E [h (\sum_{(i, j) \in D_{n, 2}} W_{n, i j}) - h (\sum_{(i, j) \in D_{n, 2}} W_{n, i j}^{'})]| .

(6)

The following assumption allows us to present, in a useful and simplified form, lemmas of Lindeberg under independence and dependence.

(Lin’): There exists $δ \in (0, 1]$ such that, for any $(i, j) \in D_{n, 2}$ , we have

$E {∥W_{n, i j}∥}^{2 + δ} < \infty$

for all $n \in N^{2}$ and all $k -$ tuple of cluster functionals $(f_{1}, \dots, f_{k}) \in F^{k}$ .

Moreover, denote

A_{n} : = \sum_{(i, j) \in D_{n, 2}} E {∥W_{n, i j}∥}^{2 + δ} .

Lemma 1

(Lindeberg under independence). Suppose that the blocks

{(Y_{n, i j})}_{(i, j) \in D_{n, 2}}

are independent and that the random variables

{(W_{n, i j})}_{(i, j) \in D_{n, 2}}

defined in (5) satisfy Assumption (Lin’). Then, for all

n \in N^{2}

, we have

Δ_{n} \leq 6 ∥ h^{(2)} ∥_{\infty}^{1 - δ} {∥ h^{(3)} ∥}_{\infty}^{δ} A_{n} .

Proof.

First, notice that

Δ_{n} \leq \sum_{(i, j) \in D_{n, 2}} Δ_{n, i j},

(7)

where

\begin{matrix} Δ_{n, i j} & : = |E [h_{i j} (V_{n, i j} + W_{n, i j}) - h_{i j} (V_{n, i j} + W_{n, i j}^{'})]|, \forall (i, j) \in D_{n, 2}; \\ V_{n, i j} & : = \sum_{(u, v) \in D_{n, 2} \ (⋃_{l = 0}^{i - 1} L_{l}^{m_{2}} \cup L_{i}^{j})} W_{n, u v}, \forall (i, j) \in D_{n, 2} \ {(m_{1}, m_{2})}, \\ V_{n, m_{1} m_{2}} & = 0; and \\ h_{i j} (x) & : = E [h (x + \sum_{u = 0}^{i - 1} \sum_{v = 1}^{m_{2}} W_{n, u v}^{'} + \sum_{v = 0}^{j - 1} W_{n, i v}^{'})] . \end{matrix}

Furthermore, we adopt the convention

W_{n, i j} = 0

, if either

i = 0

or

j = 0

.

Now, we use some lines of the proof of Lemma 1 in Bardet et al. [4].

Let

v, w \in R^{k}

. From Taylor’s formula, there exist vectors

v_{1, w}, v_{2, w} \in R^{k}

such that

\begin{matrix} h (v + w) & = h (v) + h^{(1)} (v) (w) + \frac{1}{2} h^{(2)} (v_{1, w}) (w, w) \\ = h (v) + h^{(1)} (v) (w) + \frac{1}{2} h^{(2)} (v) (w, w) + \frac{1}{6} h^{(3)} (v_{2, w}) (w, w, w), \end{matrix}

where, for

j = 1, 2, 3

,

h^{(j)} (v) (w_{1}, w_{2}, \dots, w_{j})

stands for the value of the symmetric

j -

linear form from

h^{(j)}

of

(w_{1}, \dots, w_{j})

at v. Moreover, denote

∥ h^{(j)} {(v) ∥}_{1} = sup_{∥ w_{1} ∥, \dots, ∥ w_{j} ∥ \leq 1} | h^{(j)} (v) (w_{1}, \dots, w_{j}) | and ∥ h^{(j)} ∥_{\infty} = sup_{v \in R^{k}} {∥ h^{(j)} (v) ∥}_{1} .

Thus, for

v, w, w^{'} \in R^{k}

, there exist some suitable vectors

v_{1, w}, v_{2, w}, v_{1, w^{'}}, v_{2, w^{'}} \in R^{k}

such that

\begin{matrix} h (v + w) - h (v + w^{'}) = h^{(1)} (v) (w - w^{'}) + \frac{1}{2} (h^{(2)} (v) (w, w) - h^{(2)} (v) (w^{'}, w^{'})) \\ + \frac{1}{2} ((h^{(2)} (v_{1, w}) - h^{(2)} (v)) (w, w) - (h^{(2)} (v_{1, w^{'}}) - h^{(2)} (v)) (w^{'}, w^{'})), \end{matrix}

by using the approximation of Taylor of order 2, and

\begin{matrix} h (v + w) - h (v + w^{'}) = h^{(1)} (v) (w - w^{'}) + \frac{1}{2} (h^{(2)} (v) (w, w) - h^{(2)} (v) (w^{'}, w^{'})) \\ + \frac{1}{6} (h^{(3)} (v_{2, w}) (w, w, w) - h^{(3)} (v_{2, w^{'}}) (w^{'}, w^{'}, w^{'})), \end{matrix}

by using the approximation of Taylor of order 3.

Thus,

γ = h (v + w) - h (v + w^{'}) - h^{(1)} (v) (w - w^{'}) - \frac{1}{2} (h^{(2)} (v) (w, w) - h^{(2)} (v) (w^{'}, w^{'}))

satisfies

\begin{matrix} | γ | \leq (({∥ w ∥}^{2} + {∥ w^{'} ∥}^{2}) {∥ h^{(2)} ∥}_{\infty}) \land (\frac{1}{6} ({∥ w ∥}^{3} + {∥ w^{'} ∥}^{3}) {∥ h^{(3)} ∥}_{\infty}) \\ \leq ({∥ w ∥}^{2} {∥ h^{(2)} ∥}_{\infty}) \land (\frac{1}{6} {∥ w ∥}^{3} {∥ h^{(3)} ∥}_{\infty}) + ({∥ w ∥}^{2} {∥ h^{(2)} ∥}_{\infty}) \land (\frac{1}{6} ∥ w^{'} ∥^{3} {∥ h^{(3)} ∥}_{\infty}) \\ + (∥ w^{'} ∥^{2} {∥ h^{(2)} ∥}_{\infty}) \land (\frac{1}{6} {∥ w ∥}^{3} {∥ h^{(3)} ∥}_{\infty}) + (∥ w^{'} ∥^{2} {∥ h^{(2)} ∥}_{\infty}) \land (\frac{1}{6} ∥ w^{'} ∥^{3} {∥ h^{(3)} ∥}_{\infty}) \\ \leq \frac{1}{6^{δ}} ∥ h^{(2)} ∥_{\infty}^{1 - δ} {∥ h^{(3)} ∥}_{\infty}^{δ} ({∥ w ∥}^{2 + δ} + {∥ w ∥}^{2 (1 - δ)} ∥ w^{'} ∥^{3 δ} + {∥ w ∥}^{3 δ} ∥ w^{'} ∥^{2 (1 - δ)} + {∥ w^{'} ∥}^{2 + δ}), \end{matrix}

(8)

where (8) is given by using the inequality

1 \land a \leq a^{δ}

, with

a \geq 0

and

δ \in [0, 1]

.

Substituting

h_{i j}, V_{n, i j}, W_{n, i j}

and

W_{n, i j}^{'}

for

h, v, w

and

w^{'}

in the preceding inequality (8) and taking expectations, we obtain a bound for

Δ_{n, i j}

. Indeed, we have

\begin{matrix} E [h_{i j} (V_{n, i j} + W_{n, i j}) - h_{i j} (V_{n, i j} + W_{n, i j}^{'})] \\ = E [h_{i j} (V_{n, i j} + W_{n, i j}) - h_{i j} (V_{n, i j} + W_{n, i j}^{'})] + 0 \\ = E [h_{i j} (V_{n, i j} + W_{n, i j}) - h_{i j} (V_{n, i j} + W_{n, i j}^{'})] - E [h_{i j}^{(1)} (V_{n, i j}) (W_{n, i j} - W_{n, i j}^{'})] \\ - \frac{1}{2} E [h_{i j}^{(2)} (V_{n, i j}) (W_{n, i j}, W_{n, i j}) - h_{i j}^{(2)} (V_{n, i j}) (W_{n, i j}^{'}, W_{n, i j}^{'})], \end{matrix}

because

V_{n, i j}

is independent of

W_{n, i j}

and

W_{n, i j}^{'}

, and because

E W_{n, i j} = E W_{n, i j}^{'} = 0

and

Cov (W_{n, i j}) = Cov (W_{n, i j}^{'})

for all

(i, j) \in D_{n, 2}

.

On the other hand, using Jensen’s inequality, we derive

E ∥ W_{n, i j}^{'} ∥^{2 + δ} \leq {(E ∥ W_{n, i j}^{'} ∥^{4})}^{\frac{1}{2} + \frac{δ}{4}}

, and

E ∥ W_{n, i j}^{'} ∥^{4} \leq 3 {(E ∥ W_{n, i j} ∥^{2})}^{2}

because

W_{n, i j}^{'}

is a Gaussian random variable with the same covariance as

W_{n, i j}

.

Therefore,

\begin{matrix} E ∥ W_{n, i j}^{'} ∥^{2 + δ} \leq {(3 {(E ∥ W_{n, i j} ∥^{2})}^{2})}^{\frac{1}{2} + \frac{δ}{4}} = 3^{\frac{1}{2} + \frac{δ}{4}} {(E ∥ W_{n, i j} ∥^{2})}^{1 + \frac{δ}{2}} \leq 3^{\frac{1}{2} + \frac{δ}{4}} E {∥ W_{n, i j} ∥}^{2 + δ} \end{matrix}

(9)

and

\begin{matrix} E ∥ W_{n, i j}^{'} ∥^{2 (1 - δ)} E {∥ W_{n, i j} ∥}^{3 δ} & \leq {(E ∥ W_{n, i j}^{'} ∥^{2})}^{1 - δ} E {∥ W_{n, i j} ∥}^{3 δ} \\ \leq {(E ∥ W_{n, i j} ∥^{2})}^{1 - δ} E ∥ W_{n, i j} ∥^{3 δ} \leq E {∥ W_{n, i j} ∥}^{2 + δ} . \end{matrix}

(10)

In addition, for

3 δ < 2

,

\begin{matrix} E ∥ W_{n, i j} ∥^{2 (1 - δ)} E ∥ W_{n, i j}^{'} ∥^{3 δ} \leq E ∥ W_{n, i j} ∥^{2 (1 - δ)} {(E ∥ W_{n, i j}^{'} ∥^{2})}^{\frac{3 δ}{2}} \leq E {∥ W_{n, i j} ∥}^{2 + δ}, \end{matrix}

(11)

else

\begin{matrix} E ∥ W_{n, i j} ∥^{2 (1 - δ)} E ∥ W_{n, i j}^{'} ∥^{3 δ} \leq E {∥ W_{n, i j} ∥}^{2 (1 - δ)} {(E ∥ W_{n, i j}^{'} ∥^{4})}^{\frac{3 δ}{4}}, because 3 δ \leq 4 \\ \leq 3^{\frac{3 δ}{4}} E ∥ W_{n, i j} ∥^{2 (1 - δ)} {(E ∥ W_{n, i j} ∥^{2})}^{\frac{3 δ}{2}} \leq 3^{\frac{1}{2} + \frac{δ}{4}} E {∥ W_{n, i j} ∥}^{2 + δ} . \end{matrix}

(12)

The inequalities (9)–(12) allow to simplify the terms between parentheses in the last inequality in (8). Recall that

∥ h_{i j}^{(k)} ∥_{\infty} \leq {∥ h^{(k)} ∥}_{\infty}

for all

(i, j) \in D_{n, 2}

and

0 \leq k \leq 3

. Therefore, we obtain

Δ_{n, i j} \leq \frac{2 (1 + 3^{\frac{1}{2} + \frac{δ}{4}})}{6^{δ}} ∥ h^{(2)} ∥_{\infty}^{1 - δ} ∥ h^{(3)} ∥_{\infty}^{δ} E ∥ W_{n, i j} ∥^{2 + δ} \leq 6 ∥ h^{(2)} ∥_{\infty}^{1 - δ} ∥ h^{(3)} ∥_{\infty}^{δ} E {∥ W_{n, i j} ∥}^{2 + δ},

because, for all

δ \in [0, 1]

,

C (δ) = \frac{2 (1 + 3^{\frac{1}{2} + \frac{δ}{4}})}{6^{δ}} \leq C (0) = 2 (1 + \sqrt{3}) < 6

.

As a consequence, from Assumption (Lin’), we obtain

Δ_{n} \leq 6 ∥ h^{(2)} ∥_{\infty}^{1 - δ} {∥ h^{(3)} ∥}_{\infty}^{δ} A_{n}

. The proof of Lemma 1 ends. □

Remark 1.

By taking

ϵ < 6 ∥ h^{(2)} ∥_{\infty} (∥ h^{(3)} {∥_{\infty})}^{- 1}

and suitably using the second inequality of (8) in the proof of Lemma 1, the classical Lindeberg conditions can be used:

Δ_{n} \leq 2 ∥ h^{(2)} ∥_{\infty} B_{n} (ϵ) + {∥ h^{(3)} ∥}_{\infty} a_{n} (\frac{4}{3} ϵ + \sqrt{B_{n} (ϵ)}),

(13)

where

\begin{matrix} B_{n} (ϵ) & = \sum_{(i, j) \in D_{n, 2}} E [∥ W_{n, i j} ∥^{2} I_{{∥ W_{n, i j} ∥ > ϵ}}], ϵ > 0, n \in N^{2}; \\ a_{n} & = \sum_{(i . j) \in D_{n, 2}} E {∥ W_{n, i j} ∥}^{2} < \infty, n \in N^{2} . \end{matrix}

Moreover, these classical Lindeberg conditions imply the conditions from Lemma 1. Indeed, we have

Δ_{n} \leq 2 ∥ h^{(2)} ∥_{\infty} ϵ^{- δ} A_{n} + {∥ h^{(3)} ∥}_{\infty} a_{n} (\frac{4}{3} ϵ + ϵ^{- δ / 2} \sqrt{A_{n}}),

for

δ \in (0, 1)

and

ϵ > 0

.

The proof of this remark for general independent random vectors is given in (Bardet et al. [4], p. 165).

Remark 2.

Observe that Assumptions (Lin) and (Cov) imply that

B_{n} (ϵ) \underset{n \to \infty}{⟶} 0

and that

a_{n} = \sum_{i = 1}^{k} {(r_{n} v_{n})}^{- 1} Cov (f_{i} (Y_{n}), f_{i} (Y_{n})) \underset{n \to \infty}{⟶} \sum_{i = 1}^{k} c (f_{i}, f_{i}) < \infty

, respectively. Therefore, if the blocks

{(Y_{n, i j})}_{(i, j) \in D_{n, 2}}

are independent and if Assumptions (Lin) and (Cov) hold, then from Lemma 1 and Remark 1, the fidis of the empirical process

{(Z_{n} (f))}_{f \in F}

of cluster functionals converges to the fidis of a Gaussian process

{(Z (f))}_{f \in F}

with covariance function c.

For the dependent case, we need to consider more notations:

Let

L_{i}^{j} : = \{(i, v) : v \in [j]\} \subset D_{n, 2}

, for all

(i, j) \in D_{n, 2}

. We set

L_{i}^{0} = L_{0}^{j} = \emptyset

for any

i \in [m_{1}]

and any

j \in [m_{2}]

. For each

k \in N

,

f_{k} = (f_{1}, \dots, f_{k}) \in F^{k}

,

t \in R^{k}

and

n \in N^{2}

, we define

\begin{matrix} T_{n, t} (f_{k}) : = \\ \sum_{(j_{1}, j_{2}) \in D_{n, 2}} |Cov (exp (i ⟨ t, \sum_{(u_{1}, u_{2}) \in D_{n, 2} \ (⋃_{l = 0}^{j_{1} - 1} L_{l}^{m_{2}} \cup L_{j_{1}}^{j_{2}})} W_{n, u_{1} u_{2}} ⟩), exp (i ⟨ t, W_{n, j_{1} j_{2}} ⟩))| . \end{matrix}

Lemma 2

(Dependent Lindeberg lemma). Suppose that the random variables

{(W_{n, i j})}_{(i, j) \in D_{n, 2}}

defined in (5) satisfy Assumption (Lin’). Consider the special case of complex exponential functions

h (w) = exp (i ⟨ t, w ⟩)

with

t \in R^{k}

. Then, for each

k \in N

and each

k -

tuple

f_{k} = (f_{1}, \dots, f_{k})

of cluster functionals, the following inequality holds:

Δ_{n} \leq T_{n, t} (f_{k}) + 6 {∥ t ∥}^{2 + δ} A_{n}, n \in N^{2} .

Proof.

Consider

{(W_{n, j_{1} j_{2}}^{*})}_{(j_{1}, j_{2}) \in D_{n, 2}}

an array of independent random variables satisfying Assumption (Lin’) and such that

{(W_{n, j_{1} j_{2}}^{*})}_{(j_{1}, j_{2}) \in D_{n, 2}}

is independent of

{(W_{n, j_{1} j_{2}})}_{(j_{1}, j_{2}) \in D_{n, 2}}

and

{(W_{n, j_{1} j_{2}}^{'})}_{(j_{1}, j_{2}) \in D_{n, 2}}

. Moreover, assume that

W_{n, j_{1} j_{2}}^{*}

has the same distribution as

W_{n, j_{1} j_{2}}

for

(j_{1}, j_{2}) \in D_{n, 2}

.

Then, using the same decomposition (7) in the proof of the previous lemma, one can also write

\begin{matrix} Δ_{n, j_{1} j_{2}} \leq |E [h_{j_{1} j_{2}} (V_{n, j_{1} j_{2}} + W_{n, j_{1} j_{2}}) - h_{j_{1} j_{2}} (V_{n, j_{1} j_{2}} + W_{n, j_{1} j_{2}}^{*})]| \\ + |E [h_{j_{1} j_{2}} (V_{n, j_{1} j_{2}} + W_{n, j_{1} j_{2}}^{*}) - h_{j_{1} j_{2}} (V_{n, j_{1} j_{2}} + W_{n, j_{1} j_{2}}^{'})]| . \end{matrix}

(14)

Then, from the previous lemma, the second term of the right-hand side (RHS) of the inequality (14) is bounded by

6 ∥ h^{(2)} ∥_{\infty}^{1 - δ} ∥ h^{(3)} ∥_{\infty}^{δ} E ∥ W_{n, j_{1} j_{2}} ∥^{2 + δ} \leq {6 ∥ t ∥}^{2 + δ} E {∥ W_{n, j_{1} j_{2}} ∥}^{2 + δ} .

For the first term of the RHS of the inequality (14), first notice that, for a

R^{k} -

valued random vector X independent from

{(W_{n, j_{1} j_{2}}^{'})}_{(j_{1}, j_{2}) \in D_{n, 2}}

, we have

\begin{matrix} E h_{j_{1} j_{2}} (X) & = E [h (X + \sum_{u = 0}^{j_{1} - 1} \sum_{v = 1}^{m_{2}} W_{n, u v}^{'} + \sum_{v = 0}^{j_{2} - 1} W_{n, j_{1} v}^{'})] \\ = exp (- \frac{1}{2} t^{T} (\sum_{u = 0}^{j_{1} - 1} \sum_{v = 1}^{m_{2}} C_{n, u v} + \sum_{v = 0}^{j_{2} - 1} C_{n, j_{1} v}) t) E [exp (i ⟨ t, X ⟩)], \end{matrix}

because

W_{n, j_{1} j_{2}}^{'} \sim N_{k} (0, C_{n, j_{1} j_{2}})

, where

C_{n, j_{1} j_{2}} : = Cov (W_{n, j_{1} j_{2}})

is the covariance matrix of the vector

W_{n, j_{1} j_{2}}

, for

(j_{1}, j_{2}) \in D_{n, 2}

. For

j_{1} = 0

or

j_{2} = 0

, recall that

W_{n, j_{1} j_{2}} = 0

. In this case, we also set

C_{n, j_{1} j_{2}} = 0

. Thus,

\begin{matrix} |E [h_{j_{1} j_{2}} (V_{n, j_{1} j_{2}} + W_{n, j_{1} j_{2}}) - h_{j_{1} j_{2}} (V_{n, j_{1} j_{2}} + W_{n, j_{1} j_{2}}^{*})]| \\ = |exp (- \frac{1}{2} t^{T} (\sum_{u = 0}^{j_{1} - 1} \sum_{v = 1}^{m_{2}} C_{n, u v} + \sum_{v = 0}^{j_{2} - 1} C_{n, j_{1} v}) t) \\ \times E [exp (i ⟨ t, V_{n, j_{1} j_{2}} ⟩) (exp (i ⟨ t, W_{n, j_{1} j_{2}} ⟩) - exp (i ⟨ t, W_{n, j_{1} j_{2}}^{*} ⟩))]| \\ = |exp (- \frac{1}{2} t^{T} (\sum_{u = 0}^{j_{1} - 1} \sum_{v = 1}^{m_{2}} C_{n, u v} + \sum_{v = 0}^{j_{2} - 1} C_{n, j_{1} v}) t)| \\ \times |Cov (exp (i ⟨ t, V_{n, j_{1} j_{2}} ⟩), exp (i ⟨ t, W_{n, j_{1} j_{2}} ⟩))| \\ \leq |Cov (exp (i ⟨ t, V_{n, j_{1} j_{2}} ⟩), exp (i ⟨ t, W_{n, j_{1} j_{2}} ⟩))| . \end{matrix}

Therefore,

\begin{matrix} Δ_{n} = \sum_{(j_{1}, j_{2}) \in D_{n, 2}} Δ_{n, j_{1} j_{2}} \\ \leq \sum_{(j_{1}, j_{2}) \in D_{n, 2}} (|Cov (exp (i ⟨ t, V_{n, j_{1} j_{2}} ⟩), exp (i ⟨ t, W_{n, j_{1} j_{2}} ⟩))| + {6 ∥ t ∥}^{2 + δ} E {∥ W_{n, j_{1} j_{2}} ∥}^{2 + δ}) \\ = T_{n, t} (f_{k}) + 6 {∥ t ∥}^{2 + δ} A_{n} . \end{matrix}

This completes the proof of Lemma 2. □

The previous lemma together with Remark 1 imply the following theorem.

Theorem 1

(CLT for cluster functionals on random fields). Suppose that the basic Assumption (Bas) holds and that Assumptions (Lin) and (Cov) are satisfied. Then, if for each

k \in N

,

T_{n, t} (f_{k})

converges to zero as

n \to \infty

, for all

t \in R^{k}

and all

k -

tuple

f_{k} = (f_{1}, \dots, f_{k}) \in F^{k}

of cluster functionals, the fidis of the empirical process

{(Z_{n} (f))}_{f \in F}

of cluster functionals converges to the fidis of a Gaussian process

{(Z (f))}_{f \in F}

with covariance function c defined in (Cov).

Proof.

The assumptions (Lin) and (Cov) imply that, as

n \to \infty

,

B_{n} (ϵ) ⟶ 0

and

a_{n} ⟶ \sum_{s = 1}^{k} c (f_{s}, f_{s}) < \infty

, respectively. Therefore, taking into account Remark 1, we obtain from Lemma 2 that, for each

k \in N

,

Δ_{n} = |E [h (\sum_{(i, j) \in D_{n, 2}} W_{n, i j}) - h (\sum_{(i, j) \in D_{n, 2}} W_{n, i j}^{'})]| \underset{n \to \infty}{⟶} 0,

for all

t \in R^{k}

, with

h (w) = exp (i ⟨ t, w ⟩)

, because by hypothesis,

T_{n, t} (f_{k}) \underset{n \to \infty}{⟶} 0

for all

t \in R^{k}

and all

f_{k} = (f_{1}, \dots, f_{k}) \in F^{k}

.

Notice that

W_{n}^{'} : = \sum_{(i, j) \in D_{n, 2}} W_{n, i j}^{'} \sim N_{k} (0, m_{1} m_{2} Cov (W_{n, 11}))

and that

|E (h (W_{n}^{'}) - h (W))| \underset{n \to \infty}{⟶} 0

, where

W \sim N_{k} (0, Σ_{k})

, with

Σ_{k} = {(c (f_{i}, f_{j}))}_{(i, j) \in {[k]}^{2}}

.

Using triangular inequality, we deduce that

|E [h (\sum_{(i, j) \in D_{n, 2}} W_{n, i j}) - h (W)]| \underset{n \to \infty}{⟶} 0,

and therefore

(Z_{n} (f_{1}), \dots, Z_{n} (f_{k})) = \sum_{(i, j) \in D_{n, 2}} W_{n, i j} \overset{D}{\underset{n \to \infty}{⟶}} W

. The proof of Theorem 1 is complete. □

Remark 3.

The previous theorem can be formulated for

d = 3

as follows. Define

S_{i} = {(u, v, w) : u \in [i], v \in [m_{2}], w \in [m_{3}]} \subseteq D_{n, 3}

, for

i \in [m_{1}]

, with the convention

S_{0} = \emptyset

. Moreover,

L_{i j}^{k} = {(i, j, w) : w \in [k]}

, for

(i, j, k) \in D_{n, 3}

, and

L_{i j}^{k} = \emptyset

if i, j or k is zero. Then, if Assumptions (Bas), (Lin), (Cov) are satisfied (for

d = 3

), and if for each

k \in N

,

\begin{matrix} T_{n, t}^{*} (f_{k}) = \sum_{(j_{1}, j_{2}, j_{3}) \in D_{n, 3}} |Cov (exp (i ⟨ t, V_{n, j_{1} j_{2} j_{3}} ⟩), exp (i ⟨ t, W_{n, j_{1} j_{2} j_{3}} ⟩))| \end{matrix}

(15)

converges to zero as

n \to \infty

for all

t \in R^{k}

and all

k -

tuple

f_{k} = (f_{1}, \dots, f_{k}) \in F^{k}

of cluster functionals, with

V_{n, j_{1} j_{2} j_{3}} : = \sum_{(u_{1}, u_{2}, u_{3}) \in D_{n, 3} \ (S_{j_{1} - 1} \cup ⋃_{l = 0}^{j_{2} - 1} L_{j_{1} l}^{m_{3}} \cup L_{j_{1} j_{2}}^{j_{3}})} W_{n, u_{1} u_{2} u_{3}},

the fidis of the empirical process

{(Z_{n} (f))}_{f \in F}

of cluster functionals converges to the fidis of a Gaussian process

{(Z (f))}_{f \in F}

with covariance function c.

Remark 4.

We have mentioned earlier that

n = (n_{1}, \dots, n_{d}) \to \infty

means

n_{i} \to \infty

for each

i \in [d]

. However, the limits of the sequences indexed with

n

, as

n \to \infty

, could be reformulated in terms of the limits of such sequences as “

n \to \infty

along a monotone path on the lattice

N^{d}

”, i.e., along

n = (⌈ ϑ_{1} (n) ⌉, \dots, ⌈ ϑ_{d} (n) ⌉)

for some strictly increasing continuous functions

ϑ_{i} : [1, \infty) ⟶ [1, \infty)

, with

i \in [d]

, such that

ϑ_{i} (n) ⟶ \infty

as

n \to \infty

, for

i \in [d]

.

Suppose that from each block

Y_{n}

we extract a sub-block

Y_{n}^{'}

and that the remaining parts

R_{n} = Y_{n} - Y_{n}^{'}

of the blocks

Y_{n}

do not influence the process

Z_{n} (f)

. In particular, this last statement is fulfilled if

{(r_{n} v_{n})}^{- 1} E {| Δ_{n} (f) - E Δ_{n} (f) |}^{2} I_{{| Δ_{n} (f) - E Δ_{n} (f) | \leq \sqrt{n_{n} v_{n}}}} = o (1)

and

P (| Δ_{n} (f) - E Δ_{n} (f) | > \sqrt{n_{n} v_{n}}) = o (r_{n} / n_{n})

, where

Δ_{n} (f) : = f (Y_{n}) - f (Y_{n}^{'})

. This assumption would allow us to consider

T_{n, t} (f_{k})

(or

T_{n, t}^{*} (f_{k})

) as a function of the blocks

Y_{n}^{'}

(separated by

l_{n}

) instead of the blocks

Y_{n}

, in order to provide them bounds based on either the strong mixing coefficient of Rosenblatt [12] or the weak-dependence coefficients of Doukhan and Louhichi [10] for stationary random fields. These bounds are developed in Gómez-García [5] for the case of weakly-dependent time series. However, we do not develop them in the random field context as this is not the aim of this work. This topic will be addressed in a forthcoming applied statistics paper with numerical simulations.

3. Asymptotic Behavior of the Extremogram for Space–Time Processes

In this section, we propose a measure (in two versions) of serial dependence on space and time of extreme values of space–time processes. We provide an estimator for this measure and we use Theorem 1 in order to establish an asymptotic result. This work is inspired by the extremogram for time series defined in Davis and Mikosch [13].

Let

X = {X_{t} (s) : s \in Z^{d}, t \geq 0}

be a

R^{k} -

valued space–time process, which is stationary in both space and time. We define the extremogram of X for two sets A and B both bounded away from zero by

ρ_{A, B} (s, h_{t}) : = lim_{x \to \infty} P (x^{- 1} X_{h_{t}} (s) \in B |x^{- 1} X_{0} (0) \in A),

(16)

with

(s, h_{t}) \in Z^{d} \times [0, \infty)

, provided that the limit exists.

In estimating the extremogram, the limit on x in (16) is replaced by a high quantile

u_{n}

of the process. Defining

u_{n}

as the

(1 - 1 / k_{n}) -

quantile of the stationary distribution of

∥ X_{t} (s) ∥

or related quantity, with

k_{n} = o (n) ⟶ \infty

, as

n \to \infty

, one can redefine (16) by

ρ_{A, B} (s, h_{t}) = lim_{n \to \infty} P (u_{n}^{- 1} X_{h_{t}} (s) \in B |u_{n}^{- 1} X_{0} (0) \in A),

(17)

with

(s, h_{t}) \in Z^{d} \times [0, \infty)

.

The choice of such a sequence of quantiles

{(u_{n})}_{n \in N}

is not arbitrary. The main condition to guarantee the existence of the limit (17) for any two sets A and B bounded away from zero, is that it must satisfy the following convergence

k_{n} P (u_{n}^{- 1} (X_{t_{1}} (s_{1}), \dots, X_{t_{p}} (s_{p})) \in \cdot) \underset{n \to \infty}{\overset{v a g u e}{⟶}} m_{(s_{1}, t_{1}), \dots, (s_{p}, t_{p})} (\cdot),

(18)

for all

(s_{i}, t_{i}) \in Z^{d} \times [0, \infty)

,

i \in [p]

,

p \in N

, where

{(m_{(s_{1}, t_{1}), \dots, (s_{p}, t_{p})})}_{(s_{i}, t_{i}) \in Z^{d} \times [0, \infty), i \in [p], p \in N}

is a collection of Radon measures on the Borel

σ -

field

B ({\bar{R}}^{k p} \ {0})

, not all of them being the null measure, with

m_{(s_{1}, t_{1}), \dots, (s_{p}, t_{p})} ({\bar{R}}^{k p} \ R^{k p}) = 0

. In this case, we have

\begin{matrix} P (u_{n}^{- 1} X_{h_{t}} (s) \in B |u_{n}^{- 1} X_{0} (0) \in A) = & \frac{k_{n} P (u_{n}^{- 1} (X_{0} (0), X_{h_{t}} (s)) \in A \times B)}{k_{n} P (u_{n}^{- 1} X_{0} (0) \in A)} \\ ⟶ & \frac{m_{(0, 0), (s, h_{t})} (A \times B)}{m_{(0, 0)} (A)} = ρ_{A, B} (s, h_{t}), \end{matrix}

provided that

m_{(0, 0)} (A) > 0

.

Remark 5.

The condition (18) is particularly satisfied if the space–time process X is regularly varying. For details and examples of regularly varying space–time processes and time series, see Davis and Mikosch [1] and Basrak and Segers [14], respectively.

Note that the extremogram (17) is a function of two lags: a spatial-lag

s \in Z^{d}

and a non-negative time-lag

h_{t}

. Due to all the spatial values that the spatial-lag

s

takes, in practice, it is very complicated to analyze the results of estimating such an extremogram. Moreover, the calculation would be very slow in terms of computation. To obtain a simpler interpretation and to simplify the calculations, we assume that the space–time process X satisfies the following “isotropy” condition:

(I): For each pair of non-negative integers $h_{t}$ and $h_{s}$ ,

$P (X_{0} (0) \in A, X_{h_{t}} (s) \in B) = P (X_{0} (0) \in A, X_{h_{t}} (s^{'}) \in B), \forall s, s^{'} \in S_{h_{s}}^{d - 1},$

where

S_{h}^{d - 1} : = \{s \in Z^{d} : {∥ s ∥}_{\infty} = h\}

with

h \geq 0

and

∥ (s_{1}, \dots, s_{d}) ∥_{\infty} = {max}_{i = 1, \dots, d} | s_{i} |

.

Under this condition, the extremogram (17) can be redefined using only two non-negative integer lags: a spatial-lag

h_{s}

and a time-lag

h_{t}

. Indeed, under Condition (I), we define the iso-extremogram of X for two sets A and B both bounded away from zero by

ρ_{A, B}^{*} (h_{s}, h_{t}) = ρ_{A, B} (h_{s} {\vec{e}}_{1}, h_{t}), h_{s}, h_{t} \in N_{0} : = {0} \cup N,

(19)

where

{\vec{e}}_{1} = (1, 0, 0, 0, \dots, 0) \in R^{d}

is the first element of the canonical basis of

R^{d}

.

We now propose an estimator for the iso-extremogram. For this, without loss of generality, consider

d = 2

because the case

d > 2

can be treated in the same way.

Let

X_{n} : = \{X_{t} (i, j) : (i, j, t) \in [n_{1}] \times [n_{2}] \times [n_{3}]\}

be the observations from a

R^{k}

-valued space–time process X, stationary in both space and time, and which satisfies Condition (I). Let us set

n = n_{1} n_{2} n_{3}

. The sample iso-extremogram based on the observations

X_{n}

is given by

{\hat{ρ^{*}}}_{A, B} (h_{s}, h_{t}) : = \frac{\sum_{(j_{1}, j_{2}) \in [m_{1}] \times [m_{2}]} \sum_{t = 1}^{n_{3} - h_{t}} \sum_{(i_{1}, i_{2}) \in S_{h_{s}} (c_{j_{1} j_{2}})} \frac{I_{\{\frac{X_{t + h_{t}} (i_{1}, i_{2})}{u_{n}} \in B, \frac{X_{t} (c_{j_{1} j_{2}})}{u_{n}} \in A\}}}{# S_{h_{s}} (c_{j_{1} j_{2}})}}{\sum_{(j_{1}, j_{2}) \in [m_{1}] \times [m_{2}]} \sum_{t = 1}^{n_{3}} I_{\{\frac{X_{t} (c_{j_{1} j_{2}})}{u_{n}} \in A\}}},

(20)

for

h_{s} = 0, 1, 2, \dots, ⌈ 2^{- 1} min {r_{1}, r_{2}} ⌉ - 1

, and

h_{t} = 0, \dots, n - 1

, where

c_{i j} : = (⌈\frac{(2 i - 1) r_{1} + 1}{2}⌉, ⌈\frac{(2 j - 1) r_{2} + 1}{2}⌉)

denotes the “center” of the block

B_{i j} = [(i - 1) r_{1} + 1 : i r_{1}] \times [(j - 1) r_{2} + 1 : j r_{2}]

, for

(i, j) \in [m_{1}] \times [m_{2}]

. Moreover,

S_{h} (u, v) : = {(i, j) \in [n_{1}] \times [n_{2}] : ∥ (u, v) - (i, j) ∥_{\infty} = h}

with

h \geq 0

and

# E

denotes the cardinality of the set E. We recall that

r_{i} = r_{n_{i}, i}

and

m_{i} = ⌈ n_{i} / r_{i} ⌉

, for

i = 1, 2, 3

.

Defining the cluster functional

f_{A, B, h_{1}, h_{2}} : (⋃_{l_{1}, l_{2}, l_{3} = 1}^{\infty} B_{l_{1} l_{2} l_{3}} (R^{k}), R_{\cup}) ⟶ (R, B (R)),

for

h_{1}, h_{2} = 0, 1, 2, \dots

, such that

f_{A, B, h_{1}, h_{2}} ({(x_{(i_{1}, i_{2}, i_{3})})}_{(i_{1}, i_{2}, i_{3}) \in [l_{1}] \times [l_{2}] \times [l_{3}]}) = \sum_{(i_{1}, i_{2}) \in S_{h_{1}} (c)} \sum_{i_{3} = 1}^{l_{3} - h_{2}} \frac{I_{A \times B} (x_{(c, i_{3})}, x_{(i_{1}, i_{2}, i_{3} + h_{2})})}{# S_{h_{1}} (c)},

(21)

with

c = (⌈ (l_{1} + 1) / 2 ⌉, ⌈ (l_{2} + 1) / 2 ⌉) \in [l_{1}] \times [l_{2}]

(the “center” of the block

B = [l_{1}] \times [l_{2}]

), we can rewrite the estimator (20) as

{\hat{ρ^{*}}}_{A, B} (h_{s}, h_{t}) = \frac{\sum_{(j_{1}, j_{2}, j_{3}) \in D_{n, 3}} f_{A, B, h_{s}, h_{t}} (Y_{n, j_{1} j_{2} j_{3}}) + δ_{n} + R_{A, B, h_{s}, h_{t}}}{\sum_{(j_{1}, j_{2}, j_{3}) \in D_{n, 3}} f_{A, A, 0, 0} (Y_{n, j_{1} j_{2} j_{3}}) + R_{A, A, 0, 0}},

(22)

where

\begin{matrix} δ_{n} : & = \sum_{(j_{1}, j_{2}, j_{3}) \in D_{n, 3}} \sum_{(i_{1}, i_{2}) \in S_{h_{s}} (c_{j_{1} j_{2}})} \sum_{t = j_{3} r_{3} - h_{t} + 1}^{j_{3} r_{3}} \frac{I_{\{\frac{X_{t + h_{t}} (i_{1}, i_{2})}{u_{n}} \in B, \frac{X_{t} (c_{j_{1} j_{2}})}{u_{n}} \in A\}}}{# S_{h_{s}} (c_{j_{1} j_{2}})}, \\ R_{A, B, h_{s}, h_{t}} : & = \sum_{(j_{1}, j_{2}) \in [m_{1}] \times [m_{2}]} \sum_{(i_{1}, i_{2}) \in S_{h_{2}} (c_{j_{1} j_{2}})} \sum_{t = m_{3} r_{3} + 1}^{n_{3} - h_{t}} \frac{I_{\{\frac{X_{t + h_{t}} (i_{1}, i_{2})}{u_{n}} \in B, \frac{X_{t} (c_{j_{1} j_{2}})}{u_{n}} \in A\}}}{# S_{h_{s}} (c_{j_{1} j_{2}})} . \end{matrix}

We can therefore write (22) in terms of empirical processes of cluster functionals (4) and use Lindeberg CLT for cluster functionals on random fields (Theorem 1) together with suitable conditions of joint distributions, in order to prove the convergence in distribution of the iso-extremogram estimator.

For this, first of all, we make some considerations: the normalized random variables are defined here by

X_{n, (i_{1}, i_{2}, t)} = u_{n}^{- 1} X_{t} (i_{1}, i_{2})

, where

n = (n_{1}, n_{2}, n_{3})

and

n = n_{1} n_{2} n_{3}

; and the random blocks

{(Y_{n, j_{1} j_{2} j_{3}})}_{(j_{1}, j_{2}, j_{3}) \in D_{n, 3}}

as in (2). We define

N_{0} : = N \cup {0}

and

F_{A, B} : = \{f_{A, B, h_{s}, h_{t}} : h_{s}, h_{t} \in N_{0}\}

as the family of cluster functionals defined in (21). Moreover, for the set A, bounded away from zero, let

v_{n} : = P (u_{n}^{- 1} X_{0} (0, 0) \in A)

.

Secondly, consider the following conditions:

(Cov’): For each $h_{s}, h_{s}^{'}, h_{t}, h_{t}^{'} \in N_{0}$ ,

\sum_{i \in S_{h_{s}} (c)} \sum_{i^{'} \in S_{h_{s}^{'}} (c)} \sum_{t = 1}^{r_{3} - h_{t}} \sum_{t^{'} = 1}^{r_{3} - h_{t}^{'}} \frac{P (u_{n}^{- 1} (X_{t} (c), X_{t^{'}} (c)) \in A^{2}, (X_{t + h_{t}} (i), X_{t^{'} + h_{t}^{'}} (i^{'})) \in B^{2})}{r v_{n} \cdot # S_{h_{s}} (c) \cdot # S_{h_{s}^{'}} (c)}

and

\sum_{i \in S_{h_{s}} (c)} \sum_{t = 1}^{r_{3} - h_{t}} \sum_{t^{'} = 1}^{r_{3}} \frac{P (u_{n}^{- 1} (X_{t} (c), X_{t^{'}} (c)) \in A^{2}, X_{t + h_{t}} (i) \in B)}{r v_{n} \cdot # S_{h_{s}} (c)}

converge to

σ_{A, B} ((h_{s}, h_{t}), (h_{s}^{'}, h_{t}^{'}))

and

σ_{A, B}^{'} (h_{s}, h_{t})

, respectively, where

r = r_{1} r_{2} r_{3}

and

c = (⌈ (r_{1} + 1) / 2 ⌉, ⌈ (r_{2} + 1) / 2 ⌉)

(the “center” of the block

B_{11} = [r_{1}] \times [r_{2}]

).

(C): $\sum_{(c, t), (c^{'}, t^{'}) \in C (r_{1}, r_{2}) \times [n_{3}]} P (u_{n}^{- 1} (X_{t} (c), X_{t^{'}} (c^{'})) \in A \times A) = O (1)$ ,

where

C (r_{1}, r_{2}) : = {c_{i j} \in [n_{1}] \times [n_{2}] : (i, j) \in [m_{1}] \times [m_{2}]}

is set of the “centers” of the blocks

B_{i j} = [(i - 1) r_{1} + 1 : i r_{1}] \times [(j - 1) r_{2} + 1 : j r_{2}]

.

Proposition 1

(CLT for the iso-extremogram estimator). Assume that the following conditions hold for the

R^{k}

-valued space–time process

X = \{X_{t} (s) : (s, t) \in Z^{2} \times [0, \infty)\} .

The process X is stationary in both space and time and satisfies Condition (I).
The sequence $(u_{n})$ is such that $()$ holds. Moreover, $r ≪ v_{n}^{- 1} ≪ n$ and $\sqrt{n v_{n}} ≪ r ≪ n v_{n} r_{3}$ , where $n = n_{1} n_{2} n_{3}$ , $r = r_{1} r_{2} r_{3}$ , $r_{i} ≪ n_{i}$ and $r_{i} = r_{n_{i}, i} ⟶ \infty$ , for $i = 1, 2, 3$ .
Conditions (Cov’) and (C) hold, and the Lindeberg condition (Lin) is satisfied for the normalized variables $X_{n, (s, t)} = u_{n}^{- 1} X_{t} (s)$ together with the family of cluster functionals $F_{A, B}$ . Moreover, for each $k \in N$ , the coefficient $T_{n, t}^{*} (f_{k})$ defined in (15) converges to zero as $n \to \infty$ , for all $k -$ tuple of cluster functionals $(f_{1}, \dots, f_{k}) \in F_{A, B}^{k}$ and all $t \in R^{k}$ . The same assumption holds together with the family $F_{A} : = \{f_{A, A, 0, 0}\}$ , which contains a single functional.

Then, for each

(L_{s}, L_{t}) \in N_{0} \times N_{0}

,

\frac{\sqrt{n v_{n}}}{r_{1} r_{2}} {({\hat{ρ^{*}}}_{A, B} (h_{s}, h_{t}) - ρ_{A, B, n}^{*} (h_{s}, h_{t}))}_{0 \leq h_{s} \leq L_{s}, 0 \leq h_{t} \leq L_{t}} \overset{D}{\underset{n \to \infty}{⟶}} N (0, Σ_{A, B, L_{s}, L_{t}}),

(23)

where

ρ_{A, B, n}^{*} (h_{s}, h_{t}) : = P (u_{n}^{- 1} X_{h_{t}} (h_{s} {\vec{e}}_{1}) \in B |u_{n}^{- 1} X_{0} (0) \in A)

and

Σ_{A, B, L_{s}, L_{t}}

is the covariance matrix, defined by the coefficients

σ_{h, h^{'}} = σ_{A, B} (h, h^{'}) - ρ_{A, B}^{*} (h^{'}) σ_{A, B}^{'} (h) - ρ_{A, B}^{*} (h) σ_{A, B}^{'} (h^{'}) + ρ_{A, B}^{*} (h) ρ_{A, B}^{*} (h^{'}) σ_{A, A}^{'} (0),

with

h, h^{'} \in [0 : L_{s}] \times [0 : L_{t}]

.

Proof.

Consider the expression (22) of the iso-extremogram estimator. Then, for

(h_{s}, h_{t}) \in [0 : L_{s}] \times [0, L_{t}]

, we obtain that

\begin{matrix} \frac{\sqrt{n v_{n}}}{r_{1} r_{2}} ({\hat{ρ^{*}}}_{A, B} (h_{s}, h_{t}) - ρ_{A, B, n}^{*} (h_{s}, h_{t})) \\ = \frac{Z_{n} (f_{A, B, h_{s}, h_{t}}) - (\frac{m h_{t} v_{n}}{\sqrt{n v_{n}}} + Z_{n} (f_{A, A, 0, 0})) ρ_{A, B, n}^{*} (h_{s}, h_{t}) + \frac{δ_{n}}{\sqrt{n v_{n}}} + R}{\frac{r_{1} r_{2}}{\sqrt{n v_{n}}} Z_{n} (f_{A, A, 0, 0}) + 1 + \frac{r_{1} r_{2} R_{A, A, 0, 0}}{n v_{n}}}, \end{matrix}

(24)

where

Z_{n} (\cdot)

denotes the empirical process of cluster functionals (4). Furthermore, here

R = {(n v_{n})}^{- 1} (R_{A, B, h_{s}, h_{t}} - ρ_{A, B, n}^{*} R_{A, A, 0, 0})

and

m = m_{1} m_{2} m_{3}

.

Now, notice that Chebyshev’s inequality applied on the random variables R and

r_{1} r_{2} R_{A, A, 0, 0} /

(n v_{n})

implies that they converge to zero in probability as

n \to \infty

. Similarly, applying Chebyshev’s inequality together with the condition

\sqrt{n v_{n}} = o (r)

, we prove that

{(n v_{n})}^{- 1 / 2} δ_{n} \overset{P}{⟶} 0

, as

n \to \infty

. This last condition (

\sqrt{n v_{n}} = o (r)

) also guarantees that

m h_{t} v_{n} {(n v_{n})}^{- 1 / 2} \underset{n \to \infty}{⟶} 0

. Again, Chebyshev’s inequality on the random variable

\frac{r_{1} r_{2}}{\sqrt{n v_{n}}} Z_{n} (f_{A, A, 0, 0})

, followed by Condition (C) and

r = o (n v_{n} r_{3})

, implies that this converges to zero in probability as

n \to \infty

. Thus,

\begin{matrix} \frac{\sqrt{n v_{n}}}{r_{1} r_{2}} ({\hat{ρ^{*}}}_{A, B} (h_{s}, h_{t}) - ρ_{A, B, n}^{*} (h_{s}, h_{t})) \\ = Z_{n} (f_{A, B, h_{s}, h_{t}}) - ρ_{A, B, n}^{*} (h_{s}, h_{t}) Z_{n} (f_{A, A, 0, 0}) + o (1) . \end{matrix}

From Theorem 1, the assumption 3 implies that

{(Z_{n} (f_{A, B, h_{s}, h_{t}}))}_{(h_{s}, h_{t}) \in [0 : L_{s}] \times [0 : L_{t}]}

converges to a centered Gaussian random variable with covariance matrix

{(σ_{A, B} (h, h^{'}))}_{h, h^{'} \in [0 : L_{s}] \times [0 : L_{t}]},

for each

(L_{s}, L_{t}) \in N_{0}^{2}

. Using the same argument, we prove that

Z_{n} (f_{A, A, 0, 0})

converges to a centered Gaussian variable with variance

σ_{A, A} (0, 0)

.

Finally, considering the existence of

σ_{A, B}^{'}

in (Cov’), we obtain the desired result. □

4. Conclusions and Perspectives

We have proved Lindeberg lemmas for cluster functionals on stationary random fields. This allowed us to obtain a CLT for the finite-dimensional marginal distributions of the empirical process (4) of cluster functionals of stationary random fields under the classical Lindeberg condition and the convergence to zero of a sequence

T_{n}

that summarizes the dependence between the blocks of values of the random field. Moreover, we have introduced a new spatio–temporal measure of serial extremal dependence: the iso-extremogram, a type of correlogram for extreme values of space–time processes. Under precise conditions, we have proved that the iso-extremogram estimator is asymptotically Gaussian.

In all our results, it can be noted that the sequence

T_{n}

converges to zero if the random field satisfies short range dependence conditions; either mixing or weak-dependence conditions. However, in this work we do not specify such conditions because it is not the aim of this paper, but of course it will be presented in a forthcoming applied statistics article including numerical simulations. To obtain a general idea of how to simplify the coefficient

T_{n}

using weak dependence coefficients, the reader is referred to Gómez-García [5] which deals with the time series framework.

Author Contributions

Conceptualization, J.G.G.-G.; methodology, J.G.G.-G. and C.C.; investigation, J.G.G.-G.; writing—original draft preparation, J.G.G.-G.; writing—review and editing, J.G.G.-G. and C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We would like to thank the two referees and an Associate Editor for their constructive comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Davis, R.A.; Mikosch, T. Extreme value theory for space-time processes with heavy-tailed distributions. Stoch. Process. Their Appl. 2008, 118, 560–584. [Google Scholar] [CrossRef]
Long, J.P.; De Sousa, R.S. Wiley StatsRef: Statistics Reference Online. In Statistical Methods in Astronomy; American Cancer Society: Atlanta, GA, USA, 2018; pp. 1–11. [Google Scholar] [CrossRef]
Drees, H.; Rootzén, H. Limit theorems for empirical processes of cluster functionals. Ann. Stat. 2010, 38, 2145–2186. [Google Scholar] [CrossRef]
Bardet, J.; Doukhan, P.; Lang, G.; Ragache, N. Dependent Lindeberg Central Limit Theorem and Some Applications. ESAIM Probab. Stat. 2007, 12, 154–172. [Google Scholar] [CrossRef][Green Version]
Gómez-García, J. Dependent Lindeberg central limit theorem for the fidis of empirical processes of cluster functionals. Statistics 2018, 52, 955–979. [Google Scholar] [CrossRef]
Resnick, S. Point processes, regular variation and weak convergence. Adv. Appl. Probab. 1986, 18, 66–138. [Google Scholar] [CrossRef]
Resnick, S. Extreme Values, Regular Variation, and Point Processes; Springer: Berlin, Germany, 1987. [Google Scholar]
Yun, S. The distributions of cluster functionals of extreme events in a dth-order Markov chain. J. Appl. Probab. 2000, 37, 29–44. [Google Scholar] [CrossRef]
Segers, J. Functionals of clusters of extremes. Adv. Appl. Probab. 2003, 35, 1028–1045. [Google Scholar] [CrossRef]
Doukhan, P.; Louhichi, S. A new weak dependence condition and applications to moment inequalities. Stoch. Process. Their Appl. 1999, 84, 313–342. [Google Scholar] [CrossRef]
Andrews, D.K.W. Non strong mixing autoregressive processes. J. Appl. Probab. 1984, 21, 930–934. [Google Scholar] [CrossRef]
Rosenblatt, M. A central limit theorem and a strong mixing condition. Proc. Natl. Acad. Sci. USA 1956, 42, 43–47. [Google Scholar] [CrossRef] [PubMed]
Davis, R.A.; Mikosch, T. The extremogram: A correlogram for extreme events. Bernoulli 2009, 15, 977–1009. [Google Scholar] [CrossRef]
Basrak, B.; Segers, J. Regularly varying multivariate time series. Stoch. Process. Their Appl. 2009, 119, 1055–1080. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A Dependent Lindeberg Central Limit Theorem for Cluster Functionals on Stationary Random Fields

Abstract

1. Introduction

2. Results

3. Asymptotic Behavior of the Extremogram for Space–Time Processes

4. Conclusions and Perspectives

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics