Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression

Alamari, Mohammed B.; Almulhim, Fatimah A.; Kaid, Zoulikha; Laksaci, Ali

doi:10.3390/axioms13100678

Open AccessArticle

Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression

¹

Department of Mathematics, College of Science, King Khalid University, Abha 62529, Saudi Arabia

²

Department of Mathematical Sciences, College of Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(10), 678; https://doi.org/10.3390/axioms13100678

Submission received: 12 August 2024 / Revised: 17 September 2024 / Accepted: 23 September 2024 / Published: 30 September 2024

(This article belongs to the Special Issue Advances in Functional and Topological Data Analysis)

Download

Browse Figures

Versions Notes

Abstract

The main aim of this paper is to consider a new risk metric that permits taking into account the spatial interactions of data. The considered risk metric explores the spatial tail-expectation of the data. Indeed, it is obtained by combining the ideas of expected shortfall regression with an expectile risk model. A spatio-functional Nadaraya–Watson estimator of the studied metric risk is constructed. The main asymptotic results of this work are the establishment of almost complete convergence under a mixed spatial structure. The claimed asymptotic result is obtained under standard assumptions covering the double functionality of the model as well as the data. The impact of the spatial interaction of the data in the proposed risk metric is evaluated using simulated data. A real experiment was conducted to measure the feasibility of the Spatio-Functional Expectile Shortfall Regression (SFESR) in practice.

Keywords:

financial risk; complete consistency; expected shortfall; functional data; kernel method; expectile regression; quantile regression

MSC:

62G08; 62G10; 62G35; 62G07; 62G32; 62G30; 62H12

1. Introduction

Currently, the spatial correlation of data has a potential impact on financial risk management. Indeed, with the rapid development of internet technology, investors are increasingly interested in international financial assets, which requires taking into account the spatial dependence of international stock markets. Of course, unlike standard spatial data analysis, the spatial correlation in spatio-financial time series data is not necessarily measured by the geographic coordinates of the stock markets. This is the principal motivation for introducing a financial risk metric to cover the spatial component of risk management. Recall that spatial data cannot be treated as independent (see [1,2], among others). In practice, the challenging issue of spatial data analysis comes from the fact that points are in multi-dimensional space without linear order.

Statistical analysis of spatial data has become widely developed in the last decade. Concerning the nonparametric approach, the first results were obtained by the author of [3], who obtained the asymptotic normality for the density kernel estimator. The regression function was studied in [4,5], in which the authors employed an estimator from the Nadaraya–Watson weights techniques. We refer to [6] for the nonparametric kernel estimator for the variogram, considering Nadaraya–Watson weights. Ref. [7] investigated the local linear estimation for the regression function (see also [8] for the spatial auto-regression model) and proved the uniform convergence of the constructed estimator. Their convergence rate is optimal according to the

L_{\infty}

-norm. In [9], we found an alternative local linear estimator of the spatial regression, which was obtained using the least absolute deviation. In this cited work, the authors have derived the asymptotic normality of their estimator. We return to [10] for estimation using the nearest neighbor method. In functional statistics, the authors of [11] have constructed an estimator using the spatiotemporal process. They proved the almost complete convergence (a.co.) of their estimator when the input variable is a continuous time process. The spatial quantile regression was estimated by [12]. Their estimator was constructed by inverting the estimator of the cumulative distribution function. For a more bibliographic discussion of spatio-functional data analysis, we refer the reader to [13,14,15,16].

The second important component of this study is the shortfall function (ES). This is a risk management model and was created by [17]. The principal motivation of the expected shortfall function as a risk metric is its coherency property. The estimation of the ES model is performed using multiple algorithms such as parametric, nonparametric, or semi-parametric approaches. The recent advances and references on the parametric approaches can be found in [18,19,20]. While the nonparametric estimation was developed by [21], we also cite [22] for the functional Nadaraya–Watson estimator of the functional expected shortfall regression (FESR), in which the authors studied the asymptotic properties of FESR under the mixing assumption. The weak dependence case was treated by the authors of [23], who almost established complete consistency of the kernel estimator of the FESR using the quasi-associated structure. We point out that in previous studies, the expected loss in FSER is defined through the Value at Risk (VaR)-level, the so-called FSER-VaR. In this work, we introduce an alternative risk threshold defined by the expectile regression, the so-called FSER-expectile. The expectile regression is an alternative risk metric based on tail expectation, unlike the VaR function, which is based on tail frequency. For this reason, the use of the expectile instead of the VaR function is more informative because it is more sensitive to outliers. This feature increases its ability to fit the financial risk located in the extreme values. In recent years, the expectile model has gained popularity in risk analysis (see, for instance, [24,25,26,27] for more motivations for these models). Although previous studies focus on the unconditional models, in this paper, we focus on the regression case. This version of the expectile has been studied in multivariate statistics by many authors. The first results date back to [28]. In the last decade, multivariate expectile regression has been employed for many statistical issues, including additive models [29], neural network models [30], and machine learning models [27]. However, financial risk analysis seems to be the principal applied area of the expectile regression model. In this context, ref. [31] proposes an estimation of the value at risk (VaR) using an expectile model. Ref. [32] presents different approaches used to preserve the coherence properties of multivariate expectiles. The same authors in [33] established the asymptotic behavior of the multivariate expectiles for the Fréchet model. The treatment of the functional case was recently considered in [13], in which the authors considered expectile regression (ER) with a functional covariate. They constructed an estimator of the functional ER using the nonparametric kernel approach. An alternative approach was studied by the authors of [34] using the functional parametric ER. The authors employed a Hilbert structure using a reproducing kernel. More recent advances in functional expectile regression can be found in [15] and the references therein. We may return to [35,36,37] for more recent development in FTSA.

As discussed below, the main purpose of the present paper is to introduce a new risk metric based on the expectile shortfall regression. The developed risk metric has many advantages over the old shortfall model. These advantages are because the expectile is elicitable and coherent, unlike the VaR, and additionally, it is more sensitive to the magnitude of the tail, unlike the VaR function. Thus, the expectile shortfall with expectile (ESE) is more efficient than the standard shortfall. In this paper, we consider a more complex functional structure based on the spatial correlation. The spatial correlation is more general than the standard functional time series structure. It allows for controlling the spatial interaction of the data, which is more interactive in risk management. Furthermore, the principal outcomes of this work are the construction of a computational estimator and the establishment of its asymptotic properties using spatial dependence. The practical use of this risk metric is evaluated using simulated and real data. To the best of our knowledge, spatial expected shortfall regression has not yet been fully explored, and this is the first study in this direction.

This paper is organized as follows: We present our model as well as its spatial estimator in the next section. Section 2 is dedicated to introducing the spatio-functional time series framework. The almost complete convergence of the constructed estimator is shown in Section 3. Section 4 is devoted to examining the easy implementation of the estimator using simulated data. In Section 5, we apply our model to analyze the extreme values in environmental time series data. Some concluding remarks, as well as some future prospects, are discussed in Section 6. Finally, the proofs of the auxiliary results are given in Appendix A.

2. Model and Estimator

Consider

(A_{i}, B_{i})

,

i \in {Z Z}^{N}

,

N \geq 1

, a stationary spatial process defined on a probability space

(Ω, A, I P)

and valued

F \times I R

.

F

is a semi-metric space with d denoting the corresponding semi-metric. A point

i

will be referred to as a site and is defined by the components

(i_{1}, \dots, i_{N}) \in {Z Z}^{N}

. In this work, we focus on increasing domain asymptotic, where the underlining process,

(A_{i}, B_{i})

, is observed over a rectangular domain

I_{n} = \{i = (i_{1}, \dots, i_{N}) \in {Z Z}^{N},

1 \leq i_{k} \leq n_{k}, k = 1, \dots, N\}

,

n = (n_{1}, \dots, n_{N}) \in {Z Z}^{N}

. Therefore, the index-vector

n \to \infty

means

\min {n_{k}} \to \infty

and

| \frac{n_{j}}{n_{k}} | < C

for all

j, k

such that

1 \leq j, k \leq N

and for a given constant C such that

0 < C < \infty

. This kind of design is known as an asymptotically increasing domain, which allows the area of observations to become larger without large distances between the sites. Moreover, for

n = (n_{1}, \dots, n_{N}) \in {Z Z}^{N}

, we set

\bar{n} = \prod_{i = 1}^{N} n_{i}

. The spectral structure of the functional random field

(A_{i}, B_{i}), i \in {Z Z}^{N}

, is controlled through the following mixing condition:

\{\begin{matrix} There exists a function ψ (t) ↓ 0 a s t \to \infty, such that \\ \forall X, X^{'} subsets of {Z Z}^{N} has finite cardinals \\ α (B (X), B (X^{'})) = \sup_{B \in B (X), C \in B (X^{'})} |I P (B \cap C) - I P (B) I P (C)| \\ \leq ϕ (Card (X), Card (X^{'})) ψ (dist (X, X^{'})), \end{matrix}

(1)

where

B (X)

(respectively,

B (X^{'})

) means the Borel

σ

-field generated by

(A_{i}, i \in X)

(respectively,

(A_{i}, i \in X^{'})

), Card

(X)

(respectively, Card

(X^{'})

) is the cardinality of

X

(respectively,

X^{'}

), dist

(X, X^{'})

is the Euclidean distance between

X

and

X^{'}

and

ϕ : {Z Z}^{2} \to I R^{+}

is a symmetric positive function nondecreasing in each variable, such that

\forall n, m, \in Z Z

\begin{matrix} ϕ (n, m) \leq C \min (n, m), C > 0 . \end{matrix}

(2)

\begin{matrix} \sum_{i = 1}^{\infty} i^{δ} ψ (i) < \infty, δ > 0 . \end{matrix}

(3)

Note that condition (2) can be replaced by

\begin{matrix} ϕ (n, m) \leq C {(n + m + 1)}^{\tilde{β}} for some \tilde{β} > 1 . \end{matrix}

(4)

Both conditions (2) and (4) are used in Tran [3] and Carbon et al. [8], and are satisfied by many spatial models (see [38] for some examples). It should be noted that if

N = 1,

then

(A_{i}, B_{i})

is called a strongly mixing process.

Throughout this paper, for a fixed point

z \in F

, we denote by

N_{z}

for a given neighborhood of

z

. We assume that

(A_{i}, B_{i})

’s have the same distribution as

(A, B)

. We put

C D F (\cdot | z^{'})

, the conditional distribution of B given

A = z^{'}

, and we assume the regular version of this conditional distribution exists for any

z^{'} \in N_{z}

. Additionally, we suppose that

C D F (\cdot | z)

has a continuous density

f (\cdot | z)

with respect to Lebesgue’s measure over

I R

.

Recall that the standard FESR regression is defined

for all z \in F, by R E S_{p} (z) = I E [B | B > R V a R_{p} (z), A = z],

where

R V a R_{p}

is the conditional quantile of order

1 - p

. Clearly, it is defined through the tail quantile, which is frequency-tail. Alternatively, it would be more interesting to evaluate this metric using the expectation tail. To do that, we introduce the FESR-expectile defined

for all z \in F, by R E A_{p} (z) = I E [B | B > R E X P_{p} (z), A = z],

where

{R E X P}_{p}

is

the expectile regression {R E X P}_{p} (z) = \arg \min_{t \in I R} \{I E {[p {(B - t)}^{2} 1 I}_{{(B - t) > 0}} ∣ A = z]

+ I E {[(1 - p) {(B - t)}^{2} 1 I}_{{(B - t) \leq 0}} ∣ A = z],

where

1_{C}

is the indicator function of the set

C

. It should be noted that the replacement of

R V a R_{p}

by

{R E X P}_{p}

is important in practice, as it permits remedying the lack of risk insensitivity of

R V a R_{p}

to the extreme values.

Now, to estimate

R E A_{p} (z)

using the kernel estimator, we consider

F (\cdot)

, a measurable function,

r = r_{n}

a positive sequence of real numbers tending to zero as

n

tends to infinity, and we estimate the FESR-expectile by

\hat{{R E A}_{p}} (z) = \frac{\sum_{i \in I_{n}} F [r^{- 1} d (z, A_{i})] B_{i} 1_{B_{i} > {\hat{R E X P}}_{p} (z)}}{\sum_{i \in I_{n}} F (r^{- 1} d (z, A_{i}))},

(5)

where

{\hat{R E X P}}_{p}

is the kernel estimator of

{R E X P}_{p}

, defined as the solution of

\hat{G} ({\hat{R E X P}}_{p} (t; z)) = \frac{p}{1 - p}

with

\tilde{G} (t; z) = \frac{- \sum_{i \in I_{n}} F_{n i} (z) (B_{i} - t) 1 I {(B_{i} - t) \leq 0}}{\sum_{i \in I_{n}} F_{n i} (z) (B_{i} - t) 1 I {(B_{i} - t) \leq 0}}, for t \in I R,

where

\begin{matrix} F_{n i} (z) = \frac{F [r^{- 1} d (z, A_{i})]}{\sum_{i \in I_{n}} F [r^{- 1} d (z, A_{i})]} . \end{matrix}

We refer to [13] for more discussion on the construction of the estimator

{\hat{R E X P}}_{p}

. While the estimator

\hat{{R E A}_{p}}

is constructed using similar ideas to those used for classical regression [39], it is clear that the choice of the parameter r is primordial in this smoothing approach. It is crucial for the estimation of

\hat{{R E A}_{p}}

as well as for

{\hat{R E X P}}_{p}

. Motivated by the strong relationship between the expectile and the mean squared error (MSE), the MSE-based cross-validation criterion is an appropriate rule with which to address this issue. The latter is common in nonparametric functional data analysis:

r_{o p t} = \arg \min_{r} \sum_{i \in I_{n}} {(B_{i} - {\hat{R E X P}}_{0.5} (A_{i}))}^{2} .

(6)

The popularity of this approach comes from its easy implementation in real data analysis, using the fact that the conditional mean

I E [Y | X]

is associated with

{\hat{R E X P}}_{p}

with

p = 0.5

.

3. Main Asymptotic Result

Before stating the asymptotic properties of the estimator

\hat{{R E A}_{p}}

, we need to introduce some notations and assumptions. Firstly, we set

C_{z}

or

C_{z}^{'}

as some strictly positive generic constants, and for all

t \in I R

, we define

E S (t, z) = I E [B 1_{B > t} | A = z]

. Now, to formulate our main results, we will use the hypotheses listed below:

(P1): $P (A \in B (z, r)) = ϕ (z, r) > 0$ where $B (z, r) = \{x^{'} \in F : d (z^{'}, z) < r\}$ .
(P2): $\exists δ > 0, \forall (t_{1}, t_{2}) \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]$ , $\forall (z_{1}, z_{2}) \in N_{x}^{2}$ ,

$| E S (t_{1}, z_{1}) - E S (t_{2}, z_{2}) | \leq C_{x} (d^{b} (z_{1}, z_{2}) + {| t_{1} - t_{2} |}^{b}), b > 0 .$
(P3): The sequence ${(A_{i}, B_{i})}_{i \in I_{n}}$ such that

$\{\begin{matrix} \forall i \neq j, 0 < \sup_{i \neq j} I P [(A_{i}, A_{j}) \in B (x, r) \times B (x, r)] \leq C_{1} {(ϕ (z, r))}^{(a + 1) / a}, \\ for some 1 < a < δ N^{- 1} . \\ \forall t \in [θ_{x} - δ, θ_{x} + δ], I E [B_{i} B_{j} | A_{i}, A_{j}] \leq C < \infty, \\ I E [{|B|}^{2} | X] < C < \infty and I E [{|B|}^{p}] < C < \infty, p > 1 \end{matrix}$
(P4): $F$ is a function with support $(0, 1)$ such that
$0 < C {1 I}_{(0, 1)}$ < F(t) < C ′ ${1 I}_{(0, 1)}$ < ∞.
(P5): There exists $η_{0} > 0$ , such that,

$C {\bar{n}}^{\frac{(b - 1) N - b δ}{b δ} + η_{0}} \leq ϕ (z, r)$

Comments on the hypotheses.
Hypothesis (P1) is checked for several continuous time processes (see, for instance, [40] for a general Gaussian process). The local dependency in the first part (P3) allows us to obtain the same convergence rate as in the i.i.d. case. These hypotheses could be weakened, but the convergence rate would be perturbed by the presence of covariance terms (see Liebscher [41]). (P3) is a mild regularity hypothesis imposed to evaluate the bias term. The assumptions (P4)–(P5) are technical conditions for simplifying the proofs.

Now, we obtain the convergence rate of the almost complete convergence (a.co.) of the estimator

\hat{{R E A}_{p}} (z)

to

R E A_{p} (z)

. This stochastic convergence is stronger than the convergence in probability and almost sure convergence.

Theorem 1.

Under the suppositions (P1)–(P5), we have

|\hat{{R E A}_{p}} (z) - R E A_{p} (z)| = O (r^{b}) + O ({(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2}) a . c o . as n \to \infty .

(7)

4. Simulated Data

In this section, we aim to evaluate the impact of the spatial dependency on the finite-sample performance of the spatio-functional expectile shortfall estimator. In order to highlight the main feature of our procedure, we compare its sensitivity to the volatility of the data in two situations (homoscedastic and heteroscedastic cases). For this purpose, we generate the data from the following regression relationship

\begin{matrix} Model M 1 : & Y_{i} = \int_{0}^{1} 5 c o s ({(4 - A_{i} (t))}^{2} π) d t + ϵ_{i}, i = (i_{1}, i_{2}) N = 2 \\ Model M 2 : & Y_{i} = \int_{0}^{1} 1.5 e x p (A_{i} (t)) d t + (\int (5 \log ({(4 - A_{i} (t))}^{2})) d t) ϵ_{i} . \end{matrix}

where

ϵ_{i}

is a Gaussian random field that has an exponential covariogram function,

\begin{matrix} C (u) = σ^{2} e^{\frac{- u}{ϕ}} u \in [0, \infty) . \end{matrix}

(8)

Now, in order to fit the financial risk management context, we draw the spatio-functional input variables using a spatial ARCH process. This consideration allows us to simulate the spatial interaction in the co-movement of stock markets. Indeed, let

R_{t, i}

, the log-return of a financial asset at time t on the stock market

i

, be generated by a spatial ARCH process

R_{t, i} = Σ_{t, i} Z_{t, i},

where

Z_{t, i}

is a sequence of random variables that are independent in t and identically distributed with zero mean, unit variance, and constant covariance matrix C. The conditional variance

Σ_{t, i}

is defined by

Σ_{t, i}^{2} = α^{'} + ρ \sum_{j} w_{i, j} P_{t - 1, j}^{2},

where

w_{i, i}

is a known Spatial Weight Matrix (SWM). In fact, this kind of spatio-functional process is obtained using the routine code sim.spARCH in the R-package spGARCH. A sample of the functional co-variate is plotted in Figure 1.

Recall that the principal feature of the FESR-expectile is its high sensitivity to the outliers. To measure the impact of this characteristic, we use the routine code ODM in the R-Package OutlierDM to detect the number of outliers in each model. It appears that the first model contains

4 %

versus

28 %

for the second one. On the other hand, the spatial-heterogeneity of the data constitutes a second principal issue of our study. The latter is controlled through the parameters

σ

,

ϕ

and the spatial weight matrix

w_{i, i}

. So, we calculate

M S E (p) = {\bar{n}}^{- 1} \sum_{i \in I_{n}} {(B_{i} - \hat{{R E A}_{p}} (A_{i}))}^{2} {1 I}_{B_{i} > \hat{{R E A}_{p}} (A_{i})}

for various values of the mentioned parameters.

Now, for this empirical study, we choose the smoothing parameter r via the local mean square cross-validation method as in (6). In the sense that the optimization of the mean square rule is performed over a discrete set defined by the

k^{t h}

-distance from the location point. The integer number k is obtained from

{5, 10, 15, 20, 25, 30, \dots 50}

. For the kernel

F

, we use the

β

-kernel. Finally, the metric is chosen according to the nature of the functional variable and its smoothing property. It appears that the principal component (pca) metric is more suitable for this type of discontinuous functional regressor.

The simulation results are given in Table 1.

We observe that the behavior of the estimator

\hat{{R E A}_{p}}

is strongly affected by the different parameters of this study, such as the rate of the outliers and the spatial dependency degree. The high variability of the error between these different situations highlights the importance of the FESR- expectile as a risk-metric. In particular, the MSE varies between

0.018

and

0.045

with respect to the spatial level, while the horizontal variability, which describes the sensitivity to the outliers rate, ranges between

0.018

and

0.095

. These results incorporate the theoretical study, where the convergence rate is strongly affected by the local dependency of the spatio-functional data. In the sense that the computational part proves that the performance of the estimator is strongly impacted by the degree of spatial correlation of the data. Such a conclusion highlights the importance of the expectile-based-shortfall. The latter is very sensitive to the variability or deviation of the data, allowing more reliability in risk detection. This feature makes the expectile-based-shortfall more appropriate as a risk metric than the standard expected shortfall. We point out that the standard expected shortfall is based on the quantile, which is a robust model with low sensitivity to the variability in the risk analysis, because the risk is often located in the extremes. Such a characteristic is not beneficial in risk analysis. Finally, we can say that the estimator

\hat{{R E A}_{p}}

is very easy to implement and has good performance according to the nature of the treated data.

5. Real Data Application

After demonstrating the straightforward implementation of the estimator in the last section, we now focus on the applicability of our model to real spatial time series data. More specifically, we compare the performance of the new FESR-expectile

\hat{{R E A}_{p}}

to the classical one

\tilde{R E S_{p}} (s) = \frac{\sum_{i \in I_{n}} F (r^{- 1} d (z, A_{i}) (a G (a^{- 1} (\hat{R V a R_{p}} (z) - B_{i})) + B_{i} (1 - H (a^{- 1} (\hat{R V a R_{p}} (z) - B_{i}))))}{p \sum_{i \in I_{n}} F (r^{- 1} d (z, A_{i}))},

where

G (s) = \int_{s}^{\infty} u F (u)

and

H (s) = \int_{- i n f t y}^{s} F (u) d u .

In the previous section, we evaluated the impact of spatial correlation using the ARCH model, which is well-solicited as an appropriate method for fitting the financial time series data. Alternatively, in this part, we employ the FESR-expectile model for another area, specifically in the environmental domain. This application emphasizes the importance and versatility of the FESR model. The environmental domain is a particularly relevant area for risk management, as air quality significantly affects the quality of life. Moreover, the extreme values models have usually been employed to model the risk in this area. Here, we aim to compare the efficiency of the FESR- expectile

\hat{{R E A}_{p}}

with the FESR-VaR

\tilde{R E S_{p}}

in terms of risk prevention in air quality domain. For this goal, we analyze the air quality data used by [42], which concerns the ozone concentration in Beijing. These data are available on the website https://dataverse.harvard.edu/dataverse/beijing-air (accessed on 8 August 2024). Furthermore, there are many indices of air quality, such as Ozone (O₃), Particulate Matter (PM2.5 and PM10), Nitrogen Dioxide (NO₂), Carbon, and Sulfur Dioxide (SO₂). However, in this section, we concentrate on the ozone quantity (O₃) and sulfur dioxide (SO₂). Recall that the (SO₂) and the ultraviolet rays have a significant impact on the stratospheric ozone. Specifically, we collect the data from 120 monitoring stations in Beijing and we define

A_{i}

as the daily curve of SO₂ at the station

i

(on 30 December 2016). The response variable

B_{i}

represents the total ozone measured the day before at the same station

i

. The daily curves for the sulphur dioxide are shown in Figure 2.

Now, in order to explore the spatial correlation of the data, we follow the same strategy considered by [43]. This strategy permits us to estimate the spatial trend using the classical regression as follows. Indeed, we define

{\tilde{A}}_{i} = r_{1} (i) + A_{i} and {\tilde{B}}_{i} = r_{2} (i) + B_{i} .

Therefore, before computing the estimators

\hat{{R E A}_{p}}

and

\tilde{R E S_{p}}

, we start by estimating the statistics

{({\hat{A}}_{i}, {\hat{B}}_{i})}_{i}

. The latter is estimated by

{\hat{A}}_{i} = {\tilde{A}}_{i} - {\hat{r}}_{1} (i) and {\hat{B}}_{i} = {\tilde{B}}_{i} - {\hat{r}}_{2} (i),

where

{\hat{r}}_{1} (.)

and

{\hat{r}}_{2} (.)

are the kernel estimators of the functions

r_{1}

and

r_{2}

which are

{\hat{m}}_{1} (i_{0}) = \frac{\sum_{i \in I_{n}} F_{1} (r^{- 1} ∥ i_{0} - i ∥) A_{i}}{\sum_{i \in I_{n}} F_{1} (r^{- 1} ∥ i_{0} - i ∥)} (resp . {\hat{m}}_{2} (j_{0}) = \frac{\sum_{j \in I_{n}} F_{2} (r^{- 1} ∥ j_{0} - j ∥) B_{j}}{\sum_{i \in I_{n}} F_{2} (r^{- 1} ∥ j_{0} - j ∥)}),

where

F_{1}, F_{2}

are kernel functions. Such estimators are obtained using the routine code npreg in the R-package np with

F_{1} = F_{2}

being the quadratic kernel. This step is fundamental for spatio-functional data analysis and is referred to as the detrending step. To highlight the potential impact of spatial correlation, we compare our expected shortfall to the standard one in both cases: with or without detrending. Specifically, the estimation with detrending is calculated by

{({\hat{A}}_{i}, {\hat{B}}_{i})}_{i}

, while in the other case (without detrending), we use the initial observation

{(A_{i}, B_{i})}_{i}

to compute the estimators.

Furthermore, to calculate both estimators, we follow the same procedures used in the simulation section. In other words, we use the

(0, 1)

quadratic kernel and the pca-metric, along with local cross-validation for the bandwidth parameter. The efficiency of both estimators is evaluated by computing

M S E (p) = {\bar{n}}^{- 1} \sum_{i \in I_{n}} {(B_{i} - \hat{Θ_{p}} (A_{i}))}^{2} {1 I}_{B_{i} > \hat{{REX}^{p}} {(A_{i})}^{'}}

where

\hat{Θ_{p}}

represents

\hat{{REA}_{p}}

or

\tilde{{RES}_{p}}

. The values of

M S E ()

are evaluated as a function of

p

. In Figure 3 and Figure 4, we show the values of

M S E

of both estimators

\hat{{REA}_{p}}

(black line) and

\tilde{{RES}_{p}}

(red line) in both cases (with detrending and without detrending step—see Figure 3 and Figure 4).

The graphs show the superiority of the FESR-expectile regression over the FESR-quantile model. This statement can be confirmed by the position of the black line, which is under the red line in most cases. These results show that the FESR-expectile detects the excessive level of ozone concentration more effectively, even in cases of high variability. This feature is not surprising. The slow variability of the VaR level is due to the robustness of the quantile regression, which reduces its sensitivity to extreme values. Additionally, this advantage seems to be more significant in the detrending step compared to the non-detrending case. This statement can be confirmed using the cover test developed by Bayer and Dimitriadis [44]. This test allows us to examine the goodness-of-fit of our approach. The proposed test is an alternative approach to the procedure introduced by [45] for forecasting. Since the risk prediction differs significantly from standard prediction, we have opted to examine the feasibility of our risk-metric using the Bayer–Dimitriadis test. Specifically, we compare both functional approaches

\tilde{R E S_{p}}

and

\hat{{R E A}_{p}}

using the routine code esr-backtest from the R-package esrback. We have employed this code with

α = 0.05

. Unsurprisingly, the obtained results confirm that both models are significantly good for this risk management issue. Typically, the cover-test gives a p-value of

\hat{{R E A}_{p}}

equal to 0.001, compared to 0.004 for the model

\tilde{R E S_{p}}

.

6. Conclusions and Prospects

In this contribution, we have considered the nonparametric estimation of the FESR-regression-expectile under the spatial structure. We have constructed the functional version of the kernel estimator of this model as a risk-metric. This study covers a more general case of the functional random field. In the theoretical part, we have established the Borell–Contelli convergence under strong spatial mixing assumptions. Such theoretical development provides indispensable mathematical support for the use of the newly developed risk-metric. Additionally, the obtained asymptotic result was derived under general conditions and with the precision of the pointwise convergence rate. The computational part shows the applicability of the estimator and its very easy implementation in practice. Additionally, we applied the new model to an environmental spatio-functional random process. The result confirms the superiority of the FESR-expectile over FESR-VaR. On the other hand, the importance of this contribution can be viewed through several open future directions. For instance, we will address more dependent cases, such as the quasi-associated spatio-functional time series. This situation allows us to control the co-movement of different stock exchanges using weak dependence. The second issue is determining the uniform UNN convergence of the estimator, which will help in resolving the smoothing parameter selection. Furthermore, we can also estimate the model using either the additive or the linear case.

Author Contributions

The authors contributed approximately equally to this work. Formal analysis, A.L.; Validation, M.B.A.; Writing—review & editing, Z.K. and F.A.A. All authors have read and agreed to the final version of the manuscript.

Funding

This research is funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R515), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia; and the Deanship of Scientific Research and Graduate Studies at King Khalid University through the Research Groups Program under grant number R.G.P./128/45.

Data Availability Statement

The data used in this study are available through the link https://dataverse.harvard.edu/dataverse/beijing-air (accessed on 8 August 2024).

Acknowledgments

The authors thank and extend their appreciation to the funders of this work: This work was supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R515), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia, and the Deanship of Scientific Research and Graduate Studies at King Khalid University through the Research Groups Program under grant number R.G.P./128/45.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

This appendix is dedicated to proving the mathematical results of the paper.

Proof of the Theorem 1.

We start by writing, for all

t \in I R

,

\hat{E S} (t, z) = \frac{\sum_{i \in I_{n}} F [r^{- 1} d (z, A_{i})] B_{i} 1_{B_{i} > t}}{\sum_{i \in I_{n}} F [r^{- 1} d (z, A_{i})]} .

Thus,

\hat{E S} ({\hat{R E X P}}_{p} (z), z) = \hat{{R E A}_{p}} (z), and E S ({R E X P}_{p} (z), z) = R E A_{p} (z) .

So,

\hat{{R E A}_{p}} (z) - R E A_{p} (z) = \hat{E S} ({\hat{R E X P}}_{p} (z), z) - E S ({\hat{R E X P}}_{p} (z), z)

+ E S ({\hat{R E X P}}_{p} (z), z) - E S ({R E X P}_{p} (z), z) .

Then,

\begin{matrix} | \hat{{R E A}_{p}} (z) - R E A_{p} (z) | \leq \sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} | \hat{E S} (t, z) - E S (t, z) | \\ + C | {\hat{R E X P}}_{p} (z) - {R E X P}_{p} (z) | . \end{matrix}

So, the convergence rate in Theorem 1 is consequence of

\sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} | \hat{E S} (t, z) - E S (t, z) | = O (r^{b}) + O ({(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2}) a . c o .

(A1)

and

| {\hat{R E X P}}_{p} (z) - {R E X P}_{p} (z) | = O (r^{b}) + O ({(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2}) a . c o .

(A2)

As (A2) is proved in [13], it suffices to establish (A1). For this, we have

I E [{\hat{E S}}_{D} (z)] = 1

and we write, for

t \in I R

\hat{E S} (t, z) - \hat{E S} (t, z) = \frac{1}{{\hat{E S}}_{D} (z)} [({\hat{E S}}_{N} (t, z) - I E [{\hat{E S}}_{N} (t, z)])

- (\hat{E S} (t, z)) - I E [{\hat{E S}}_{N} (t, z)])] - \frac{{\hat{E S}}_{N} (t, z)}{{\hat{E S}}_{D} (z)} [{\hat{E S}}_{D} (z) - I E [{\hat{E S}}_{D} (z)]] .

Finally, the proof is a consequence of Lemmas A1–A3. □

Lemma A1.

Under the suppositions (P1) and (P3)–(P5), we have

{\hat{E S}}_{D} (z) - I E [{\hat{E S}}_{D} (z)] = O {(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2} a . c o .

Additionally,

\sum_{n} I P ({\hat{E S}}_{D} (z) < \frac{1}{2}) < \infty .

Proof of Lemma A1.

To prove this lemma, we use the classical spatial block decomposition (see [3]). Set

{\hat{E S}}_{D} (z) = \frac{1}{\bar{n}} \sum_{i \in I_{n}} \frac{F [r^{- 1} d (z, A_{i})]}{I E [F [r^{- 1} d (z, A_{i})]]} .

We put

F_{i} = F [r^{- 1} d (z, A_{i})]

and

D_{i} = F_{i} - I E [F_{i}]

\begin{matrix} {\hat{E S}}_{D} (z) - 1 = \frac{1}{\bar{n} I E [F_{1}]} \sum_{i \in I_{n}} D_{i} . \end{matrix}

So, we consider a sequence

p_{n}

and decompose the sum into

2^{N}

partial sums of random variables as follows:

\begin{matrix} Y (1, n, x, j) = \sum_{i_{k} = 2 j_{k} p_{n} + 1 k = 1, \dots, N}^{2 j_{k} p_{n} + p_{n}} D_{i}, \end{matrix}

\begin{matrix} Y (2, n, x, j) = \sum_{i_{k} = 2 j_{k} p_{n} + 1 k = 1, \dots, N - 1}^{2 j_{k} p_{n} + p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + p_{n} + 1}^{(j_{N} + 1) p_{n}} D_{i}, \end{matrix}

\begin{matrix} Y (3, n, x, j) = \sum_{i_{k} = 2 j_{k} p_{n} + 1 k = 1, \dots, N - 2}^{2 j_{k} p_{n} + p_{n}} \sum_{i_{N - 1} = 2 j_{N - 1} p_{n} + p_{n} + 1}^{2 (j_{N - 1} + 1) p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + 1}^{2 j_{N} p_{n} + p_{n}} D_{i}, \end{matrix}

\begin{matrix} Y (4, n, x, j) = \sum_{i_{k} = 2 j_{k} p_{n} + 1 k = 1, \dots, N - 2}^{2 j_{k} p_{n}} \sum_{i_{N - 1} = 2 j_{N - 1} p_{n} + p_{n} + 1}^{2 (j_{N - 1} + 1) p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + p_{n} + 1}^{2 (j_{N} + 1) p_{n}} D_{i}, \end{matrix}

and so on. Finally

\begin{matrix} Y (2^{N - 1}, n, x, j) = \sum_{i_{k} = 2 j_{k} p_{n} + p_{n} + 1 k = 1, \dots, N - 1}^{2 (j_{k} + 1) p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + 1}^{2 j_{N} p_{n} + p_{n}} D_{i}, \end{matrix}

\begin{matrix} Y (2^{N}, n, x, j) = \sum_{i_{k} = 2 j_{k} p_{n} + p_{n} + 1 k = 1, \dots, N}^{2 (j_{k} + 1) p_{n}} D_{i} . \end{matrix}

Setting

J = {0, \dots, r_{1} - 1} \times \dots \times {0, \dots, r_{N} - 1}

, where

r_{i} = 2 n_{i} p_{n}^{- 1}, i = 1, \dots, N

and we denote by

\begin{matrix} T (n, x, i) = \sum_{j \in J} Y (i, n, x, j) . \end{matrix}

Now, we write,

\begin{matrix} | {\hat{E S}}_{D} (z) - I E [{\hat{E S}}_{D} (z)] | = \frac{1}{\bar{n} I E [F_{1}]} \sum_{i = 1}^{2^{N}} T (n, x, i) . \end{matrix}

As regards this last inequality, we have

\forall η > 0

I P (| {\hat{E S}}_{D} (z) - I E [{\hat{E S}}_{D} (z)] | \geq η) \leq 2^{N} \max_{i = 1, \dots} I P (T (n, x, i) \geq η \bar{n} I E [F_{1}]) .

Finally,

I P (T (n, x, i) \geq η \bar{n} I E [F_{1}]), for all i = 1, \dots, 2^{N} .

We enumerate the

M = \prod_{k = 1}^{N} r_{k} = 2^{- N} \bar{n} p_{n}^{- N} \leq \bar{n} p_{n}^{- N}

random variables

Y (1, n, x, j);

j \in J

in the arbitrary way

X_{1}, \dots X_{M}

. Thus, for each

X_{j}

there exists a certain

j_{j}

in

J

such that

X_{j} = \sum_{i \in I (1, n, x, j_{j})} D_{i},

where

I (1, n, x, j_{j}) = \{i : 2 j_{k_{j}} p_{n} + 1 \leq i_{k} \leq 2 j_{k_{j}} p_{n} + p_{n}; k = 1, \dots N\} .

Observe that these sets contain

p_{n}^{N}

sites and are far apart by the distance of

p_{n}^{N}

.

Now, we apply Lemma [8]. It permits the approximation of

X_{1}

,

X_{2}

, …,

X_{M}

by some independent random variables

X_{1}^{*}, \dots X_{M}^{*}

, which have the same low as

X_{j = 1, \dots M}

, and such that

\sum_{j = 1}^{r} I E | X_{j} - X_{j}^{*} | \leq 2 C M p_{n}^{N} ϕ ((M - 1) p_{n}^{N}, p_{n}^{N}) ψ (p_{n}) .

So, we have to evaluate

I P (T (n, x, 1) \geq η) .

For that, we employ Bernstein and Markov inequalities that

\begin{matrix} I P (T (n, x, i) \geq η \bar{n} I E [F_{1}]) \leq B_{1} + B_{2} \end{matrix}

where

B_{1} = I P (|\sum_{j = 1}^{M} X_{j}^{*}| \geq \frac{M η \bar{n} I E [F_{1}]}{2 M}) \leq 2 exp (- \frac{{(η \bar{n} I E [F_{1}])}^{2}}{M V a r [X_{1}^{*}] + C p_{n}^{N} η \bar{n} I E [F_{1}]})

and

\begin{matrix} B_{2} & = & I P (\sum_{j = 1}^{M} | X_{j} - X_{j}^{*} | \geq \frac{η \bar{n} I E [F_{1}]}{2}) \\ \leq & \frac{1}{η \bar{n} I E [F_{1}]} \sum_{j = 1}^{M} I E | X_{j} - X_{j}^{*} | \\ \leq & 2 M p_{n}^{N} {(η \bar{n} I E [F_{1}])}^{- 1} ϕ ((M - 1) p_{n}^{N}, p_{n}^{N}) ψ (p_{n}) . \end{matrix}

Since

\bar{n} = 2^{N} M p_{n}^{N}

and

ϕ ((M - 1) p_{n}^{N}, p_{n}^{N}) \leq p_{n}^{N}

, we get for

η = η_{0} \sqrt{\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)}}

B_{2} \leq \bar{n} p_{n}^{N} {(\ln \bar{n})}^{- 1 / 2} {(\bar{n} ϕ (z, r))}^{- 1 / 2} φ (p_{n}) .

As

p_{n} = C {(\frac{\bar{n} ϕ (z, r)}{\ln \bar{n}})}^{1 / 2 N}

, we write

\begin{matrix} B_{2} \leq \bar{n} ψ (p_{n}) . \end{matrix}

(A3)

Consequently, from (P5), we have

Σ_{n} \bar{n} ψ (p_{n}) < \infty .

Let us focus now on

B_{1}

. Indeed,

V a r [X_{1}^{*}] = V a r [Σ_{i \in I (1, n, x, 1)} D_{i}] = Σ_{i, j \in I (1, n, x, 1)} |C o v (D_{i}, D_{j})| .

Let

Q_{n} = Σ_{i \in I (1, n, x, 1)} V a r [D_{i}]

and

R_{n} = Σ_{i \neq j \in I (1, n, x, 1)} |C o v (D_{i}, D_{j})|

. By Assumptions (P1) and (P2), we have

V a r [D_{i}] \leq C ({(ϕ (z, r))}^{(a + 1) / a} + {(ϕ (z, r))}^{2});

therefore,

\begin{matrix} Q_{n} = O (p_{n}^{N} ϕ (z, r)) . \end{matrix}

Concerning

R_{n}

, we introduce

S_{1} = {i, j \in I (1, n, x, 1) : 0 < ∥i - j∥ \leq c_{n}},

S_{2} = {i, j \in I (1, n, x, 1) : ∥i - j∥ > c_{n}},

where

c_{n}

is a real sequence that converges to

+ \infty

. Split this sum over subsets in

S_{1}

and

S_{2}

\begin{matrix} R_{n} & = & Σ_{(i, j) \in S_{1}} |C o v (D_{i}, D_{j})| + Σ_{(i, j) \in S_{2}} |C o v (D_{i}, D_{j})| \\ = & R_{n}^{1} + R_{n}^{2} . \end{matrix}

First,

\begin{matrix} R_{n}^{1} & = & Σ_{(i, j) \in S_{1}} |I E [F_{i} F_{j}] - I E [F_{i}] I E [F_{j}]| \\ \leq & C p_{n}^{N} c_{n}^{N} ϕ (z, r) ({(ϕ (z, r))}^{1 / a} + ϕ (z, r)) \\ \leq & C p_{n}^{N} c_{n}^{N} {(ϕ (z, r))}^{(a + 1) / a} . \end{matrix}

On the other hand, we have

\begin{matrix} R_{n}^{2} = Σ_{(i, j) \in S_{2}} |C o v (D_{i}, D_{j})| . \end{matrix}

We deduce, from Lemma 4.1 in [8] that

|C o v (D_{i}, D_{j})| \leq C ψ (∥i - j∥),

thus

\begin{matrix} R_{n}^{2} & \leq & C Σ_{(i, j) \in S_{2}} ψ (∥i - j∥) \leq C p_{n}^{N} Σ_{i : ∥ i ∥ \geq c_{n}} ψ (∥i∥) \\ \leq & C p_{n}^{N} c_{n}^{- N a} Σ_{i : ∥ i ∥ \geq c_{n}} {∥i∥}^{N a} ψ (∥i∥) . \end{matrix}

Let

c_{n} = {(ϕ (z, r))}^{- 1 / N a}

, then

\begin{matrix} R_{n}^{2} & \leq & C p_{n}^{N} c_{n}^{- N a} Σ_{i : ∥ i ∥ \geq c_{n}} {∥i∥}^{N a} ψ (∥i∥) \\ \leq & C p_{n}^{N} ϕ (z, r) Σ_{i : ∥ i ∥ \geq c_{n}} {∥i∥}^{N a} ψ (∥i∥) . \end{matrix}

Because of (P2)

R_{n}^{2} \leq C p_{n}^{N} ϕ (z, r) .

Furthermore,

R_{n}^{1} \leq C p_{n}^{N} ϕ (z, r) .

Hence,

\begin{matrix} V a r [X_{1}^{*}] = O (p_{n}^{N} ϕ (z, r)) . \end{matrix}

This last gives

\begin{matrix} B_{1} \leq \exp (- C (η_{0}) \ln \bar{n}) . \end{matrix}

Consequently, a good choice of

η_{0}

gives the claimed result of the lemma. Additionally,

Σ_{n} I P (|{\hat{E S}}_{D} (z)| \leq 1 / 2) \leq Σ_{n} I P (|{\hat{E S}}_{D} (z) - I E [{\hat{E S}}_{D} (z)]| > 1 / 2) < \infty .

□

Lemma A2.

Under the supposition (P1)–(P2) and (P4)–(P5), we have

\sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} |E S (t, z) - I E [{\hat{E S}}_{N} (t, z)]| = O (r^{b}) .

Proof of Lemma A2.

Writing

E S (t, x) - I E [{\hat{E S}}_{N} (t, z)] = \frac{1}{I E [F_{1} (z)]} I E {[F_{1} (z) 1 I}_{B (z, r)} (z_{1}) (E S (t, x) - E S (t, A_{1})) .

By (P2), we get

{1 I}_{B (z, r)} (A_{1}) | E S (t, x) - E S (t, A_{1}) | \leq {C r}^{b} .

Thus

\sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} | E S (t, x) - I E [{\hat{E S}}_{N} (t, z)] | \leq C r^{b},

which gives

\sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} | E S (t, x) - I E [{\hat{E S}}_{N} (t, z)] | = O (r^{b})

□

Lemma A3.

Under the suppositions (P1)–(P5), we have

\sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} |{\hat{E S}}_{N} (t, z) - I E [{\hat{E S}}_{N} (t, z)]| = O ({(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2}, a . c o .

Proof of Lemma A3.

Since

[{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]

then by the compactness feature we get

[{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ] \subset ⋃_{j = 1}^{l_{n}}] B_{j} - d_{n}, B_{j} + d_{n} [

(A4)

for

d_{n} = O (\frac{1}{\sqrt{{\bar{n}}^{b}}})

and

l_{n} = O (\sqrt{{\bar{n}}^{b}})

. The two functions

I E [{\hat{E S}}_{N} (\cdot, z)]

and

{\hat{E S}}_{N} (\cdot, z)

are increasing. Thus, for

1 \leq j \leq l_{n}

,

I E {\hat{E S}}_{N} ((B_{j} - d_{n}, z) \leq \sup_{t \in] B_{j} - d_{n}, B_{j} + d_{n} [} I E {\hat{E S}}_{N} (t, z) \leq I E {\hat{E S}}_{N} (B_{j} + d_{n}, z)

{\hat{E S}}_{N} (t, z) B_{j} - d_{n}, z) \leq \sup_{t \in] B_{j} - d_{n}, B_{j} + d_{n} [} {\hat{E S}}_{N} (t, z) \leq {\hat{E S}}_{N} (B_{j} + d_{n}, t) .

(A5)

Now, by (P2)

\forall t_{1}, t_{2} \in {R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ,

we have

|I E {\hat{E S}}_{N} (t_{1}, z) - I E {\hat{E S}}_{N} (t_{2}, z)| \leq C {| t_{1} - t_{2} |}^{b} .

Hence,

\sup_{t \in [{R E X P}_{p} (z) - δ, {R E X P}_{p} (z) + δ]} |{\hat{E S}}_{N} (t, z) - I E {\hat{E S}}_{N} (t, z)|

\leq \max_{1 \leq j \leq l_{n}} \max_{z \in {B_{j} - d_{n}, B_{j} + d_{n}}} |{\hat{E S}}_{N} (z, z) - I E {\hat{E S}}_{N} (z, z)| + C d_{n}^{b} .

Clearly,

d_{n}^{b} = {\bar{n}}^{- 1 / 2} = o {(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2} .

Therefore, it suffices that

\max_{1 \leq j \leq l_{n}} \max_{z \in {B_{j} - d_{n}, B_{j} + d_{n}}} |{\hat{E S}}_{N} (z, z) - I E {\hat{E S}}_{N} (z, z)| = O {(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2}, a . c o .

Then,

\forall η > 0

,

\begin{matrix} I P (\max_{1 \leq j \leq l_{n}} \max_{z \in {B_{j} - d_{n}, B_{j} + d_{n}}} |{\hat{E S}}_{N} (z, z) - I E {\hat{E S}}_{N} (z, z)| > η \sqrt{\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)}}) \\ \leq 2 l_{n} \max_{1 \leq j \leq l_{n}} \max_{z \in {B_{j} - d_{n}, B_{j} + d_{n}}} I P (|{\hat{E S}}_{N} (z, z) - I E {\hat{E S}}_{N} (z, z)| > η \sqrt{\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)}}) . \end{matrix}

It remains to prove

I P (|{\hat{E S}}_{N} (z, z) - I E {\hat{E S}}_{N} (z, z)| > η \sqrt{\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)}}) .

Indeed,

{\tilde{F}}_{i} = \frac{1}{I E [F_{1}]} {[F_{i} B_{i} 1 I}_{\{B_{i} \leq z\}} - I E {[F_{i} B_{i} 1 I}_{\{B_{1} \leq z\}}]] .

We write

\forall ε > 0

I P [| {\hat{E S}}_{N} (z, z) - I E {\hat{E S}}_{N} (z, z) | > ε] = I P (\max_{z \in G_{n}} |{\hat{E S}}_{N} (z, z) - I E [{\hat{E S}}_{N} (z, z)]| > ε)

\leq \sum_{z \in G_{n}} I P (|{\hat{E S}}_{N} (z, z) - I E [{\hat{E S}}_{N} (z, z)]| > ε) .

(A6)

Since B is not necessarily bounded, we employ a truncation method by introducing

{\hat{E S}}_{N}^{*} (z, t) = \frac{1}{n I E [F (h^{- 1} d (z, A_{1}))]} \sum_{i \in I_{n}} F (r^{- 1} d (z, A_{i})) B_{i}^{*}

with

B^{*} = B {1 I}_{(B < γ_{n})}

with

γ_{n} = {\bar{n}}^{a / p}

. Thus, the result is a consequence of

\begin{matrix} d_{n} \max_{z \in G_{n}} |I E [{\hat{E S}}_{N}^{*} (z, z)] - I E [{\hat{E S}}_{N} (z, z)]| = O_{a . c o .} {(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2}, \end{matrix}

(A7)

\begin{matrix} d_{n} \max_{z \in G_{n}} |{\hat{E S}}_{N}^{*} (z, z) - {\hat{E S}}_{N} (z, z)| = O_{a . c o .} {(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2} \end{matrix}

(A8)

and

\begin{matrix} d_{n} \max_{z \in G_{n}} |{\hat{E S}}_{N}^{*} (z, z) - I E [{\hat{E S}}_{N}^{*} (z, z)]| = O_{a . c o .} {(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})}^{1 / 2} . \end{matrix}

(A9)

For (A7) we write, ∀

z \in G_{n}

|I E [{\hat{E S}}_{N}^{*} (z, z)] - I E [{\hat{E S}}_{N} (z, z)]| \leq C \frac{1}{ϕ (z, r)} I E {[|B| 1 I}_{B \geq γ_{n}}} F (r^{- 1} d (z, X))] .

By the inequality of Holder, for

α

and

β

such that

\frac{1}{α} + \frac{1}{β} = 1

, and

α = \frac{p}{2}

\begin{matrix} \forall z \in G_{n} \\ I E {[|B| 1 I}_{{B \geq γ_{n}}} F (r^{- 1} d (z, A_{1}))] & \leq & {I E}^{1 / α} [| B^{α} | 1 I {B \geq γ_{n}}] {I E}^{1 / β} [F^{β} (r^{- 1} d (z, A_{1}))] \\ \leq & γ_{n}^{- 1} {I E}^{1 / α} [| B^{2 α} |] {I E}^{1 / β} [F^{β} (r^{- 1} d (z, A_{1}))] \\ \leq & γ_{n}^{- 1} {I E}^{1 / α} [| B^{p} |] {I E}^{1 / β} [F^{β} (r^{- 1} d (z, A_{1}))] \\ \leq & C γ_{n}^{- 1} ϕ^{1 / β} (z, r) . \end{matrix}

Thus,

d_{n} \max_{z \in G_{n}} |{\hat{E S}}_{N}^{*} (z, z) - I E [{\hat{E S}}_{N}^{*} (z, z)]| \leq {\bar{n}}^{1 / 2 - a / p} ϕ^{(1 - β) / β} .

Finally, (A7) is because

a > p

.

Now, for (A8) we use the Markov’s inequality to show that

\forall z \in G_{n}

,

\forall ϵ > 0

\begin{matrix} I P (|{\hat{E S}}_{N}^{*} (z, z) - {\hat{E S}}_{N} (z, z)| > ϵ) & \leq & \sum_{i \in I_{n}} I P (B_{i} > n^{a / p}) \\ \leq & \bar{n} I P (B > n^{a / p}) \\ \leq & {\bar{n}}^{1 - a} I E [B^{p}] . \end{matrix}

Choosing

ϵ = ϵ_{0} (\sqrt{(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})})

and using

a > 5 / 2

,

d_{n} \max_{z \in G_{n}} I P (| {\hat{E S}}_{N} (z, z) - {\hat{E S}}_{N}^{*} (z, z) | > ϵ_{0} (\sqrt{(\frac{\ln \bar{n}}{\bar{n} ϕ (z, r)})})) \leq {\bar{n}}^{3 / 2 - a} < C {\bar{n}}^{- 1 - ν} .

Now for (A9), define

z \in G_{n}

,

D_{i} = F_{i} B_{i}^{*} - I E [F_{1} B_{i}^{*}] .

Therefore, ∀

ε > 0

\begin{matrix} I P \{|{\hat{E S}}_{N}^{*} (z, z) - I E [{\hat{E S}}_{N}^{*} (z, z)]| > ε\} & = & I P \{|\frac{1}{\bar{n} I E [F_{1}]} \sum_{i \in I_{n}} D_{i}| > ε\} \\ \leq & I P \{|\sum_{i \in I_{n}} D_{i}| > ε \bar{n} I E [F_{1}]\} . \end{matrix}

Using the spatial blocks decomposition to write

{\hat{E S}}_{N}^{*} (z, z) - I E [{\hat{E S}}_{N}^{*} (z, z)] = \frac{1}{\hat{n} I E [F_{1} (x)]} \sum_{i = 1}^{2^{N}} T (n, i),

(A10)

with

T (n, i) = \sum_{j \in J} Λ (i, n, j)

with

where J = {0, \dots, r_{1} - 1} \times \dots \times {0, \dots, r_{N} - 1}; r_{i} = 2 n_{i} p_{n}^{- 1}, i = 1, \dots, N .

and

Λ (1, n, j) = \sum_{\begin{matrix} i_{k} = 2 j_{k} p_{n} + 1 \\ k = 1, \dots, N \end{matrix}}^{2 j_{k} p_{n} + p_{n}} D_{i},

Λ (2, n, j) = \sum_{\begin{matrix} i_{k} = 2 j_{k} p_{n} + 1 \\ k = 1, \dots, N - 1 \end{matrix}}^{2 j_{k} p_{n} + p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + p_{n} + 1}^{(j_{N} + 1) p_{n}} D_{i},

Λ (3, n, j) = \sum_{\begin{matrix} i_{k} = 2 j_{k} p_{n} + 1 \\ k = 1, \dots, N - 2 \end{matrix}}^{2 j_{k} p_{n} + p_{n}} \sum_{i_{N - 1} = 2 j_{N - 1} p_{n} + p_{n} + 1}^{2 (j_{N - 1} + 1) p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + 1}^{2 j_{N} p_{n} + p_{n}} D_{i},

Λ (4, n, j) = \sum_{\begin{matrix} i_{k} = 2 j_{k} p_{n} + 1 \\ k = 1, \dots, N - 2 \end{matrix}}^{2 j_{k} p_{n}} \sum_{i_{N - 1} = 2 j_{N - 1} p_{n} + p_{n} + 1}^{2 (j_{N - 1} + 1) p_{n}} \sum_{i_{N} = 2 j_{N} p_{n} + p_{n} + 1}^{2 (j_{N} + 1) p_{n}} D_{i}, ⋮

Finally

Λ (2^{N}, n, j) = \sum_{\begin{matrix} i_{k} = 2 j_{k} p_{n} + p_{n} + 1 \\ k = 1, \dots, N \end{matrix}}^{2 (j_{k} + 1) p_{n}} D_{i}

Clearly,

T (n, 1)

is the sum of the random variables

D_{i}

over big blocks, whereas the other terms

T (n, i),

2 \leq i \leq 2^{N}

are sums over small blocks.

Furthermore, from (A10), we get, for all

η > 0

,

I P (| {\hat{E S}}_{N}^{*} (z, z) - I E [{\hat{E S}}_{N}^{*} (z, z)] | \geq η) \leq 2^{N} \max_{i = 1, \dots 2^{N}} I P (T (n, i) \geq η \hat{n} I E [F_{1} (x)]) .

So, the required result is based on the evaluation of the quantities

I P (T (n, i) \geq η \hat{n} I E [F_{1} (x)]), for all i = 1, \dots, 2^{N} .

For the sake of shortness, we treat only the case

i = 1

. The other case can be treated in the same manner. For the rest of the proof, we enumerate the

M = \prod_{k = 1}^{N} r_{k} = 2^{- N} \hat{n} p_{n}^{- N} \leq \hat{n} p_{n}^{- N}

random variables

Λ (1, n, j); j \in J

in the arbitrary way

Z_{1}, \dots Z_{M}

. Thus, for each

Z_{j}

, there exists a certain

j

in

J

such that

Z_{j} = \sum_{i \in I (1, n, j)} D_{i},

where

I (1, n, j) = \{i : 2 j_{k} p_{n} + 1 \leq i_{k} \leq 2 j_{k} p_{n} + p_{n}; k = 1, \dots N\} .

Clearly the subsets

I (1, n, j)

contain

p_{n}^{N}

sites and are far apart by a distance of

p_{n}

at least. So, under (P4) and (P5),

F (r_{n}^{- 1} d (x, A_{i})) B_{i}^{*} \leq C γ_{n} .

So, according to the Lemma of [8] Carbon et al. (2007) we obtain M independent random variables

Z_{1}^{*}, \dots Z_{M}^{*}

having the same low as

Z_{j = 1, \dots M}

and such that

\sum_{j = 1}^{r} I E | Z_{j} - Z_{j}^{*} | \leq 2 C γ_{n} M p_{n}^{N} ϕ (M - 1) p_{n}^{N}, p_{n}^{N}) ϕ (p_{n}) .

(A11)

Therefore,

I P (T (n, i) \geq η \hat{n} I E [F_{1} (x)]) \leq B_{1} (n) + B_{2} (n),

where

B_{1} (n) = I P (|\sum_{j = 1}^{M} Z_{j}^{*}| \geq \frac{M η \hat{n} I E [F_{1} (x)]}{2 M})

B_{2} (n) = I P (\sum_{j = 1}^{M} | Z_{j} - Z_{j}^{*} | \geq \frac{η \hat{n} I E [F_{1} (x)]}{2}) .

Concerning

B_{2} (n)

, we write

B_{2} (n) \leq 2 M γ_{n} p_{n}^{N} {(η \hat{n} I E [F_{1} (x)])}^{- 1} ϕ ((M - 1) p_{n}^{N}, p_{n}^{N}) ψ (p_{n}) .

Now, since

I E [F_{1} (x)] \leq C ϕ (z, r)

,

\hat{n} = 2^{N} M p_{n}^{N}

and

ϕ ((M - 1) p_{n}^{N}, p_{n}^{N}) \leq p_{n}^{N}

, we obtain for

η = η_{0} \sqrt{\frac{\ln \hat{n}}{\hat{n} ϕ (z, r)}}

B_{2} (n) \leq \hat{n} γ_{n} p_{n}^{N} {(\ln \hat{n})}^{- 1 / 2} {(\hat{n} ϕ (z, r))}^{- 1 / 2} ψ (p_{n}) .

Therefore, for

p_{n}

p_{n} = C {(\frac{\hat{n} ϕ (z, r)}{\ln \hat{n} γ_{n}^{2}})}^{1 / 2 N}

B_{2} (n) \leq \hat{n} ψ (p_{n}) .

We conclude

\sum_{n} B_{2} (n) < \infty .

Next, for

B_{1}

,

B_{1} (n) \leq 2 \exp (- \frac{{(η \hat{n} I E [F_{1} (x)])}^{2}}{M V a r [Z_{1}^{*}] + C η γ_{n} p_{n}^{N} \hat{n} I E [F_{1} (x)]}) .

(A12)

Furthermore,

V a r [Z_{1}^{*}] = V a r [\sum_{i \in I (1, n, 1)} D_{i}] .

As

I E [B_{i}^{p} | A_{i}] < \infty

, for

p > 2

, then

\begin{matrix} V a r [D_{i}^{k}] & \leq C I E [F_{i}^{2} B_{i}^{* 2}] \leq C I E [F_{i}^{2} B_{i}^{2}] \\ \leq C I E [F_{i}^{2} I E [B_{i}^{2} | A_{i}]] \\ \leq C I E [F_{i}^{2}] \leq C ϕ (z, r), \end{matrix}

since

I E [| B_{i} B_{j} | | A_{i} A_{j}] < \infty

we get

\begin{matrix} for all i \neq j C o v (D_{i}, D_{j}) & \leq C I E [F_{i} | B_{i}^{*} | F_{j} | B_{j}^{*} |] \\ \leq C I E [F_{i} F_{j} | B_{i} B_{j} |] \\ \leq C I E [F_{i} F_{j} I E [| B_{i} B_{j} | | A_{i} A_{j}]] \\ \leq C I E [F_{i} F_{j}] \leq C {(ϕ (z, r))}^{(a + 1) / a} (h) . \end{matrix}

Furthermore, as

I E [B_{i}^{p} | A_{i}] < \infty

\begin{matrix} for all i \neq j C o v (D_{i}, D_{j}) & \leq ∥ D_{i} ∥_{p}^{2} ψ^{1 - 2 / p} (∥ i - j ∥) \\ \leq C ∥ F_{i} B_{i}^{*} ∥_{p}^{2} ψ^{1 - 2 / p} (∥ i - j ∥) \\ \leq C ∥ F_{i} B_{i} ∥_{p}^{2} ψ^{1 - 2 / p} (∥ i - j ∥) \\ \leq C ∥ F_{i} ∥_{p}^{2} ψ^{1 - 2 / p} (∥ i - j ∥) \\ \leq C {(ϕ (z, r))}^{2 / p} (h) ψ^{1 - 2 / p} (∥ i - j ∥)) . \end{matrix}

Observe that

\sum_{i \in I (1, n, 1)} V a r [D_{i}] = O (p_{n}^{N} ϕ (z, r)) .

For a real sequence

d_{n}

tends to

+ \infty

we write

\begin{matrix} \sum_{i \neq j \in I (1, n, 1)} |C o v (D_{i}, D_{j})| & \leq \sum_{{i, j \in I (1, n, 1) ∥i - j∥ \leq d_{n}}} |C o v (D_{i}, D_{j})| \\ + \sum_{{i, j \in I (1, n, 1) ∥i - j∥ > d_{n}}} |C o v (D_{i}, D_{j})| \\ \leq C p_{n}^{N} ϕ (z, r) (d_{n}^{N} {(ϕ (z, r))}^{1 / a} \\ + d_{n}^{- N a} {(ϕ (z, r))}^{2 / p - 1} (h) \sum_{i : ∥ i ∥ \geq d_{n}} {∥i∥}^{N a} ψ^{1 - 2 / p} (∥i∥)) . \end{matrix}

Choosing

d_{n} = {(ϕ (z, r))}^{2 / N p (a + 1) - 1 / N a}

to

\sum_{i \neq j \in I (1, n, 1)} |C o v (D_{i}, D_{j})| \leq C p_{n}^{N} (ϕ (z, r))

So,

V a r [\sum_{i \in I (1, n, 1)} D_{i}] = O (p_{n}^{N} (ϕ (z, r))) .

We replace

V a r [Z_{1}^{*}] = O (p_{n}^{N} (ϕ (z, r)))

in (A12)

\begin{matrix} B_{1} (n) \leq \exp (- C (η_{0}) \ln \hat{n}) \end{matrix}

Finally, a good choice of

η_{0}

gives

\sum_{n} B_{1} (n) < \infty .

which completes the proof of the lemma. □

References

Cressie, N.A. Statistics for Spatial Data; Wiley: New York, NY, USA, 1993. [Google Scholar]
Diggle, P.; Ribeiro, P.J. Model-Based Geostatistics; Springer: New York, NY, USA, 2007. [Google Scholar]
Tran, L.T. Kernel density estimation on random fields. J. Multivar. Anal. 1990, 34, 37–53. [Google Scholar] [CrossRef]
Lu, Z.; Chen, X. Spatial kernel regression: Weak consistency. Statist. Probab. Lett. 2004, 68, 125–136. [Google Scholar] [CrossRef]
Biau, G.; Cadre, B. Nonparametric spatial prediction. Stat. Inference Stoch. Process. 2004, 7, 327–349. [Google Scholar] [CrossRef]
García-Soidán, P.H.; Febrero-Bande, M.; González-Manteiga, W. Nonparametric kernel estimation of an isotropic variogram. J. Stat. Plan. Inference 2004, 121, 65–92. [Google Scholar] [CrossRef]
Hallin, M.; Lu, Z.; Tran, L.T. Local linear spatial regression. Ann. Stat. 2004, 32, 2469–2500. [Google Scholar] [CrossRef]
Carbon, M.; Francq, C.; Tran, L.T. Kernel regression estimation for random fields. J. Stat. Plan. Inference 2007, 137, 778–798. [Google Scholar] [CrossRef]
Xu, R.; Wang, J. L¹-estimation for spatial nonparametric regression. J. Nonparametr. Stat. 2008, 20, 523–537. [Google Scholar] [CrossRef]
Li, J.; Tran, L.T. Nonparametric estimation of conditional expectation. J. Stat. Plan. Inference 2009, 139, 164–175. [Google Scholar] [CrossRef]
Dabo-Niang, S.; Yao, A.F. Kernel regression estimation for continuous spatial processes. Math. Meth. Stat. 2007, 16, 298–317. [Google Scholar] [CrossRef]
Laksaci, A.; Maref, F. Estimation non paramétrique de quantiles conditionnels pour des variables fonctionnelles spatialement dépendantes. C. R. Math. 2009, 347, 1075–1080. [Google Scholar] [CrossRef]
Mohammedi, M.; Bouzebda, S.; Laksaci, A. The consistency and asymptotic normality of the kernel type expectile regression estimator for functional data. J. Multivar. Anal. 2021, 181, 104673. [Google Scholar] [CrossRef]
Aneiros, G.; Cao, R.; Fraiman, R.; Genest, C.; Vieu, P. Recent advances in functional data analysis and high-dimensional statistics. J. Multivar. Anal. 2019, 170, 3–9. [Google Scholar] [CrossRef]
Almanjahie, I.M.; Bouzebda, S.; Kaid, Z.; Laksaci, A. The local linear functional kNN estimator of the conditional expectile: Uniform consistency in number of neighbors. Metrika 2024, 1–29. [Google Scholar] [CrossRef]
Litimein, O.; Laksaci, A.; Ait-Hennani, L.; Mechab, B.; Rachdi, M. Asymptotic normality of the local linear estimator of the functional expectile regression. J. Multivar. Anal. 2024, 202, 105281. [Google Scholar] [CrossRef]
Artzner, P.; Delbaen, F.; Eber, J.M.; Heath, D. Coherent measures of risk. Math. Financ. 1999, 9, 203–228. [Google Scholar] [CrossRef]
Righi, M.B.; Ceretta, P.S. A comparison of expected shortfall estimation models. J. Econ. Bus. 2015, 78, 14–47. [Google Scholar] [CrossRef]
Lazar, E.; Pan, J.; Wang, S. On the estimation of Value-at-Risk and Expected Shortfall at extreme levels. J. Commod. Mark. 2024, 34, 100391. [Google Scholar] [CrossRef]
Moutanabbir, K.; Bouaddi, M. A new non-parametric estimation of the expected shortfall for dependent financial losses. J. Stat. Plan. Inference 2024, 232, 106151. [Google Scholar] [CrossRef]
Scaillet, O. Nonparametric estimation and sensitivity analysis of expected shortfall. Math. Financ. Int. J. Math. Stat. Financ. Econ. 2004, 14, 115–129. [Google Scholar] [CrossRef]
Ferraty, F.; Quintela-Del-Río, A. Conditional VAR and expected shortfall: A new functional approach. Econom. Rev. 2016, 35, 263–292. [Google Scholar] [CrossRef]
Ait-Hennani, L.; Kaid, Z.; Laksaci, A.; Rachdi, M. Nonparametric estimation of the expected shortfall regression for quasi-associated functional data. Mathematics 2022, 10, 4508. [Google Scholar] [CrossRef]
Waltrup, L.S.; Sobotka, F.; Kneib, T.; Kauermann, G. Expectile and quantile regression—David and Goliath? Stat. Model. 2015, 15, 433–456. [Google Scholar] [CrossRef]
Bellini, F.; Di Bernardino, E.D. Risk management with expectiles. Eur. J. Financ. 2017, 23, 487–506. [Google Scholar] [CrossRef]
Bellini, F.; Negri, I.; Pyatkova, M. Backtesting VaR and expectiles with realized scores. Stat. Methods Appl. 2019, 28, 119–142. [Google Scholar] [CrossRef]
Farooq, M.; Steinwart, I. Learning rates for kernel-based expectile regression. Mach. Learn. 2019, 108, 203–227. [Google Scholar] [CrossRef]
Efron, B. Regression percentiles using asymmetric squared error loss. Stat. Sin. 1991, 1, 93–125. [Google Scholar]
Sobotka, F.; Kneib, T. Geoadditive expectile regression. Comput. Stat. Data Anal. 2012, 56, 755–767. [Google Scholar] [CrossRef]
Jiang, C.; Jiang, M.; Xu, Q.; Huang, X. Expectile regression neural network model with applications. Neurocomputing 2017, 247, 73–86. [Google Scholar] [CrossRef]
Daouia, A.; Girard, S.; Stupfler, G. Estimation of tail risk based on extreme expectiles. J. R. Stat. Soc. Ser. B Stat. Methodol. 2018, 80, 263–292. [Google Scholar] [CrossRef]
Maume-Deschamps, V.; Rullière, D.; Said, K. Multivariate extensions of expectiles risk measures. Depend. Model. 2017, 5, 20–44. [Google Scholar] [CrossRef]
Maume-Deschamps, V.; Rullière, D.; Said, K. Asymptotics multivariate expectiles. arXiv 2018, arXiv:1704.07152v2. [Google Scholar]
Girard, S.; Stupfler, G.; Usseglio-Carleve, A. Functional estimation of extreme conditional expectiles. Econom. Stat. 2022, 21, 131–158. [Google Scholar] [CrossRef]
Goia, A.; Vieu, P. An introduction to recent advances in high/infinite dimensional statistics. J. Multivar. Anal. 2016, 170, 1–6. [Google Scholar] [CrossRef]
Yu, D.; Pietrosanu, M.; Mizera, I.; Jiang, B.; Kong, L.; Tu, W. Functional Linear Partial Quantile Regression with Guaranteed Convergence for Neuroimaging Data Analysis. Stat. Biosci. 2024, 1–17. [Google Scholar] [CrossRef]
Di Bernardino, E.; Laloe, T.; Pakzad, C. Estimation of extreme multivariate expectiles with functional covariates. J. Multivar. Anal. 2024, 202, 105292. [Google Scholar] [CrossRef]
Guyon, X. Estimation d’un champ par pseudo-vraisemblance conditionnelle: Etude asymptotique et application au cas Markovien. In Proceedings of the Sixth Franco-Belgian Meeting of Statisticians, Bruxelles, Belguim, 14–15 November 1987. [Google Scholar]
Ferraty, F.; Vieu, P. Nonparametric Functional Data Analysis: Theory and Practice; Springer Series in Statistics; Springer: New York, NY, USA, 2006. [Google Scholar]
Li, W.V.; Shao, Q.M. Gaussian processes: Inequalities, small ball probabilities and applications. Hanbook Stat. 2001, 19, 533–597. [Google Scholar]
Liebscher, E. Estimation of the density and the regression function under mixing conditions. Stat. Decis. 2001, 19, 9–26. [Google Scholar] [CrossRef]
Rachdi, M.; Laksaci, A.; Al-Kandari, N.M. Expectile regression for spatial functional data analysis (sFDA). Metrika 2022, 85, 627–655. [Google Scholar] [CrossRef]
Hallin, M.; Lu, Z.; Yu, K. Local linear spatial quantile regression. Bernoulli 2009, 15, 659–686. [Google Scholar] [CrossRef]
Bayer, S.; Dimitriadis, T. Regression-Based Expected Shortfall Backtesting. J. Financ. Econom. 2022, 20, 437–471. [Google Scholar] [CrossRef]
Hassani, H.; Silva, E.S. A Kolmogorov-Smirnov based test for comparing the predictive accuracy of two sets of forecasts. Econometrics 2015, 3, 590–609. [Google Scholar] [CrossRef]

Figure 1. The ARCH process for

α^{'} = 0.05

and

ρ = 0.8

.

Figure 1. The ARCH process for

α^{'} = 0.05

and

ρ = 0.8

.

Figure 2. The SO₂ and O₃ daily curves.

Figure 3. Comparison of the

M S E

values between FESR-expectile and FESR-VaR without detrending cases. The black line represents

\hat{R E A_{p}}

, and the red line represents

\tilde{R E S_{p}}

.

Figure 3. Comparison of the

M S E

values between FESR-expectile and FESR-VaR without detrending cases. The black line represents

\hat{R E A_{p}}

, and the red line represents

\tilde{R E S_{p}}

.

Figure 4. Comparison of the

M S E

values between FESR-expectile and FESR-VaR with detrending cases. The black line represents

\hat{R E A_{p}}

, and the red line represents

\tilde{R E S_{p}}

.

Figure 4. Comparison of the

M S E

values between FESR-expectile and FESR-VaR with detrending cases. The black line represents

\hat{R E A_{p}}

, and the red line represents

\tilde{R E S_{p}}

.

Table 1. Comparison results.

Model	n1	n2	$σ$	$ϕ$	SWM	MSE (0.01)	MSE (0.05)	MSE (0.5)	MSE (0.90)
M1	20	50	0.09	0.03	Queen	0.023	0.018	0.014	0.026
	50	30	0.09	0.03	Bishop	0.034	0.027	0.018	0.032
	20	30	0.79	0.93	Bishop	0.042	0.032	0.026	0.045
	20	50	0.09	0.03	Rook	0.042	0.020	0.018	0.037
	50	30	0.79	0.03	Rook	0.021	0.016	0.022	0.031
M2	20	50	0.09	0.03	Queen	0.045	0.036	0.028	0.044
	50	30	0.09	0.03	Bishop	0.071	0.053	0.026	0.059
	20	30	0.75	0.93	Bishop	0.096	0.052	0.048	0.105
	20	50	0.09	0.03	Rook	0.086	0.054	0.032	0.049
	50	30	0.79	0.03	Rook	0.039	0.025	0.047	0.055

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alamari, M.B.; Almulhim, F.A.; Kaid, Z.; Laksaci, A. Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression. Axioms 2024, 13, 678. https://doi.org/10.3390/axioms13100678

AMA Style

Alamari MB, Almulhim FA, Kaid Z, Laksaci A. Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression. Axioms. 2024; 13(10):678. https://doi.org/10.3390/axioms13100678

Chicago/Turabian Style

Alamari, Mohammed B., Fatimah A. Almulhim, Zoulikha Kaid, and Ali Laksaci. 2024. "Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression" Axioms 13, no. 10: 678. https://doi.org/10.3390/axioms13100678

APA Style

Alamari, M. B., Almulhim, F. A., Kaid, Z., & Laksaci, A. (2024). Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression. Axioms, 13(10), 678. https://doi.org/10.3390/axioms13100678

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Spatio-Functional Nadaraya–Watson Estimator of the Expectile Shortfall Regression

Abstract

1. Introduction

2. Model and Estimator

3. Main Asymptotic Result

4. Simulated Data

5. Real Data Application

6. Conclusions and Prospects

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI