Improved Confidence Intervals for Expectiles

Penev, Spiridon; Maesono, Yoshihiko

doi:10.3390/math13030510

Open AccessArticle

Improved Confidence Intervals for Expectiles

by

Spiridon Penev

^1,*,†

and

Yoshihiko Maesono

^2,†

¹

Department of Statistics, The University of New South Wales Sydney, Kensington, NSW 2052, Australia

²

Department of Mathematics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2025, 13(3), 510; https://doi.org/10.3390/math13030510

Submission received: 14 December 2024 / Revised: 25 January 2025 / Accepted: 31 January 2025 / Published: 4 February 2025

(This article belongs to the Special Issue Nonparametric and Semiparametric Approaches in Statistical Inference and Data Science: Theory, Methods and Applications)

Download

Browse Figures

Versions Notes

Abstract

Expectiles were introduced to statistics around 40 years ago, but have recently gained renewed interest due to their relevance in financial risk management. In particular, the 2007–2009 global financial crisis highlighted the need for more robust risk evaluation tools, leading to the adoption of inference methods for expectiles. While first-order asymptotic inference results for expectiles are well established, higher-order asymptotic results remain underdeveloped. This study aims to fill that gap by deriving higher-order asymptotic results for expectiles, ultimately improving the accuracy of confidence intervals. The paper outlines the derivation of the Edgeworth expansion for both the standardized and studentized versions of the kernel-based estimator of the expectile, using large deviation results on U-statistics. The expansion is then inverted to construct more precise confidence intervals for the expectile. These theoretical results were applied to moderate sample sizes ranging from 20 to 200. To demonstrate the advantages of this methodology, an example from risk management is presented. The enhanced confidence intervals consistently outperformed those based on the first-order normal approximation. The methodology introduced in this paper can also be extended to other contexts.

Keywords:

nonparametric statistics; expectile; kernel; smoothing; risk; Edgeworth expansion

MSC:

62G05; 62G20

1. Introduction

Quantile-based inference has long been a topic of interest in risk evaluation in the financial industry and other contexts. Notable applications include the following: the widely used risk measure value at risk (VaR), which corresponds directly to a quantile; coherent risk measures, which are often derived from quantile transformations; and quantile regression, which has been employed as a tool for making portfolio investment decisions.

For a continuous random variable X with a cumulative distribution function (CDF)

F (x),

density function

f (x)

, and

E | X | < \infty

the p-th quantile is defined as

Q (p) = inf {x : F (x) \geq p} .

Given a sample

X_{1}, X_{2}, \dots, X_{n}

from

F,

the simplest estimator of

Q (p)

is the sample quantile. Under mild conditions, it is asymptotically normal, but its asymptotic variance

σ^{2} = \frac{p (1 - p)}{f^{2} (Q (p))}

is large, particularly in the tails. Hence, it behaves poorly for small sample sizes, and alternative estimators are needed. An obvious choice is the kernel quantile estimator

{\hat{Q}}_{p, h_{n}} = \frac{1}{h_{n}} \int_{0}^{1} F_{n}^{- 1} (x) K (\frac{x - p}{h_{n}}) d x,

(1)

where

F_{n}^{- 1} (x)

is the inverse of the empirical distribution function,

K (.)

is a suitably chosen kernel, and

h_{n}

is a bandwidth. Conditions on the bandwidth and the kernel must be imposed to ensure consistency, asymptotic normality, and the higher-order accuracy of

{\hat{Q}}_{p, h_{n}} .

Quantile-based inference has also been of significant interest to the authors of this paper. We derived in [1] a higher-order expansion for the standardized kernel quantile estimator, thus extending the long-standing flagship results obtained in [2,3]. Our expansion was non-trivial, because the influence of the bandwidth made the balance between bias and variance delicate.

In [4], we derived an Edgeworth expansion for the studentized version of the kernel quantile estimator, where the variance of the estimator was estimated using the jackknife method. This result is particularly important for practical applications, as the variance is rarely known in real-world scenarios. By inverting the Edgeworth expansion, we achieved a uniform improvement in coverage accuracy compared to the inversion of the asymptotically normal approximation. Our results are applicable in improving the inference for quantiles when the sample sizes are small-to-moderate. This situation often occurs in practice. For example, if monthly loss data are used in risk analysis, then the accumulated data for 10 years amounts to 120 observations. Another example is that of the daily data gathered from the stock market, which has approximately 250 active trading days each year.

The global financial crisis (GFC) from mid-2007 to early 2009 had a profound effect on many banks around the world. There were numerous reasons for the crisis; however, in mathematical terms, one important reason was that the banks’ risk estimates based on the widespread measure value at risk (VaR) happened to be very inaccurate. The key shortcomings of VaR are that it assumes normal market conditions, tends to ignore the tail risk, and shows a tendency to reduce risk estimates during calm periods (encouraging leverage) and increase them during volatile periods (forcing deleveraging). These limitations were seriously scrutinized after the GFC, with bank regulators proposing alternative risk measures. As a result, the seminal paper of Newey and Powell [5] introducing expectiles was rediscovered, and many advantageous properties of expectiles were noted and extended.

The coherency requirement for a risk measure in finance was first formulated in the seminal paper by [6] and has been widely used since then. We note that VaR is not a coherent risk measure, mainly because it does not satisfy the subadditivity property. The average value at risk (AVaR) does satisfy the subadditivity property and turns out to be a coherent risk measure. It was considered around 2009 as an alternative risk measure by the Basel Committee of Banking Supervision. Meanwhile, academic research on the properties of risk measures continued. The elicitability property was pointed out as another essential requirement in [7]. The latter property is an important requirement associated with effective backtesting. It then turned out that expectiles “have it all”, as they are simultaneously coherent and elicitable. Moreover, it has been shown (for example, in [8]) that expectiles are the only law-invariant risk measures that are simultaneously coherent and elicitable. Furthermore, ref. [9] noted that another desirable property, the isotonicity with respect to the usual stochastic order, also holds.

Due to the above properties, expectiles became widely adopted in risk management after the GFC. The asymptotic properties of expectile regression estimators and test statistics were investigated more deeply following the paper by [5]. Asymptotic properties of the sample expectiles, such as uniform consistency and asymptotic normality, were shown under weak assumptions, and a central limit theorem for the expectile process was proven in [10]. Expectile estimators for heavy-tailed distributions were also investigated in the same paper. Under strict stationarity assumptions, the authors in [11] showed several first-order asymptotic properties, such as consistency, asymptotic normality, and qualitative robustness. In [12], the authors compared the merits of estimators in the quantile regression and expectile regression models.

Less attention has been paid in the literature to confidence interval construction, with the only exception, to our knowledge, being the paper by [13]. Again, these intervals are constructed based on first-order asymptotics using asymptotic normality.

From a practical point of view, ref. [14] is a review paper which discusses known properties of expectiles and their financial meaning, and which presents real-data examples. The paper also refines some of the results in [15]. Another similar review discussing the regulatory implementation of expectiles is [16].

While first-order asymptotic inference results for expectiles are well established, developing higher-order asymptotics is more challenging.

It was natural for us, therefore, to turn to expectiles and to propose methods for improved inference about expectiles for small-to-moderate sample sizes. The methodology to achieve this goal is to derive the higher-order Edgeworth expansion for both the standardized and studentized versions of the kernel-based estimator of the expectile. By inverting the expansion, we can construct improved confidence intervals for the expectile. This article suggests this methodology and illustrates its effectiveness. The article is organized as follows. In Section 2, we introduce some notations, definitions, and auxiliary statements needed in the subsequent sections. Section 3 presents our main results about the Edgeworth expansion of the asymptotic distribution of the estimator. This section is subdivided into three subsections. The first subsection deals with the standardized kernel-based expectile estimator. The second subsection discusses the related results for the studentized kernel-based expectile estimator. Whilst the first subsection is mainly of theoretical interest, the results of the second subsection can be directly applied to derive more accurate confidence intervals for the expectile of the population for small-to-moderate samples. The third subsection discusses a Cornish–Fisher-type approximation for the quantile of the kernel-based expectile estimator. The application for accurate confidence interval construction presents the main purpose of our methodology. Its efficiency is illustrated numerically in Section 4. Section 5 summarizes our findings. The technical results and proofs of the main statements of the paper are postponed to Appendix A.

2. Materials and Methods

From a methodological standpoint,

{\hat{Q}}_{p, h_{n}}

is an L-estimator: it can be written as a weighted sum of the order statistics

X_{(i)}, i = 1, 2, \dots, n :

{\hat{Q}}_{p, h_{n}} = \sum_{i = 1}^{n} v_{i, n} X_{(i)}, v_{i, n} = \frac{1}{h_{n}} \int_{\frac{i - 1}{n}}^{\frac{i}{n}} K (\frac{x - p}{h_{n}}) d x .

(2)

Consider a random variable

X \in L^{2} (Ω, F, P) .

Using the notations

x_{+} = max (x, 0),

x_{-} = max (- x, 0),

Newey and Powel introduced in [5] the expectile

e_{τ}

as the minimizer of the asymmetric quadratic loss,

e_{τ} (X) = {argmin}_{y \in R} {[τ | | (X - y)}_{+} {| |}_{2}^{2} + {(1 - τ) | | (X - y)}_{-} {| |}_{2}^{2}],

(3)

and we realize that its empirical variant can also be represented as an L-statistic. When

τ = 1 / 2

we obtain

e_{1 / 2} (X) = E (X)

, which means that the expectiles can be interpreted as an asymmetric generalization of the mean. In addition, it has been shown in several papers (see, for example, [14]) that the so-called expectile-VaR (

E V a R_{τ} = - e_{τ} (X)

is a coherent risk measure when

τ \in (0, 1 / 2]

, as it satisfies the four coherency axioms from [6]. Any value of

τ \in (0, 1)

can be used in the definition of the expectile but, for the reason mentioned in the previous sentence, we will assume that

τ \in (0, 1 / 2]

in the theoretical developments of this paper.

The asymptotic properties of L-statistics are usually discussed by first decomposing them into a U-statistic plus a small-order remainder term and then applying the asymptotic theory of the U-statistic. Initially, the L statistic is written as

\int_{0}^{1} F_{n}^{- 1} (u) J (u) d u

, where

F_{n}

is the empirical distribution function and the score function

J (u)

does not involve

n .

For the presentation (2), however, such a decomposition is impossible as the “score function” becomes a delta function in the limit. Therefore, a novel dedicated approach is needed. In the case of quantiles, details about such an approach are given in [4]. Our current paper shows how the issue can also be resolved in the case of expectiles. The main tools in our derivations are some large deviation results on U-statistics from [17] and standard asymptotic results from [18].

Remark 1.

We mention, in passing, that in the paper by [19] it is shown that expectiles of a distribution F are in a one-to-one correspondence to quantiles of another distribution G that is related to F by an explicit formula. It was tempting to use this relation to utilize our results from [4] for constructing confidence intervals for the expectiles of F. We examined this option and realized that the correspondence is quite complicated, involving functionals of F that need to be estimated by the data. Our conclusion was that proceeding this way was not an option for constructing precise confidence intervals for the expectiles. Hence, our way to proceed is to deal directly from the very beginning with the definition of the expectile of

F .

We start with some initial notations, definitions, and auxiliary statements.

We consider a sample

X_{1}, \dots, X_{n}

of n independently and identically distributed random variables with density and cumulative distribution functions

f, F,

respectively.

Let us define

I_{τ} (x, y) = τ (y - x) I (y \geq x) - (1 - τ) (x - y) I (y < x)

where

I (\cdot)

is an indicator function. Looking at the original definition (3), we can define the (true theoretical) expectile

y^{*}

as a solution to the equation

E [I_{τ} (y^{*}, X_{1})] = 0 .

(4)

Using the relation

{(y - x)}_{+} = {(x - y)}_{-}

, we realize that the defining Equation (4) leads to the same solution as the defining Equation (2) in [14] or the defining Equation (2) in [10].

As discussed in [10], the

τ

-expectile

y^{*}

satisfies the Equation

τ \int_{y^{*}}^{\infty} {1 - F (s)} d s - (1 - τ) \int_{- \infty}^{y^{*}} F (s) d s = 0 .

(5)

Using integration by parts, we have the following proposition:

Proposition 1.

Assume that

E | X_{1} | < \infty

. Then, we have

τ (μ - y^{*}) - (1 - 2 τ) F^{[2]} (y^{*}) = 0,

(6)

where

μ = E (X_{1})

and

F^{[2]} (x) = \int_{- \infty}^{x} F (s) d s .

Thus, an estimator of the expectile is given by a solution of the equation

τ (\bar{X} - \tilde{y}) - (1 - 2 τ) \int_{- \infty}^{\tilde{y}} F_{n} (s) d s = 0,

where

\bar{X}

is the sample mean and

F_{n} (\cdot)

is the empirical distribution function.

Holzmann and Klar in [10] showed the uniform consistency and asymptotic normality of

\tilde{y}

. In this paper, we study the higher-order asymptotic properties of the expectile estimator. To study the higher-order asymptotics, we use a kernel-type estimator

\hat{F} (\cdot)

of the distribution function

F (\cdot)

, instead of

F_{n} (\cdot)

.

Let us define kernel estimators of the density and distribution function:

\begin{matrix} \hat{f} (x) & = & \frac{1}{n h} \sum_{i = 1}^{n} K (\frac{x - X_{i}}{h}), \\ \hat{F} (x) & = & \frac{1}{n} \sum_{i = 1}^{n} W (\frac{x - X_{i}}{h}) \end{matrix}

where

K (\cdot)

is a kernel function and

W (\cdot)

is an integral of

K (\cdot)

. We assume that

(a 1) K (s) \geq 0, \int_{- \infty}^{\infty} K (s) = 1, K (- s) = K (s),

and

W (x) = \int_{- \infty}^{x} K (s) d s .

Here, h is a bandwidth where

h \to 0, n h \to \infty

. Hereafter, we assume that the bandwidth

h = O (n^{- \frac{1}{4}} {(log n)}^{- 1})

.

As in the case of quantile estimation, we are using a kernel-smoothed estimator of the cumulative distribution function in the construction of the expectile estimator. The reason for switching to the kernel-smoothed version of the empirical distribution function in the definition of our expectile estimator is that only for this version is it possible to show the validity of the Edgeworth expansion. As discussed in detail in [4], if we use a kernel estimator with an informed choice of bandwidth and a suitable kernel then the resulting expectile estimator can be easily studentized; the Edgeworth expansion up to order

o (n^{- 1 / 2})

for the studentized version will be derived and the theoretical quantities involved in the expansion can be shown to be easily estimated. In addition, a Cornish–Fisher inversion can be used to construct confidence intervals for the expectile (which is the main goal of the inference in this paper). The resulting confidence intervals are more precise than the intervals obtained via inversion of the normal approximation and can be used to improve the coverage accuracy for moderate sample sizes.

Hence, from now on we will discuss the higher-order asymptotic properties of the estimator

\hat{y}

of

y^{*}

that satisfy

τ \int_{\hat{y}}^{\infty} {1 - \hat{F} (s)} d s - (1 - τ) \int_{- \infty}^{\hat{y}} \hat{F} (s) d s = 0 .

Let us define

A (t) = \int_{- \infty}^{t} W (s) d s, {\hat{F}}^{[2]} (s) = \frac{1}{n} \sum_{i = 1}^{n} h A (\frac{s - X_{i}}{h}) .

Then, similarly to Proposition 1, we have the following Proposition:

Proposition 2.

If the kernel function satisfies the condition (a1), the kernel expectile estimator

\hat{y}

is given by the solution of the equation

τ (\bar{X} - \hat{y}) - (1 - 2 τ) {\hat{F}}^{[2]} (\hat{y}) = 0 .

(7)

For our further discussion, we define the following quantities:

Definition 1.

\begin{matrix} u_{1} (X_{i}) & = & - (1 - 2 τ) \{h A (\frac{y^{*} - X_{i}}{h}) - F^{[2]} (y^{*})\} + τ (X_{i} - μ), \\ b_{1 n} & = & - (1 - 2 τ) \{E [h A (\frac{y^{*} - X_{i}}{h})] - F^{[2]} (y^{*})\}, \\ {\tilde{u}}_{1} (X_{i}) & = & u_{1} (X_{i}) - b_{1 n}, \\ ξ_{n}^{2} & = & E ({\tilde{u}}_{1}^{2} (X_{1})), \\ b_{2 n} & = & E [W (\frac{y^{*} - X_{i}}{h})] - F (y^{*}), \\ \tilde{W} (X_{i}) & = & W (\frac{y^{*} - X_{i}}{h}) - F (y^{*}) - b_{2 n}, \\ b_{3 n} & = & E [\frac{1}{h} K (\frac{y^{*} - X_{i}}{h})] - f (y^{*}), \\ \tilde{K} (X_{i}) & = & \frac{1}{h} K (\frac{y^{*} - X_{i}}{h}) - f (y^{*}) - b_{3 n}, \\ C & = & τ + (1 - 2 τ) F (y^{*}) \\ \tilde{C} & = & τ + (1 - 2 τ) \hat{F} (y^{*}), \\ \hat{C} & = & τ + (1 - 2 τ) \hat{F} (\hat{y}) . \end{matrix}

Note that

E [{\tilde{u}}_{1} (X_{i})] = E [\tilde{W} (X_{i})] = E [\tilde{K} (X_{i})] = 0

holds and that

b_{1 n}, b_{2 n}

, and

b_{3 n}

denote the biases of the kernel estimators of

F^{[2]} (y^{*}), F (y^{*})

, and

f (y^{*}) .

Using Equations (6) and (7), we have

τ (\bar{X} - μ) - τ (\hat{y} - y^{*}) - (1 - 2 τ) \{{\hat{F}}^{[2]} (\hat{y}) - F^{[2]} (y^{*})\} = 0 .

(8)

Since we intend to discuss the Edgeworth expansion with a residual term

o (n^{- 1 / 2})

, we will obtain the asymptotic representation with residual term

o_{L} (n^{- 1 / 2})

, where

P \{| o_{L} (n^{- 1 / 2}) | \geq n^{- 1 / 2} ε_{n}\} = o (n^{- 1 / 2})

as

ε_{n} \to 0

. When we obtain the Edgeworth expansion until the order

n^{- 1 / 2}

, it follows from Esseen’s smoothing lemma that we can ignore the terms of order

o_{L} (n^{- 1 / 2})

.

Similarly to the

o_{L} (n^{- 1 / 2})

notation, we will also be using the

o_{l} (n^{- 1 / 2})

notation that follows the definition

P \{| o_{l} (n^{- 1 / 2}) | \geq n^{- 1 / 2} {(log n)}^{- 1}\} = o (n^{- 1 / 2}) .

Note that we can also ignore the

o_{l} (n^{- 1 / 2})

terms when we discuss the Edgeworth expansion with residual term

o (n^{- 1 / 2})

.

3. Results

3.1. Edgeworth Expansion for the Standardized Expectile

In this subsection, we will obtain an asymptotic representation of the standardized expectile

\frac{\sqrt{n} C}{ξ_{n}} (\hat{y} - y^{*})

and its Edgeworth expansion. This expansion is of theoretical interest mainly, as the constant C and the normalizing quantity

ξ_{n}

involved depend on parameters of the unknown population distribution. Later, in Section 3.2 we will formulate the related but more practicable expansions of the studentized expectile. They do not depend on unknown population parameters and can also be inverted to deliver accurate asymptotic confidence intervals for the expectile.

We assume that the following conditions hold:

\begin{matrix} (a 2) & C \neq 0, \\ (a 3) & E | X_{1} {- μ |}^{5 + δ} < \infty . \end{matrix}

Theorem 1.

We assume the conditions

(a 1), (a 2), (a 3)

hold. Furthermore, we assume that

K (\cdot)

, the derivatives

f^{'} (\cdot)

, and

K^{'} (\cdot)

are bounded, and that

\int s^{4} K (s) d s < \infty

. Then:

(1): For the standardized expectile estimator, we have

\begin{matrix} \frac{\sqrt{n} C}{ξ_{n}} (\hat{y} - y^{*}) \\ = & n^{1 / 2} \frac{b_{1 n}}{ξ_{n}} + n^{- 1 / 2} \frac{ν_{B}}{ξ_{n}} + n^{- 1 / 2} \sum_{i = 1}^{n} \frac{{\tilde{u}}_{1} (X_{i})}{ξ_{n}} \\ + n^{- 3 / 2} \sum_{1 \leq i < j \leq n} \frac{ν_{2} (X_{i}, X_{j})}{ξ_{n}} + o_{L} (n^{- 1 / 2}) \end{matrix}

where

\begin{matrix} ν_{B} & = & - (1 - 2 τ) \{\frac{f (y^{*})}{2 C^{2}} E [{\tilde{u}}_{1}^{2} (X_{1})] + \frac{1}{C} E [\tilde{W} (X_{1}) {\tilde{u}}_{1} (X_{1})]\} \\ ν_{2} (X_{i}, X_{j}) & = & - (1 - 2 τ) \{\frac{f (y^{*})}{C^{2}} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) \\ + \frac{1}{C} [\tilde{W} (X_{i}) {\tilde{u}}_{1} (X_{j}) + \tilde{W} (X_{j}) {\tilde{u}}_{1} (X_{i})]\} . \end{matrix}

(2): The asymptotic variance of $\hat{y}$ is equal to $\frac{1}{C^{2}} V a r [\tilde{u} (X_{1})]$ , where

V a r [\tilde{u} (X_{1})] = (2 - 4 τ) F^{[3]} (y^{*}) + τ^{2} {(y^{*} - μ)}^{2} + τ^{2} σ^{2} + O (h),

σ^{2} = V (X_{1})

, and

F^{[3]} (x) = \int_{- \infty}^{x} F^{[2]} (s) d s .

(3): The Edgeworth expansion is given by

P \{\frac{\sqrt{n} C}{ξ_{n}} (\hat{y} - y^{*}) \leq x\} = Q (x - \frac{n^{1 / 2} b_{1 n}}{ξ_{n}}) + o (n^{- 1 / 2})

where

κ = \frac{1}{ξ_{n}^{3}} E [{\tilde{u}}_{1}^{3} (X_{1})] + \frac{3}{ξ_{n}^{3}} E [{\tilde{u}}_{1} (X_{1}) {\tilde{u}}_{1} (X_{2}) ν_{2} (X_{1}, X_{2})]

and

Q (x) = Φ (x) - ϕ (x) n^{- 1 / 2} \{\frac{ν_{B}}{ξ_{n}} + \frac{κ (x^{2} - 1)}{6}\} .

(9)

Remark 2.

It follows from the results of Holzmann and Klar in [10] that

\begin{matrix} E [I_{τ}^{2} (y^{*}, X_{1})] \\ = & E [τ^{2} {(X_{1} - y^{*})}^{2} I_{(X_{1} \geq y^{*})} + {(1 - τ)}^{2} {(y^{*} - X_{1})}^{2} I_{(X_{1} < y^{*})}] \\ = & τ^{2} \int_{y^{*}}^{\infty} {(s - y^{*})}^{2} f (s) d s + {(1 - τ)}^{2} \int_{- \infty}^{y^{*}} {(y^{*} - s)}^{2} f (s) d s \\ = & τ^{2} \int_{- \infty}^{\infty} {(s - y^{*})}^{2} f (s) d s + (1 - 2 τ) \int_{- \infty}^{y^{*}} {(y^{*} - s)}^{2} f (s) d s \\ = & τ^{2} \int_{- \infty}^{\infty} {(s - μ + μ - y^{*})}^{2} f (s) d s \\ + (1 - 2 τ) \{{[{(y^{*} - s)}^{2} F (s)]}_{- \infty}^{y^{*}} + 2 \int_{- \infty}^{y^{*}} (y^{*} - s) F (s) d s\} \\ = & τ^{2} σ^{2} + τ {(μ - y^{*})}^{2} + (1 - 2 τ) \{{[2 (y^{*} - s) F^{[2]} (s)]}_{- \infty}^{y^{*}} + 2 \int_{- \infty}^{y^{*}} F^{[2]} (s) d s\} \\ = & τ^{2} σ^{2} + τ^{2} {(μ - y^{*})}^{2} + (2 - 4 τ) F^{[3]} (y^{*}) . \end{matrix}

Compared with the result of Corollary 4 on p. 2359 in [10], we realized that, as expected, the first-order approximations of the asymptotic variances of the kernel-smoothed expectile estimator in our paper and of the empirical distribution-based estimator discussed in [10] coincide.

It is easy to see that

\begin{matrix} \frac{3}{ξ_{n}^{3}} E [{\tilde{u}}_{1} (X_{1}) {\tilde{u}}_{1} (X_{2}) ν_{2} (X_{1}, X_{2})] \\ = & - \frac{3 (1 - 2 τ)}{ξ_{n}^{3}} \{\frac{ξ_{n}^{4} f (y^{*})}{C^{2}} + \frac{ξ_{n}^{2}}{C} E [\tilde{W} (X_{1}) {\tilde{u}}_{1} (X_{1})]\} \\ = & - 3 (1 - 2 τ) \{\frac{ξ_{n} f (y^{*})}{C^{2}} + \frac{1}{C ξ_{n}} E [\tilde{W} (X_{1}) {\tilde{u}}_{1} (X_{1})]\} \end{matrix}

and this relation can be used to write down the formula for

κ

in (9) in an alternative way.

The proof of the above Theorem will be presented in Appendix A. It relies heavily on some large deviations results for the U-statistics that we summarized in Lemma A1 (whose formulations and proof are also postponed to Appendix A). Using the results of the latter Lemma, we can obtain the evaluation of the order of the asymptotic approximations and expansions of the differences

(\hat{y} - y^{*}), {(\hat{y} - y^{*})}^{2}, {(\hat{y} - y^{*})}^{3}

(also presented in Lemma A2 in Appendix A). Combining the evaluations from Lemma A2 essentially guarantees the asymptotic representation in the standardized case given in Theorem 1.

3.2. Edgeworth Expansion for the Studentized Expectile

As pointed out by many papers, the Edgeworth expansion of the studentized estimator is more important than the expansion of the standardized estimator. In addition, it is the studentized version that can be inverted in practical settings to deliver confidence intervals for the unknown expectile. To obtain the asymptotic representation of the studentized estimator, we need to construct suitable estimators

\hat{C}

and

{\hat{ξ}}_{n}

.

\hat{C}

is given in (Definition 1) above. Next, we proceed to obtain an estimator of

E [u_{1}^{2} (X_{1})]

. Similarly, as the ordinal estimation of the population variance, putting

{\hat{u}}_{1} (X_{i}) = - (1 - 2 τ) h A (\frac{\hat{y} - X_{i}}{h}) + τ X_{i},

we can obtain a consistent estimator:

{\hat{ξ}}_{n}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} {\hat{u}}_{1}^{2} (X_{i}) - \frac{n}{n - 1} {\{\frac{1}{n} \sum_{i = 1}^{n} {\hat{u}}_{1} (X_{i})\}}^{2} .

Let us now analyze the following studentized expectile estimator:

\frac{\hat{C}}{{\hat{ξ}}_{n}} (\hat{y} - y^{*}) .

We now introduce our next set of notations:

Definition 2.

Let us define

\begin{matrix} ζ (X_{i}) = \frac{1}{2 ξ_{n}^{3}} {{\tilde{u}}_{1}^{2} (X_{i}) - ξ_{n}^{2}}, \\ B_{n} = \frac{1 - 2 τ}{2 C^{2}} f (y^{*}) ξ_{n} - E [{\tilde{u}}_{1} (X_{1}) ζ (X_{1})], \\ u_{1}^{*} (X_{i}) = \frac{{\tilde{u}}_{1} (X_{i})}{ξ_{n}}, \\ u_{2}^{*} (X_{i}, X_{j}) = \frac{1 - 2 τ}{C^{2} ξ_{n}} f (y^{*}) {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) - {\tilde{u}}_{1} (X_{i}) ζ (X_{j}) - {\tilde{u}}_{1} (X_{j}) ζ (X_{i}) . \end{matrix}

Similarly to the standardized expectile estimator, we can derive the asymptotic representation of the studentized estimator in the next Theorem:

Theorem 2.

Under the same assumptions as in Theorem 1:

(1): For the studentized expectile estimator, we obtain

\begin{matrix} \frac{n^{1 / 2} \hat{C} (\hat{y} - y^{*})}{{\hat{ξ}}_{n}} & = & \frac{n^{1 / 2} b_{1 n}}{ξ_{n}} + n^{- 1 / 2} B_{n} + n^{- 1 / 2} \sum_{i = 1}^{n} u_{1}^{*} (X_{i}) \\ + n^{- 3 / 2} \sum_{1 \leq i < j \leq n} u_{2}^{*} (X_{i}, X_{j}) + o_{l} (n^{- 1 / 2}) . \end{matrix}

(10)

(2): We have the Edgeworth expansion with residual term $o (n^{- 1 / 2})$ :

P \{\frac{\sqrt{n} \hat{C} (\hat{y} - y^{*})}{{\hat{ξ}}_{n}} \leq x\} = Q_{S} (x - \frac{n^{1 / 2} b_{1 n}}{ξ_{n}}) + o (n^{- 1 / 2})

(11)

where

κ_{S} = E [{u_{1}^{*} (X_{1})}^{3}] + 3 E [u_{1}^{*} (X_{1}) u_{1}^{*} (X_{2}) u_{2}^{*} (X_{1}, X_{2})]

and

Q_{S} (x) = Φ (x) - n^{- 1 / 2} ϕ (x) \{B_{n} + \frac{κ_{S} (x^{2} - 1)}{6}\} .

The proof of Theorem 2 is presented in Appendix A.

3.3. Cornish–Fisher-Type Approximation of the $α$ -Quantile of the Studentized Expectile

Here, we will obtain an approximation of

α

-quantile

e_{α}

of the studentized expectile, where

α = P \{\frac{\sqrt{n} \hat{C} (\hat{y} - y^{*})}{{\hat{ξ}}_{n}} \leq e_{α}\} = Q_{S} (e_{α} - \frac{n^{1 / 2} b_{1 n}}{ξ_{n}}) + o (n^{- 1 / 2}) .

Let us define

e_{α}^{*} = e_{α} - \frac{n^{1 / 2} b_{1 n}}{ξ_{n}} .

Then, expanding around the

α

-quantile

z_{α}

of

N (0, 1)

, we have

\begin{matrix} e_{α}^{*} & = & z_{α} + n^{- 1 / 2} \frac{ϕ (z_{α}) \{B_{n} + P_{S} (z_{α})\}}{Q_{S}^{'} (z_{α})} + O (n^{- 1}) \\ = & z_{α} + n^{- 1 / 2} \{B_{n} + P_{S} (z_{α})\} + O (n^{- 1}) \end{matrix}

where

P_{S} (x) = \frac{κ_{S} (x^{2} - 1)}{6}, κ_{S} = 3 \frac{(1 - 2 τ) f (y^{*})}{C^{2}} ξ_{n} - \frac{2 E [{\tilde{u}}_{1}^{3} (X_{1})]}{ξ_{n}^{3}} .

For the

α

-quantile

e_{α}

, we have

e_{α} = z_{α} + n^{1 / 2} \frac{b_{1 n}}{ξ_{n}} + n^{- 1 / 2} \{B_{n} + P_{S} (z_{α})\} + O (n^{- 1}) .

Since

b_{1 n} = - h^{2} \frac{(1 - 2 τ) f (y^{*})}{2} \int_{- \infty}^{\infty} u^{2} K (u) d u + O (h^{4}),

we have an estimator of

b_{1 n}

{\hat{b}}_{1 n} = - h^{2} \frac{(1 - 2 τ) \hat{f} (\hat{y})}{2} \int_{- \infty}^{\infty} u^{2} K (u) d u .

It is easy to see that

B_{n} = \frac{1 - 2 τ}{2 C^{2}} f (y^{*}) ξ_{n} - \frac{E [{{\tilde{u}}_{1} (X_{1})}^{3}]}{2 ξ_{n}^{3}} .

Thus, we have estimators of

B_{n}

and

ν_{3}

as follows:

\begin{matrix} {\hat{B}}_{n} & = & \frac{1 - 2 τ}{2 {\hat{C}}^{2}} \hat{f} (\hat{y}) {\hat{ξ}}_{n} - \frac{{\hat{μ}}_{3}}{2 {\hat{ξ}}_{n}^{3}}, \\ {\hat{κ}}_{S} & = & 3 \frac{(1 - 2 τ)}{{\hat{C}}^{2}} \hat{f} (\hat{y}) {\hat{ξ}}_{n} - \frac{2 {\hat{μ}}_{3}}{{\hat{ξ}}_{n}^{3}} \end{matrix}

where

{\hat{μ}}_{3} = \frac{1}{n} \sum_{i = 1}^{n} {\{\hat{u} (X_{i}) - \frac{1}{n} \sum_{j = 1}^{n} \hat{u} (X_{j})\}}^{3} .

Therefore, we have an estimator of the

α

-quantile

e_{α}

:

{\hat{e}}_{α} = z_{α} + n^{1 / 2} \frac{{\hat{b}}_{1 n}}{{\hat{ξ}}_{n}} + n^{- 1 / 2} \{{\hat{B}}_{n} + \frac{{\hat{κ}}_{S} (z_{α}^{2} - 1)}{6}\} .

(12)

4. Discussion

Given that the main application domain of expectiles has been risk management, we also want to illustrate the application of our methodology in this area. As discussed in Section 2,

E V a R = - e_{τ} (X)

is a coherent risk measure when

τ \in (0, 1 / 2] .

It is easy to check (or compare p. 46 of ([15])) that

- e_{1 - \tilde{τ}} (X) = e_{\tilde{τ}} (- X)

holds. In addition, most interest in risk management is in the tails. If the random variable of interest X represents an outcome, then

- X

represents a loss, and one would be interested in losses in the tail. To illustrate the effectiveness of our approach for constructing improved confidence intervals, we need to compare simulation outcomes with the population distribution for which the true expectile is known precisely. Such examples are relatively scarce in the literature. Some suitable exceptions are discussed in [14]. One of these exceptions is the exponential distribution, which we chose for our illustrations below.

Setting

Z = (- X)

to be standard exponentially distributed, we have for the values of

\tilde{τ}

the relation

e_{\tilde{τ}} (Z) = 1 + W \{\frac{2 \tilde{τ} - 1}{(1 - \tilde{τ}) e}\},

(13)

where

W (\cdot)

is the Lambert

W

function defined implicitly by means of the equation

W (z) exp (W (z)) = z

(and we note that

W (x) \sim l o g (x)

for

x \to \infty

holds). The Formula (13) for the expectile of the exponential distribution is derived on page 495 in [14].

We have used a symmetric compactly supported on

(- 1, 1)

kernel

K (x) .

It is said to be of order m, where m is the mth derivative

K^{(m)} \in L i p (β)

for some

β > 0

and

\int_{- 1}^{1} K (x) d x = 1, \int_{- 1}^{1} x^{i} K (x) d x = 0, i = 1, 2, \dots, m - 1, \int_{- 1}^{1} x^{m} K (x) d x \neq 0 .

our numerical experiments, we used the classical second-order Epanechnikov kernel,

K (x) = \frac{3}{4} (1 - x^{2}) I (| x | \leq 1) .

(14)

With it, the factor in the definition of the estimator

{\hat{b}}_{1 n}

becomes

\int_{- \infty}^{\infty} u^{2} K (u) d u = 0.2

.

There are at least two ways to produce accurate confidence intervals at level

(1 - α)

for the expectile when exploiting the Edgeworth expansion of its studentized version. One approach (we call it the CF method) is based on using the estimated values

({\hat{e}}_{α / 2}

and

{\hat{e}}_{1 - (α / 2)})

obtained by using the Formula (12). Then, the left-and right-hand sides of the

(1 - α) \times 100 %

confidence interval for

E V a R = - e_{τ} (X)

at given

τ

are obtained as

(\hat{y} - {\hat{e}}_{1 - \frac{α}{2}} {\hat{ξ}}_{n} / ({\hat{C}}_{n} \sqrt{n}), \hat{y} - {\hat{e}}_{\frac{α}{2}} {\hat{ξ}}_{n} / ({\hat{C}}_{n} \sqrt{n})) .

(15)

Another approach (we call it numerical inversion) is to use numerical root-finder procedures to solve the two equations

{\hat{Q}}_{S} (η_{1} - \frac{n^{1 / 2} {\hat{b}}_{1 n}}{{\hat{ξ}}_{n}}) = \frac{α}{2}, {\hat{Q}}_{S} (η_{2} - \frac{n^{1 / 2} {\hat{b}}_{1 n}}{{\hat{ξ}}_{n}}) = 1 - \frac{α}{2},

(16)

and construct the confidence interval as

(η_{1}, η_{2}) .

Here,

{\hat{Q}}_{S} (x) = Φ (x) - n^{- 1 / 2} ϕ (x) \{{\hat{B}}_{n} + \frac{{\hat{κ}}_{S} (x^{2} - 1)}{6}\} .

These two methods should be asymptotically equivalent but they deliver different intervals for small sample sizes, with the numerical inversion delivering significantly better results, in terms of closeness to the nominal coverage level.

We provide some details about the numerical implementation at the end of Appendix A.

With the standardized version, we achieved a better approximation and more precise coverage probabilities across very low sample sizes, such as

n = 10, 12, 15, or 20,

across a range of

τ

values, such as

τ = 0.1, 0.2, 0.3, 0.4

and a range of values of

α

, such as

α = 0.1, 0.05, 0.01

for the

(1 - α) * 100 %

confidence intervals. The approximations were extremely accurate for such small sample sizes. We have not reproduce all these here, as our main goal was to investigate the practically more relevant studentized case. We only include one graph (Figure 1) where the case

n = 12, τ = 0.2

is demonstrated graphically. The “true” cdf of the estimator for comparison was obtained via the empirical cdf based on 50,000 simulations from standard exponentially distributed data of size

n = 12 .

The resulting confidence intervals at a nominal

90 %

level had actual coverage of 0.90 for the Edgeworth and 0.9014 for the normal approximation. At nominal

95 %

, they were 0.9490 and 0.9453, respectively. At nominal

99 %,

they were 0.9858 for Edgeworth versus 0.9804 for the normal approximation.

In the practically relevant studentized case, we were unable to obtain such good results for sample sizes as low as the ones from the standardized case. This is, of course, to be expected, as in this case there was a need to estimate the

ξ_{n}

and the C quantities using the data. The moderate sample sizes at which the CF and numerical inversion methods deliver significantly more precise results depend, of course, on the distribution of C itself. For the exponential distribution, these turn out to be in the range of 20, 50, 100, 150 to about 200. For larger sample sizes, all three methods—the normal theory-based confidence intervals, the ones obtained by the numerical inversion and the CF-based intervals—become very accurate but the discrepancy between their accuracy becomes negligibly small and, for that reason, we do not report it here.

Before presenting thorough numerical simulations, we include one illustrative graph (Figure 2) for the case

n = 50, τ = 0.3

where the studentized case is demonstrated. The graph demonstrates the virtually uniform improvement when the Edgeworth approximation is used instead of the simple normal approximation. A comparison was made with the “true” cdf (obtained via the empirical cdf based on 50,000 simulations from standard exponentially distributed data of size

n = 50

). We found that at 50,000 replications a stabilization occurred and that further increase of the replications seemed unnecessary. The resulting confidence intervals at a nominal

90 %

level had actual coverage of 0.877 for the numerical inversion of the Edgeworth, with 0.869 for the normal approximation. At

95 %

nominal level, the actual coverage was 0.921 for the numerical inversion of the Edgeworth versus 0.917 for the normal approximation.

Next, we include Table 1 and Table 2 showing the effects of applying our methodology for constructing confidence intervals for the expectiles. The moderate samples included in the comparison were chosen as 20, 50, 100, 150, and 200. The two tables illustrate the results for two common confidence levels used in practice (

90 %

for Table 1 and

95 %

for Table 2). The best performer in each row is in bold font. Examination of Table 1 and Table 2 shows that the “true” coverage probabilities approached the nominal probabilities when the sample size increased. As expected, the discrepancies in accuracy between the different approximations also decreased when the sample size n increased. As the confidence intervals were based on asymptotic arguments, this demonstrates the consistency of our procedure. For the chosen levels of confidence, our new confidence intervals virtually always outperformed the ones based on the normal approximation. There appears to be a downward bias in the coverage probabilities across the tables for both normal and Edgeworth-based methods. This bias grows smaller as n increases above 200. We observe that the value of

τ

also influenced the bias, with smaller values of

τ

impacting the bias more significantly. This was expected, as these values of

τ

lead to expectiles that are further in the right tail of the distribution of the loss variable X and, hence, are more difficult to estimate. As is known, the Edgeworth approximation’s strength is in the central part of the distribution. However, at small values of

τ

we focus on the tail, where it does not necessarily improve over the normal approximation.

5. Conclusions

Edgeworth expansions are expected to deliver better approximations to the standardized and studentized versions of estimators of parameters of interest. The inversion of these expansions can be applied for constructing more accurate confidence intervals for these parameters when sample sizes are small or moderate. To justify the validity of these expansions, one needs to switch to using a kernel-smoothed version of the empirical distribution function in the definition of the estimator. We applied the technique for estimating the expectile, which has recently found fruitful applications in risk management. We illustrated the advantages of our procedure on simulations that utilized the exponential distribution as an example. We chose this distribution because the expectile is known precisely (not only approximately) for it. However, we stress that our procedure is fully nonparametric and can be applied for any distribution, as long as the conditions of Theorem 2 are satisfied.

Furthermore, we focused on the applications of inference about expectiles in financial data. However, this is by no means the only domain of application. As discussed in Remark 1. above, expectiles of a distribution F are in a one-to-one correspondence to quantiles of another distribution

G .

Anywhere, where quantiles-based inference is of interest, expectiles-based inference can be an alternative. In particular, a referee has suggested applications in constructing nonparametric control charts. The extension of our methodology to expectile regression settings is particularly important for applications. These applications could be a topic for further research.

Author Contributions

Conceptualization, Y.M. and S.P.; methodology, Y.M. and S.P.; software, S.P. and Y.M.; validation, S.P. and Y.M.; formal analysis, Y.M. and S.P.; writing—original draft preparation, S.P. and Y.M.; writing—S.P. and Y.M.; visualization, S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were analyzed in this study. Simulated data were created using the R programming language.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Proof of Proposition 1.

Using the integration by parts, we have

\begin{matrix} 0 & = & τ \int_{y^{*}}^{\infty} {1 - F (s)} d s - (1 - τ) \int_{- \infty}^{y^{*}} F (s) d s \\ = & τ {[s {1 - F (s)}]}_{y^{*}}^{\infty} + τ \int_{y^{*}}^{\infty} s f (s) d s - (1 - τ) F^{[2]} (y^{*}) \\ = & - τ y^{*} {1 - F (y^{*})} + τ [\int_{- \infty}^{\infty} s f (s) d s - \int_{- \infty}^{y^{*}} s f (s) d s] - (1 - τ) F^{[2]} (y^{*}) \\ = & - τ y^{*} + τ y^{*} F (y^{*}) + τ \{μ - {[s F (s)]}_{- \infty}^{y^{*}} + \int_{- \infty}^{y^{*}} F (s) d s\} - (1 - τ) F^{[2]} (y^{*}) \\ = & τ (μ - y^{*}) - (1 - 2 τ) F^{[2]} (y^{*}) \end{matrix}

□

Proof of Proposition 2.

Since the kernel function satisfies

\int_{- \infty}^{\infty} s K (s) d s = 0

, we have

\begin{matrix} \int_{- \infty}^{\infty} s \hat{f} (s) d s & = & \frac{1}{n} \sum_{i = 1}^{n} \int_{- \infty}^{\infty} \frac{1}{h} K (\frac{s - X_{i}}{h}) d s \\ = & \frac{1}{n} \sum_{i = 1}^{n} \int_{- \infty}^{\infty} (t h + X_{i}) K (t) d t \\ = & \frac{1}{n} \sum_{i = 1}^{n} X_{i} = \bar{X} . \end{matrix}

Similarly to the derivation in (Proposition 1), we obtain

\begin{matrix} 0 & = & τ \int_{\hat{y}}^{\infty} {1 - {\hat{F}}_{n} (s)} d s - (1 - τ) \int_{- \infty}^{\hat{y}} {\hat{F}}_{n} (s) d s \\ = & τ {[s \{1 - \hat{F} (s)\}]}_{\hat{y}}^{\infty} + τ \int_{\hat{y}}^{\infty} s \hat{f} (s) d s - (1 - τ) {\hat{F}}^{[2]} (\hat{y}) \\ = & - τ \hat{y} {1 - \hat{F} (\hat{y})} + τ [\int_{- \infty}^{\infty} s \hat{f} (s) d s - \int_{- \infty}^{\hat{y}} s \hat{f} (s) d s] - (1 - τ) {\hat{F}}^{[2]} (\hat{y}) . \end{matrix}

It is easy to see that

\int_{- \infty}^{\hat{y}} s \hat{f} (s) d s = {[s \hat{F} (s)]}_{- \infty}^{\hat{y}} - \int_{- \infty}^{\hat{y}} \hat{F} (s) d s = \hat{y} \hat{F} (\hat{y}) - {\hat{F}}^{[2]} (\hat{y}) .

Thus, we have

τ (\bar{X} - \hat{y}) - (1 - 2 τ) {\hat{F}}^{[2]} (\hat{y}) = 0 .

□

First, we note the moment conditions that help us to obtain the asymptotic representations of the statistics.

Lemma A1.

Under the conditions of Theorem 1, for some

δ > 0

\begin{matrix} E {|\tilde{K} (X_{1})|}^{3 + δ} < \infty, \\ E {|\tilde{W} (X_{1})|}^{3 + δ} < \infty \end{matrix}

and

E {|\tilde{u} (X_{1})|}^{3 + δ} < \infty .

Proof.

Since

K (\cdot)

is bounded and

W (\cdot)

is a cumulative distribution function, we have the first and second inequalities. For

\tilde{u} (\cdot)

, it is sufficient to prove

E {|h A (\frac{y^{*} - X_{1}}{h})|}^{3 + δ} < \infty .

From the definition, we have

A (t) = \int_{- \infty}^{t} W (s) d s = {[s W (s)]}_{- \infty}^{t} - \int_{- \infty}^{t} s K (s) d s .

Since

\int s^{2} K (s) d s < \infty

, we have

\begin{matrix} lim_{s \to - \infty} s W (s) = lim_{s \to - \infty} \frac{W (s)}{1 / s} \\ = & lim_{s \to - \infty} (- \frac{K (s)}{1 / s^{2}}) = lim_{s \to - \infty} {- s^{2} K (s)} = 0 . \end{matrix}

From the condition (a3) and

| W (s) | \leq 1

, we have

h^{3 + δ} E {|(\frac{y^{*} - X_{1}}{h}) W (\frac{y^{*} - X_{1}}{h})|}^{3 + δ} \leq E {| y^{*} - X_{1} |}^{3 + δ} < \infty .

Furthermore, for a constant

β > 0

, we have

|\int_{- \infty}^{t} s K (s) d s| \leq β

. Thus, we obtain the desired result. □

Let us define the order evaluation

o_{M} (n^{- 1 / 2})

via

E | o_{M} (n^{- 1 / 2}) |^{r} = O (n^{- 1 / 2 - r / 2 - δ})

for some

r > 0

. As stated in our discussion in Section 2, when we consider an Edgeworth expansion with residual term

o (n^{- 1 / 2})

, we can ignore expressions of order

o_{l} (n^{- 1 / 2})

and

o_{L} (n^{- 1 / 2})

. Now, we observe that if a certain residual term

R_{n}

satisfies

R_{n} = o_{M} (n^{- 1 / 2})

then it is

o_{l} (n^{- 1 / 2})

.

Furthermore, let us define a U-statistic,

U_{n} = {(\binom{n}{r})}^{- 1} \sum_{1 \leq i_{1} < i_{2} < \dots \leq n} g (X_{i_{1}}, X_{i_{2}}, \dots, X_{i_{r}})

where

g (x_{1}, x_{2}, \dots, x_{r})

is symmetric in its arguments. For

U_{n}

, using large deviation theory, we have the following Lemma:

Lemma A2.

If

E (g^{2}) < \infty

we can obtain the following evaluations:

(1): It follows from Malevich and Abdalimov’s results [17] that

P \{|\frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq log n\} = o (n^{- 1 / 2}) .

(2): For $h = n^{- 1 / 4} {(log n)}^{- 1}$ , we have

P \{h^{2} |\frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq n^{- 1 / 2} {(log n)}^{- 1}\} = o (n^{- 1 / 2})

and then

h^{2} \frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}} = o_{l} (n^{- 1 / 2}) .

(3): Let $β (\cdot)$ and $γ (\cdot)$ be functions satisfying $E [β (X_{1})] = E [γ (X_{1})] = 0$ , $E [β^{2} (X_{1})] = O (1)$ and $E [γ^{2} (X_{1})] = O (1)$ . Then, we have

\begin{matrix} n^{- 3 / 2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} β (X_{i}) γ (X_{j}) \\ = & n^{- 1 / 2} E [β (X_{1}) γ (X_{1})] + n^{- 3 / 2} \sum_{1 \leq i < j \leq n} {β (X_{i}) γ (X_{j}) + β (X_{j}) γ (X_{i})} \\ + o_{M} (n^{- 1 / 2}) . \end{matrix}

(4): For $U_{n}$ and $o_{M} (n^{- 1 / 2})$ , we have

P \{|o_{M} (n^{- 1 / 2}) \frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq n^{- 1 / 2} {(log n)}^{- 1}\} = o (n^{- 1 / 2})

and then

o_{M} (n^{- 1 / 2}) \frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}} = o_{M} (n^{- 1 / 2}) .

Proof.

(1) The equation directly follows from [17].

(2) Since

n^{- 1 / 2} h^{- 2} {(log n)}^{- 1} = n^{2 d - 1 / 2} {(log n)}^{- 1} \geq log n,

we have the desired result.

(3) Since

E [β^{2} (X_{1})] = O (1)

and

E [γ^{2} (X_{1})] = O (1)

, we have

\begin{matrix} E {|n^{- 3 / 2} \sum_{i = 1}^{n} {β (X_{i}) γ (X_{i}) - E [β (X_{1}) γ (X_{1})]}|}^{2} \\ = & O (n^{- 3}) n E [β^{2} (X_{1})] E [γ^{2} (X_{1})] \\ = & O (n^{- 3} n) = O (n^{- 1 / 2 - 2 / 2 - 1 / 2}) \end{matrix}

and then

n^{- 3 / 2} \sum_{i = 1}^{n} {β (X_{i}) γ (X_{i}) - E [β (X_{1}) γ (X_{1})]} = o_{l} (n^{- 1 / 2}) .

(4) It is easy to see that

\begin{matrix} P \{|o_{M} (n^{- 1 / 2}) \frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq n^{- 1 / 2} {(log n)}^{- 1}\} \\ = & P \{|o_{M} (n^{- 1 / 2}) \frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq n^{- 1 / 2} {(log n)}^{- 1}, |\frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq log n\} \\ + P \{|o_{M} (n^{- 1 / 2}) \frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq n^{- 1 / 2} {(log n)}^{- 1}, |\frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| < log n\} \\ \leq & P \{|\frac{U_{n} - E (U_{n})}{\sqrt{V (U_{n})}}| \geq log n\} + P \{| o_{M} (n^{- 1 / 2}) | \geq n^{- 1 / 2} {(log n)}^{- 2}\} \\ = & o (n^{- 1 / 2}) . \end{matrix}

Thus, we have the desired result. □

Lemma A3.

For

0 < τ < 1

, under the assumption of Theorem 1 we have following approximations:

(1)

{(\hat{y} - y^{*})}^{3} = n^{- 1 / 2} o_{M} (n^{- 1 / 2}) .

(2)

\begin{matrix} \tilde{C} (\hat{y} - y^{*}) \\ = & \frac{1}{n} \sum_{i = 1}^{n} u_{1} (X_{i}) - \frac{1 - 2 τ}{2} {(\hat{y} - y^{*})}^{2} \hat{f} (y^{*}) + n^{- 1 / 2} o_{M} (n^{- 1 / 2}) . \end{matrix}

(3)

\begin{matrix} (\tilde{C} - C) (\hat{y} - y^{*}) \\ = & \frac{1 - 2 τ}{C} (n^{- 1} E [\tilde{W} (X_{1}) {\tilde{u}}_{1} (X_{1})] + n^{- 2} \sum_{1 \leq i < j \leq n} \{\tilde{W} (X_{i}) {\tilde{u}}_{1} (X_{j}) + \tilde{W} (X_{j}) {\tilde{u}}_{1} (X_{i})\}) \\ + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

(4)

\begin{matrix} \hat{f} (y^{*}) {(\hat{y} - y^{*})}^{2} \\ = & \frac{2 f (y^{*})}{n^{2} C^{2}} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) + \frac{f (y^{*})}{n C^{2}} E [{\tilde{u}}_{1}^{2} (X_{1})] + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

Proof.

(1) It follows from Equation (8) that

τ (\bar{X} - μ) - [τ + (1 - 2 τ) \hat{F} (y^{♯})] (\hat{y} - y^{*}) = 0

and

τ^{3} {(\bar{X} - μ)}^{3} = {[τ + (1 - 2 τ) \hat{F} (y^{♯})]}^{3} {(\hat{y} - y^{*})}^{3}

where

y^{♯} \in {[\hat{y}, y^{*}] or [y^{*}, \hat{y}]}

. For fixed

τ (0 < τ < 1)

and

0 \leq x \leq 1

, we can show that

min {τ, 1 - τ} \leq τ + (1 - 2 τ) x \leq max {τ, 1 - τ}

Thus,

{(\hat{y} - y^{*})}^{3}

and

{(\bar{X} - μ)}^{3}

converge to 0 with the same stochastic order. It follows from the moment evaluation of U-statistics that

E | \bar{X} {- μ |}^{r} = O (n^{- r / 2}) (r \geq 2)

. Then, we obtain

\begin{matrix} E | n^{1 / 2} {(\bar{X} - μ)}^{3} |^{4 / 3} = n^{2 / 3} E {| \bar{X} - μ |}^{4} \\ = & O (n^{- 4 / 3}) = O (n^{- 1 / 2 - 2 / 3 - 1 / 6}) . \end{matrix}

Thus, we have the desired result.

(2) Using the Taylor expansion, we have

\begin{matrix} {\hat{F}}^{[2]} (\hat{y}) - {\hat{F}}^{[2]} (y^{*}) \\ = & \hat{F} (y^{*}) (\hat{y} - y^{*}) + \frac{1}{2} \hat{f} (y^{*}) {(\hat{y} - y^{*})}^{2} + \frac{1}{6} {\hat{f}}^{'} (y^{*}) {(\hat{y} - y^{*})}^{3} \\ + \frac{1}{24} {\hat{f}}^{(2)} (y^{*}) {(\hat{y} - y^{*})}^{4} + \frac{1}{120} {\hat{f}}^{(3)} (y^{♯}) {(\hat{y} - y^{*})}^{5} \end{matrix}

where

y^{♯}

is between

\hat{y}

and

y^{*}

. Since

K^{(3)} (\cdot)

is bounded, we have

\begin{matrix} |n^{1 / 2} f^{(3)} (y^{♯}) {(\bar{X} - μ)}^{5}| \\ = & |n^{1 / 2} \frac{1}{n h^{4}} \sum_{i = 1}^{n} K^{(3)} (\frac{y^{♯} - X_{i}}{h}) {(\bar{X} - μ)}^{5}| \\ \leq & M h^{- 4} n^{1 / 2} {| \bar{X} - μ |}^{5} . \end{matrix}

Then, it follows from the moment evaluation that

\begin{matrix} E {|M h^{- 4} n^{1 / 2} {| \bar{X} - μ |}^{5}|}^{1 + δ / 5} \\ = & O (n^{3 / 2 + 3 δ / 10} n^{- (5 + δ) / 2} {(log n)}^{4 + 4 δ / 5}) \\ = & O (n^{- 1 / 2 - (1 + δ / 5) / 2 - δ / 10} {(log n)}^{4 + 4 δ / 5}) . \end{matrix}

Thus, we have

\frac{1}{120} {\hat{f}}^{(3)} (y^{♯}) {(\hat{y} - y^{*})}^{5} = n^{- 1 / 2} o_{M} (n^{- 1 / 2}) .

In the same way as the evaluation of the asymptotic mean squared error of the kernel density estimator, we can show that

E [{\hat{f}}^{'} (y^{*})] = f^{'} (y^{*}) + O (h^{2}) and E {|{\hat{f}}^{'} (y^{*}) - f^{'} (y^{*})|}^{k} = O (h^{- 2 k + 1}) .

Thus, for some constant

d > 0

,

\begin{matrix} E [{\{{\hat{f}}^{'} (y^{*}) - f^{'} (y^{*})\}}^{4}] \\ \leq & d (E {\{{\hat{f}}^{'} (y^{*}) - E [\frac{1}{h^{2}} K^{'} (\frac{x - X_{1}}{h})]\}}^{4} \\ + {\{E [\frac{1}{h^{2}} K^{'} (\frac{x - X_{1}}{h})] - f^{'} (y^{*})\}}^{4}) \\ = & O (n^{- 2} h^{- 7} + h^{8}) \end{matrix}

From Hölder’s inequality, we obtain

\begin{matrix} E {|\{{\hat{f}}^{'} (y^{*}) - f^{'} (y^{*})\} {(\hat{y} - y^{*})}^{3}|}^{4 / 3} \\ \leq & M {\{E {|{\hat{f}}^{'} (y^{*}) - f^{'} (y^{*})|}^{4}\}}^{1 / 4} {\{E | \bar{X} {- μ |}^{4}\}}^{3 / 4} \\ = & O (n^{- 3 / 2}) = o (n^{- 1 / 2 - 2 / 3 - 1 / 3}) . \end{matrix}

Therefore, we have

\frac{1}{6} {\hat{f}}^{'} (y^{*}) {(\hat{y} - y^{*})}^{3} = n^{- 1 / 2} o_{M} (n^{- 1 / 2}) .

Similarly, we can show that

\frac{1}{24} {\hat{f}}^{(2)} (y^{*}) {(\hat{y} - y^{*})}^{4} = n^{- 1 / 2} o_{M} (n^{- 1 / 2}) .

Thus, we have

\begin{matrix} \tilde{C} (\hat{y} - y^{*}) \\ = & [τ + (1 - 2 τ) \hat{F} (y^{*})] (\hat{y} - y^{*}) \\ = & \frac{1}{n} \sum_{i = 1}^{n} u_{1} (X_{i}) - \frac{1 - 2 τ}{2} {(\hat{y} - y^{*})}^{2} \hat{f} (y^{*}) + n^{- 1 / 2} o_{M} (n^{- 1 / 2}) . \end{matrix}

Substituting the above equation into Equation (8), we have the desired result.

(3) From the definition, we can show that

\begin{matrix} \tilde{C} - C & = & (1 - 2 τ) \{\hat{F} (y^{*}) - F (y^{*})\} \\ = & (1 - 2 τ) \frac{1}{n} \sum_{i = 1}^{n} \tilde{W} (X_{i}) + (1 - 2 τ) b_{2 n}, \\ C (\hat{y} - y^{*}) & = & \tilde{C} (\hat{y} - y^{*}) - (\tilde{C} - C) (\hat{y} - y^{*}) \end{matrix}

and

\hat{y} - y^{*} = \frac{1}{n C} \sum_{i = 1}^{n} {\tilde{u}}_{1} (X_{i}) + o_{l} (n^{- 1 / 2}) .

It follows from the equation in (Lemma A2) that

b_{2 n} \frac{1}{n C} \sum_{i = 1}^{n} {\tilde{u}}_{1} (X_{i}) = n^{- 1 / 2} o_{l} (n^{- 1 / 2}) .

Then, we can show that

\begin{matrix} (\tilde{C} - C) (\hat{y} - y^{*}) \\ = & \{(1 - 2 τ) \frac{1}{n} \sum_{i = 1}^{n} \tilde{W} (X_{i}) + (1 - 2 τ) b_{2 n}\} \{\frac{1}{n C} \sum_{i = 1}^{n} {\tilde{u}}_{1} (X_{i}) + o_{l} (n^{- 1 / 2})\} \\ = & \frac{1 - 2 τ}{C} (n^{- 1} E [\tilde{W} (X_{1}) {\tilde{u}}_{1} (X_{1})] + n^{- 2} \sum_{1 \leq i < j \leq n} \{\tilde{W} (X_{i}) {\tilde{u}}_{1} (X_{j}) + \tilde{W} (X_{j}) {\tilde{u}}_{1} (X_{i})\}) \\ + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

(4) For the kernel-type estimators, we have

E [{\tilde{K}}^{2} (X_{1})] = O (h^{- 1}), E [{\tilde{W}}^{2} (X_{1})] = O (1), E [{\tilde{u}}_{1}^{2} (X_{1})] = O (1) .

First, we consider

(1 - 2 τ) {(\hat{y} - y^{*})}^{2} {\hat{f} (y^{*}) - f (y^{*})}

. Let us evaluate the following terms:

\begin{matrix} n^{- 5 / 2} \sum_{1 \leq i < j < k \leq n} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) \tilde{K} (X_{k}), n^{- 3 / 2} b_{1 n} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) \tilde{K} (X_{j}), \\ n^{- 3 / 2} b_{3 n} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) \tilde{u} (X_{j}) . \end{matrix}

Using the moment evaluations for U-statistics, we can show that

\begin{matrix} E {|n^{- 5 / 2} \sum_{1 \leq i < j < k \leq n} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) \tilde{K} (X_{k})|}^{2} \\ \leq & O (n^{- 5}) n^{3} {\{E [{\tilde{u}}_{1}^{2} (X_{i})]\}}^{2} E [{\tilde{K}}^{2} (X_{k})] \\ = & O (n^{- 2}) O (h^{- 1}) = O (n^{- 1 / 2 - 1 - (1 / 2 - d)}) . \end{matrix}

Similarly, we can show that

\begin{matrix} E {|n^{- 3 / 2} b_{3 n} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) \tilde{u} (X_{j})|}^{2} \\ = & O (n^{- 3} h^{4} n^{2}) = O (n^{- 1 - 4 d}) = O (n^{- 1 / 2 - 1 - 4 (d - 1 / 8)}) \end{matrix}

and

\begin{matrix} E {|n^{- 3 / 2} b_{n} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) \tilde{K} (X_{j})|}^{2} \\ = & O (n^{- 3} h^{4} n^{2} h^{- 1}) = O (n^{- 1 - 3 d}) = O (n^{- 1 / 2 - 1 - 3 (d - 1 / 6)}) . \end{matrix}

Furthermore, it is easy to see that

n^{- 3 / 2} \sum_{i = 1}^{n} {{\tilde{u}}_{1}^{2} (X_{i}) - E [{\tilde{u}}_{1}^{2} (X_{i})]} = n^{1 / 2} n^{- 1 / 2} o_{l} (n^{- 1 / 2}) = o_{l} (n^{- 1 / 2})

and then

n^{- 3 / 2} \sum_{i = 1}^{n} {\tilde{u}}_{1}^{2} (X_{i}) = n^{- 1 / 2} E [{\tilde{u}}_{1}^{2} (X_{i})] + o_{l} (n^{- 1 / 2}) .

It follows from Equation (4) in the standardized version that we have

\hat{y} - y^{*} = \frac{1}{n C} \sum_{i = 1}^{n} {\tilde{u}}_{1} (X_{i}) + o_{l} (n^{- 1 / 2}) .

Using the large deviation in (Lemma A2), we have

\begin{matrix} {(\hat{y} - y^{*})}^{2} & = & \frac{2}{n^{2} C^{2}} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) \\ + \frac{1}{n C^{2}} E [{\tilde{u}}_{1}^{2} (X_{1})] + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

Then, we can show that

\begin{matrix} \hat{f} (y^{*}) {(\hat{y} - y^{*})}^{2} \\ = & \frac{2 f (y^{*})}{n^{2} C^{2}} \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) + \frac{f (y^{*})}{n C^{2}} E [{\tilde{u}}_{1}^{2} (X_{1})] + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

□

Proof of Theorem 1.

(1) Using (Lemma A3), we can easily obtain the asymptotic representation of the standardized expectile

\frac{\sqrt{n} C}{ξ_{n}} (\hat{y} - y^{*})

.

(2) Using the Taylor expansion, we can easily obtain

\begin{matrix} E [h A (\frac{y^{*} - X_{1}}{h})] \\ = & \int_{- \infty}^{\infty} h A (\frac{y^{*} - s}{h}) f (s) d s \\ = & {[h A (\frac{y^{*} - s}{h}) F (s)]}_{- \infty}^{\infty} + \int_{- \infty}^{\infty} W (\frac{y^{*} - s}{h}) F (s) d s \\ = & {[W (\frac{y^{*} - s}{h}) F^{[2]} (s)]}_{- \infty}^{\infty} + \int_{- \infty}^{\infty} \frac{1}{h} K (\frac{y^{*} - s}{h}) F^{[2]} (s) d s \\ = & \int_{- \infty}^{\infty} K (u) F^{[2]} (y^{*} - h u) d u = F^{[2]} (y^{*}) + O (h) . \end{matrix}

Similarly, we obtain

\begin{matrix} E [{\{h A (\frac{y^{*} - X_{1}}{h})\}}^{2}] \\ = & \int_{- \infty}^{\infty} {\{h A (\frac{y^{*} - s}{h})\}}^{2} f (s) d s \\ = & {[{\{h A (\frac{y^{*} - s}{h})\}}^{2} F (s)]}_{- \infty}^{\infty} + 2 h \int_{- \infty}^{\infty} A (\frac{y^{*} - s}{h}) W (\frac{y^{*} - s}{h}) F (s) d s \\ = & 2 h {[A (\frac{y^{*} - s}{h}) W (\frac{y^{*} - s}{h}) F^{[2]} (s)]}_{- \infty}^{\infty} \\ + 2 \int_{- \infty}^{\infty} \{W^{2} (\frac{y^{*} - s}{h}) + A (\frac{y^{*} - s}{h}) K (\frac{y^{*} - s}{h})\} F^{[2]} (s) d s \\ = & 2 \int_{- \infty}^{\infty} W^{2} (\frac{y^{*} - s}{h}) F^{[2]} (s) d s + 2 \int_{- \infty}^{\infty} A (\frac{y^{*} - s}{h}) K (\frac{y^{*} - s}{h}) F^{[2]} (s) d s . \end{matrix}

For the first term, we can show that

\begin{matrix} 2 \int_{- \infty}^{\infty} W^{2} (\frac{y^{*} - s}{h}) F^{[2]} d s \\ = & 2 {[W^{2} (\frac{y^{*} - s}{h}) F^{[3]} (s)]}_{- \infty}^{\infty} + \frac{4}{h} \int_{- \infty}^{\infty} W (\frac{y^{*} - s}{h}) K (\frac{y^{*} - s}{h}) F^{[3]} (s) d s \\ = & 4 \int_{- \infty}^{\infty} W (u) K (u) F^{[3]} (y^{*} - h u) d u \\ = & 4 F^{[3]} (y^{*}) \int_{- \infty}^{\infty} W (u) K (u) d u + O (h) \\ = & 2 F^{[3]} (y^{*}) + O (h) \end{matrix}

where

F^{[3]} (x) = \int_{- \infty}^{x} F^{[2]} (u) d u .

For the second term, we have

\begin{matrix} 2 \int_{- \infty}^{\infty} A (\frac{y^{*} - s}{h}) K (\frac{y^{*} - s}{h}) F^{[2]} (s) d s \\ = & 2 h \int_{- \infty}^{\infty} A (u) K (u) F^{[2]} (y^{*} - h u) d u = O (h) . \end{matrix}

Then. we can obtain

\begin{matrix} V a r [(1 - 2 τ) \{h A (\frac{y^{*} - X_{1}}{h})\}] \\ = & {(1 - 2 τ)}^{2} [2 F^{[3]} (y^{*}) - {\{F^{[2]} (y^{*})\}}^{2}] + O (h) . \end{matrix}

Similarly, we can obtain the covariance. Since

\begin{matrix} E [h A (\frac{y^{*} - X_{1}}{h}) X_{i}] = \int_{- \infty}^{\infty} h A (\frac{y^{*} - s}{h}) s f (s) d s \\ = & {[h A (\frac{y^{*} - s}{h}) \{s F (s) - F^{[2]} (s)\}]}_{- \infty}^{\infty} \\ + \int_{- \infty}^{\infty} W (\frac{y^{*} - s}{h}) \{s F (s) - F^{[2]} (s)\} d s \\ = & {[W (\frac{y^{*} - s}{h}) \{s F^{[2]} (s) - 2 F^{[3]} (s)\}]}_{- \infty}^{\infty} \\ + \int_{- \infty}^{\infty} \frac{1}{h} K (\frac{y^{*} - s}{h}) \{s F^{[2]} (s) - 2 F^{[3]} (s)\} d s \\ = & \int_{- \infty}^{\infty} K (u) \{(y^{*} - h u) F^{[2]} (y^{*} - h u) - 2 F^{[3]} (y^{*} - h u)\} d u \\ = & y^{*} F^{[2]} (y^{*}) - 2 F^{[3]} (y^{*}) + O (h), \end{matrix}

and then we have

\begin{matrix} C o v (h A (\frac{y^{*} - X_{1}}{h}), X_{i} - μ) \\ = & E [h A (\frac{y^{*} - X_{1}}{h}) X_{i}] - μ E [h A (\frac{y^{*} - X_{1}}{h})] \\ = & (y^{*} - μ) F^{[2]} (y^{*}) - 2 F^{[3]} (y^{*}) + O (h) . \end{matrix}

Combining the above calculations, we can obtain

\begin{matrix} V a r [u (X_{1})] \\ = & {(1 - 2 τ)}^{2} [2 F^{[3]} (y^{*}) - {\{F^{[2]} (y^{*})\}}^{2} \\ + 2 h F^{[2]} (y^{*}) \{\int_{- \infty}^{\infty} A (u) K (u) d u - 2 \int_{- \infty}^{\infty} u W (u) K (u) d u\}] \\ - 2 (1 - 2 τ) τ \{(y^{*} - μ) F^{[2]} (y^{*}) - 2 F^{[3]} (y^{*})\} + τ^{2} σ^{2} + O (h) \\ = & F^{[3]} (y^{*}) \{2 {(1 - τ)}^{2} + 4 (1 - 2 τ)\} \\ - (1 - 2 τ) F^{[2]} (y^{*}) \{τ (y^{*} - μ) + (1 - 2 τ) F^{[2]} (y^{*})\} \\ - (1 - 2 τ) F^{[2]} (y^{*}) τ (y^{*} - μ) \\ + τ^{2} σ^{2} + O (h) . \end{matrix}

It follows from the Equation (6) that

(1 - 2 τ) F^{[2]} (y^{*}) τ (y^{*} - μ) = - τ^{2} {(y^{*} - μ)}^{2} .

Thus, we obtain

V a r [u (X_{1})] = (2 - 4 τ) F^{[3]} (y^{*}) + τ^{2} {(y^{*} - μ)}^{2} + τ^{2} σ^{2} + O (h) .

(3) Using the Edgeworth expansion for the asymptotic U-statistics (see [4]), we can easily obtain the Edgeworth expansion. □

Proof of Theorem 2.

From the definition, we obtain

\hat{C} (\hat{y} - y^{*}) = (\hat{C} - \tilde{C}) (\hat{y} - y^{*}) + \tilde{C} (\hat{y} - y^{*}) .

Using the Taylor expansion and the previous approximations, we obtain

\begin{matrix} (\hat{C} - \tilde{C}) (\hat{y} - y^{*}) & = & (1 - 2 τ) \{\hat{F} (\hat{y}) - \hat{F} (y^{*})\} (\hat{y} - y^{*}) \\ = & (1 - 2 τ) \hat{f} (y^{*}) {(\hat{y} - y^{*})}^{2} + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

Thus, we have

\hat{C} (\hat{y} - y^{*}) = \frac{1}{n} \sum_{i = 1}^{n} u_{1} (X_{i}) + \frac{1 - 2 τ}{2} {(\hat{y} - y^{*})}^{2} \hat{f} (y^{*}) + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) .

It follows from (2) in (Lemma A3) that

\begin{matrix} \hat{C} (\hat{y} - y^{*}) \\ = & b_{n} + \frac{1 - 2 τ}{2 n C^{2}} f (y^{*}) E [{\tilde{u}}_{1}^{2} (X_{1})] + \frac{1}{n} \sum_{i = 1}^{n} {\tilde{u}}_{1} (X_{i}) \\ + \frac{1 - 2 τ}{n^{2} C^{2}} f (y^{*}) \sum_{1 \leq i < j \leq n} {\tilde{u}}_{1} (X_{i}) {\tilde{u}}_{1} (X_{j}) + n^{- 1 / 2} o_{l} (n^{- 1 / 2}) . \end{matrix}

It is easy to see that

ξ_{n}^{2} = E [{\tilde{u}}_{1}^{2} (X_{1})] = E [{u_{1} (X_{1}) - b_{n}}^{2}] = E [u_{1}^{2} (X_{1})] + O (h^{4}) .

It follows from the theory of U-statistics for the sample variance and the Taylor expansion that

{\hat{ξ}}_{n}^{2} = ξ_{n}^{2} + \frac{1}{n} \sum_{i = 1}^{n} \{{\tilde{u}}_{1}^{2} (X_{i}) - ξ_{n}^{2}\} + o_{l} (n^{- 1 / 2}) .

Furthermore, using the Taylor expansion, we obtain

{\hat{ξ}}_{n}^{- 1} = ξ_{n}^{- 1} - \frac{1}{2 n ξ_{n}^{3}} \sum_{i = 1}^{n} \{{\tilde{u}}_{1}^{2} (X_{i}) - ξ_{n}^{2}\} + o_{l} (n^{- 1 / 2}) .

Similarly to the proof of (Lemma 2) in Maesono and Penev [4], we can obtain the asymptotic representation of the studentized expectile.

Using the same argument as in Maesono and Penev [4], it is easy to obtain the Edgeworth expansion for the studentized expectile estimator. □

Details about the numerical implementation

We used the R programming language. The kernel was the Epanechnikov kernel (14). The bandwidth was set to

h = n^{- 1 / 4} {(log n)}^{- 1} .

For performance comparisons, we needed the true expectile of the exponential distribution (13). This was calculated by using the R function lambertW0 from the R package lamW. The kernel functions

W (.)

and

A (.)

were calculated analytically using simple piece-wise polynomial integration starting with the Epanechnikov kernel. We describe the main steps in the implementation of the practically relevant studentized confidence intervals (the steps for the standardized intervals are similar):

Enter the n data and the values of $τ$ and $α .$
Write simple R functions to calculate $\hat{f} (\cdot),$ $\hat{F} (\cdot)$ . and ${\hat{F}}^{[2]} (\cdot)$ , using $h, W (\cdot),$ and $A (\cdot) .$
Solve the Equation (7) to find the estimated expectile $\hat{y}$ , using the uniroot function in R.
Find $\hat{C}$ from Definition 1.
Find the estimator ${\hat{ξ}}_{n}^{2} .$
Find ${\hat{μ}}_{3} .$
Find ${\hat{B}}_{n}$ , using $\hat{C}, \hat{f}, \hat{y}, {\hat{μ}}_{3}$ , and ${\hat{ξ}}_{n} .$
Find ${\hat{κ}}_{S}$ , using $\hat{C}, \hat{f}, \hat{y}, {\hat{μ}}_{3}$ , and ${\hat{ξ}}_{n} .$
Find ${\hat{b}}_{1 n}$ , using $h, \hat{f} (\hat{y})$ , and $\int_{- \infty}^{\infty} u^{2} K (u) d u = 0.2 .$
Find the estimator of ${\hat{e}}_{α}$ in (12).
Substitute in the Formula (15) to obtain the confidence interval by the CF method.
Using the numerical foot-finder uniroot to find the confidence internal by using the numerical inversion method (16).

References

Maesono, Y.; Penev, S. Edgeworth Expansion for the Kernel Quantile Estimator. Ann. Inst. Stat. Math. 2011, 63, 617–644. [Google Scholar] [CrossRef]
Falk, M. Relative deficiency of kernel type estimators of quantiles. Ann. Stat. 1984, 12, 261–268. [Google Scholar] [CrossRef]
Falk, M. Asymptotic normality of the kernel quantile estimator. Ann. Stat. 1985, 13, 428–433. [Google Scholar] [CrossRef]
Maesono, Y.; Penev, S. Improved confidence intervals for quantiles. Ann. Inst. Stat. Math. 2013, 65, 167–189. [Google Scholar] [CrossRef]
Newey, W.; Powel, J. Asymmetric least squares estimation and testing. Econometrica 1987, 55, 819–847. [Google Scholar] [CrossRef]
Artzner, P.; Delbaen, F.; Eber, J.; Heath, D. Coherent measures of risk. Math. Financ. 1999, 9, 203–228. [Google Scholar] [CrossRef]
Gneiting, T. Making and Evaluating Point Forecasts. J. Am. Stat. Assocation 2011, 106, 746–762. [Google Scholar] [CrossRef]
Ziegel, J. Coherence and elicitability. Math. Financ. 2016, 26, 901–918. [Google Scholar] [CrossRef]
Bellini, F. Isotonicity properties of generalized quantiles. Stat. Probab. Lett. 2012, 82, 2017–2024. [Google Scholar] [CrossRef]
Holzmann, H.; Klar, B. Expectile asymptotics. Electornic J. Stat. 2016, 10, 2355–2371. [Google Scholar] [CrossRef]
Krätschmer, V.; Zähle, H. Statistical Inference for Expectile-based Risk Measures. Scand. J. Stat. 2017, 44, 425–454. [Google Scholar] [CrossRef]
Waltrup, L.S.; Sobotka, F.; Kneib, T.; Kauermann, G. Expectile and quantile regression- Davis and Goliath? Stat. Model. 2015, 15, 433–456. [Google Scholar] [CrossRef]
Sobotka, F.; Kauermann, G.; Waltrup, L.S.; Kneip, T. On confidence intervals for semiparametric expectile regression. Stat. Comput. 2013, 23, 135–148. [Google Scholar] [CrossRef]
Bellini, F.; Di Bernardino, E. Risk management with expectiles. The Eur. J. Financ. 2017, 23, 487–506. [Google Scholar] [CrossRef]
Bellini, F.; Klar, B.; Müller, A.; Rosazza Gianin, E. Generalized quantiles as risk measures. Insur. Math. Econ. 2014, 54, 41–48. [Google Scholar] [CrossRef]
Chen, J.M. On Exactitute in Financial Regulation: Value-at-Risk, Expected Shortfall, and Expectiles. Risks 2018, 6, 61. [Google Scholar] [CrossRef]
Malevich, T.L.; Abdalimov, B. Large Deviation Probabilities for U-Statistics. Theory Probab. Appl. 1979, 24, 215–220. [Google Scholar] [CrossRef]
Van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
Jones, M.C. Expeciles and m-quantiles are quantiles. Stat. Probab. Lett. 1994, 20, 149–153. [Google Scholar] [CrossRef]

Figure 1. True cdf, normal, and Edgeworth approximation for the standardized estimator with exponential data.

Figure 2. True cdf, normal, and Edgeworth approximation for the studentized estimator with exponential data.

Table 1. Symmetric confidence intervals for the expectile of the standard exponential distribution.

Sample Size	$τ$	Nominal Coverage 90%
		Normal	Numerical Inversion	CF Method
20	0.5	0.85630	$0.87444$	0.85730
20	0.4	0.84648	$0.86409$	0.83894
20	0.3	0.83056	$0.84846$	0.81052
20	0.2	0.80674	$0.82362$	0.76108
20	0.1	0.75308	$0.77348$	0.66668
50	0.5	0.88008	$0.89212$	0.87974
50	0.4	0.87504	$0.88542$	0.86940
50	0.3	0.86872	$0.87670$	0.85496
50	0.2	0.85734	$0.86368$	0.82832
50	0.1	0.82998	$0.83878$	0.76416
100	0.5	0.89244	$0.90010$	0.89138
100	0.4	0.88938	$0.89648$	0.88616
100	0.3	0.88588	$0.89196$	0.87762
100	0.2	0.87834	$0.88486$	0.86204
100	0.1	0.86184	$0.86668$	0.82268
150	0.5	0.89558	$0.90240$	0.89476
150	0.4	0.89284	$0.90126$	0.89164
150	0.3	0.89070	$0.89698$	0.88544
150	0.2	0.88532	$0.89032$	0.87424
150	0.1	0.87374	$0.87888$	0.84714
200	0.5	0.89714	$0.90322$	0.89666
200	0.4	0.89522	$0.90150$	0.89374
200	0.3	0.89330	$0.89952$	0.88916
200	0.2	0.88714	$0.89406$	0.88008
200	0.1	0.88094	$0.88540$	0.86120

Table 2. Symmetric confidence intervals for the expectile of the standard exponential distribution.

Sample Size	$τ$	Nominal Coverage 95%
		Normal	Numerical Inversion	CF Method
20	0.5	0.90454	$0.92016$	0.91170
20	0.4	0.89414	$0.90844$	0.89408
20	0.3	0.87992	$0.89130$	0.86164
20	0.2	0.85692	$0.86550$	0.80174
20	0.1	0.80530	$0.81386$	0.69500
50	0.5	0.93000	$0.93760$	0.93170
50	0.4	0.92404	$0.93012$	0.92260
50	0.3	0.91726	$0.92100$	0.90848
50	0.2	0.90608	$0.90632$	0.87640
50	0.1	0.87604	$0.87832$	0.79596
100	0.5	0.94104	$0.94588$	0.94194
100	0.4	0.93896	$0.94170$	0.93718
100	0.3	0.93500	$0.93612$	0.92922
100	0.2	0.92642	$0.92652$	0.91332
100	0.1	0.90554	$0.90666$	0.86584
150	0.5	0.94426	$0.94874$	0.94580
150	0.4	0.94230	$0.94580$	0.94266
150	0.3	0.93858	$0.94200$	0.93744
150	0.2	0.93386	$0.93430$	0.92570
150	0.1	$0.92224$	0.91924	0.89484
200	0.5	0.94588	$0.95002$	0.94688
200	0.4	0.94478	$0.94690$	0.94372
200	0.3	0.94282	$0.94286$	0.93926
200	0.2	0.93688	$0.93704$	0.93140
200	0.1	$0.92776$	0.92606	0.90954

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Penev, S.; Maesono, Y. Improved Confidence Intervals for Expectiles. Mathematics 2025, 13, 510. https://doi.org/10.3390/math13030510

AMA Style

Penev S, Maesono Y. Improved Confidence Intervals for Expectiles. Mathematics. 2025; 13(3):510. https://doi.org/10.3390/math13030510

Chicago/Turabian Style

Penev, Spiridon, and Yoshihiko Maesono. 2025. "Improved Confidence Intervals for Expectiles" Mathematics 13, no. 3: 510. https://doi.org/10.3390/math13030510

APA Style

Penev, S., & Maesono, Y. (2025). Improved Confidence Intervals for Expectiles. Mathematics, 13(3), 510. https://doi.org/10.3390/math13030510

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improved Confidence Intervals for Expectiles

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Edgeworth Expansion for the Standardized Expectile

3.2. Edgeworth Expansion for the Studentized Expectile

3.3. Cornish–Fisher-Type Approximation of the $α$ -Quantile of the Studentized Expectile

4. Discussion

5. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Improved Confidence Intervals for Expectiles

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Edgeworth Expansion for the Standardized Expectile

3.2. Edgeworth Expansion for the Studentized Expectile

3.3. Cornish–Fisher-Type Approximation of the α -Quantile of the Studentized Expectile

4. Discussion

5. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.3. Cornish–Fisher-Type Approximation of the $α$ -Quantile of the Studentized Expectile