Next Article in Journal
Economic and Financial Crime: Corruption, Shadow Economy, and Money Laundering: Book Review. Written by Monica Violeta Achim and Sorin Nicolae Borlea. Springer Nature: Cham, Switzerland, 2021. ISBN 978-3-030-51780-9
Previous Article in Journal
Forecasting in Small Business Management
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Optimal Tail Selection in Risk Measurement

by
Małgorzata Just
1,* and
Krzysztof Echaust
2
1
Department of Finance and Accounting, Poznań University of Life Sciences, 60-637 Poznań, Poland
2
Department of Operations Research and Mathematical Economics, Poznań University of Economics and Business, 61-875 Poznań, Poland
*
Author to whom correspondence should be addressed.
Risks 2021, 9(4), 70; https://doi.org/10.3390/risks9040070
Submission received: 3 March 2021 / Revised: 30 March 2021 / Accepted: 7 April 2021 / Published: 9 April 2021

Abstract

:
The appropriate choice of a threshold level, which separates the tails of the probability distribution of a random variable from its middle part, is considered to be a very complex and challenging task. This paper provides an empirical study on various methods of the optimal tail selection in risk measurement. The results indicate which method may be useful in practice for investors and financial and regulatory institutions. Some methods that perform well in simulation studies, based on theoretical distributions, may not perform well when real data are in use. We analyze twelve methods with different parameters for forty-eight world indices using returns from the period of 2000–Q1 2020 and four sub-periods. The research objective is to compare the methods and to identify those which can be recognized as useful in risk measurement. The results suggest that only four tail selection methods, i.e., the Path Stability algorithm, the minimization of the Asymptotic Mean Squared Error approach, the automated Eyeball method with carefully selected tuning parameters and the Hall single bootstrap procedure may be useful in practical applications.

1. Introduction

Extreme value theory (EVT) is a branch of statistics dealing with extreme values. It focuses on the asymptotic behavior of extreme values of random variables instead of the whole distribution. Such an approach enables us to model extreme values more accurately. Nowadays, EVT is widely used in many disciplines such as hydrology, environmental sciences, engineering, insurance and finance (Akhundjanov and Chamberlain 2019; Embrechts et al. 1997; Embrechts and Schmidli 1994; Gilli and Këllezi 2006; Longin 2000; Loretan and Phillips 1994; Mcneil 1999; McNeil and Frey 2000; Roth et al. 2016; Smith 2003; Vilasuso and Katz 2000; Wang et al. 2010). One of the key problems in EVT is related to the search for the optimal tail fraction. The appropriate choice of the threshold (the beginning of a tail of distribution) is ambiguous but fundamental in the estimation of the tail index or parameters of a Generalized Pareto Distribution (GPD) and then in the calculation of tail-related risk measures, e.g., Value at Risk (VaR) or Expected Shortfall (ES).
While there are many concepts and approaches presented in the literature, none of them has been indicated to be suitable, and there is no single answer where the distribution tail begins. Searching for the tail of distribution is always a trade-off between bias and variance. If a threshold is specified too low, the method estimates indicate a high bias. The more the threshold is away from the tail, the more the empirical distribution of extrema deviates from the theoretical model. On the other hand, a too high threshold results in high variance of the model estimates, since not many data exceed the threshold (Coles 2001). Numerous authors apply a fixed percentile of the total sample size as the threshold, usually 10%, 7.5%, 5% or 1% of the upper statistics (Karmakar and Shukla 2014; Jones 2003; Bee et al. 2016; Fernandez 2005; Totić and Božović 2016; Gençay et al. 2003; Gençay and Selçuk 2004; Echaust and Just 2020a; Daníelsson and Morimoto 2000; Echaust 2021; Longin 2000; McNeil and Frey 2000). More sophisticated approaches to threshold selection are based on graphical techniques. A frequently used procedure relies on the analysis of a mean excess plot, which represents the mean of the excesses of the threshold u. This method was applied in Aboura (2014), Cifter (2011), Gilli and Këllezi (2006), Łuczak and Just (2020) and Omari et al. (2017). The change in the pattern for a very high threshold is observed in this plot; therefore, the choice of a threshold is ambiguous. The second graphical procedure for threshold selection is to estimate parameters of the Generalized Pareto Distribution at a range of thresholds. Above a threshold u, at which the model is valid, the estimate of the shape parameter should be approximately constant, whereas the estimate of the scale parameter should be linear. Another very popular threshold selection technique involves a graphical representation of Hill (Beirlant et al. 2011; Hill 1975), Pickands (Pickands 1975) or Dekkers–Einmahl–de Haan estimators (Dekkers et al. 1989). However, the graphical-based threshold choice procedures require identification of stable regions in the graphs; therefore, they are highly subjective and difficult to apply in empirical studies. Finally, there are studies that use the choice of threshold based on optimization procedures (Caeiro and Gomes 2016; Danielsson et al. 2016; Echaust and Just 2020b; Scarrott and MacDonald 2012). An extensive overview of such methods is provided in Section 2.
This paper provides an empirical study on various methods of optimal tail selection in risk measurement. The results are to indicate which of them may be used in practice by investors and financial and regulatory institutions. According to the Amendment to Basel I (1996), market risk should be calculated as the Value at Risk using the 99th percentile confidence level. A number of weaknesses have been identified by using VaR for determining capital requirement during the global financial crisis (GFC) of 2007–2009. Therefore, in the Basel III document (2013) the Committee proposed to replace VaR with ES at the 97.5% confidence level. This confidence level is to provide a similar level of risk as the 99th percentile VaR while ensuring a number of benefits, including generally more stable risk estimates and often lesser sensitivity to extreme outlier observations. Moreover, in contrast to VaR, ES is a subadditive risk measure and thus does not hinder the appropriate risk measurement of investment portfolios. In our empirical study, we analyze twelve automatic methods of threshold selection. Unfortunately, some of these methods estimate a threshold at a very high level to estimate these risk measures at the recommended confidence levels. The research objective of this paper is to compare these methods and to identify these methods that can be recognized as useful in risk measurement. Some models that perform well in simulation studies, based on theoretical distributions, may not perform well when real data are in use (Danielsson et al. 2016). This paper sheds new light on the linkage between tail selection and financial applications.
This study contributes to the literature in two ways. Firstly, we provide a review of tail selection methods most popular in the literature. Secondly, we compare the results of the tail choice computed with a variety of methods. All threshold estimates are calculated in the tea R package (Ossberger 2020). We are able to specify which of them tend to produce systematically too high threshold levels to calculate risk measures at commonly accepted confidence levels. Therefore, our results may be helpful for all investors, risk managers and academics who apply tail models in practice.
The remainder of the paper is organized as follows. Section 2 provides an overview of optimal tail selection methods. Section 3 describes the data used in the empirical study. Section 4 presents empirical results and their discussion, while Section 5 concludes the study.

2. Optimization Approaches for Threshold Selection

In the empirical study we use twelve methods to choose the optimal number of data in the tail:
  • The Mean Absolute Deviation Distance metric (MAD-Distance metric) method,
  • The Kolmogorov–Smirnov Distance metric (KS-Distance metric) method,
  • The Reiss and Thomas (RT) procedures,
  • The Path Stability (PS) method,
  • The automated Eyeball (Eyeball) method,
  • The Guillou and Hall (GH) procedure,
  • The minimization of the Asymptotic Mean Squared Error (dAMSE) method,
  • The Hall and Welsh (HW) procedure,
  • The single bootstrap (Hall) procedure proposed by Hall,
  • The single bootstrap (Himp) procedure proposed by Caeiro and Gomes,
  • The double bootstrap (Gomes) procedure proposed by Gomes, Figueiredo and Neves,
  • The double bootstrap (Danielsson) procedure proposed by Danielsson, de Haan, Peng and de Vries.

2.1. Mean Absolute Deviation Distance Metric (MAD-Distance Metric) and Kolmogorov–Smirnov Distance Metric (KS-Distance Metric) Methods

The MAD-Distance metric (MAD Dis) and KS-Distance metric (KS Dis) methods presented by Danielsson et al. (2016) are based on minimizing the distance between the largest upper order statistics of the dataset, i.e., the distance between the empirical tail and the theoretical tail of a Pareto distribution. The tail index α of this distribution is estimated using the Hill estimator (Hill 1975):
ξ ^ k , n H = 1 α ^ = 1 k i = 0 k 1 ( log ( x n i , n ) log ( x n k , n ) ) ,
where k is the number of upper order statistics used to estimate parameter α . The distance, which is measured in the quantile dimension, is minimized with respect to k . The optimal number of the upper order statistics k is called k 0 . Determining k 0 is equivalent to specifying the number of extreme values (sample size in the tail). The MAD-Distance metric method uses the mean absolute deviation penalty functions:
Q n M A D = 1 T j = 1 T | x n j , n q ( j , k ) |  
and the KS-Distance metric takes the maximum absolute deviation penalty functions:
Q n K S = inf k [ sup j | x n j , n q ( j , k ) | ] ,   ( j = 1 , , T ) ,
where T > k is the region over which the distance metric is measured, x n j , n is the empirical quantile, and q ( j , k ) is the quantile estimated by x n k + 1 , n ( k / j ) 1 / α ^ k with the Hill estimator α ^ k . In these approaches, the size of the upper tail is denoted by t s . The motivation for the selection of these methods is that the tail-related risk measures (e.g., VaR and ES) are concepts related to quantiles at a given probability level.

2.2. The Reiss and Thomas (RT) Procedures

Reiss and Thomas (2007) proposed alternative procedures to select the optimal number of data in the tail, k 0 . These approaches select the lowest upper order statistic by minimizing the expression (RT1):
1 k i = 1 k i β | ξ ^ i H median ( ξ ^ i H , , ξ ^ k H ) |
or expression (RT2):
1 k i = 1 k i β ( ξ ^ i H ξ ^ k H ) 2 ,
where ξ ^ i H is the Hill estimator, and the tuning parameter satisfies 0 β 0.5 .
In practice, automated implementation of these procedures is unreliable for small k ; thus, a minimum value of k , k m i n , is usually used (Scarrott and MacDonald 2012.)

2.3. The Path Stability (PS) Method

The Path Stability method is a heuristic algorithm proposed by Caeiro and Gomes (2016). The algorithm looks for a stable region of the path stability, i.e., the plot of a tail index with respect to k . The optimal number of data in the tail, k 0 , is identified by the algorithm that consists of six steps:
First step. Given an observed data ( x 1 , , x n ) , compute T ( k ) = ξ ^ k , n H ( k = 1 , , n 1 ) using the Hill estimator.
Second step. Take j 0 as a minimum value of j , a non-negative integer, such that the rounded values, to j decimal places, of the estimates in the first step are distinct. Define a k ( T ) ( j ) = round ( T ( k ) , j ) ( k = 1 , , n 1 ) , the rounded values of T ( k ) to j decimal places.
Third step. Consider the set of k values associated to equal consecutive values of a k ( T ) ( j 0 ) obtained in the second step. Set k m a x ( T ) and k m i n ( T ) the maximum and minimum values, respectively, of the set with the largest range. The largest run size is l T = k m a x ( T ) k m i n ( T ) .
Fourth step. Consider all those estimates, T ( k ) ( k m i n ( T ) k k m a x ( T ) ) , now with two additional decimal places, i.e., compute T ( k ) = a k ( T ) ( j 0 + 2 ) . Obtain the mode of T ( k ) and denote K T the set of k -values associated with this mode.
Fifth step. Obtain k ^ T , the maximum value of K T .
Sixth step. Finally calculate ξ ^ H | P S = ξ ^ k ^ T , n H .

2.4. The Automated Eyeball (Eyeball) Method

Another heuristic algorithm is the automated Eyeball method proposed by Resnick and Stărică (1997) and explored in (Danielsson et al. 2016). The method finds a stable region in the Hill plot by defining a moving window. This method is based on a heuristic algorithm, which looks for a substantial drop in variance of the Hill plot as k is increased. The following estimator is used to select the optimal number of data in the tail:
k 0 e y e = min { k { 2 , , n + w } | h < 1 w i = 1 w I { α ^ ( k ) ε < α ^ ( k + i ) < α ^ ( k ) + ε } } ,
where w is the size of the moving window (usually 1% of the full sample); n + is the number of positive data; ε is the size of the range in which estimates may vary (e.g., 0.3); and h is the percentage of data inside the moving window, which should occur in the tolerable range (usually around 90%).

2.5. The Guillou and Hall (GH) Procedure

The procedure for choosing the optimal threshold when fitting the Hill estimator of a tail exponent to extreme value data proposed by Guillou and Hall (2001) is based on bias reduction. The optimal number of data in the tail, k 0 , in this method is identified by the procedure, which consists of three steps (e.g., see, Caeiro and Gomes 2016):
First step. Given an observed data ( x 1 , , x n ) , compute:
Q n ( k ) = { 1 2 k / 2 + 1 j = k k / 2 k + k / 2 T n 2 ( j ) } 1 / 2 ( 1 k   and   k + k / 2 < n ) ,
where
T n ( k ) = 3 k 3 i = 1 k ( k 2 i + 1 ) U i 1 k i = 1 k U i ,   U i = i i = 1 k ( log x n i + 1 : n log x n i : n ) ( 1 i k < n ) .
Second step. Given c c r i t = 1.25 , consider the choice
k 0 G H = inf { k : | Q n ( j ) | c c r i t , j k } .
Third step. Next, obtain ξ ^ G H = ξ ^ k ^ 0 G H , n G H .
Guillou and Hall (2001) investigated a wider range of c c r i t , but they suggest values between 1.25 and 1.5.

2.6. Minimization of the Asymptotic Mean Squared Error (dAMSE) Method

The minimization of the Asymptotic Mean Squared Error (dAMSE) method was presented by Caeiro and Gomes (2016). The optimal number of data in the tail, k 0 , in this method is identified by minimizing the Asymptotic Mean Squared Error (AMSE) of the Hill estimator with respect to k . This algorithm consists of five steps:
First step. Given an observed data ( x 1 , , x n ) , calculate for the tuning parameters τ = 0 and τ = 1 , the values of ρ ^ τ ( k ) , that have the form:
ρ ^ τ ( k ) = | 3 ( W k , n ( τ ) 1 ) / ( W k , n ( τ ) 3 ) |
dependent on the statistics:
W k , n ( τ ) = { ( M k , n ( 1 ) ) τ ( M k , n ( 2 ) / 2 ) τ / 2 ( M k , n ( 2 ) / 2 ) τ / 2 ( M k , n ( 3 ) / 6 ) τ / 3 ,   if   τ 0 ,   log ( M k , n ( 1 ) ) log ( M k , n ( 2 ) / 2 ) / 2 log ( M k , n ( 2 ) / 2 ) / 2 log ( M k , n ( 3 ) / 6 ) / 3 ,   if   τ = 0 ,  
where
M k , n ( j ) = 1 k i = 1 k ( log x n i + 1 : n log x n k : n ) j ( j = 1 , 2 , 3 ) .
Second step. Consider K = ( n 0.995 , n 0.999 ) . Calculate the median of { ρ ^ τ ( k ) } k K , denoted χ τ , and compute I τ = k K ( ρ ^ τ ( k ) χ τ ) 2   ( τ = 0 , 1 ) .
Then, select the tuning parameter, τ * = 0 , if I 0 I 1 , otherwise select τ * = 1 .
Third step. Work with ρ ^ ρ ^ τ * ( k ) = ρ ^ τ * ( k 01 ) and β ^ β ^ τ * ( k ) = β ^ ρ ^ τ * ( k 01 ) , with k 01 = n 0.999 and
β ^ ρ ^ ( k ) = ( k n ) ρ ^ d k ( ρ ^ ) D k ( 0 ) D k ( ρ ^ ) d k ( ρ ^ ) D k ( ρ ^ ) D k ( 2 ρ ^ ) ,
dependent on the estimator ρ ^ = ρ ^ τ * ( k 01 ) , and where for any α 0 ,
d k ( α ) = 1 k i = 1 k ( i / k ) α , D k ( α ) = 1 k i = 1 k ( i / k ) α U i ,
where
U i = i i = 1 k ( log x n i + 1 : n log x n k : n ) ( 1 i k < n ) ,
the scaled log-spacings.
Fourth step. On the basis of estimators β ^ and ρ ^ , calculate
k ^ 0 H = ( ( 1 ρ ^ ) 2 n 2 ρ ^ 2 ρ ^ β ^ 2 ) 1 / ( 1 2 ρ ^ ) .
Fifth step. Finally, compute ξ ^ H = ξ ^ k ^ 0 H , n H .

2.7. The Hall and Welsh (HW) Procedure

Hall and Welsh (1985) proposed a procedure to identify the optimal number of upper order statistics for the Hill estimator by minimizing the AMSE criterion. This method is based on a similar algorithm as that in the dAMSE method but with a different estimation of second-order parameters dependent upon three tuning parameters, σ < τ 1 < τ 2 (Caeiro and Gomes 2016). The Hall and Welsh procedure is quite sensitive to changes in the tuning parameters, but the parameter values proposed by Hall and Welsh ( σ = 0.5 , τ 1 = 0.9 , τ 2 = 0.95 ) seem to work well (Gomes and Oliveira 2001). This method may be presented in five steps (e.g., see Gomes and Oliveira 2001; Hall and Welsh 1985):
First step. For σ = 0.5 , τ 1 = 0.9 , τ 2 = 0.95 , obtain s = [ n σ ] , t 1 = [ n τ 1 ] , t 2 = [ n τ 2 ] .
Second step. Obtain
ρ ^ n = | log | ( 1 / M t 1 , n ( 1 ) ) ( 1 / M s , n ( 1 ) ) ( 1 / M t 2 , n ( 1 ) ) ( 1 / M s , n ( 1 ) ) | / log t 1 t 2   | ,
where M k , n ( 1 ) = 1 k i = 1 k ( log x n i + 1 : n log x n k : n )   ( k = t 1 ,   t 2 ,   s ) .
Third step. Put
λ ^ n = | 2 ρ ^ n ( n t 1 ) ρ ^ n M s , n ( 1 ) M t 1 , n ( 1 ) M t 1 , n ( 1 )   | 2 / ( 1 2 ρ ^ n ) .
Fourth step. Compute
k ^ 0 H W = [ λ ^ n 2 ρ ^ n / ( 1 2 ρ ^ n ) ] .
If k ^ 0 H W [ 1 , n ) , select another method, otherwise continue.
Fifth step. Finally, compute ξ ^ H W = ξ ^ k ^ 0 H W , n H W .
The consistency of this procedure holds only for ρ ( σ / 2 ( 1 τ 1 ) ,   σ / 2 ( 1 σ ) ) .

2.8. The Hall Single Bootstrap (Hall) Procedure

In 1990, Hall introduced a bootstrap-based methodology for the estimation of the optimal tail fraction (see, e.g., Hall 1990; Gomes and Oliveira 2001). This method is applied to the Hall class of Pareto-type tails (Danielsson et al. 2016). Given a simple X _ n = ( X 1 , , X n ) from an unknown model F, and the functional ξ ^ k , n H = ϕ k ( X _ n ) , 1 k < n , consider the bootstrap sample X _ n 1 * = ( X 1 * , , X n 1 * ) , where n 1 n , from the model F n * ( x ) = 1 n i = 1 n I { X i x } , the empirical d.f. associated with the original sample X _ n . Given an initial value k , i.e., k a u x , such that ξ ^ k a u x , n H is a consistent estimator of ξ , Hall proposes the minimization of the bootstrap estimate of the Mean Squared Error (MSE) of ξ ^ k , n 1 H :
M S E * ( n 1 ,   k ) = E [ { ξ ^ k , n 1 H * ξ ^ k a u x , n H } 2 | X _ n ] .
Next, the value k 0 * ( n 1 ) is selected so that it minimizes M S E * ( n 1 ,   k ) , and the tail fraction is determined from the formula:
k ^ 0 * ( n ) = k 0 * ( n 1 ) ( n / n 1 ) 2 / 3 .
In the algorithm, the number of bootstrap repetitions is denoted by B .
Most often, k a u x and n 1 = n ε are taken as 2 n and n 0.955 , respectively (Caeiro and Gomes 2016). Gomes and Oliveira (2001) noted that there is a disturbing sensitivity of the method to the initial value of k a u x ; in turn, there is almost independence on n 1 . These facts contributed to the search for an alternative bootstrap methodology.

2.9. The Single Bootstrap (Himp) Procedure Proposed by Caeiro and Gomes

An improved version of Hall bootstrap methodology was introduced in Caeiro and Gomes (2014). They proposed a single bootstrap procedure consisting of five steps:
First step. Given an observed sample ( x 1 , , x n ) , compute the estimates ρ ^ and β ^ of the second order parameters ρ and β as described in the dAMSE algorithm.
Second step. Consider a sub-sample size n 1 = o ( n ) . For l from 1 until B , generate independently B bootstrap samples ( x 1 * , , x n 1 * ) , of sizes n 1 , from the empirical distribution F n * ( x ) = 1 n i = 1 n I { X i x } associated with the observed data ( x 1 , , x n ) .
Third step. Denoting T k , n 1 ( 1 ) * the bootstrap counterpart of T k , n 1 ( 1 ) = ξ ^ k / 2 , n 1 H ξ ^ k , n 1 H , obtain t k , n 1 , l * ( 1 l B ) , the observed values of T k , n 1 ( 1 ) . For k = 2 ,   ,   n 1 1 , compute
M S E * ( n 1 ,   k ) = 1 B l = 1 B ( t k , n 1 , l * ) 2
and obtain
k ^ 0 | T * ( n 1   ) = argmin 1 < k < n 1 M S E * ( n 1 ,   k ) .
Fourth step. Compute the threshold estimate
k ^ 0 * ( n ) = ( 1 2 ρ ^ ) 2 / ( 1 2 ρ ^ ) k ^ 0 | T * ( n 1 ) / ( n / n 1 ) 2 ρ ^ / ( 1 2 ρ ^ ) + 1 .
If k ^ 0 * ( n ) [ 1 , n ) go back to the second step generating different bootstrap samples.
Fifth step. Finally, obtain ξ ^ * = ξ ^ k ^ 0 * ( n ) ,   n H .

2.10. The Double Bootstrap (Gomes) Procedure Proposed by Gomes, Figueiredo and Neves

The double bootstrap procedure described by Gomes et al. (2012) leads to an increased precision of the result with the same number B of bootstrap samples generated (Caeiro and Gomes 2016). This algorithm consists of five steps:
Let ξ ^ k , n H denote the Hill estimator.
First step. Given an observed sample ( x 1 , , x n ) , compute the estimates ρ ^ and β ^ of the second order parameters ρ and β as described in the dAMSE algorithm.
Second step. Consider a sub-sample size n 1 = o ( n ) and n 2 = n 1 2 / n + 1 . For l from 1 until B , generate independently B bootstrap samples ( x 1 * , , x n 2 * ) and ( x 1 * , , x n 2 * , x n 2 + 1 * , , x n 1 * ) of sizes n 2 and n 1 , respectively, from the empirical distribution F n * ( x ) = 1 n i = 1 n I { X i x } associated with the observed data ( x 1 , , x n ) .
Third step. Denoting T k , n i * the bootstrap counterpart of T k , n i = ξ ^ k / 2 , n i H ξ ^ k , n i H , obtain t k , n i , l * ( 1 l B , i = 1 , 2 ) , the observed values of T k , n . For k = 2 ,   ,   n i 1 , and i = 1.2 compute
M S E * ( n i ,   k ) = 1 B l = 1 B ( t k , n i , l * ) 2
and obtain
k ^ 0 | T * ( n i   ) = argmin 1 < k < n i M S E * ( n i ,   k ) .
Fourth step. Compute the threshold estimate
k ^ 0 * ( n ) = ( 1 2 ρ ^ ) 2 / ( 1 2 ρ ^ ) ( k ^ 0 | T * ( n 1 ) ) 2 / k ^ 0 | T * ( n 2 ) + 1 .
If k ^ 0 * ( n ) [ 1 , n ) , go back to the second step generating different bootstrap samples.
Fifth step. Finally obtain ξ ^ * = ξ ^ k ^ 0 * ( n ) , n H .

2.11. The Double Bootstrap (Danielsson) Procedure Proposed by Danielsson, de Haan, Peng and de Vries

The procedure proposed in Danielsson et al. (2001) for selecting the optimal sample fraction in tail index estimation simulates the AMSE criterion of the Hill estimator using an auxiliary statistic. For the AMSE, ξ is unknown. To solve this problem, the theoretical ξ in the AMSE formula is replaced with a control variate. Since a simple bootstrap is inconsistent in the tail area, a sub-sample bootstrap is used. Moreover, to be able to scale the sub-sample MSE back to the sample size, a second, even smaller sub-sample bootstrap is needed. The AMSE of the control variate is
T k 1 , n 1 = ( [ M k 1 ,   n 1   * ( 2 ) 2 ( ξ k 1 , n 1 * H ) 2 ] 2 ) ,
where
M k 1 , n 1   * ( 2 ) = 1 k 1 i = 1 k 1   ( log ( x n 1 i + 1 : n 1 ) log ( x n 1 k 1 : n 1 ) ) 2   .
In this procedure, n 1 = n ε is the sub-sample for the bootstrap (the number of bootstrap repetitions is denoted by B ). In the first step, the T function is minimized over two dimensions: n 1 and k 1 . In the next step, given the optimal n 1 and k 1 , a second bootstrap with a sub-sample size n 2 = n 1 2 / n is made to find k 2 . Finally, the optimal number of order statistics is determined as follows:
k ^ 0 * ( n ) = k 1 2 k 2 [ ( log ( k 1 ) ) 2 ( 2 log ( n 1 ) log ( k 1 ) ) 2 ] ( log ( n 1 ) log ( k 1 ) ) / log ( n 1 ) + 1 .

3. Data

We employed several approaches and compared the results of the optimal tail selection for forty-eight world indices computed using twelve methods with different parameters (56 different variants of tail estimation for each asset). The source of the price data is the Polish financial service stooq.pl. The country sample encompasses both developed and emerging markets and comes from different continents. We conducted the analysis for the period 2000–Q1 2020 and selected four sub-periods to demonstrate the robustness of our analysis with reference to the time period under consideration. The first sub-period covers 750 initial index quotes, the second sub-period covers the next 750 quotes, the third sub-period covers 750 index quotes starting in 2008 in the subprime crisis, and the fourth sub-period covers another 750 index quotes ending at the beginning of the COVID-19 crisis. This means that for each sub-period there are 749 log returns. In turn, the entire period ranges from 4520 to 5176 log-returns depending on the number of trading days in the market taken into account. The basic characteristics for the entire research period are given in Table 1. The average return ranges from −0.05% for Greece to 0.08% for Argentina. Higher volatility is found for the emerging markets, especially for Ukraine, Russia, Turkey and Argentina. Since our research period covers two crises, i.e., the global financial crisis in 2007–2009, the European debt crisis (EDC) in 2010–2012 and the beginning of the COVID-19 outbreak, almost all time-series of returns demonstrate left skewness. The Iceland index is characterized by distinctive kurtosis. Certainly, it is an effect of Iceland’s systemic banking collapse in 2008.

4. Results of the Empirical Study

Table 2 presents the results of an optimal threshold selection for the left and right tails. More precisely, it shows the percentile ranks for the entire period. Table 3 and Table 4 show the results for the first and third sub-periods. For the sake of brevity, we do not report the results for the second and fourth sub-periods. Note that the results for these periods do not differ substantially from those presented in Table 3 and Table 4.
Not all methods perform well in finite samples. Some that perform well in simulation studies based on theoretical distributions may not be suitable in financial applications. We can distinguish several methods that produce very high threshold estimates and pick a small number of data in the tails. They include the MAD Dis, KS Dis, RT1, RT2, Eyeball, GH, Himp, Gomes and Danielsson approaches. These methods tend to produce thresholds above the 99th percentile (max. above 0.99); thus, they prevent an estimation of VaR or ES at commonly accepted confidence levels. In particular, the KS Dis, RT1 and RT2 for kmin = 2, Eyeball with the tuning parameter w = 0.01, GH and Danielsson methods systematically produce high threshold estimates, since the median exceeds the 98th percentile. The findings support the results of Danielsson et al. (2016), who argued that the Eyeball and KS Dis methods tend to pick the threshold close to the maximum of distribution. For shorter time-series (in sub-periods), the RT1, RT2 for kmin = 2 and Danielsson methods are the most restrictive and systematically set the threshold at too high levels to be used in financial applications. This finding is in line with the observation by Scarrott and MacDonald (2012), who pointed out that the RT1 approach is unreliable for a small k despite the weighting by i β . In turn, Reiss and Thomas (2007) suggested using alternative distance metrics or weighting schemes when we deal with limited data. Methods based on minimizing the asymptotic MSE, especially the bootstrap-based methods, do not perform well in empirical studies (Danielsson et al. 2016). Similarly, Ferreira et al. (2003) noted that these methods do not give satisfactory results for samples of size under approximately 2000.
The other methods estimate the threshold in a more conservative way, lower than the 99th quantile. Such a choice of threshold guarantees enough data in the tail to calculate Value at Risk at the 99th confidence level. These are the HW, dAMSE, Hall (excluding parameters satisfying B = 10,000, ε = 0.955, k a u x   =   n + ), Eyeball ( w   =   0.025 , h   =   0.9 , ε   =   0.3 ) and PS approaches. However, only two of them, i.e., the PS and HW algorithms, satisfy the Basel III requirements with reference to the Expected Shortfall, since their maxima are below the 97.5th percentile. Although HW performs well for long time-series, it fails in shorter series. In sub-periods, the optimizing procedure does not converge for several assets either for the upper or lower tails. Two algorithms, i.e., PS and HW, exhibit a very high range and standard deviation. It means that the threshold is highly volatile and cannot be substituted by a fixed percentile of the dataset as a threshold in practical applications. On the other hand, the Eyeball and dAMSE approaches produce less volatile estimations. The dAMSE method establishes the threshold close to the 95th percentile, while the Eyeball method is even more stable, but it indicates a much higher threshold, i.e., at the 98th percentile. Thus, we cannot decide which method is the optimal choice for financial applications. However, we can indicate those approaches which may be useful in practice. A majority of the analyzed methods, i.e., MAD Dis, KS Dis, PS, GH, HW, Eyeball, dAMSE, Gomes and Himp, indicate a higher left threshold than the right one. High returns might be perceived as extreme returns when they are positive, but they do not have to be recognized as extreme when they are negative. This result suggests asymmetry between the left and right tails. This result seems to be general since it holds in the entire period as well as the sub-periods.
Having specified the distribution of tails, the next step in our study comprises a comparison of the methods recognized as applicable in a risk measurement. Two different distances between threshold estimates are compared. The results for the entire research period are presented in Table 5 and Table 6. Average absolute differences are shown in Table 5 and root mean squared differences in Table 6. As can be seen, the methods may be divided into two groups. The Eyeball, dAMSE and Hall methods show a relatively large deviation from the two other methods. Note that all the three above-mentioned methodologies utilize a low fraction (median above the 97th and below the 99th percentiles) of the total sample. The other group consists of the PS and HW methods. The distance between the threshold estimates is relatively close, and they both pick the threshold far away from the maximum (median below the 90th percentile). Danielsson et al. (2016) documented the similarity in choosing optimal sample fractions between the KS Distance and Eyeball methods and between the approach proposed by Drees and Kaufmann (1998) and the Danielsson method. However, they cast doubt on the applicability of the latter methods for real-world empirical estimations. The results for the right tails and the left tails do not indicate any significant differences, but approximately the same relative differences are preserved between the methodologies.

5. Conclusions

The selection of the threshold which separates the tail from the middle part of a return distribution is crucial in the estimation of tail-related risk measures. Unfortunately, the right threshold is unknown in empirical applications. This paper presents evidence for twelve different optimal tail selection methods in risk measurement. We selected the optimal tail fraction for daily return time-series for forty-eight world indices. We found that many methods tend to set the optimal tail above the 99th percentile; therefore, their applicability in risk management is very limited. The methods that perform well in long and relatively short time-series include the minimization of the Asymptotic Mean Squared Error (dAMSE), the single bootstrap procedure (Hall), Eyeball with carefully selected tuning parameters and Path Stability (PS). The methods can be divided into two categories. The first three methods produce threshold estimates around the 98th percentile for the entire research period and their estimates are close to one another. This level of the threshold allows the estimation of the Value at Risk at the 99th confidence level, but not the Expected Shortfall at the 97.5th confidence level recommended by the Basel Committee. In the sub-periods, the estimated thresholds are lower, and for the dAMSE and Hall (excluding parametrization: B = 10,000, ε = 0.955, kaux = n + ) methods, their maximum is below the 97.5th percentile. The PS method sets the optimal threshold much lower than the other methodologies (below the 95th percentile for the entire research period and sub-periods). The estimates of PS are relatively volatile, suggesting a difference from the fixed percentage of the total sample size, commonly presented in the literature. The PS method is based on a rather simple algorithm; thus, it can be easily implemented in a risk management process.

Author Contributions

Conceptualization, M.J. and K.E.; methodology, M.J. and K.E.; software, M.J. and K.E.; validation, M.J. and K.E.; formal analysis, M.J. and K.E.; investigation, M.J. and K.E.; resources, M.J.; data curation, M.J.; writing—original draft preparation, M.J. and K.E.; writing—review and editing, M.J. and K.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Poznań University of Life Sciences.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Aboura, Sofiane. 2014. When the U.S. Stock Market Becomes Extreme? Risks 2: 211–25. [Google Scholar] [CrossRef] [Green Version]
  2. Akhundjanov, Sherzod B., and Lauren Chamberlain. 2019. The Power-Law Distribution of Agricultural Land Size. Journal of Applied Statistics 46: 3044–56. [Google Scholar] [CrossRef] [Green Version]
  3. Amendment to Basel I. 1996. Basel Committee on Banking Supervision. Amendment to the Capital Accord to Incorporate Market Risk. Available online: https://www.bis.org/publ/bcbs24.htm (accessed on 8 April 2021).
  4. Basel III document. 2013. Basel Committee on Banking Supervision Consultative Document. Fundamental Review of the Trading Book: A Revised Market Risk Framework. Available online: https://www.google.com.hk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=&ved=2ahUKEwjHn96N3vDvAhWQc3AKHayJA68QFjADegQIBRAD&url=https%3A%2F%2Fwww.bis.org%2Fpubl%2Fbcbs265.pdf&usg=AOvVaw3aSWbbf3GU7A7n90x5zjvi (accessed on 8 April 2021).
  5. Bee, Marco, Debbie J. Dupuis, and Luca Trapin. 2016. Realizing the Extremes: Estimation of Tail-Risk Measures from a High-Frequency Perspective. Journal of Empirical Finance. [Google Scholar] [CrossRef]
  6. Beirlant, Jan, Edwin Boniphace, and Goedele Dierckx. 2011. Generalized Sum Plots. Revstat Statistical Journal 9: 181–98. [Google Scholar]
  7. Caeiro, Frederico, and M. Ivette Gomes. 2014. On the Bootstrap Methodology for the Estimation of the Tail Sample Fraction. Proceedings of COMPSTAT 2014: 545–52. [Google Scholar]
  8. Caeiro, Frederico, and M. Ivette Gomes. 2016. Threshold Selection in Extreme Value Analysis. Extreme Value Modeling and Risk Analysis: Methods and Applications 1: 69–86. [Google Scholar] [CrossRef]
  9. Cifter, Atilla. 2011. Value-at-Risk Estimation with Wavelet-Based Extreme Value Theory: Evidence from Emerging Markets. Physica A: Statistical Mechanics and Its Applications 390: 2356–67. [Google Scholar] [CrossRef]
  10. Coles, Stuart. 2001. An Introduction to Statistical Modeling of Extreme Values. Springer Series in Statistics; London: Springer. [Google Scholar]
  11. Daníelsson, Jón, and Yuji Morimoto. 2000. Forecasting Extreme Financial Risk: A Critical Analysis of Practical Methods for the Japanese Market. Forecasting Extreme Financial Risk: A Critical Analysis of Practical Methods for the Japanese Market 18: 25–48. [Google Scholar]
  12. Danielsson, Jon, Laurens de Haan, Liang Peng, and Casper G. de Vries. 2001. Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation. Journal of Multivariate Analysis 76: 226–48. [Google Scholar] [CrossRef] [Green Version]
  13. Danielsson, Jon, Lerby Murat Ergun, Laurens de Haan, and Casper G. de Vries. 2016. Tail Index Estimation: Quantile Driven Threshold Selection. SRC Working Paper 58. London: Systemic Risk Centre, The London School of Economics and Political Science. [Google Scholar]
  14. Dekkers, Arnold L. M., John H. J. Einmahl, and Laurens de Haan. 1989. A Moment Estimator for the Index of an Extreme-Value Distribution. The Annals of Statistics 1989: 1833–55. [Google Scholar] [CrossRef]
  15. Drees, Holger, and Edgar Kaufmann. 1998. Selecting the Optimal Sample Fraction in Univariate Extreme Value Estimation. Stochastic Processes and Their Applications 75: 149–72. [Google Scholar] [CrossRef] [Green Version]
  16. Echaust, Krzysztof. 2021. Asymmetric Tail Dependence between Stock Market Returns and Implied Volatility. Journal of Economic Asymmetries 23: e00190. [Google Scholar] [CrossRef]
  17. Echaust, Krzysztof, and Małgorzata Just. 2020a. A Comparison of Conditional and Unconditional VaR Models. Paper presented at the International Scientific Conference Hradec Economic Days, Hradec Králové, Czech Republic, April 2–3; vol. 10, pp. 124–35. [Google Scholar] [CrossRef]
  18. Echaust, Krzysztof, and Małgorzata Just. 2020b. Value at Risk Estimation Using the Garch-Evt Approach with Optimal Tail Selection. Mathematics 8: 114. [Google Scholar] [CrossRef] [Green Version]
  19. Embrechts, Paul, and Hanspeter Schmidli. 1994. Modelling of Extremal Events in Insurance and Finance. ZOR Zeitschrift Für Operations Research Mathematical Methods of Operations Research 39: 1–34. [Google Scholar] [CrossRef]
  20. Embrechts, Paul, Claudia Klüppelberg, and Thomas Mikosch. 1997. Modelling Extremal Events for Insurance and Finance. New York: Springer. [Google Scholar]
  21. Fernandez, Viviana. 2005. Risk Management under Extreme Events. International Review of Financial Analysis 14: 113–48. [Google Scholar] [CrossRef]
  22. Ferreira, Ana, Laurens de Haan, and Liang Peng. 2003. On Optimising the Estimation of High Quantiles of a Probability Distribution. Statistics 37: 401–34. [Google Scholar] [CrossRef]
  23. Gençay, Ramazan, and Faruk Selçuk. 2004. Extreme Value Theory and Value-at-Risk: Relative Performance in Emerging Markets. International Journal of Forecasting 20: 287–3033. [Google Scholar] [CrossRef] [Green Version]
  24. Gençay, Ramazan, Faruk Selçuk, and Abdurrahman Ulugülyaǧci. 2003. High Volatility, Thick Tails and Extreme Value Theory in Value-at-Risk Estimation. Insurance: Mathematics and Economics 33: 337–56. [Google Scholar] [CrossRef] [Green Version]
  25. Gilli, Manfred, and Evis Këllezi. 2006. An Application of Extreme Value Theory for Measuring Financial Risk. Computational Economics 27: 207–28. [Google Scholar] [CrossRef] [Green Version]
  26. Gomes, M. Ivette, and Orlando Oliveira. 2001. The Bootstrap Methodology in Statistics of Extremes—Choice of the Optimal Sample Fraction. Extremes 4: 331–58. [Google Scholar] [CrossRef]
  27. Gomes, M. Ivette, Fernanda Figueiredo, and M. Manuela Neves. 2012. Adaptive Estimation of Heavy Right Tails: Resampling-Based Methods in Action. Extremes 15: 463–89. [Google Scholar] [CrossRef]
  28. Guillou, Armelle, and Peter Hall. 2001. A Diagnostic for Selecting the Threshold in Extreme Value Analysis. Journal of the Royal Statistical Society. Series B: Statistical Methodology 63: 293–305. [Google Scholar] [CrossRef]
  29. Hall, Peter. 1990. Using the Bootstrap to Estimate Mean Squared Error and Select Smoothing Parameter in Nonparametric Problems. Journal of Multivariate Analysis 32: 177–203. [Google Scholar] [CrossRef] [Green Version]
  30. Hall, Peter, and Alan H. Welsh. 1985. Adaptive Estimates of Parameters of Regular Variation. The Annals of Statistics 13: 331–41. [Google Scholar] [CrossRef]
  31. Hill, Bruce M. 1975. A Simple General Approach to Inference About the Tail of a Distribution. The Annals of Statistics 3: 1163–74. [Google Scholar] [CrossRef]
  32. Jones, Christopher S. 2003. The Dynamics of Stochastic Volatility: Evidence from Underlying and Options Markets. Journal of Econometrics 116: 181–224. [Google Scholar] [CrossRef] [Green Version]
  33. Karmakar, Madhusudan, and Girja K. Shukla. 2014. Managing Extreme Risk in Some Major Stock Markets: An Extreme Value Approach. International Review of Economics and Finance 35: 1–25. [Google Scholar] [CrossRef]
  34. Longin, François M. 2000. From Value at Risk to Stress Testing: The Extreme Value Approach. Journal of Banking and Finance 24: 1097–130. [Google Scholar] [CrossRef]
  35. Loretan, Mico, and Peter C. B. Phillips. 1994. Testing the Covariance Stationarity to Several Financial Datasets. Journal of Empirical Finance 1: 211–48. [Google Scholar] [CrossRef]
  36. Łuczak, Aleksandra, and Małgorzata Just. 2020. The Positional MEF-TOPSIS Method for the Assessment of Complex Economic Phenomena in Territorial Units. Statistics in Transition 21: 157–72. [Google Scholar] [CrossRef]
  37. Mcneil, Alexander J. 1999. Extreme Value Theory for Risk Managers. Departement Mathematik ETH Zentrum 12: 217–37. [Google Scholar]
  38. McNeil, Alexander J., and Rüdiger Frey. 2000. Estimation of Tail-Related Risk Measures for Heteroscedastic Financial Time Series: An Extreme Value Approach. Journal of Empirical Finance 7: 271–300. [Google Scholar] [CrossRef]
  39. Omari, Cyprian O., Peter N. Mwita, and Antony G. Waititu. 2017. Using Conditional Extreme Value Theory to Estimate Value-at-Risk for Daily Currency Exchange Rates. Journal of Mathematical Finance 7: 846–70. [Google Scholar] [CrossRef] [Green Version]
  40. Ossberger, Johannes. 2020. Tea: Threshold Estimation Approaches. R Package Version 1.1. Madison: R Package. [Google Scholar] [CrossRef]
  41. Pickands, James. 1975. Statistical Inference Using Extreme Order Statistics. The Annals of Statistics 3. [Google Scholar] [CrossRef]
  42. Reiss, Rolf-Dieter, and Michael Thomas. 2007. Statistical Analysis of Extreme Values: With Applications to Insurance, Finance, Hydrology and Other Fields, 3rd ed. Basel: Birkhäuser. [Google Scholar] [CrossRef]
  43. Resnick, Sidney, and Cătălin Stărică. 1997. Smoothing the Hill Estimator. Advances in Applied Probability 29: 271–93. [Google Scholar] [CrossRef]
  44. Roth, Martin, Geurt Jongbloed, and T. Adri Buishand. 2016. Threshold Selection for Regional Peaks-over-Threshold Data. Journal of Applied Statistics 43: 1291–309. [Google Scholar] [CrossRef]
  45. Scarrott, Carl, and Anna MacDonald. 2012. A Review of Extreme Value Threshold Estimation and Uncertainty Quantification. Revstat Statistical Journal 10: 33–60. [Google Scholar]
  46. Smith, Richard L. 2003. Statistics of Extremes, with Applications in Environment, Insurance, and Finance. In Extreme Values in Finance, Telecommunications, and the Environment. Boca Raton: CRC Press. [Google Scholar] [CrossRef]
  47. Totić, Selena, and Miloš Božović. 2016. Tail Risk in Emerging Markets of Southeastern Europe. Applied Economics. [Google Scholar] [CrossRef]
  48. Vilasuso, Jon, and David Katz. 2000. Estimates of the Likelihood of Extreme Returns in International Stock Markets. Journal of Applied Statistics 27: 119–30. [Google Scholar] [CrossRef]
  49. Wang, Zongrun, Weitao Wu, Chao Chen, and Yanju Zhou. 2010. The Exchange Rate Risk of Chinese Yuan: Using VaR and ES Based on Extreme Value Theory. Journal of Applied Statistics 37: 265–82. [Google Scholar] [CrossRef]
Table 1. Summary statistics for the sample of countries.
Table 1. Summary statistics for the sample of countries.
Index Min Mean Max Sd. Dev.Skewness Ex. Kurtosis
ATX (Austria)−0.14670.00010.12020.0143−0.5910.01
AEX (Netherlands)−0.1138−0.00010.10030.0140−0.227.49
ATH (Greece)−0.1771−0.00050.13430.0189−0.557.84
BEL20 (Belgium)−0.15330.00000.09330.0126−0.4410.36
BET (Romania)−0.11900.00060.14580.0150−0.3411.67
BUX (Hungary)−0.12650.00030.13180.0149−0.247.08
CAC (France)−0.1310−0.00010.10590.0144−0.226.52
DAX (Germany)−0.13050.00010.10800.0148−0.175.99
FMIB (Italy)−0.1854−0.00020.10870.0154−0.599.34
HEX (Finland)−0.1740−0.00010.14560.0173−0.407.86
IBEX (Spain)−0.1515−0.00010.13480.0147−0.328.13
ICEX (Iceland)−1.06220.00000.05060.0186−39.032164.64
MOEX (Russia)−0.20660.00050.25230.0200−0.2515.99
OMXR (Latvia)−0.16330.00040.12090.0140−0.5620.56
OMXS (Sweden)−0.11170.00000.09870.0148−0.084.37
OMXT (Estonia)−0.10600.00040.12090.0105−0.2512.85
OMXV (Lithuania)−0.11940.00040.11000.0099−0.7724.07
OSEAX (Norway)−0.09830.00030.09190.0137−0.696.76
PSI20 (Portugal)−0.1038−0.00020.10200.0119−0.427.49
PX (Czechia)−0.16190.00010.12360.0134−0.5413.73
RTS (Russia)−0.21200.00030.20200.0215−0.549.27
SAX (Slovakia)−0.14810.00030.11880.0118−0.6914.81
SMI (Switzerland)−0.10130.00000.10790.0117−0.297.95
SOFIX (Bulgaria)−0.11360.00030.08390.0123−0.6411.56
UKX (UK)−0.11510.00000.09380.0119−0.348.34
UX (Ukraine)−0.34370.00050.39080.02600.8553.49
WIG20 (Poland)−0.14250.00000.08150.0150−0.334.22
XU100 (Turkey)−0.19980.00030.17770.0207−0.107.91
AOR (Australia)−0.10010.00010.06350.0098−0.919.63
HSI (Hong Kong)−0.13580.00010.13410.0146−0.117.90
JCI (Indonesia)−0.10950.00040.09700.0134−0.637.18
KLCI (Malaysia)−0.09980.00010.06630.0081−0.7911.20
KOSPI (South Korea)−0.12800.00010.11280.0150−0.577.16
NKX (Japan)−0.12110.00000.13230.0150−0.386.42
NZ50 (New Zealand)−0.07890.00040.06940.0072−0.7210.19
PSEI (Philippines)−0.14320.00020.16180.0131−0.1516.91
SET (Thailand)−0.16060.00020.10580.0132−0.9311.77
SHBS (China)−0.10290.00040.18400.0198−0.027.12
SNX (India)−0.14100.00030.15990.0147−0.419.37
STI (Singapore)−0.09090.00000.07530.0112−0.406.79
TWSE (Taiwan)−0.09940.00000.06520.0134−0.293.89
BVP (Brazil)−0.15990.00030.13680.0182−0.386.64
DJI (US)−0.13840.00010.10760.0119−0.4013.63
IPC (Mexico)−0.08270.00030.10440.0128−0.055.47
IPSA (Chile)−0.15220.00020.11800.0104−0.8422.35
MRV (Argentina)−0.47690.00080.16120.0232−2.0239.79
SPX (US)−0.12770.00010.10960.0125−0.4011.37
TSX (Canada)−0.13180.00010.11290.0114−0.9617.30
Note: Sd. Dev. means standard deviation; Ex. Kurtosis—excess kurtosis; BET—the beginning of trading 31 October 2000; SOFIX—the beginning of trading 26 November 2001; NZ50—the beginning of trading 3 January 2001. Symbols for indices are consistent with the symbols on the financial website, www.stooq.pl (Accessed on: 16 April 2020).
Table 2. Estimates of the threshold under different methods in the entire research period.
Table 2. Estimates of the threshold under different methods in the entire research period.
MethodLower TailUpper Tail
MinMeanMedianMaxSd. Dev.MinMeanMedianMaxSd. Dev.
MAD Dis ts = 0.250.89700.91920.90780.99960.03120.88320.91320.90210.99920.0342
MAD Dis ts = 0.200.90920.93370.92130.99860.02880.90210.93190.91630.99980.0336
MAD Dis ts = 0.150.91480.94990.94000.99860.02430.89980.95050.93610.99980.0290
MAD Dis ts = 0.100.94120.96490.95750.99920.01630.93890.96340.95390.99610.0188
MAD Dis ts = 0.050.95170.97920.97890.99680.01000.95610.98080.98340.99680.0113
KS Dis ts = 0.250.96050.99610.99820.99960.00740.96200.99330.99630.99960.0078
KS Dis ts = 0.200.96050.99590.99820.99960.00740.96200.99340.99680.99960.0078
KS Dis ts = 0.150.96050.99560.99800.99960.00750.96200.99330.99650.99960.0079
KS Dis ts = 0.100.96050.99580.99800.99960.00730.96200.99350.99650.99940.0077
KS Dis ts = 0.050.96050.99580.99800.99960.00730.96200.99360.99660.99940.0078
RT1 β = 0, kmin = 20.90840.97620.98000.99980.02380.93110.98300.99200.99980.0205
RT1 β = 0.1, kmin = 20.91850.98210.99170.99980.02140.93390.98630.99460.99980.0178
RT1 β = 0.2, kmin = 20.92590.98610.99700.99980.01930.93440.98970.99780.99980.0166
RT1 β = 0.3, kmin = 20.94780.99410.99900.99980.01080.94210.99310.99920.99980.0132
RT1 β = 0, kmin = 30.90860.97380.97730.99980.02410.93130.97900.98810.99980.0212
RT1 β = 0, kmin = 40.90880.97260.97720.99980.02440.90130.97490.97930.99980.0236
RT1 β = 0, kmin = 50.89790.96720.97050.99980.02620.90150.97440.97950.99980.0230
RT1 β = 0, kmin = 100.89890.96620.96940.99980.02490.90250.97320.97650.99980.0221
RT1 β = 0, kmin = 0.003n0.89970.96620.96870.99980.02410.90330.97410.97740.99980.0220
RT1 β = 0, kmin = 0.005n0.90170.96640.97020.99980.02300.90530.97100.97200.99980.0226
RT2 β = 0, kmin = 20.90640.98160.98840.99980.02190.93520.98390.99230.99980.0196
RT2 β = 0.1, kmin = 20.91180.98350.99170.99980.02060.93520.98480.99330.99980.0189
RT2 β = 0.2, kmin = 20.93990.98550.99450.99980.01780.94140.98860.99720.99980.0160
RT2 β = 0.3, kmin = 20.94410.98760.99670.99980.01690.94140.98950.99840.99980.0155
RT2 β = 0, kmin = 30.90660.98050.98530.99980.02140.93540.98070.98980.99980.0201
RT2 β = 0, kmin = 40.90680.98060.98550.99980.02140.90190.97610.98170.99980.0228
RT2 β = 0, kmin = 50.90700.97670.98420.99980.02310.90210.97480.97930.99980.0220
RT2 β = 0, kmin = 100.90800.97230.97590.99980.02190.90310.97450.97830.99940.0213
RT2 β = 0, kmin = 0.003n0.90900.97240.97590.99960.02120.90390.97530.97930.99980.0212
RT2 β = 0, kmin = 0.005n0.91100.97310.97650.99960.02060.90590.97330.97350.99860.0203
PS j = 10.76510.87910.88730.95110.04850.75130.88130.88690.95710.0472
PS j = 00.73240.82720.82620.90130.04200.64090.79940.80510.89610.0476
Eyeball w = 0.01, ε = 0.3, h = 0.90.99100.99390.99400.99560.00080.99120.99330.99330.99520.0008
Eyeball w = 0.02, ε = 0.3, h = 0.90.98600.98950.98960.99150.00090.98580.98860.98870.99090.0009
Eyeball w = 0.025, ε = 0.3, h = 0.90.98380.98740.98740.98990.00090.98360.98620.98620.98870.0009
GH0.94730.98710.99670.99940.01530.94880.98800.99320.99940.0143
dAMSE0.97210.97380.97380.97700.00070.97040.97230.97240.97470.0007
HW0.64820.89650.90860.94590.05000.71450.87880.89010.93270.0409
Hall B = 10,000, ε = 0.955, kaux = 2 n + 0.91870.97140.97510.98360.01290.92450.97310.97650.98480.0109
Hall B = 10,000, ε = 0.995, kaux = 2 n + 0.90850.96870.97230.98170.01430.91870.97110.97540.98280.0117
Hall B = 10,000, ε = 0.9, kaux = 2 n + 0.92890.97480.97820.98600.01120.93150.97570.97860.98630.0098
Hall B = 10,000, ε = 0.955 kaux = n + 0.93850.97730.98190.99300.01510.89470.97710.98070.99410.0168
Hall B = 10,000, ε = 0.955, kaux = 3 n + 0.94640.96810.97130.97630.00780.94360.96610.96770.97460.0076
Hall B = 1000, ε = 0.955, kaux = 2 n + 0.91830.97140.97510.98370.01280.92690.97330.97700.98380.0106
Hall(r) B = 10,000, ε = 0.955, kaux = 2 n + 0.91870.97140.97480.98360.01270.92550.97310.97620.98420.0108
Himp B = 10,000, ε = 0.9550.96400.98480.98630.99980.00720.96180.98260.98310.99880.0088
Himp B = 10,000, ε = 0.9950.96040.98390.98560.99940.00780.95640.98120.98310.99860.0099
Himp B = 10,000, ε = 0.90.96920.98620.98610.99980.00620.96620.98400.98500.99920.0077
Himp B = 1000, ε = 0.9550.96400.98520.98630.99980.00690.96200.98270.98410.99880.0088
Himp (r) B = 10,000, ε = 0.9550.96400.98450.98600.99980.00720.96200.98260.98330.99880.0088
Gomes B = 10,000, ε = 0.9950.95980.98400.98510.99800.00810.95500.98080.98260.99860.0102
Gomes B = 10,000, ε = 0.9550.95860.98270.98460.99980.00850.95480.98060.98180.99880.0105
Gomes B = 10,000, ε = 0.90.95960.98380.98190.99980.00980.95400.98100.98120.99980.0120
Gomes B = 1000, ε = 0.9950.95800.98380.98510.99750.00810.95540.98190.98330.99860.0098
Gomes (r) B = 10,000, ε = 0.9950.96000.98400.98530.99760.00800.95600.98090.98280.99860.0100
Danielsson B = 500, ε = 0.90.93900.98590.98990.99980.01470.93980.98860.99340.99980.0130
Note: (r) means repeating the calculation; Sd. Dev.—standard deviation.
Table 3. Estimates of the threshold under different methods in the first sub-period.
Table 3. Estimates of the threshold under different methods in the first sub-period.
MethodLower TailUpper Tail
MinMeanMedianMaxSd. Dev.MinMeanMedianMaxSd. Dev.
MAD Dis ts = 0.250.86110.90700.89050.99730.04140.84380.89460.88050.99730.0395
MAD Dis ts = 0.200.86920.91730.90390.99730.03740.86650.90560.89920.98930.0248
MAD Dis ts = 0.150.87320.93590.92660.99870.03020.87720.93050.92320.99730.0265
MAD Dis ts = 0.100.91050.95210.94860.99200.02270.90790.95300.94660.99870.0227
MAD Dis ts = 0.050.95730.97390.97130.99870.01000.95190.97100.96930.99200.0101
KS Dis ts = 0.250.93990.98300.98800.99870.01400.95190.97910.97930.99730.0118
KS Dis ts = 0.200.92790.98280.98400.99870.01400.94390.97900.98060.99730.0129
KS Dis ts = 0.150.94790.98410.98660.99870.01170.94660.98120.98130.99730.0108
KS Dis ts = 0.100.94790.98430.98800.99870.01180.93590.98010.98060.99730.0126
KS Dis ts = 0.050.96800.98770.98930.99870.00780.95190.98440.98600.99870.0091
RT1 β = 0, kmin = 20.80240.96610.99070.99870.04400.87450.98010.99330.99870.0279
RT1 β = 0.1, kmin = 20.87720.97280.99200.99870.03400.88520.98100.99470.99870.0267
RT1 β = 0.2, kmin = 20.90920.97690.99470.99870.03070.90650.98540.99600.99870.0217
RT1 β = 0.3, kmin = 20.90920.98240.99470.99870.02590.91590.98650.99600.99870.0203
RT1 β = 0, kmin = 30.80370.96130.97660.99870.04320.87580.97820.98870.99870.0273
RT1 β = 0, kmin = 40.80510.96200.97800.99870.04280.87720.97520.98660.99870.0287
RT1 β = 0, kmin = 50.80640.95810.97130.99870.04380.87850.97420.98730.99870.0277
RT1 β = 0, kmin = 100.80640.94910.96460.99870.04920.81170.97040.98130.99870.0365
RT1 β = 0, kmin = 0.003n0.80240.96610.99070.99870.04400.87450.98010.99330.99870.0279
RT1 β = 0, kmin = 0.005n0.80370.96130.97660.99870.04320.87580.97820.98870.99870.0273
RT2 β = 0, kmin = 20.88250.97010.99070.99870.03530.88920.98020.99200.99870.0281
RT2 β = 0.1, kmin = 20.88250.97210.99330.99870.03340.88920.98110.99400.99870.0270
RT2 β = 0.2, kmin = 20.88250.97310.99330.99870.03340.89990.98260.99600.99870.0251
RT2 β = 0.3, kmin = 20.91050.97770.99400.99870.02850.89990.98350.99600.99870.0244
RT2 β = 0, kmin = 30.88380.96300.97600.99870.03590.89050.97940.99200.99870.0273
RT2 β = 0, kmin = 40.88520.96410.97730.99870.03570.89190.97680.98930.99870.0275
RT2 β = 0, kmin = 50.88650.96200.97800.99870.03560.89320.97720.98930.99870.0268
RT2 β = 0, kmin = 100.81310.95540.96600.99870.04250.84250.97240.98660.99870.0326
RT2 β = 0, kmin = 0.003n0.88250.97010.99070.99870.03530.88920.98020.99200.99870.0281
RT2 β = 0, kmin = 0.005n0.88380.96300.97600.99870.03590.89050.97940.99200.99870.0273
PS j = 10.73700.84570.85310.95060.05420.71300.86420.86650.98400.0646
PS j = 00.60880.75100.75570.88520.06320.66490.76230.76440.87980.0477
Eyeball w = 0.01, ε = 0.3, h = 0.90.98130.98660.98660.99330.00330.97330.98460.98530.99330.0044
Eyeball w = 0.02, ε = 0.3, h = 0.90.96930.98010.98000.98800.00360.96800.97800.97800.98930.0043
GH0.87450.96380.97660.99600.03060.90790.97020.97800.99600.0238
dAMSE0.84510.94000.94130.94790.01420.81980.93850.94390.94790.0234
HWxxxxxxxxxx
Hall B = 10,000, ε = 0.955, kaux = 2 n + 0.83310.93720.94730.95860.02710.83310.94620.94990.96530.0199
Hall B = 10,000, ε = 0.995, kaux = 2 n + 0.81980.93240.94390.96530.02940.82110.94200.94660.95460.0216
Hall B = 10,000, ε = 0.9, kaux = 2 n + 0.85050.94350.95060.96260.02390.85050.95180.95460.96930.0175
Hall B = 10,000, ε = 0.955 kaux = n + 0.88920.95820.97130.98530.02330.75700.96310.97130.98130.0332
Hall B = 10,000, ε = 0.955, kaux = 3 n + 0.84380.91930.92460.94530.02190.87720.92520.92520.94130.0122
Hall B = 1000, ε = 0.955, kaux = 2 n + 0.82510.93770.94790.96130.02760.82780.94680.94930.96260.0200
Hall (r) B = 10,000, ε = 0.955, kaux = 2 n + 0.83440.93730.94590.95860.02690.83310.94610.95060.96530.0198
Himp B = 10,000, ε = 0.9550.92260.96950.97060.99730.01770.91990.97850.98060.99870.0165
Himp B = 10,000, ε = 0.9950.91460.96850.96930.99870.02050.91990.97740.98060.99870.0177
Himp B = 10,000, ε = 0.90.93320.96960.97060.99870.01560.92920.98010.98200.99870.0149
Himp B = 1000, ε = 0.9550.92260.96960.97130.99730.01800.92120.97890.98200.99870.0167
Himp (r) B = 10,000, ε = 0.9550.92390.96930.97000.99730.01780.92120.97850.98060.99870.0166
Gomes B = 10,000, ε = 0.9950.91460.96840.96860.99870.02050.92260.97780.98060.99870.0177
Gomes B = 10,000, ε = 0.9550.91320.96830.96930.99870.02090.92260.97690.97930.99870.0177
Gomes B = 10,000, ε = 0.90.90920.96550.96660.99870.02240.92120.97730.97860.99870.0183
Gomes B = 1000, ε = 0.9950.91460.96840.97000.99870.02020.90650.97740.98000.99870.0186
Gomes (r) B = 10,000, ε = 0.9950.91590.96820.96860.99870.02050.91050.97730.98060.99870.0186
Danielsson B = 500, ε = 0.90.91720.98420.99330.99870.02040.86650.98960.99800.99870.0218
Danielsson B = 500, ε = 0.9550.92390.98150.99330.99870.02260.93320.99310.99870.99870.0117
Note: (r) means repeating the calculation; Sd. Dev.—standard deviation; x—cannot calculate, the method is not appropriate for the data.
Table 4. Estimates of the threshold under different methods in the third sub-period.
Table 4. Estimates of the threshold under different methods in the third sub-period.
MethodLower TailUpper Tail
MinMeanMedianMaxSd. Dev.MinMeanMedianMaxSd. Dev.
MAD Dis ts = 0.250.86780.90950.89450.99200.03550.84110.92790.91520.99870.0480
MAD Dis ts = 0.200.87320.92490.91590.99330.03650.85050.93630.93990.99600.0412
MAD Dis ts = 0.150.89590.94580.94260.99330.02970.86650.94600.95330.99330.0336
MAD Dis ts = 0.100.92120.95790.95590.99330.01940.90520.95190.95190.99330.0278
MAD Dis ts = 0.050.95330.96900.96860.99600.01090.95190.97230.96930.99730.0136
KS Dis ts = 0.250.93320.98110.98260.99870.01350.89850.97530.98260.99730.0217
KS Dis ts = 0.200.93860.98200.98530.99870.01380.89850.97440.98330.99730.0226
KS Dis ts = 0.150.93860.98310.98730.99870.01380.89850.97330.97930.99730.0221
KS Dis ts = 0.100.93860.98260.98530.99870.01400.90790.97230.97860.99730.0213
KS Dis ts = 0.050.95190.98690.98930.99870.00920.95330.97870.98260.99730.0137
RT1 β = 0, kmin = 20.79040.97590.99730.99870.03950.82380.96030.98260.99870.0475
RT1 β = 0.1, kmin = 20.79970.98160.99730.99870.03620.82510.96850.99470.99870.0431
RT1 β = 0.2, kmin = 20.81310.98550.99730.99870.03180.88250.98370.99600.99870.0254
RT1 β = 0.3, kmin = 20.92120.99100.99730.99870.01700.88250.98430.99600.99870.0253
RT1 β = 0, kmin = 30.79170.97020.98800.99870.03900.82510.95470.97330.99870.0477
RT1 β = 0, kmin = 40.79310.96200.97660.99870.04280.82640.94980.95390.99870.0475
RT1 β = 0, kmin = 50.79440.96300.97800.99870.04250.82780.94380.94260.99870.0483
RT1 β = 0, kmin = 100.80110.96280.97130.99870.04040.83440.93050.93190.99870.0479
RT1 β = 0, kmin = 0.003n0.79040.97590.99730.99870.03950.82380.96030.98260.99870.0475
RT1 β = 0, kmin = 0.005n0.79170.97020.98800.99870.03900.82510.95470.97330.99870.0477
RT2 β = 0, kmin = 20.76640.98190.99730.99870.03940.84250.96960.99470.99870.0408
RT2 β = 0.1, kmin = 20.76900.98290.99730.99870.03810.84250.96960.99470.99870.0408
RT2 β = 0.2, kmin = 20.76900.98290.99730.99870.03810.88380.97990.99530.99870.0275
RT2 β = 0.3, kmin = 20.80240.98410.99730.99870.03320.88380.98180.99600.99870.0265
RT2 β = 0, kmin = 30.76770.97560.99330.99870.03920.84380.96400.98200.99870.0422
RT2 β = 0, kmin = 40.76900.96890.98800.99870.04240.84510.95950.98000.99870.0427
RT2 β = 0, kmin = 50.77040.96870.98800.99870.04200.84650.95160.96330.99870.0451
RT2 β = 0, kmin = 100.77700.96490.97000.99870.03900.85310.94280.94530.99730.0420
RT2 β = 0, kmin = 0.003n0.76640.98190.99730.99870.03940.84250.96960.99470.99870.0408
RT2 β = 0, kmin = 0.005n0.76770.97560.99330.99870.03920.84380.96400.98200.99870.0422
PS j = 10.73700.86000.86110.94790.05520.73430.86290.86450.96660.0537
PS j = 00.64350.79710.80710.89850.05650.61280.77280.76970.89590.0489
Eyeball w = 0.01, ε = 0.3, h = 0.90.97730.98680.98730.99470.00340.98260.98780.98800.99330.0027
Eyeball w = 0.02, ε = 0.3, h = 0.90.97060.98020.98000.98930.00400.97600.98130.98130.98660.0027
GH0.89320.96820.98200.99600.02700.87980.95270.97060.99730.0402
dAMSE0.82640.93860.94260.95730.02160.81580.93860.94130.95460.0184
HWxxxxxxxxxx
Hall B = 10,000, ε = 0.955, kaux = 2 n + 0.85050.94430.94930.96400.01890.86650.92890.93320.96260.0248
Hall B = 10,000, ε = 0.995, kaux = 2 n + 0.84250.94060.94660.96260.02020.85850.92400.93120.95860.0263
Hall B = 10,000, ε = 0.9, kaux = 2 n + 0.86780.94970.95390.96930.01700.88380.93500.93930.96530.0220
Hall B = 10,000, ε = 0.955 kaux = n + 0.78370.95560.96330.98260.03360.85580.93840.94530.98260.0362
Hall B = 10,000, ε = 0.955, kaux = 3 n + 0.86650.92270.92390.95590.01630.86650.91730.92120.95190.0204
Hall B = 1000, ε = 0.955, kaux = 2 n + 0.85050.94390.94790.96530.01900.86650.92830.93320.96260.0249
Hall (r) B = 10,000, ε = 0.955, kaux = 2 n + 0.84910.94360.94860.96530.01920.86780.92890.93320.96130.0248
Himp B = 10,000, ε = 0.9550.91860.96820.97200.99870.01870.92920.96400.96330.99870.0162
Himp B = 10,000, ε = 0.9950.93060.96910.97060.99870.01870.92260.96110.96130.99730.0178
Himp B = 10,000, ε = 0.90.92660.96970.97330.99870.01630.93720.96400.96400.99870.0131
Himp B = 1000, ε = 0.9550.91720.96940.97130.99870.01890.92660.96310.96330.99870.0172
Himp(r) B = 10,000, ε = 0.9550.91860.96780.97060.99870.01880.92920.96320.96330.99870.0158
Gomes B = 10,000, ε = 0.9950.92790.96870.97060.99870.01900.92120.96060.95990.99600.0178
Gomes B = 10,000, ε = 0.9550.91050.96510.96930.99870.02090.91990.96140.96190.99870.0192
Gomes B = 10,000, ε = 0.90.90520.96450.96800.99870.02190.90790.95850.95930.99870.0186
Gomes B = 1000, ε = 0.9950.92520.96900.97200.99870.01920.92260.96160.96130.99870.0183
Gomes (r) B = 10,000, ε = 0.9950.92790.96870.97130.99870.01910.92260.96070.96130.99870.0180
Danielsson B = 500, ε = 0.90.84650.98390.99530.99870.02610.88920.97200.97730.99870.0275
Danielsson B = 500, ε = 0.9550.83440.98490.99870.99870.02740.90120.97270.97800.99870.0266
Note: (r) means repeating the calculation; Sd. Dev.—standard deviation; x—cannot calculate, the method is not appropriate for the data.
Table 5. The mean absolute differences (MAE) of the threshold estimates in the left tail (below diagonal) and right tail (above diagonal) for five methods.
Table 5. The mean absolute differences (MAE) of the threshold estimates in the left tail (below diagonal) and right tail (above diagonal) for five methods.
MethodPSEyeballdAMSEHWHall 10,000Hall 1000
PS00.11480.10230.04780.10430.1044
Eyeball0.118400.01390.11470.01710.0168
dAMSE0.10600.013600.10200.01070.0105
HW0.07180.10320.091800.10440.1046
Hall 10,0000.10690.02060.01300.093000.0018
Hall 10000.10700.02060.01290.09300.00130
Note: PS— j = 1 ; Eyeball— w = 0.025 , h = 0.9 , ε = 0.3 ; Hall 1000— B = 1000 , ε = 0.955 , k a u x = 2 n + ; Hall 10,000— B = 10,000 , ε = 0.955 , k a u x = 2 n + .
Table 6. The root mean squared differences (RMSE) of the threshold estimates in the left tail (below diagonal) and right tail (above diagonal) for five methods.
Table 6. The root mean squared differences (RMSE) of the threshold estimates in the left tail (below diagonal) and right tail (above diagonal) for five methods.
MethodPSEyeballdAMSEHWHall 10,000Hall 1000
PS00.11480.10230.04780.10430.1044
Eyeball0.118400.01390.11470.01710.0168
dAMSE0.10600.013600.10200.01070.0105
HW0.07180.10320.091800.10440.1046
Hall 10,0000.10690.02060.01300.093000.0018
Hall 10000.10700.02060.01290.09300.00130
Note: PS— j = 1 ; Eyeball— w = 0.025 , h = 0.9 , ε = 0.3 ; Hall 1000— B = 1000 , ε = 0.955 , k a u x = 2 n + ; Hall 10,000— B = 10,000 , ε = 0.955 , k a u x = 2 n + .
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Just, M.; Echaust, K. An Optimal Tail Selection in Risk Measurement. Risks 2021, 9, 70. https://doi.org/10.3390/risks9040070

AMA Style

Just M, Echaust K. An Optimal Tail Selection in Risk Measurement. Risks. 2021; 9(4):70. https://doi.org/10.3390/risks9040070

Chicago/Turabian Style

Just, Małgorzata, and Krzysztof Echaust. 2021. "An Optimal Tail Selection in Risk Measurement" Risks 9, no. 4: 70. https://doi.org/10.3390/risks9040070

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop