A Simple Traffic Light Approach to Backtesting Expected Shortfall

Nick Costanzino; Michael Curran

doi:10.3390/risks6010002

and

¹

Barclays Capital, 745 7th Ave, New York, NY 10019, USA

²

Department of Finance and Risk Engineering, NYU Tandon School of Engineering, New York, NY 11201, USA

³

Riskcare, New York, NY 10018, USA

^*

Authors to whom correspondence should be addressed.

Risks2018, 6(1), 2;https://doi.org/10.3390/risks6010002

Version Notes

Order Reprints

Abstract

We propose a Traffic Light approach to backtesting Expected Shortfall which is completely consistent with, and analogous to, the Traffic Light approach to backtesting VaR (Value at Risk) initially proposed by the Basel Committee on Banking Supervision in their 1996 consultative document Basle Committee on Banking Supervision (1996). The approach relies on the generalized coverage test for Expected Shortfall developed in Costanzino and Curran (2015).

Keywords:

Expected Shortfall; backtesting; Fundamental Review of the Trading Book

1. Introduction

Even before the initial Basel Committee consultative document (Bank for International Settlements 2012) there had been a push by both risk managers and academics to replace VaR (Value at Risk) with another risk measure that addresses VaR’s deficiencies. In particular, coherent risk measures (Acerbi and Taschea 2002a, 2002b; Hall 2007) satisfy the basic desired properties required by a risk measure as outlined in Artzner et al. (1999). Expected Shortfall (ES) is the natural choice among all coherent risk measures, and therefore there is no surprise that it has been chosen by the Basel Committee as the risk measure to replace VaR. However, unlike the case of VaR, there is no well-established backtesting framework for Expected Shortfall. Indeed, the current Basel proposal to backtest ES at the 97.5 quantile is to backtest the related VaR estimate at the 97.5 and 99 quantiles, which is a grossly insufficient test. Nevertheless, some recent backtesting methods have been proposed including, but not limited to, Acerbi and Szekeley (2014); Costanzino and Curran (2015); Du and Escanciano (2017); Fissler et al. (2015); Gordy et al. (2017); Kratz et al. (2016).

The main result of this note is the development of a Traffic Light backtest for Expected Shortfall which extends the Traffic Light backtest for VaR. The test relies on the computation of critical values derived from the finite-sample distribution of the ES test statistic (9) first introduced in Costanzino and Curran (2015).

The note is organized as follows. In Section 2 we briefly review the VaR Traffic Light to provide context for our corresponding test for ES. In Section 3 we define the Traffic Light test for ES and compute the distribution of the finite-sample statistic from which we calculate the critical values using a numerical root-finding algorithm. Finally, in Section 4 we discuss the test and some implications.

2. Review of the VaR Traffic Light Test

Let

{t_{i}}_{i = 0}^{N}

be a sequence of historical trading days and

{L_{i}}_{i = 1}^{N}

the corresponding realized trading losses. The most basic approach to assessing the accuracy of a VaR forecast calculation for those trading days is to backtest using the VaR Coverage Test which essentially counts the number of VaR breaches. This leads to the Traffic Light approach to backtesting VaR originally proposed by the Basel Committee for Banking Supervision in Basle Committee on Banking Supervision (1996), which we describe below.

For each

i = 1, \dots, N

, let

{VaR}_{i} (α)

denote the forecast VaR at level

α

defined by

VaR (α) : = inf {z \in R : F_{L} (z) \geq α}

(1)

where

F_{L}

is the cumulative distribution of the random loss variable L. For each trading day i we define the VaR breach indicator

X_{VaR}^{(i)} : [0, 1] \to {0, 1}

as

X_{VaR}^{(i)} (α) : = 1_{{L_{i} \leq {V a R}_{i} (α)}} = \{\begin{matrix} 0 & if L_{i} > V a R_{i} (α) \\ 1 & if L_{i} \leq V a R_{i} (α) . \end{matrix}

(2)

That is,

X_{VaR}^{(i)}

keeps track of whether a breach occurred for trading day i. Then, the total number of breaches over all N trading days, denoted by

X_{VaR}^{N} : [0, 1] \to {0, 1, 2, \dots, N}

, is

\begin{matrix} X_{VaR}^{N} (α) : = \sum_{i = 1}^{N} 1_{{L_{i} \leq {VaR}_{i} (α)}} \end{matrix}

(3)

Under the null hypothesis that the VaR model is correct,

E [X_{VaR}^{N} (α)] = N α

. Thus, for the Basel parameters

α = 1 %

and

N = 250

, we expect

2.5

breaches. Of course, in any backtest it is very rare that one observes exactly

2.5

breaches (in fact impossible since

X_{VaR}^{N}

must be an integer), and thus we appeal to statistical analysis to understand the probability of obtaining significantly fewer or more breaches than would be expected if we had a correct model. For fixed N and level

α

we define the cumulative probability

Ψ_{VaR}^{α, N}

of obtaining x or fewer breaches as

\begin{matrix} Ψ_{VaR}^{α, N} (x) : = P [X_{VaR}^{N} (α) \leq x] \end{matrix}

(4)

The Basel Committee on Banking Supervision proposed a Traffic Light approach to statistical significance of VaR breaches in their 1996 document Basle Committee on Banking Supervision (1996). Therein the Basel Committee defines three color zones through cumulative probabilities of the number of realized VaR breaches. The Green Zone is defined as the number of breaches under the null hypothesis whereby the cumulative probability of obtaining that many breaches or fewer is less than

95 %

. The Yellow Zone is defined as the number of breaches whereby the cumulative probability of obtaining that many breaches or fewer is greater than

95 %

but less than

99 %

. Finally, the Red Zone is defined by a cumulative probability of

99.99 %

or more. Thus, the boundary between the Green and Yellow zones is defined as the largest integer x such that

Ψ_{VaR}^{α, N} (x) < 0.95

and the boundary between the Yellow and Red zones is similarly defined as the largest integer x such that

Ψ_{VaR}^{α, N} (x) < 0.9999

.

Table 1 gives the resulting color zones for different breaches values under the VaR Basel parameters

α = 1 %

and

N = 250

observations. The true Binomial Null Distribution is used to compute the Cumulative Probabilities rather than the asymptotic Normal distribution.

Table 1. Traffic Light zone boundaries are computed assuming

α = 1 %

and

N = 250

observations.

3. Derivation of the Expected Shortfall Traffic Light Test

We now define a Traffic Light approach to backtesting Expected Shortfall based on the Coverage Test in Costanzino and Curran (2015). The test relies on an appropriate extension of the VaR breach indicator (2) to the case of ES. The resulting new breach indicator (6) takes into account the severity of the breach (i.e., losses beyond the VaR level) and is a continuous variable rather than discrete.

We begin the derivation by defining Expected Shortfall as

\begin{matrix} ES (α) : = \frac{1}{α} \int_{0}^{α} VaR (p) d p \end{matrix}

(5)

In analogy to

X_{VaR}^{(i)}

(2), we define the ES generalized breach indicator

X_{ES}^{(i)} : [0, 1] \to [0, 1]

by

\begin{matrix} X_{ES}^{(i)} (α) & : = \frac{1}{α} \int_{0}^{α} 1_{{L_{i} \leq {VaR}_{i} (p}} d p \end{matrix}

(6)

\begin{matrix} = \underset{severity of the breach}{\underset{︸}{(1 - \frac{F_{L} (L_{i})}{α})}} 1_{{L_{i} \leq {VaR}_{i} (α)}} \end{matrix}

(7)

\begin{matrix} = θ^{(i)} (α) \cdot X_{VaR}^{(i)} \end{matrix}

(8)

where have used (2) and have set

θ^{(i)} (α) = 1 - F_{L} (L_{i}) / α

, where

F_{L}

is the cumulative distribution implicitly defined in (1). We note that compared to

X_{VaR}^{(i)}

(2),

X_{ES}^{(i)}

(6) has an extra term

F (L_{i}) / α

which determines the severity of the breach. That is, suppose

L_{i} = {VaR}_{i}

. Then

F (L_{i}) / α = 1

so

X_{ES}^{(i)} = 0

whereas

X_{VaR}^{(i)} = 1

. On the other hand suppose

L_{i}

is very negative. Then,

F (L_{i}) = 0

so that

X_{ES}^{(i)} = 1

and similarly

X_{VaR}^{(i)} = 1

. Thus,

X_{ES}^{(i)}

keeps track of whether a breach happened on trading day i as well as the severity. Then, the total severity of breaches over all N trading days, denoted by

X_{ES}^{N} : [0, 1] \to [0, N]

, is

\begin{matrix} \begin{matrix} X_{ES}^{N} (α) & : = \sum_{i = 1}^{N} \frac{1}{α} \int_{0}^{α} 1_{{L_{i} \leq {VaR}_{i} (p)}} d p \\ = \sum_{i = 1}^{N} (1 - \frac{F (L_{i})}{α}) 1_{{L_{i} \leq {VaR}_{i} (α)}} \\ = \sum_{i = 1}^{N} θ^{(i)} (α) X_{VaR}^{(i)} (α) \end{matrix} \end{matrix}

(9)

For fixed N and level

α

we define the cumulative probability

Ψ_{ES}^{α, N}

of obtaining x or fewer breaches as

\begin{matrix} Ψ_{ES}^{α, N} (x) : = P [X_{ES}^{N} (α) \leq x] \end{matrix}

(10)

Therefore, for any quantile q, we can compute the corresponding Generalized Breach Value x by inverting the equation

\begin{matrix} sup_{x \in [0, \infty)} Ψ_{ES}^{α, N} (x) < q \end{matrix}

(11)

Note that in the case of the VaR Traffic Light Test (see Table 1), it makes sense to compute the quantiles for different breach values (i.e.,

1, 2, \dots, 10

). For Expected Shortfall, the breach indicator is a continuous variable and it no longer makes sense to choose the breach value and compute an associated quantile. Rather, we choose the quantile and then invert to obtain the corresponding breach value. In particular, we borrow the color zone boundaries from the VaR Traffic Light Test, which yield a Green Zone if

q < 0.95

, Yellow Zone if

0.95 \leq q < 0.9999

, and Red Zone if

0.9999 \leq q

; i.e.,

\begin{matrix} {Boundary}_{G Y} : = sup_{x \in [0, \infty)} Ψ_{ES}^{α, N} (x) < 0.95 \end{matrix}

(12)

and the boundary between the Yellow and Red zones is given by

\begin{matrix} {Boundary}_{Y R} : = sup_{x \in [0, \infty)} Ψ_{ES}^{α, N} (x) < 0.9999 \end{matrix}

(13)

To compute these boundaries, and other values of x one needs to compute the distribution of the test statistic

X_{ES}^{N} (α)

under the null-hypothesis

H_{0}

given by

H_{0} : {X_{ES}^{(i)}}_{i = 1}^{N} i . i . d . \forall i \neq j, and P [L_{i} \leq {VaR}_{i} (p)] = p \forall p \in [0, α] .

(14)

A similar argument as in Costanzino and Curran (2015) shows that for any

α \in (0, 1)

,

\begin{matrix} \sqrt{N} (X_{ES}^{N} (α) - μ_{ES}) \overset{[}{N} \to \infty] D \overset{}{\to} N (0, σ_{ES}^{2}) \end{matrix}

(15)

where

\begin{matrix} μ_{ES} = \frac{1}{2} α N \end{matrix}

(16)

\begin{matrix} σ_{ES}^{2} = α (\frac{4 - 3 α}{12}) \end{matrix}

(17)

and hence

\begin{matrix} lim_{N \to \infty} X_{ES}^{N} (α) \sim N (μ_{ES}, σ_{ES}^{2}) \end{matrix}

(18)

Hence, as a crude approximation, we can compute (12) and (13) using the asymptotic test distribution (18) to obtain

{Boundary}_{G Y}^{approx} = 5.4768

and

{Boundary}_{Y R}^{approx} = 9.2229

. These values are approximate since they use the asymptotic distribution

Ψ_{ES}^{α, \infty} (x)

of the test statistic rather than the finite-sample one

Ψ_{ES}^{α, N} (x)

. We now derive the finite-sample distribution

Ψ_{ES}^{α, N} (x)

and use a numerical root-finding procedure to accurately estimate the critical values.

The derivation of the ES Traffic Light test relies on the computation of the finite-sample cumulative distribution of the test statistic

X_{E S}^{N} (α)

(9). A key observation in the derivation is that under the null-hypothesis, the distribution of

X_{ES}^{(i)} (α)

conditional on a breach having occurred is uniform in the

α

-tail, and thus using the law of total probability we have

\begin{matrix} \begin{matrix} P [X_{E S}^{N} (α) \leq x] & = \sum_{n = 1}^{\infty} P [X_{ES}^{N} (α) \leq x ∣ X_{VaR}^{N} (α) = n] \cdot P [X_{VaR}^{N} (α) = n] \\ = \sum_{n = 1}^{\infty} I_{n} (x) \cdot B (n, N, α) \end{matrix} \end{matrix}

(19)

where

I_{n} (x)

is the Irwin–Hall distribution (c.f. Hall 1927; Irwin 1927; Marengo et al. 2017) defined by

\begin{matrix} I_{n} (x) = \frac{1}{2 (n - 1)!} \sum_{k = 0}^{n} {(- 1)}^{k} (\binom{n}{k}) {(x - k)}^{n - 1} sgn (x - k) \end{matrix}

(20)

and

B (n, N, α)

binomial probability mass function

\begin{matrix} B (n, N, α) = (\binom{n}{N}) α^{n} {(1 - α)}^{N - n} \end{matrix}

(21)

We then use this probability calculation (19) and a root-finding algorithm to solve the equation

\begin{matrix} P [X_{ES}^{N} (α) \leq Boundary] = q \end{matrix}

(22)

for

Boundary

where q is the appropriate quantile level. In particular assuming the Basel parameters

α = 2.5 %

and

N = 250

, then for

q = q_{GY} = 0.05

and

q = q_{YR} = 0.0001

we obtain

\begin{matrix} {Boundary}_{G Y} = 5.7049 \end{matrix}

(23)

\begin{matrix} {Boundary}_{Y R} = 9.8833 \end{matrix}

(24)

Table 2 gives the resulting quantiles and color zones for different breach values under the ES Basel parameters

α = 2.5 %

and

N = 250

observations where the cumulative probabilities were computed using (19). Of particular note is the breach values and cumulative probabilities for Expected Shortfall at the 97.5 quantile (i.e.,

α = 2.5 %

) are very similar to the VaR values at the 99 quantile (i.e.,

α = 1 %

). In addition, the finite-sample Breach Value at the 50th quantile (3.0276) is very similar to the asymptotic Breach Value at the 50th quantile (

\frac{α}{2} N = 250 \times 0.025 / 2 = 3.125)

. Furthermore, note that in the case of Expected Shortfall, the breach values are continuous, and therefore infinitesimally small changes in breach value may result in a change in the color zone.

Table 2. Expected Shortfall Traffic Light zone boundaries are computed assuming

α = 2.5 %

and

N = 250

observations.

4. Discussion

First, the values and quantiles for VaR at

α = 1 %

are similar to the values and quantiles for ES at

α = 2.5 %

. This happens because there are more VaR breaches at

α = 2.5 %

than at

α = 1 %

, but the severity of the breach in ES is smaller than unity so these two mechanisms average each other out.

We also note that along with color zones, the Basel document (Basle Committee on Banking Supervision 1996) defines market risk capital multipliers based on the cumulative probability

C_{VaR}

of the number of realized exceptions,

X_{VaR}

. In particular, a multiplier

k_{VaR}

ranging from 0 to 1 is given depending on the number of breaches; i.e.,

k_{VaR} = f_{VaR} (C_{VaR})

for some function

f_{VaR}

. The same can obviously be done for Expected Shortfall; i.e.,

k_{ES} = f_{ES} (C_{ES})

for some function

f_{ES}

. However, the continuous nature of the breach values from (9) leads to the need for

k_{ES}

to be a continuous function so as to avoid the case where small changes in breach value give rise to large changes in multiplier.

5. Conclusions

By defining an appropriate breach statistic (6) that measures the severity of each breach and using the results in (Costanzino and Curran 2015; Clift et al. 2015), we propose a Traffic Light test for Expected Shortfall using the finite-sample distribution of the test statistic under the null hypothesis. The test itself as well as the actual values of the zone boundaries are analogous to the Basel Traffic Light test for VaR.

Acknowledgments

The authors would like to thank the three anonymous referees for helpful suggestions, as well as Marco Naldi (Barclays) for a careful reading of the manuscript.

Author Contributions

The two authors contribute equally to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Acerbi, Carlo, and Balazs Szekely. 2014. Back-Testing Expected Shortfall. Risk Magazine, November. [Google Scholar]
Acerbi, Carlo, and Dirk Tasche. 2002a. Expected Shortfall: A natural coherent alternative to Value at Risk. Economic Notes 31: 379–88. [Google Scholar] [CrossRef]
Acerbi, Carlo, and Dirk Tasche. 2002b. On the Coherence of Expected Shortfall. Journal of Banking & Finance 26: 1487–503. [Google Scholar]
Artzner, Philippe, Freddy Delbaen, Jean-Marc Eber, and David Heath. 1999. Coherent Measures of Risk. Mathematical Finance 9: 203–28. [Google Scholar] [CrossRef]
Basle Committee on Banking Supervision. 1996. Supervisory Framework For The Use of Back-Testing in Conjunction With The Internal Models Approach to Market Risk Capital Requirements. Available online: www.bis.org/publ/bcbs22.htm (accessed on 7 January 2018).
Costanzino, Nick, and Mike Curran. 2015. Backtesting General Spectral Risk Measures with Application to Expected Shortfall. Available online: https://ssrn.com/abstract=2514403 (accessed on 7 January 2018).
Clift, Simon, Nick Costanzino, and Mike Curran. 2015. Comparison of Backtesting Methods for Expected Shortfall. Preprint. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2618345 (accessed on 7 January 2018).
Du, Zaichao, and Juan Carlos Escanciano. 2017. Backtesting Expected Shortfall: Accounting for Tail Risk. Management Science 63: 940–58. [Google Scholar] [CrossRef]
Fissler, Tobias, Johanna F. Ziegel, and Tilmann Gneiting. 2015. Expected Shortfall is Jointly Elicitable with Value at Risk—Implications for Backtesting. Risk Magazine, December 21. [Google Scholar]
Bank for International Settlements. 2012. Fundamental Review of the Trading Book: A Revised Market Risk Framework. Consultative Document. Basel: Basel Committee on Banking Supervision. [Google Scholar]
Gordy, Michael B., Hsiao Yen Lok, and Alexander J. McNeil. 2017. Spectral backtests of forecast distributions with application to risk management. arXiv Preprint, arXiv:1708.01489. [Google Scholar]
Hall, Philip. 1927. The distribution of means for samples of size N drawn from a population in which the variate takes values between 0 and 1, all such values being equally probable. Biometrika 19: 240–45. [Google Scholar] [CrossRef]
Hull, John. 2007. VaR versus Expected Shortfall. Risk Magazine, March 1. [Google Scholar]
Irwin, Joseph Oscar. 1927. On the frequency distribution of the means of samples from a population having any law of frequency with finite moments, with special reference to Pearson’s Type II. Biometrika 19: 225–39. [Google Scholar] [CrossRef]
Kratz, Marie, Yen H. Lok, and Alexander J. McNeil. 2016. Multinomial VaR Backtests: A simple implicit approach to backtesting expected shortfall. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2898688 (accessed on 7 January 2018).
Marengo, James E., David L. Farnsworth, and Lucas Stefanic. 2017. A Geometric Derivation of the Irwin-Hall Distribution. International Journal of Mathematics and Mathematical Sciences 2017: 3571419. [Google Scholar] [CrossRef]

Table 1. Traffic Light zone boundaries are computed assuming

α = 1 %

and

N = 250

observations.

Table 1. Traffic Light zone boundaries are computed assuming

α = 1 %

and

N = 250

observations.

Basel Traffic Light Approach to VaR
Zone	Breach Value	Cumulative Probability
Green	0	8.11%
	1	28.58%
	2	54.32%
	3	75.81%
	4	89.22%
Yellow	5	95.88%
	6	98.63%
	7	99.60%
	8	99.89%
	9	99.97%
Red	more than 10	99.99%

Table 2. Expected Shortfall Traffic Light zone boundaries are computed assuming

α = 2.5 %

and

N = 250

observations.

Table 2. Expected Shortfall Traffic Light zone boundaries are computed assuming

α = 2.5 %

and

N = 250

observations.

Traffic Light Approach to Expected Shortfall
Zone	Generalized Breach Value	Cumulative Probability
Green	0	0.18%
	1.3929	10%
	2.1131	25%
	3.0276	50%
	4.0520	75%
	5.0622	90%
	5.7049	95%
Yellow	5.7049	95%
	6.9844	99%
	8.5285	99.9%
	9.8833	99.99%
Red	more than 9.8833	99.99%

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A Simple Traffic Light Approach to Backtesting Expected Shortfall

Abstract

1. Introduction

2. Review of the VaR Traffic Light Test

3. Derivation of the Expected Shortfall Traffic Light Test

4. Discussion

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics