We have estimated the 10-day VaR and the 10-day ES of the S&P 500 Index every 10 days from 21 December 1973 to 29 December 2023 by the proposed method, the crude Monte Carlo simulation with normal innovations, and the crude Monte Carlo simulation with standardized Student’s
t innovations. We call the Monte Carlo methods SIS, CMC-N, and CMC-t, respectively. We denote by
the set of every 10 days from 21 December 1973 to 29 December 2023. The size of set
is 1261. In order to statistically test whether the estimates are correct, we have performed some backtestings; the conditional coverage test (
Christoffersen 1998) for the 10-day VaR estimates and the
test (
Acerbi and Szekely 2014) for the 10-day ES estimates.
4.1. Estimation of Risk Measures
By applying CMC-N, CMC-t, and SIS, we obtained
, the 10-day VaR estimates at every 10 days from 21 December 1973 to 29 December 2023, and
, the 10-day ES estimates at every 10 days during the same period. The confidence levels of the estimates were set to be 0.95, 0.975, 0.99. In each method, the rolling window method with the window size of 750 was applied. At the beginning of each day
t,
, we fitted the GJR-GARCH(1,1) model to the daily log returns of previous 750 days, and obtained the estimated daily volatilities and the residuals during the previous 750 days. When we fitted the GJR-GARCH(1,1) model in CMC-N and SIS, we assumed that the innovations follow the standard normal distribution. In CMC-t, the innovations were assumed to follow the standardized Student’s
t-distribution. In CMC-N and CMC-t, we applied the crude Monte Carlo simulation described in
Section 2.4 to generate
sample processes of
, the daily log returns over next 10 days, where the approximate distribution of the innovations is the standard normal distribution in CMC-N, and the standardized Student’s
t-distribution in CMC-t. In SIS, we also have generated
importance sample processes of
as described by
Section 3. The value of
in Equation (
30) was set to be 0.25. We implemented CMC-N, CMC-t, and SIS in R (
R Core Team 2024). The estimation of the risk measures described above was performed in a desktop computer with six cores and 8 GB RAM.
For
t,
,
and
were estimated simultaneously from the simulated
sample processes of
. A total of 1261 number of 10-day VaR and 10-day ES estimations were performed for each method and confidence level.
Table 2 shows the elapsed time to obtain the estimates for each estimation method and confidence level.
For all confidence levels, CMC-t took more time to obtain the estimates than CMC-N and SIS. This is due to the fact that fitting the GJR-GARCH model with the standardized Student’s
t innovations to the observed log returns takes more time than fitting with the normal innovations. In SIS, we find the optimal value of the twisting parameter
and compute the unnormalized likelihood ratio for each of the importance samples of
,
, which made the time difference between CMC-N and SIS in
Table 2. However, the time difference was not so much. SIS took approximately 20% more time than CMC-N. We can see that the computational burden incurred by using SIS instead of CMC-N is not that large, and that SIS is more efficient than CMC-t in terms of the computational burden.
In estimating the
(10),
, we obtained the simulated values of the 10-day log return from the simulated
. We denote them by
, where
N is equal to
. We also computed the unnormalized likelihood ratio for each
in SIS. We divide the set
into 10 sets of equal numbers of
. For each of the sets, we computed the 10-day VaR estimate from Equation (
17) in CMC-N and CMC-t, and from Equation (
38) in SIS. Then, the sample mean of these 10-day VaR estimates becomes
in each method. The standard error of the sample mean of these 10-day VaR estimates becomes the standard error of
. We also computed the relative error of
, which is the S.E. of
divided by
.
Figure 2 shows the negated 10-day VaR estimates obtained by SIS. In the figure, the confidence level of the VaR estimates is the
. The figure also shows 10-day log returns computed at day
t,
. The behavior of VaR estimates by CMC-N (and CMC-t) looks similar to the figure.
If
at day
t, then we call that a violation occurs at day
t, and call such a loss the excessive loss, i.e., a 10-day excessive loss means a negated 10-day log return larger than the corresponding 10-day VaR estimate. We computed
the violation process defined in Equation (
19) with
k being equal to 10, and counted the number of violations. If a method estimates
correctly for
, then a violation occurs at day
t with probability
p. Since a total of 1261 number of 10-day VaR estimations were performed, the expected number of violationsis 1261 ×
p.
Table 3 shows the expected number of violation, and the number of observed violation for each method and confidence level. The table also shows the 95% confidence interval of the number of violations under the null hypothesis (
20). In the table, the numbers of observed violations are within their confidence intervals for all confidence levels and methods. It seems that all methods estimated the 10-day VaR appropriately. We will discuss this point in more detail in the next subsection.
Table 4 shows the average S.E. and R.E. of
for each method when the confidence levels are 0.95, 0.975, and 0.99. We can see that CMC-N gives the lowest average S.E and average R.E. for all confidence levels. However, when the confidence level is 0.99, the average S.E. (and also the average R.E.) of CMC-N and SIS are similar. Thus, SIS was not effective in reducing the average S.E. of the 10-day VaR estimates. This is due to the fact that the twisting parameter of SIS is found to minimize the estimation error of the 10-day ES, not the 10-day VaR.
When we estimate a value by a Monte Carlo simulation, the efficiency of the Monte Carlo simulation is inversely proportional to the product of the variance of the estimate and the simulation time to obtain the estimate (
Glynn and Whitt 1992;
Sak and Hörmann 2012). If the estimate obtained by a Monte Carlo simulation has the same variance as the estimate obtained by another Monte Carlo simulation, then the ratio of the products represents the ratio of the simulation times taken for the two Monte Carlo simulations to obtain the estimates with the same accuracy. Suppose that it took the same simulation time for both the Monte Carlo simulations to obtain the estimate. If we recall that the variance of the estimate is inversely proportional to the sample size, and that the simulation time increases almost linearly with the sample size, then we can see that the ratio represents how much simulation time will be taken for a Monte Carlo simulation to obtain the estimate with the same accuracy as that by the other Monte Carlo simulation.
We have obtained a series of estimates to the 10-day VaRs, not a single estimate. Thus, the efficiency of a method is inversely proportional to the product of the square of the average S.E., and the simulation time to obtain the estimates. We call the product the time-variance. The lower the time-variance of a method is, the more efficient the method is.
Table 4 shows the time-variance of the 10-day VaR estimation in each method. It can be seen from the table that CMC-N is the most efficient, regardless of the confidence level. Both CMC-T and SIS took much time to obtain the estimates, and have the larger S.E. than CMC-N. These result in the higher time-variances of CMC-T and SIS compared to CMC-N. When we computed the time-variance for each method and confidence level in
Table 4, we used the elapsed time in
Table 2 as the simulation time taken to obtain
. Computing
requires only one more step than computing
in all methods. This additional step takes very little time compared to the overall process to obtain
. Thus, the elapsed time shown in
Table 2 can be used as the approximate simulation time to obtain
.
After we obtained
,
, we also computed the 10-day ES estimate for each of the 10 sets of
obtained from
. In the computation, we applied Equation (
18) in both CMC-N and CMC-t, and applied Equation (
39) in SIS. Then, the sample mean of these 10-day ES estimates becomes
. The standard error of the sample mean of these 10-day ES estimates becomes the standard error of
. The relative error of
is computed to be the S.E. of
divided by
.
Table 5 shows the average S.E. and the average R.E. of
for each method and confidence level. We can see that SIS gives the lowest average S.E and R.E. for all confidence levels. Using SIS, the average S.E. (and also the average R.E.) of the 10-day ES estimates is reduced by about 3 to 5 times compared to CMC-N, and by about 3 to 7 times compared to CMC-t. We can see that our proposed scheme to find the optimal twisting parameter works well in reducing the estimation error of the 10-day ES.
Table 5 also shows the time-variance of the 10-day ES estimation in each method. We can see from the table that SIS is the most efficient for all confidence levels. Using SIS, the time-variance of the estimates is reduced by about 5 to 20 times compared to CMC-N, which means that SIS will take about 5 to 20 times less time than CMC-N to obtain a 10-day ES estimate with the same accuracy. Using SIS, the time-variance of the estimates is reduced by about 12 to 60 times compared to CMC-t. Thus, SIS will take significantly less time than CMC-t to obtain a 10-day ES estimate with the same accuracy.
4.2. Backtesting on 10-Day VaR Estimates
In this subsection, we apply the backtestings described in
Section 2.5 to statistically test whether our proposed method, as well as CMC-N and CMC-t, estimate the 10-day VaRs appropriately. For each method and confidence level, we obtained the violation process
from
and the observed 10-day log returns
. In order to test whether
,
, i.e., the violation rate is
p, we computed
in Equation (
22).
Table 6 shows the value of
and the significant probability of it. We can see that for all methods and confidence levels the estimates to the 10-day VaRs are appropriate in the sense that we can not reject the null hypothesis (
20) at
significant level. In other words, if the violations are independent, then there is no reason to deny that the violation rate is
p.
We have tested the temporal independence of the violation process
.
Table 6 shows the value of
in Equation (
24) and the significant probability of it for each method and confidence level. The table says that the null hypothesis
in (
23) can not be rejected at
significant level for all methods and confidence levels, and that the violation process follows a Bernoulli process rather than a Markov chain, i.e., there is no temporal dependence in the violation process.
In order to test whether
is temporally independent with desired violation rate
p, i.e., the violation process follows a Bernoulli process with success process
p, we have performed the conditional coverage test on
.
Table 6 shows the value of
and the significant probability of it for each method and confidence level. We can see from the table that for all methods and confidence levels, the null hypothesis
in (
26) can not be rejected at at
significant level. Thus, we accept that, for all estimation methods and confidence levels, the violation process follows a Bernoulli process with success process
p, equivalently,
,
. The past information of violations is not helpful in the current estimation of the 10-day VaR. We conclude that SIS as well as CMC-N and CMC-t estimated the 10-day VaR accurately in this sense.
4.3. Backtesting on 10-Day ES Estimates
From the conclusion on the 10-day VaR estimates, we can assume that
for all methods and confidence levels. In order to test statistically whether CMC-N, CMC-t, and SIS estimate the 10-day ESs accurately, we applied the
test described in
Section 2.5. When
and
were obtained by CMC-N, we first computed the
statistic of the observed 10-day log returns
from Equation (
28). To obtain the
p-value of
under
, we have generated
sample processes of
by the same manner as in CMC-N. In order to obtain a sample process of
, we applied the rolling window method with window size 750 to estimate the parameters of GJR-GARCH(1,1) with normal innovations every
, and generated the daily log returns
by applying the crude Monte Carlo method described in
Section 2.4. In the generation of daily log returns, the innovations were assumed to follow the standard normal distribution. We repeated the procedure
times, and denote by
the
i-th sample process of the 10-day log returns. By substituting
for
in Equation (
28), we obtained the simulated
statistic of the sample process
. We call it
,
. Then, the
p-value of
under
is obtained from Equation (
29).
When
and
were obtained by CMC-t, we have performed the hypothesis test (
27) in the same manner as described in the previous paragraph, except that in applying the rolling window method, we fitted the GJR-GARCH(1,1) with standardized Student’s
t innovations to the observed log return process, and that the innovations were assumed to follow the standardized Student’s
t-distribution in applying the crude Monte Carlo method to generating the daily log returns
.
In the case that
and
were obtained by SIS, they are estimated risk measures on the 10-day log returns under the assumption that the log return process follows the GJR-GARCH(1,1) with innovations having time-varying distributions, and that the distribution at a time is approximated well by the Gaussian kernel smoothing (
6) of previous innovations. Thus, for computing the
p-value of
in this case, we need the sample processes of
following the assumption. In the generation of a sample process, we fitted the GJR-GARCH(1,1) with standard normal innovations to the observed log return process in applying the rolling window method, and applied the crude Monte Carlo method with innovations of pdf (
6) as described in
Section 2.4.
Table 7 shows the value of
and its
p-value for each method and confidence level of 0.95, 0.975, and 0.99. If we set the significance level at
, then
by CMC-N appears to be underestimated. In other cases,
could not be rejected, and we conclude that all methods worked well in the estimation of the 10-day ES.