The Weak Convergence Rate of Two Semi-Exact Discretization Schemes for the Heston Model

Annalena Mickel; Andreas Neuenkirch

doi:10.3390/risks9010023

and

¹

DFG Research Training Group 1953, University of Mannheim, B6, 26, D-68131 Mannheim, Germany

²

Mathematical Institute, University of Mannheim, B6, 26, D-68131 Mannheim, Germany

^*

Author to whom correspondence should be addressed.

Risks2021, 9(1), 23;https://doi.org/10.3390/risks9010023

This article belongs to the Special Issue Computational Finance and Risk Analysis in Insurance

Version Notes

Order Reprints

Abstract

Inspired by the article Weak Convergence Rate of a Time-Discrete Scheme for the Heston Stochastic Volatility Model, Chao Zheng, SIAM Journal on Numerical Analysis 2017, 55:3, 1243–1263, we studied the weak error of discretization schemes for the Heston model, which are based on exact simulation of the underlying volatility process. Both for an Euler- and a trapezoidal-type scheme for the log-asset price, we established weak order one for smooth payoffs without any assumptions on the Feller index of the volatility process. In our analysis, we also observed the usual trade off between the smoothness assumption on the payoff and the restriction on the Feller index. Moreover, we provided error expansions, which could be used to construct second order schemes via extrapolation. In this paper, we illustrate our theoretical findings by several numerical examples.

Keywords:

Heston model; discretization schemes for SDEs; exact simulation of the CIR process; Kolmogorov PDE; Malliavin calculus

MSC:

60H07; 60H35; 65C05; 91G60

1. Introduction and Main Results

The Heston Model Heston (1993) is a widely used stochastic volatility model to price financial options. It consists of two stochastic differential equations (SDEs) for an asset price process S and its volatility V:

\begin{matrix} \begin{matrix} d S_{t} & = μ S_{t} d t + \sqrt{V_{t}} S_{t} (ρ d W_{t} + \sqrt{1 - ρ^{2}} d B_{t}), \\ d V_{t} & = κ (θ - V_{t}) d t + σ \sqrt{V_{t}} d W_{t}, \end{matrix} \end{matrix}

(1)

with

S_{0}, V_{0}, κ, θ, σ > 0

,

μ \in R

,

ρ \in [- 1, 1]

,

T > 0

and independent Brownian motions

W = {(W_{t})}_{t \in [0, T]}

,

B = {(B_{t})}_{t \in [0, T]}

, which are defined on a filtered probability space

(Ω, F, {(F_{t})}_{t \in [0, T]}, P)

, where the filtration satisfies the usual conditions. It is a simple and popular extension of the Black–Scholes model where the volatility of the asset was assumed to be constant. As a consequence, the Heston Model takes the asymmetry and excess kurtosis of financial asset returns into account which are typically observed in real market data. The volatility is given by the so-called Cox–Ingersoll–Ross process (CIR). Its Feller index

ν = \frac{2 κ θ}{σ^{2}}

will be an important parameter for our results. Throughout this article, the initial values

S_{0}

,

V_{0}

are assumed to be deterministic.

To price options with maturity at time T, one is interested in the value of

\begin{matrix} E [g (S_{T})], \end{matrix}

where

g : [0, \infty) \to R

is the payoff function. Closed formulae for

E [g (S_{T})]

are rarely known and often Monte Carlo methods are applied, for which in turn the simulation of

S_{T}

is required. Usually, the log-Heston model instead of the Heston model is considered in numerical practice. This yields the SDE

\begin{matrix} \begin{matrix} d (log (S_{t})) & = (μ - \frac{1}{2} V_{t}) d t + \sqrt{V_{t}} d (ρ W_{t} + \sqrt{1 - ρ^{2}} B_{t}), \\ d V_{t} & = κ (θ - V_{t}) d t + σ \sqrt{V_{t}} d W_{t}, \end{matrix} \end{matrix}

(2)

and the exponential is then incorporated in the payoff, i.e., g is replaced by

f : R \to R

with

f (x) = g (exp (x))

.

While exact simulation schemes and their refinements are known (see, e.g., Broadie and Kaya (2006); Glasserman and Kim (2011); Malham and Wiese (2013); Smith (2007)), discretization schemes as, e.g., Altmayer and Neuenkirch (2017); Andersen (2008); Kahl and Jäckel (2006); Lord et al. (2009), are very popular for the Heston model. The latter discretization schemes can be easily extended to the multi-dimensional case and avoid computational bottlenecks of the exact schemes. In particular, Euler-type methods, such as the fully truncated Euler scheme, seem to be very efficient (see, e.g., Coskun and Korn (2018); Lord et al. (2009)), but no weak error analysis is available for them, up to the best of our knowledge.

A second order discretization scheme for the log-Heston model has been introduced in Andersen (2008) and analyzed in Zheng (2017). The so-called Broadie-Kaya trick and a removal of the drift, detailed in Section 3.1, reduce the simulation of the log-Heston model to the joint simulation of

\begin{matrix} \begin{matrix} d X_{t} & = (\frac{ρ κ}{σ} - \frac{1}{2}) V_{t} d t + \sqrt{1 - ρ^{2}} \sqrt{V_{t}} d B_{t}, \\ d V_{t} & = κ (θ - V_{t}) d t + σ \sqrt{V_{t}} d W_{t} . \end{matrix} \end{matrix}

(3)

Moreover, since the transition density of the CIR process

V = {(V_{t})}_{t \in [0, T]}

follows a non-central chi-square distribution, it can be simulated exactly. Trapezoidal discretizations of the first component

X = {(X_{t})}_{t \in [0, T]}

lead to the trapezoidal scheme

\begin{matrix} x_{k + 1} = x_{k} & + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{v_{k + 1} + v_{k}}{2} (t_{k + 1} - t_{k}) \\ + \sqrt{1 - ρ^{2}} \sqrt{\frac{v_{k + 1} + v_{k}}{2}} Δ_{k} B, k = 0, \dots, N - 1, \end{matrix}

where

0 = t_{0} < \dots < t_{k} < \dots < t_{N} = T

,

v_{k} = V_{t_{k}}

and

Δ_{k} B = B_{t_{k} + 1} - B_{t_{k}}

. This discretization avoids in particular the cumbersome exact simulation of the integrated volatility. Zheng (2017) establishes weak order two for polynomial test functions by transferring the error analysis to that of a trapezoidal rule for multidimensional deterministic integrals. Our original intention was to extend this result to a larger class of test functions f by using the Kolmogorov PDE approach. However, the required Itō-Taylor expansions turned out to be not feasible. So, instead, we analyzed the following two semi-exact discretization schemes: the Euler-type scheme

\begin{matrix} x_{k + 1} = x_{k} + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{k} (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sqrt{v_{k}} Δ_{k} B \end{matrix}

(4)

and the semi-trapezoidal scheme

\begin{matrix} x_{k + 1} = x_{k} + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{v_{k + 1} + v_{k}}{2} (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sqrt{v_{k}} Δ_{k} B . \end{matrix}

(5)

In both schemes, the CIR process is simulated exactly. In our opinion, the analysis of these schemes gives valuable insights in the weak error analysis of discretization schemes for the log-Heston model and is also a good starting point for the analysis of full Euler-type discretization schemes.

Our error analysis relies on two regularity results for the Heston PDE (Briani et al. (2018); Feehan and Pop (2013)), the Kolmogorov PDE approach for the weak error analysis from Talay and Tubaro (1990), and Malliavin calculus. We also observe the usual trade off between the smoothness assumption on the payoff and the restriction on the Feller index. For payoffs of lower smoothness, a restriction on the Feller index

ν = 2 κ θ / σ^{2}

is required, which arises from the use of Malliavin calculus tools.

In the following, we use the notation

Δ t = max_{k = 1, \dots, N} | t_{k} - t_{k - 1} |

for the maximal step size and the usual notations for the spaces of differentiable functions. In particular, the subscript c denotes compact support and

p o l

denotes polynomial growth. In addition, see Section 3.1. The results of Feehan and Pop (2013) require compact support of the test functions f, while the results of Briani et al. (2018) allow polynomial growth but require higher smoothness for f.

Theorem 1.

Let

ε > 0

. (i) If

f \in C_{c}^{2 + ε} (R \times R_{+}; R)

and

\frac{2 κ θ}{σ^{2}} > \frac{3}{2}

, then both schemes satisfy

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = O (Δ t) . \end{matrix}

(ii) If

f \in C_{c}^{4 + ε} (R \times R_{+}; R)

, then both schemes satisfy

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = O (Δ t) . \end{matrix}

Assuming more smoothness of f, we obtain more detailed results:

Theorem 2.

Suppose that

f \in C_{p o l}^{8} (R \times R_{+}; R)

. (i) Then, the Euler scheme (4) satisfies

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = \sum_{n = 0}^{N - 1} \int_{t_{n}}^{t_{n + 1}} \int_{t_{n}}^{t} E [H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, V_{s}, V_{t})] d s d t + O ({(Δ t)}^{2}), \end{matrix}

where

\begin{matrix} H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, V_{s}, V_{t}) & = (\frac{1}{2} - \frac{ρ κ}{σ}) (κ (θ - V_{s}) u_{x} (t, {\hat{x}}_{t}, V_{t}) + σ^{2} V_{s} u_{x v} (s, {\hat{x}}_{s}, V_{s})) \\ - \frac{(1 - ρ^{2})}{2} (κ (θ - V_{s}) u_{x x} (t, {\hat{x}}_{t}, V_{t}) + σ^{2} V_{s} u_{x x v} (s, {\hat{x}}_{s}, V_{s})) \end{matrix}

and

\begin{matrix} {\hat{x}}_{t} & = x_{n} + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{n} (t - t_{n}) + \sqrt{1 - ρ^{2}} \sqrt{v_{n}} (B_{t} - B_{t_{n}}), t \in [t_{n}, t_{n + 1}], \end{matrix}

for

n = 0, \dots, N - 1

.

In particular, for an equidistant discretization with

t_{k} = k T / N

,

k = 0, \dots, N

, we have

\begin{matrix} lim_{N \to \infty} N (E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})]) = \frac{T}{2} \int_{0}^{T} E [H (t, t, X_{t}, X_{t}, V_{t}, V_{t})] d t . \end{matrix}

Here, u denotes the solution of the associated Kolmogorov PDE; see Equation (7).

(ii) For the semi-trapezoidal scheme (5), we have

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = \sum_{n = 0}^{N - 1} \int_{t_{n}}^{t_{n + 1}} \int_{t_{n}}^{t} E [H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, V_{s}, V_{t})] d s d t + O ({(Δ t)}^{2}), \end{matrix}

where

\begin{matrix} H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, V_{s}, V_{t}) & = - \frac{(1 - ρ^{2})}{2} (κ (θ - V_{s}) u_{x x} (t, {\hat{x}}_{t}, V_{t}) + σ^{2} V_{s} u_{x x v} (s, {\hat{x}}_{s}, V_{s})) \end{matrix}

and

\begin{matrix} {\hat{x}}_{t} = x_{n} & + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{V_{t} + v_{n}}{2} (t - t_{n}) + \sqrt{1 - ρ^{2}} \sqrt{v_{n}} (B_{t} - B_{t_{n}}), t \in [t_{n}, t_{n + 1}], \end{matrix}

for

n = 0, \dots, N - 1

.

In particular, for an equidistant discretization

t_{k} = k T / N

,

k = 0, \dots, N

, it holds

\begin{matrix} lim_{N \to \infty} N (E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})]) = \frac{T}{2} \int_{0}^{T} E [H (t, t, X_{t}, X_{t}, V_{t}, V_{t})] d t . \end{matrix}

Here, u denotes again the solution of the associated Kolmogorov PDE; see Equation (7).

Thus, the semi-trapezoidal rule eliminates the first two terms of the error expansion of the Euler scheme.

Remarks

Remark 1.

We expect that the error expansions for an equidistant discretization for both schemes satisfy

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = (\frac{T}{2} \int_{0}^{T} E [H (t, t, x_{t}, x_{t}, v_{t}, v_{t})] d t) \cdot N^{- 1} + O (N^{- 2}) . \end{matrix}

(6)

However, to establish this, we would require error estimates for functionals of the type

E [f (λ, X_{T}, V_{T})]

with

λ \in [0, T]

, which are uniform in λ. (Compare, e.g., Proposition 2 in Talay and Tubaro (1990).) This, in turn, would require uniform regularity estimates for the Heston PDE, which are not available at the moment.

Remark 2.

Property (6) allows to construct a second order scheme via extrapolation: If (6) holds, then

Y_{N} = 2 f (x_{2 N}, v_{2 N}) - f (x_{N}, v_{N})

satisfies

E Y_{N} = E f (X_{T}, V_{T}) + O ({(Δ t)}^{2}),

where

(x_{2 N}, v_{2 N})

uses the stepsize

T / (2 N)

and

(x_{N}, v_{N})

the stepsize

T / N

.

Remark 3.

We require smoothness assumptions for f that are not met by the payoffs in practice, which are at most Lipschitz continuous or even discontinuous. However, this is a typical problem for weak approximation of SDEs as the Heston SDE, which do not satisfy the so-called standard assumptions on the coefficients. In Bally and Talay (1996), only bounded and measurable test functions f are treated assuming uniform hypoellipticity of the coefficients of the SDE. However, the Heston model does not satisfy this property. An adaptation of the strategy of Bally and Talay (1996) to the Heston model yields strong assumption on the Feller index (see Altmayer (2015)), which we want to avoid here.

Remark 4.

Schemes built on the Broadie-Kaya trick, i.e., Equation (3), have a different structure than schemes which arise by a direct discretization of the log-Heston model as, e.g., the schemes studied in Altmayer and Neuenkirch (2017); Lord et al. (2009). For example, the so-called absorbed Euler discretization reads as

\begin{matrix} z_{k + 1} & = z_{k} - \frac{1}{2} v_{k} (t_{k + 1} - t_{k}) + \sqrt{v_{k}} (ρ (W_{t_{k + 1}} - W_{t_{k}}) + \sqrt{1 - ρ^{2}} (B_{t_{k + 1}} - B_{t_{k}})), \\ v_{k + 1} & = {(v_{k} + κ (θ - v_{k}) (t_{k + 1} - t_{k}) + σ \sqrt{v_{k}} (W_{t_{k + 1}} - W_{t_{k}}))}^{+} . \end{matrix}

Here, the volatility

V = {(V_{t})}_{t \in [0, T]}

is discretized by an Euler scheme, a fix for retaining the positivity is introduced by using the positive part, and the equation for the log-Heston price

Z = {(log (S_{t}))}_{t \in [0, T]}

is discretized instead of the one for

X = {(X_{t})}_{t \in [0, T]}

.

Remark 5.

The Broadie-Kaya trick is a particular case of a more general transformation procedure, which has been introduced in Cui et al. (2018) for a general class of stochastic volatility models. In addition, in Cui et al. (2018), the weak convergence of a Markov chain approximation for these equations is established, which had been introduced in Cui et al. (2020). Markov chain approximations have been also studied in Briani et al. (2018) for the Heston and Bates model and are an alternative to a classical discretization of stochastic differential equations. In particular, for pricing American options, they can be beneficial.

2. Numerical Results

In this section, we will test numerically whether the convergence rates for the Euler Scheme (4) and the Semi-Trapezoidal Scheme (5) are attained even under milder assumptions than those from Theorems 1 and 2. We use the following model parameters:

Model 1: $S_{0} = 100, V_{0} = 0.010201, K = 100, κ = 6.21, θ = 0.019, σ = 0.61, ρ = - 0.7, T = 1, r = 0.0319$ ;
Model 2: $S_{0} = 100, V_{0} = 0.09, K = 100, κ = 2, θ = 0.09, σ = 1, ρ = - 0.3, T = 5, r = 0.05$ ;
Model 3: $S_{0} = 100, V_{0} = 0.0457, K = 100, κ = 5.07, θ = 0.0457, σ = 0.48, ρ = - 0.767, T = 2, r = 0.00$ .

The Feller index is

ν = \frac{2 κ θ}{σ^{2}} \approx 0.63

in Model 1,

ν \approx 0.36

in Model 2, and

ν \approx 2.01

in Model 3. For each model, we use the following payoff functions:

1.: European Call: $g_{1} (S_{T}) = e^{- r T} max {S_{T} - K, 0}$ ;
2.: European Put: $g_{2} (S_{T}) = e^{- r T} max {K - S_{T}, 0}$ ;
3.: Indicator: $g_{3} (S_{T}) = e^{- r T} 1_{[0, K]} (S_{T})$ .

Note that none of these payoffs satisfies the assumptions of our Theorems. Thus, the presented numerical experiments explore whether the Theorems are valid under milder assumptions. In order to measure the weak error rate, we simulated

M = 2 \cdot 10^{7}

independent copies

g_{i} (s_{N}^{(j)})

,

j = 1, \dots, M

, of

g_{i} (s_{N})

to estimate

E (g_{i} (s_{N}))

by

p_{M, N} = \frac{1}{M} \sum_{j = 1}^{M} g_{i} (s_{N}^{(j)})

for each combination of model parameters, functional and number of steps

N \in {2^{1}, . . ., 2^{6}}

where

Δ t = \frac{T}{N}

. The number of Monte Carlo samples is chosen in such a way that the Monte Carlo error is sufficiently small enough, i.e., does not dominate the theoretically expected convergence rates. The Monte Carlo mean of these samples was then compared to a reference solution

p_{ref}

, i.e.,

e (N) = | p_{ref} - p_{M, N} |,

and the error

e (N)

is plotted in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15, Figure 16, Figure 17 and Figure 18. We then measured the rate of convergence, i.e., the decay rate of

e (N)

, by the slope of a least-squares fit in logarithmic coordinates. The reference solutions can be computed with sufficiently high accuracy from semi-explicit formulae via Fourier methods. In particular, the put price can be calculated from the call-price formula given in Heston (1993) via the put-call-parity. The price of the digital option can be computed from the probability

P_{2}

given in Heston (1993); it equals

e^{r T} (1 - P_{2})

. Additionally to the Euler and Semi-Trapezoidal scheme, we simulated the Trapezoidal scheme as in Zheng (2017) and the two extrapolation schemes from Remark 2. Moreover, to present a broader picture we estimated the weak error order of two Euler-type discretizations of the full Heston Model, the Full Truncation Euler (FTE) as in Lord et al. (2009), and the Symmetrized Euler as in Bossy and Diop (2015). To clarify things, we show two plots for each combination of model parameters and functional: one with the suspected order one schemes (Euler, Semi-Trapezoidal, FTE, and Symmetrized Euler) and one with the suspected order two schemes (Trapezoidal, Extrapolated Euler, and Extrapolated Semi-Trapezoidal).

Figure 1. Call Model 1.

Figure 2. Call Model 1.

Figure 3. Put Model 1.

Figure 4. Put Model 1.

Figure 5. Indicator Model 1.

Figure 6. Indicator Model 1.

Figure 7. Call Model 2.

Figure 8. Call Model 2.

Figure 9. Put Model 2.

Figure 10. Put Model 2.

Figure 11. Indicator Model 2.

Figure 12. Indicator Model 2.

Figure 13. Call Model 3.

Figure 14. Call Model 3.

Figure 15. Put Model 3.

Figure 16. Put Model 3.

Figure 17. Indicator Model 3.

Figure 18. Indicator Model 3.

2.1. Model 1

In Table 1, we can see the measured convergence rates for this model with a Feller index of

ν \approx 0.63 .

The associated plots are shown in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6.

Table 1. Measured convergence rates Model 1.

All “Order 1” schemes seem to have a very regular convergence behavior except for the Semi-Trapezoidal scheme for the Indicator, which could be explained by the low absolute error. Especially for the Call and the Indicator, both schemes from Theorem 1 seem to have very high weak convergence rates. Because of the Feller index of 0.63 in this model, this indicates that the assertion of Theorems 1 and 2 could hold under weaker assumptions. The extremely low estimated convergence rate for the Semi-Trapezoidal scheme in combination with the Put could be due to the low error. The estimated weak error order of the FTE scheme is noticeably higher than 1, whereas the Symmetrized Euler has low convergence rates. The convergence behavior of the “Order 2” schemes is a bit less regular. The Extrapolated Euler scheme seems to converge with order 2 for all payoff functions, whereas the Extrapolated Semi-Trapezoidal scheme seem to have only order 1 for the Indicator. But, again, we notice that the error for just 2 discretization steps already starts at around 2⁻¹⁰, which is extremely low.

2.2. Model 2

Here, we have an even lower Feller index of

ν \approx 0.36

. We can see that the estimated convergence rates for all “Order 1” schemes are lower than before, see Table 2. However, the Semi-Trapezoidal scheme and the FTE scheme seem to converge with order 1. The convergence behavior is still quite regular as we can see in Figure 7, Figure 9, and Figure 11. In absolute terms, the errors of the schemes from Theorem 1 are the lowest, especially for Put and Indicator. Looking at the “Order 2” schemes, the Trapezoidal discretization still shows an estimated weak convergence rate of around 2, whereas the two extrapolation schemes show a weaker performance. But, especially for the Indicator, all three schemes seem to have a very low error and a quite regular convergence behavior.

Table 2. Measured convergence rates Model 2.

2.3. Model 3

Here, we have the highest Feller Index with

ν \approx 2.01

. It is, therefore, a bit surprising that the Euler scheme seems to have a convergence rate of less than 1 in this case. In general, the errors for the “Order 1” schemes show a more irregular behavior, as can be seen from Figure 13, Figure 15, and Figure 17. The Semi-Trapezoidal and the FTE scheme work especially well in this scenario as we can see in Table 3. This is also the only case where the Symmetrized Euler shows an estimated convergence order of around 1. The extrapolation definitely improves the convergence rate of the Euler scheme with order 2 for the Indicator, but this is not the case for the Semi-Trapezoidal scheme.

Table 3. Measured convergence rates Model 3.

2.4. Computational Times

The computational times show the expected behavior, i.e., the simulation times for the semi-exact schemes increase as the Feller index decreases. See Table 4 and Table 5. This is a well known feature of the MATLAB-generator ncx2rnd for the non-central chi-square distribution, which we used. (All simulations were carried out in MATLAB.)

Table 4. Computational times (sec.) of the semi-exact schemes for 2⁶ time steps and 2 × 10⁷ paths.

Table 5. Computational times (sec.) of Euler-type discretizations for 2⁶ time steps and 2 × 10⁷ paths.

2.5. Conclusions

Except for the Euler scheme for the Call in Model 3, the simulation studies support the conjecture that the convergence rates of Theorems 1 and 2 hold under weaker assumptions. For the mentioned behavior of the Euler scheme, we do not have an explanation, except the possibly pre-asymptotic step sizes. For the extrapolated schemes, which might have order two, the situation is less clear. Since the behavior of the trapezoidal scheme is regular, a too large Monte Carlo error seems an unlikely explanation. Explanations could be again the pre-asymptotic step sizes or, in fact, the non-smoothness of the considered payoffs.

3. Auxiliary Results

In this section, we will collect and establish, respectively, several auxiliary results for the weak error analysis.

3.1. Kolmogorov PDE

Recall that the stochastic integral equations for the log-Heston model for

0 \leq s < t \leq T

read as

\begin{matrix} log (S_{t}) & = log (S_{s}) + \int_{s}^{t} (r - \frac{1}{2} V_{u}) d u + \int_{s}^{t} \sqrt{V_{u}} d (ρ W_{u} + \sqrt{1 - ρ^{2}} B_{u}), \\ V_{t} & = V_{s} + \int_{s}^{t} κ (θ - V_{u}) d u + σ \int_{s}^{t} \sqrt{V_{u}} d W_{u} . \end{matrix}

Now, we apply the so-called Broadie-Kaya trick from Broadie and Kaya (2006). We can rearrange the second equation:

\begin{matrix} \int_{s}^{t} \sqrt{V_{u}} d W_{u} = \frac{1}{σ} (V_{t} - V_{s} - κ θ (t - s) + κ \int_{s}^{t} V_{u} d u) . \end{matrix}

Then, we plug this equation into the first one:

\begin{array}{l} log (S_{t}) - log (S_{s}) = \frac{ρ}{σ} (V_{t} - V_{s} - κ θ (t - s) + r (t - s)) & + (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{s}^{t} V_{u} d u \\ + \sqrt{1 - ρ^{2}} \int_{s}^{t} \sqrt{V_{u}} d B_{u} . \end{array}

Without loss of generality, we can neglect the non-integral part in

log (S_{t}) - log (S_{s})

, since we have

f (log (S_{T}), V_{T}) = f (X_{T} + \frac{ρ}{σ} (V_{T} - V_{0} - κ θ T + r T), V_{T})

with

X_{T} = X_{T}^{0, log (S_{0}), V_{0}}

given below. To get the Kolmogorov backward PDE, we look at the following integral equations:

\begin{matrix} V_{t}^{s, v} & = v + \int_{s}^{t} κ (θ - V_{r}^{s, v}) d r + σ \int_{s}^{t} \sqrt{V_{r}^{s, v}} d W_{r}, \\ X_{t}^{s, x, v} & = x + (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{s}^{t} V_{r}^{s, v} d r + \sqrt{1 - ρ^{2}} \int_{s}^{t} \sqrt{V_{r}^{s, v}} d B_{r} . \end{matrix}

We set

\begin{matrix} u (t, x, v) & = E [f (X_{T}^{t, x, v}, V_{T}^{t, v})], t \in [0, T], x \in R, v \geq 0 \end{matrix}

and obtain for

f : R \times [0, \infty) \to R

bounded and continuous the Kolmogorov backward PDE by an application of the Feynman-Kac Theorem (see, e.g., Theorem 5.7.6 in Karatzas and Shreve (1991)):

\begin{matrix} u_{t} (t, x, v) = & - (\frac{ρ κ}{σ} - \frac{1}{2}) v u_{x} (t, x, v) - κ (θ - v) u_{v} (t, x, v) \\ - \frac{v}{2} ((1 - ρ^{2}) u_{x x} (t, x, v) + σ^{2} u_{v v} (t, x, v)), t \in (0, T), x \in R, v > 0, \\ u (T, x, v) = & f (x, v), x \in R, v \geq 0 . \end{matrix}

(7)

In our error analysis, we will follow the now classical approach of Talay and Tubaro (1990), which exploits the regularity of the Kolmogorov backward PDE. For the latter we will rely on the works of Feehan and Pop (2013) and Briani et al. (2018). To state these regularity results, we will need the following notation:

For a multi-index

l = (l_{1}, . . ., l_{d}) \in N^{d}

, we define

| l | = \sum_{j = 1}^{d} l_{j}

and for

y \in R^{d}

, we define

\partial_{y}^{l} = \partial_{y_{1}}^{l_{1}} \dots \partial_{y_{d}}^{l_{d}}

. Moreover, we denote by

| y |

the standard Euclidean norm in

R^{d}

. Let

D \subset R^{d}

be a domain and

q \in N

. The set

C^{q} (D; R)

is the set of all real-valued functions on

D

which are q-times continuously differentiable. For

ε \in (0, 1)

, we denote by

C^{q + ε} (D; R)

the set of all functions from

C^{q} (D; R)

in which partial derivatives of order q are Hölder-continuous of order

ε

, and

C_{c}^{q + ε} (D; R)

is the set of all functions from

C^{q + ε} (D; R)

, who have compact support. Moreover,

C_{p o l}^{q} (D; R)

is the set of functions

g \in C^{q} (D; R)

such that there exist

C, a > 0

for which

\begin{matrix} | \partial_{y}^{l} {g (y) | \leq C (1 + | y |}^{a}), y \in D, | l | \leq q . \end{matrix}

Finally, we denote by

C_{p o l, T}^{q} (D; R)

the set of functions

v \in C_{p o l}^{⌊ q / 2 ⌋, q} ([0, T) \times D; R)

such that there exist

C, a > 0

for which

\begin{matrix} sup_{t < T} | \partial_{t}^{k} \partial_{y}^{l} {v (t, y) | \leq C (1 + | y |}^{a}), y \in D, 2 k + | l | \leq q . \end{matrix}

The work of Feehan and Pop deals with general degenerated parabolic equations and establishes a-priori regularity estimates for them. In the context of Equation (7), the main result of Feehan and Pop (2013), i.e., Theorem 1.1, reads as follows:

Theorem 3.

Let

ε > 0

and

f \in C_{c}^{2 + ε} (R \times R_{+}; R)

. Then, there exists a constant

c > 0

, depending only on

f, T, ρ, κ, θ

and σ such that the solution u of PDE (7) satisfies

\begin{matrix} sup_{(t, x, v) \in [0, T] \times R \times [0, \infty)} (| u (t, x, v) | + | \partial_{t} u (t, x, v) | + | \partial_{v} u (t, x, v) | + | \partial_{x} u (t, x, v) |) \leq c, \\ sup_{(t, x, v) \in [0, T] \times R \times [0, 1]} (| v \partial_{x x} u (t, x, v) | + | v \partial_{x v} u (t, x, v) | + | v \partial_{v v} u (t, x, v) |) \leq c, \\ sup_{(t, x, v) \in [0, T] \times R \times [1, \infty)} (| \partial_{x x} u (t, x, v) | + | \partial_{x v} u (t, x, v) | + | \partial_{v v} u (t, x, v) |) \leq c . \end{matrix}

So, under the above assumptions on f, the solution u and the first order derivatives are bounded. Moreover, the second order derivatives are also bounded, if they are damped by v for

v \in [0, 1]

.

Assuming more smoothness on f, we can achieve more regularity for u using the above result, at least for the partial derivatives with respect to x. Set

\begin{matrix} \hat{u} (t, x, v) & : = u_{x} (t, x, v) = E [f_{x} (X_{T}^{t, x, v}, V_{T}^{t, v})], \\ \tilde{u} (t, x, v) & : = u_{x x} (t, x, v) = E [f_{x x} (X_{T}^{t, x, v}, V_{T}^{t, v})] . \end{matrix}

This is well defined: by continuity and boundedness of

f_{x}

and dominated convergence we have

\begin{matrix} u_{x} (t, x, v) & = \frac{\partial}{\partial x} E [f (X_{T}^{t, x, v}, V_{T}^{t, v})] = \frac{\partial}{\partial x} E [f (x + Z_{T}^{t, v}, V_{T}^{t, v})] \\ = lim_{δ \to 0} E [\frac{f (x + δ + Z_{T}^{t, v}, V_{T}^{t, v}) - f (x + Z_{T}^{t, v}, V_{T}^{t, v})}{δ}] \\ = lim_{δ \to 0} \int_{0}^{1} E [f_{x} (x + δ λ + Z_{T}^{t, v}, V_{T}^{t, v})] d λ \\ = \int_{0}^{1} E [f_{x} (x + Z_{T}^{t, v}, V_{T}^{t, v})] d λ \\ = E [f_{x} (x + Z_{T}^{t, v}, V_{T}^{t, v})] = E [f_{x} (X_{T}^{t, x, v}, V_{T}^{t, v})] \end{matrix}

with

\begin{matrix} Z_{T}^{t, v} = (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{s}^{t} V_{r}^{s, v} d r + \sqrt{1 - ρ^{2}} \int_{s}^{t} \sqrt{V_{r}^{s, v}} d B_{r} . \end{matrix}

An analogous calculation for

u_{x x} (t, x, v)

shows that

u_{x x} (t, x, v) = E [f_{x x} (X_{T}^{t, x, v}, V_{T}^{t, v})]

. Thus,

u_{x x}

is also bounded, if

f \in C_{c}^{2 + ε} (R \times R_{+}; R)

. Moreover,

\hat{u}

fulfills the Kolmogorov backward PDE

\begin{matrix} {\hat{u}}_{t} (t, x, v) = & - (\frac{ρ κ}{σ} - \frac{1}{2}) v {\hat{u}}_{x} (t, x, v) - κ (θ - v) {\hat{u}}_{v} (t, x, v) \\ - \frac{v}{2} ((1 - ρ^{2}) {\hat{u}}_{x x} (t, x, v) + σ^{2} {\hat{u}}_{v v} (t, x, v)), t \in (0, T), x \in R, v > 0, \\ \hat{u} (T, x, v) = & f_{x} (x, v), x \in R, v \geq 0, \end{matrix}

while

\tilde{u}

fulfills the same PDE with terminal condition

\begin{matrix} \tilde{u} (T, x, v) = & f_{x x} (x, v), x \in R, v \geq 0 . \end{matrix}

Applying Theorem 3 now to

\hat{u}

and

\tilde{u}

, we obtain the following additional bounds (case (ii)) for the derivatives of u:

Corollary 1.

(i) Let

ε > 0

and

f \in C_{c}^{2 + ε} (R \times R_{+}; R)

. Then, there exists a constant

c > 0

, depending only on

f, T, ρ, κ, θ

and σ such that the solution u of PDE (7) satisfies

\begin{matrix} sup_{(t, x, v) \in [0, T] \times R \times [0, \infty)} | \partial_{x x} u (t, x, v) | \leq c . \end{matrix}

(ii) Let

ε > 0

and

f \in C_{c}^{4 + ε} (R \times R_{+}; R)

. Then, there exists a constant

c > 0

, depending only on

f, T, ρ, κ, θ

and σ such that the solution u of PDE (7) satisfies

\begin{matrix} sup_{(t, x, v) \in [0, T] \times R \times [0, \infty)} (| \partial_{x v} u (t, x, v) | + | \partial_{x x} u (t, x, v) | + | \partial_{x x v} u (t, x, v) | + | \partial_{x x x} u (t, x, v) |) \leq c . \end{matrix}

The recent work of Briani et al. is a specialized approach for the log-Bates model, of which the log-Heston model is a particular case. In our setting, they obtain in Proposition 5.3 and Remark 5.4 of Briani et al. (2018) the following:

Theorem 4.

Let

q \in N

,

q \geq 2

and suppose that

f \in C_{p o l}^{2 q} (R \times R_{+}; R)

. Then, the solution u of PDE (7) satisfies

u \in C_{p o l, T}^{q} (R \times R_{+}; R)

.

In contrast to the results of Feehan and Pop, the result of Briani et al. requires more smoothness of f but allows polynomial growth instead of compact support.

3.2. Properties of the CIR Process

We recall here the following estimates for the CIR process, which are well known or can be found in Hurd and Kuznetsov (2008).

Lemma 1.

(1) We have

E [sup_{t \in [0, T]} V_{t}^{p}] < \infty

for all

p \geq 1

and

sup_{t \in [0, T]} E V_{t}^{p} < \infty i f f p > - \frac{2 κ θ}{σ^{2}} .

(2) For all

p \geq 1

, there exist constants

c > 0

, depending only on

p, κ, θ, σ, T,

and

V_{0}

, such that

E | V_{t} - V_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}, s, t \in [0, T] .

We will need the following bound on the growth of the

L^{q}

-norm of a specific stochastic integral of the CIR process:

Lemma 2.

For all

q \in [2, \frac{4 κ θ}{σ^{2}})

, it holds that

\begin{matrix} sup_{t \in [0, T]} t^{- q / 2} E [| \int_{0}^{t} \frac{1}{\sqrt{V_{u}}} d B_{u} |^{q}] < \infty . \end{matrix}

Proof.

With the Burkholder-Davis-Gundy inequality and the Hölder inequality, we have

\begin{matrix} t^{- q / 2} E [| \int_{0}^{t} \frac{1}{\sqrt{V_{u}}} d B_{u} |^{q}] & \leq t^{- q / 2} E [{|\int_{0}^{t} \frac{1}{V_{u}} d u|}^{q / 2}] \\ \leq t^{- q / 2} E [{({|\int_{0}^{t} {(\frac{1}{V_{u}})}^{q / 2} d u|}^{2 / q} {|\int_{0}^{t} d r|}^{(q - 2) / q})}^{q / 2}] \\ = t^{- q / 2} (\int_{0}^{t} E [{(\frac{1}{V_{u}})}^{q / 2}] d u) t^{(q - 2) / 2} \\ \leq sup_{u \in [0, t]} E [V_{u}^{- q / 2}] \end{matrix}

for all

t \in [0, T]

. The assertion now follows from Lemma 1 (1). □

3.3. Malliavin Calculus

When working with low smoothness assumptions on f, we will use a Malliavin integration by parts procedure to establish weak convergence order one. As in Altmayer and Neuenkirch (2017), this paragraph gives a short introduction into Malliavin calculus; for more details, we refer to Nualart (1995).

Malliavin calculus adds a derivative operator to stochastic analysis. Basically, if Y is a random variable and

{(W_{t}, B_{t})}_{t \in [0, T]}

a two-dimensional Brownian motion, then the Malliavin derivative measures the dependence of Y on

(W, B)

. The Malliavin derivative is defined by a standard extension procedure: Let

S

be the set of smooth random variables of the form

S = φ (\int_{0}^{T} h_{1} (s) d (W_{s}, B_{s}), \dots, \int_{0}^{T} h_{k} (s) d (W_{s}, B_{s}))

with

φ \in C^{\infty} (R^{k}; R)

bounded with bounded derivatives,

h_{i} \in L^{2} ([0, T]; R^{2})

,

i = 1, \dots, k

, and the stochastic integrals

\int_{0}^{T} h_{j} (s) d (W_{s}, B_{s}) = \int_{0}^{T} h_{j}^{(1)} (s) d W_{s} + \int_{0}^{T} h_{j}^{(2)} (s) d B_{s} .

The derivative operator D of such a smooth random variable is defined as

D S = \sum_{i = 1}^{k} \frac{\partial φ}{\partial x_{i}} (\int_{0}^{T} h_{1} (s) d (W_{s}, B_{s}), \dots, \int_{0}^{T} h_{k} (s) d (W_{s}, B_{s})) h_{i} .

This operator is closable from

L^{p} (Ω)

into

L^{p} (Ω; H)

with

H = L^{2} ([0, T]; R^{2})

and the Sobolev space

D^{1, p}

denotes the closure of

S

with respect to the norm

{∥ Y ∥}_{1, p} = {({E | Y |}^{p} + E {|\int_{0}^{T} {| D_{s} Y |}^{2} d s|}^{p})}^{1 / p} .

In particular, if

D^{W}

denotes the first component of the Malliavin derivative, i.e., the derivative with respect to W, we have

D_{t}^{W} Y = \{\begin{matrix} 1_{[0, t]} & if & Y = W \\ 0 & if & Y = B \end{matrix}

and vice versa for the derivative with respect to B, i.e.,

D_{t}^{B} Y = \{\begin{matrix} 1_{[0, t]} & if & Y = B \\ 0 & if & Y = W \end{matrix}

This, in particular, implies that, if

Y \in D^{1, 2}

is independent of B, then

D^{B} Y = 0

.

For the CIR process, we will, therefore, have that

D^{B} V_{t} = 0

for all

t \in [0, T]

.

The derivative operator follows rules similar to ordinary calculus.

Proposition 1.

Let

X = (X_{1}, . . ., X_{d})

be a random variable with components in

D^{1, p}

. If

(i): $ϕ : R^{d} \to R$ is in $C^{1} (R^{d}; R)$ ,
(ii): $ϕ (X) \in L^{p} (Ω)$ ,
(iii): $\partial_{i} ϕ (X) \cdot D X_{i} \in L^{p} (Ω; H) f o r a l l i = 1, ‖, d$ ,

then the chain rule holds:

ϕ (X) \in D^{1, p}

and

\begin{matrix} D ϕ (X) = \sum_{i = 1}^{d} \partial_{i} ϕ (X) \cdot D X_{i} . \end{matrix}

For example, for a random variable

Y \in D^{1, p}

and

g \in C^{1} (R; R)

with bounded derivative, the chain rule reads as

\begin{matrix} D g (Y) = g^{'} (Y) D Y . \end{matrix}

Another simple example for the application of this chain rule is

D_{r}^{W} [{(W_{t} - W_{s})}^{2}] = 2 (W_{t} - W_{s}) 1_{(s, t]} (r), r, s, t \in [0, T], s \leq t .

The divergence operator

δ

is the adjoint of the derivative operator. If a random variable

u \in L^{2} (Ω; L^{2} ([0, T]; R^{2}))

belongs to

dom (δ)

, the domain of the divergence operator, then

δ (u)

is defined by the duality—also called integration by parts—relationship

\begin{matrix} E [Y δ (u)] = E [\int_{0}^{T} ⟨ D_{s} Y, u_{s} ⟩ d s] for all Y \in D^{1, 2} . \end{matrix}

(8)

If u is adapted to the canonical filtration generated by

(W, B)

and satisfies

E \int_{0}^{T} {| u_{t} |}^{2} d t < \infty

, then

u \in dom (δ)

and

δ (u)

coincides with the Itō integral

\int_{0}^{T} u_{1} (s) d W_{s} + \int_{0}^{T} u_{2} (s) d B_{s}

. For the Malliavin regularity of the CIR process, the following is well known. See, e.g., Proposition 4.5 and Theorem 4.6 in Altmayer (2015) or Proposition 4.1 in Alos and Ewald (2008).

Lemma 3.

Let

t \in [0, T]

and

\frac{2 κ θ}{σ^{2}} > 1

. Then, we have

\sqrt{V_{t}} \in D^{1, \infty}

and

V_{t} \in D^{1, \infty}

with

D_{r} (\sqrt{V_{t}}) = \frac{σ}{2} exp (\int_{r}^{t} ((\frac{σ^{2}}{8} - \frac{κ θ}{2}) \frac{1}{V_{u}} - \frac{κ}{2}) d u) 1_{[0, t]} (r), r \in [0, T] .

In Altmayer and Neuenkirch (2015), this and the integration by parts formula was used to establish

E (g (X_{T})) = \frac{1}{T \sqrt{1 - ρ^{2}}} \cdot E (G (X_{T}) \cdot \int_{0}^{T} \frac{1}{\sqrt{V_{t}}} d B_{t}),

under the assumption

\frac{2 κ θ}{σ^{2}} > 1

with

G : R \to R

differentiable and

g = G^{'}

bounded, see Proposition 4.1 in Altmayer and Neuenkirch (2015). Indeed, using

u_{t} = 1 / \sqrt{V_{t}}

and

E \int_{0}^{T} {| u_{t} |}^{2} d t < \infty

,

D_{r}^{B} X_{t} = \sqrt{1 - ρ^{2}} \sqrt{V_{t}} 1_{[0, t]} (r)

and the chain rule, i.e.,

D_{r}^{B} G (X_{T}) = g (X_{T}) D_{r}^{B} X_{T},

we have

\begin{matrix} E (G (X_{T}) \cdot \int_{0}^{T} \frac{1}{\sqrt{V_{t}}} d B_{t}) & = E (\int_{0}^{T} g (X_{T}) \cdot D_{t}^{B} X_{T} \cdot \frac{1}{\sqrt{V_{t}}} d t) \\ = E (\int_{0}^{T} g (X_{T}) \sqrt{1 - ρ^{2}} \sqrt{V_{t}} \cdot \frac{1}{\sqrt{V_{t}}} d t) = T \sqrt{1 - ρ^{2}} E [g (X_{T})], \end{matrix}

where the first equality is due to the integration by parts formula.

In Lemmas 5 and 9, we will establish discrete counterparts for this integration by parts result, i.e., on the level of the approximation schemes. In this context, we will also need the Malliavin differentiability of

\int_{s}^{t} \sqrt{V_{u}} d W_{u}

. Since

\begin{matrix} \int_{s}^{t} \sqrt{V_{u}} d W_{u} = \frac{1}{σ} (V_{t} - V_{s} - κ θ (t - s) + κ \int_{s}^{t} V_{u} d u), \end{matrix}

we obtain

\begin{matrix} D_{r}^{W} (\int_{s}^{t} \sqrt{V_{u}} d W_{u}) & = \frac{1}{σ} (D_{r}^{W} (V_{t} - V_{s}) + κ \int_{s}^{t} D_{r}^{W} V_{u} d u), \\ D_{r}^{B} (\int_{s}^{t} \sqrt{V_{u}} d W_{u}) & = 0, \end{matrix}

by exchanging the Riemann integral and the Malliavin derivative (via a standard approximation argument for the Riemann integral, Lemma 3 and Lemma 1.2.3 in Nualart (1995)) and the independence of

(V, W)

and B. Thus, we can conclude that

\begin{matrix} \int_{s}^{t} \sqrt{V_{u}} d W_{u} \in D^{1, \infty}, 0 \leq s < t \leq T . \end{matrix}

(9)

3.4. Properties of the Euler Discretization

Recall that the Euler discretization of the price process is given by

\begin{matrix} x_{k + 1} = x_{k} + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{k} (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sqrt{v_{k}} Δ_{k} B \end{matrix}

with

Δ_{k} B = B_{t_{k + 1}} - B_{t_{k}}

. We extend this discretization in every interval

[t_{n}, t_{n + 1}]

as the following Itō process:

\begin{matrix} {\hat{x}}_{t} & = x_{n (t)} + (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{η (t)}^{t} v_{n (t)} d s + \sqrt{1 - ρ^{2}} \int_{η (t)}^{t} \sqrt{v_{n (t)}} d B_{s} . \end{matrix}

Here, we have set

n (t) : = max {n \in {0, . . ., N} : t_{n} \leq t}

,

η (t) : = t_{n (t)}

and

v_{k} = V_{t_{k}}

.

We have the following result on the Malliavin regularity of the Euler discretization:

Lemma 4.

Let

t \in [0, T]

and

\frac{2 κ θ}{σ^{2}} > 1

. Then,

{\hat{x}}_{t} \in D^{1, \infty}

, and we have

\begin{matrix} D_{r}^{B} {\hat{x}}_{t} = \sqrt{1 - ρ^{2}} \sqrt{v_{n (r)}} 1_{[0, t]} (r) . \end{matrix}

Proof.

We have

\begin{matrix} {\hat{x}}_{t} = {\hat{x}}_{η (t)} + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{n (t)} (t - η (t)) + \sqrt{1 - ρ^{2}} \sqrt{v_{n (t)}} (B_{t} - B_{η (t)}) \end{matrix}

and

\begin{matrix} {\hat{x}}_{η (t)} = (\frac{ρ κ}{σ} - \frac{1}{2}) \sum_{k = 0}^{n (t) - 1} v_{k} (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sum_{k = 0}^{n (t) - 1} \sqrt{v_{k}} (B_{t_{k + 1}} - B_{t_{k}}) . \end{matrix}

Following the steps of the proof of Lemma 3.5 from Altmayer and Neuenkirch Altmayer and Neuenkirch (2017), we then have

{\hat{x}}_{t} \in D^{1, \infty}

exploiting that

\sqrt{V_{t}} \in D^{1, \infty}

and

V_{t} \in D^{1, \infty}

under the assumption

\frac{2 κ θ}{σ^{2}} > 1

. The chain rule from Proposition 1 yields

\begin{matrix} D_{r}^{B} {\hat{x}}_{η (t)} = \sqrt{1 - ρ^{2}} \sum_{k = 0}^{n (t) - 1} \sqrt{v_{k}} 1_{(t_{k}, t_{k + 1}]} (r) \end{matrix}

and

\begin{matrix} D_{r}^{B} {\hat{x}}_{t} = D_{r}^{B} {\hat{x}}_{η (t)} + \sqrt{1 - ρ^{2}} \sqrt{v_{n (t)}} 1_{(t_{n (t)}, t]} (r) . \end{matrix}

□

Note that we write, in the following,

v_{t}

instead of

V_{t}

to unify the notation. With the above result, we can express

E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})]

without the second order derivative of u, which will be needed later on.

Lemma 5.

Let

t \in [0, T]

. Under the assumptions of Theorem 3 and

\frac{2 κ θ}{σ^{2}} > 1

, we have

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] = \frac{1}{t \sqrt{1 - ρ^{2}}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}] . \end{matrix}

Proof.

To avoid stronger restrictions on the Feller index we will use a localization procedure. So, for

ε > 0

, let

ψ_{ε}

be a function such that

1.: $ψ_{ε} : R \to R$ is continuously differentiable with bounded derivative,
2.: $0 \leq ψ_{ε} (x) \leq 1$ on $[0, \infty)$ ,
3.: $ψ_{ε} (x) = 1$ on $[2 ε, \infty)$ ,
4.: $ψ_{ε} (x) = 0$ on $(- \infty, ε]$ .

Since

(V, W)

and B are independent, the chain rule from Proposition 1 implies

\begin{matrix} D_{r}^{B} (\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t})) = \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) D_{r}^{B} {\hat{x}}_{t} \end{matrix}

with

D_{r}^{B} {\hat{x}}_{t} = \sqrt{1 - ρ^{2}} \sqrt{v_{η (r)}} 1_{[0, t]} (r)

. Recall the integration by parts formula from Equation (8), i.e.,

\begin{matrix} E [Y (\int_{0}^{T} u_{1} (s) d W_{s} + \int_{0}^{T} u_{2} (s) d B_{s})] = E [\int_{0}^{T} ⟨ D_{s} Y, u_{s} ⟩ d s], \end{matrix}

where we now choose

\begin{matrix} D_{r} Y & = (\begin{matrix} D_{r}^{W} (\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t})) \\ D_{r}^{B} (\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t})) \end{matrix}), u_{r} = (\begin{matrix} 0 \\ \frac{1}{\sqrt{v_{η (r)}}} 1_{[0, t]} (r) \end{matrix}) . \end{matrix}

Before we can apply the integration by parts rule, we need to check whether

\begin{matrix} \int_{0}^{T} E [{|D_{r}^{W} (\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) ψ_{ε} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t})|}^{2}] d r < \infty, \\ \int_{0}^{T} E [{|\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε}^{'} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t}) D_{r}^{W} v_{t}|}^{2}] d r < \infty, \\ \int_{0}^{T} E [{|\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) (u_{x x} (t, {\hat{x}}_{t}, v_{t}) D_{r}^{W} {\hat{x}}_{t} + u_{x v} (t, {\hat{x}}_{t}, v_{t}) D_{r}^{W} v_{t})|}^{2}] d r < \infty, \\ \int_{0}^{T} E [{|\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) D_{r}^{B} {\hat{x}}_{t}|}^{2}] d r < \infty, \end{matrix}

(10)

for

t > 0

. We deduced these terms by using again the chain rule for

D_{r} Y

. Note that the properties of the localizing function and Theorem 3 imply that

ψ_{ε} (v) u_{x} (t, x, v), ψ_{ε}^{'} (v) u_{x} (t, x, v), ψ_{ε} (v) u_{x x} (t, x, v), ψ_{ε} (v) u_{x v} (t, x, v)

are all uniformly bounded in

(t, x, v)

. So, Equation (10) holds, then, due to Lemma 1, Lemma 3, Equation (9), and Lemma 4.

Since

\int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}

is also well-defined by Lemma 1 due to

\frac{2 κ θ}{σ^{2}} > 1

, we obtain now

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x x} (t, {\hat{x}}_{t}, v_{t})] \\ = \frac{1}{t} E [\int_{0}^{t} (\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x x} (t, {\hat{x}}_{t}, v_{t})) d r] \\ = \frac{1}{t \sqrt{1 - ρ^{2}}} E [\int_{0}^{t} D_{r}^{B} (\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t})) \frac{1}{\sqrt{v_{η (r)}}} d r] \\ = \frac{1}{t \sqrt{1 - ρ^{2}}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} ψ_{ε} (v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}] . \end{matrix}

Due to Corollary 1 (i), not only

u_{x}

but also

u_{x x}

is bounded. Since

ψ_{ε} (v_{t}) \to 1

almost surely for

ε \to 0

and

| ψ_{ε} (v_{t}) | \leq 1

for all

ε > 0

, the assertion follows now by dominated convergence using the Itô-isometry and again Lemma 1. □

We also need the following

L^{p}

-convergence result:

Lemma 6.

Let

p \geq 1

. There exists a constant

c > 0

, depending only on

p, T, ρ, κ, θ, σ

and

v_{0}

, such that

sup_{t \in [0, T]} E {| X_{t} - {\hat{x}}_{t} |}^{p} \leq c \cdot {(Δ t)}^{p / 4} .

Proof.

We have

\begin{matrix} X_{t} - {\hat{x}}_{t} & = (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{0}^{t} (v_{u} - v_{η (u)}) d u + \sqrt{1 - ρ^{2}} \int_{0}^{t} (\sqrt{v_{u}} - \sqrt{v_{η (u)}}) d B_{u} . \end{matrix}

Assume without loss of generality that

p \geq 2

. Jensen’s inequality and the Burkholder-Davis-Gundy inequality now imply that there exists a constant

c > 0

, depending only on p, T, the parameters of the CIR process, and

v_{0}

, such that

\begin{matrix} E | X_{t} - {\hat{x}}_{t} |^{p} & \leq c \int_{0}^{t} E | v_{u} - v_{η (u)} |^{p} d u + c \int_{0}^{t} E {| \sqrt{v_{u}} - \sqrt{v_{η (u)}} |}^{p} d u . \end{matrix}

Since

| \sqrt{x} - \sqrt{y} | \leq \sqrt{| x - y |}

for

x, y \geq 0

, the assertion follows from Lemma 1. □

Straightforward calculations also yield the following

L^{p}

-smoothness result for the Euler-type scheme:

Lemma 7.

Let

p \geq 1

. There exists a constant

c > 0

, depending only on

p, T, ρ, κ, θ, σ

, and

v_{0}

, such that

E | {\hat{x}}_{t} - {\hat{x}}_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}

for all

s, t \in [0, T]

.

3.5. Properties of the Semi-Trapezoidal Rule

Recall that our semi-trapezoidal rule reads as

\begin{matrix} x_{k + 1} = x_{k} & + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (v_{k + 1} + v_{k}) (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sqrt{v_{k}} Δ_{k} B \\ = x_{k} & + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{k} (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sqrt{v_{k}} Δ_{k} B \\ + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (v_{k + 1} - v_{k}) (t_{k + 1} - t_{k}) . \end{matrix}

Again, we write the scheme as a time-continuous process:

\begin{matrix} {\hat{x}}_{t} = x_{n (t)} & + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{n (t)} (t - η (t)) + \sqrt{1 - ρ^{2}} \sqrt{v_{n (t)}} (B_{t} - B_{η (t)}) \\ + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (v_{t} - v_{n (t)}) (t - η (t)) . \end{matrix}

Expanding the last term with Itō’s lemma, we obtain

\begin{matrix} {\hat{x}}_{t} = x_{n (t)} + \int_{η (t)}^{t} a_{s} d s + \int_{η (t)}^{t} b_{s} d B_{s} + \int_{η (t)}^{t} c_{s} d W_{s} \end{matrix}

with

\begin{matrix} a_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) (v_{n (t)} + \frac{1}{2} (t - η (t)) κ (θ - v_{t}) + \frac{1}{2} (v_{t} - v_{n (t)})), \\ b_{t} & : = \sqrt{1 - ρ^{2}} \sqrt{v_{n (t)}}, \\ c_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (t - η (t)) σ \sqrt{v_{t}} . \end{matrix}

Here, we have set again

n (t) : = max {n \in {0, . . ., N} : t_{n} \leq t}

,

η (t) : = t_{n (t)}

,

v_{k} = V_{t_{k}}

, and we also write again

v_{t}

, instead of

V_{t}

, to unify the notation.

We need the following result on the Malliavin regularity of the semi-trapezoidal scheme:

Lemma 8.

Let

t \in [0, T]

and

\frac{2 κ θ}{σ^{2}} > 1

. Then, we have

{\hat{x}}_{t} \in D^{1, \infty}

and

\begin{matrix} D_{r}^{B} {\hat{x}}_{t} = \sqrt{1 - ρ^{2}} \sqrt{v_{η (r)}} 1_{[0, t]} (r) . \end{matrix}

Proof.

We already know that

v_{t} \in D^{1, \infty}

and

\sqrt{v_{t}} \in D^{1, \infty}

. We can write

{\hat{x}}_{t}

as

\begin{matrix} {\hat{x}}_{t} = {\hat{x}}_{η (t)} + \frac{1}{2} (\frac{ρ κ}{σ} - \frac{1}{2}) (v_{t} + v_{η (t)}) (t - η (t)) + \sqrt{1 - ρ^{2}} \int_{η (t)}^{t} \sqrt{v_{η (t)}} d B_{s} \end{matrix}

with

\begin{matrix} {\hat{x}}_{η (t)} = \frac{1}{2} (\frac{ρ κ}{σ} - \frac{1}{2}) \sum_{k = 0}^{n (t) - 1} (v_{k + 1} + v_{k}) (t_{k + 1} - t_{k}) + \sqrt{1 - ρ^{2}} \sum_{k = 0}^{n (t) - 1} \sqrt{v_{k}} (B_{t_{k + 1}} - B_{t_{k}}) . \end{matrix}

Following the steps of the proof of Lemma 3.5 from Altmayer and Neuenkirch (2017), we then also have

{\hat{x}}_{t} \in D^{1, \infty}

. The chain rule from Proposition 1 yields

\begin{matrix} D_{r}^{B} {\hat{x}}_{η (t)} = \sqrt{1 - ρ^{2}} \sum_{k = 0}^{n (t) - 1} \sqrt{v_{k}} 1_{(t_{k}, t_{k + 1}]} (r) \end{matrix}

and

\begin{matrix} D_{r}^{B} {\hat{x}}_{t} = D_{r}^{B} {\hat{x}}_{η (t)} + \sqrt{1 - ρ^{2}} \sqrt{v_{n (t)}} 1_{(t_{n (t)}, t]} (r) . \end{matrix}

□

Note that the partial Malliavin derivative with respect to B for the Euler and the semi-trapezoidal scheme coincide. So, by analogous calculations as for the Euler scheme, we obtain the following integration by parts result:

Lemma 9.

Let

t \in [0, T]

. Under the assumptions of Theorem 3 and

\frac{2 κ θ}{σ^{2}} > 1

, we have

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] = \frac{1}{t \sqrt{1 - ρ^{2}}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}] . \end{matrix}

By similar calculations as for the Euler scheme, we also have:

Lemma 10.

Let

p \geq 1

. There exists a constant

c > 0

, depending only on

p, T, ρ, κ, θ, σ

, and

v_{0}

, such that

sup_{t \in [0, T]} E {| X_{t} - {\hat{x}}_{t} |}^{p} \leq c \cdot {(Δ t)}^{p / 4} .

Lemma 11.

Let

p \geq 1

. There exists a constant

c > 0

, depending only on

p, T, ρ, κ, θ, σ

, and

v_{0}

, such that

E | {\hat{x}}_{t} - {\hat{x}}_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}

for all

s, t \in [0, T]

.

4. Proof of Theorem 1

We address both schemes and the different assumptions in separate subsections. Constants, which are, in particular, independent of the maximal stepsize

Δ t = max_{k = 1, \dots, N} | t_{k} - t_{k - 1} |,

and depend only

f, T, ρ, κ, θ, σ

, and

v_{0}, x_{0}

, will be denoted by c, regardless of their value.

4.1. The Euler Scheme: Expanding the Error

Since

u (T, x_{N}, v_{N}) = E f (x_{N}, v_{N})

and

u (0, x_{0}, v_{0}) = E f (X_{T}, V_{T})

, the weak error is a telescoping sum of local errors:

\begin{matrix} |E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})]| = |\sum_{n = 1}^{N} E [u (t_{n}, x_{n}, v_{n}) - u (t_{n - 1}, x_{n - 1}, v_{n - 1})]| . \end{matrix}

With the Itō formula and the Kolmogorov backward PDE evaluated at

(t, {\hat{x}}_{t}, v_{t})

, we obtain

\begin{matrix} e_{n} : = & E [u (t_{n + 1}, x_{n + 1}, v_{n + 1}) - u (t_{n}, x_{n}, v_{n})] \\ = & \int_{t_{n}}^{t_{n + 1}} E [u_{t} (t, {\hat{x}}_{t}, v_{t}) + (\frac{ρ κ}{σ} - \frac{1}{2}) v_{n (t)} u_{x} (t, {\hat{x}}_{t}, v_{t}) + κ (θ - v_{t}) u_{v} (t, {\hat{x}}_{t}, v_{t}) \\ + \frac{1}{2} v_{n (t)} (1 - ρ^{2}) u_{x x} (t, {\hat{x}}_{t} v_{t}) + \frac{1}{2} v_{t} σ^{2} u_{v v} (t, {\hat{x}}_{t}, v_{t})] d t \\ = & \int_{t_{n}}^{t_{n + 1}} E [(\frac{ρ κ}{σ} - \frac{1}{2}) (v_{n (t)} - v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t}) + \frac{1}{2} (v_{n (t)} - v_{t}) (1 - ρ^{2}) u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t . \end{matrix}

Since

\begin{matrix} v_{n (t)} - v_{t} = - \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}, \end{matrix}

we have

e_{n} = e_{n}^{(1)} + e_{n}^{(2)}

with

\begin{matrix} e_{n}^{(1)} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{t_{n}}^{t_{n + 1}} E [(- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x} (t, {\hat{x}}_{t}, v_{t})] d t, \\ e_{n}^{(2)} & : = \frac{1}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [(- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t . \end{matrix}

By Theorem 3 and Corollary 1, we have that

u_{x}

and

u_{x x}

are bounded. So, Lemma 1 implies that

\int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (θ - v_{s}) d s u_{x} (t, {\hat{x}}_{t}, v_{t})] d t = O ({(Δ t)}^{2})

and

\int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (θ - v_{s}) d s u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t = O ({(Δ t)}^{2}) .

Moreover, with the law of total expectation and the adaptedness of

{\hat{x}}_{η (t)}

and

v_{η (t)}

, we have

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)})] & = E [E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) | F_{η (t)}]] \\ = E [u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} | F_{η (t)}]] \\ = 0, \end{matrix}

due to the martingale property of the Itō integral. Therefore, we can write

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t})] = E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] \end{matrix}

and obtain

\begin{matrix} e_{n}^{(1)} = O ({(Δ t)}^{2}) - (ρ κ - \frac{σ}{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] d t . \end{matrix}

In the same way, we have

\begin{matrix} e_{n}^{(2)} & = O ({(Δ t)}^{2}) - \frac{σ}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x x} (t, {\hat{x}}_{t}, v_{t}) - u_{x x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] d t . \end{matrix}

Summarizing this preliminary part, we have obtained

\begin{matrix} e_{n} = O ({(Δ t)}^{2}) + {\tilde{e}}_{n}^{(1)} + {\tilde{e}}_{n}^{(2)}, \end{matrix}

(11)

where

\begin{matrix} {\tilde{e}}_{n}^{(1)} & = - (ρ κ - \frac{σ}{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] d t, \end{matrix}

(12)

\begin{matrix} {\tilde{e}}_{n}^{(2)} & = - \frac{σ}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x x} (t, {\hat{x}}_{t}, v_{t}) - u_{x x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] d t . \end{matrix}

(13)

4.2. The Euler Scheme: Case (i)

So, it remains to analyze

{\tilde{e}}_{n}^{(1)}

and

{\tilde{e}}_{n}^{(2)}

under the regularity of Theorem 3 (i). We start with

{\tilde{e}}_{n}^{(1)}

. The mean value theorem and

| u_{x x} (t, x, v) + u_{x v} (t, x, v) | \leq c (1 + \frac{1}{v}), t \geq 0, x \in R, v > 0

and give

\begin{matrix} | u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) | \\ \leq | {\hat{x}}_{t} - {\hat{x}}_{η (t)} | \int_{0}^{1} | u_{x x} (t, ξ {\hat{x}}_{t} + (1 - ξ) {\hat{x}}_{η (t)}, ξ v_{t} + (1 - ξ) v_{η (t)}) | d ξ \\ + | v_{t} - v_{η (t)} | \int_{0}^{1} | u_{x v} (t, ξ {\hat{x}}_{t} + (1 - ξ) {\hat{x}}_{η (t)}, ξ v_{t} + (1 - ξ) v_{η (t)}) | d ξ \\ \leq c (| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} |) | (1 + \frac{1}{v_{t}} + \frac{1}{v_{η (t)}}) |, \end{matrix}

where we used

\begin{matrix} \frac{1}{ξ v_{1} + (1 - ξ) v_{2}} \leq \frac{1}{v_{1}} + \frac{1}{v_{2}}, v_{1}, v_{2} > 0 . \end{matrix}

With the Minkowski inequality and Lemma 1, it holds that

\begin{matrix} E {[{(\frac{1}{v_{t}} + \frac{1}{v_{η (t)}})}^{1 + δ}]}^{1 / (1 + δ)} \leq E {[{(\frac{1}{v_{t}})}^{1 + δ}]}^{1 / (1 + δ)} + E {[{(\frac{1}{v_{η (t)}})}^{1 + δ}]}^{1 / (1 + δ)} \leq c \end{matrix}

for

δ \in (0, \frac{2 κ θ}{σ^{2}} - 1)

, where c is in particular independent of

t \in [0, T]

. Finally, with the Minkowski inequality, the Burkholder-Davis-Gundy inequality, Lemma 1, and Lemma 7, we obtain for all

p \geq 1

that

\begin{matrix} E {[| \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} |^{p} (| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} {|)}^{p}]}^{1 / p} \\ \leq E {[| \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} |^{2 p}]}^{1 / 2 p} E {[(| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} {|)}^{2 p}]}^{1 / 2 p} \\ = E {[| \int_{η (t)}^{t} v_{s} d s |^{p}]}^{1 / 2 p} (E {[| {\hat{x}}_{t} - {\hat{x}}_{η (t)} |^{2 p}]}^{1 / 2 p} + E {[| v_{t} - v_{η (t)} |^{2 p}]}^{1 / 2 p}) \\ \leq c {({(t - η (t))}^{p})}^{1 / 2 p} {({(t - η (t))}^{p})}^{1 / 2 p} \\ \leq c Δ t, \end{matrix}

where c is in particular independent of

t \in [0, T]

. The Hölder inequality then gives

\begin{matrix} {\tilde{e}}_{n}^{(1)} = O ({(Δ t)}^{2}) . \end{matrix}

For

{\tilde{e}}_{n}^{(2)}

, we will use the integration by parts rule to get rid of the second order derivative. Otherwise, direct estimation would only lead to weak order

1 / 2

. First, recall that, by Lemma 5, we have

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] = \frac{1}{t \sqrt{1 - ρ^{2}}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}] . \end{matrix}

Moreover, note that we also have

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}] = 0 \end{matrix}

(14)

and recall that

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{η (t)}, v_{η (t)})] = 0 . \end{matrix}

Thus, we can write

\begin{matrix} {\tilde{e}}_{n}^{(2)} & = - \frac{σ \sqrt{1 - ρ^{2}}}{2 t} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)})) I_{t}^{B}] \end{matrix}

(15)

with

I_{t}^{B} = \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}

.

Before we analyze this expression further, it remains to show (14). Using the law of total expectation, the adaptedness of

{\hat{x}}_{η (t)}

,

v_{η (t)}

and of the Itō integrals, we have

\begin{matrix} E & [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r}] \\ = & E [E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} | F_{η (t)}]] \\ = & E [u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{0}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} | F_{η (t)}]] \\ = & E [u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) (E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{0}^{η (t)} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} | F_{η (t)}] \\ + E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} | F_{η (t)}])] \\ = E [u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) \int_{0}^{η (t)} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} | F_{η (t)}]] \\ + E [u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} | F_{η (t)}]] . \end{matrix}

Since

E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} | F_{η (t)}] = 0 = E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} \frac{1}{\sqrt{v_{η (r)}}} d B_{r} | F_{η (t)}],

due to the properties of the Itō integral, Equation (14) follows.

Using the mean value theorem in Equation (15), we obtain

\begin{matrix} | u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}) | \leq c (| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} |) | (1 + \frac{1}{v_{t}} + \frac{1}{v_{η (t)}}) | . \end{matrix}

Therefore,

\begin{matrix} | {\tilde{e}}_{n}^{(2)} | \leq & c \int_{t_{n}}^{t_{n + 1}} \frac{1}{\sqrt{t}} E [| \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} | (| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} |) Θ_{t}] d t \end{matrix}

with

\begin{matrix} Θ_{t} = \frac{| I_{t}^{B} |}{\sqrt{t}} (1 + \frac{1}{v_{t}} + \frac{1}{v_{η (t)}}) . \end{matrix}

By Lemmas 1 and 2, it holds that

\begin{matrix} sup_{t \in [0, T]} E {[| \frac{I_{t}^{B}}{\sqrt{t}} |^{q}]}^{1 / q} & < \infty, \\ sup_{t \in [0, T]} E {[{(1 + \frac{1}{v_{t}} + \frac{1}{v_{η (t)}})}^{p}]}^{1 / p} & < \infty, \end{matrix}

for

q \in [2, \frac{4 κ θ}{σ^{2}})

and

p \in [0, \frac{2 κ θ}{σ^{2}})

. So, the Hölder inequality leads to

\begin{matrix} E [Θ_{t}^{1 + δ}] & = E [{(\frac{| I_{t}^{B} |}{\sqrt{t}})}^{1 + δ} {(1 + \frac{1}{v_{t}} + \frac{1}{v_{n (t)}})}^{1 + δ}] \\ = E {[{({(\frac{| I_{t}^{B} |}{\sqrt{t}})}^{1 + δ})}^{3}]}^{1 / 3} E {[{({(1 + \frac{1}{v_{t}} + \frac{1}{v_{n (t)}})}^{1 + δ})}^{3 / 2}]}^{2 / 3} \\ = E {[{(\frac{| I_{t}^{B} |}{\sqrt{t}})}^{3 (1 + δ)}]}^{1 / 3} E {[{(1 + \frac{1}{v_{t}} + \frac{1}{v_{n (t)}})}^{3 (1 + δ) / 2}]}^{2 / 3} \leq c \end{matrix}

with

δ \in (0, \frac{4 κ θ}{3 σ^{2}} - 1)

and c in particular independent of

t \in [0, T]

.

With the Cauchy-Schwarz, Burkholder-Davis-Gundy, and Minkowski inequalities for

p \geq 1

, it follows that

\begin{matrix} \frac{1}{\sqrt{t}} E {[| \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} |^{p} (| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} {|)}^{p}]}^{1 / p} \\ \leq \frac{1}{\sqrt{t}} E {[| \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} |^{2 p}]}^{1 / 2 p} E {[(| {\hat{x}}_{t} - {\hat{x}}_{η (t)} | + | v_{t} - v_{η (t)} {|)}^{2 p}]}^{1 / 2 p} \\ \leq \frac{c}{\sqrt{t}} E {[| \int_{η (t)}^{t} v_{s} d s |^{p}]}^{1 / 2 p} {({(t - η (t))}^{p})}^{1 / 2 p} \\ \leq \frac{c}{\sqrt{t}} {({(t - η (t))}^{p})}^{1 / 2 p} {({(t - η (t))}^{p})}^{1 / 2 p} \\ = \frac{c}{\sqrt{t}} (t - η (t)) . \end{matrix}

With the Hölder inequality, we now have

\begin{matrix} | {\tilde{e}}_{n}^{(2)} | & \leq c \int_{t_{n}}^{t_{n + 1}} \frac{1}{\sqrt{t}} (t - η (t)) d t . \end{matrix}

Therefore,

\begin{matrix} | e_{n} {| \leq c (Δ t)}^{2} + c \int_{t_{n}}^{t_{n + 1}} \frac{1}{\sqrt{t}} (t - η (t)) d t, \end{matrix}

and, since

[0, T] ∋ t \to \frac{1}{\sqrt{t}} \in (0, \infty)

is Riemann-integrable, we obtain

\begin{matrix} |E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})]| = |\sum_{n = 1}^{N} e_{n}| \leq c Δ t, \end{matrix}

which concludes the proof of this part.

4.3. The Euler Scheme: Case (ii)

Starting from Equation (11) and using now the bounds of Corollary 1 for

u_{x x}

,

u_{x v}

,

u_{x x x}

, and

u_{x x v}

, the assertion follows from a direct application of the mean value theorem to (12) and (13), together with the Lemmata 1 and 7.

4.4. Semi-Trapezoidal Rule: Expanding the Error

We look again at the telescoping sum of local errors

\begin{matrix} |E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})]| = |\sum_{n = 1}^{N} E [u (t_{n}, x_{n}, v_{n}) - u (t_{n - 1}, x_{n - 1}, v_{n - 1})]| . \end{matrix}

Recall that

\begin{matrix} {\hat{x}}_{t} = x_{η (t)} + \int_{η (t)}^{t} a_{s} d s + \int_{η (t)}^{t} b_{s} d B_{s} + \int_{η (t)}^{t} c_{s} d W_{s} \end{matrix}

with

\begin{matrix} a_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) (v_{η (t)} + \frac{1}{2} (t - η (t)) κ (θ - v_{t}) + \frac{1}{2} (v_{t} - v_{η (t)})), \\ b_{t} & : = \sqrt{1 - ρ^{2}} \sqrt{v_{η (t)}}, \\ c_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (t - η (t)) σ \sqrt{v_{t}} . \end{matrix}

With the Itō formula and the Kolmogorov backward PDE evaluated at

(t, {\hat{x}}_{t}, v_{t})

, we have

\begin{matrix} e_{n} : = & E [u (t_{n + 1}, x_{n + 1}, v_{n + 1}) - u (t_{n}, x_{n}, v_{n})] \\ = & \int_{t_{n}}^{t_{n + 1}} E [u_{t} (t, {\hat{x}}_{t}, v_{t}) + a_{t} u_{x} (t, {\hat{x}}_{t}, v_{t}) + κ (θ - v_{t}) u_{v} (t, {\hat{x}}_{t}, v_{t}) \\ + \frac{1}{2} (b_{t}^{2} + c_{t}^{2}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) + c_{t} σ \sqrt{v_{t}} u_{x v} (t, {\hat{x}}_{t}, v_{t}) + \frac{1}{2} v_{t} σ^{2} u_{v v} (t, {\hat{x}}_{t}, v_{t})] d t \\ = & \int_{t_{n}}^{t_{n + 1}} E [(\frac{ρ κ}{σ} - \frac{1}{2}) (v_{η (t)} + \frac{1}{2} (t - η (t)) κ (θ - v_{t}) + \frac{1}{2} (v_{t} - v_{η (t)}) - v_{t}) u_{x} (t, {\hat{x}}_{t}, v_{t}) \\ + \frac{1}{2} ((1 - ρ^{2}) v_{η (t)} + {(\frac{ρ κ}{σ} - \frac{1}{2})}^{2} \frac{1}{4} {(t - η (t))}^{2} σ^{2} v_{t} - (1 - ρ^{2}) v_{t}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) \\ + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (t - η (t)) σ^{2} v_{t} u_{x v} (t, {\hat{x}}_{t}, v_{t})] d t \\ = & \int_{t_{n}}^{t_{n + 1}} E [(\frac{ρ κ}{σ} - \frac{1}{2}) (\frac{1}{2} (v_{η (t)} - v_{t}) + \frac{1}{2} (t - η (t)) κ (θ - v_{t})) u_{x} (t, {\hat{x}}_{t}, v_{t}) \\ + \frac{1}{2} ((1 - ρ^{2}) (v_{η (t)} - v_{t}) + {(\frac{ρ κ}{σ} - \frac{1}{2})}^{2} \frac{1}{4} {(t - η (t))}^{2} σ^{2} v_{t}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) \\ + (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (t - η (t)) σ^{2} v_{t} u_{x v} (t, {\hat{x}}_{t}, v_{t})] d t . \end{matrix}

Using again

\begin{matrix} v_{η (t)} - v_{t} = - \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}, \end{matrix}

we obtain

e_{n} = e_{n}^{(1)} + e_{n}^{(2)} + e_{n}^{(3)}

with

\begin{matrix} e_{n}^{(1)} : = & (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{t_{n}}^{t_{n + 1}} E [\frac{1}{2} (\int_{η (t)}^{t} κ (v_{s} - v_{t}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x} (t, {\hat{x}}_{t}, v_{t})] d t, \\ e_{n}^{(2)} : = & \frac{1}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [(- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t \\ + {(\frac{ρ κ}{σ} - \frac{1}{2})}^{2} \frac{σ^{2}}{8} \int_{t_{n}}^{t_{n + 1}} {(t - η (t))}^{2} E [v_{t} u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t, \\ e_{n}^{(3)} : = & (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{σ^{2}}{2} \int_{t_{n}}^{t_{n + 1}} (t - η (t)) E [v_{t} u_{x v} (t, {\hat{x}}_{t}, v_{t})] d t . \end{matrix}

Since

v u_{x v} (t, x, v) \leq c (1 + v)

by Theorem 3, we have

e_{n}^{(3)} = O ({(Δ t)}^{2})

using Lemma 1. Moreover, since

u_{x}

and

u_{x x}

are bounded by Theorem 3 and Corollary 1 (i), we obtain similar to the calculations for the Euler scheme that

\begin{matrix} e_{n}^{(1)} & = O ({(Δ t)}^{2}) - \frac{1}{2} (ρ κ - \frac{σ}{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t})] d t \\ e_{n}^{(2)} & = O ({(Δ t)}^{2}) - \frac{σ}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t \end{matrix}

and

\begin{matrix} e_{n} = O ({(Δ t)}^{2}) + {\tilde{e}}_{n}^{(1)} + {\tilde{e}}_{n}^{(2)}, \end{matrix}

(16)

with

\begin{matrix} {\tilde{e}}_{n}^{(1)} & = - \frac{1}{2} (ρ κ - \frac{σ}{2}) \int_{t_{n}}^{t_{n + 1}} E [σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x} (t, {\hat{x}}_{t}, v_{t}) - u_{x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] d t, \end{matrix}

(17)

\begin{matrix} {\tilde{e}}_{n}^{(2)} & = - \frac{σ}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} (u_{x x} (t, {\hat{x}}_{t}, v_{t}) - u_{x x} (t, {\hat{x}}_{η (t)}, v_{η (t)}))] d t . \end{matrix}

(18)

4.5. Semi-Trapezoidal Rule: Case (i)

Since Lemma 8 gives

{\hat{x}}_{t} \in D^{1, \infty}

and

\begin{matrix} D_{r}^{B} {\hat{x}}_{t} = \sqrt{1 - ρ^{2}} \sqrt{v_{η (r)}} 1_{[0, t]} (r), \end{matrix}

we can proceed here in the same way as for the Euler scheme by using the Lemmata 9 and 11.

4.6. Semi-Trapezoidal Rule: Case (ii)

Starting from (16), the assertion of this case follows from a direct application of the mean value theorem to (17) and (18) using the regularity results from Corollary 1, together with the Lemmata 1 and 11.

5. Proof of Theorem 2

Now, we derive the error expansion under the regularity of Theorem 4 with

q = 4

, i.e., we have

u \in C_{p o l, T}^{4} (R \times R_{+}; R)

.

5.1. Euler Scheme: Preliminaries

By the Lemmata 1, 4, and 7, we have that

sup_{t \in [0, T]} E | v_{t} |^{p} + sup_{t \in [0, T]} E {| {\hat{x}}_{t} |}^{p} < \infty

and

E | v_{t} - v_{s} |^{p} + E | {\hat{x}}_{t} - {\hat{x}}_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}, s, t \in [0, T],

for all

p \geq 1

. Using the Burkholder-Davis-Gundy, Hölder, and Minkowski inequalities, we also have

sup_{t \in [0, T]} E {| X_{t} |}^{p} < \infty

and

E | X_{t} - X_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}, s, t \in [0, T],

for all

p \geq 1

. We will use this in the following at several places without explicitly mentioning it.

Recall that we obtained

\begin{matrix} e_{n}^{(1)} & : = \int_{t_{n}}^{t_{n + 1}} E [(\frac{ρ κ}{σ} - \frac{1}{2}) (- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x} (t, {\hat{x}}_{t}, v_{t})] d t, \end{matrix}

(19)

\begin{matrix} e_{n}^{(2)} & : = \int_{t_{n}}^{t_{n + 1}} E [\frac{1 - ρ^{2}}{2} (- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t, \end{matrix}

(20)

in Section 4.1.

If higher derivatives of u are available, then we can analyze

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t})] \end{matrix}

and

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] \end{matrix}

via another application of Itō’s lemma. So, let

k : [0, T] \times R \times [0, \infty) \to R

be a

C^{1, 2}

-function that fulfills the backward PDE (7). In particular, the partial derivatives of u up to order two are such functions. Itō’s formula and the Kolmogorov backward PDE (7) now give

\begin{matrix} k (t, {\hat{x}}_{t}, v_{t}) & = k (η (t), {\hat{x}}_{η (t)}, v_{η (t)}) \\ + \int_{η (t)}^{t} [(\frac{ρ κ}{σ} - \frac{1}{2}) k_{x} (s, {\hat{x}}_{s}, v_{s}) + \frac{(1 - ρ^{2})}{2} k_{x x} (s, {\hat{x}}_{s}, v_{s})] (v_{η (s)} - v_{s}) d s \\ + \int_{η (t)}^{t} k_{x} (s, {\hat{x}}_{s}, v_{s}) \sqrt{1 - ρ^{2}} v_{η (s)} d B_{s} + \int_{η (t)}^{t} k_{v} (s, {\hat{x}}_{s}, v_{s}) σ \sqrt{v_{s}} d W_{s} . \end{matrix}

If

k_{x}

and

k_{v}

have polynomial growth, then an application of the Itō isometry and the martingale property of the Itō integral yield

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} k (t, {\hat{x}}_{t}, v_{t})] & = E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} k (η (t), {\hat{x}}_{η (t)}, v_{η (t)})] \\ + E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) (v_{η (s)} - v_{s}) d s] \\ + E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} k_{x} (s, {\hat{x}}_{s}, v_{s}) \sqrt{1 - ρ^{2}} v_{η (s)} d B_{s}] \\ + E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} k_{v} (s, {\hat{x}}_{s}, v_{s}) σ \sqrt{v_{s}} d W_{s}] \\ = E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} \int_{η (s)}^{s} K (s, {\hat{x}}_{s}, v_{s}) κ (v_{u} - θ) d u d s] \\ - σ E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) \int_{η (s)}^{s} \sqrt{v_{u}} d W_{u} d s] \\ + σ E [\int_{η (t)}^{t} v_{s} k_{v} (s, {\hat{x}}_{s}, v_{s}) d s], \end{matrix}

where

\begin{matrix} K (s, {\hat{x}}_{s}, v_{s}) = (\frac{ρ κ}{σ} - \frac{1}{2}) k_{x} (s, {\hat{x}}_{s}, v_{s}) + \frac{(1 - ρ^{2})}{2} k_{x x} (s, {\hat{x}}_{s}, v_{s}) . \end{matrix}

If

k_{x}

and

k_{x x}

have polynomial growth, then an application of Hölder’s inequality and the Itō isometry yield

E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} \int_{η (t)}^{t} \int_{η (s)}^{s} K (s, {\hat{x}}_{s}, v_{s}) κ (v_{u} - θ) d u d s] = O ({(Δ t)}^{5 / 2}),

and so it follows

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} k (t, {\hat{x}}_{t}, v_{t})] & = σ E [\int_{η (t)}^{t} v_{s} k_{v} (s, {\hat{x}}_{s}, v_{s}) d s] \\ - σ E [\int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) \int_{η (t)}^{t} \sqrt{v_{u}} d W_{u} \int_{η (s)}^{s} \sqrt{v_{u}} d W_{u} d s] \\ + O ({(Δ t)}^{5 / 2}) . \end{matrix}

Since we have

\begin{matrix} E [\int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) \int_{η (t)}^{t} \sqrt{v_{u}} d W_{u} \int_{η (s)}^{s} \sqrt{v_{u}} d W_{u} d s] \\ = E [\int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) E [\int_{η (t)}^{t} \sqrt{v_{u}} d W_{u} \int_{η (s)}^{s} \sqrt{v_{u}} d W_{u} | F_{s}] d s] \\ = E [\int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) {(\int_{η (s)}^{s} \sqrt{v_{u}} d W_{u})}^{2} d s], \end{matrix}

again, by the properties of the Itō integral, we finally obtain by Hölder’s inequality and the Burkholder-Davis-Gundy inequality that

E [\int_{η (t)}^{t} K (s, {\hat{x}}_{s}, v_{s}) \int_{η (t)}^{t} \sqrt{v_{u}} d W_{u} \int_{η (s)}^{s} \sqrt{v_{u}} d W_{u} d s] = O ({(Δ t)}^{2}) .

Thus, we can conclude that

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} k (t, {\hat{x}}_{t}, v_{t})] & = σ E [\int_{η (t)}^{t} v_{s} k_{v} (s, {\hat{x}}_{s}, v_{s}) d s] + O ({(Δ t)}^{2}), \end{matrix}

(21)

for

k = u_{x}

and

k = u_{x x}

, if the derivatives up to order four of u have polynomial growth.

5.2. Euler Scheme: Conclusion

Setting now

k = u_{x}

in Equation (21), we have from (19) that

\begin{matrix} e_{n}^{(1)} & : = - (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [u_{x} (t, {\hat{x}}_{t}, v_{t}) κ (θ - v_{s})] d s d t \\ - σ^{2} (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [v_{s} u_{x v} (s, {\hat{x}}_{s}, v_{s})] d s d t + O ({(Δ t)}^{3}) . \end{matrix}

Replacing now the function k by

u_{x x}

in Equation (21), we arrive from (20) at

\begin{matrix} e_{n}^{(2)} : = & - \frac{1}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [u_{x x} (t, {\hat{x}}_{t}, v_{t}) κ (θ - v_{s})] d s d t \\ - \frac{σ^{2}}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s})] d s d t + O ({(Δ t)}^{3}) . \end{matrix}

Summarizing, we have shown that

\begin{matrix} e_{n} & = (\frac{1}{2} - \frac{ρ κ}{σ}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (θ - v_{s}) u_{x} (t, {\hat{x}}_{t}, v_{t}) + σ^{2} v_{s} u_{x v} (s, {\hat{x}}_{s}, v_{s}) d s] d t \\ - \frac{(1 - ρ^{2})}{2} \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (θ - v_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) + σ^{2} v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s}) d s] d t \\ + O ({(Δ t)}^{3}) \end{matrix}

and

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = \sum_{n = 0}^{N - 1} \int_{t_{n}}^{t_{n + 1}} \int_{t_{n}}^{t} E H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, v_{s}, v_{t}) d s d t + O ({(Δ t)}^{2}) \end{matrix}

where

\begin{array}{l} H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, v_{s}, v_{t}) & = (\frac{1}{2} - \frac{ρ κ}{σ}) (κ (θ - v_{s}) u_{x} (t, {\hat{x}}_{t}, v_{t}) + σ^{2} v_{s} u_{x v} (s, {\hat{x}}_{s}, v_{s})) \\ - \frac{(1 - ρ^{2})}{2} (κ (θ - v_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) + σ^{2} v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s})) . \end{array}

An application of the mean value theorem, the polynomial growth of the derivatives of u, the Minkowski inequality, the Hölder inequality, and the Lemmata 1, 6 yields for

s, t \in [t_{n}, t_{n + 1}]

that

\begin{array}{l} E [H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, v_{s}, v_{t})] & = E [H (t_{n}, t_{n}, X_{t_{n}}, X_{t_{n}}, V_{t_{n}}, V_{t_{n}})] + O ({(Δ t)}^{1 / 4}) . \end{array}

Note here that

u \in C_{p o l, T}^{4} (R \times R_{+}; R)

implies that

u_{t x}

and

u_{t x x}

are well-defined, have polynomial growth, and are continuous.

Thus, for an equidistant discretization

t_{k} = k T / N

,

k = 0, \dots, N

, we have

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = \frac{Δ t}{2} \sum_{n = 0}^{N - 1} E [H (t_{n}, t_{n}, X_{t_{n}}, X_{t_{n}}, V_{t_{n}}, V_{t_{n}})] Δ t + O ({(Δ t)}^{5 / 4}) . \end{matrix}

Since

\sum_{n = 0}^{N - 1} E [H (t_{n}, t_{n}, X_{t_{n}}, X_{t_{n}}, V_{t_{n}}, V_{t_{n}})] Δ t \to \int_{0}^{T} E [H (t, t, X_{t}, X_{t}, V_{t}, V_{t})] d t

for

Δ t \to 0

, this concludes the proof of Theorem 2 (i).

5.3. Semi-Trapezoidal Scheme: Preliminaries

By the Lemmata 1, 8, and 11, we have that

sup_{t \in [0, T]} E | v_{t} |^{p} + sup_{t \in [0, T]} E {| {\hat{x}}_{t} |}^{p} < \infty

and

E | v_{t} - v_{s} |^{p} + E | {\hat{x}}_{t} - {\hat{x}}_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}, s, t \in [0, T],

for all

p \geq 1

. Using the Burkholder-Davis-Gundy, Hölder, and Minkowski inequalities, we also have

sup_{t \in [0, T]} E {| X_{t} |}^{p} < \infty

and

E | X_{t} - X_{s} |^{p} \leq c \cdot {| t - s |}^{p / 2}, s, t \in [0, T],

for all

p \geq 1

.

We will use this in the following at several places without explicitly mentioning it.

We will now take also a closer look at the error of the semi-trapezoidal discretization for

u \in C_{p o l, T}^{4} (R \times R_{+}; R)

. Recall that

\begin{matrix} e_{n}^{(1)} : = & (\frac{ρ κ}{σ} - \frac{1}{2}) \int_{t_{n}}^{t_{n + 1}} E [\frac{1}{2} (- \int_{η (t)}^{t} κ (v_{t} - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x} (t, {\hat{x}}_{t}, v_{t})], \\ e_{n}^{(2)} : = & \int_{t_{n}}^{t_{n + 1}} E [\frac{1}{2} (1 - ρ^{2}) (- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) \\ + {(\frac{ρ κ}{σ} - \frac{1}{2})}^{2} \frac{σ^{2}}{8} {(t - η (t))}^{2} v_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t, \\ e_{n}^{(3)} : = & (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{σ^{2}}{2} \int_{t_{n}}^{t_{n + 1}} E [(t - η (t)) v_{t} u_{x v} (t, {\hat{x}}_{t}, v_{t})] d t . \end{matrix}

We can again use the Itō formula and the Kolmogorov backward PDE (7) evaluated at

(s, {\hat{x}}_{s}, v_{s})

and obtain for a

C^{1, 2}

-function k, which fulfills the PDE (7), that

\begin{matrix} k (t, {\hat{x}}_{t}, v_{t}) - k (t_{n}, {\hat{x}}_{t_{n}}, v_{t_{n}}) \\ = \int_{t_{n}}^{t} k_{t} (s, {\hat{x}}_{s}, v_{s}) d s + \int_{t_{n}}^{t} k_{v} (s, {\hat{x}}_{s}, v_{s}) d v_{s} + \int_{t_{n}}^{t} k_{x} (s, {\hat{x}}_{s}, v_{s}) d {\hat{x}}_{s} \\ + \frac{1}{2} \int_{t_{n}}^{t} k_{x x} (s, {\hat{x}}_{s}, v_{s}) d {⟨ \hat{x} ⟩}_{s} + \int_{t_{n}}^{t} k_{x v} (s, {\hat{x}}_{s}, v_{s}) d {⟨ \hat{x}, v ⟩}_{s} + \frac{1}{2} \int_{t_{n}}^{t} k_{v v} (s, {\hat{x}}_{s}, v_{s}) d {⟨ v ⟩}_{s} \\ = \int_{t_{n}}^{t} (a_{s} - (\frac{ρ κ}{σ} - \frac{1}{2}) v_{s}) k_{x} (s, {\hat{x}}_{s}, v_{s}) d s + \frac{1}{2} \int_{t_{n}}^{t} (b_{s}^{2} + c_{s}^{2} - (1 - ρ^{2}) v_{s}) k_{x x} (s, {\hat{x}}_{s}, v_{s}) d s \\ + \int_{t_{n}}^{t} c_{s} σ \sqrt{v_{s}} k_{x v} (s, {\hat{x}}_{s}, v_{s}) d s + \int_{t_{n}}^{t} b_{s} k_{x} (s, {\hat{x}}_{s}, v_{s}) d B_{s} + \int_{t_{n}}^{t} c_{s} k_{x} (s, {\hat{x}}_{s}, v_{s}) d W_{s} \\ + \int_{t_{n}}^{t} σ \sqrt{v_{s}} k_{v} (s, {\hat{x}}_{s}, v_{s}) d W_{s} \end{matrix}

(22)

with

\begin{matrix} a_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) (v_{η (t)} + \frac{1}{2} (t - η (t)) κ (θ - v_{t}) + \frac{1}{2} (v_{t} - v_{η (t)})), \\ b_{t} & : = \sqrt{1 - ρ^{2}} \sqrt{v_{η (t)}}, \\ c_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (t - η (t)) σ \sqrt{v_{t}} . \end{matrix}

Analogous calculations as for the Euler scheme yield that

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{τ}} d W_{τ} k (t, {\hat{x}}_{t}, v_{t})] & = σ E [\int_{η (t)}^{t} v_{τ} k_{v} (τ, {\hat{x}}_{τ}, v_{τ}) d τ] + O ({(Δ t)}^{2}) \end{matrix}

(23)

for

k = u_{x}

and

k = u_{x x}

under the assumption

u \in C_{p o l, T}^{4} (R \times R_{+}; R)

.

5.4. Semi-Trapezoidal Rule: Calculations for $e_{n}^{(1)}$ , $e_{n}^{(2)}$ , and $e_{n}^{(3)}$

Rewriting the terms of

e_{n}^{(1)}

using (23) for the last term gives

\begin{matrix} e_{n}^{(1)} = & - (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (v_{t} - v_{s}) d s u_{x} (t, {\hat{x}}_{t}, v_{t})] d t \\ - (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{σ}{2} \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x} (t, {\hat{x}}_{t}, v_{t})] d t \\ = & - (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (\int_{s}^{t} κ (θ - v_{u}) d u + σ \int_{s}^{t} \sqrt{v_{u}} d W_{u}) d s u_{x} (t, {\hat{x}}_{t}, v_{t})] d t \\ - (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{σ^{2}}{2} \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [v_{s} u_{x v} (s, {\hat{x}}_{s}, v_{s})] d s d t + O ({(Δ t)}^{3}) . \end{matrix}

Applying, again, (23) with s instead of

η (t)

as the lower bound of the integral to the second summand of the first term, using the polynomial growth of the derivatives of u and Hölder’s inequality, we also have

\int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (\int_{s}^{t} κ (θ - v_{u}) d u + σ \int_{s}^{t} \sqrt{v_{u}} d W_{u}) d s u_{x} (t, {\hat{x}}_{t}, v_{t})] d t = O ({(Δ t)}^{3})

and so

\begin{array}{l} e_{n}^{(1)} = & - (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{σ^{2}}{2} \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [v_{s} u_{x v} (s, {\hat{x}}_{s}, v_{s})] d s d t + O ({(Δ t)}^{3}) . \end{array}

Adding

e_{n}^{(3)}

yields

\begin{matrix} e_{n}^{(1)} + e_{n}^{(3)} & = O ({(Δ t)}^{3}) - (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{σ^{2}}{2} \int_{t_{n}}^{t_{n + 1}} \int_{t_{n}}^{t} E [u_{x v} (t, {\hat{x}}_{t}, v_{t}) v_{t} - u_{x v} (s, {\hat{x}}_{s}, v_{s}) v_{s}] d s d t . \end{matrix}

However, Itō’s formula gives for sufficiently smooth

k : [0, T] \times R \times [0, \infty) \to R

that

\begin{matrix} k (t, {\hat{x}}_{t}, v_{t}) - k (s, {\hat{x}}_{s}, v_{s}) \\ = \int_{s}^{t} k_{t} (r, {\hat{x}}_{r}, v_{r}) d r + \int_{s}^{t} k_{v} (r, {\hat{x}}_{r}, v_{r}) d v_{r} + \int_{s}^{t} k_{x} (r, {\hat{x}}_{r}, v_{r}) d {\hat{x}}_{r} \\ + \frac{1}{2} \int_{s}^{t} k_{x x} (r, {\hat{x}}_{r}, v_{r}) d {⟨ \hat{x} ⟩}_{r} + \int_{s}^{t} k_{x v} (r, {\hat{x}}_{r}, v_{r}) d {⟨ \hat{x}, v ⟩}_{r} + \frac{1}{2} \int_{t_{n}}^{t} k_{v v} (r, {\hat{x}}_{r}, v_{r}) d {⟨ v ⟩}_{r} \end{matrix}

(24)

with

\begin{matrix} d v_{t} & = κ (θ - v_{t}) d t + σ \sqrt{v_{t}} d W_{t}, d {\hat{x}}_{t} = a_{t} d t + b_{t} d B_{t} + c_{t} d W_{t}, \end{matrix}

where

\begin{matrix} a_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) (v_{η (t)} + \frac{1}{2} (t - η (t)) κ (θ - v_{t}) + \frac{1}{2} (v_{t} - v_{η (t)})), \\ b_{t} & : = \sqrt{1 - ρ^{2}} \sqrt{v_{η (t)}}, \\ c_{t} & : = (\frac{ρ κ}{σ} - \frac{1}{2}) \frac{1}{2} (t - η (t)) σ \sqrt{v_{t}} \end{matrix}

and

\begin{matrix} d {⟨ \hat{x} ⟩}_{t} & = (b_{t}^{2} + c_{t}^{2}) d t, d {⟨ \hat{x}, v ⟩}_{t} = σ c_{t} \sqrt{v_{t}} d t, d {⟨ v ⟩}_{t} = σ^{2} v_{t} d t . \end{matrix}

Since

u \in C_{p o l, T}^{4} (R \times R_{+}; R)

, we can apply this to

k (t, x, v) = u_{x v} (t, x, v) v

and taking expectations gives then

E [u_{x v} (t, {\hat{x}}_{t}, v_{t}) v_{t} - u_{x v} (s, {\hat{x}}_{s}, v_{s}) v_{s}] = O (| t - s |) .

So, we end up with

\begin{matrix} e_{n}^{(1)} + e_{n}^{(3)} & = O ({(Δ t)}^{3}) . \end{matrix}

Looking at

e_{n}^{(2)}

, the last term is already of third order:

\begin{matrix} e_{n}^{(2)} = & \int_{t_{n}}^{t_{n + 1}} E [\frac{1}{2} (1 - ρ^{2}) (- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) \\ + {(\frac{ρ κ}{σ} - \frac{1}{2})}^{2} \frac{σ^{2}}{8} {(t - η (t))}^{2} v_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t \\ = & \int_{t_{n}}^{t_{n + 1}} E [\frac{1}{2} (1 - ρ^{2}) (- \int_{η (t)}^{t} κ (θ - v_{s}) d s - σ \int_{η (t)}^{t} \sqrt{v_{s}} d W_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t \\ + O ({(Δ t)}^{3}) . \end{matrix}

Since

\begin{matrix} E [\int_{η (t)}^{t} \sqrt{v_{s}} d W_{s} u_{x x} (t, {\hat{x}}_{t}, v_{t})] & = σ E [\int_{η (t)}^{t} v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s}) d s] + O ({(Δ t)}^{2}), \end{matrix}

by (23), it follows

\begin{matrix} e_{n}^{(2)} = & - \frac{1}{2} (1 - ρ^{2}) \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (θ - v_{s}) d s u_{x x} (t, {\hat{x}}_{t}, v_{t})] d t \\ - \frac{1}{2} (1 - ρ^{2}) σ^{2} \int_{t_{n}}^{t_{n + 1}} \int_{η (t)}^{t} E [v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s})] d s d t + O ({(Δ t)}^{3}) . \end{matrix}

5.5. Semi-Trapezoidal Scheme: Conclusion

Summarizing, we have shown that

\begin{matrix} e_{n} = - \frac{(1 - ρ^{2})}{2} \int_{t_{n}}^{t_{n + 1}} E [\int_{η (t)}^{t} κ (θ - v_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) + σ^{2} v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s}) d s] d t + O ({(Δ t)}^{3}) \end{matrix}

and

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = \sum_{n = 0}^{N - 1} \int_{t_{n}}^{t_{n + 1}} \int_{t_{n}}^{t} E H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, v_{s}, v_{t}) d s d t + O ({(Δ t)}^{2}) \end{matrix}

where

\begin{matrix} H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, v_{s}, v_{t}) & = - \frac{(1 - ρ^{2})}{2} (κ (θ - v_{s}) u_{x x} (t, {\hat{x}}_{t}, v_{t}) + σ^{2} v_{s} u_{x x v} (s, {\hat{x}}_{s}, v_{s})) . \end{matrix}

An application of the mean value theorem, the polynomial growth of the derivatives of u, the Minkowski inequality, the Hölder inequality, and the Lemmata 1, 10 yields for

s, t \in [t_{n}, t_{n + 1}]

that

\begin{matrix} E [H (s, t, {\hat{x}}_{s}, {\hat{x}}_{t}, v_{s}, v_{t})] & = E [H (t_{n}, t_{n}, X_{t_{n}}, X_{t_{n}}, V_{t_{n}}, V_{t_{n}})] + O ({(Δ t)}^{1 / 4}) . \end{matrix}

In particular, for an equidistant discretization

t_{k} = k T / N

,

k = 0, \dots, N

, we have

\begin{matrix} E [f (x_{N}, v_{N})] - E [f (X_{T}, V_{T})] = \frac{Δ t}{2} \sum_{n = 0}^{N - 1} E H (t_{n}, t_{n}, X_{t_{n}}, X_{t_{n}}, V_{t_{n}}, V_{t_{n}}) Δ t + O ({(Δ t)}^{5 / 4}) \end{matrix}

and the convergence

\sum_{n = 0}^{N - 1} E H (t_{n}, t_{n}, X_{t_{n}}, X_{t_{n}}, V_{t_{n}}, V_{t_{n}}) Δ t \to \int_{0}^{T} E H (t, t, X_{t}, X_{t}, V_{t}, V_{t}) d t

for

Δ t \to 0

concludes the proof of Theorem 2 (ii).

Author Contributions

All authors have contributed substantially to this manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

DFG Research Training Group 1953 “Statistical Modeling of Complex Systems”.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Alos, Elisa, and Christian-Oliver Ewald. 2008. Malliavin differentiability of the Heston volatility and applications to option pricing. Advances in Applied Probability 40: 144–62. [Google Scholar] [CrossRef]
Altmayer, Martin. 2015. Quadrature of Discontinuous SDE Functionals Using Malliavin Integration by Parts. Ph.D. dissertation, University of Mannheim, Germany. [Google Scholar]
Altmayer, Martin, and Andreas Neuenkirch. 2015. Multilevel Monte Carlo quadrature of discontinuous payoffs in the generalized Heston model using Malliavin integration by parts. SIAM Journal on Financial Mathematics 6: 22–52. [Google Scholar]
Altmayer, Martin, and Andreas Neuenkirch. 2017. Discretising the Heston model: An analysis of the weak convergence rate. IMA Journal of Numerical Analysis 37: 1930–60. [Google Scholar] [CrossRef]
Andersen, Leif. 2008. Simple and efficient simulation of the Heston stochastic volatility model. Journal of Computational Finance 11: 29–50. [Google Scholar]
Bally, Vlad, and Denis Talay. 1996. The law of the Euler scheme for stochastic differential equations. I: Convergence rate of the distribution function. Probability Theory and Related Fields 104: 43–60. [Google Scholar] [CrossRef]
Bossy, Mireille, and Awa Diop. 2015. Weak convergence analysis of the symmetrized Euler scheme for one dimensional SDEs with diffusion coefficient |x|^a, a ∈ [1/2,1). arXiv, arXiv:1508.04573. [Google Scholar]
Briani, Maya, Lucia Caramellino, and Giulia Terenzi. 2018. Convergence rate of Markov chains and hybrid numerical schemes to jump-diffusions with application to the Bates model. arXiv, arXiv:1809.10545. [Google Scholar]
Broadie, Mark, and Özgür Kaya. 2006. Exact simulation of stochastic volatility and other affine jump diffusion processes. Operations Research 54: 217–31. [Google Scholar] [CrossRef]
Coskun, Sema, and Ralf Korn. 2018. Pricing barrier options in the Heston model using the Heath–Platen estimator. Monte Carlo Methods and Applications 24: 29–41. [Google Scholar]
Cui, Zhenyu, Justin Kirkby, and Duy Nguyen. 2018. A general valuation framework for SABR and stochastic local volatility models. SIAM Journal on Financial Mathematics 9: 520–63. [Google Scholar]
Cui, Zhenyu, Justin Kirkby, and Duy Nguyen. 2020. Efficient simulation of generalized SABR and stochastic local volatility models based on markov chain approximations. European Journal of Operational Research. [Google Scholar] [CrossRef]
Feehan, Paul, and Camelia Pop. 2013. A Schauder approach to degenerate-parabolic partial differential equations with unbounded coefficients. Journal of Differential Equations 254: 4401–45. [Google Scholar] [CrossRef]
Glasserman, Paul, and Kyoung-Kuk Kim. 2011. Gamma expansion of the Heston stochastic volatility model. Finance and Stochastics 15: 267–96. [Google Scholar] [CrossRef]
Heston, Steven. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The Review of Financial Studies 6: 327–43. [Google Scholar] [CrossRef]
Hurd, Thomas, and Alexey Kuznetsov. 2008. Explicit formulas for Laplace transforms of stochastic integrals. Markov Processes and Related Fields 14: 277–90. [Google Scholar]
Kahl, Christian, and Peter Jäckel. 2006. Fast strong approximation Monte-Carlo schemes for stochastic volatility models. Quantitative Finance 6: 513–36. [Google Scholar]
Karatzas, Ioannis, and Steven Shreve. 1991. Brownian Motion and Stochastic Calculus, 2nd ed. New York: Springer. [Google Scholar]
Malham, Simon, and Anke Wiese. 2013. Chi-square simulation of the CIR process and the Heston model. International Journal of Theoretical and Applied Finance 16: 1350014. [Google Scholar]
Nualart, David. 1995. The Malliavin Calculus and Related Topics. New York: Springer. [Google Scholar]
Lord, Roger, Remmert Koekkoek, and Dick van Dijk. 2009. A comparison of biased simulation schemes for stochastic volatility models. Quantitative Finance 10: 177–94. [Google Scholar]
Smith, Robert. 2007. An almost exact simulation method for the Heston model. Journal of Computational Finance 11: 115–25. [Google Scholar]
Talay, Denis, and Luciano Tubaro. 1990. Expansion of the global error for numerical schemes solving stochastic differential equations. Stochastic Analysis and Applications 8: 483–509. [Google Scholar]
Zheng, Chao. 2017. Weak convergence rate of a time-discrete scheme for the Heston stochastic volatility. SIAM Journal on Numerical Analysis 55: 1243–63. [Google Scholar] [CrossRef]

Figure 1. Call Model 1.

Figure 2. Call Model 1.

Figure 3. Put Model 1.

Figure 4. Put Model 1.

Figure 5. Indicator Model 1.

Figure 6. Indicator Model 1.

Figure 7. Call Model 2.

Figure 8. Call Model 2.

Figure 9. Put Model 2.

Figure 10. Put Model 2.

Figure 11. Indicator Model 2.

Figure 12. Indicator Model 2.

Figure 13. Call Model 3.

Figure 14. Call Model 3.

Figure 15. Put Model 3.

Figure 16. Put Model 3.

Figure 17. Indicator Model 3.

Figure 18. Indicator Model 3.

Table 1. Measured convergence rates Model 1.

Method	Call	Put	Indicator
Euler	1.5252	0.9492	1.1870
Semi-Trapezoidal	2.0174	0.2857	1.8343
FTE	1.5205	1.5205	1.2847
Symmetrized Euler	0.3693	0.3659	0.3250
Trapezoidal	2.0283	1.1119	2.4544
Euler extrap.	2.3114	2.0172	1.9719
Semi-Trapezoidal extrap.	1.8687	1.9999	0.9834

Table 2. Measured convergence rates Model 2.

Method	Call	Put	Indicator
Euler	0.4335	1.2898	0.8565
Semi-Trapezoidal	1.3025	0.7810	0.9518
FTE	1.2050	1.1733	1.0546
Symmetrized Euler	0.3028	0.3021	0.2421
Trapezoidal	1.8925	2.1272	1.6324
Euler extrap.	0.9483	1.4393	1.5966
Semi-Trapezoidal extrap.	1.4840	1.0481	1.2744

Table 3. Measured convergence rates Model 3.

Method	Call	Put	Indicator
Euler	0.6977	0.5378	1.0695
Semi-Trapezoidal	1.6989	1.6551	1.6396
FTE	2.0091	1.7303	1.6008
Symmetrized Euler	1.0386	1.0426	0.9018
Trapezoidal	1.8682	1.6799	1.5219
Euler extrap.	1.1612	1.1857	2.2612
Semi-Trapezoidal extrap.	1.5660	1.0441	1.5979

Table 4. Computational times (sec.) of the semi-exact schemes for 2⁶ time steps and 2 × 10⁷ paths.

Model	1	2	3
Euler	345.73	755.19	145.40
Semi-Trapezoidal	344.53	757.93	144.79
Trapezoidal	342.51	766.01	143.39
Euler extrap.	690.36	2335.94	307.62
Semi-Trapezoidal extrap.	686.55	2371.67	310.29

Table 5. Computational times (sec.) of Euler-type discretizations for 2⁶ time steps and 2 × 10⁷ paths.

Model	1	2	3
FTE	142.3	138.37	141.53
Symmetrized Euler	141.64	140.98	141.67

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

The Weak Convergence Rate of Two Semi-Exact Discretization Schemes for the Heston Model

Abstract

1. Introduction and Main Results

Remarks

2. Numerical Results

2.1. Model 1

2.2. Model 2

2.3. Model 3

2.4. Computational Times

2.5. Conclusions

3. Auxiliary Results

3.1. Kolmogorov PDE

3.2. Properties of the CIR Process

3.3. Malliavin Calculus

3.4. Properties of the Euler Discretization

3.5. Properties of the Semi-Trapezoidal Rule

4. Proof of Theorem 1

4.1. The Euler Scheme: Expanding the Error

4.2. The Euler Scheme: Case (i)

4.3. The Euler Scheme: Case (ii)

4.4. Semi-Trapezoidal Rule: Expanding the Error

4.5. Semi-Trapezoidal Rule: Case (i)

4.6. Semi-Trapezoidal Rule: Case (ii)

5. Proof of Theorem 2

5.1. Euler Scheme: Preliminaries

5.2. Euler Scheme: Conclusion

5.3. Semi-Trapezoidal Scheme: Preliminaries

5.4. Semi-Trapezoidal Rule: Calculations for $e_{n}^{(1)}$ , $e_{n}^{(2)}$ , and $e_{n}^{(3)}$

5.5. Semi-Trapezoidal Scheme: Conclusion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

The Weak Convergence Rate of Two Semi-Exact Discretization Schemes for the Heston Model

Abstract

1. Introduction and Main Results

Remarks

2. Numerical Results

2.1. Model 1

2.2. Model 2

2.3. Model 3

2.4. Computational Times

2.5. Conclusions

3. Auxiliary Results

3.1. Kolmogorov PDE

3.2. Properties of the CIR Process

3.3. Malliavin Calculus

3.4. Properties of the Euler Discretization

3.5. Properties of the Semi-Trapezoidal Rule

4. Proof of Theorem 1

4.1. The Euler Scheme: Expanding the Error

4.2. The Euler Scheme: Case (i)

4.3. The Euler Scheme: Case (ii)

4.4. Semi-Trapezoidal Rule: Expanding the Error

4.5. Semi-Trapezoidal Rule: Case (i)

4.6. Semi-Trapezoidal Rule: Case (ii)

5. Proof of Theorem 2

5.1. Euler Scheme: Preliminaries

5.2. Euler Scheme: Conclusion

5.3. Semi-Trapezoidal Scheme: Preliminaries

5.4. Semi-Trapezoidal Rule: Calculations for e n ( 1 ) , e n ( 2 ) , and e n ( 3 )

5.5. Semi-Trapezoidal Scheme: Conclusion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

5.4. Semi-Trapezoidal Rule: Calculations for $e_{n}^{(1)}$ , $e_{n}^{(2)}$ , and $e_{n}^{(3)}$