Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model

Harms, Philipp; Stefanovits, David; Teichmann, Josef; Wüthrich, Mario V.

doi:10.3390/risks4030018

Open AccessFeature PaperArticle

Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model^†

by

Philipp Harms

^1,*,

David Stefanovits

²,

Josef Teichmann

² and

Mario V. Wüthrich

^3,4

¹

Institute of Mathematics, Albert Ludwigs University of Freiburg, 79104 Freiburg, Germany

²

Department of Mathematics, ETH Zurich, 8092 Zurich, Switzerland

³

Department of Mathematics, RiskLab, ETH Zurich, 8092 Zurich, Switzerland

⁴

Swiss Finance Institute SFI, Walchestrasse 9, 8006 Zurich, Switzerland

^*

Author to whom correspondence should be addressed.

^†

We thank Hansjörg Furrer for supporting this project.

Risks 2016, 4(3), 18; https://doi.org/10.3390/risks4030018

Submission received: 21 December 2015 / Revised: 12 May 2016 / Accepted: 8 June 2016 / Published: 23 June 2016

Download

Browse Figures

Versions Notes

Abstract

:

The discrete-time multifactor Vasiček model is a tractable Gaussian spot rate model. Typically, two- or three-factor versions allow one to capture the dependence structure between yields with different times to maturity in an appropriate way. In practice, re-calibration of the model to the prevailing market conditions leads to model parameters that change over time. Therefore, the model parameters should be understood as being time-dependent or even stochastic. Following the consistent re-calibration (CRC) approach, we construct models as concatenations of yield curve increments of Hull–White extended multifactor Vasiček models with different parameters. The CRC approach provides attractive tractable models that preserve the no-arbitrage premise. As a numerical example, we fit Swiss interest rates using CRC multifactor Vasiček models.

Keywords:

interest rate model; re-calibration; HJM model; Vasiček model; Hull–White extension

1. Introduction

The tractability of affine models, such as the Vasiček [1] and the Cox–Ingersoll–Ross [2] models, has made them appealing for term structure modeling. Affine term structure models are based on a (multidimensional) factor process, which in turn describes the evolution of the spot rate and the bank account processes. No-arbitrage arguments then provide the corresponding zero-coupon bond prices, yield curves and forward rates. Prices in these models are calculated under an equivalent martingale measure for known static model parameters. However, model parameters typically vary over time as financial market conditions change. They may, for instance, be of a regime switching nature and need to be permanently re-calibrated to the actual financial market conditions. In practice, this re-calibration is done on a regular basis (as new information becomes available). This implies that model parameters are not static and, henceforth, may also be understood as stochastic processes. The re-calibration should preserve the no-arbitrage condition, which provides side constraints in the re-calibration. The aim of this work is to discuss these side constraints with the help of the discrete-time multifactor Vasiček interest rate model, which is a tractable, but also flexible model. We show that re-calibration under the side constraints naturally leads to Heath–Jarrow–Morton [3] models with stochastic parameters, which we call consistent re-calibration (CRC) models [4].

These models are attractive in financial applications for several reasons. In risk management and in the current regulatory framework [5], one needs realistic and tractable models of portfolio returns. Our approach provides tractable non-Gaussian models for multi-period returns on bond portfolios. Moreover, stress tests for risk management purposes can be implemented efficiently in our framework by selecting suitable models for the parameter process. While an in-depth market study of the performance of CRC models remains to be done, we provide in this paper some evidence of improved fits.

The paper is organized as follows. In Section 2, we introduce Hull–White extended discrete-time multifactor Vasiček models, which are the building blocks for CRC in this work. We define CRC of the Hull–White extended multifactor Vasiček model in Section 3. Section 4 specifies the market price of risk assumptions used to model the factor process under the real-world probability measure and the equivalent martingale measure, respectively. In Section 5, we deal with parameter estimation from market data. In Section 6, we fit the model to Swiss interest rate data, and in Section 7, we conclude. All proofs are presented in Appendix A.

2. Discrete-Time Multifactor Vasiček Model and Hull–White Extension

2.1. Setup and Notation

Choose a fixed grid size

Δ > 0

and consider the discrete-time grid

{0, Δ, 2 Δ, 3 Δ, \dots} = N_{0} Δ

. For example, a daily grid corresponds to

Δ = 1 / 252

if there are 252 business days per year. Choose a (sufficiently rich) filtered probability space

(Ω, F, F, P^{*})

with discrete-time filtration

F = {(F (t))}_{t \in N_{0}}

, where

t \in N_{0}

refers to time point

t Δ

. Assume that

P^{*}

denotes an equivalent martingale measure for a (strictly positive) bank account numeraire

{(B (t))}_{t \in N_{0}}

.

B (t)

denotes the value at time

t Δ

of an investment of one unit of currency at Time 0 into the bank account (i.e., the risk-free rollover relative to Δ).

We use the following notation. Subscript indices refer to elements of vectors and matrices. Argument indices refer to time points. We denote the

n \times n

identity matrix by

1 \in R^{n \times n}

. We also introduce the vectors

1 = {(1, \dots, 1)}^{⊤} \in R^{n}

and

e_{1} = {(1, 0, \dots, 0)}^{⊤} \in R^{n}

.

2.2. Discrete-Time Multifactor Vasiček Model

We choose

n \in N

fixed and introduce the n-dimensional

F

-adapted factor process:

X = {(X (t))}_{t \in N_{0}} = {(X_{1} (t), \dots, X_{n} (t))}_{t \in N_{0}}^{⊤},

which generates the spot rate and bank account processes as follows:

r (t) = 1^{⊤} X (t) and B (t) = exp \{Δ \sum_{s = 0}^{t - 1} r (s)\},

(1)

where

t \in N_{0}

; empty sums are set equal to zero. The factor process

X

is assumed to evolve under

P^{*}

according to:

X (t) = b + β X (t - 1) + Σ^{\frac{1}{2}} ε^{*} (t), t > 0,

(2)

with initial factor

X (0) \in R^{n}

,

b \in R^{n}

,

β \in R^{n \times n}

,

Σ^{\frac{1}{2}} \in R^{n \times n}

and

{(ε^{*} (t))}_{t \in N} = {(ε_{1}^{*} (t), \dots, ε_{n}^{*} (t))}_{t \in N}^{⊤}

being

F

-adapted. The following assumptions are in place throughout the paper.

Assumption 1.

We assume that the spectrum of matrix β is a subset of

{(- 1, 1)}^{n}

and that matrix

Σ^{\frac{1}{2}}

is non-singular. Moreover, for each

t \in N

, we assume that

ε^{*} (t)

is independent of

F (t - 1)

under

P^{*}

and has standard normal distribution

ε^{*} (t) \overset{P^{*}}{\sim} N (0, 1)

.

Remark.

In Assumption 1, the condition on matrix β ensures that

1 - β

is invertible and that the geometric series generated by β converges. The condition on

Σ^{\frac{1}{2}}

ensures that

Σ = Σ^{\frac{1}{2}} {(Σ^{\frac{1}{2}})}^{⊤}

is symmetric positive definite. Under Assumption 1, Equation (2) defines a stationary process; see [6], Section 11.3.

The model defined by Equations (1) and (2) is called the discrete-time multifactor Vasiček model. Under the above model assumptions, we have for

m > t

:

X (m) | F (t) \overset{P^{*}}{\sim} N ({(1 - β)}^{- 1} (1 - β^{m - t}) b + β^{m - t} X (t), \sum_{s = 0}^{m - t - 1} β^{s} Σ {(β^{⊤})}^{s}) .

(3)

Remark.

For

m > t

, the conditional distribution of

X (m)

, given

F (t)

, depends only on the value

X (t)

at time

t Δ

and on lag

m - t

. In other words, the factor process (2) is a time-homogeneous Markov process.

At time

t Δ

, the price of the zero-coupon bond (ZCB) with maturity date

m Δ > t Δ

with respect to filtration

F

and equivalent martingale measure

P^{*}

is given by:

P (t, m) = E^{*} [\frac{B (t)}{B (m)}| F (t)] = E^{*} [exp \{- Δ \sum_{s = t}^{m - 1} 1^{⊤} X (s)\}| F (t)] .

For the proof of the following result, see Appendix A.

Theorem 2.

The ZCB prices in the discrete-time multifactor Vasiček Models (1) and (2) with respect to filtration

F

and equivalent martingale measure

P^{*}

have an affine term structure:

P (t, m) = e^{A (t, m) - B {(t, m)}^{⊤} X (t)}, m > t,

with

A (m - 1, m) = 0

,

B (m - 1, m) = 1 Δ

and for

m - 1 > t \geq 0

:

\begin{matrix} A (t, m) & = A (t + 1, m) - B {(t + 1, m)}^{⊤} b + \frac{1}{2} B {(t + 1, m)}^{⊤} Σ B (t + 1, m), \\ B (t, m) & = {(1 - β^{⊤})}^{- 1} (1 - {(β^{⊤})}^{m - t}) 1 Δ . \end{matrix}

In the discrete-time multifactor Vasiček Models (1) and (2), the term structure of interest rates (yield curve) takes the following form at time

t Δ

for maturity dates

m Δ > t Δ

:

Y (t, m) = - \frac{1}{(m - t) Δ} log P (t, m) = - \frac{A (t, m)}{(m - t) Δ} + \frac{B {(t, m)}^{⊤} X (t)}{(m - t) Δ},

(4)

with the spot rate at time

t Δ

given by

Y (t, t + 1) = 1^{⊤} X (t) = r (t)

.

2.3. Hull–White Extended Discrete-Time Multifactor Vasiček Model

The possible shapes of the Vasiček yield curve (4) are restricted by the choice of the parameters

b \in R^{n}

,

β \in R^{n \times n}

and

Σ \in R^{n \times n}

. These parameters are not sufficiently flexible to exactly calibrate the model to an arbitrary observed initial yield curve. Therefore, we consider the Hull–White extended version (see [7]) of the discrete-time multifactor Vasiček model. We replace the factor process defined in (2) as follows. For fixed

k \in N_{0}

, let

X^{(k)}

satisfy:

X^{(k)} (t) = b + θ (t - k) e_{1} + β X^{(k)} (t - 1) + Σ^{\frac{1}{2}} ε^{*} (t), t > k,

(5)

with starting factor

X^{(k)} (k) \in R^{n}

,

e_{1} = {(1, 0, \dots, 0)}^{⊤} \in R^{n}

and function

θ : N \to R

. Model assumption (5) corresponds to (2), where the first component of

b

is replaced by the time-dependent coefficient

{(b_{1} + θ (i))}_{i \in N}

and all other terms ceteris paribus. Without loss of generality, we choose the first component for this replacement. Note that parameter

b_{1}

is redundant in this model specification, but for didactical reasons, it is used below. The time-dependent coefficient θ is called the Hull–White extension, and it is used to calibrate the model to a given yield curve at a given time point

k Δ

. The upper index ^(k) denotes that time point and corresponds to the time shift we apply to the Hull–White extension θ in Model (5). The factor process

X^{(k)}

generates the spot rate process and the bank account process as in (1).

The model defined by (1, 5) is called the Hull–White extended discrete-time multifactor Vasiček model. Under these model assumptions, we have for

m > t \geq k

:

X^{(k)} (m) | F (t) \overset{P^{*}}{\sim} N (\sum_{s = 0}^{m - t - 1} β^{s} (b + θ (m - s - k) e_{1}) + β^{m - t} X^{(k)} (t), \sum_{s = 0}^{m - t - 1} β^{s} Σ {(β^{⊤})}^{s}) .

Remark.

For

m > t \geq k

, the conditional distribution of

X^{(k)} (m)

, given

F (t)

, depends only on the factor

X^{(k)} (t)

at time

t Δ

. In this case, factor process (5) is a time-inhomogeneous Markov process. Note that the upper index ^(k) in the notation is important since the conditional distribution depends explicitly on the lag

m - k

.

Theorem 3.

The ZCB prices in the Hull–White extended discrete-time multifactor Vasiček model (1, 5) with respect to filtration

F

and equivalent martingale measure

P^{*}

have affine term structure:

P^{(k)} (t, m) = e^{A^{(k)} (t, m) - B {(t, m)}^{⊤} X^{(k)} (t)}, m > t \geq k,

with

B (t, m)

as in Theorem 2,

A^{(k)} (m - 1, m) = 0

and for

m - 1 > t \geq k

:

\begin{matrix} A^{(k)} (t, m) & = A^{(k)} (t + 1, m) - B {(t + 1, m)}^{⊤} (b + θ (t + 1 - k) e_{1}) \\ + \frac{1}{2} B {(t + 1, m)}^{⊤} Σ B (t + 1, m) . \end{matrix}

In the Hull–White extended discrete-time multifactor Vasiček model (1, 5), the yield curve takes the following form at time

t Δ

for maturity dates

m Δ > t Δ \geq k Δ

:

Y^{(k)} (t, m) = - \frac{1}{(m - t) Δ} log P^{(k)} (t, m) = - \frac{A^{(k)} (t, m)}{(m - t) Δ} + \frac{B {(t, m)}^{⊤} X^{(k)} (t)}{(m - t) Δ},

(6)

with spot rate at time

t Δ

given by

Y^{(k)} (t, t + 1) = 1^{⊤} X^{(k)} (t)

.

Remark.

Note that the coefficient

B (t, m)

in Theorem 3 is not affected by the Hull–White extension θ and depends solely on

m - t

, whereas the coefficient

A^{(k)} (t, m)

depends explicitly on the Hull–White extension θ.

2.4. Calibration of the Hull–White Extended Model

We consider the term structure model defined by the Hull–White extended factor process

X^{(k)}

and calibrate the Hull–White extension

θ \in R^{N}

to a given yield curve at time point

k Δ

. We explicitly introduce the time index k in Model (5) because the CRC algorithm is a concatenation of multiple Hull–White extended models, which are calibrated at different time points

k Δ

, see Section 3 below.

Assume that there is a fixed final time to maturity date

M Δ

and that we observe at time

k Δ

the yield curve

\hat{y} (k) \in R^{M}

for maturity dates

(k + 1) Δ, \dots, (k + M) Δ

. For these maturity dates, the Hull–White extended discrete-time multifactor Vasiček yield curve at time

k Δ

, given by Theorem 3, reads as:

y^{(k)} (k) = {(- \frac{1}{i Δ} A^{(k)} (k, k + i) + \frac{1}{i Δ} B {(k, k + i)}^{⊤} X^{(k)} (k))}_{i = 1, \dots, M}^{⊤} \in R^{M} .

For given starting factor

X^{(k)} (k) \in R^{n}

and parameters

b \in R^{n}

,

β \in R^{n \times n}

and

Σ \in R^{n \times n}

, our aim is to choose the Hull–White extension

θ \in R^{N}

such that we get an exact fit at time

k Δ

to the yield curve

\hat{y} (k)

, that is,

y^{(k)} (k) = \hat{y} (k) .

(7)

The following theorem provides an equivalent condition to (7), which allows one to calculate the Hull–White extension

θ \in R^{N}

explicitly.

Theorem 4.

Denote by

y^{(k)} (k)

the yield curve at time

k Δ

obtained from the Hull–White extended discrete-time multifactor Vasiček Model (1, 5) for given starting factor

X^{(k)} (k) = x \in R^{n}

, parameters

b \in R^{n}

,

β \in R^{n \times n}

and

Σ \in R^{n \times n}

and Hull–White extension

θ \in R^{N}

. For given

y \in R^{M}

, identity

y^{(k)} (k) = y

holds if and only if the Hull–White extension θ fulfills:

θ = C {(β)}^{- 1} z (b, β, Σ, x, y),

(8)

where

θ = {(θ_{i})}_{i = 1, \dots, M - 1}^{⊤} \in R^{M - 1}

,

C (β) = {(C_{i j} (β))}_{i, j = 1, \dots, M - 1} \in R^{(M - 1) \times (M - 1)}

and

z (b, β, Σ, x, y) = {(z_{i} (b, β, Σ, x, y))}_{i = 1, \dots, M - 1}^{⊤} \in R^{M - 1}

are defined by:

\begin{matrix} θ_{i} & = θ (i), \\ C_{i j} (β) & = B_{1} (k + j, k + i + 1) 1_{{j \leq i}}, \\ z_{i} (b, β, Σ, x, y) & = \sum_{s = k + 1}^{k + i} (\frac{1}{2} B {(s, k + i + 1)}^{⊤} Σ B (s, k + i + 1) - B {(s, k + i + 1)}^{⊤} b) \\ - 1^{⊤} (1 - β^{i + 1}) {(1 - β)}^{- 1} x Δ + (i + 1) y_{i + 1} (k) Δ, \end{matrix}

with

i, j = 1, \dots, M - 1

and

B (\cdot, \cdot) = {(B_{1} (\cdot, \cdot), \dots, B_{n} (\cdot, \cdot))}^{⊤}

given by Theorem 2.

Theorem 4 shows that the Hull–White extension can be calculated by inverting the

(M - 1) \times (M - 1)

lower triangular positive definite matrix

C (β)

.

3. Consistent Re-Calibration

The crucial extension now is the following: we let parameters

b

, β and Σ vary over time, and we re-calibrate the Hull–White extension in a consistent way at each time point, that is according to the actual choice of the parameter values using Theorem 4. Below, we show that this naturally leads to a Heath–Jarrow–Morton [3] (HJM) approach to term structure modeling.

3.1. Consistent Re-Calibration Algorithm

Assume that

{(b (k))}_{k \in N_{0}}

,

{(β (k))}_{k \in N_{0}}

and

{(Σ (k))}_{k \in N_{0}}

are

F

-adapted parameter processes with

β (k)

and

Σ (k)

satisfying Assumption 1,

P^{*}

-a.s., for all

k \in N_{0}

. Based on these parameter processes, we define the n-dimensional

F

-adapted CRC factor process

X

, which evolves according to Steps (i)–(iv) of the CRC algorithm described below. Thus, factor process

X

will define a spot rate model similar to (1).

In the CRC algorithm, Steps 3.1.1–3.1.3 below are executed iteratively.

3.1.1. Initialization $k = 0$

Assume that the initial yield curve observation at Time 0 is given by

\hat{y} (0) \in R^{M}

. Let

θ^{(0)} \in R^{N}

be an

F (0)

-measurable Hull–White extension, such that condition (7) is satisfied at Time 0 for initial factor

X (0) \in R^{n}

and parameters

b (0)

,

β (0)

and

Σ (0)

. By Theorem 4, the values

θ^{(0)} = {(θ^{(0)} (i))}_{i = 1, \dots, M - 1} \in R^{M - 1}

are given by:

θ^{(0)} = C {(β (0))}^{- 1} z (b (0), β (0), Σ (0), X (0), \hat{y} (0)) .

This provides Hull–White extended Vasiček yield curve

y^{(0)} (0)

identically equal to

\hat{y} (0)

for given initial factor

X (0)

and parameters

b (0)

,

β (0)

,

Σ (0)

.

3.1.2. Increments of the Factor Process from $k \to k + 1$

Assume factor

X (k)

, parameters

b (k), β (k)

and

Σ (k)

and Hull–White extension

θ^{(k)}

are given. Define the Hull–White extended model

X^{(k)} = {(X^{(k)} (t))}_{t \geq k}

by:

X^{(k)} (t) = b (k) + θ^{(k)} (t - k) e_{1} + β (k) X^{(k)} (t - 1) + Σ (k) ε^{*} (t), t > k,

(9)

with starting value

X^{(k)} (k) = X (k)

,

F (k)

-measurable parameters

b (k)

,

β (k)

and

Σ (k)

and Hull–White extension

θ^{(k)}

. We update the factor process

X

at time

(k + 1) Δ

according to the

X^{(k)}

-dynamics, that is, we set:

X (k + 1) = X^{(k)} (k + 1) .

This provides

F (k + 1)

-measurable yield curve at time

(k + 1) Δ

for maturity dates

m Δ > (k + 1) Δ

:

Y^{(k)} (k + 1, m) = - \frac{A^{(k)} (k + 1, m)}{(m - (k + 1)) Δ} + \frac{B^{(k)} {(k + 1, m)}^{⊤} X (k + 1)}{(m - (k + 1)) Δ},

with

A^{(k)} (m - 1, m) = 0

and

B^{(k)} (m - 1, m) = Δ 1

, and recursively for

m - 1 > t \geq k

:

\begin{matrix} A^{(k)} (t, m) & = A^{(k)} (t + 1, m) - B^{(k)} {(t + 1, m)}^{⊤} (b (k) + θ^{(k)} (t + 1 - k) e_{1}) \\ + \frac{1}{2} B^{(k)} {(t + 1, m)}^{⊤} Σ (k) B^{(k)} (t + 1, m), \\ B^{(k)} (t, m) & = {(1 - β {(k)}^{⊤})}^{- 1} (1 - {(β {(k)}^{⊤})}^{m - t}) 1 Δ . \end{matrix}

This is exactly the no-arbitrage price under

P^{*}

if the parameters

b (k)

,

β (k)

and

Σ (k)

and the Hull–White extension

θ^{(k)}

remain constant for all

t > k

.

3.1.3. Parameter Update and Re-Calibration at $k + 1$

Assume that at time

(k + 1) Δ

, the parameters

(b (k), β (k), Σ (k))

are updated to

(b (k + 1), β (k + 1), Σ (k + 1))

. We may think of this parameter update as a consequence of model selection after we observe a new yield curve at time

(k + 1) Δ

. This is discussed in more detail in Section 5 below. The no-arbitrage yield curve at time

(k + 1) Δ

from the model with parameters

(b (k), β (k), Σ (k))

and Hull–White extension

θ^{(k)}

is given by:

y^{(k)} (k + 1) = {(Y^{(k)} (k + 1, k + 2), \dots, Y^{(k)} (k + 1, k + 1 + M))}^{⊤} \in R^{M} .

The parameter update

(b (k), β (k), Σ (k)) \mapsto (b (k + 1), β (k + 1), Σ (k + 1))

requires re-calibration of the Hull–White extension, otherwise arbitrage is introduced into the model. This re-calibration provides

F (k + 1)

-measurable Hull–White extension

θ^{(k + 1)} \in R^{N}

at time

(k + 1) Δ

. The values

θ^{(k + 1)} = {(θ^{(k + 1)} (i))}_{i = 1, \dots, M - 1} \in R^{M - 1}

are given by (see Theorem 4):

θ^{(k + 1)} = C {(β (k + 1))}^{- 1} z (b (k + 1), β (k + 1), Σ (k + 1), X (k + 1), y^{(k)} (k + 1)),

(10)

and the resulting yield curve

y^{(k + 1)} (k + 1)

under the updated parameters is identically equal to

y^{(k)} (k + 1)

. Note that this CRC makes the upper index

(k)

in the yield curve superfluous, because the Hull–White extension is re-calibrated to the new parameters, such that the resulting yield curve remains unchanged. Therefore, we write

Y (k, \cdot)

in the sequel for the CRC yield curve with factor

X (k)

, parameters

b (k), β (k), Σ (k)

and Hull–White extension

θ^{(k)}

.

(End of algorithm.)

Remark.

For the implementation of the above algorithm, we need to consider the following issue. Assume we start the algorithm at Time 0 with initial yield curve

\hat{y} (0) \in R^{M}

. At times

k Δ

, for

k > 0

, calibration of

θ^{(k)} \in R^{M - 1}

requires yields with times to maturity beyond

M Δ

. Either yields for these times to maturity are observable, and the length of

θ^{(k)}

is reduced in every step of the CRC algorithm or an appropriate extrapolation method beyond the latest available maturity date is applied in every step.

3.2. Heath–Jarrow–Morton Representation

We analyze the yield curve dynamics

{(Y (k, \cdot))}_{k \in N_{0}}

obtained by the CRC algorithm of Section 3.1. Due to re-calibration (10), the yield curve fulfills the following identity for

m > k + 1

:

\begin{matrix} Y (k + 1, m) & = - \frac{A^{(k)} (k + 1, m)}{(m - (k + 1)) Δ} + \frac{B^{(k)} {(k + 1, m)}^{⊤} X (k + 1)}{(m - (k + 1)) Δ} \\ = - \frac{A^{(k + 1)} (k + 1, m)}{(m - (k + 1)) Δ} + \frac{B^{(k + 1)} {(k + 1, m)}^{⊤} X (k + 1)}{(m - (k + 1)) Δ}, \end{matrix}

(11)

where the first line is based on the

F (k)

-measurable parameters

(b (k), β (k), Σ (k))

and Hull–White extension

θ^{(k)}

, and the second line is based on the

F (k + 1)

-measurable parameters and Hull–White extension

(b (k + 1), β (k + 1), Σ (k + 1), θ^{(k + 1)})

after CRC Step (iii). Note that in the re-calibration only

(b (k + 1), β (k + 1), Σ (k + 1))

can be chosen exogenously, and the Hull–White extension

θ^{(k + 1)}

is used for consistency property (10). Our aim is to express

Y (k + 1, m)

as a function of

X (k)

and

Y (k, m)

. Using Equations (9) and (11), we have for

m > k + 1

:

\begin{matrix} Y (k + 1, m) & (m - (k + 1)) Δ = - A^{(k)} (k + 1, m) \\ + B^{(k)} {(k + 1, m)}^{⊤} (b (k) + θ^{(k)} (1) e_{1} + β (k) X (k) + Σ {(k)}^{\frac{1}{2}} ε^{*} (k + 1)) . \end{matrix}

(12)

This provides the following theorem; see Appendix A for the proof.

Theorem 5.

Under equivalent martingale measure

P^{*}

, the yield curve dynamics

{(Y (k, \cdot))}_{k \in N_{0}}

obtained by the CRC algorithm of Section 3.1 has the following HJM representation for

m > k + 1

:

\begin{matrix} Y (k + 1, m) (m - (k + 1)) Δ & = Y (k, m) (m - k) Δ - Y (k, k + 1) Δ \\ + \frac{1}{2} B^{(k)} {(k + 1, m)}^{⊤} Σ (k) B^{(k)} (k + 1, m) \\ + B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} ε^{*} (k + 1), \end{matrix}

with

B^{(k)} (k + 1, m) = {(1 - β^{⊤} (k))}^{- 1} (1 - {(β {(k)}^{⊤})}^{m - k - 1}) 1 Δ

.

Key observation.

Observe that in Theorem 5, a remarkable simplification happens. Simulating the CRC algorithm (9) and (10) to future time points

k Δ > 0

does not require the calculation of the Hull–White extensions

{(θ^{(k)})}_{k \in N_{0}}

according to (10), but the knowledge of the parameter process

{(b (k), β (k), Σ (k))}_{k \in N_{0}}

is sufficient. The Hull–White extensions are fully encoded in the yield curve process

{(Y (k, \cdot))}_{k \in N_{0}}

, and we can avoid the inversion of (potentially) high dimensional matrices

C {(β (k))}_{k \in N_{0}}

.

Further remarks.

CRC of the multifactor Vasiček spot rate model can be defined directly in the HJM framework assuming a stochastic dynamics for the parameters. However, solely from the HJM representation, one cannot see that the yield curve dynamics is obtained, in our case, by combining well-understood Hull–White extended multifactor Vasiček spot rate models using the CRC algorithm of Section 3; that is, the Hull–White extended multifactor Vasiček model gives an explicit functional form to the HJM representation.
The CRC algorithm of Section 3 does not rely directly on ${(ε^{*} (t))}_{t \in N}$ having independent and Gaussian components. The CRC algorithm is feasible as long as explicit formulas for ZCB prices in the Hull–White extended model are available. Therefore, one may replace the Gaussian innovations by other distributional assumptions, such as normal variance mixtures. This replacement is possible provided that conditional exponential moments can be calculated under the new innovation assumption. Under non-Gaussian innovations, it will no longer be the case that the HJM representation does not depend on the Hull–White extension $θ^{(k)} \in R^{N}$ .
Interpretation of the parameter processes will be given in Section 5, below.

4. Real World Dynamics and Market Price of Risk

All previous derivations were done under an equivalent martingale measure

P^{*}

for the bank account numeraire. In order to statistically estimate parameters from market data, we need to specify a Girsanov transformation to the real-world measure, which is denoted by

P

. We present a specific change of measure, which provides tractable spot rate dynamics under

P

. Assume that

{(λ (k))}_{k \in N_{0}}

and

{(Λ (k))}_{k \in N_{0}}

are

R^{n}

- and

R^{n \times n}

-valued

F

-adapted processes, respectively. Let

{(X (k))}_{k \in N_{0}}

be the factor process obtained by the CRC algorithm of Section 3.1. Then, we assume that the n-dimensional

F

-adapted process

{(λ (k) + Λ (k) X (k))}_{k \in N_{0}}

describes the market price of risk dynamics. We define the following

P^{*}

-density process:

{(ξ (k))}_{k \in N_{0}}

ξ (k) = exp \{- \frac{1}{2} \sum_{s = 0}^{k - 1} {∥λ (s) + Λ (s) X (s)∥}_{2}^{2} + \sum_{s = 0}^{k - 1} {(λ (s) + Λ (s) X (s))}^{⊤} ε^{*} (s + 1)\}, k \in N_{0} .

The real-world probability measure

P

is then defined by the Radon–Nikodym derivative:

{\frac{d P}{d P^{*}}|}_{F (k)} = ξ (k), k \in N_{0} .

(13)

An immediate consequence is that for

k \in N_{0}

:

ε (k + 1) = λ (k) + Λ (k) X (k) + ε^{*} (k + 1),

has a standard Gaussian distribution under

P

, conditionally on

F (k)

. This implies that under the real-world measure

P

, the factor process

{(X (k))}_{k \in N_{0}}

is described by:

X (k + 1) = a (k) + α (k) X (k) + Σ {(k)}^{\frac{1}{2}} ε (k + 1),

(14)

where we define:

a (k) = b (k) + θ^{(k)} (1) e_{1} - Σ {(k)}^{\frac{1}{2}} λ (k) and α (k) = β (k) - Σ {(k)}^{\frac{1}{2}} Λ (k) .

(15)

As in Assumption 1, we require

Λ (k)

to be such that the spectrum of

α (k)

is a subset of

{(- 1, 1)}^{n}

. Formula (14) describes the dynamics of the factor process

{(X (k))}_{k \in N_{0}}

obtained by the CRC algorithm of Section 3.1 under real-world measure

P

. The following corollary describes the yield curve dynamics obtained by the CRC algorithm under

P

, in analogy to Theorem 5.

Corollary 6.

Under real-world measure

P

satisfying (13), the yield curve dynamics

{(Y (k, \cdot))}_{k \in N_{0}}

obtained by the CRC algorithm of Section 3.1 has the following HJM representation for

m > k + 1

:

\begin{matrix} Y (k + 1, m) (m - (k + 1)) Δ & = Y (k, m) (m - k) Δ - Y (k, k + 1) Δ \\ + \frac{1}{2} B^{(k)} {(k + 1, m)}^{⊤} Σ (k) B^{(k)} (k + 1, m) \\ - B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} λ (k) \\ - B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} Λ (k) X (k) \\ + B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} ε (k + 1), \end{matrix}

with

B^{(k)} (k + 1, m) = {(1 - β {(k)}^{⊤})}^{- 1} (1 - {(β {(k)}^{⊤})}^{m - k - 1}) 1 Δ

.

Compared to Theorem 5, there are additional drift terms

- B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} λ (k)

and

- B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} Λ (k) X (k)

, which are characterized by the market price of risk parameters

λ (k) \in R^{n}

and

Λ (k) \in R^{n \times n}

.

5. Choice of Parameter Process

The yield curve dynamics obtained by the CRC algorithm of Section 3.1 require exogenous specification of the parameter process of the multifactor Vasiček Models (1) and (2) and the market price of risk process, i.e., we need to model the process:

{(b (t), β (t), Σ (t), λ (t), Λ (t))}_{t \in N_{0}} .

(16)

By Equation (9), the one-step ahead development of the CRC factor process

X

under

P

reads as:

X (t + 1) = b (t) + θ^{(t)} (1) e_{1} - Σ {(t)}^{\frac{1}{2}} λ (t) + (β (t) - Σ {(t)}^{\frac{1}{2}} Λ (t)) X (t) + Σ {(t)}^{\frac{1}{2}} ε (t + 1),

(17)

with

F (t)

-measurable parameters

b (t)

,

β (t)

and

Σ (t)

and Hull–White extension

θ^{(t)}

. Thus, on the one hand, the factor process

{(X (t))}_{t \in N_{0}}

evolves according to (17), and on the other hand, parameters

{(b (t), β (t), Σ (t), λ (t), Λ (t))}_{t \in N_{0}}

evolve according to the financial market conditions. Note that the process

{(θ^{(t)})}_{t \in N_{0}}

of Hull–White extensions is fully determined through CRC by (10). In order to distinguish the evolutions of

{(X (t))}_{t \in N_{0}}

and

{(b (t), β (t), Σ (t), λ (t), Λ (t))}_{t \in N_{0}}

, respectively, we assume that process (16) changes at a slower pace than the factor process, and therefore, parameters can be assumed to be constant over a short time window. This assumption motivates the following approach to specifying a model for process (16). For each time point

t Δ

, we fit multifactor Vasiček Models (1) and (2) with fixed parameters

(b, β, Σ, λ, Λ)

on observations from a time window

{t - K + 1, \dots, t}

of length K. For estimation, we assume that we have yield curve observations

{(\hat{y} (k))}_{k = t - K + 1, \dots, t} = {(({\hat{y}}_{1} (k), \dots, {\hat{y}}_{M} (k)))}_{k = t - K + 1, \dots, t}

for times to maturity

τ_{1} Δ < \dots < τ_{M} Δ

. Since yield curves are not necessarily observed on a regular time to the maturity grid, we introduce the indices

τ_{1}, \dots, τ_{M} \in N

to refer to the available times to maturity. Varying the time of estimation

t Δ

, we obtain time series for the parameters from historical data. Finally, we fit a stochastic model to these time series. In the following, we discuss the interpretation of the parameters and present two different estimation procedures. The two procedures are combined to obtain a full specification of the model parameters.

5.1. Interpretation of Parameters

5.1.1. Level and Speed of Mean Reversion

By Equation (3), we have under

P^{*}

for

m > t

:

\begin{matrix} E^{*} [X (m) | F (t)] & = {(1 - β)}^{- 1} (1 - β^{m - t}) b + β^{m - t} X (t), \\ E^{*} [r (m) | F (t)] & = 1^{⊤} {(1 - β)}^{- 1} (1 - β^{m - t}) b + 1^{⊤} β^{m - t} X (t) . \end{matrix}

Thus, β determines the speed at which the factor process

{(X (t))}_{t \in N_{0}}

and the spot rate process

{(r (t))}_{t \in N_{0}}

return to their long-term means:

lim_{m \to \infty} E^{*} [X (m) | F (t)] = {(1 - β)}^{- 1} b and lim_{m \to \infty} E^{*} [r (m) | F (t)] = 1^{⊤} {(1 - β)}^{- 1} b .

A sensible choice of

{(β (t))}_{t \in N_{0}}

adapts the speed of mean reversion to the prevailing financial market conditions at each time point

t Δ

.

5.1.2. Instantaneous Variance

By Equation (3), we have under

P^{*}

for

t > 0

:

{Cov}^{*} [X (t) | F (t - 1)] = Σ, and {Var}^{*} [r (t) | F (t - 1)] = 1^{⊤} Σ 1 .

Thus, matrix Σ plays the role of the instantaneous covariance matrix of

X

, and it describes the instantaneous spot rate volatility.

5.2. State Space Modeling Approach

On each time window, we want to use yield curve observations to estimate the parameters of time-homogeneous Vasiček Models (1) and (2). In general, this model is not able to reproduce the yield curve observations exactly. One reason might be that the data are given in the form of parametrized yield curves, and the parametrization might not be compatible with the Vasiček model. For example, this is the case for the widely-used Svensson family [8]. Another reason might be that yield curve observations do not exactly represent risk-free zero-coupon bonds.

The discrepancy between the Vasiček model and the yield curve observations can be accounted for by adding a noise term to the Vasiček yield curves. This defines a state space model with the factor process as the hidden state variable. In this state space model, the parameters of the factor dynamics can be estimated using Kalman filter techniques in conjunction with maximum likelihood estimation ([9] Section 3.6.3). This is explained in detail in Section 5.2.1, Section 5.2.2, Section 5.2.3, Section 5.2.4, Section 5.2.5, Section 5.2.6 and Section 5.2.7 below.

5.2.1. Transition System

The evolution of the unobservable process

X

under

P

is assumed to be given on time window

{t - K + 1, \dots, t}

by:

X (k) = a + α X (k - 1) + Σ^{\frac{1}{2}} ε (k), k \in {t - K + 1, \dots, t},

with initial factor

X (t - K) \in R^{n}

and parameters

a = b - Σ^{\frac{1}{2}} λ

and

α = β - Σ^{\frac{1}{2}} Λ

. The initial factor

X (t - K)

is updated according to the output of the Kalman filter for the previous time window

{t - K, \dots, t - 1}

. The initial factor is set to zero for the first time window available.

Remark.

Parameters

(b, β, Σ, λ, Λ)

are assumed to be constant over the time window {

t - K + 1

,…,t}. Thus, we drop the index k compared to Equations (14) and (15). For estimation, we assume that the factor process evolves according to the time-homogeneous multifactor Vasiček Models (1) and (2) in that time window. The Hull–White extension is calibrated to the yield curve at time

t Δ

given the estimated parameter values of the time-homogeneous model.

5.2.2. Measurement System

We assume that the observations in the state space model are given by:

\hat{Y} (k) = d + D X (k) + S^{\frac{1}{2}} η (k), k \in {t - K, \dots, t},

(18)

where:

\begin{matrix} \hat{Y} (k) & = {(\hat{Y} (k, k + τ_{1}), \dots, \hat{Y} (k, k + τ_{M}))}^{⊤} \in R^{M}, \\ d & = {(- {(τ_{1} Δ)}^{- 1} A (k, k + τ_{1}), \dots, - {(τ_{M} Δ)}^{- 1} A (k, k + τ_{M}))}^{⊤} \in R^{M}, \\ D_{i j} & = {(τ_{i} Δ)}^{- 1} B_{j} (k, k + τ_{i}), 1 \leq i \leq M, 1 \leq j \leq n, \end{matrix}

with

A (\cdot, \cdot)

and

B (\cdot, \cdot) = {(B_{1} (\cdot, \cdot), \dots, B_{n} (\cdot, \cdot))}^{⊤}

given by Theorem 2 and M-dimensional

F (k)

-measurable noise term

S^{\frac{1}{2}} η (k)

for non-singular

S^{\frac{1}{2}} \in R^{M \times M}

. We assume that

η (k)

is independent of

F (k - 1)

and

ε (k)

under

P

and that

η (k) \overset{P}{\sim} N (0, 1)

. The error term

S^{\frac{1}{2}} η

describes the discrepancy between the yield curve observations and the model. For

S = 0

, we would obtain a yield curve in (18) that corresponds exactly to the multifactor Vasiček one.

Given the parameter and market price of risk values

(b, β, Σ, λ, Λ)

, we estimate the factor using the following iterative procedure. For

k \in {t - K, \dots, t}

and fixed t, we consider the σ-field

F^{\hat{Y}} (k) = σ (\hat{Y} (s) | t - K \leq s \leq k) \subset F (k)

and describe the estimation procedure in this state space model.

5.2.3. Anchoring

Fix initial factor

X (t - K) = x (t - K | t - K - 1)

, and initialize:

\begin{matrix} x (t - K + 1 | t - K) & = E [X (t - K + 1) | F^{\hat{Y}} (t - K)] = a + α x (t - K | t - K - 1), \\ Σ (t - K + 1 | t - K) & = Cov (X (t - K + 1) | F^{\hat{Y}} (t - K)) = Σ . \end{matrix}

5.2.4. Forecasting the Measurement System

At time

k \in {t - K + 1, \dots, t}

, we have:

\begin{matrix} y (k | k - 1) & = E [\hat{Y} (k) | F^{\hat{Y}} (k - 1)] = d + D x (k | k - 1), \\ F (k) & = Cov (\hat{Y} (k) | F^{\hat{Y}} (k - 1)) = D Σ (k | k - 1) D^{⊤} + S, \\ ζ (k) & = \hat{y} (k) - y (k | k - 1) . \end{matrix}

5.2.5. Bayesian Inference in the Transition System

The prediction error

ζ (k)

is used to update the unobservable factors.

\begin{matrix} x (k | k) & = E [X (k) | F^{\hat{Y}} (k)] = x (k | k - 1) + K (k) ζ (k), \\ Σ (k | k) & = Cov (X (k) | F^{\hat{Y}} (k)) = (1 - K (k) D) Σ (k | k - 1), \end{matrix}

where

K (k)

denotes the Kalman gain matrix given by:

K (k) = Cov (X (k) | F^{\hat{Y}} (k - 1)) D^{⊤} Cov {(\hat{Y} (k) | F^{\hat{Y}} (k - 1))}^{- 1} = Σ (k | k - 1) D^{⊤} F {(k)}^{- 1} .

5.2.6. Forecasting the Transition System

For the unobservable factor process, we have the following forecast:

\begin{matrix} x (k + 1 | k) & = E [X (k + 1) | F^{\hat{Y}} (k)] = a + α x (k | k), \\ Σ (k + 1 | k) & = Cov (X (k + 1) | F^{\hat{Y}} (k)) = α Σ (k | k) α^{⊤} + Σ . \end{matrix}

5.2.7. Likelihood Function

The Kalman filter procedure above allows one to infer factors

X

given the parameter and market price of risk values. Of course, in this section, we are interested in estimating these values in the first place. For this purpose, the procedure above can be used in conjunction with maximum likelihood estimation. For the underlying parameters

Θ = (b, β, Σ, a, α)

, we have the following likelihood function given the observations

{(\hat{y} (k))}_{k = t - K + 1, \dots, t}

:

L_{t} (Θ) = \prod_{k = t - K + 1}^{t} \frac{exp (- \frac{1}{2} ζ {(k)}^{⊤} F {(k)}^{- 1} ζ (k))}{{(2 π)}^{\frac{M}{2}} det F {(k)}^{\frac{1}{2}}} .

(19)

The maximum likelihood estimator (MLE)

{\hat{Θ}}^{MLE} = ({\hat{b}}^{MLE}, {\hat{β}}^{MLE}, {\hat{Σ}}^{MLE}, {\hat{a}}^{MLE}, {\hat{α}}^{MLE})

is found by maximizing the likelihood function

L_{t} (Θ)

over Θ, given the data. As in the EM (expectation maximization) algorithm, maximization of the likelihood function is alternated with Kalman filtering until convergence of the estimated parameters

{\hat{Θ}}^{MLE}

is achieved.

5.3. Estimation Motivated by Continuous Time Modeling

5.3.1. Rescaling the Time Grid

Assume factor process

{(X (t))}_{t \in N_{0}}

is given under

P

by

X (0) \in R^{n}

and for

t > 0

:

X (t) = a + α X (t - 1) + Σ^{\frac{1}{2}} ε (t),

where

a = b - Σ^{\frac{1}{2}} λ

and

α = β - Σ^{\frac{1}{2}} Λ

. Furthermore, assume that α is a diagonalizable matrix with

α = T D T^{- 1}

for

T \in R^{n \times n}

and diagonal matrix

D \in {(- 1, 1)}^{n \times n}

. Then, the transformed process

Z = {(T^{- 1} X (t))}_{t \in N_{0}}

evolves according to:

Z (t) = c + D Z (t - 1) + Ψ^{\frac{1}{2}} ε (t), t > 0,

where

c = T^{- 1} a

and

Ψ = T^{- 1} Σ {(T^{- 1})}^{⊤}

. For

d \in N_{+}

, the d-step ahead conditional distribution of

Z

under

P

is given by:

Z (t + d) | F (t) \overset{P}{\sim} N (μ + γ Z (t), Γ), t \geq 0,

where

μ = {(1 - D)}^{- 1} (1 - D^{d}) c

,

γ = D^{d}

and

Γ = \sum_{s = 0}^{d - 1} D^{s} Ψ D^{s}

. Suppose we have estimated

μ \in R^{n}

, the diagonal matrix

γ \in {(- 1, 1)}^{n}

and

Γ \in R^{n \times n}

on the time grid with size

d Δ

, for instance, using MLE, as explained in Section 5.2. We are interested in recovering the parameters

c

, D and Ψ of the dynamics on the refined time grid with size Δ from μ, γ and Γ.

The diagonal matrix D and vector

c

are reconstructed from the diagonal matrix γ as follows:

\begin{matrix} D & = γ^{\frac{1}{d}} = 1 + \frac{1}{d} log (γ) + o (\frac{1}{d}), as d \to \infty, \\ c & = {(1 - γ)}^{- 1} (1 - γ^{\frac{1}{d}}) μ = \frac{1}{d} {(1 - γ)}^{- 1} log (γ^{- 1}) μ + o (\frac{1}{d}), as d \to \infty, \end{matrix}

where logarithmic and power functions applied to diagonal matrices are defined on their diagonal elements. Note that for

i, j = 1, \dots, n

, we have:

Γ_{i j} = \sum_{s = 0}^{d - 1} γ_{i i}^{\frac{s}{d}} Ψ_{i j} γ_{j j}^{\frac{s}{d}} = Ψ_{i j} \sum_{s = 0}^{d - 1} {(γ_{i i}^{\frac{1}{d}} γ_{j j}^{\frac{1}{d}})}^{s} = Ψ_{i j} \frac{1 - γ_{i i} γ_{j j}}{1 - {(γ_{i i} γ_{j j})}^{\frac{1}{d}}} .

Therefore, we recover Ψ from γ and Γ as follows.

Ψ = \frac{1}{d} υ + o (\frac{1}{d}), as d \to \infty,

where

υ = {(- Γ_{i j} log (γ_{i i} γ_{j j}) {(1 - γ_{i i} γ_{j j})}^{- 1})}_{i, j = 1, \dots, n} \in R^{n \times n}

. Consider for

t > 0

the increments

D_{t} Z = Z (t) - Z (t - 1)

. From the formulas for

c

, D and Ψ, we observe that the

F_{t - 1}

-conditional mean of

D_{t} Z

:

c + (D - 1) Z (t - 1) = - \frac{1}{d} {(1 - γ)}^{- 1} log (γ) μ + \frac{1}{d} log (γ) Z (t - 1) + o (\frac{1}{d}),

and the

F_{t - 1}

-conditional volatility of

D_{t} Z

:

Ψ^{\frac{1}{2}} = \sqrt{\frac{1}{d}} υ^{\frac{1}{2}} + o (\sqrt{\frac{1}{d}}),

live on different scales as

d \to \infty

; in fact, volatility dominates for large d. Under

P

for

t > 0

, we have:

\begin{matrix} E [D_{t} Z {(D_{t} Z)}^{⊤} | F_{t - 1}] = Cov [D_{t} Z, D_{t} Z | F_{t - 1}] + E [D_{t} Z | F_{t - 1}] E {[D_{t} Z | F_{t - 1}]}^{⊤} \\ = Cov [Z (t), Z (t) | F_{t - 1}] + (E [Z (t) | F_{t - 1}] - Z (t - 1)) {(E [Z (t) | F_{t - 1}] - Z (t - 1))}^{⊤} \\ = Ψ + (c + (D - 1) Z (t - 1)) {(c + (D - 1) Z (t - 1))}^{⊤} . \end{matrix}

Therefore, setting

D_{t} X = X (t) - X (t - 1)

, we obtain as

d \to \infty

:

\begin{matrix} E [D_{t} X {(D_{t} X)}^{⊤} | F_{t - 1}] = T E [D_{t} Z {(D_{t} Z)}^{⊤} | F_{t - 1}] T^{⊤} \\ = T Ψ T^{⊤} + T (c + (D - 1) Z (t - 1)) {(c + (D - 1) Z (t - 1))}^{⊤} T^{⊤} \\ = \frac{1}{d} T υ T^{⊤} + o (\frac{1}{d}) = T Ψ T^{⊤} + o (\frac{1}{d}) = Σ + o (\frac{1}{d}), \end{matrix}

(20)

5.3.2. Longitudinal Realized Covariations of Yields

We consider the yield curve increments within the discrete-time multifactor Vasiček Models (1) and (2). The increments of the yield process

{(Y (t, t + τ))}_{t \in N_{0}}

for fixed time to maturity

τ Δ > 0

are given by:

\begin{matrix} D_{t, τ} Y & = Y (t, t + τ) - Y (t - 1, t - 1 + τ) \\ = \frac{1}{τ Δ} B {(t, t + τ)}^{⊤} (X (t) - X (t - 1)) = \frac{1}{τ Δ} B {(t, t + τ)}^{⊤} D_{t} X, \end{matrix}

where

D_{t} X | F (t - 1) \overset{P}{\sim} N (a + (α - 1) X (t - 1), Σ)

. For times to maturity

τ_{1} Δ, τ_{2} Δ > 0

, we get under

P

:

E [D_{t, τ_{1}} Y D_{t, τ_{2}} Y | F_{t - 1}] = \frac{1}{τ_{1} τ_{2} Δ^{2}} B {(t, t + τ_{1})}^{⊤} E [D_{t} X {(D_{t} X)}^{⊤} | F_{t - 1}] B (t, t + τ_{2}) .

By Equation (20) for small grid size Δ, we estimate the last expression by:

E [D_{t, τ_{1}} Y D_{t, τ_{2}} Y | F_{t - 1}] \approx \frac{1}{τ_{1} τ_{2}} 1^{⊤} (1 - β^{τ_{1}}) {(1 - β)}^{- 1} Σ {(1 - β^{⊤})}^{- 1} (1 - {(β^{⊤})}^{τ_{2}}) 1 .

(21)

Formula (21) is interesting for the following reasons:

It does not depend on the unobservable factors $X$ .
It allows for direct cross-sectional estimation of β and Σ. That is, β and Σ can directly be estimated from market observations without knowing the market-price of risk.
It is helpful to determine the number of factors needed to fit the model to market yield curve increments. This can be analyzed by principal component analysis.
It can also be interpreted as a small-noise approximation for noisy measurement systems of the form (18).

Let

{\hat{y}}_{1} (k)

and

{\hat{y}}_{2} (k)

be market observations for times to maturity

τ_{1} Δ

and

τ_{2} Δ

and at times

k \in {t - K + 1, \dots, t}

, also specified in Section 5.2. Then, the expectation on the left hand side of (21) can be estimated by the realized covariation:

\hat{RCov} (t, τ_{1}, τ_{2}) = \frac{1}{K} \sum_{k = t - K + 1}^{t} ({\hat{y}}_{1} (k) - {\hat{y}}_{1} (k - 1)) ({\hat{y}}_{2} (k) - {\hat{y}}_{2} (k - 1)) .

(22)

The quality of this estimator hinges on two crucial assumptions. First, higher order terms in (20) are negligible in comparison to Σ. Second, the noise term

S^{\frac{1}{2}} η

in (18) leads to a negligible distortion in the sense that observations

\hat{Y}

are reliable indicators for the underlying Vasiček yield curves.

5.3.3. Cross-Sectional Estimation of β and Σ

Realized covariation estimator (22) can be used in conjunction with asymptotic relation (21) to estimate parameters β and Σ at time

t Δ

in the following way. For given symmetric weights

w_{i j} = w_{j i} \geq 0

, we solve the least squares problem:

\begin{matrix} ({\hat{β}}^{RCov}, {\hat{Σ}}^{RCov}) & = {arg min}_{β, Σ} {\sum_{i, j = 1}^{M} w_{i j} [\hat{RCov} (t, τ_{i}, τ_{j}) \\ - \frac{1}{τ_{i} τ_{j}} 1^{⊤} (1 - β^{τ_{i}}) {(1 - β)}^{- 1} Σ {(1 - β^{⊤})}^{- 1} (I - {(β^{⊤})}^{τ_{j}}) 1]^{2}}, \end{matrix}

(23)

where we optimize over β and Σ satisfying Assumption 1.

5.4. Inference on Market Price of Risk

Finally, we aim at determining parameters λ and Λ of the change of measure specified in Section 4. For this purpose, we combine MLE estimation (Section 5.2) with estimation from realized covariations of yields (Section 5.3). First, we estimate β and Σ by

{\hat{β}}^{RCov}

and

{\hat{Σ}}^{RCov}

as in Section 5.3. Second, we estimate

a

,

b

and α by maximizing the log-likelihood:

log L_{t} (b, β, Σ, a, α) = \sum_{k = t - K + 1}^{t} log (det F (k)) - \sum_{k = t - K + 1}^{t} ζ {(k)}^{⊤} F {(k)}^{- 1} ζ (k) + const,

for fixed β and Σ over

b \in R^{n}

,

a \in R^{n}

and

α \in R^{n \times n}

with spectrum in

{(- 1, 1)}^{n}

, i.e.,

({\hat{b}}^{MLE}, {\hat{a}}^{MLE}, {\hat{α}}^{MLE}) = {arg max}_{b, a, α} log L_{t} (b, {\hat{β}}^{RCov}, {\hat{Σ}}^{RCov}, a, α) .

(24)

The constraint on the matrix α ensures that the factor process is stationary under the real-world measure

P

. From Equation (15), we have

λ = Σ^{- \frac{1}{2}} (b - a)

and

Λ = Σ^{- \frac{1}{2}} (β - α)

. This motivates the inference of λ by:

\hat{λ} = {({\hat{Σ}}^{RCov})}^{- \frac{1}{2}} ({\hat{b}}^{MLE} - {\hat{a}}^{MLE}),

(25)

and the inference of Λ by:

\hat{Λ} (k) = {({\hat{Σ}}^{RCov})}^{- \frac{1}{2}} ({\hat{β}}^{RCov} - {\hat{α}}^{MLE}) .

(26)

We stress the importance of estimating as many parameters as possible from the realized covariations of yields prior to using maximum likelihood estimation. The MLE procedure of Section 5.2 is computationally intensive and generally does not work well to estimate volatility parameters.

6. Numerical Example for Swiss Interest Rates

6.1. Description and Selection of Data

We choose

Δ = 1 / 252

, which corresponds to a daily time grid (assuming that a financial year has 252 business days). For the Swiss currency (CHF), we consider as yield observations the Swiss Average Rate (SAR), the London InterBank Offered Rate (LIBOR) and the Swiss Confederation Bond (SWCNB). See Figure 1 and Figure 2.

Short times to maturity. The SAR is an ongoing volume-weighted average rate calculated by the Swiss National Bank (SNB) based on repo transactions between financial institutions. It is used for short times to maturity of at most three months. For SAR, we have the Over-Night SARONthat corresponds to a time to maturity of Δ (one business day) and the SAR Tomorrow-Next (SARTN) for time to maturity $2 Δ$ (two business days). The latter is not completely correct, because SARON is a collateral over-night rate and tomorrow-next is a call money rate for receiving money tomorrow, which has to be paid back the next business day. Moreover, we have the SAR for times to maturity of one week (SAR1W), two weeks (SAR2W), one month (SAR1M) and three months (SAR3M); see also [10].
Short to medium times to maturity. The LIBOR reflects times to maturity, which correspond to one month (LIBOR1M), three months (LIBOR3M), six months (LIBOR6M) and 12 months (LIBOR12M) in the London interbank market.
Medium to long times to maturity. The SWCNB is based on Swiss government bonds, and it is used for times to maturity, which correspond to two years (SWCNB2Y), three years (SWCNB3Y), four years (SWCNB4Y), five years (SWCNB5Y), seven years (SWCNB7Y), 10 years (SWCNB10Y), 20 years (SWCNB20Y) and 30 years (SWCNB30Y).

These data are available from 8 December 1999, and we set 15 September 2014 to be the last observation date. Of course, SAR, LIBOR and SWCNB do not exactly model risk-free zero-coupon bonds, and these different classes of instruments are not completely consistent, because prices are determined slightly differently for each class. In particular, this can be seen during the 2008–2009 financial crisis. However, these data are in many cases the best approximation to CHF risk-free zero-coupon yields that is available. For the longest times to maturity of SWCNB, one may also raise issues about the liquidity of these instruments, because insurance companies typically run a buy-and-hold strategy for long-term bonds.

In Figure 3, Figure 4, Figure 5 and Figure 6, we compute the realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

of yield curve observations

{({\hat{y}}_{τ} (k))}_{k = t - K + 1, \dots, t}

for different times to maturity

τ Δ

and window length K; see Equation (22). In Figure 2 and Figure 6, we observe that SAR fits SWCNB better than LIBOR after the financial crisis of 2008. For this reason, we decide to drop LIBOR and build daily yield curves from SAR and SWCNB, only. The mismatch between LIBOR, SAR and SWCNB is attributable to differences in liquidity and the credit risk of the underlying instruments.

6.2. Model Selection

In this numerical example, we restrict ourselves to multifactor Vasiček models with β and α of diagonal form:

β = diag (β_{11}, \dots, β_{n n}), and α = diag (α_{11}, \dots, α_{n n}),

where

- 1 < β_{11}, \dots, β_{n n}, α_{11}, \dots, α_{n n} < 1

. In the following, we explain exactly how to perform the delicate task of parameter estimation in the multifactor Vasiček Models (1) and (2) using the procedure explained in Section 5.

6.2.1. Discussion of Identification Assumptions

We select short times to maturity (SAR) to estimate parameters

b

, β, Σ,

a

and α. This is reasonable because these parameters describe the dynamics of the factor process and, thus, of the spot rate. As we are working on a small (daily) time grid, asymptotic Formulas (20) and (21) are expected to give good approximations. Additionally, it is reasonable to assume that the noise covariance matrix S in data-generating Model (18) is negligible compared to (21). Therefore, we can estimate the left hand side of (21) by the realized covariation of observed yields; see estimator (22). Then, we determine the Hull–White extension θ in order to match the prevailing yield curve interpolated from SAR and SWCNB.

6.2.2. Determination of the Number of Factors

We need to determine the appropriate number of factors n. The more factors we use, the better we can fit the model to the data. However, the dimensionality of the estimation problem increases quadratically in the number of factors, and the model may become over-parametrized. Therefore, we look for a trade-off between the accuracy of the model and the number of parameters used. In Figure 7, we determine

β_{11}, \dots, β_{n n}

and Σ by solving optimization (23) numerically for three observation dates and

n = 2, 3

. A three-factor model is able to capture rather accurately the dependence on the time to maturity τ. In Figure 8, Figure 9 and Figure 10, we compare the realized volatility of the numerical solution of (23) to the market realized volatility for all observation dates. We observe that in several periods, the two-factor model is not able to fit the SAR realized volatilities accurately for all times to maturities. The three-factor model achieves an accurate fit for most observation dates. The model exhibits small mismatches in 2001, 2008–2009 and 2011–2012. These are periods characterized by a sharp reduction in interest rates in response to financial crises. In September 2011, following strong appreciation of the Swiss Franc with respect to the Euro, the SNB pledged to no longer tolerate Euro-Franc exchange rates below the minimum rate of

1.20

, effectively enforcing a currency floor for more than three years. As a consequence of the European sovereign debt crisis and the intervention of the SNB starting from 2011, we have a long period of very low (even negative) interest rates.

6.2.3. Determination of Vasiček Parameters

Considering the results of Figure 8, Figure 9 and Figure 10, we restrict ourselves from now on to three-factor Vasiček models with parameters

a, b \in R^{3}

and:

\begin{matrix} β = diag (β_{11}, β_{22}, β_{33}), α = diag (α_{22}, α_{22}, α_{33}), Σ^{\frac{1}{2}} = (\begin{matrix} Σ_{11}^{\frac{1}{2}} & 0 & 0 \\ Σ_{21}^{\frac{1}{2}} & Σ_{22}^{\frac{1}{2}} & 0 \\ Σ_{31}^{\frac{1}{2}} & Σ_{32}^{\frac{1}{2}} & Σ_{33}^{\frac{1}{2}} \end{matrix}), \end{matrix}

where

- 1 \leq β_{11}, β_{22}, β_{33}, α_{11}, α_{22}, α_{33} \leq 1

,

Σ_{11}^{\frac{1}{2}}, Σ_{22}^{\frac{1}{2}}, Σ_{33}^{\frac{1}{2}} > 0

and

Σ_{21}^{\frac{1}{2}}, Σ_{31}^{\frac{1}{2}}, Σ_{32}^{\frac{1}{2}} \in R

.

In Figure 11, Figure 12 and Figure 13, we plot the numerical solutions of optimizations (23) and (24) for all observation dates. The parameters are reasonable for most of the observation dates. We observe that the estimates of

β_{11}

are close to one for all observation dates. Our values for the speed of mean reversion are reasonable on a daily time grid. Note that β scales as

β^{d}

on a d-days time grid; see Section 5.3. The speeds of mean reversion of

X_{2}

and

X_{3}

are higher than that of

X_{1}

for most of the observation dates. We also see that the volatility of

X_{1}

is lower than that of

X_{2}

and

X_{3}

. In 2011, we observe large spikes in the factor volatilities. Starting from 2011, we have a period with strong correlations among the factors. From these results, we conclude that the three-factor Vasiček model is reasonable for Swiss interest rates. Particularly challenging for the estimation is the period 2011–2014 of low interest rates following the European sovereign debt crisis and the SNB intervention. In Figure 11 (rhs), we observe that the difference in the speeds of mean-reversion under the risk-neutral and real-world measures is negligible. The difference between

b

and

a

is considerable in certain time periods. From the estimation results, we conclude that a constant market price of risk assumption is reasonable and set from now on

Λ = 0

. In Figure 14, we compute the objective function of optimization (24) for

(b, β, Σ, a, α) = (0, {\hat{β}}^{RCov}, {\hat{Σ}}^{RCov}, 0, {\hat{β}}^{RCov})

and compare it to the numerical solution

({\hat{b}}^{MLE}, {\hat{β}}^{RCov}, {\hat{Σ}}^{RCov}, {\hat{a}}^{MLE}, {\hat{β}}^{RCov})

. We observe that in 2003–2005 and 2010–2014, the parameter configuration

(0, {\hat{β}}^{RCov}, {\hat{Σ}}^{RCov}, 0, {\hat{β}}^{RCov})

is nearly optimal. In these periods, we have very low interest rates, and therefore, estimates of

b

and

a

close to zero are reasonable. Given the estimated parameters, we calibrate the Hull–White extension by equation (10) to the full yield curve interpolated from SAR and SWCNB; see Figure 15. We point out that our fitting method is not a purely statistical procedure; rather, it is a combination of estimation and calibration in accordance with the paradigm of robust calibration, as explained in [4].

6.2.4. Selection of a Model for the Vasiček Parameters

In the following, we use the CRC approach to construct a modification of the Vasiček model with stochastic volatility. We model the process

{(Σ (t))}_{t \in N_{0}}

by a Heston-like [11] approach. We assume deterministic correlations among the factors and stochastic volatility given by:

(\begin{matrix} Σ_{11} (t) \\ Σ_{22} (t) \\ Σ_{33} (t) \end{matrix}) = φ + ϕ (\begin{matrix} Σ_{11} (t - 1) \\ Σ_{22} (t - 1) \\ Σ_{33} (t - 1) \end{matrix}) + (\begin{matrix} \sqrt{Σ_{11} (t - 1)} & 0 & 0 \\ 0 & \sqrt{Σ_{22} (t - 1)} & 0 \\ 0 & 0 & \sqrt{Σ_{33} (t - 1)} \end{matrix}) Φ^{\frac{1}{2}} \tilde{ε} (t),

where

φ \in R_{+}^{3}

,

ϕ = diag (ϕ_{11}, ϕ_{22}, ϕ_{33}) \in R^{3 \times 3}

,

Φ^{\frac{1}{2}} \in R^{3 \times 3}

non-singular, and for each

t \in N

,

\tilde{ε} (t)

has a standard Gaussian distribution under

P

, conditionally given

F (t - 1)

. Moreover, we assume that

(ε (t), \tilde{ε} (t))

is multivariate Gaussian under

P

, conditionally given

F (t - 1)

. Note that

ε (t)

and

\tilde{ε} (t)

are allowed to be correlated. The matrix valued process

{(Σ (t))}_{t \in N_{0}}

is constructed combining this stochastic volatility model with fixed correlation coefficients. This model is able to capture the stylized fact that volatility appears to be more noisy in high volatility periods; see Figure 12.

We use the volatility time series of Figure 12 to specify φ, ϕ and Φ. We rewrite the equation for the evolution of the volatility as:

\frac{Σ_{i i} (t)}{\sqrt{Σ_{i i} (t - 1)}} = \frac{φ_{i}}{\sqrt{Σ_{i i} (t - 1)}} + ϕ_{i i} \sqrt{Σ_{i i} (t - 1)} + {(Φ^{\frac{1}{2}} \tilde{ε} (t))}_{i}, i = 1, 2, 3,

and use least square regression to estimate φ, ϕ and Φ. From the regression residuals, we estimate the correlations between

ε (t)

and

\tilde{ε} (t)

. Figure 16, Figure 17 and Figure 18 show the estimates of φ, ϕ and Φ.

6.3. Simulation and Back-Testing

Section 6.2 provides a full specification of the three-factor Vasiček CRC model under the risk-neutral and real-world probability measures. Various model quantities of interest in applications can then be calculated by simulation.

6.3.1. Simulation

The CRC approach has the remarkable property that yield curve increments can be simulated accurately and efficiently using Theorem 5 and Corollary 6. In contrast, spot rate models with stochastic volatility without CRC have serious computational drawbacks. In such models, the calculation of the prevailing yield curve for given state variables requires Monte Carlo simulation. Therefore, the simulation of future yield curves requires nested simulations.

6.3.2. Back-Testing

We backtest properties of the monthly returns of a buy and hold portfolio investing equal proportions of wealth in the zero-coupon bonds with times to maturity of 2, 3, 4, 5, 6 and 9 months and 1, 2, 3, 5, 7 and 10 years. We divide the sample into disjoint monthly periods and calculate the monthly return of this portfolio assuming that at the beginning of each period, we invest in the bonds with these times to maturity in equal proportions of wealth. The returns and some summary statistics are shown in Figure 19. We observe that the returns are positively skewed, leptokurtic and have heavier tails than the Gaussian distribution. These stylized facts are essential in applications.

For each monthly period, we select a three-factor Vasiček model and its CRC counterpart with stochastic volatility. Then, we simulate for each period realizations of the returns of the test portfolio. By construction, the Vasiček model generates Gaussian log-returns and is unable to reproduce the stylized facts of the sample; see Table 1 and Table 2 and Figure 20. Increasing the number of factors does not help much, because the log-returns remain Gaussian. On the other hand, CRC of the Vasiček model with stochastic volatility provides additional modeling flexibility. In particular, we can see from the statistics in Table 2 and the confidence intervals in Figure 20 that the model matches the return distribution better than the Vasiček model. As explained in Figure 20, statistical tests assuming the independence of disjoint monthly periods show that the difference between the Vasiček model and its CRC counterpart is statistically significant. We conclude that the three-factor CRC Vasiček model is a parsimonious and tractable alternative that provides reasonable results.

6.3.3. Regulatory Framework

The type of analysis that was performed in the previous section is an integral component of the present regulatory framework for risk management. In the Basel framework [5], the capital charge for the trading book is based on quantile risk measures. Under the internal model approach ([5], Section 2.VI.D), a bank calculates quantiles for the distribution of possible 10-day losses based on recent market data under the assumption that the trading book portfolio is held fixed over the time period. The approach relies on accurate modeling of the distribution of portfolio returns over holding periods of multiple days. A similar analysis is required by the Basel ([5], Section 2.VI.D) regulatory framework for model validation and stress testing: model validation is performed by backtesting the historical performance of the model, and stress tests are carried out using the same methodology by calibrating the model to historical periods of significant financial stress.

These tasks can be accomplished using the CRC approach by selecting suitable classes of affine models and parameter processes. The approach is fairly general, since there are few restrictions on the parameter processes. In particular, it allows for stochastic volatility and can be used to create realistic non-Gaussian distributions of multi-period bond returns (see Section 6.3.2). Nevertheless, computing these bond return distributions does not require nested simulations. This is crucial for reasons of efficiency. Moreover, the flexibility in the specification of the parameter processes makes the CRC approach well suited for stress testing, because it allows one to freely select and specify stress scenarios.

7. Conclusions

Flexibility and tractability. Consistent re-calibration of the multifactor Vasiček model provides a tractable extension that allows parameters to follow stochastic processes. The additional flexibility can lead to better fits of yield curve dynamics and return distributions, as we demonstrated in our numerical example. Nevertheless, the model remains tractable. In particular, yield curves can be simulated efficiently using Theorem 5 and Corollary 6. This allows one to efficiently calculate model quantities of interest in risk management, forecasting and pricing.
Model selection. CRC models are selected from the data in accordance with the robust calibration principle of [4]. First, historical parameters, market prices of risk and Hull–White extensions are inferred using a combination of volatility estimation, MLE and calibration to the prevailing yield curve via Formulas (23–26, 10). The only choices in this inference procedure are the number of factors of the Vasiček model and the window length K. Then, as a second step, the time series of estimated historical parameters are used to select a model for the parameter evolution. This results in a complete specification of the CRC model under the real world and the pricing measure.
Application to modeling of Swiss interest rates. We fitted a three-factor Vasiček CRC model with stochastic volatility to Swiss interest rate data. The model achieves a reasonably good fit in most time periods. The tractability of CRC allowed us to compute several model quantities by simulation. We looked at the historical performance of a representative buy and hold portfolio of Swiss bonds and concluded that a multifactor Vasiček model is unable to describe the returns of this portfolio accurately. In contrast, the CRC version of the model provides the necessary flexibility for a good fit.

Acknowledgments

We gratefully acknowledge support by the ETHFoundation and SNFGrant 149879.

Author Contributions

All authors contributed equally to this article.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CRC	Consistent re-calibration
HJM	Heath–Jarrow–Morton
ZCB	zero-coupon bond

Appendix A Proofs

Proof of Theorem 2.

We prove Theorem 2 by induction as in ([9] Theorem 3.16) where ZCB prices are derived under the assumption that β and Σ are diagonal matrices. We have

P (m - 1, m) = exp (- 1^{⊤} X (m - 1) Δ)

, which proves the claim for

t = m - 1

. Assume that Theorem 2 holds for

t + 1 \in {2, \dots, m - 1}

. We verify that it also holds for

t \in {1, \dots, m - 2}

. Under equivalent martingale measure

P^{*}

, we have using the tower property for conditional expectations and the induction assumption:

\begin{matrix} P (t, m) & = exp \{- 1^{⊤} X (t) Δ\} E^{*} [E^{*} [exp \{- Δ \sum_{s = t + 1}^{m - 1} 1^{⊤} X (s)\} | F (t + 1)] | F (t)] \\ = exp \{- 1^{⊤} X (t) Δ\} E^{*} [P (t + 1, m) | F (t)] \\ = exp \{- 1^{⊤} X (t) Δ\} E^{*} [exp \{A (t + 1, m) - B {(t + 1, m)}^{⊤} X (t + 1)\} | F (t)] \\ = exp \{- 1^{⊤} X (t) Δ + A (t + 1, m) - B {(t + 1, m)}^{⊤} (b + β X (t)) + \frac{1}{2} B {(t + 1, m)}^{⊤} Σ B (t + 1, m)\} \\ = exp \{A (t + 1, m) - B {(t + 1, m)}^{⊤} b + \frac{1}{2} B {(t + 1, m)}^{⊤} Σ B (t + 1, m) - (B {(t + 1, m)}^{⊤} β + 1^{⊤} Δ) X (t)\} . \end{matrix}

This proves the following recursive formula for

m - 1 > t \geq 0

:

\begin{matrix} A (t, m) & = A (t + 1, m) - B {(t + 1, m)}^{⊤} b + \frac{1}{2} B {(t + 1, m)}^{⊤} Σ B (t + 1, m), \\ B (t, m) & = β^{⊤} B (t + 1, m) + 1 Δ . \end{matrix}

Finally, note that the recursive formula for

B (\cdot, \cdot)

implies:

B (t, m) = \sum_{s = 0}^{m - t - 1} {(β^{⊤})}^{s} 1 Δ = {(1 - β^{⊤})}^{- 1} (1 - {(β^{⊤})}^{m - t}) 1 Δ .

This concludes the proof. ☐

Proof of Theorem 3.

The proof goes by induction as the proof of Theorem 2. ☐

Proof of Theorem 4.

First, observe that the condition

y^{(k)} (k) = y

imposes conditions only on the values

θ (1), \dots, θ (M - 1)

. Secondly, note that the vector θ, such that the condition is satisfied, can be calculated recursively in the following way.

First component $θ_{1}$ . We have $A^{(k)} (k + 1, k + 2) = 0$ , $B (k + 1, k + 2) = 1 Δ$ and:

$A^{(k)} (k, k + 2) = - 1^{⊤} b Δ - θ_{1} Δ + \frac{1}{2} 1^{⊤} Σ 1 Δ^{2},$

see Theorem 3. Solving the last equation for $θ_{1}$ , we have:

$θ_{1} = \frac{1}{2} 1^{⊤} Σ 1 Δ - 1^{⊤} b - A^{(k)} (k, k + 2) Δ^{- 1} .$

From (6) and the equation for $B$ in Theorem 2, we obtain:

$A^{(k)} (k, k + 2) = 1^{⊤} (1 - β^{2}) {(1 - β)}^{- 1} x Δ - 2 y_{2} Δ .$

This is equivalent to:

$θ_{1} = \frac{1}{2} 1^{⊤} Σ 1 Δ - 1^{⊤} b - 1^{⊤} (1 - β^{2}) {(1 - β)}^{- 1} x + 2 y_{2} .$

(27)
Recursion $i \to i + 1$ . Assume we have determined $θ_{1}, \dots, θ_{i}$ for $i = 1, \dots, M - 2$ . We want to determine $θ_{i + 1}$ . We have $A^{(k)} (k + i + 1, k + i + 2) = 0$ , and iteration of the recursive formula for $A^{(k)}$ in Theorem 3 implies:

$A^{(k)} (k, k + i + 2) = - \sum_{s = k + 1}^{k + i + 1} B {(s, k + i + 2)}^{⊤} (b + θ (s - k) e_{1}) + \frac{1}{2} \sum_{s = k + 1}^{k + i + 1} B {(s, k + i + 2)}^{⊤} Σ B (s, k + i + 2) .$

Solving the last equation for $θ_{i + 1}$ and using $B (k + i + 1, k + i + 2) = 1 Δ$ , we have:

$\begin{matrix} θ_{i + 1} = & - \frac{1}{Δ} A^{(k)} (k, k + i + 2) - \frac{1}{Δ} \sum_{s = k + 1}^{k + i} B {(s, k + i + 2)}^{⊤} (b + θ (s - k) e_{1}) - 1^{⊤} b \\ + \frac{1}{2 Δ} \sum_{s = k + 1}^{k + i + 1} B {(s, k + i + 2)}^{⊤} Σ B (s, k + i + 2) . \end{matrix}$

From (6) and the equation for $B$ in Theorem 2, we obtain:

$A^{(k)} (k, k + i + 2) = 1^{⊤} (1 - β^{i + 2}) {(1 - β)}^{- 1} x Δ - (i + 2) y_{i + 2} Δ .$

This is equivalent to:

$\begin{matrix} θ_{i + 1} & = (i + 2) y_{i + 2} - 1^{⊤} (1 - β^{i + 2}) {(1 - β)}^{- 1} x - \frac{1}{Δ} \sum_{s = k + 1}^{k + i} B {(s, k + i + 2)}^{⊤} (b + θ_{s - k} e_{1}) \\ - 1^{⊤} b + \frac{1}{2 Δ} \sum_{s = k + 1}^{k + i + 1} B {(s, k + i + 2)}^{⊤} Σ B (s, k + i + 2) \\ = (i + 2) y_{i + 2} - 1^{⊤} (1 - β^{i + 2}) {(1 - β)}^{- 1} x - \frac{1}{Δ} \sum_{s = k + 1}^{k + i + 1} B {(s, k + i + 2)}^{⊤} b \\ - \frac{1}{Δ} \sum_{s = k + 1}^{k + i} B_{1} (s, k + i + 2) θ_{s - k} + \frac{1}{2 Δ} \sum_{s = k + 1}^{k + i + 1} B {(s, k + i + 2)}^{⊤} Σ B (s, k + i + 2) . \end{matrix}$

(28)

This recursion allows one to determine the components of θ. Note that Equation (28) can be written as:

{(C (β) θ)}_{i + 1} = z_{i + 1} (b, β, Σ, x, y), i = 1, \dots, M - 2 .

Observe that the lower triangular matrix

C (β)

is invertible since

det C (β) = Δ^{M - 1} > 0

. Hence, Equations (27) and (28) prove (8). ☐

Proof of Theorem 5.

We add and subtract

- A^{(k)} (k, m) + B^{(k)} {(k, m)}^{⊤} X (k)

to the right hand side of Equation (12) and obtain:

\begin{matrix} Y (k + 1, m) (m - (k + 1)) Δ & = A^{(k)} (k, m) - A^{(k)} (k + 1, m) - A^{(k)} (k, m) \\ + B^{(k)} {(k, m)}^{⊤} X (k) - B^{(k)} {(k, m)}^{⊤} X (k) \\ + B^{(k)} {(k + 1, m)}^{⊤} (b (k) + θ^{(k)} (1) e_{1} + β (k) X (k) + Σ {(k)}^{\frac{1}{2}} ε^{*} (k + 1)) . \end{matrix}

(29)

We have the following two identities from Section 3.1.2:

\begin{matrix} - A^{(k)} (k, m) + B^{(k)} {(k, m)}^{⊤} X (k) & = Y (k, m) (m - k) Δ, \\ A^{(k)} (k, m) - A^{(k)} (k + 1, m) & = - B^{(k)} {(k + 1, m)}^{⊤} (b (k) + θ^{(k)} (1) e_{1}) + \frac{1}{2} B^{(k)} {(k + 1, m)}^{⊤} Σ (k) B^{(k)} (k + 1, m) . \end{matrix}

(30)

Therefore, the right hand side of (29) is rewritten as:

\begin{matrix} Y (k + 1, m) (m - (k + 1)) Δ & = Y (k, m) (m - k) Δ + (B^{(k)} {(k + 1, m)}^{⊤} β (k) - B^{(k)} {(k, m)}^{⊤}) X (k) \\ + \frac{1}{2} B^{(k)} {(k + 1, m)}^{⊤} Σ (k) B^{(k)} (k + 1, m) + B^{(k)} {(k + 1, m)}^{⊤} Σ {(k)}^{\frac{1}{2}} ε^{*} (k + 1) . \end{matrix}

(31)

Observe that:

\begin{matrix} B^{(k)} {(k + 1, m)}^{⊤} β (k) = {(\sum_{s = 0}^{m - k - 2} {(β^{⊤} (k))}^{s} 1)}^{⊤} β (k) Δ = 1^{⊤} \sum_{s = 1}^{m - k - 1} β {(k)}^{s} Δ = B^{(k)} {(k, m)}^{⊤} - 1^{⊤} Δ, \end{matrix}

and that

Y (k, k + 1) = 1^{⊤} X (k)

. This proves the claim. ☐

References

O. Vasiček. “An equilibrium characterization of the term structure.” J. Financ. Econ. 5 (1997): 177–188. [Google Scholar] [CrossRef]
J.C. Cox, J.E. Ingersoll, and S.A. Ross. “A theory of the term structure of interest rates.” Econometrica 53 (1985): 385–407. [Google Scholar] [CrossRef]
D. Heath, R. Jarrow, and A. Morton. “Bond pricing and the term structure of interest rates: A new methodology for contingent claim valuation.” Econometrica 60 (1992): 77–105. [Google Scholar] [CrossRef]
P. Harms, D. Stefanovits, J. Teichmann, and M.V. Wüthrich. “Consistent re-calibration of yield curve models.” 2015. Available online: arxiv.org/abs/1502.02926 (accessed on 1 May 2016).
Bank for International Settlements (BIS), and Basel Committee on Banking Supervision (BCBS). “Basel II: International convergence of capital measurement and capital standards. A revised framework—Comprehensive version.” 2006. Available online: http://www.bis.org/publ/bcbs128.htm (accessed on 1 May 2016).
P.J. Brockwell, and R.A. Davis. Time Series: Theory and Methods. Berlin/Heidelberg, Germany: Springer, 1991. [Google Scholar]
J. Hull, and A. White. “Branching out.” Risk 7 (1994): 34–37. [Google Scholar]
L.E. Svensson. Estimating and Interpreting forward Interest Rates: Sweden 1992–1994. Cambridge, MA, USA: National Bureau of Economic Research, 1994. [Google Scholar]
M.V. Wüthrich, and M. Merz. Financial Modeling, Actuarial Valuation and Solvency in Insurance. Berlin/Heidelberg, Germany: Springer, 2013. [Google Scholar]
T.J. Jordan. “SARON—An innovation for the financial markets.” Launch event for Swiss Reference Rates, Zurich, 25 August 2009. Available online: https://www.snb.ch/en/mmr/speeches/id/ref_20090825_tjn_1 (accessed on 1 May 2016).
S.L. Heston. “A closed-form solution for options with stochastic volatility with applications to bond and currency options.” Rev. Financ. Stud. 6 (1993): 327–343. [Google Scholar] [CrossRef]

Figure 1. Yield rates (lhs): Swiss Average Rate (SAR) and (rhs) London InterBank Offered Rate (LIBOR) from 8 December 1999, until 15 September 2014.

Figure 2. Yield rates: (lhs) Swiss Confederation Bond (SWCNB) and (rhs) a selection of SAR, LIBOR and Swiss Confederation Bond (SWCNB) from 8 December 1999, until 15 September 2014. Note that LIBOR looks rather differently from SAR and SWCNB after the financial crisis of 2008.

Figure 3. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ = 1, 2, 5, 10, 21, 63

, window length

K = 21

(lhs) and

K = 126

(rhs).

Figure 3. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ = 1, 2, 5, 10, 21, 63

, window length

K = 21

(lhs) and

K = 126

(rhs).

Figure 4. LIBOR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ = 21, 63, 126, 252

, window length

K = 21

(lhs) and

K = 126

(rhs).

Figure 4. LIBOR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ = 21, 63, 126, 252

, window length

K = 21

(lhs) and

K = 126

(rhs).

Figure 5. SWCNB realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ / 252 = 2, 3, 4, 5, 7, 10, 20, 30

, window length

K = 21

(lhs) and

K = 126

(rhs).

Figure 5. SWCNB realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ / 252 = 2, 3, 4, 5, 7, 10, 20, 30

, window length

K = 21

(lhs) and

K = 126

(rhs).

Figure 6. A selection of SAR, LIBOR and SWCNB realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ =

1, 63, 252, 504, window length

K = 21

(lhs) and

K = 126

(rhs). Note that LIBOR looks rather differently from SAR and SWCNB after the financial crisis of 2008.

Figure 6. A selection of SAR, LIBOR and SWCNB realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

τ =

1, 63, 252, 504, window length

K = 21

(lhs) and

K = 126

(rhs). Note that LIBOR looks rather differently from SAR and SWCNB after the financial crisis of 2008.

Figure 7. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 1, 2, 5, 10, 21, 63

and three observation dates compared to the realized volatility of the two- (lhs) and three-factor (rhs) Vasiček model fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

. The three-factor model achieves an accurate fit.

Figure 7. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 1, 2, 5, 10, 21, 63

and three observation dates compared to the realized volatility of the two- (lhs) and three-factor (rhs) Vasiček model fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

. The three-factor model achieves an accurate fit.

Figure 8. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 1

(lhs),

τ = 2

(rhs) and all observation dates compared to the realized volatility of the two- and three-factor Vasiček models fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

.

Figure 8. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 1

(lhs),

τ = 2

(rhs) and all observation dates compared to the realized volatility of the two- and three-factor Vasiček models fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

.

Figure 9. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 5

(lhs),

τ = 10

(rhs) and all observation dates compared to the realized volatility of the two- and three-factor Vasiček models fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

.

Figure 9. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 5

(lhs),

τ = 10

(rhs) and all observation dates compared to the realized volatility of the two- and three-factor Vasiček models fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

.

Figure 10. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 21

(lhs),

τ = 63

(rhs) and all observation dates compared to the realized volatility of the two- and three-factor Vasiček models fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

.

Figure 10. SAR realized volatility

\hat{RCov} {(t, τ, τ)}^{\frac{1}{2}}

for

K = 126

,

τ = 21

(lhs),

τ = 63

(rhs) and all observation dates compared to the realized volatility of the two- and three-factor Vasiček models fitted by optimization (23) for

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

.

Figure 11. Estimation of

β_{11}

,

β_{22}

and

β_{33}

(lhs) and

{(Σ^{\frac{1}{2}} Λ)}_{11} = β_{11} - α_{11}

,

{(Σ^{\frac{1}{2}} Λ)}_{22} = β_{22} - α_{22}

and

{(Σ^{\frac{1}{2}} Λ)}_{33} = β_{33} - α_{33}

(rhs) by optimizations (23) and (24) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

,

w_{i j} = 1_{{i = j}}

and

S = 10^{- 5} \cdot 1

. The values determine the speed of mean reversion of the factors. Since we are considering a daily time grid, values close to one (slow mean reversion) are reasonable. We observe that the difference in the speed of mean-reversion under the risk-neutral and real-world measures is negligible.

Figure 11. Estimation of

β_{11}

,

β_{22}

and

β_{33}

(lhs) and

{(Σ^{\frac{1}{2}} Λ)}_{11} = β_{11} - α_{11}

,

{(Σ^{\frac{1}{2}} Λ)}_{22} = β_{22} - α_{22}

and

{(Σ^{\frac{1}{2}} Λ)}_{33} = β_{33} - α_{33}

(rhs) by optimizations (23) and (24) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

,

w_{i j} = 1_{{i = j}}

and

S = 10^{- 5} \cdot 1

. The values determine the speed of mean reversion of the factors. Since we are considering a daily time grid, values close to one (slow mean reversion) are reasonable. We observe that the difference in the speed of mean-reversion under the risk-neutral and real-world measures is negligible.

Figure 12. Estimation of

Σ_{11}

,

Σ_{22}

and

Σ_{33}

(lhs) and correlations

ρ_{21}

,

ρ_{31}

and

ρ_{32}

(rhs) by optimization (23) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

. We observe large spikes in the volatilities and strong correlations among the factors during the European sovereign debt crisis and after the SNB intervention in 2011.

Figure 12. Estimation of

Σ_{11}

,

Σ_{22}

and

Σ_{33}

(lhs) and correlations

ρ_{21}

,

ρ_{31}

and

ρ_{32}

(rhs) by optimization (23) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

and

w_{i j} = 1_{{i = j}}

. We observe large spikes in the volatilities and strong correlations among the factors during the European sovereign debt crisis and after the SNB intervention in 2011.

Figure 13. Estimation of

b_{1}

,

b_{2}

and

b_{3}

(lhs) and

{(Σ^{\frac{1}{2}} λ)}_{1} = b_{1} - a_{1}

,

{(Σ^{\frac{1}{2}} λ)}_{2} = b_{2} - a_{2}

and

{(Σ^{\frac{1}{2}} λ)}_{3} = b_{3} - a_{3}

(rhs) by optimizations (23) and (24) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

,

w_{i j} = 1_{{i = j}}

and

S = 10^{- 5} \cdot 1

. The difference between

b

and

a

is considerable in 2000–2002 and 2006–2009.

Figure 13. Estimation of

b_{1}

,

b_{2}

and

b_{3}

(lhs) and

{(Σ^{\frac{1}{2}} λ)}_{1} = b_{1} - a_{1}

,

{(Σ^{\frac{1}{2}} λ)}_{2} = b_{2} - a_{2}

and

{(Σ^{\frac{1}{2}} λ)}_{3} = b_{3} - a_{3}

(rhs) by optimizations (23) and (24) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

,

w_{i j} = 1_{{i = j}}

and

S = 10^{- 5} \cdot 1

. The difference between

b

and

a

is considerable in 2000–2002 and 2006–2009.

Figure 14. Objective function

log L_{t}

(lhs) and

{(Σ^{\frac{1}{2}} λ)}_{1} = b_{1} - a_{1}

,

{(Σ^{\frac{1}{2}} λ)}_{2} = b_{2} - a_{2}

and

{(Σ^{\frac{1}{2}} λ)}_{3} = b_{3} - a_{3}

(rhs) given by optimization (24) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

,

w_{i j} = 1_{{i = j}}

and

S = 10^{- 5} \cdot 1

. We compare the value of the objective function for

(b, β, Σ, a, α) = (0, β^{RCov}, Σ^{RCov}, 0, α^{RCov})

and the numerical solution of the optimization. The configuration

(0, β^{RCov}, Σ^{RCov}, 0, α^{RCov})

is almost optimal in low interest rate times.

Figure 14. Objective function

log L_{t}

(lhs) and

{(Σ^{\frac{1}{2}} λ)}_{1} = b_{1} - a_{1}

,

{(Σ^{\frac{1}{2}} λ)}_{2} = b_{2} - a_{2}

and

{(Σ^{\frac{1}{2}} λ)}_{3} = b_{3} - a_{3}

(rhs) given by optimization (24) in the three-factor model for

K = 126

,

M = 6

,

τ_{1} = 1

,

τ_{2} = 2

,

τ_{3} = 5

,

τ_{4} = 10

,

τ_{5} = 21

,

τ_{6} = 63

,

w_{i j} = 1_{{i = j}}

and

S = 10^{- 5} \cdot 1

. We compare the value of the objective function for

(b, β, Σ, a, α) = (0, β^{RCov}, Σ^{RCov}, 0, α^{RCov})

and the numerical solution of the optimization. The configuration

(0, β^{RCov}, Σ^{RCov}, 0, α^{RCov})

is almost optimal in low interest rate times.

Figure 15. Three-factor Hull–White extended Vasiček yield curve (lhs) and Hull–White extension θ (rhs) as of 29 September 2006. The parameters are estimated as in Figure 11, Figure 12 and Figure 13. The initial factors are obtained from the Kalman filter for the estimated parameters. The calibration of the Hull–White extension requires yields on a time to maturity grid of size Δ. These are interpolated from SAR and SWCNB using cubic splines.

Figure 16. Estimation of

φ_{1}

,

φ_{2}

and

φ_{3}

by least square regression (two different scales). We use a time window of 252 observations for the regression.

Figure 16. Estimation of

φ_{1}

,

φ_{2}

and

φ_{3}

by least square regression (two different scales). We use a time window of 252 observations for the regression.

Figure 17. Estimation of

ϕ_{11}

,

ϕ_{22}

and

ϕ_{33}

(lhs) and

Φ_{11}

,

Φ_{22}

and

Φ_{33}

(rhs) by least square regression. We use a time window of 252 observations for the regression.

Figure 17. Estimation of

ϕ_{11}

,

ϕ_{22}

and

ϕ_{33}

(lhs) and

Φ_{11}

,

Φ_{22}

and

Φ_{33}

(rhs) by least square regression. We use a time window of 252 observations for the regression.

Figure 18. Estimation of correlations

{\tilde{ρ}}_{21}

,

{\tilde{ρ}}_{31}

and

{\tilde{ρ}}_{32}

(lhs) and correlations

Cor [ε (t), \tilde{ε} (t) ∣ F (t - 1)]

(rhs). We use a time window of 252 observation for the regression. The residuals ε are calculated using the parameter estimates of Figure 11, Figure 12 and Figure 13.

Figure 18. Estimation of correlations

{\tilde{ρ}}_{21}

,

{\tilde{ρ}}_{31}

and

{\tilde{ρ}}_{32}

(lhs) and correlations

Cor [ε (t), \tilde{ε} (t) ∣ F (t - 1)]

(rhs). We use a time window of 252 observation for the regression. The residuals ε are calculated using the parameter estimates of Figure 11, Figure 12 and Figure 13.

Figure 19. Logarithmic monthly returns of a buy and hold portfolio investing in equal wealth proportions in the zero-coupon bonds with times to maturity of 2, 3, 4, 5, 6 and 9 months and 1, 2, 3, 5, 7 and 10 years. For each monthly period, we calculate the logarithmic return of this portfolio assuming that at the beginning of each period, we are invested in the bonds with these times to maturity in equal proportions of wealth.

Figure 20. Confidence intervals computed from

10^{4}

simulations of the test portfolio returns in the Vasiček model and its CRC counterpart with stochastic volatility. For each monthly period, we check if the market return lies in the confidence interval. This is more often the case for the CRC than for the standard Vasiček model. A one-sided binomial test assuming the independence of monthly periods shows that the difference is statistically significant (

p = 0.0013

for the

25 %

and

p =

0.00017 for the

5 %

quantiles). The result remains significant if every second month is discarded to account for dependencies (

p \approx 0.01

). This suggests that the CRC Vasicěk model is able to match the return distribution better than its counterpart with constant parameters.

Figure 20. Confidence intervals computed from

10^{4}

simulations of the test portfolio returns in the Vasiček model and its CRC counterpart with stochastic volatility. For each monthly period, we check if the market return lies in the confidence interval. This is more often the case for the CRC than for the standard Vasiček model. A one-sided binomial test assuming the independence of monthly periods shows that the difference is statistically significant (

p = 0.0013

for the

25 %

and

p =

0.00017 for the

5 %

quantiles). The result remains significant if every second month is discarded to account for dependencies (

p \approx 0.01

). This suggests that the CRC Vasicěk model is able to match the return distribution better than its counterpart with constant parameters.

Table 1. Statistics computed from simulations of the test portfolio returns for some of the monthly periods in the Vasiček model. For each monthly period, we simulate

10^{4}

realizations.

Table 1. Statistics computed from simulations of the test portfolio returns for some of the monthly periods in the Vasiček model. For each monthly period, we simulate

10^{4}

realizations.

Table 2. Statistics computed from the simulations of the test portfolio returns for some of the monthly periods in the consistent re-calibration (CRC) counterpart of the Vasiček model with stochastic volatility. For each monthly period, we simulate

10^{4}

realizations.

Table 2. Statistics computed from the simulations of the test portfolio returns for some of the monthly periods in the consistent re-calibration (CRC) counterpart of the Vasiček model with stochastic volatility. For each monthly period, we simulate

10^{4}

realizations.

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Harms, P.; Stefanovits, D.; Teichmann, J.; Wüthrich, M.V. Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model. Risks 2016, 4, 18. https://doi.org/10.3390/risks4030018

AMA Style

Harms P, Stefanovits D, Teichmann J, Wüthrich MV. Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model. Risks. 2016; 4(3):18. https://doi.org/10.3390/risks4030018

Chicago/Turabian Style

Harms, Philipp, David Stefanovits, Josef Teichmann, and Mario V. Wüthrich. 2016. "Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model" Risks 4, no. 3: 18. https://doi.org/10.3390/risks4030018

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model †

Abstract

1. Introduction

2. Discrete-Time Multifactor Vasiček Model and Hull–White Extension

2.1. Setup and Notation

2.2. Discrete-Time Multifactor Vasiček Model

2.3. Hull–White Extended Discrete-Time Multifactor Vasiček Model

2.4. Calibration of the Hull–White Extended Model

3. Consistent Re-Calibration

3.1. Consistent Re-Calibration Algorithm

3.1.1. Initialization k = 0

3.1.2. Increments of the Factor Process from k → k + 1

3.1.3. Parameter Update and Re-Calibration at k + 1

3.2. Heath–Jarrow–Morton Representation

4. Real World Dynamics and Market Price of Risk

5. Choice of Parameter Process

5.1. Interpretation of Parameters

5.1.1. Level and Speed of Mean Reversion

5.1.2. Instantaneous Variance

5.2. State Space Modeling Approach

5.2.1. Transition System

5.2.2. Measurement System

5.2.3. Anchoring

5.2.4. Forecasting the Measurement System

5.2.5. Bayesian Inference in the Transition System

5.2.6. Forecasting the Transition System

5.2.7. Likelihood Function

5.3. Estimation Motivated by Continuous Time Modeling

5.3.1. Rescaling the Time Grid

5.3.2. Longitudinal Realized Covariations of Yields

5.3.3. Cross-Sectional Estimation of β and Σ

5.4. Inference on Market Price of Risk

6. Numerical Example for Swiss Interest Rates

6.1. Description and Selection of Data

6.2. Model Selection

6.2.1. Discussion of Identification Assumptions

6.2.2. Determination of the Number of Factors

6.2.3. Determination of Vasiček Parameters

6.2.4. Selection of a Model for the Vasiček Parameters

6.3. Simulation and Back-Testing

6.3.1. Simulation

6.3.2. Back-Testing

6.3.3. Regulatory Framework

7. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Abbreviations

Appendix A Proofs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model^†

3.1.1. Initialization $k = 0$

3.1.2. Increments of the Factor Process from $k \to k + 1$

3.1.3. Parameter Update and Re-Calibration at $k + 1$