Mean-Reverting 4/2 Principal Components Model. Financial Applications

Escobar-Anel, Marcos; Gong, Zhenxian

doi:10.3390/risks9080141

Open AccessArticle

Mean-Reverting 4/2 Principal Components Model. Financial Applications

by

Marcos Escobar-Anel

^*,†

and

Zhenxian Gong

^†

Department of Statistical and Actuarial Sciences, University of Western Ontario, London, ON N6A5B7, Canada

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Risks 2021, 9(8), 141; https://doi.org/10.3390/risks9080141

Submission received: 17 June 2021 / Revised: 20 July 2021 / Accepted: 21 July 2021 / Published: 27 July 2021

(This article belongs to the Special Issue Risks: Feature Papers 2021)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we propose a new multivariate mean-reverting model incorporating state-of-the art 4/2 stochastic volatility and a convenient principal component stochastic volatility (PCSV) decomposition for the stochastic covariance. We find a quasi closed-form characteristic function and propose analytic approximations, which aid in the pricing of derivatives and calculation of risk measures. Parameters are estimated on three bivariate series, using a two-stage methodology involving method of moments and least squares. Moreover, a scaling factor is added for extra degrees of freedom to match data features. As an application, we consider investment strategies for a portfolio with two risky assets and a risk-free cash account. We calculate value-at-risk (VaR) values at a 95% risk level using both simulation-based and distribution-based methods. A comparison of these VaR values supports the effectiveness of our approximations and the potential for higher dimensions.

Keywords:

principal component analysis; 4/2 stochastic volatility model; moment-generating function; risk management calculations

1. Introduction

In mathematical finance, principal component analysis (PCA) is used to reduce dimensionality in the explanation of a vector of asset returns; see, for instance, Alexander (2001) for discrete-time model applications. The methodology has also been used in continuous-time stochastic processes for financial applications; see Escobar et al. (2010) and Escobar and Olivares (2013) for its usage in collateralized debt obligations (CDO) and exotic financial derivatives pricing, as well as Escobar and Gschnaidtner (2018) and more generally De Col et al. (2013) for factor and PC analyses applications to foreign exchange data.

When modeling financial or any complex data, one can focus on capturing the stylized facts reported in the literature. The best-known features of financial data are as follows: fat tails, changing volatilities and correlations, the leverage effect and co-volatility movements. Examples of a refined fact of financial data are the smiles and smirks of the implied volatility surface. To capture them, in Christoffersen et al. (2009), the authors proposed a PCA-inspired stochastic covariance (SC) model using the popular Heston stochastic volatility model Heston (1993) as the underlying component.

These features can be captured by rich SC models, with proper marginal structures. SC models have received significant attention in the literature; the best-known representatives are the stochastic Wishart family, see Da Fonseca et al. (2007); Gouriéroux (2006), and the Ornstein–Uhlenbeck (OU) family, see Muhle-Karbe et al. (2012), of models, as well as general linear-quadratic jump-diffusions, see Cheng and Scaillet (2007). Even though these models show realistic advantages over the classical Black–Scholes model, they lose their tractability as a result of an increase in model dimensions (i.e., an increase in the number of parameters and simulation paths). This is commonly known as the curse of dimensionality, and PCA is a viable method to control the problem with dimensionality. Inspired by this, principal component stochastic volatility (PCSV) models are built from a linear combination of tractable one-dimensional counterparts. Their applications have been studied in a series of papers since 2010; see, for example, De Col et al. (2013); Escobar (2018); Escobar et al. (2010); Escobar-Anel and Moreno-Franco (2019).

Current PCSV models rely on Heston SV for the components, also known as the 1/2 model. A new model for single assets, namely, the 4/2 volatility process, was masterfully presented in Grasselli (2016). Notably, the Heston model (the 1/2 model) predicts that the implied volatility skew will flatten when the instantaneous volatility increases (e.g., financial crises), while another embedded structure, the 3/2 model see Platen (1997) predicts steeper skews. The author argues that the two processes complement each other in better explaining the implied volatility surface. There are many interesting, recent generalizations of the 4/2 model, see, for example, Cui et al. (2021) and Kirkby and Nguyen (2020) and the literature therein.

The works presented above mostly target the equity market and are built based on geometric Brownian motion (GBM)-type processes, and they are hence not suitable for commodities and volatility indexes. These asset classes display mean-reverting and spillover effects, both of which are stylized facts not seen in equities. Mean-reverting effects capture the stationary behavior of prices, which tend to go back to a long-term mean. On the other hand, spillover refers to the impact of one asset on the trends (drift) of other assets (i.e., the impact of one asset on the long term average “stationary price” of a second asset). Our modeling in this paper will ensure that these two facts are captured.

Our modeling is inspired by a recent paper by Cheng et al. (2019) that introduces a generalized multivariate mean-reverting 4/2 factor analysis (FA) model. The model uses the one-dimensional mean-reverting 4/2 stochastic volatility proposed by Escobar-Anel and Gong (2020) as the underlying model. They obtained an analytical representation of the characteristic function (c.f.) of a vector of asset prices as well as a second conditional c.f. for non-mean-reverting nested cases. Thus, the FFT-based option pricing method, for example Carr and Madan (1999), can be used, and exact simulation is possible. The authors further identified a set of conditions that not only produces well-defined changes of measure, but also avoids local martingales for risk-neutral pricing purposes.

In this paper, we make several contributions to the literature:

We studied in detail a multivariate mean-reverting 4/2 stochastic volatility model based on PCA, which is inspired in the general framework of Cheng et al. (2019). The SC in the new model is decomposed into constant eigenvectors that capture the correlation among assets and a diagonal eigenvector matrix whose entries are modeled by the 4/2 process.
The PCA structure allows us to find a semi-closed-form c.f. for the vector of returns. It permits the extension to multidimensions of simple but accurate approximation approaches, first introduced in Escobar-Anel and Gong (2020) for one dimension, to find closed-form approximations to the c.f., which are proven to be accurate for realistic parameter settings.
We use the estimation approaches developed in Escobar-Anel and Gong (2020) to estimate the parameters for special cases of the proposed model. Here, we use two pairs of bivariate time series capturing both the asset and its variance. Estimation of multidimensional processes is rare in the literature, and our work demonstrates that many, but not all, of the parameters are statistically significant, confirming stylized facts of commodity prices and volatility indexes such as stochastic correlation and spill-over effects.
A risk management application, based on a constant proportion strategies portfolio, for example Merton (1975) and DeMiguel et al. (2009), demonstrates the accuracy of the approximation.

The rest of the paper is organized as follows: in Section 2, we define the model and derive two sub-models for two parametric constructions. Then, in Section 3, we expand the theoretical results for the c.f.s obtained in Cheng et al. (2019) with approximations. In Section 4, we focus on estimation for the multivariate mean-reverting 4/2 stochastic volatility model. The estimation method considered is based on the method developed in Escobar-Anel and Gong (2020) with the introduction of a new scaling factor. Thereafter, in Section 5, we construct a portfolio with two risky assets and one risk-free cash account, and we subsequently calculate the value-at-risk (VaR) at a 95% level using various techniques. Finally, Section 6 concludes the paper.

2. Model Definition

In this section, we define the general model. We first introduce a model with spillover effects, and we later cover models with separable spillover effects and no spillover effects as special cases. We also study the implications of the model on the covariance process.

2.1. General Model Setup

Suppose

X_{t} = {(X_{1} (t), \dots, X_{n} (t))}^{'}

is a vector of assets. The dynamics for each asset

X_{i} (t)

under the so-called historical measure

P

is defined as

\{\begin{matrix} \frac{d X_{i} (t)}{X_{i} (t)} & = [L_{i} + \sum_{j = 1}^{n} c_{i j} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}})}^{2} - \sum_{j = 1}^{n} β_{i j} l n (X_{j} (t))] d t + \sum_{j = 1}^{n} a_{i j} (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d W_{j}^{P} (t), \\ d ν_{j} (t) & = α_{j} (θ_{j} - ν_{j} (t)) d t + ξ_{j} \sqrt{ν_{j} (t)} d B_{j}^{P} (t), j = 1, \dots, n \end{matrix}

(1)

where

W_{i}^{P} (t)

and

B_{j}^{P} (t)

are independent Brownian motions if

i \neq j

, and they are correlated if

i = j

, that is, the quadratic variation

〈 W_{i}^{P} (t), B_{i}^{P} (t) 〉 = ρ_{i} t

, where

ρ_{i}

is constant. The parameters for each

ν_{j} (t)

process are positive and satisfy the Feller condition:

α_{j} θ_{j} > \frac{ξ_{j}^{2}}{2}

. Moreover, we assume that the mean-reverting level of

ν_{j} (t)

decreases as j increases; that is,

0 < θ_{j} < θ_{j - 1}

, for

j = 2, \dots, n

. This last feature is intended to sort the eigenvalues in order of importance, and

b_{j}

’s measure the “weight” of the 3/2 components.

This model is unlike a traditional mean-reverting model, as it takes into account the spillover effects in the drift, which appear in the form of

β_{i j}, i \neq j

. We mentioned above that the spillover effects show the impacts of one asset on others; these impacts shall not be confused with correlations. The correlations are reflected in the price trend of both assets, capturing co-movements between assets. Spillover effects describe the impact on the mean-reverting level of one asset by others (i.e., the shift in the long term mean due to the movements of other assets). The concept of spillover effects can be understood as how much, for example, a demand curve of one good shifts according to the change in factors of other goods. In addition, although this paper does not dwell into risk-neutral pricing, we should highlight that changes of measure are feasible on each principal component along the lines of Proposition 4 in Escobar-Anel and Gong (2020), this is changes of the type:

d W_{j}^{P} = λ_{j} (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d t + d W_{j}^{Q}

,

d B_{j}^{P} = λ_{n u_{j}} \sqrt{ν_{j}} d t + d B_{j}^{Q}

.

Equation (1) can be written in matrix form as follows:

{dX}_{t} = d i a g (X_{t}) [(L + {CV}_{t} - B \cdot \ln (X_{t})) d t + Σ_{t}^{\frac{1}{2}} {dW}_{t}^{P}] .

(2)

where

W_{t} = {(W_{1}^{P} (t), \dots, W_{n}^{P} (t))}^{'}

is a vector of independent standard Brownian motions;

B = (\begin{matrix} β_{11} & \dots & β_{1 n} \\ ⋮ & ⋱ & ⋮ \\ β_{n 1} & \dots & β_{n n} \end{matrix}), L = {(L_{1}, \dots, L_{n})}^{'}, C = (\begin{matrix} c_{11} & \dots & c_{1 n} \\ ⋮ & ⋱ & ⋮ \\ c_{n 1} & \dots & c_{n n} \end{matrix}), V_{t} = {({(\sqrt{ν_{1} (t)} + \frac{b_{1}}{\sqrt{ν_{1} (t)}})}^{2}, \dots, {(\sqrt{ν_{n} (t)} + \frac{b_{n}}{\sqrt{ν_{n} (t)}})}^{2})}^{'}

and

\ln (X_{t}) = {(l n (X_{1} (t)), \dots, l n (X_{n} (t)))}^{'}

.

We first assume that the eigenvalues of the matrix

- B = (- β_{i j})

are all negative; this is similar to the literature, see Langetieg (1980) and Larsen (2010). This assumption will be used to explain some of the estimation results. Here,

B

captures the spillover effects, while

C

contains risk premiums associated with the assets, and the long-term average for the assets is determined by

E [B^{- 1} (L + {CV}_{t})]

.

We next assume a principal component decomposition on the instantaneous covariance matrix

Σ_{t}

:

Σ_{t} d t = A d i a g (V_{t}) A^{'} d t

, where

A = {(a_{i j})}_{n \times n}

is an orthogonal matrix with constant entries, and it captures the correlations among assets. We craft the matrix

C

in such a way that allows for c.f. analytical approximations; that is

C = A \tilde{C} + \frac{1}{2} (A \circ A)

, where

\tilde{C} = d i a g (c_{1}, \dots, c_{n})

and

(A \circ A)

denotes the Hadamard product of

A

. The dynamics of log price

Y_{i} (t) = l n (X_{i} (t))

is as follows:

\{\begin{matrix} d Y_{i} (t) & = [L_{i} + \sum_{j = 1}^{n} a_{i j} c_{j} (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}})^{2} - \sum_{j = 1}^{n} β_{i j} Y_{j} (t)] d t + \sum_{j = 1}^{n} a_{i j} (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d W_{j}^{P} (t), \\ d ν_{j} (t) & = α_{j} (θ_{j} - ν_{j} (t)) d t + ξ_{j} \sqrt{ν_{j} (t)} d B_{j}^{P} (t), j = 1, \dots, n . \end{matrix}

(3)

Based on the applications, our model can be reduced to three subcases for which we are able to approximate the c.f. with analytic functions:

$b_{j} = 0$ : This is a generalization of Escobar et al. (2010) to multivariate mean-reverting asset classes. If $n = 1$ , we get the model considered in Benth (2011).
$ρ_{j} = 0$ : This case applies to the assets whose price series demonstrates an abnormal increase or decrease, but no leverage effect is observed for the assets of interest. The term “leverage effect" was first defined and studied in Black (1976). It describes the negative correlation between an asset’s volatility and its return.
$b_{j} = 0, ρ_{j} = 0$ : This case can be generated by either of the two previous cases. It applies better to assets that exhibit mild behavior in their price series; at the same time, no leverage effect is identified.

We explain how to approximate the c.f. with analytic functions for these three cases in Section 3.

2.1.1. Separable Spillover Effect

In this section, we assume a convenient structure in the spillover matrix

B

to obtain the c.f., and by doing so, we obtain another solvable case. Here, we further simplify the model by rewriting it in terms of n independent processes. We demonstrate this procedure by first writing Equation (3) in matrix form:

{dY}_{t} = (L + A \tilde{C} V_{t} - B Y_{t}) d t + A d i a g {(V_{t})}^{\frac{1}{2}} {dW}_{t}^{P} .

(4)

Multiplying both sides of Equation (4) by

A^{- 1}

, we get

A^{- 1} {dY}_{t} = (A^{- 1} L + A^{- 1} A \tilde{C} V_{t} - A^{- 1} B Y_{t}) d t + d i a g {(V_{t})}^{\frac{1}{2}} {dW}_{t}^{P} .

(5)

Suppose the matrix

B

can be written as follows:

B = A \tilde{B} A^{- 1}

, where

\tilde{B} = d i a g (\tilde{β_{1}}, \dots, \tilde{β_{n}})

is a diagonal matrix (i.e., whose entries are eigenvalues of

B

). Using this result, and applying a simple transformation

M_{t} = A^{- 1} Y_{t}

, we arrive at a new mean-reverting process with diagonal matrix

\tilde{B}

:

\begin{matrix} {dM}_{t} & = (A^{- 1} L + \tilde{C} V_{t} - \tilde{B} M_{t}) d t + d i a g {(V_{t})}^{\frac{1}{2}} {dW}_{t}^{P} . \end{matrix}

Each element of

{dM}_{t}

is a mean-reverting 4/2 stochastic volatility process, as in Escobar-Anel and Gong (2020). That is,

\{\begin{matrix} d M_{j} (t) & = [{\tilde{L}}_{j} + \tilde{c_{j}} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}})}^{2} - \tilde{β_{j}} M_{j} (t)] d t + (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d W_{j}^{P} (t) \\ d ν_{j} (t) & = α_{j} (θ_{j} - ν_{j} (t)) d t + ξ_{j} \sqrt{ν_{j} (t)} d B_{j}^{P} (t) \\ 〈 d W_{j}^{P} (t), d B_{j}^{P} (t) 〉 & = ρ_{j} d t \end{matrix}

(6)

where

\tilde{L_{j}} = \sum_{i = 1}^{n} L_{i} a_{i j}^{*}

, and

a_{i j}^{*}

are the entries of

A^{- 1}

. Furthermore,

{dM}_{t}

is also a vector of independent processes.

2.1.2. Model with No Spillover Effects

In this section, we assume no spillover effects among the assets (i.e., matrix

B

is diagonal). This further simplifies our model to

\{\begin{matrix} \frac{d X_{i} (t)}{X_{i} (t)} & = [L_{i} + \sum_{j = 1}^{n} c_{i j} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}})}^{2} - β_{i} l n (X_{i} (t))] d t + \sum_{j = 1}^{n} a_{i j} (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d W_{j}^{P} (t), \\ d ν_{j} (t) & = α_{j} (θ_{j} - ν_{j} (t)) d t + ξ_{j} \sqrt{ν_{j} (t)} d B_{j}^{P} (t), j = 1, \dots, n \end{matrix}

(7)

The corresponding matrix representation has the same form:

{dX}_{t} = d i a g (X_{t}) [(L + {CV}_{t} - B \cdot \ln (X_{t})) d t + Σ_{t}^{\frac{1}{2}} {dW}_{t}^{P}],

(8)

with

B = d i a g (β_{1}, \dots, β_{n})

. The dynamics of log price

Y_{i} (t) = l n (X_{i} (t))

are then

\{\begin{matrix} d Y_{i} (t) & = [L_{i} + \sum_{j = 1}^{n} a_{i j} c_{j} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}})}^{2} - β_{i} Y_{i} (t)] d t + \sum_{j = 1}^{n} a_{i j} (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d W_{j}^{P} (t), \\ d ν_{j} (t) & = α_{j} (θ_{j} - ν_{j} (t)) d t + ξ_{j} \sqrt{ν_{j} (t)} d B_{j}^{P} (t), j = 1, \dots, n . \end{matrix}

(9)

2.2. Properties of the Variance Vector

We devote this subsection to exploring the properties of the variance vector. This is important for understanding the instantaneous volatilities implied by our model. Recall that empirical data is related to these volatilities; therefore, one should ensure that these implied processes reflect the stylized facts of the data they cater to.

Let

œ_{t}^{2}

denote the variance vector; by definition, this is

œ_{t}^{2} = d i a g (Σ_{t}) = (A \circ A) V_{t}

. As defined before,

V_{t}

is a vector of 4/2 processes (the sum of 1/2 and 3/2 processes). Therefore,

œ_{t}^{2}

can be written in terms of a linear combination of these two processes:

œ_{t}^{2} = (\begin{matrix} \sum_{j = 1} a_{1 j}^{2} (ν_{j} (t) + \frac{b_{j}^{2}}{ν_{j} (t)} + 2 b_{j}) \\ ⋮ \\ \sum_{j = 1} a_{n j}^{2} (ν_{j} (t) + \frac{b_{j}^{2}}{ν_{j} (t)} + 2 b_{j}) \end{matrix}) = (\begin{matrix} \sum_{j = 1} a_{1 j}^{2} ν_{j} (t) \\ ⋮ \\ \sum_{j = 1} a_{n j}^{2} ν_{j} (t) \end{matrix}) + (\begin{matrix} \sum_{j = 1} a_{1 j}^{2} (\frac{b_{j}^{2}}{ν_{j} (t)}) \\ ⋮ \\ \sum_{j = 1} a_{n j}^{2} (\frac{b_{j}^{2}}{ν_{j} (t)}) \end{matrix}) + (\begin{matrix} \sum_{j = 1} a_{1 j}^{2} 2 b_{j} \\ ⋮ \\ \sum_{j = 1} a_{n j}^{2} 2 b_{j} \end{matrix})

This model for the variance can be interpreted as factor model with n 4/2 factors. Due to the popularity of factor models for explaining asset classes, it stands to reason that volatility indexes (these variances) can also be expressed in terms of factors, which could reflect intrinsic and systemic economical movements.

One can obtain the dynamics of

œ_{t}^{2}

more explicitly:

\begin{matrix} d œ_{t}^{2} & = (\begin{matrix} \sum_{j = 1} a_{1 j}^{2} [(α_{j} (θ_{j} - ν_{j} (t)) + \frac{b_{j}^{2}}{ν_{j}^{2} (t)} (ξ_{j}^{2} - α_{j} θ_{j} + α_{j} ν_{j} (t))) d t + ξ_{j} (\sqrt{ν_{j} (t)} - b_{j}^{2} ν_{j}^{\frac{3}{2}} (t)) d B_{j}^{P} (t)] \\ ⋮ \\ \sum_{j = 1} a_{n j}^{2} [(α_{j} (θ_{j} - ν_{j} (t)) + \frac{b_{j}^{2}}{ν_{j}^{2} (t)} (ξ_{j}^{2} - α_{j} θ_{j} + α_{j} ν_{j} (t))) d t + ξ_{j} (\sqrt{ν_{j} (t)} - b_{j}^{2} ν_{j}^{\frac{3}{2}} (t)) d B_{j}^{P} (t)] \end{matrix}) \end{matrix}

From the above stochastic differential Equation (SDE), we are able to obtain the variance and covariance of the vector

d œ_{t}^{2}

via quadratic variations. Note that these can be interpreted as the volatility of variance and the correlation among variances (co-volatility movement), respectively:

〈 d σ_{i}^{2} (t), d σ_{i}^{2} (t) 〉 = \sum_{j = 1}^{n} a_{i j}^{4} {[ξ_{j} (\sqrt{ν_{j} (t)} - b_{j}^{2} ν_{j}^{\frac{3}{2}} (t))]}^{2} d t

(10)

〈 d σ_{i}^{2} (t), d σ_{j}^{2} (t) 〉 = \sum_{k = 1}^{n} a_{i k}^{2} a_{j k}^{2} {[ξ_{j} (\sqrt{ν_{j} (t)} - b_{j}^{2} ν_{j}^{\frac{3}{2}} (t))]}^{2} d t

(11)

Equations (10) and (11) suggest that the instantaneous variance and covariance of

œ_{t}^{2}

are locally stochastic (i.e., driven by the same Brownian as the underlying).

3. Characteristic Functions and Approximations

In this section, we derive the c.f.s for the previously presented cases involving spillover effects and no spillover effects, using Proposition 2 and Corollary 1 from Cheng et al. (2019), in line with the approximation approach from Escobar-Anel and Gong (2020). In Escobar-Anel and Gong (2020), the authors obtained analytical approximations of the c.f.’s for the special cases

ρ = 0, b \neq 0

;

b = 0, ρ \neq 0

using results from Grasselli (2016). Taking advantage of the principal component structure of the model, we demonstrate that the c.f. representations boil down to a multiplication of one-dimensional approximations.

Next, we show the c.f. for the general model and its submodels described in Section 2, namely the model with general spillover effects (Section 3.1) and the model with separable spillover effects (Section 3.2). Then, in Section 3.3, we present the principle used to approximate the c.f.’s.

3.1. Characteristic Function for Model with Spillover Effects

Let us first define

Z_{t} = e^{B t} Y_{t}

such that

e^{B t}

is a matrix exponential; then

Z_{i} (t)

is represented as

d Z_{i} (t) = \sum_{j = 1}^{n} {(e^{B t})}_{i j} \{L_{j} + \sum_{k = 1}^{n} a_{i j} c_{j} {(\sqrt{v_{k} (t)} + \frac{b_{k}}{\sqrt{v_{k} (t)}})}^{2}\} d t + \sum_{j = 1}^{n} {(e^{B t})}_{i j} \{\sum_{k = 1}^{n} a_{j k} (\sqrt{v_{k} (t)} + \frac{b_{k}}{\sqrt{v_{k} (t)}}) d W_{k} (t)\}

(12)

For convenience, we use

{(e^{B t})}_{i j}

as the

i j

-th component of the matrix

e^{B t}

. Note that

Z_{i} (t)

is no longer a mean-reverting process, although it accounts for time-dependent coefficients.

Corollary 1.

Let

{(Z (t))}_{t \geq 0}

evolve according to the model in Equation (12). The c.f.

Φ_{Z_{(} t), v (t)}

is then given as follows:

\begin{matrix} Φ_{Z (t), °_{t}} (T, ω) = E [exp i ω^{'} (Z (T) - Z (t)) ∣ Z (t) = z_{t}, v (t) = v_{t}] \\ = \prod_{k = 1}^{n} Φ_{G G} (T, 1; L (ω), h_{k} (ω), g_{k} (ω), α_{k}, θ_{k}, ξ_{k}, ρ_{k}, b_{k}, c_{k}, v_{k, t}, Z (t)) \end{matrix}

where

h_{k} (ω, t) = \sum_{j = 1}^{n} a_{j k} c_{k} f_{j} (ω, t)

,

L (ω, t) = \sum_{j = 1}^{n} \frac{L_{j}}{n} f_{j} (ω, t)

,

g_{k} (ω, t) = \sum_{j = 1}^{n} a_{j k} f_{j} (ω, t)

and

f_{j} (ω, t) = \sum_{m = 1}^{n} i ω_{m} {(e^{β t})}_{m j}

. Moreover,

Φ_{G G}

is a one-dimensional generalization of the c.f. from Grasselli (2016) provided in Lemma A1 of Cheng et al. (2019).

The proof follows as a direct application of the proof of Proposition 2 in Cheng et al. (2019).

3.2. Characteristic Function for Model with Separable Spillover Effects

To derive the c.f. of

M_{i} (t)

, we perform the transformation

S_{j} (t) = e^{{\tilde{β}}_{j} t} M_{j} (t)

, recognizing that the c.f. of

S_{j} (t)

has been derived in Escobar-Anel and Gong (2020). Hence, the c.f. of

Y_{t}

is a product of the corresponding c.f. of

S_{j} (t)

. The result is summarized in the following corollary.

Corollary 2.

Let

Φ_{M R} (T, u; L, c, a, β, α, θ, ξ, ρ, b, ν (t), Z_{(} t))

denote the characteristic function provided in Proposition 2.1 in Escobar-Anel and Gong (2020); then, the characteristic function of

Y_{t}

is given by the following equation:

E (e^{i u^{'} Y_{T}} | F_{t}) = \prod_{j = 1}^{n} Φ_{M R} (T, u^{*}; {\tilde{L}}_{j}, {\tilde{c}}_{j}, 1, {\tilde{β}}_{j}, α_{j}, θ_{j}, ξ_{j}, ρ_{j}, b_{j}, ν_{j} (t), S_{j} (t))

(13)

where

u^{*}

is a new vector of real numbers with element

u_{j}^{*} = \sum_{i = 1} u_{i} a_{i j} e^{- {\tilde{β}}_{j} t}

.

The proof is straightforward; using the relationship

Y_{t} = {AM}_{t}

, we know that each individual process

Y_{i} (t)

is a linear combination of

M_{j} (t), j = 1, 2, \dots, n

, and therefore

S_{j} (t)

, processes. The product

u^{'} Y_{T}

can be further written in terms of

S_{j} (t)

:

u^{'} Y_{T} = u^{'} {AM}_{T} = \sum_{j = 1}^{n} \sum_{i = 1}^{n} u_{i} a_{i j} M_{j} (T) = \sum_{j = 1}^{n} \sum_{i = 1}^{n} u_{i} a_{i j} e^{- {\tilde{β}}_{j} t} S_{j} (T) = \sum_{j = 1}^{n} u_{j}^{*} S_{j} (T) .

The independence property of random variables leads to Equation (13).

3.3. Approximation Principle and Results

We have learned that the c.f. can be written in terms of a product of the c.f.’s of n independent one-dimensional processes thanks to principal component decomposition. These one-dimensional processes are only different in the structure of the matrix exponential term (i.e.,

e^{B t}

), which is deterministic, and they resemble the same

Z (t)

process seen in Escobar-Anel and Gong (2020). Therefore, the principles to approximate

Φ_{Z (t), ν_{t}} (T, ω)

follow those adopted in Escobar-Anel and Gong (2020). In other words, we only need to calculate an approximation to the individual c.f.

Φ_{G G}

, and the approximation can be realized under three scenarios, as described in Section 2:

b_{j} = 0

;

ρ_{j} = 0

and the trivial case of

b_{j} = 0, ρ_{j} = 0

.

For completeness, the formula for

Φ_{G G}

in Cheng et al. (2019) is as follows:

\begin{matrix} Φ_{G G} (T, u; L, h, g, κ, θ, ξ, ρ, b, c, v_{t}, Z_{t}) = exp \{i u \int_{t}^{T} A (s) d s\} ν {(t)}^{i u \frac{b ρ}{ξ} g (t)} exp \{- i u ρ \frac{g (t) ν (t)}{ξ}\} \\ \times E [ν {(T)}^{i u \frac{b ρ}{ξ} g (T)} exp {i u (\int_{t}^{T} B (s) ν (s) d s + \int_{t}^{T} C (s) \frac{1}{ν (s)} d s + \int_{t}^{T} D (s) ln (ν (s)) d s \\ + ρ \frac{g (T) ν (T)}{ξ})} ∣ F_{t}] . \end{matrix}

Φ_{G G}

cannot be solved in closed-form due to the lack of a representation of the moment generating function of an integrated Cox-Ingersoll-Ross (CIR) process with time-dependent integrands. Therefore, we propose an analytic function that approximates the unsolvable conditional expectation:

\begin{matrix} E [ν {(T)}^{\frac{b ρ}{ξ} g (T)} exp \{\int_{t}^{T} B (s) ν (s) d s + \int_{t}^{T} C (s) \frac{1}{ν (s)} d s + \int_{t}^{T} D (s) ln (ν (s)) d s + ρ \frac{g (T) ν (T)}{ξ}\} ∣ F_{t}] \\ \approx E [ν {(T)}^{\frac{b ρ}{ξ} g (T)} exp \{ρ \frac{g (T) ν (T)}{ξ} - n \int_{t}^{T} ν (s) d s - m \int_{t}^{T} \frac{1}{ν (s)} d s\} ∣ F_{t}], \end{matrix}

for some complex constants m and n:

m \approx - \int_{t}^{T} C (s) d s

,

n \approx - \int_{t}^{T} B (s) d s

, and

D (s) = 0

, for

s \in [t, T]

. We propose the following two approximations:

Midpoint: $m = \frac{\underset{[t, T]}{m i n} (C (s)) + \underset{[t, T]}{m a x} (C (s))}{2}$ , $n = \frac{\underset{[t, T]}{m i n} (B (s)) + \underset{[t, T]}{m a x} (B (s))}{2}$ .
Average: $m = \frac{1}{T - t} \int_{t}^{T} C (s) d s$ , $n = \frac{1}{T - t} \int_{t}^{T} B (s) d s$ .

The approximated conditional expectation is solvable, as it fits the framework of Grasselli (2016). We summarize the results in the following corollary for the general model with spillover effects, which includes separable spillover effects as a special case.

Corollary 3.

Given deterministic functions

B_{j} (s)

and

C_{j} (s)

, defined in Lemma A1 in Cheng et al. (2019), and

g_{j} (s)

, defined in Corollary 1 for

s \in [t, T]

,

\begin{matrix} E & [ν_{j} {(T)}^{\frac{b_{j} ρ_{j}}{ξ_{j}} g_{j} (T)} exp {\int_{t}^{T} B_{j} (s) ν_{j} (s) d s + \int_{t}^{T} C_{j} (s) \frac{1}{ν_{j} (s)} d s + \int_{t}^{T} D_{j} (s) ln (ν_{j} (s)) d s \\ + ρ_{j} \frac{g_{j} (T) ν_{j} (T)}{ξ_{j}}} ∣ F_{t}] \end{matrix}

can be approximated by analytic functions for constants

m_{j}

and

n_{j}

satisfying

n_{j} = {\begin{cases} \frac{\underset{[t, T]}{m i n} (B_{j} (s)) + \underset{[t, T]}{m a x} (B_{j} (s))}{2}, & i f m i d p o i n t a p p r o a c h i s c o n s i d e r e d \\ \frac{1}{T - t} \int_{t}^{T} B_{j} (s) d s, & if average approach is considered \end{cases}

m_{j} = {\begin{cases} \frac{\underset{[t, T]}{m i n} (C_{j} (s)) + \underset{[t, T]}{m a x} (C_{j} (s))}{2}, & i f m i d p o i n t a p p r o a c h i s c o n s i d e r e d \\ \frac{1}{T - t} \int_{t}^{T} C_{j} (s) d s, & if average approach is considered \end{cases}

under three scenarios:

$b_{j} = 0$ : Given $b_{j} = 0$ , $C (s) = 0$ and $D (s) = 0$ , $s \in [t, T]$ . If $n_{j} \geq - \frac{α_{j}^{2}}{2 ξ_{j}^{2}}$ , then

$\begin{matrix} E [exp \{\int_{t}^{T} B_{j} (s) ν_{j} (s) d s + ρ_{j} \frac{g_{j} (T) ν_{j} (T)}{ξ_{j}}\} ∣ F_{t}] \approx E [exp \{ρ_{j} \frac{g_{j} (T) ν_{j} (T)}{ξ_{j}} - n_{j} \int_{t}^{T} ν_{j} (s) d s\} ∣ F_{t}] \\ = {(\frac{(B_{j} ξ_{j}^{2} + α) (e^{\sqrt{A_{j}} (T - t)} - 1) + \sqrt{A_{j}} (e^{\sqrt{A_{j}} (T - t)} + 1)}{2 \sqrt{A_{j}} e^{\frac{\sqrt{A_{j}} + α_{j}}{2} (T - t)}})}^{- \frac{2 α_{j} θ_{j}}{ξ_{j}^{2}}} e^{ν_{j} (t) (\frac{(B_{j} α_{j} - 2 n_{j}) (e^{\sqrt{A_{j}} (T - t)} - 1) - B_{j} \sqrt{A_{j}} (e^{\sqrt{A_{j}} (T - t)} + 1)}{(B_{j} ξ_{j}^{2} + α_{j}) (e^{\sqrt{A_{j}} (T - t)} - 1) + \sqrt{A_{j}} (e^{\sqrt{A_{j}} (T - t)} + 1)})}, \\ B_{j} & = - \frac{ρ_{j} g_{j} (T)}{ξ_{j}}, A_{j} = α_{j}^{2} + 2 n_{j} ξ_{j}^{2}, \end{matrix}$

(14)
$b_{j} \neq 0, æ_{j} = 0$ : Given $b_{j} = 0, ρ_{j} \neq 0$ and $D (s) = 0$ , $s \in [t, T]$ . If $m_{j} > - \frac{{(2 α_{j} θ_{j} - ξ_{j}^{2})}^{2}}{8 ξ_{j}^{2}}$ , $n_{j} \geq - \frac{α_{j}^{2}}{2 ξ_{j}^{2}}$ , then

$\begin{matrix} E [exp \{\int_{t}^{T} B_{j} (s) ν_{j} (s) d s + \int_{t}^{T} C_{j} (s) \frac{1}{ν_{j} (s)} d s\} ∣ F_{t}] \approx E [exp \{- n_{j} \int_{t}^{T} ν_{j} (s) d s - m_{j} \int_{t}^{T} \frac{1}{ν_{j} (s)} d s\} ∣ F_{t}] \\ = {(\frac{γ_{j} (T, ν_{j} (t))}{2})}^{k_{j} + 1} ν_{j} {(t)}^{- \frac{α_{j} θ_{j}}{ξ_{j}^{2}}} K_{j} {(T)}^{- (\frac{1}{2} + \frac{k_{j}}{2} + \frac{α_{j} θ_{j}}{ξ_{j}^{2}})} e^{\frac{1}{ξ_{j}^{2}} (θ_{j} (T - t) - \sqrt{H_{j}} ν_{j} (t) c o t h (\frac{\sqrt{H_{j}} (T - t)}{2}) + α_{j} ν_{j} (t))} \frac{Γ (\frac{1}{2} + \frac{k_{j}}{2} + \frac{α_{j} θ_{j}}{ξ_{j}^{2}})}{Γ (k_{j} + 1)} \\ \times_{1} F_{1} (\frac{1}{2} + \frac{k_{j}}{2} + \frac{α_{j} θ_{j}}{ξ_{j}^{2}}, k_{j} + 1, \frac{γ_{j} {(T, ν_{j} (t))}^{2}}{4 K_{j} (T)}), \\ k_{j} = \frac{1}{ξ_{j}^{2}} \sqrt{{(2 α_{j} θ_{j} - ξ_{j}^{2})}^{2} + 8 m_{j} ξ_{j}^{2}}, H_{j} = α_{j}^{2} + 2 n_{j} ξ_{j}^{2}, γ_{j} (T, ν_{j} (t)) = \frac{2 \sqrt{H_{j} ν_{j} (t)}}{ξ_{j}^{2} s i n h (\frac{\sqrt{H_{j}} (T - t)}{2})}, \\ K_{j} (T) = \frac{1}{ξ_{j}^{2}} (\sqrt{H_{j}} ν_{j} (t) c o t h (\frac{\sqrt{H_{j}} (T - t)}{2}) + α_{j}) . \end{matrix}$

(15)

Corollary 3 follows directly from Propositions 2.2 and 2.3 in Escobar-Anel and Gong (2020). The approximation for the c.f. when there are no spillover effects follows the same procedure as presented in Corollary 3. For the case when the spillover effects are separable, we obtain a sum of independent mean-reverting 4/2 stochastic volatility processes, as indicated in Equation (6). As a result, Propositions 2.2 and 2.3 in Escobar-Anel and Gong (2020) can be directly applied to approximate the c.f.s for these processes.

4. Estimation

In this section, we consider the model with separable spillover effects as the underlying model for estimation. In this way, on the one hand, we fulfill the purpose of studying spillover effects among assets, and on the other hand, we avoid the complexity of a matrix exponential1. Recall that the model with separable spillover effects can be expressed as follows:

{dY}_{t} = (L + A \tilde{C} V_{t} - B Y_{t}) d t + A d i a g {(V_{t})}^{\frac{1}{2}} {dW}_{t}^{P} .

where

B

is constructed such that it can be decomposed into a product of three matrices:

B = A \tilde{B} A^{- 1}

. In this case,

d Y_{i} (t)

is a linear combination of independent processes

d M_{j} (t)

, that is,

d Y_{t} = A d M_{t}

with

\{\begin{matrix} d M_{j} (t) & = [{\tilde{L}}_{j} + \tilde{c_{j}} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}})}^{2} - \tilde{β_{j}} M_{j} (t)] d t + (\sqrt{ν_{j} (t)} + \frac{b_{j}}{\sqrt{ν_{j} (t)}}) d W_{j}^{P} (t) \\ d ν_{j} (t) & = α_{j} (θ_{j} - ν_{j} (t)) d t + ξ_{j} \sqrt{ν_{j} (t)} d B_{j}^{P} (t) \\ 〈 d W_{j}^{P} (t), d B_{j}^{P} (t) 〉 & = ρ_{j} d t \end{matrix}

For simplicity, we focus on two dimensions, hence studying pairs of assets with their respective volatility indexes. For example, VIX (VVIX) and VSTOXX (VVSTOXX), USO (OVX) and GLD (GVZ), or USO (OVX) and SLV (VXSLV). Then, we follow the same estimation procedure outlined in Escobar-Anel and Gong (2020), splitting the parameters into two groups: volatility group and drift group.

After a data description in Section 4.1, Section 4.2 estimates tEstVolGpnder the model with separable spillover effects, we first need to estimate covariance matrix (

\hat{Σ}

) from asset data as a long-term average of the SC matrix (

Σ_{t}

). This permits us to produce and estimate for the constant eigenvectors, denoted as

\hat{A}

. With the estimated eigenvectors, we decompose our original asset processes into the sum of independent mean-reverting 4/2 models. The volatility group then consists of parameters for the underlying CIR processes driving the principal components:

b_{j}, α_{j}, θ_{j}, ξ_{j}

. Section 4.3 tackles the estimation of drift group parameters, using least squares.

Volatility indexes are functions of implied volatility, model free and directly calculated from option prices from the market. Here, we use volatility indexes data as a convenient proxy for instantaneous volatility. In fact, instantaneous volatility is rather impossible to capture from empirical data, even with high-frequency data, as it requires instantaneous periods rather than the available discrete periods. On the other hand, once a model is specified, volatility indexes can be used to represent instantaneous volatility with some multiplicative (scaling) adjustment or factor; see, for example Luo and Zhang (2012), Zhang and Zhu (2006) and references therein. The relationship between the instantaneous volatility and the volatility index, for example VIX, can be expressed in terms of a closed-form equation, where the difference between the two lies in a multiplicative factor. In a recent paper, see Lin et al. (2017), the author determines the connection between instantaneous and implied volatility, assuming Grasselli’s 4/2 model Grasselli (2016) with jumps.

Due to the short horizon of volatility indexes (21-day options), the multiplicative factor could be close to one in a region of the parametric space, which implies that volatility indexes are almost equal to instantaneous volatilities regardless of the structural choice of the underlying model. As precautions and inspired by these pioneering works, we introduce scale parameters to adjust empirical volatility indexes data to estimate instantaneous volatilities. This is done such that the empirical means of the observed variance series (

V_{t}^{obs}

) match the corresponding long-term asset variances. These new scaling parameters are estimated at an early stage and are methodologically independent of other parameters.

4.1. Data Description

We consider the following pairs of assets and volatilities: The first study is on VIX (S&P 500 Volatility Index) and VSTOXX (Euro STOXX 50 Volatility Index); here, we also use the volatility indexes VVIX and VVSTOXX, respectively. The second group is comprised of USO (Oil ETF) and GLD (Gold ETF), with OVX and GVZ as the respective volatility indexes. The third and final group is made up of SLV (Silver ETF) and GLD, with VXSLV and GVZ as the respective volatility indexes. All these data sets are daily in the period from late 2010 to early 2020.

Our estimation method consists of two stages: we first estimate the parameters in the assets’ data (called drift group), and then the parameters in the assets’ volatilities (called the volatility group). The sample size of the raw data is different across all the assets and volatility indexes. Hence, we must further process the data to better suit our estimation purpose, in particular ensuring that we take only the trading days when both assets and their volatilities can be observed. Figure 1, Figure 2 and Figure 3 depict the pairs of asset data and their volatility indexes. Note that the volatility index data is quoted as annualized volatility multiplied by 100. When we use the volatility index for estimation, we transform the volatility index to daily volatility by dividing by

100 \times \sqrt{250}

.

4.2. Estimation of Volatility Group Parameters

The model used for estimation of the “volatility group” parameters in this section is the model with separable spillover effects, described next for completeness.

Recall that Equation (6) gives us the representation for each principal component that reflects on our mean-reverting 4/2 stochastic volatility model; that is,

d l o g (X_{t}) = d Y_{t} = A d M_{t}

, with

d M_{i} (t)

defined as

\{\begin{matrix} d M_{i} (t) & = [{\tilde{L}}_{i} + \tilde{c_{i}} {(\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}})}^{2} - \tilde{β_{i}} M_{i} (t)] d t + (\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}}) d W_{i}^{P} (t) \\ d ν_{i} (t) & = α_{i} (θ_{i} - ν_{i} (t)) d t + ξ_{i} \sqrt{ν_{i} (t)} d B_{i}^{P} (t) \\ 〈 d W_{i}^{P} (t), d B_{i}^{P} (t) 〉 & = ρ_{i} d t \end{matrix}

The estimation procedure for this model setup can be summarized as follows. We first transform the data using matrix

A

to produce the

M_{i} (t)

process following the relationships among

X_{t}

,

Y_{t}

and

M_{t}

. Then, we can use the estimation method developed in Escobar-Anel and Gong (2020) for each

M_{i} (t)

process. Finally, we recover the parameters for each

X_{i} (t)

process.

4.2.1. Estimation of Matrix $A$ and the Scaling Parameters S

In the next sections, we estimate the parameters in the volatility group. The empirical results are summarized in Table 1. The first step is to estimate matrix

A

, as it connects log asset prices

Y_{t}

and principal components

M_{t}

. Recall that

A

is an orthogonal matrix comprising the eigenvectors of covariance matrix

Σ_{t}

. Given daily data, we estimate

A

by first calculating the empirical covariance matrix

\hat{Σ}

and applying eigenvalue decomposition:

\hat{Σ} = \hat{A} d i a g (σ^{(1)}, \dots, σ^{(n)}) {\hat{A}}^{'},

where

(σ^{(1)}, \dots, σ^{(n)})

is a vector of the eigenvalues of

\hat{Σ}

, and

\hat{A}

is the estimate of matrix

A

. In Table 1, we include the results for

\hat{Σ}

,

\hat{A}

and eigenvalues

(σ^{(1)}, σ^{(2)})

from empirical data. Note that

\hat{A}

is not unique in that the signs of each element in the matrix can be manipulated such that the column vectors are still the eigenvectors for the corresponding eigenvalues, while

\hat{A}

preserves its orthogonality.

As mentioned at the beginning of this section, volatility indexes are a useful proxy for instantaneous volatility; however, they may require scaling adjustment. Let

V_{t}^{obs} = {(V_{t}^{(1)}, \dots, V_{t}^{(n)})}^{'}

denote the squared observed volatility indexes data for n assets. We introduce a scale parameter

s_{i}

to bridge observed volatility indexes’ series

V_{t}^{(i)}

s to theoretical variances via the following relationship:

S V_{t}^{obs} = (A \circ A) V_{t}

(16)

V_{t} = {(A \circ A)}^{- 1} {SV}_{t}^{obs},

(17)

where

S

is a diagonal matrix with diagonal vector

(s_{1}, \dots, s_{n})

. In theory,

s_{i}

is a function of

t, T, b_{i}, α_{i}, θ_{i}, ξ_{i}

, see Luo and Zhang (2012), and Zhang and Zhu (2006), some of which fall into the volatility group and are to be estimated. Hence, devising an estimate that does not depend on these parameters is crucial. We propose an estimate that matches the long-run first empirical moment of both sides of Equation (16). The long-term average of the left-hand side of Equation (16) can be directly calculated from squared volatility indexes’ data. The long-term average of the right-hand side may not seem as straightforward, as it deals with a stochastic process. In our definition,

(A \circ A) V_{t}

is the diagonal of the covariance matrix and refers to the instantaneous variance process. Therefore, in the long run, the expectation of the variance process should converge to the variance of the underlying asset. Let

{\hat{σ}}_{i}^{2}

denote the empirical long term variance of asset i, and let

{\hat{μ}}_{i}

denote the long-term average of the corresponding squared volatility index data. We estimate

s_{i}

as

{\hat{s}}_{i} = \frac{{\hat{σ}}_{i}^{2}}{{\hat{μ}}_{i}} .

(18)

Substituting Equation (18) and

\hat{A}

back to the right-hand side of Equation (17), the long-term average is

(\begin{matrix} σ^{(1)} \\ ⋮ \\ σ^{(n)} \end{matrix}) = E [{(A \circ A)}^{- 1} \hat{S} V_{t}^{obs}],

which matches the left-hand side of Equation (17) in terms of the long-term average, as

V_{t}

is the eigenvalue of

Σ_{t}

and converges to

(σ^{(1)}, \dots, σ^{(n)})

in the long run. Table 1 also displays the results for

\hat{Σ}

,

(σ^{(1)}, σ^{(2)})

and

({\hat{s}}_{1}, {\hat{s}}_{2})

.

4.2.2. Estimation of Volatility Group

Let

{(\hat{A} \circ \hat{A})}^{- 1} = \dot{A} = {{\dot{a}}_{i j}}_{i, j = 1}^{n}

, and let the j-th eigenvalue of

Σ_{t}

be defined as

V_{j} (t) = ν_{j} (t) + \frac{b_{j}^{2}}{ν_{j} (t)} + 2 b_{j}

. Then,

V_{j} (t)

is represented by

\sum_{i = 1}^{n} {\dot{a}}_{j i} {\hat{s}}_{i} V_{t}^{(i)}

according to Equation (17). Suppose

V_{t}^{(i)} = (V^{(i)} (t_{1}), \dots, V^{(i)} (t_{n}))

is a series of squared volatility indexes for asset i observed on

Ω_{T} = {t_{i}}_{i = 0}^{m}, t_{0} = 0, t_{m} = T

; then, at time

0 \leq t_{k} \leq T

, we have

V_{j} (t_{k}) = \sum_{i = 1}^{n} {\dot{a}}_{j i} {\hat{s}}_{i} V^{(i)} (t_{k})

.

In theory, we expect the

V_{t}

series to be non-negative, since it is related to the series of instantaneous variances

(A \circ A) V_{t}

for the underlying assets. In practice, however, we observe inconsistencies in some cases. For example, as Figure 4a illustrates,

V_{2} (t)

has a number of negative values (labeled by “V2”), which are non-negligible. Next, we perform a preliminary analysis to locate the root of the issue.

To solve the problem of inconsistency without modifying our model, we deal with the negative values as if they are missing values. We thus replace the negative values by the weekly averages centered on those negative values. Figure 4b illustrates the series of

V_{1} (t)

and

V_{2} (t)

after this modification. Furthermore, Figure 5 presents two series:

V_{1} (t)

and

V_{2} (t)

after transforming the original OVX and GVZ data. In this case, we do not observe the inconsistency shown in Figure 4b, which means that the data supports our model. The figure also illustrates the trend as expected, with the first principal component generating the largest variation (

V_{1} (t)

) in asset price compared to the second principal component (

V_{2} (t)

).

In Figure 6a, we also observe some inconsistency in silver ETF (SLV) and gold ETF (GLD) data between 2011 and 2012. Given that the correlation between SLV and GLD is large,

V_{2} (t)

series stays close to 0, which implies that the two assets are likely driven by the same random factor. Since the negative values do not appear as often as in Figure 4a and are close to 0, we simply take the absolute value of the negative values and show the modified series in Figure 6b.

Now that we have prepared all the data for estimation, we apply the estimation method developed in Escobar-Anel and Gong (2020) to

V_{1} (t)

and

V_{2} (t)

to estimate

b_{1}, α_{1}, θ_{1}, ξ_{1}

and

b_{2}, α_{2}, θ_{2}, ξ_{2}

. Note that, in all three scenarios, the minimum of

V_{2} (t)

is approximately zero, which implies that

b_{2}

is 0 as seen from Figure 4b, Figure 5 and Figure 6. Therefore, it is sufficient to model

V_{2} (t)

as a CIR (1/2) process instead of a 4/2 process. On the other hand, the “spikes” occurred frequently in

V_{1} (t)

—labeled as “V1” and shown by the figures—are signals that

V_{1} (t)

should be a 4/2 process given all three pairs of assets-and-volatility index data. Since we assume that

V_{2} (t)

follows a CIR process, we estimate

α_{2}, θ_{2}, ξ_{2}

using maximum likelihood. Table 2, Table 3 and Table 4 list the estimated parameters and their standard errors (s.es) with the chosen data sets for parameters in the volatility group.

The inference on the parameters (asymptotic mean and variance) is performed via parametric bootstrap. In other words, we simulate the corresponding processes with the estimated parameters 1000 times and repeat the estimation procedure for each simulation. In the end, we obtain a pool of 1000 sets of estimates. The law of large numbers suggests that the means calculated from the pool of estimates are the asymptotic means for each estimator. It is interesting to see how the first principal component not only accounts for most of the variation, but it also absorbs the complexity of the problem. In other words. the tables show that the first component requires the advanced 4/2 modeling (i.e.,

b \neq 0

), while the second component can be better explained with the simpler 1/2 model (

b = 0

).

4.3. Estimation of Drift Group

Similarly, we use the least square approach to estimate parameters in the drift group. Table 5, Table 6 and Table 7 display the results. Some parameters are assessed to be non-significant based on the p-values. We decide to keep all the parameters because our sample sizes are not large enough to draw concrete conclusions on the significance of the parameters.

Note that the estimated parameters reported in the tables are for the parameters of the

M_{1} (t)

and

M_{2} (t)

processes. We can recover the estimates for original parameters using the relationship we defined earlier; that is,

\hat{L} = \hat{A} \tilde{L}

,

\hat{B} = \hat{A} \tilde{B} {\hat{A}}^{- 1}

, and

\hat{C} = \hat{A} \tilde{C} + \frac{1}{2} (\hat{A} \circ \hat{A})

. The estimates for the original parameters are reported in Table 8. The diagonal entries in the

\hat{B}

matrices provide information on the mean-reverting speed for all the assets. Note that for the pair SLV and GLD, one of the eigenvalues (

- 0.5401

) of

\hat{B}

does not satisfy the assumption imposed on the eigenvalues of

B

, which is a sign that the data does not support this particular model. The correlation coefficients are not included because they are not affected by the transformation.

It is worth noting that

\hat{L}

in Table 8 does not reflect the actual mean-reverting level. Therefore, to determine the mean-reverting level for each asset, we must go back to Equation (4) and rewrite it in following format:

{dY}_{t} = B (B^{- 1} L + B^{- 1} A \tilde{C} V_{t} - Y_{t}) d t + A d i a g {(V_{t})}^{\frac{1}{2}} {dW}_{t}^{P} .

We can now see that the mean-reverting level is

B^{- 1} L

plus a random component, which we define as

H (V_{t}) = B^{- 1} A \tilde{C} V_{t}

. The long term mean indicated by the model is basically

B^{- 1} L + E [H (V_{t})]

. We report these estimates in Table 9 and compare them with the averages calculated from empirical data.

As Table 9 shows, the estimated MRLs are close to the empirical log price averages, except for the USO case, where the estimated mean is smaller than the empirical mean. This latest point might be due to the impact of the initial value on the stationary value of a 4/2 process. Moreover, the VIX and VSTOXX pair has the largest mean-reverting speed compared to the other two commodity ETF pairs. This is not a surprise, as evidenced by empirical data. Volatility indexes tend to return to the mean faster due to an economic cycle, whilst commodities normally have a longer time horizon to revert to the mean level due to scarcity, demand and supply.

5. Application to Risk Measures

Risk measures in financial risk management are used to determine the minimum amount of capital to be kept in reserve in worst-case scenarios as a way of protecting financial institutions. There are many risk measures in the literature, see, for example, Artzner et al. (1999) and McNeil et al. (2005), one of which is considered fundamental: Value-at-Risk (VaR), which is a distribution-based risk measure. In other words, a VaR calculation takes into account the distribution of the underlying (VaR is in fact a quantile). It is more robust to outliers than mean and variance.

In this section, we compute the VaR of a portfolio consisting of two assets and a cash account, in line with the previous estimation section. We must first find the distribution of this portfolio, which might not be available due to, in particular, the correlations among the underlyings. In the language of mathematical statistics, we must find the joint distribution of USO and GLD to compute VaR. In general, finding closed-form expressions for the joint distribution of two non-Gaussian stochastic processes is theoretically difficult. In fact, USO and GLD have complex distribution functions under our multidimensional 4/2 model setting. Fortunately, this is feasible in our model, as we can express the joint distribution at any given date of USO and GLD in terms of two independent random variables, which simplifies our problem significantly and allows for the use of c.f.s to compute the properties of the portfolio distribution.

5.1. Portfolio Setup

Suppose that we have a portfolio

Π

consisting of two assets

X_{1} (t)

and

X_{2} (t)

:

Π (t) = ϕ_{1} (t) X_{1} (t) + ϕ_{2} (t) X_{2} (t) + ϕ_{3} (t) B (t),

(19)

where

ϕ_{1}

and

ϕ_{2}

represent the weights of

X_{1} (t)

and

X_{2} (t)

in the portfolio, and

B (t)

is a cash account with interest rate r. In a short period of time, we can also write the problem using the self-financing condition and relative portfolio weights

π_{1}

,

π_{2}

and

(1 - π_{1} - π_{2})

; that is, the proportion allocated to the assets and cash account, allocations see Campell et al. (1997):

\frac{d Π (t)}{Π (t)} = π_{1} \frac{d X_{1} (t)}{X_{1} (t)} + π_{2} \frac{d X_{2} (t)}{X_{2} (t)} + (1 - π_{1} - π_{2}) r d t .

(20)

Constant allocations will be considered (i.e., constant

π

), as they constitute the most popular investment strategy in the market, supported by Merton (1975). From the process

\frac{d X_{i} (t)}{X_{i} (t)}

, we can easily obtain

d Y_{i} (t)

(

Y_{i} (t) = l o g (X_{i} (t))

by using Ito’s lemma. When comparing

\frac{d X_{i} (t)}{X_{i} (t)}

and

d Y_{i} (t)

, we observe that only the drift term is adjusted, while diffusion terms stay the same. Assuming that

(X_{1} (t), X_{2} (t))

are modeled by Equation (2), the log prices

(Y_{1} (t), Y_{2} (t))

then have the SDE specified in Equation (4) under the PCSV framework. Moreover, we can also write

\frac{d X_{i} (t)}{X_{i} (t)}

in terms of

d Y_{i} (t)

:

\frac{d X_{i} (t)}{X_{i} (t)} = d Y_{i} (t) + \frac{1}{2} \sum_{j} a_{i j}^{2} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{ν_{j} (t)})}^{2} d t .

Hence, we rewrite Equation (20) as follows:

\begin{matrix} \frac{d Π (t)}{Π (t)} & = π_{1} [d Y_{1} (t) + \frac{1}{2} \sum_{j = 1}^{2} a_{1 j}^{2} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{ν_{j} (t)})}^{2} d t] + π_{2} [d Y_{2} (t) + \frac{1}{2} \sum_{j = 1}^{2} a_{2 j}^{2} {(\sqrt{ν_{j} (t)} + \frac{b_{j}}{ν_{j} (t)})}^{2} d t] \\ + (1 - π_{1} - π_{2}) r d t . \end{matrix}

(21)

It is known that

Y_{1} (t)

and

Y_{2} (t)

are linear combinations of two independent stochastic processes or random variables

M_{1} (t)

and

M_{2} (t)

:

\begin{matrix} d Y_{1} (t) & = a_{11} d M_{1} (t) + a_{12} d M_{2} (t), \end{matrix}

(22)

\begin{matrix} d Y_{2} (t) & = a_{21} d M_{1} (t) + a_{22} d M_{2} (t) . \end{matrix}

(23)

We now substitute

d Y_{1} (t)

and

d Y_{2} (t)

in Equation (21) with Equations (22) and (23):

\begin{matrix} \frac{d Π (t)}{Π (t)} & = (π_{1} a_{11} + π_{2} a_{21}) d M_{1} (t) + \frac{1}{2} (π_{1} a_{11}^{2} + π_{2} a_{21}^{2}) {(\sqrt{ν_{1} (t)} + \frac{b_{1}}{ν_{1} (t)})}^{2} d t \\ + (π_{1} a_{12} + π_{2} a_{22}) d M_{2} (t) + \frac{1}{2} (π_{1} a_{12}^{2} + π_{2} a_{22}^{2}) {(\sqrt{ν_{2} (t)} + \frac{b_{2}}{ν_{2} (t)})}^{2} d t + (1 - π_{1} - π_{2}) r d t . \end{matrix}

(24)

From Equation (24), we can conclude that

\frac{d Π (t)}{Π (t)}

is also a linear combination of

d M_{1} (t)

and

d M_{2} (t)

, with adjustment to the drift terms, which does not affect the independent relationship between

d M_{1} (t)

and

d M_{2} (t)

. We organize Equation (24) into the following expression:

\frac{d Π (t)}{Π (t)} = d M_{1}^{*} (t) + d M_{2}^{*} (t)

(25)

where

d M_{1}^{*} (t)

and

d M_{2}^{*} (t)

are independent, with

\begin{matrix} d M_{i}^{*} (t) & = [L_{i}^{*} + c_{i}^{*} {(\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}})}^{2} - β_{i}^{*} M_{i}^{*} (t)] d t + a_{i}^{*} (\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}}) d W_{i}^{P} (t), \end{matrix}

where

L_{1}^{*} = {\tilde{L}}_{1} a_{1}^{*} + (1 - π_{1} - π_{2}) r, L_{2}^{*} = {\tilde{L}}_{2} a_{2}^{*}

,

c_{i}^{*} = {\tilde{c}}_{i} a_{i}^{*} + \frac{1}{2} a_{i}^{* *}

,

β_{i}^{*} = {\tilde{β}}_{i} a_{i}^{*}

,

a_{i}^{*} = \sum_{j = 1}^{2} π_{j} a_{j i}

, and

a_{i}^{* *} = \sum_{j = 1}^{2} π_{j} a_{j i}^{2}

.

Note that for convenience, we included the growth rate on the cash account into the long-term average of

M_{1}^{*} (t)

. From a mathematical perspective,

M_{1}^{*} (t)

is constructed based on

M_{1} (t)

, which is the first principal component that determines the most variation among the assets. As a result,

M_{1}^{*} (t)

affects the performance of

Π (t)

more than

M_{2}^{*} (t)

financially. For this reason, the effect of the growth rate on the portfolio can also be interpreted as if it impacts the long-term average of

M_{1}^{*} (t)

. Now, we apply Ito’s lemma to Equation (25) to obtain the dynamics for

l n (Π (t))

:

d l n (Π (t)) = d {\tilde{M}}_{1}^{*} (t) + d {\tilde{M}}_{2}^{*} (t)

(26)

where

d {\tilde{M}}_{i}^{*} (t) = [L_{i}^{*} + (c_{i}^{*} - \frac{1}{2} {(a_{i}^{*})}^{2}) {(\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}})}^{2} - β_{i}^{*} {\tilde{M}}_{i}^{*} (t)] d t + a_{i}^{*} (\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}}) d W_{i}^{P} (t) .

It is straightforward to find the characteristic function of

d l n (Π (t))

using the above mentioned results.

5.2. The Density Function of the Portfolio $Π (t)$

From Equation (26), our portfolio now essentially contains two new “assets” that are independent of each other. Thanks to this independence, we can derive the characteristic function as well as the density function of our portfolio. Since our goal is to calculate the VaR, it is convenient to use a density function and integrate numerically. In this section, we list two approaches to obtain such a density function.

5.2.1. Density Function via Convolution

One way in which to obtain the conditional density function for

l n (Π (T))

is via convolution of two conditional density functions for

{\tilde{M}}_{1}^{*} (T)

and

{\tilde{M}}_{2}^{*} (T)

. In probability, if two random variables X and Y are independent, with density functions

f_{X} (x)

and

f_{Y} (y)

, respectively, then the density for

Z = X + Y

,

f_{Z} (z)

, can be found via convolution; that is,

f_{Z} (z) = \int_{- \infty}^{\infty} f_{X} (x) f_{Y} (z - x) d x .

(27)

If X and Y have analytical density functions, then the convolution method is straightforward. In our case, we obtain the conditional c.f. first. The Fourier inversion of the conditional c.f. theoretically gives the density function. However, due to the structure of our original c.f. and the approximations, we need to invert both the original c.f. and the approximated c.f.s numerically for the corresponding density functions. For

{\tilde{M}}_{1}^{*} (T)

and

{\tilde{M}}_{2}^{*} (T)

, we can obtain their conditional corresponding density functions

f_{1} (m_{1} | F_{t})

and

f_{2} (m_{2} | F_{t})

by inverting the c.f. of

{\tilde{Z}}_{1}^{*} (t) = e^{β_{1}^{*} t} {\tilde{M}}_{1}^{*} (t)

and

{\tilde{Z}}_{2}^{*} (t) = e^{β_{2}^{*} t} {\tilde{M}}_{2}^{*} (t)

:

f_{1} (m_{1} | F_{t}) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{- i u m_{1}} Φ_{M R} (T, u; L_{1}^{*}, c_{1}^{*}, a_{1}^{*}, β_{1}^{*}, α_{1}, θ_{1}, ξ_{1}, ρ_{1}, a_{1}^{*} b_{1}, ν_{1} (t), e^{- β_{1}^{*} t} {\tilde{Z}}_{1}^{*} (t)) d u .

(28)

f_{2} (m_{2} | F_{t}) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{- i u m_{2}} Φ_{M R} (T, u; L_{2}^{*}, c_{2}^{*}, a_{2}^{*}, β_{2}^{*}, α_{2}, θ_{2}, ξ_{2}, ρ_{2}, a_{2}^{*} b_{2}, ν_{2} (t), e^{- β_{2}^{*} t} {\tilde{Z}}_{2}^{*} (t)) d u .

(29)

We can now write Equation (32) as a convolution of Equations (28) and (29):

f (p | F_{t}) = \int_{- \infty}^{\infty} f_{1} (m_{1} | F_{t}) f_{2} (p - m_{1} | F_{t}) d m_{1} .

(30)

A challenging part of this method is that we must first invert the semi-closed c.f.s to obtain the density functions for

{\tilde{M}}_{1}^{*} (t)

and

{\tilde{M}}_{2}^{*} (t)

—artificial assets—which involves approximations. As we have well-developed approximation approaches for

Φ_{M R}

, we can apply the results to obtain the analytic function as an approximation of the c.f. for individual artificial assets and then find the density via Fourier inversion. Then, we can use Equation (30) to obtain the density of the portfolio. The approximation approaches work well in a parametric region, as demonstrated in Escobar-Anel and Gong (2020), for three scenarios (

b_{i} = 0, ρ_{i} \neq 0; b_{i} \neq 0, ρ_{i} = 0; b_{i} = 0, ρ_{i} = 0

); the goodness of approximations depends on

L_{i}^{*}, c_{i}^{*}, a_{i}^{*}, β_{i}^{*}, α_{i}, θ_{i}, ξ_{i}, ρ_{i}, b_{i}

.

5.2.2. Density Function via Fourier Inversion

Another, more direct means of obtaining the density function is to apply an inverse Fourier transform to the characteristic function. Before we provide the formula for the characteristic function, we consider the transformation

{\tilde{Z}}_{i}^{*} (t) = e^{β_{i}^{*} t} {\tilde{M}}_{i}^{*} (t)

. By Ito’s lemma, we have

d {\tilde{Z}}_{i}^{*} (t) = e^{β_{i}^{*} t} [L_{i}^{*} + (c_{i}^{*} - \frac{1}{2} {(a_{i}^{*})}^{2}) {(\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}})}^{2}] d t + e^{β_{i}^{*} t} a_{i}^{*} (\sqrt{ν_{i} (t)} + \frac{b_{i}}{\sqrt{ν_{i} (t)}}) d W_{i}^{P} (t) .

The next corollary explains how to derive the characteristic function.

Corollary 4.

Let

Φ_{M R} (T, u; L, c, a, β, α, θ, ξ, ρ, b, ν (t), Z_{(} t))

denote the characteristic function provided in Proposition 2.1 in Escobar-Anel and Gong (2020); then, the characteristic function of

l n (Π (T))

is given by the following equation:

E (e^{i u l n (Π (T))} | F_{t}) = \prod_{i = 1}^{2} Φ_{M R} (T, u; L_{i}^{*}, c_{i}^{*}, a_{i}^{*}, β_{i}^{*}, α_{i}, θ_{i}, ξ_{i}, ρ_{i}, a_{i}^{*} b_{i}, ν_{i} (t), {\tilde{Z}}_{i}^{*} (t))

(31)

where

L_{1}^{*} = {\tilde{L}}_{1} a_{1}^{*} + (1 - π_{1} - π_{2}) r, L_{2}^{*} = {\tilde{L}}_{2} a_{2}^{*}, c_{i}^{*} = {\tilde{c}}_{i} a_{i}^{*} + \frac{1}{2} a_{i}^{* *}, β_{i}^{*} = {\tilde{β}}_{i} a_{i}^{*}, a_{i}^{*} = \sum_{j = 1}^{2} π_{j} a_{j i}

,

a_{i}^{* *} = \sum_{j = 1}^{2} π_{j} a_{j i}^{2} .

Let p denote all possible values in the domain of

l n (Π (T))

; the density function hence follows from

f (p | F_{t}) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{- i u p} E (e^{i u l n (Π (T))} | F_{t}) d u .

(32)

It can be seen that the c.f. of the portfolio does not have a closed-form representation, since it is a product of semi-closed c.f.s (

Φ_{M R}

). As a result, we could use the approximations developed in Section 3.3.

5.2.3. Numerical Implementation of Selected Method

The Fourier inversion method and the convolution method theoretically yield the same density function for the portfolio. Moreover, in a portfolio that only consists of two (artificial) risky assets, both methods are not complicated to implement. We implement the convolution method with partial simulation for applications with only two assets. However, it would be more efficient to use the Fourier inversion method when the portfolio has a large pool of assets (e.g., over 100). To see this, note that, for

n > > 2

assets, the convolution method involves the simulation of n processes with

n + 1

integrations; in contrast, the Fourier inversion method reduces the number of integrations to just one.

We summarize the numerical implementation to compute the conditional density function of the portfolio in the following steps:

Step 1: Simulate two CIR processes $ν_{1} (t)$ and $ν_{2} (t)$ and compute $Φ_{M R}$ for ${\tilde{M}}_{1}^{*} (t)$ and ${\tilde{M}}_{2}^{*} (t)$ .
Step 2: Invert the c.f.s obtained in Step 1 to obtain $f_{1} (m_{1} | F_{t})$ and $f_{2} (m_{2} | F_{t})$ .
Step 3: Numerically integrate the product of the conditional density of ${\tilde{M}}_{1}^{*} (T)$ and ${\tilde{M}}_{2}^{*} (T)$ for the conditional density function of $l n (Π (T))$ .

Even though we use partial simulation to obtain the density function of the portfolio, partial simulation is not time-consuming, as efficient methods exist to simulate CIR processes; see, for example, Andersen (2007). Moreover, both the convolution method and the direct Fourier inversion method require either fewer simulations or no simulations (via approximations), compared to a full simulation approach, which would require the simulation of four processes. In the case where semi-closed c.f.s are involved, we would only need to simulate at most

n \geq 2

processes (

ν_{i} (t)

for

i = 1, \dots, n

process), instead of a simulation of the

{\tilde{M}}_{1}^{*} (t)

processes altogether (

2 n

processes). Most importantly, thanks to PCA decomposition, we would likely need m such volatility drivers to explain the SC of n assets with

m < < n

. This means a substantial reduction in computational complexity (partial, simulations, integrations or approximations).

In summary, under a PCSV framework, partial simulation is a good choice in terms of efficiency. The PCA reduces computational complexity, as fewer diffusions may be required to explain the variation of all assets. Our approximations further improve the efficiency for computing the c.f.s with analytic functions.

In the next application section, we apply the convolution method from Section 5.2.1 to compute the VaR at popular quantile

α_{q} = 0.95

(

V a R_{0.95}

).

5.3. The VaR for a Portfolio of USO and GLD

In this section, we consider a pair of risky assets: USO and GLD, and we study

V a R_{0.95}

under an investment strategy: equally weighted risky assets only (

π_{1} = π_{2} = 0.5

)2. These have been proven to be robust and reliable strategies in the seminal work of DeMiguel et al. (2009). In Table 10, we report the

V a R_{0.95}

values and their s.es. We consider a well-known asymptotic result for quantiles to calculate the s.es for

V a R_{α_{q}}

as derived in Stuart et al. (1994):

s . e (V a R_{α_{q}}) = \sqrt{\frac{α_{q} (1 - α_{q})}{n f {(V a R_{α_{q}})}^{2}}},

where

α_{q}

is the quantile of the portfolio distribution (in this case, it is

95 %

); n is the sample size, and

f (V a R_{α_{q}})

is the probability distribution function (density function) evaluated at

V a R_{α_{q}}

.

Case:

π_{1} = π_{2} = 0.5

In this case, we use the information from Table 1b, Table 3 and Table 6 to obtain parameters that generate

{\tilde{M}}_{2}^{*} (T)

or

{\tilde{M}}_{1}^{*} (T)

. Figure 7 and Figure 8 confirm that density functions from theory are in line with the simulations. We compute and compare

V a R_{0.95}

values from four sources: simulation of the portfolio (Simulation), density function without approximation (Density w/o Approximation), approximated density function using the midpoint approach (Approx. Density (M)) and approximated density function using the average approach (Approx. Density (A)).

We use linear interpolation here to calculate the quantile if

α_{q}

falls between two critical levels calculated from histogram and density functions. Standard errors are reported in parantheses.

6. Conclusions

This paper studied the properties of a multivariate mean-reverting 4/2 stochastic volatility model based on principal component decomposition. In particular, we studied the variance and covariance processes as well as several submodels of interest to the industry (e.g., separable or no spillover effects, multivariate mean-reverting Heston models). We also obtain an expectation representation for the c.f. of the asset prices with respect to the paths of the stochastic volatility process. Two closed-form approximations to the c.f. are presented in Section 3, these are the first efficient calculations of c.f. for multivariate mean-reverting stochastic covariance models. In Section 4, we implemented a two-step estimation methodology to three sets of data involving two asset classes, commodities and volatility indexes. The study confirms stylized facts commonly attributed to commodities, like spillover effects, are also observed in the joint dynamic of volatility indexes, which has not been previously reported; it also displays the role and need of scaling parameters between instantaneous variance and volatility indexes in a multidimensional setting.

In Section 5, we further tested our approximation methods in a risk management setting by computing one of the most popular risk measures, namely, VaR. Since VaR is a distribution-based risk measure, our analysis confirms the effectiveness of the c.f. approximations in a multidimensional setting for a portfolio of advanced stochastic processes. Although our analysis was in two dimensions, the methodology is transferable to any dimension, e.g., a portfolio with a large number of underlying assets. In such case, the average-based approximation can greatly save time in calculating distribution-based risk measures, the alternative is the MontCarlo simulation of a high number of continuous-time processes with the subsequent loss in precision.

Author Contributions

Conceptualization, M.E.-A. and Z.G.; methodology, M.E.-A. and Z.G.; software, Z.G.; validation, M.E.-A. and Z.G.; formal analysis, M.E.-A. and Z.G.; investigation, M.E.-A. and Z.G.; resources, Z.G.; data curation, Z.G.; writing—original draft preparation, M.E.-A. and Z.G.; writing—review and editing, M.E.-A.; visualization, Z.G.; supervision, M.E.-A.; project administration, M.E.-A.; funding acquisition, M.E.-A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by NSERC, grant number RGPIN-2020-05068.

Conflicts of Interest

The authors declare no conflict of interest.

Notes

1	The estimation methodology for volatility group parameters (Section 4.2) can also be applied to the general model, as for drift group parameters (Section 4.3) some modifications are needed to account for the vector autoregressive structure coming from spillover effects.
2	Similar results were obtained for equally weighted assets case ( $π_{1} = π_{2} = \frac{1}{3}$ ).

References

Alexander, Carol. 2001. Orthogonal garch. In Mastering Risk. Financial Times–Prentice Hall: London 2: 21–38. [Google Scholar]
Andersen, Leif B. G. 2007. Efficient Simulation of the Heston Stochastic Volatility Model; SSRN 946405. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=946405 (accessed on 23 July 2021).
Artzner, Philippe, Freddy Delbaen, Jean-Marc Eber, and David Heath. 1999. Coherent measures of risk. Mathematical Finance 9: 203–28. [Google Scholar] [CrossRef]
Benth, Fred Espen. 2011. The stochastic volatility model of barndorff-nielsen and shephard in commodity markets. Mathematical Finance 21: 595–625. [Google Scholar] [CrossRef]
Black, Fischer. 1976. Studies of stock market volatility changes. Paper presented at 1976 American Statistical Association Bisiness and Economic Statistics Section, Washington, DC, USA; pp. 177–81. Available online: https://www.scirp.org/(S(i43dyn45teexjx455qlt3d2q))/reference/ReferencesPapers.aspx?ReferenceID=2030459 (accessed on 23 July 2021).
Campell, John Y., Andrew W. Lo, and A. Craig MacKinlay. 1997. The Econometrics of Financial Markets. Princeton: Princeton University Press. [Google Scholar]
Carr, Peter, and Dilip Madan. 1999. Option valuation using the fast fourier transform. Journal of Computational Finance 2: 61–73. [Google Scholar] [CrossRef] [Green Version]
Cheng, Peng, and Olivier Scaillet. 2007. Linear-quadratic jump-diffusion modeling. Mathematical Finance 17: 575–98. [Google Scholar] [CrossRef] [Green Version]
Cheng, Yuyang, Marcos Escobar-Anel, and Zhenxian Gong. 2019. Generalized mean-reverting 4/2 factor model. Journal of Risk and Financial Management 12: 159. [Google Scholar] [CrossRef] [Green Version]
Christoffersen, Peter, Steven Heston, and Kris Jacobs. 2009. The shape and term structure of the index option smirk: Why multifactor stochastic volatility models work so well. Management Science 55: 1914–32. [Google Scholar] [CrossRef] [Green Version]
Cui, Zhenyu, Justin Lars Kirkby, and Duy Nguyen. 2021. Efficient simulation of generalized sabr and stochastic local volatility models based on markov chain approximations. European Journal of Operational Research 290: 1046–62. [Google Scholar] [CrossRef]
Da Fonseca, José, Martino Grasselli, and Claudio Tebaldi. 2007. Option pricing when correlations are stochastic: An analytical framework. Review of Derivatives Research 10: 151–80. [Google Scholar] [CrossRef]
De Col, Alvise, Alessandro Gnoatto, and Martino Grasselli. 2013. Smiles all around: Fx joint calibration in a multi-heston model. Journal of Banking & Finance 37: 3799–818. [Google Scholar]
DeMiguel, Victor, Lorenzo Garlappi, and Raman Uppal. 2009. Optimal versus naive diversification: How inefficient is the 1/n portfolio strategy? The Review of Financial Studies 22: 1915–53. [Google Scholar] [CrossRef] [Green Version]
Escobar, Marcos. 2018. A stochastic volatility factor model of heston type. statistical properties and estimation. Stochastics 90: 172–99. [Google Scholar] [CrossRef]
Escobar, Marcos, Barbara Götz, Luis Seco, and Rudi Zagst. 2010. Pricing a cdo on stochastically correlated underlyings. Quantitative Finance 10: 265–77. [Google Scholar] [CrossRef]
Escobar, Marcos, and Christoph Gschnaidtner. 2018. A multivariate stochastic volatility model with applications in the foreign exchange market. Review of Derivatives Research 21: 1–43. [Google Scholar] [CrossRef]
Escobar, Marcos, and Pablo Olivares. 2013. Pricing of mountain range derivatives under a principal component stochastic volatility model. Applied Stochastic Models in Business and Industry 29: 31–44. [Google Scholar] [CrossRef]
Escobar-Anel, Marcos, and Zhenxian Gong. 2020. The mean-reverting 4/2 stochastic volatility model: Properties and financial applications. Applied Stochastic Models in Business and Industry 36: 836–856. [Google Scholar] [CrossRef]
Escobar-Anel, Marcos, and Harold A. Moreno-Franco. 2019. Dynamic portfolio strategies under a fully correlated jump-diffusion process. Annals of Finance 15: 421–53. [Google Scholar] [CrossRef]
Gouriéroux, Christian. 2006. Continuous time wishart process for stochastic risk. Econometric Reviews 25: 177–217. [Google Scholar] [CrossRef]
Grasselli, Martino. 2016. The 4/2 stochastic volatility model: A unified approach for the heston and the 3/2 model. In Mathematical Finance. Hoboken: Wiley Online Library. [Google Scholar]
Heston, Steven L. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. Review of Financial Studies 6: 327–43. [Google Scholar] [CrossRef] [Green Version]
Kirkby, J. Lars, and Duy Nguyen. 2020. Efficient asian option pricing under regime switching jump diffusions and stochastic volatility models. Annals of Finance 16: 307–51. [Google Scholar] [CrossRef]
Langetieg, Terence C. 1980. A multivariate model of the term structure. The Journal of Finance 35: 71–97. [Google Scholar]
Larsen, Linda Sandris. 2010. Optimal investment strategies in an international economy with stochastic interest rates. International Review of Economics & Finance 19: 145–65. [Google Scholar]
Lin, Wei, Shenghong Li, Xingguo Luo, and Shane Chern. 2017. Consistent pricing of vix and equity derivatives with the 4/2 stochastic volatility plus jumps model. Journal of Mathematical Analysis and Applications 447: 778–97. [Google Scholar] [CrossRef] [Green Version]
Luo, Xingguo, and Jin E. Zhang. 2012. The term structure of vix. Journal of Futures Markets 32: 1092–123. [Google Scholar] [CrossRef]
McNeil, Alexander, Rudiger Frey, and Paul Embrechts. 2005. Quantitative Risk Management: Concepts, Techniques, and Tools. Princeton Series in Finance; Princeton: Princeton University Press. [Google Scholar]
Merton, Robert C. 1975. Optimum consumption and portfolio rules in a continuous-time model. In Stochastic Optimization Models in Finance. Amsterdam: Elsevier, pp. 621–61. [Google Scholar]
Muhle-Karbe, Johannes, Oliver Pfaffel, and Robert Stelzer. 2012. Option pricing in multivariate stochastic volatility models of ou type. SIAM Journal on Financial Mathematics 3: 66–94. [Google Scholar] [CrossRef] [Green Version]
Platen, Eckhard. 1997. A non-linear stochastic volatility model. In Financial Mathematics Research Report No. FMRR005-97. Canberra: Center for Financial Mathematics, Australian National University. [Google Scholar]
Stuart, Alan, Steven Arnold, J. Keith Ord, Anthony O’Hagan, and Jonathan Forster. 1994. Kendall’s Advanced Theory of Statistics. Hoboken: Wiley. [Google Scholar]
Zhang, Jin E., and Yingzi Zhu. 2006. Vix futures. Journal of Futures Markets 26: 521–31. [Google Scholar] [CrossRef]

Figure 1. Historical VIX (VVIX) and VSTOXX (VVSTOXX) data.

Figure 2. Historical USO (OVX) and GLD (GVZ) data.

Figure 3. Historical SLV (VXSLV) and GLD (GVZ) data.

Figure 4. Principal components with volatility indexes’ data.

Figure 5. Principal components. Data: USO (OVX) and GLD (GVZ).

Figure 6. Principal components with volatility indexes’ data.

Figure 7. Case 1: Density and histogram for

{\tilde{M}}_{1}^{*} (t)

and

{\tilde{M}}_{2}^{*} (t)

.

Figure 7. Case 1: Density and histogram for

{\tilde{M}}_{1}^{*} (t)

and

{\tilde{M}}_{2}^{*} (t)

.

Figure 8. Case 1: Density and histogram for for

l n (Π (T))

.

Figure 8. Case 1: Density and histogram for for

l n (Π (T))

.

Table 1. Empirical results (a) covariance matrix and long term average of squared volatility indexes; (b) eigenvectors, eigenvalues and scaling factors

(a)
	VIX & VSTOXX	USO & GLD	SLV & GLD
$\hat{Σ}$	$(\begin{matrix} 0.0062 & 0.0030 \\ 0.0030 & 0.0048 \end{matrix})$	$(\begin{matrix} 4.78 \times 10^{- 4} & 4.62 \times 10^{- 5} \\ 4.62 \times 10^{- 5} & 1.255 \times 10^{- 5} \end{matrix})$	$(\begin{matrix} 0.0724 & 0.0331 \\ 0.0331 & 0.0237 \end{matrix})$
$({\hat{μ}}_{1}, {\hat{μ}}_{2})$	(0.0034, 0.0028)	(6.01 $\times 10^{- 4}$ , 1.67 $\times 10^{- 4}$ )	(3.62 $\times 10^{- 4}$ , 1.18 $\times 10^{- 4}$ )
(b)
	VIX & VSTOXX	USO & GLD	SLV & GLD
$\hat{A}$	$(\begin{matrix} 0.7825 & - 0.6226 \\ 0.6226 & 0.7825 \end{matrix})$	$(\begin{matrix} 0.9918 & - 0.1278 \\ 0.1278 & 0.9918 \end{matrix})$	$(\begin{matrix} 0.8925 & - 0.451 \\ 0.451 & 0.8925 \end{matrix})$
$\dot{A}$	$(\begin{matrix} 2.725 & - 1.725 \\ - 1.725 & 2.725 \end{matrix})$	$(\begin{matrix} 1.0169 & - 0.0169 \\ - 0.0169 & 1.0169 \end{matrix})$	$(\begin{matrix} 1.3429 & - 0.3429 \\ - 0.3429 & 1.3429 \end{matrix})$
( $σ^{(1)}, σ^{(2)}$ )	(2.1372, 0.592)	(0.121, 0.0299)	(0.0891, 0.007)
( ${\hat{s}}_{1}, {\hat{s}}_{2}$ )	(1.8074, 1.7229)	(0.795, 0.7518)	(0.799, 0.8)

Table 2. Estimated volatility group parameters with empirical data (VIX and VSTOXX).

	Data: $V_{1} (t)$	Data: $V_{2} (t)$
$\hat{b}$	3.11 $\times 10^{- 4}$	$\sim 0$
Mean of $\hat{b}$ (s.e)	4.511 $\times 10^{- 4}$ (1.1581 $\times 10^{- 6}$ )	0(0)
$\hat{α}$	42.1811	19.1624
Mean of $\hat{α}$ (s.e)	41.6498(0.0695)	19.5827(0.0552)
$\hat{θ}$	0.0079	0.0027
Mean of $\hat{θ}$ (s.e)	0.0076(4.8578 $\times 10^{- 6}$ )	0.0027(8.0796 $\times 10^{- 6}$ )
$\hat{ξ}$	0.3436	0.3885
Mean of $\hat{ξ}$ (s.e)	0.3537(1.3937 $\times 10^{- 4}$ )	0.3877(5.1103 $\times 10^{- 4}$ )

Table 3. Estimated volatility group parameters with empirical data (USO and GLD).

	Data: $V_{1} (t)$	Data: $V_{2} (t)$
$\hat{b}$	1.6776 $\times 10^{- 5}$	$\sim 0$
Mean of $\hat{b}$ (s.e)	7.5578 $\times 10^{- 5}$ (1.3892 $\times 10^{- 6}$ )	0(0)
$\hat{α}$	3.62	5.3597
Mean of $\hat{α}$ (s.e)	5.9253(0.0465)	5.6994(0.0186)
$\hat{θ}$	8.9803 $\times 10^{- 4}$	1.1859 $\times 10^{- 4}$
Mean of $\hat{θ}$ (s.e)	4.6621 $\times 10^{- 4}$ (7.4231 $\times 10^{- 6}$ )	1.1779 $\times 10^{- 4}$ (2.4962 $\times 10^{- 7}$ )
$\hat{ξ}$	0.0271	0.0231
Mean of $\hat{ξ}$ (s.e)	0.02(2.4446 $\times 10^{- 4}$ )	0.0231(5.3704 $\times 10^{- 6}$ )

Table 4. Estimated volatility group parameters with empirical data (SLV and GLD).

	Data: $V_{1} (t)$	Data: $V_{2} (t)$
$\hat{b}$	2.0968 $\times 10^{- 5}$	$\sim 0$
Mean of $\hat{b}$ (s.e)	7.4123 $\times 10^{- 5}$ (2.85 $\times 10^{- 7}$ )	0(0)
$\hat{α}$	5.178	24.3083
Mean of $\hat{α}$ (s.e)	7.7687(0.0664)	24.8349(0.057)
$\hat{θ}$	8.3026 $\times 10^{- 4}$	3.8644 $\times 10^{- 5}$
Mean of $\hat{θ}$ (s.e)	4.6417 $\times 10^{- 4}$ (7.5518 $\times 10^{- 6}$ )	3.8753 $\times 10^{- 5}$ (6.5274 $\times 10^{- 8}$ )
$\hat{ξ}$	0.0307	0.0343
Mean of $\hat{ξ}$ (s.e)	0.024(3.0304 $\times 10^{- 4}$ )	0.0344(1.1187 $\times 10^{- 5}$ )

Table 5. Estimated drift group parameters (VIX and VSTOXX).

Data	$\hat{\tilde{L_{i}}}$	p-Value	$\hat{\tilde{c_{i}}}$	p-Value	$\hat{\tilde{β_{i}}}$	p-Value	$\hat{ρ}$	p-Value
$M_{1} (t)$ & $V_{1} (t)$	25.6041	0	−114.6521	0.436	6.1077	0	0.5621	0
$M_{2} (t)$ & $V_{2} (t)$	9.4459	0	73.6448	0.734	15.6266	0	0.00419	0

Table 6. Estimated drift group parameters (USO and GLD).

Data	$\hat{\tilde{L_{i}}}$	p-Value	$\hat{\tilde{c_{i}}}$	p-Value	$\hat{\tilde{β_{i}}}$	p-Value	$\hat{ρ}$	p-Value
$M_{1} (t)$ & $V_{1} (t)$	0.8096	0.154	−416.2006	0.214	0.214	0.134	−0.3723	0
$M_{2} (t)$ & $V_{2} (t)$	2.6418	0.07	−646.7339	0.384	0.5701	0.079	−0.00294	0

Table 7. Estimated drift group parameters (SLV and GLD).

Data	$\hat{\tilde{L_{i}}}$	p-Value	$\hat{\tilde{c_{i}}}$	p-Value	$\hat{\tilde{β_{i}}}$	p-Value	$\hat{ρ}$	p-Value
$M_{1} (t)$ & $V_{1} (t)$	4.5459	0.009	842.936	0.153	1.0183	0.008	−0.2323	0
$M_{2} (t)$ & $V_{2} (t)$	−1.5092	0.23	−1787.0546	0.16	−0.5401	0.191	0.1228	0

Table 8. Estimated original drift group parameters.

Data	$\hat{L}$	$\hat{C}$	$\hat{B}$
VIX&VSTOXX	(25.9172, 8.5495)	$(\begin{matrix} - 89.4091 & 46.0451 \\ - 71.1886 & - 57.3209 \end{matrix})$	$(\begin{matrix} 9.7978 & - 4.6378 \\ - 4.6378 & 11.9365 \end{matrix})$
USO&GLD	(0.4653, 2.7236)	$(\begin{matrix} - 412.2959 & - 82.6444 \\ - 53.1823 & 641.9225 \end{matrix})$	$(\begin{matrix} 0.2198 & - 0.0451 \\ - 0.0451 & 0.5643 \end{matrix})$
SLV&GLD	(4.7379, 0.7032)	$(\begin{matrix} 752.7178 & - 805.8599 \\ 380.2658 & 1595.3 \end{matrix})$	$(\begin{matrix} 0.7013 & 0.6273 \\ 0.6273 & - 0.2231 \end{matrix})$

Table 9. Estimated original drift group parameters.

Data	$B^{- 1} L$	$E [H (V_{t})]$	Estimated Mean-Reverting Level	Empirical Averages
VIX&VSTOXX	(2.9042, 3.0832)	(−0.0952, −0.0631)	(2.809, 3.0219)	(2.6497, 2.9139)
USO&GLD	(3.1598, 5.0794)	(−0.961, −0.2547)	(2.244, 4.8247)	(3.1342, 4.8096)
SLV&GLD	(2.7242, 4.5075)	(0.2215, 0.2158)	(2.9457, 4.7232)	(2.9468, 4.8645)

Table 10.

V a R_{0.05}

for

l n (Π (t))

from four sources.

Table 10.

V a R_{0.05}

for

l n (Π (t))

from four sources.

	Simulation	Density w/o Approximation	Approx. Density (M)	Approx. Density (A)
$V a R_{0.05}$	2.7833 (0.004006)	2.783 (0.00401)	2.7832 (0.003988)	2.783 (0.004008)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Escobar-Anel, M.; Gong, Z. Mean-Reverting 4/2 Principal Components Model. Financial Applications. Risks 2021, 9, 141. https://doi.org/10.3390/risks9080141

AMA Style

Escobar-Anel M, Gong Z. Mean-Reverting 4/2 Principal Components Model. Financial Applications. Risks. 2021; 9(8):141. https://doi.org/10.3390/risks9080141

Chicago/Turabian Style

Escobar-Anel, Marcos, and Zhenxian Gong. 2021. "Mean-Reverting 4/2 Principal Components Model. Financial Applications" Risks 9, no. 8: 141. https://doi.org/10.3390/risks9080141

APA Style

Escobar-Anel, M., & Gong, Z. (2021). Mean-Reverting 4/2 Principal Components Model. Financial Applications. Risks, 9(8), 141. https://doi.org/10.3390/risks9080141

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mean-Reverting 4/2 Principal Components Model. Financial Applications

Abstract

1. Introduction

2. Model Definition

2.1. General Model Setup

2.1.1. Separable Spillover Effect

2.1.2. Model with No Spillover Effects

2.2. Properties of the Variance Vector

3. Characteristic Functions and Approximations

3.1. Characteristic Function for Model with Spillover Effects

3.2. Characteristic Function for Model with Separable Spillover Effects

3.3. Approximation Principle and Results

4. Estimation

4.1. Data Description

4.2. Estimation of Volatility Group Parameters

4.2.1. Estimation of Matrix $A$ and the Scaling Parameters S

4.2.2. Estimation of Volatility Group

4.3. Estimation of Drift Group

5. Application to Risk Measures

5.1. Portfolio Setup

5.2. The Density Function of the Portfolio $Π (t)$

5.2.1. Density Function via Convolution

5.2.2. Density Function via Fourier Inversion

5.2.3. Numerical Implementation of Selected Method

5.3. The VaR for a Portfolio of USO and GLD

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Mean-Reverting 4/2 Principal Components Model. Financial Applications

Abstract

1. Introduction

2. Model Definition

2.1. General Model Setup

2.1.1. Separable Spillover Effect

2.1.2. Model with No Spillover Effects

2.2. Properties of the Variance Vector

3. Characteristic Functions and Approximations

3.1. Characteristic Function for Model with Spillover Effects

3.2. Characteristic Function for Model with Separable Spillover Effects

3.3. Approximation Principle and Results

4. Estimation

4.1. Data Description

4.2. Estimation of Volatility Group Parameters

4.2.1. Estimation of Matrix A and the Scaling Parameters S

4.2.2. Estimation of Volatility Group

4.3. Estimation of Drift Group

5. Application to Risk Measures

5.1. Portfolio Setup

5.2. The Density Function of the Portfolio Π ( t )

5.2.1. Density Function via Convolution

5.2.2. Density Function via Fourier Inversion

5.2.3. Numerical Implementation of Selected Method

5.3. The VaR for a Portfolio of USO and GLD

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2.1. Estimation of Matrix $A$ and the Scaling Parameters S

5.2. The Density Function of the Portfolio $Π (t)$