Comparing Two Different Option Pricing Methods

Alessandro Bondi; Dragana Radojičić; Thorsten Rheinländer

doi:10.3390/risks8040108

,

and

¹

Classe di Scienze, Scuola Normale Superiore di Pisa, 56126 Pisa, Italy

²

TU Wien, Institute of Statistics and Mathematical Methods in Economics, 22180 Vienna, Austria

^*

Author to whom correspondence should be addressed.

Risks2020, 8(4), 108;https://doi.org/10.3390/risks8040108

This article belongs to the Special Issue Interplay between Financial and Actuarial Mathematics

Version Notes

Order Reprints

Abstract

Motivated by new financial markets where there is no canonical choice of a risk-neutral measure, we compared two different methods for pricing options: calibration with an entropic penalty term and valuation by the Esscher measure. The main aim of this paper is to contrast the outcomes of those two methods with real-traded call option prices in a liquid market like NASDAQ stock exchange, using data referring to the period 2019–2020. Although the Esscher measure method slightly underperforms the calibration method in terms of absolute values of the percentage difference between real and model prices, it could be the only feasible choice if there are not many liquidly traded derivatives in the market.

Keywords:

geometric Esscher measure; calibration with entropic penalty term; financial markets; option pricing

1. Introduction

Empirical studies show that real financial markets are incomplete, which means that not all derivative securities like options are replicable by means of some initial capital plus the value process of some self-finance trading strategy, and therefore hold unhedgeable and undiversifiable risks. The second fundamental theorem of asset pricing implies that in incomplete markets there typically exist several martingale measures consistent with the no-arbitrage principle that can be chosen as pricing measures (see, e.g., Jeanblanc et al. 2009; Shreve 2004). In fact, the range of option prices composes the total range between the inf and the sup of expected values with respect to martingale measures, and is therefore too large for practical purposes, see (Eberlein and Hammerstein 2004, Theorem 11.55).

Therefore, it makes sense to select a particular martingale measure for pricing purposes. In this study, we introduce, discuss, and compare two methods for the simulation of European call option prices: one based on the Esscher martingale measure and the other on the calibration to real-traded securities with an entropic penalty term. We implemented them in the programming language R and contrast the simulated derivative prices with real data retrieved from NASDAQ option chains. The objective of the paper was to provide a good fit between real and simulated contingent claim prices, consequently suggesting a choice for risk-neutral pricing measures in incomplete markets. In order to fulfill this aim, we constructed a model rich enough to describe stock prices but feasible for practice and connected to option prices. In this context, it is natural to exogenously specify different, suitable underlying asset price dynamics on a case-by-case basis.

The Esscher transform, invented by the Swedish actuary F. Esscher and introduced in Esscher (1932), is a time-honored tool in actuarial science for approximating the distribution of the aggregate claims of a portfolio. It consists of an exponential tilting procedure. More recently, it has been used as a premium calculation principle, see Van Heerwaarden et al. (1989), in option pricing, see Gerber and Shiu (1994), and also in pricing defaultable assets, see Lee and Rheinländer (2012). In our financial setting, the agent has the option to invest in a financial market via admissible strategies. Hereby we exclude strategies leading to arbitrage opportunities. The corresponding theory has been developed by Kallsen and Shiryaev, see Kallsen and Shiryaev (2002). In Benth and Schmeck (2014), the authors used an Esscher transform incorporated in a two-factor model to derive futures prices in electricity markets. Furthermore, pricing methodology for weather derivatives in a specific two-factor model is established in Hell et al. (2012).

The idea of calibration is to consider a set of liquid option prices, and to find a market martingale measure such that the expected values of the option payoffs are as close as possible to the observed prices. This approximation problem is typically ill-posed, and one has to include a regularization term, we opted for an entropic penalty term in this regard. A general exposition about the calibration method can be found in Cont and Tankov, see Cont and Tankov (2004a).

Section 2 and Section 3 are divided into a theoretical and a numerical part. The second section is devoted to the geometric Esscher measure and Laplace cumulant processes. In this part, we modeled the log–prices of the stock with a

C O G A R C H

process (introduced in Klüppelberg et al. (2004)) and present a sufficient condition for the existence of such a measure. Assuming the driving Lévy process to have a

N I G

(normal inverse Gaussian) distribution, which is infinitely divisible and thus corresponds to a Lévy process, we explain the original procedure—relying on the canonical representation theorem for semimartingales—followed by simulating the paths of the stock pricesunder the martingale measure. We pinpoint that, as Lemma 1 shows, the choice of the

N I G

distribution in the

C O G A R C H

model is crucial, since it allows for generating the risk-neutral trajectories of the underlying asset price.

In the third section, the calibration method is explained; using the fundamental concepts of relative entropy of distributions and Lévy processes on the Skorohod space (presented in Appendix E), we chose a martingale measure under which the dynamics of the stock prices follow an exponential Lévy process. In this part we mainly follow Cont and Tankov (2004a), even if the mathematical structure was refined and generalized, and the numerical approximation machinery was tailored to our goals.

In order to make the readers familiar with the theory and the notation that stands behind the main concepts of these two methods, in the appendix we also present a construction of the

N I G

distribution and some essential results on Lévy processes, highlighting the connection with the characteristics of semimartingales.

The main point of the present paper is as follows: if there are many liquid options around, like plain vanilla calls and puts on liquid stocks, we would expect the calibration method to perform best, therefore, we chose it as the benchmark method. However, in new financial markets, like insurance derivatives, cryptocurrencies, energy, or electricity markets, there are often only a few derivatives or any at all available, or the underlying (like electricity) is difficult to trade in since it is not storable. In these cases the calibration method is not implementable, and there is not much choice other than to use the Esscher method which has been done, e.g., in the book by Benth, Benth, and Koekebakker about electricity and related markets, see Benth et al. (2008). Besides that, there are few, if at all, studies about practical implementation of hedging strategies in incomplete markets, therefore our study fills a gap in this direction. Our main result thus is that while the Esscher martingale measure based pricing method in a liquid market does underperform the calibration method, as is to be expected, it does so only by less than 5%. So it might be a feasible choice of method in new financial markets.

2. Esscher Measure Method

In this section, the dynamics of the stock prices are introduced and sufficient conditions for the existence of the Esscher measure, which was used to simulate European call options prices, are established. We refer to Appendix D for a brief summary of the theory, which is necessary to construct the geometric Esscher measure. In the following, we use notation that can be found, e.g., in the books (Rheinländer and Sexton 2011; Shiryaev and Jacod 2003), as well as references therein.

2.1. The Model

On a probability space

(Ω, F, P)

we introduce a driving,

R

-valued Lévy process

L = {\{L_{t}\}}_{t}

with generating triplet

(σ^{2}, ν, γ_{h})

with respect to a truncation function h (see Appendix B and Appendix C). We endow this space with

F

, the augmented filtration of L. Define the dynamics of the spot prices by the process

S = {\{S_{t}\}}_{t}

, where

S : = S_{0} exp (G), S_{0} \in R^{+},

with the log-prices

G = {\{G_{t}\}}_{t}

which follow a

C O G A R C H

process, introduced in Klüppelberg et al. (2004):

d G_{t} = σ_{t -} d L_{t}, G_{0} = 0 .

(1)

The volatility process

σ^{2} = {\{σ_{t}^{2}\}}_{t}

is given by

σ_{t}^{2} : = (k \int_{0}^{t} e^{X_{s}} d s + σ_{0}^{2}) e^{- X_{t -}}, t \geq 0,

(2)

with

σ_{0}^{2} \in R^{+}

and an adapted, right-continuous with left limits (RCLL) process

X = {\{X_{t}\}}_{t}

defined by

X_{t} : = η log δ - \sum_{0 < s \leq t} log [1 + Φ {(Δ L_{s})}^{2}], t \geq 0,

where

k > 0, η > 0, Φ \geq 0

. We recall that a process is said to be RCLL if it has, almost surely, right-continuous trajectories with left limits. Although

σ^{2}

is a left-continuous process, as shown by (2), we use

σ_{-}

with the aim to highlight that we are working with a process which is adapted and left-continuous, therefore predictable.

For an adapted process

r = {\{r_{t}\}}_{t}

modeling the interest rate, the discounted spot price process is given by

\tilde{S} = {\{{\tilde{S}}_{t}\}}_{t}

, where

{\tilde{S}}_{t} : = exp (- \int_{0}^{t} r_{s} d s) S_{t}, t \geq 0

. Thus, we consider the process

G^{'} = {\{G_{t}^{'}\}}_{t}

, where

G_{t}^{'} : = - \int_{0}^{t} r_{s} d s + G_{t}

for every

t \geq 0

, whose dynamics follow

d G_{t}^{'} = - r_{t} d t + d G_{t}

. At this point, we can express

\tilde{S} = S_{0} exp (G^{'})

and we want to find a process

θ \in L (G^{'})

such that

\tilde{S}

is a

P^{θ}

–local martingale:

P^{θ}

denotes the geometric Esscher measure, or Esscher martingale transform, for the exponential process

\tilde{S}

(see Appendix D).

Remark 1.

From (1), it is clear that G jumps at the same times as L does, and

Δ G_{t} = σ_{t -} Δ L_{t}

for every

t \geq 0

. For

t \in R^{+}

and

ω \in Ω

, the measure associated with its jumps is

\begin{matrix} μ^{G} (ω; (0, t] \times A) & = \sum_{s \leq t} 1_{\{x \neq 0\} \cap A} (Δ G_{s} (ω)) = \sum_{s \leq t} 1_{\{x \neq 0\}} (Δ L_{s} (ω)) 1_{{(σ_{s -} (ω))}^{- 1} A} (Δ L_{s} (ω)) \\ = μ^{L} (ω; (0, t] \times {(σ_{-} (ω))}^{- 1} A), A \in B (R) . \end{matrix}

Furthermore

μ^{G^{'}} = μ^{G}

, hence

F_{(t, ω)}^{G^{'}} (d y) = ν {(σ_{t -} (ω) \cdot)}^{- 1} (d y)

, meaning that

F_{(t, ω)}^{G^{'}} (A) = \int_{R} 1_{A} (σ_{t -} (ω) y) ν (d y), A \in B (R) .

Theorem 1.

Let

θ \in L (G^{'})

be such that

θ \cdot G^{'}

is exponentially special and

Z^{θ}

is a uniformly integrable martingale. If also

(θ + 1) \cdot G^{'}

is exponentially special and θ satisfies

(θ_{t} + \frac{1}{2}) σ_{t -}^{2} σ^{2} - r_{t} + σ_{t -} γ_{h} + \int_{R} (e^{(θ_{t} + 1) σ_{t -} y} - e^{θ_{t} σ_{t -} y} - σ_{t -} h (y)) ν (d y) = 0

(3)

for every

t \geq 0

, then

\tilde{S}

is a

P^{θ}

-local martingale.

Note that

G^{'}

is a 1-dimensional process, thus the geometric Esscher measure is unique.

Proof.

Let

\tilde{G^{'}} {(h)}_{t} : = \sum_{s \leq t} [Δ G_{s}^{'} - h (Δ G_{s}^{'})] = {((x - h (x)) 🟉 μ^{G})}_{t}, t \geq 0

and

G^{'} {(h)}_{t} : = G_{t}^{'} - \tilde{G^{'}} {(h)}_{t}, t \geq 0

. By Remark 1, we get

\begin{matrix} d G^{'} {(h)}_{t} & = d G_{t}^{'} - d \tilde{G^{'}} {(h)}_{t} = - r_{t} d t + d G_{t} - (x - h (x)) μ^{G} (d t, d x) \\ = - r_{t} d t + σ_{t -} d L_{t} - (σ_{t -} y - h (σ_{t -} y)) μ^{L} (d t, d y) . \end{matrix}

Taking into account Lemma A1 in Appendix D, which provides

δ^{L} {(σ_{-})}_{t} = σ_{t -} γ_{h} + \int_{R} σ_{t -} (y - h (y)) ν (d y), t \geq 0,

and considering that

{(σ_{-} \cdot L)}^{c} = σ_{-} \cdot L^{c}

and

⟨σ_{-} \cdot L^{c}, σ_{-} \cdot L^{c}⟩ = \int σ_{-}^{2} σ^{2} d t

, we obtain the characteristics of

G^{'}

under P with respect to h:

\{\begin{matrix} b_{t}^{G^{'}} = - r_{t} + σ_{t -} γ_{h} + \int_{R} (h (σ_{t -} y) - σ_{t -} h (y)) ν (d y) \\ c_{t}^{G^{'}} = σ_{t -}^{2} σ^{2} \\ F_{t}^{G^{'}} (d y) = ν {(σ_{t -} \cdot)}^{- 1} (d y) \end{matrix}, t \geq 0 .

(4)

Thus, if

θ

satisfies

\tilde{κ} (θ + 1) - \tilde{κ} (θ) = 0

, thanks to Theorem A2 we can conclude that

\tilde{S}

is a

P^{θ}

-local martingale. In fact, using (A11) we have

\begin{matrix} \tilde{κ} {(θ + 1)}_{t} - \tilde{κ} {(θ)}_{t} \\ = b_{t}^{G^{'}} + \frac{1}{2} σ_{t -}^{2} σ^{2} + θ_{t} σ_{t -}^{2} σ^{2} + \int_{R} (e^{(θ_{t} + 1) x} - e^{θ_{t} x} - h (x)) F_{t}^{G^{'}} (d x) \\ = (θ_{t} + \frac{1}{2}) σ_{t -}^{2} σ^{2} - r_{t} + σ_{t -} γ_{h} + \int_{R} (e^{(θ_{t} + 1) σ_{t -} y} - e^{θ_{t} σ_{t -} y} - σ_{t -} h (y)) ν (d y) \\ = 0, t \geq 0, \end{matrix}

where the last equality holds by (3). □

The next lemma provides us with the candidate solutions to Equation (3) when L follows a

N I G

distribution (see Appendix A).

Lemma 1.

Let

L = {\{L_{t}\}}_{t}

be a

N I G

-distributed Lévy process with parameters

(α, β, μ, δ)

and define the truncation function

h (x) : = x 1_{D} (x), x \in R .

If a process

θ = {\{θ_{t}\}}_{t}

such that

(σ_{t -} θ_{t}) (ω), σ_{t -} (θ_{t} + 1) (ω) \in [- α - β, α - β], t \in R_{0}^{+}, ω \in Ω

(5)

fulfills Equation (3), then for every

t \geq 0

we have

\begin{matrix} θ_{t}^{1, 2} = \frac{1}{2 (R_{t}^{2} + δ^{2} σ_{t -}^{2}) σ_{t -}} ( & - δ^{2} σ_{t -}^{3} - 2 β δ^{2} σ_{t -}^{2} - R_{t}^{2} (σ_{t -} + 2 β) \\ \pm \sqrt{4 α^{2} δ^{2} R_{t}^{2} σ_{t -}^{2} + 4 R_{t}^{4} α^{2} - R_{t}^{2} δ^{2} σ_{t -}^{4} - \frac{R_{t}^{6}}{δ^{2}} - 2 σ_{t -}^{2} R_{t}^{4}}), \end{matrix}

(6)

with

R_{t} : = - r_{t} + μ σ_{t -}

.

Proof.

Since

L_{1} \sim N I G (α, β, μ, δ)

, we apply (A6) and (A9) (see Appendix A and Appendix B) to obtain

E [e^{z L_{1}}] = exp [μ z + γ_{1} z + \int_{R} (e^{z x} - 1 - z x 1_{D} (x)) ν (d x)], z \in [- α - β, α - β],

where

γ_{1} : = \frac{2 δ α}{π} \int_{0}^{1} sinh (β x) K_{1} (α x) d x

and

ν (d x) = \frac{δ α}{π |x|} e^{β x} K_{1} (α |x|) d x .

Using (A5), we get

δ (\sqrt{α^{2} - β^{2}} - \sqrt{α^{2} - {(β + z)}^{2}}) = γ_{1} z + \int_{R} (e^{z x} - 1 - z x 1_{D} (x)) ν (d x)

(7)

for

z \in [- α - β, α - β]

. Proceeding as in the proof of Theorem 1, we have

\begin{matrix} 0 & = \tilde{κ} {(θ + 1)}_{t} - \tilde{κ} {(θ)}_{t} \\ = R_{t} + σ_{t -} γ_{1} + \int_{R} [e^{(θ_{t} + 1) σ_{t -} x} - 1 - σ_{t -} (θ_{t} + 1) x 1_{D} (x)] ν (d x) \\ - \int_{R} (e^{θ_{t} σ_{t -} x} - 1 - σ_{t -} θ_{t} x 1_{D} (x)) ν (d x), t \geq 0 . \end{matrix}

(8)

Since

σ_{-} γ_{1} = σ_{-} [(θ + 1) - θ] γ_{1}

, we expand on the chain of equalities in (8):

\begin{matrix} 0 & = R_{t} + [σ_{t -} (θ_{t} + 1) γ_{1} + \int_{R} (e^{(θ_{t} + 1) σ_{t -} x} - 1 - σ_{t -} (θ_{t} + 1) x 1_{D} (x)) ν (d x)] \\ - [σ_{t -} θ_{t} γ_{1} + \int_{R} (e^{θ_{t} σ_{t -} x} - 1 - σ_{t -} θ_{t} x 1_{D} (x)) ν (d x)] \\ = R_{t} + δ (\sqrt{α^{2} - {(β + σ_{t -} θ_{t})}^{2}} - \sqrt{α^{2} - {(β + σ_{t -} (θ_{t} + 1))}^{2}}), t \geq 0, \end{matrix}

where the last equality is obtained by combining (5) and (7). At this point it remains to find the possible solutions for every

t \geq 0

to the equation

R_{t} + δ (\sqrt{α^{2} - {(β + σ_{t -} θ_{t})}^{2}} - \sqrt{α^{2} - {(β + σ_{t -} (θ_{t} + 1))}^{2}}) = 0,

which can be written as

\sqrt{α^{2} - {(β + σ_{t -} (θ_{t} + 1))}^{2}} = \frac{R_{t}}{δ} + \sqrt{α^{2} - {(β + σ_{t -} θ_{t})}^{2}} .

Squaring both sides we get

2 \frac{R_{t}}{δ} \sqrt{α^{2} - {(β + σ_{t -} θ_{t})}^{2}} = - (σ_{t -}^{2} + 2 β σ_{t -} + \frac{R_{t}^{2}}{δ^{2}}) - 2 σ_{t -}^{2} θ_{t},

(9)

and repeating once again the same operation of squaring we end up with an equation of second degree in

θ

:

4 σ_{t -}^{2} (σ_{t -}^{2} + \frac{R_{t}^{2}}{δ^{2}}) θ_{t}^{2} + 4 σ_{t -} (σ_{t -}^{3} + 2 β σ_{t -}^{2} + \frac{R_{t}^{2}}{δ^{2}} σ_{t -} + 2 β \frac{R_{t}^{2}}{δ^{2}}) θ_{t} + c_{t} = 0,

(10)

where

c_{t} : = {(σ_{t -}^{2} + 2 β σ_{t -} + \frac{R_{t}^{2}}{δ^{2}})}^{2} - 4 \frac{R_{t}^{2}}{δ^{2}} (α^{2} - β^{2})

. Computing algebraically the solutions of (10) we get the expressions in (6). □

We have empirically tested both possible branches of solutions in (6), and we have chosen

\begin{matrix} θ_{t} = \frac{1}{2 (R_{t}^{2} + δ^{2} σ_{t -}^{2}) σ_{t -}} ( & - δ^{2} σ_{t -}^{3} - 2 β δ^{2} σ_{t -}^{2} - R_{t}^{2} (σ_{t -} + 2 β) \\ - \sqrt{4 α^{2} δ^{2} R_{t}^{2} σ_{t -}^{2} + 4 R_{t}^{4} α^{2} - R_{t}^{2} δ^{2} σ_{t -}^{4} - \frac{R_{t}^{6}}{δ^{2}} - 2 σ_{t -}^{2} R_{t}^{4}}) \end{matrix}

(11)

to run the simulation because it prevents the risk-neutral dynamics from “exploding”, i.e., from skyrocketing to unrealistically high prices in a short time.

Remark 2.

Unfortunately, it does not seem possible for us to prove, from an analytical point of view, that either candidate solution in (6) solves (3), i.e., that it is a true solution. What the simulations suggest is that only the process in (11) is indeed a solution of (3). This insight is confirmed by the numerical experiments to a greater extent. In fact, it turns out that both processes

θ^{1, 2}

satisfy Condition (5). However,

θ^{1}

makes the right hand side in (9) negative, and this implies that it does not solve (3). On the contrary, the process

θ^{2}

—the one we pick in (11)—makes such term positive, hence it solves (3), indeed.

2.1.1. Simulation of the $P^{θ}$ -Dynamics

The goal of this section is to carefully describe a procedure to generate the risk-neutral dynamics of the underlying asset price. This allows us to compute the simulated European call option prices with the Monte Carlo method.

Let

r \in R^{+}

represent the constant annual interest rate. We assume that the driving Lévy process L is

N I G

-distributed and that the hypothesis of Theorem 1 hold (with respect to the truncation function

\bar{h} (x) = x 1_{D} (x), x \in R

). Using the R-package yuima (see Iacus 2011) we estimate the parameters of the model from the time series of the underlying asset log-prices. Next, we simulated a sufficiently large number of paths of the variance process

σ^{2}

with the found parameters: by (11), from each of them we can get a trajectory of

θ

. The issue reduces to generating the paths of G, but note that, after the change of measure, L is not a Lévy process anymore because

F_{t}^{θ} (d x) = e^{θ_{t} x} ν (d x), t \geq 0

. Consequently, G is not a

C O G A R C H

process under the martingale measure. Then, in order to simulate its trajectories we use the canonical representation for semimartingales (Shiryaev and Jacod (2003), Chapter II, Theorem 2.34): for a d-dimensional semimartingale

X = {\{X_{t}\}}_{t}

with characteristics

(B, C, ν^{X})

relative to the truncation function h, it holds:

X = X_{0} + X^{c} + B + h (x) 🟉 (μ^{X} - ν^{X}) + (x - h (x)) 🟉 μ^{X} .

(12)

The symbol 🟉 denotes the integration with respect to a random measure: we refer to (Shiryaev and Jacod (2003), Chapter II, Section 1) for an extensive study of this topic. For the reader’s convenience, we provide the basic definition.

Definition 1.

Let μ be a random measure on

R_{0}^{+} \times R

and

W : Ω \times R_{0}^{+} \times R \to R

be measurable function with respect to the product σ-algebra

O \otimes B (R)

, where

O

denotes the optional σ-algebra on

Ω \times R_{0}^{+}

. The integral process

W 🟉 μ

is defined by

W 🟉 μ_{t} (ω) : = \{\begin{matrix} \int_{[0, t] \times R} W (ω; s, x) μ (ω; d s, d x), & if \int_{[0, t] \times R} |W (ω; s, x)| μ (ω; d s, d x) < \infty \\ \infty, & otherwise \end{matrix} .

For an arbitrarily chosen

ϵ \leq 1

, fix the truncation function

h (x) : = x 1_{\{|z| < ϵ\}} (x)

for

x \in R

, and let

(0, ν, γ_{h})

be the corresponding generating triplet of L. From the proof of Theorem 1, specifically from (4), we readily get the characteristics

(b_{t}, 0, ν {(σ_{t -} \cdot)}^{- 1} (d x))

of G under P, and invoking (A12) and Remark A2 in Appendix D we find them under

P^{θ}

:

\{\begin{matrix} b_{t}^{θ} = b_{t} + \int_{R} h (x) (e^{θ_{t} x} - 1) F_{t} (d x) \\ c_{t}^{θ} = 0 \\ F_{t}^{θ} (d x) = e^{θ_{t} x} ν {(σ_{t -} \cdot)}^{- 1} (d x) \end{matrix}, t \geq 0 .

(13)

We now represent G by (12), noting that

G_{0} = 0

and

G^{c} = 0

since

c^{θ} = 0

. As regards the term

B = {\{B_{t}\}}_{t}

, we expand on the computations in (13), getting

\begin{matrix} b_{t}^{θ} = σ_{t -} γ_{h} + \int_{|x| < ϵ / σ_{t -}} σ_{t -} y \frac{δ α}{π |y|} e^{(θ_{t} σ_{t -} + β) y} K_{1} (α |y|) d y \\ - \int_{(- ϵ, ϵ)} σ_{t -} y \frac{δ α}{π |y|} e^{β y} K_{1} (α |y|) d y, t \geq 0, \end{matrix}

and obtain the variables of the process B by

B_{t} = \int_{(0, t)} b_{s}^{θ} d s, t \geq 0 .

At this point we need to approximate a trajectory of the process

h (x) 🟉 (μ^{G} - ν^{G})

. In order to do so, we neglect the contribution of the jumps of G with absolute value smaller than

ϵ

, focusing just on the term

\begin{matrix} h (x) 🟉 (μ^{G} - ν^{G}) & \approx - h (x) 🟉 ν^{G} = - \int_{(0, t)} d s \int_{R} x 1_{\{|z| < ϵ\}} (x) F_{s}^{θ} (d x) \\ = - \int_{(0, t)} d s \int_{|x| < ϵ / σ_{s -}} σ_{s -} y \frac{δ α}{π |y|} e^{(θ_{s} σ_{s -} + β) y} K_{1} (α |y|) d y, t \geq 0, \end{matrix}

which in turn cancels out with one of the addends defining B. Heuristically speaking, in this approach we are assuming that the small jumps of the underlying asset price do not contribute to the determination of the option value in a significant way. However, we may refer to Asmussen and Rosiński (2001) for a more sophisticated procedure taking into account the variation of such small jumps in the case of Lévy processes.

Finally we concentrate on the term

(x - h (x)) 🟉 μ^{G}

: we simulate its paths similarly to the Lévy–Itô decomposition theorem (see Sato 1999, Theorem 19.2). According to this result, if

\tilde{L}

is a Lévy process with generating triplet

(A, ν, γ)

and

D_{a, \infty} : = \{x \in R^{d} : a < |x| < \infty\}

for

a > 0

, then

{\{{((x - x 1_{D} (x)) 🟉 μ^{\tilde{L}})}_{t}\}}_{t}

is a compound Poisson process with constant

ν (D_{1, \infty})

and distribution

ϕ (B) : = \{\begin{matrix} 0, & if B \subset D \\ \frac{ν (B \cap D_{1, \infty})}{ν (D_{1, \infty})}, & otherwise \end{matrix}, B \in B (R^{d}) .

Thus, we take a nonhomogeneous Poisson process with time-varying intensity

\begin{matrix} λ_{t} & : = F_{t}^{θ} (\{|x| > ϵ\}) = \int_{R} e^{θ_{t} σ_{t -} y} 1_{D_{ϵ, \infty}} (σ_{t -} y) \frac{δ α}{π |y|} e^{β y} K_{1} (α |y|) d y \\ = \int_{|x| > ϵ / σ_{t -}} \frac{δ α}{π |y|} e^{(θ_{t} σ_{t -} + β) y} K_{1} (α |y|) d y, t \geq 0 \end{matrix}

and impose the time-varying jumps sizes to follow

c_{t} : = \int_{R} x \frac{1_{D_{ϵ, \infty}} (x)}{λ_{t}} F_{t}^{θ} (d x) = \frac{1}{λ_{t}} \int_{|x| > ϵ / σ_{t -}} σ_{t -} y \frac{δ α}{π |y|} e^{(θ_{t} σ_{t -} + β) y} K_{1} (α |y|) d y, t \geq 0 .

The jump times of this process have been simulated by the thinning algorithm described in Lewis and Shedler (1979).

Denote by N the number of iterations we run and by

S^{i}

, for

i = 1, \dots, N

, the corresponding, simulated trajectories of the spot price under the pricing measure

P^{θ}

. We obtain the value of an European call option with strike K and maturity T following the Monte Carlo method, i.e., computing the sample mean of the vector of components

e^{- r T} {(S^{i} (T) - K)}^{+}, i = 1, \dots, N .

2.1.2. Empirical Results

We have empirically tested the Esscher method on the prices of call options on Apple Inc. stock (ticker symbol: AAPL) with fixed strike at

$ 320

. In Figure 1a, the simulated option prices are shown as function of their maturities. The average of the absolute values of percentage difference is

3.1646 %

.

Figure 1. Numerical experiments for the Esscher method.

Furthermore, we applied this method to the prices of call options on Microsoft Corporation stock (ticker symbol: MSFT) with strike

$ 190

: we plot the outcomes in Figure 1b. In this case, the mean of absolute values of percentage difference settles down at

5.0518 %

.

Finally we report the results of a test on call options with strike

$ 910

with Tesla, Inc. (ticker symbol: TSLA) as underlying stock. In Figure 1c we can notice a big discrepancy between the real price and the predicted value for the shortest-term maturity taken into account, the latter being

37.2755 %

smaller than the former. However, if we neglect this first term we recover a result in line with the previous experiments, namely an average for the absolute values of percentage difference of

4.0555 %

.

In all the three cases, we set the number of Monte Carlo iterations

N = 10^{4}

and we used the time series of the

log -

prices associated to the trading year before 19 February 2020, to estimate the parameters of the

C O G A R C H

model (see Section 2.1.1). Therefore, the simulations were run prior to Apple’s stock split (on 28 August 2020) and Tesla’s stock split (on 31 August 2020).

Remark 3.

The real data was obtained from https://old.nasdaq.com, where American options are traded. Although our approximation generates the prices in an European model, it can be accepted as meaningful since:

the analyzed derivatives have short-term maturities, so we are allowed to ignore the dividend yield (however, TSLA does not pay dividends);
the time values of the options were always positive, implying that it is convenient to sell the option rather than exercising the call right.

3. Calibration with Entropic Penalty Term Method

In the Esscher method, the density process is completely defined by the spot prices: this is an intuitive drawback in liquid markets, because it does not use all the available information (e.g., the time series of derivative prices). This is a motivation to consider the Calibration Method, which consists in modeling the spot prices dynamics directly under a martingale measure “chosen” by the market.

3.1. The Model

We start the practical analysis by taking into account a driving,

R

-valued Lévy process

L = {\{L_{t}\}}_{t}

with generating triplet

(σ^{2}, ν, γ)

on a probability space

(Ω, F, Q)

endowed with the augmented filtration of L, which satisfies the usual conditions and is denoted by

G

. We describe the stock prices

S = {\{S_{t}\}}_{t}

with an exponential Lévy model:

S_{t} : = S_{0} exp (r t + L_{t}), t \geq 0,

with

S_{0} \in R^{+}

and

r \in R^{+}

representing the constant annual interest rate. The discounting process

R = {\{R_{t}\}}_{t}

reduces to the deterministic function

R_{t} : = exp (- r t), t \geq 0 .

Since we need Q to be a martingale measure for S we require:

(i): there exists a $t > 0$ such that $E^{Q} [exp (L_{t})] < \infty$ ;
(ii): $Ψ (1) = \frac{1}{2} σ^{2} + γ + \int_{R} (e^{x} - 1 - x 1_{D} (x)) ν (d x) = 0$ .

With these assumptions, Remark A1 in Appendix B proves that

{\{exp (L_{t})\}}_{t}

is a martingale with

E^{Q} [exp (L_{t})] = 1

for every

t > 0

. Considering the discounted spot prices process

\tilde{S} = {\{{\tilde{S}}_{t}\}}_{t}

, defined by

{\tilde{S}}_{t} : = R_{t} S_{t} = S_{0} exp (L_{t}), t \geq 0

, we can state that it is a Q-martingale with constant expectation equal to

S_{0}

.

In our discussion the driving Lévy process L will be a pure jump process, so its generating triplet simplifies as

σ^{2} = 0

. In particular, due to assumption (ii) we get the next relation:

γ = - \int_{R} (e^{x} - 1 - x 1_{D} (x)) ν (d x) .

(14)

Assume that there are N European call options with fixed maturity T available in the market, and for each

j = 1, \dots, N

let

K_{j}

and

C^{j}

be the strike price and the observed price of the j-th derivative, respectively. It is convenient to consider the quantities

k_{j} = log (K_{j}), j = 1, \dots, N

. In this setting, S is the price process of the underlying asset. Since Q is a martingale measure for S, we can use it to price options:

C_{ν}^{j} : = exp (- r T) E^{Q} [{(S_{T} - exp (k_{j}))}^{+}], j = 1, \dots, N

. Denoting by

Q_{L_{T}}

the pushforward distribution on

R

generated by

L_{T}

we have

C_{ν}^{j} = exp (- r T) \int_{R} {(S_{0} e^{r T + y} - e^{k_{j}})}^{+} Q_{L_{T}} (d y), j = 1, \dots, N .

Thus, knowing the Lévy measure

ν

, the whole generating triplet could be recovered and therefore the option prices computed.

The reasoning which leads the calibration is choosing

ν

such that

C_{ν}^{j}

is “close” to

C^{j}

for any

j = 1, \dots, N

. Hence we pick the Lévy measure of the model by a least-squares procedure:

\bar{ν} = arg min_{ν} \{\sum_{j = 1}^{N} {|C_{ν}^{j} - C^{j}|}^{2}\} .

(15)

Since the problem expressed in (15) is ill posed (see Cont and Tankov 2004a, Chp. 13), we introduce a regularization term. Let

L^{0} = {\{L_{t}^{0}\}}_{t}

be a, pure jump, driving Lévy process with generating triplet

(0, ν_{0}, γ_{0})

statistically estimated from the time series of the underlying asset price. Furthermore, let

Q^{L^{0}}

and

Q^{L}

be the distributions on the Skorohod space

(D, F_{D}; F)

generated by

L^{0}

and L, respectively, which are supposed to be equivalent on

F_{t}

for every

t > 0

(see Appendix E). We use the relative entropy as a measure of the distance from

Q^{L_{0}}

, so that by Remark A3 the issue is reduced to solving the optimization problem:

\bar{ν} = arg min_{ν \in Q} \{\sum_{j = 1}^{N} {|C_{ν}^{j} - C^{j}|}^{2} + α \int_{R} (\frac{d ν}{d ν^{0}} log \frac{d ν}{d ν^{0}} + 1 - \frac{d ν}{d ν^{0}}) d ν^{0}\},

(16)

for

Q : = \{ν : Q^{L} |_{F_{t}} \sim {Q^{L^{0}}|}_{F_{t}}, t > 0\}

. The term

α

is a regularization parameter: the higher it is, the more we trust the initial distribution and the less importance we give to calibration. The existence of a solution to (16) has been studied, among others, in the paper Cont and Tankov (2004b). We are going to relax the assumptions in (16), considering the minimization in the set of the Lévy measures equivalent to

ν^{0}

.

3.1.1. Numerical Approximation

The goal of this section is to describe in each and every detail a discretization procedure that allows us to tackle the optimization problem in (16). To this aim, it is necessary to express the objective functional in (16) as a function of the masses of the discretized Lévy measures. The steps of our argument are the following:

I.: estimate the parameters of the prior Lévy process $L^{0}$ from the time series of log–returns;
II.: introduce a discretization grid for the prior Lévy measure $ν^{0}$ and the driving Lévy measure $ν$ (in what follows, the points of such a grid are denoted by $y_{1} < \dots < y_{N_{d}}$ ). Their discretized versions are denoted by $ν_{d}^{0}$ and $ν_{d}$ , respectively;
III.: compute the discrete version of the entropy term in (16) as a function of the masses of $ν_{d}$ ;
IV.: use an approach based on the Fourier inversion theorem to get an approximation of the modified time values of the options at the log–strikes under scrutiny. This allows us to obtain (23), a discretized version of the objective functional in (16);
V.: calculate explicitly the derivatives of the discretized objective functional to speed up the simulations;
VI.: choose the regularization parameter $α$ .

Fix a maturity T and define the grid

x_{j} \in R^{+}

,

j = 1, \dots, N

, with

x_{1} < x_{2} < \dots < x_{N}

: these points represent the log–strike prices of the options available on the market. Moreover, from the time series of the log–returns we obtain the parameters of the historical,

N I G

-distributed Lévy process

L^{0} = {\{L_{t}^{0}\}}_{t}

by a maximum likelihood procedure using the R-package ghyp. Note that, in this case, one can select any pure jump, infinitely divisible distribution: we opt for the NIG by analogy with the Esscher’s method, and our choice is satisfactory, as the final goodness of fit between real data and model prices shows. First we introduce a discretization grid consisting in the points

y_{h}, h = 1, \dots, N_{d}

, with

y_{1} < y_{2} < \dots < y_{N_{d}}

, which constitutes a partition of the interval

[- M, M]

for some

M > 0

, and then we approximate the Lévy measure of

L^{0}

with the discrete version

ν_{d}^{0} (d y) = \sum_{h = 1}^{N_{d}} ν_{h}^{0} δ_{(y_{h})} (d y)

, where

δ_{(\bar{a})}

denotes the Dirac measure at a point

\bar{a}

and

ν_{1}^{0} = \int_{(- \infty, y_{1}]} d ν^{0}; ν_{h}^{0} = \int_{(y_{h - 1}, y_{h}]} d ν^{0}, h = 2, \dots, N_{d} - 1; ν_{N_{d}}^{0} = \int_{(y_{N_{d} - 1}, \infty)} d ν^{0} .

Now we take into account another measure which has the same mass points as the previous one:

ν_{d} (d y) = \sum_{h = 1}^{N_{d}} ν_{h} δ_{(y_{h})} (d y),

with

ν_{h} \in R^{+}

for every

h = 1, \dots, N_{d}

. We note that the calibrated measures

ν_{d}

and

ν_{d}^{0}

are equivalent, but

ν_{d}

and the prior

ν^{0}

are not. We understand the measure

ν_{d}

as the discretized Lévy measure of the driving, pure jump Lévy processes

L = {\{L_{t}\}}_{t}

. We now want to compute the Radon–Nikodym derivative

\frac{d ν_{d}}{d ν_{d}^{0}}

with the aim to explicitly express the entropy term of (16) in the discrete version

\int_{R} (\frac{d ν_{d}}{d ν_{d}^{0}} log \frac{d ν_{d}}{d ν_{d}^{0}} + 1 - \frac{d ν_{d}}{d ν_{d}^{0}}) d ν_{d}^{0} .

(17)

Set

y_{0} = - \infty

and define the function

f (y) : = \{\begin{matrix} \frac{ν_{h}}{ν_{h}^{0}}, & if y_{h - 1} < y \leq y_{h}, h = 1, \dots, N_{d} \\ 0, & otherwise \end{matrix} .

For every

A \in B (R)

it results

\begin{matrix} \int_{R} f 1_{A} d ν_{d}^{0} & = \sum_{h = 1}^{N_{d}} \frac{ν_{h}}{ν_{h}^{0}} \int_{(y_{h - 1}, y_{h}]} 1_{A} (y) ν_{d}^{0} (d y) = \sum_{h = 1}^{N_{d}} \frac{ν_{h}}{ν_{h}^{0}} ν_{d}^{0} ((y_{h - 1}, y_{h}] \cap A) \\ = \sum_{h = 1}^{N_{d}} \frac{ν_{h}}{ν_{h}^{0}} ν_{h}^{0} 1_{A} (y_{h}) = ν_{d} (A), \end{matrix}

as

ν_{d}^{0} ((y_{h - 1}, y_{h}] \cap A) = ν_{h}^{0} 1_{A} (y_{h}), h = 1, \dots, N_{d}

. Therefore,

\frac{d ν_{d}}{d ν_{d}^{0}} = f

. Moving back to the integral (17), we get:

\begin{matrix} \int_{R} (\frac{d ν_{d}}{d ν_{d}^{0}} log \frac{d ν_{d}}{d ν_{d}^{0}} + 1 - \frac{d ν_{d}}{d ν_{d}^{0}}) d ν_{d}^{0} = \int_{(- \infty, y_{N_{d}}]} (\frac{d ν_{d}}{d ν_{d}^{0}} log \frac{d ν_{d}}{d ν_{d}^{0}} + 1 - \frac{d ν_{d}}{d ν_{d}^{0}}) d ν_{d}^{0} \\ = \sum_{h = 1}^{N_{d}} [\frac{ν_{h}}{ν_{h}^{0}} log \frac{ν_{h}}{ν_{h}^{0}} + 1 - \frac{ν_{h}}{ν_{h}^{0}}] ν_{d}^{0} ((y_{h - 1}, y_{h}]) = \sum_{h = 1}^{N_{d}} [ν_{h} (log ν_{h} - log ν_{h}^{0}) + ν_{h}^{0} - ν_{h}] . \end{matrix}

Following the Carr and Madan approach (see Carr and Madan 1999, Section 3.2) for further details), we are able to to express the quantity

\sum_{j = 1}^{N} {|C_{ν}^{j} - C^{j}|}^{2}

as a function of

ν_{1}, \dots, ν_{N_{d}}

. The option price

C_{ν} (k)

is not an integrable function, as a swift application of Lebesgue’s convergence theorem shows that it converges to

S_{0}

as

k \to - \infty

. Therefore, we focus on the modified time value

z_{T}

, defined by

z_{T} (k) : = C_{ν} (k) - {(S_{0} - e^{k - r T})}^{+}, k \in R .

We suppose that

z_{T}

and its inverse Fourier transform

ζ_{T}

are integrable, so by inversion (see, e.g., Rudin 1987, Theorem 9.11) we have

z_{T} (k) = \frac{1}{\sqrt{2 π}} \int_{R} e^{- i k u} ζ_{T} (u) d u a . e .

(18)

Fix

k \in R

; we first analyze the term

\begin{matrix} {(S_{0} - e^{k - r T})}^{+} & = (S_{0} - e^{k - r T}) 1_{\{z \leq log S_{0} + r T\}} (k) \\ = e^{- r T} \int_{R} (e^{log S_{0} + r T} - e^{k}) 1_{\{z \leq log S_{0} + r T\}} (k) Q_{L_{T}} (d y) . \end{matrix}

In a similar way we get

C_{ν} (k) = e^{- r T} \int_{R} (e^{y + log S_{0} + r T} - e^{k}) 1_{\{z \leq y + log S_{0} + r T\}} (k) Q_{L_{T}} (d y) .

This provides the following expression for

z_{T} (k)

:

\begin{matrix} z_{T} (k) = e^{- r T} \int_{R} [(S_{0} e^{r T + y} - e^{k}) 1_{\{z \geq k - log S_{0} - r T\}} (y) - (S_{0} e^{r T} - e^{k}) 1_{\{z \leq log S_{0} + r T\}} (k)] Q_{L_{T}} (d y) . \end{matrix}

The assumption (ii) ensures that

\int_{R} e^{y} Q_{L_{T}} (d y) = 1,

so we finally obtain

\begin{matrix} z_{T} (k) = e^{- r T} \int_{R} [(S_{0} e^{r T + y} - e^{k}) (1_{\{z \geq k - log S_{0} - r T\}} (y) - 1_{\{z \leq log S_{0} + r T\}} (k))] Q_{L_{T}} (d y), k \in R . \end{matrix}

The insight here is to use the Fourier transform and its inverse to estimate the values

z_{T} (x_{j})

for every

j = 1, \dots, N

. Hence we fix a point

u \in R

and introduce the inverse Fourier transform

ζ_{T} (u) : = \frac{1}{\sqrt{2 π}} \int_{R} e^{i u k} z_{T} (k) d k .

Under suitable conditions which allow us to switch the order of integration we have

\begin{matrix} ζ_{T} (u) & = - \frac{e^{- r T}}{\sqrt{2 π}} \int_{(- \infty, 0]} [\int_{(y + log S_{0} + r T, log S_{0} + r T)} e^{i u k} (S_{0} e^{r T + y} - e^{k}) d k] Q_{L_{T}} (d y) \\ + \frac{e^{- r T}}{\sqrt{2 π}} \int_{(0, \infty)} [\int_{(log S_{0} + r T, y + log S_{0} + r T)} e^{i u k} (S_{0} e^{r T + y} - e^{k}) d k] Q_{L_{T}} (d y) . \end{matrix}

(19)

We turn our attention on the computation of the first term of the sum (19). An explicit calculation of the inner integral (respect to the Lebesgue measure) of such addend, indicated by

I_{1}

, gives

\begin{matrix} I_{1} (y) & = S_{0} \frac{e^{r T + y + i u (log S_{0} + r T)}}{i u (i u + 1)} [i u (1 - e^{- y} + 1 - e^{i u y})] \\ = S_{0} \frac{e^{r T + i u (log S_{0} + r T)}}{i u + 1} (e^{y} - 1) + S_{0} \frac{e^{r T + i u (log S_{0} + r T)}}{i u (i u + 1)} e^{y} - S_{0} \frac{e^{i u log S_{0}} e^{(i u + 1) r T}}{i u (i u + 1)} e^{y + i u y}, y \in (- \infty, 0] . \end{matrix}

The same strategy allows us to compute the inner integral

I_{2}

of the second addend in (19), as well. Precisely, for every

y \in (0, \infty)

we get

I_{2} (y) = - S_{0} \frac{e^{r T + i u (log S_{0} + r T)}}{i u + 1} (e^{y} - 1) - S_{0} \frac{e^{r T + i u (log S_{0} + r T)}}{i u (i u + 1)} e^{y} + S_{0} \frac{e^{i u log S_{0}} e^{(i u + 1) r T}}{i u (i u + 1)} e^{y + i u y} .

Therefore we are able to represent the inverse Fourier transform as

ζ_{T} (u) = \frac{S_{0}}{\sqrt{2 π}} \frac{e^{i u (log S_{0} + r T)}}{i u + 1} [\int_{R} (1 - e^{y}) Q_{L_{T}} (d y) - \frac{1}{i u} \int_{R} e^{y} Q_{L_{T}} (d y) + \frac{1}{i u} \int_{R} e^{y + i u y} Q_{L_{T}} (d y)] .

Both (A9) in Appendix B and assumption (i) make us conclude that

ζ_{T} (u) = \frac{S_{0}}{\sqrt{2 π}} \frac{e^{i u (log S_{0} + r T)}}{i u (i u + 1)} [e^{T Ψ (i u + 1)} - 1], u \in R .

(20)

By the definition of

Ψ

in (A8) (see Appendix B), substituting the discrete version

ν_{d}

for the Lévy measure

ν

associated to L, as a consequence of (14) we get, for every

u \in R

,

Ψ (i u + 1) ≃ \int_{R} (e^{(i u + 1) y} - i u e^{y} - e^{y} + i u) ν_{d} (d y) = \sum_{h = 1}^{N_{d}} e^{y_{h}} (e^{i u y_{h}} - 1) ν_{h} + i u \sum_{h = 1}^{N_{d}} (1 - e^{y_{h}}) ν_{h} .

(21)

Plugging (21) into (20) we obtain, for

u \in R

, the final approximation

ζ_{T} (u) ≃ \frac{S_{0}}{\sqrt{2 π}} \frac{e^{i u (log S_{0} + r T)}}{i u (i u + 1)} [exp (T \sum_{h = 1}^{N_{d}} e^{y_{h}} (e^{i u y_{h}} - 1) ν_{h} + i u T \sum_{h = 1}^{N_{d}} (1 - e^{y_{h}}) ν_{h}) - 1] .

(22)

With the aim to approximate

z_{T}

at the points

x_{j}

, for

j = 1, \dots, N

, we construct a new, uniform grid with mesh d containing the log-strike one. Specifically, fixed

\tilde{N} \in N

, we set

{\tilde{x}}_{n} : = \frac{2 π n}{\tilde{N} Δ}, n = - \tilde{N}, \dots, - 1, 0, 1, \dots, \tilde{N},

for

Δ : = \frac{A}{\tilde{N}}

, with

A : = \frac{2 π}{d}

, which is the size of the discretization interval. We carry out our construction so that for every

j = 1, \dots, N

and

h = 1, \dots, N_{d}

there exists a

n_{h j} \in \{- \tilde{N} + 1, \dots, \tilde{N} - 1\}

such that

x_{j} - y_{h} = {\tilde{x}}_{n_{h j}}

. We eventually introduce the points of the discretization grid as

u_{k} : = - \frac{A}{2} + k Δ, k = 0, \dots, \tilde{N} .

We then compute

\begin{matrix} z_{T} ({\tilde{x}}_{n}) & ≃ \frac{1}{\sqrt{2 π}} \int_{(- A / 2, A / 2)} e^{- i u {\tilde{x}}_{n}} ζ_{T} (u) d u ≃ \frac{1}{\sqrt{2 π}} \frac{A}{\tilde{N}} \sum_{k = 0}^{\tilde{N} - 1} e^{- i u_{k} {\tilde{x}}_{n}} {\tilde{w}}_{k} ζ_{T} (u_{k}) \\ = \frac{1}{\sqrt{2 π}} \frac{A}{\tilde{N}} e^{i \frac{A}{2} {\tilde{x}}_{n}} \sum_{k = 0}^{\tilde{N} - 1} exp (- i \frac{2 π n}{\tilde{N}} k) {\tilde{w}}_{k} ζ_{T} (u_{k}), n = 0, \dots, \tilde{N} - 1, \end{matrix}

where

{\tilde{w}}_{k}

are chosen by the trapezoidal rule, i.e.,

{\tilde{w}}_{k} : = \{\begin{matrix} \frac{1}{2}, & if k = 0, \tilde{N} - 1 \\ 1, & if k = 1, \dots, \tilde{N} - 2 \end{matrix} .

Therefore, knowing

ζ_{T} (u_{k})

for

k = 0, \dots, \tilde{N} - 1

, we can use a fast Fourier transform (FFT) to estimate the values

z_{T} ({\tilde{x}}_{n})

, for

n = 0, \dots, \tilde{N} - 1

, and, due to the symmetry of the grid, an inverse discrete Fourier transform to get

z_{T} ({\tilde{x}}_{n})

for

n = - \tilde{N} + 1, \dots, - 1

. Thus, we reduce the optimization problem (16) to the minimization in

{(R^{+})}^{d}

of the objective functional:

F (ν_{1}, \dots, ν_{N_{d}}) = \sum_{j = 1}^{N} {|z_{t} (x_{j}) + {(S_{0} - e^{x_{j} - r T})}^{+} - C^{j}|}^{2} + α \sum_{h = 1}^{N_{d}} [ν_{h} (log ν_{h} - log ν_{h}^{0}) + ν_{h}^{0} - ν_{h}] .

(23)

The optimization is run under the L-BFGS-B algorithm, so we compute the derivatives of

F

to speed it up. We focus on deriving the first addend in (23), since calculating the entropic term is straightforward. Denote by

g (ν_{1}, \dots, ν_{N_{d}})

the argument of the exponential in (22) and define

C_{u} : = \frac{S_{0}}{\sqrt{2 π}} \frac{e^{i u (log S_{0} + r T)}}{i u (i u + 1)}, u \in R .

Under suitable integrability condition, for every

u \in R

and

h = 1, \dots, N_{d}

we have

\begin{matrix} \frac{\partial ζ_{T} (u)}{\partial ν_{h}} & (ν_{1}, \dots, ν_{N_{d}}) = C_{u} T e^{g (ν_{1}, \dots, ν_{N_{d}})} [e^{y_{h}} (e^{i u y_{h}} - 1) + i u (1 - e^{y_{h}})] \\ = \frac{S_{0}}{\sqrt{2 π}} \frac{T}{i u + 1} e^{i u (log S_{0} + r T)} (1 - e^{y_{h}}) e^{g (ν_{1}, \dots, ν_{N_{d}})} + T e^{y_{h}} ζ_{T} (u) (e^{i u y_{h}} - 1) + T e^{y_{h}} C_{u} (e^{i u y_{h}} - 1) . \end{matrix}

Direct calculation from (18) leads, for every

k \in R

and

h = 1, \dots, N_{d}

, to

\begin{matrix} \frac{\partial z_{T} (k)}{\partial ν_{h}} (ν_{1}, \dots, ν_{N_{d}}) = \frac{1}{\sqrt{2 π}} \int_{R} e^{- i u k} \frac{\partial ζ_{T} (u)}{\partial ν_{h}} (ν_{1}, \dots, ν_{N_{d}}) d u \\ = \frac{S_{0}}{2 π} T (1 - e^{y_{h}}) \int_{R} \frac{e^{- i u k}}{i u + 1} e^{i u (log S_{0} + r T)} e^{g (ν_{1}, \dots, ν_{N_{d}})} d u \\ + T e^{y_{h}} [z_{T} (k - y_{h}) + {(S_{0} - e^{k - r T - y_{h}})}^{+} - z_{T} (k) - {(S_{0} - e^{k - r T})}^{+}] . \end{matrix}

(24)

Taking

k = {\tilde{x}}_{n}

, for

n = 0, \dots, \tilde{N} - 1

, we can approximate the first addend in (24) with an FFT:

\int_{R} \frac{e^{- i u {\tilde{x}}_{n}}}{i u + 1} e^{i u (r T + log S_{0})} e^{g (ν_{1}, \dots, ν_{N_{d}})} ≃ \frac{A}{\tilde{N}} \sum_{k = 0}^{\tilde{N} - 1} e^{- i u_{k} {\tilde{x}}_{n}} {\tilde{w}}_{k} f (u_{k}) = \frac{A}{\tilde{N}} e^{i \frac{A}{2} {\tilde{x}}_{n}} \sum_{k = 0}^{\tilde{N} - 1} exp (- i \frac{2 π n}{\tilde{N}} k) {\tilde{w}}_{k} f (u_{k}),

where

f (u) : = \frac{\sqrt{2 π}}{S_{0}} i u ζ_{T} (u) + \frac{e^{i u (log S_{0} + r T)}}{i u + 1}, u \in R .

Summarizing, we can numerically calculate the derivatives of

F

:

\frac{\partial F}{\partial ν_{h}} (ν_{1}, \dots, ν_{N_{d}}) = 2 \sum_{j = 1}^{N} [z_{T} (x_{j}) + {(S_{0} - e^{x_{j} - r T})}^{+} - C^{j}] \frac{\partial z_{T} (x_{j})}{\partial ν_{h}} (ν_{1}, \dots, ν_{N_{d}}) + α (log ν_{h} - log ν_{h}^{0})

for any

h = 1, \dots, N_{d}

.

The regularization parameter

α

should be eventually determined; to this aim, we follow a procedure loosely inspired by the Morozov discrepancy principle (see the classical Morozov (1966)). We start off by minimizing the quadratic pricing error (15), ignoring the entropy term and using the discretized Lévy measure of

L^{0}

as starting point. The value

ϵ_{0}

of the functional at the found minimum

\bar{ν}

is interpreted as the distance between the market and the selected model class. After that we consider the bid–ask spread of the options on the market and denote by

ϵ

its Euclidean norm. The absolute value of the difference between these two quantities is then allowed to be slightly greater, although it must maintain the same order of magnitude. This leads to the introduction of another term

\tilde{ϵ} : = |ϵ_{0} - ϵ|

, where c is a positive constant to be picked. At this point the functional in (16) is strictly increasing in

α

, so we choose the regularization parameter as follows:

α : = \tilde{ϵ} {(\int_{R} (\frac{d \bar{ν}}{d ν^{0}} log \frac{d \bar{ν}}{d ν^{0}} + 1 - \frac{d \bar{ν}}{d ν^{0}}) d ν^{0})}^{- 1} .

For our applications, taking

c \approx 2

has proven to be a satisfactory choice in terms of goodness of fit.

3.1.2. Empirical Results

The final step is to empirically apply this method on prices of real-traded options. In particular, we have tested it with derivatives on the same stocks as those considered for the Esscher’s method. In order to estimate the parameters of the historical Lévy process, we used the same time series of log–prices as those of the Esscher’s method (see Section 2.1.2) All the call options under scrutiny expire in January 2021: 11 months from the time of simulation.

In the case of call options on Apple Inc. stock (AAPL) we obtained an average of the absolute values of percentage difference of

3.5794 %

: Figure 2a displays the results of such implementation. As regards call options on Microsoft Corporation stock (MSFT), the absolute values of percentage difference is

1.6425 %

and the results are shown in Figure 2b. Finally we calibrated our method to the prices of call options on Tesla, Inc. stock (TSLA): Figure 2c shows the outcomes. Here the mean of absolute values of percentage difference settles down at

0.9148 %

.

Figure 2. Numerical experiments for the calibration method.

This time we retrieved the real prices from https://finance.yahoo.com, but the same considerations as Remark 3 apply.

4. Conclusions

The main purpose of this research is to quantify the performances of two option pricing methods in a liquid market like NASDAQ. The first approach is based on the Esscher measure, the second one on the calibration to real-traded call option prices with an entropic penalty term.

Our experimental analysis shows that, from a computational point of view, the Esscher method with

10^{4}

Monte Carlo iterations is far faster than the calibration method: the table in Figure 3 reports the exact execution times that we have obtained with an Asus ZenBook UX31A. This fact is especially noticeable when we need to simulate the prices of options with different maturities and same strike. On the other hand, the calibration method allows generating the entire option chain for a fixed maturity with great precision. Moreover, it is more stable than the previous one, as it provides results closer to real data for a larger range of stock prices.

Figure 3. Computational times (in seconds) of both methods (with an Asus ZenBook UX31A).

Comparing averages of absolute values for percentage difference between real and simulated prices, the calibration procedure offers better outcomes than Esscher’s, with the only exception of AAPL call options, where the latter outperforms the former by approximately 42 basis points. Nevertheless, the Esscher method only needs the historical time series of the underlying asset price to be applied, so it is feasible also in markets with few derivatives. This study shows that the Esscher measure is an appealing and efficient solution to price call options in illiquid—or low liquid—financial markets.

5. Future Research

Considering the future work on the subject, one of the directions worth investigating is the Linear Esscher measure method. The importance of focusing on this approach is given by its intrinsic link to the minimal entropy Hellinger martingale measure, as indicated by Rheinländer and Sexton in (Rheinländer and Sexton 2011, Chp. 9), and, in much more detail, by Choulli and Stricker in Choulli and Stricker (2006). Moreover, it would be interesting to extend our study to other asset classes like metal futures, and to compare it with the results obtained in Chen (2011).

Author Contributions

Conceptualization, A.B. and T.R.; methodology, T.R.; software, A.B.; validation, A.B., D.R. and T.R.; formal analysis, A.B., D.R. and T.R.; investigation, A.B., D.R. and T.R.; resources, A.B., D.R. and T.R.; data curation, A.B., D.R.; writing—original draft preparation, A.B., D.R.; writing—review and editing, A.B., D.R. and T.R.; visualization, A.B., D.R.; supervision, T.R.; project administration, T.R.; funding acquisition, T.R. All authors have read and agreed to the published version of the manuscript.

Funding

Open Access Funding by TU Wien. This research received no external funding.

Acknowledgments

We would like to thank Dan Chen for valuable comments. The authors acknowledge TU Wien Bibliothek for financial support through its Open Access Funding Program.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Normal Inverse Gaussian Distribution

In our work, we extensively use the

N I G

distribution, because it provides a good fit for the analyzed data. A probability measure

μ

on

R

is said to be

N I G

with parameters

(α, β, μ, δ)

, where

0 \leq |β| < α, μ \in R

and

δ > 0

, if it has the following density:

f_{N I G} (x; α, β, μ, δ) = \frac{α}{π} exp (δ \sqrt{α^{2} - β^{2}} + β (x - μ)) \frac{K_{1} (α δ \sqrt{1 + {(\frac{x - μ}{δ})}^{2}})}{\sqrt{1 + {(\frac{x - μ}{δ})}^{2}}},

(A1)

with

K_{1}

that denotes the modified Bessel function of the third kind with index 1 and

x \in R

. For a definition of

K_{1}

we refer to Abramowitz and Stegun (1970). The next procedure is the classical construction of such a distribution.

A probability measure

μ

on

(R^{+}, B (R^{+}))

is said to be a generalized inverse Gaussian (GIG) distribution with parameters

ν \in R, δ > 0, γ > 0

if its density with respect to the Lebesgue measure is

f_{G I G} (x; ν, δ, γ) = {(\frac{γ}{δ})}^{ν} \frac{1}{2 K_{ν} (γ δ)} x^{ν - 1} exp [- \frac{1}{2} (δ^{2} x^{- 1} + γ^{2} x)], x > 0 .

(A2)

In order to show that

f_{G I G}

is a density function, we need the following representation for

K_{ν}

(Watson 1966, Formula (8), p. 182), the modified Bessel function of the third kind with index

ν

:

K_{ν} (x) = \frac{1}{2} \int_{0}^{\infty} y^{ν - 1} exp [- \frac{x}{2} (y + \frac{1}{y})] d y, x > 0 .

(A3)

The computation of the next integral allows us to find the normalization constant in (A2):

\begin{matrix} \int_{0}^{\infty} x^{ν - 1} e^{- \frac{1}{2} (δ^{2} x^{- 1} + γ^{2} x)} d x = \int_{0}^{\infty} x^{ν - 1} e^{- \frac{1}{2} γ δ (\frac{δ}{γ} x^{- 1} + \frac{γ}{δ} x)} d x \\ = {(\frac{δ}{γ})}^{ν} \int_{0}^{\infty} y^{ν - 1} e^{- \frac{1}{2} γ δ (y + \frac{1}{y})} d y = 2 {(\frac{δ}{γ})}^{ν} K_{ν} (γ δ), \end{matrix}

where in the second equality we made the substitution

y = \frac{γ}{δ} x

. Since

K_{ν} > 0

in

R^{+}

from (A3), we can conclude that

f_{G I G}

is actually a density function on

R^{+}

.

Figure A1.

K_{0}, K_{1}, K_{2}

are the modified Bessel functions of the first three, nonnegative, integer orders, respectively.

Let us fix two other parameters

α, β \in R

such that

0 \leq |β| < α

. For any

μ \in R, y \in R^{+}

we denote by

f_{N} (\cdot; μ, β, y)

the density of the probability measure on

(R, B (R))

generated by a random variable

X \sim N (μ + β y, y)

. It is possible to introduce a new distribution on

(R, B (R))

considering the following function:

\begin{matrix} f_{G H} (x; ν, α, β, μ, δ) : = \int_{0}^{\infty} f_{N} (x; μ, β, y) f_{G I G} (y; ν, δ, \sqrt{α^{2} - β^{2}}) d y \\ = \frac{1}{\sqrt{2 π}} {(\frac{\sqrt{α^{2} - β^{2}}}{δ})}^{ν} \frac{1}{2 K_{ν} (δ \sqrt{α^{2} - β^{2}})} \\ \int_{0}^{\infty} \frac{1}{\sqrt{y}} e^{- \frac{1}{2 y} {(x - μ - β y)}^{2}} y^{ν - 1} e^{- \frac{1}{2} (δ^{2} y^{- 1} + (α^{2} - β^{2}) y)} d y \\ = \frac{1}{\sqrt{2 π}} {(\frac{\sqrt{α^{2} - β^{2}}}{δ})}^{ν} \frac{1}{2 K_{ν} (δ \sqrt{α^{2} - β^{2}})} e^{β (x - μ)} \\ \int_{0}^{\infty} y^{ν - 1 - \frac{1}{2}} exp \{- \frac{1}{2} [\frac{1}{y} (δ^{2} + {(x - μ)}^{2}) + α^{2} y]\} d y \\ = \frac{1}{\sqrt{2 π}} {(\frac{\sqrt{α^{2} - β^{2}}}{δ})}^{ν} \frac{1}{2 K_{ν} (δ \sqrt{α^{2} - β^{2}})} e^{β (x - μ)} \\ {(\frac{\sqrt{δ^{2} + {(x - μ)}^{2}}}{α})}^{ν - \frac{1}{2}} \int_{0}^{\infty} z^{(ν - \frac{1}{2}) - 1} exp [- \frac{1}{2} (z^{- 1} + z) α \sqrt{δ^{2} + {(x - μ)}^{2}}] d z \\ = \frac{{(\sqrt{α^{2} - β^{2}})}^{ν}}{\sqrt{2 π} δ^{ν} α^{ν - \frac{1}{2}} K_{ν} (δ \sqrt{α^{2} - β^{2}})} e^{β (x - μ)} \frac{K_{ν - \frac{1}{2}} (α \sqrt{δ^{2} + {(x - μ)}^{2}})}{{(\sqrt{δ^{2} + {(x - μ)}^{2}})}^{\frac{1}{2} - ν}}, x \in R, \end{matrix}

(A4)

where in the last but one equality we made the substitution

z = \frac{α}{\sqrt{δ^{2} + {(x - μ)}^{2}}} y

and in the last we used (A3).

A straightforward application of Tonelli’s theorem shows that

f_{G H} (\cdot; ν, α, β, μ, δ)

is a density function: the corresponding probability measure on

(R, B (R))

is called Generalized Hyperbolic distribution.

We recover the

N I G

density (A1) taking

ν = - \frac{1}{2}

in (A4). In fact, for every

ν \in R

, it results

K_{- ν} = K_{ν}

and the representation (Abramowitz and Stegun (1970), Formula (

9.6 . 23

))

K_{ν} (x) = \frac{\sqrt{π} {(\frac{1}{2} x)}^{ν}}{Γ (ν + \frac{1}{2})} \int_{1}^{\infty} e^{- x t} {(t^{2} - 1)}^{ν - \frac{1}{2}} d t, x > 0,

which holds for every

ν > - \frac{1}{2}

, enables us to conclude

\begin{matrix} K_{- \frac{1}{2}} (x) = K_{\frac{1}{2}} (x) = - \sqrt{\frac{π x}{2}} \frac{1}{x} {[e^{- x t}]}_{1}^{\infty} = \sqrt{\frac{π}{2}} \frac{e^{- x}}{\sqrt{x}}, x > 0 . \end{matrix}

As a particular case of Generalized Hyperbolic distribution, the

N I G

probability measure is infinitely divisible (see, e.g., Barndorff-Nielsen and Halgreen 1977; Eberlein and Hammerstein 2004). The moment generating function for a random variable

X \sim N I G (α, β, μ, δ)

is given by

E [e^{z X}] = exp [μ z + δ (\sqrt{α^{2} - β^{2}} - \sqrt{α^{2} - {(β + z)}^{2}})], - α - β \leq z \leq α - β,

(A5)

and its generating triplet (see Appendix B for the definition of this concept) is given by

\{\begin{matrix} σ^{2} = 0 \\ ν (d x) = \frac{δ α}{π |x|} e^{β x} K_{1} (α |x|) d x \\ γ = μ + \frac{2 δ α}{π} \int_{0}^{1} sinh (β x) K_{1} (α x) d x \end{matrix} .

(A6)

For an explicit calculation, we refer to the paper Barndorff-Nielsen (1998).

Appendix B. Cumulant Function of Lévy Processes

Fix a probability space

(Ω, F, P)

. It is well known that, if

X = {\{X_{t}\}}_{t}

is a Lévy process, then

X_{t}

is infinitely divisible for every

t \in R_{0}^{+}

. On the other hand, for every infinitely divisible probability measure

μ

on

R^{d}

, there exists a Lévy process

X = {\{X_{t}\}}_{t}

, which is unique up to identity in law, such that

X_{1} \sim μ

.

The Lévy–Khintchine representation theorem states that, given an infinitely divisible distribution

μ

on

R^{d}

, its characteristic function

\hat{μ}

can be written as:

\hat{μ} (z) = exp [- \frac{1}{2} ⟨ z, A z ⟩ + i ⟨ γ, z ⟩ + \int_{R^{d}} (e^{i ⟨ z, x ⟩} - 1 - i ⟨ z, x ⟩ 1_{D} (x)) ν (d x)], z \in R^{d},

(A7)

where

D : = \{x \in R^{d} : |x| \leq 1\}

,

γ \in R^{d}

,

ν

is a measure on

R^{d}

satisfying

ν (\{0\}) = 0, \int_{R^{d}} ({|x|}^{2} \land 1) ν (d x) < \infty

and A is a symmetric, positive semidefinite

d \times d

matrix. The representation (A7) of

\hat{μ}

by

(A, ν, γ)

is unique and the triplet

(A, ν, γ)

is called generating triplet of the distribution

μ

. The generating triplet of a Lévy process

X = {\{X_{t}\}}_{t}

is the generating triplet of

X_{1}

.

Note that it is not necessary to take

1_{D}

to have integrability in (A7). In fact, let

h : R^{d} \to R^{d}

be a bounded function, with

h (x) = x

in a neighborhood of 0: we call it a truncation function. Since, for every

z \in R^{d}

, in a neighborhood of 0 we have

|e^{i ⟨ z, x ⟩} - 1 - i ⟨ z, h (x) ⟩| \leq \frac{1}{2} {|z|}^{2} {|x|}^{2}

and, further,

|e^{i ⟨ z, x ⟩} - 1 - i ⟨ z, h (x) ⟩| \leq 2 + C |z|, x \in R^{d}

, for some positive constant C such that

h \leq C

in

R^{d}

, if we put

γ_{h} : = γ + \int_{R^{d}} (h (x) - x 1_{D} (x)) ν (d x)

component wise, then the Lévy–Khintchine formula with respect to h becomes:

\hat{μ} (z) = exp [- \frac{1}{2} ⟨ z, A z ⟩ + i ⟨ γ_{h}, z ⟩ + \int_{R^{d}} (e^{i ⟨ z, x ⟩} - 1 - i ⟨ z, h (x) ⟩) ν (d x)], z \in R^{d} .

We note that only

γ_{h}

depends on the choice of the truncation function.

It is possible to “extend” the argument of the exponential function in (A7), called characteristic exponent and denoted by

ψ

, to a subset of

C^{d}

. Precisely, taking into account the set

C : = \{c \in R^{d} : \int_{|x| > 1} e^{⟨ c, x ⟩} ν (d x) < \infty\},

we can define the function

Ψ : \tilde{D} \to C

as follows:

Ψ (w) : = \frac{1}{2} ⟨ w, A w ⟩ + ⟨ γ, w ⟩ + \int_{R^{d}} (e^{⟨ w, x ⟩} - 1 - ⟨ w, x ⟩ 1_{D} (x)) ν (d x), w \in \tilde{D},

(A8)

where

\tilde{D} : = \{w \in C^{d} : Re (w) \in C\}

. We readily note that

i z \in \tilde{D}

for every

z \in R^{d}

and the following equality holds:

Ψ (i z) = ψ (z), z \in R^{d}

. In this sense

Ψ

extends

ψ

to a subset of

C^{d}

. We pinpoint that, in this paper, we set

⟨ u, w ⟩ : = \sum_{j = 1}^{d} u_{j} w_{j}, u, w \in C^{d}

, so

⟨ \cdot, \cdot ⟩

is not the

C^{d}

-Hermitian inner product. Theorem

25.17

in Sato (1999) affirms that, if

X = {\{X_{t}\}}_{t}

is a Lévy process generated by

(A, ν, γ)

, then

E [e^{⟨ w, X_{t} ⟩}] = e^{t Ψ (w)} for any w \in \tilde{D}, t > 0 :

(A9)

we call

Ψ

the cumulant function of X. As we have previously done, we can introduce a truncation function h and express

Ψ = Ψ_{h}

in

\tilde{D}

, where

Ψ_{h} (w) : = \frac{1}{2} ⟨ w, A w ⟩ + ⟨ γ_{h}, w ⟩ + \int_{R^{d}} (e^{⟨ w, x ⟩} - 1 - ⟨ w, h (x) ⟩) ν (d x), w \in \tilde{D} .

Remark A1.

Let

X = {\{X_{t}\}}_{t}

be a

R^{d}

-valued Lévy process defined on a probability space

(Ω, F, P)

, Ψ be its cumulant function and

θ \in R^{d}

be such that

E [exp (⟨ θ, X_{t} ⟩)] < \infty

for some

t > 0

. By Theorem

25.3

in Sato (1999), this is equivalent to require that

θ \in C

. We introduce the random variables

M_{t} : = exp (⟨ θ, X_{t} ⟩ - t Ψ (θ)), t \geq 0 .

Note that

M = {\{M_{t}\}}_{t}

is integrable, by assumption and the fact that

t Ψ (θ)

is constant in Ω for every

t \geq 0

. Let us construct the minimal augmented filtration

F = {(F_{t})}_{t \geq 0}

of X, i.e.,

F_{t} = σ (N ⋃ F_{t}^{0})

for any

t \geq 0

, where

{(F_{t}^{0})}_{t}

is the natural filtration of the process and

N

the collection of

F

-negligible sets; obviously, M is

F

-adapted. According to (Protter 2005, Chapter I, Theorem 31)

F

is right-continuous, so it satisfies the usual conditions, as well. If we fix

t > s \geq 0

, then using (A9) and the properties of the Lévy-increments we see that M is a martingale with mean 1:

\begin{matrix} E [M_{t} | F_{s}] & = E [e^{⟨ θ, X_{t} ⟩ - t Ψ (θ)} | F_{s}] \overset{a . s .}{=} e^{⟨ θ, X_{s} ⟩ - s Ψ (θ)} E [e^{⟨ θ, X_{t} - X_{s} ⟩} | F_{s}] e^{- (t - s) Ψ (θ)} \\ \overset{a . s .}{=} M_{s} E [e^{⟨ θ, X_{t - s} ⟩}] e^{- (t - s) Ψ (θ)} = M_{s} . \end{matrix}

Definition A1.

Fix

T > 0

and let

θ \in C

. The probability measure

P^{θ}

on

F_{T}

, with

P^{θ} \sim P

on

F_{T}

, defined by

\frac{d P^{θ}}{d P} : = M_{T}

is called the Esscher transform of P with respect to θ. The density process is given by

\frac{d {P^{θ}|}_{F_{t}}}{d {P|}_{F_{t}}} = M_{t}, t \in [0, T] .

Appendix C. Characteristics of Semimartingales

The aim of this section is to generalize the concept of generating triplet of a Lévy process to semimartingales. Here we mainly follow (Shiryaev and Jacod 2003, Chapter II).

We start off by fixing a stochastic basis

(Ω, F, P; F)

, with

F

which satisfies the usual conditions. Given two stopping times

S, T

, the stochastic interval is the random set

〚 S, T 〛 : = \{(t, ω) \in R_{0}^{+} \times Ω : S (ω) \leq t \leq T (ω)\},

and

〚 T 〛 : = 〚 T, T 〛

.

Definition A2.

A random set A is called thin if it is of the form

A = ⋃_{n} 〚 T_{n} 〛,

where

{(T_{n})}_{n}

is a sequence of stopping times.

It is important to observe that the sections

\{t \in R_{0}^{+} : (t, ω) \in A\}

, for

ω \in Ω

, are at most countable when A is a thin set.

Theorem A1.

If

X = {\{X_{t}\}}_{t}

is a RCLL, adapted process, then the random set

\{Δ X \neq 0\}

is thin.

We refer to (He et al. 2018, Theorem 3.32) for a proof. Thanks to this result we can easily introduce the next concept.

Definition A3.

Let

X = {\{X_{t}\}}_{t}

be an adapted, RCLL,

R^{d}

-valued process. The measure

μ^{X}

on

R_{0}^{+} \times R^{d}

defined by

μ^{X} (ω; d t, d x) : = \sum_{s} 1_{\{x \neq 0\}} ({Δ X}_{s} (ω)) δ_{(s, Δ X_{s} (ω))} (d t, d x), ω \in Ω,

where

δ_{(\bar{a})}

denotes the Dirac measure at a point

\bar{a}

, is called measure associated with its jumps.

We now consider a process

X = {\{X_{t}\}}_{t}

which is a d-dimensional semimartingale, highlighting that we restrict our attention to RCLL and

F

-adapted semimartingales. Let h be a truncation function; using the measure

μ^{X}

it is possible to derive the characteristics of X (see Shiryaev and Jacod 2003, Chp. II, Definition 2.6), which we denote by the triplet

(B, C, ν^{X})

. They are defined up to a P-null set and only B depends on the choice of h.

Every Lévy process—once we endow the probability space in which it is defined with its minimal augmented filtration—is a

P I I S

process (a RCLL, adapted process which starts at 0 and has independent and stationary increments), which in turn is a semimartingale. Thus, considering a Lévy process

L = {\{L_{t}\}}_{t}

with generating triplet

(A, ν, γ_{h})

relative to a truncation function h, we can apply (Shiryaev and Jacod 2003, Chp. II, Corollary 4.19) together with the uniqueness of the Lévy–Khintchine representation to express its characteristics as

(γ_{h} t, A t, d t \otimes ν (d x)) .

Appendix D. Laplace Cumulant and Geometric Esscher Measure

On a stochastic basis

(Ω, F, P; F)

, with the filtration

F

which satisfies the usual conditions of right-continuity and completeness, we define a d–dimensional semimartingale

X = (X^{1}, \dots, X^{d})

, i.e., a process which can be decomposed as

X = M + A

, a sum of a local martingale M and a process A of locally finite variation which is called drift, or additive compensator, of the semimartingale, and with characteristics

(B, C, ν^{X})

relative to a truncation function h (see Appendix B for the definition). In particular, a special semimartingale is a semimartingale where its finite variation part is predictable; such a special semimartingale has a unique semimartingale decomposition. In view of (Shiryaev and Jacod 2003, Chp. II, Proposition 2.9) we can assume, without loss of generality, that

ν^{X} (\{t\} \times R^{d}) \leq 1

identically. An exposition on semimartingale characteristics is outside of the scope of this paper, for this we refer to Shiryaev and Jacod (2003).

Now we introduce the stochastic logarithm and the stochastic exponential, denoted by

L

and

E

, respectively. The symbol · denotes stochastic integration, the exact meaning can vary according to the context. Let X be a semimartingale; the set of stochastic processes which are integrable with respect to X is denoted by

L (X)

. This notion is quite intricate, and it is not the purpose of this article to give an overview on the theory of stochastic integration. For this, we refer to (Protter 2005, Chp. IV, Section 2). The solution of the stochastic integral equation

Y = 1 + Y_{-} \cdot X

is denoted as the stochastic exponential

E (X)

of the semimartingale X. Here, the process

Y_{-} = {\{Y_{-} (t)\}}_{t}

is defined by

Y_{-} (t) : = Y_{t -} = {lim}_{s \to t^{-}} Y_{s}

for

t > 0

, with

Y_{-} (0) : = Y_{0}

. Conversely, if Y is a semimartingale such that both Y and

Y_{-}

do not vanish, then the process

X = \frac{1}{Y_} \cdot Y,

denoted by

L (Y)

, is called the stochastic logarithm of Y, and is the unique semimartingale such that

Y = Y_{0} E (X) .

For a detailed exposition of these concepts we refer to (Shiryaev and Jacod 2003, Chp. II, Section 8).

Definition A4.

Let

θ \in L (X)

be an admissible strategy such that

θ \cdot X

is exponentially special, i.e.,

exp (X)

is a special semimartingale. The Laplace cumulant

{\tilde{K}}^{X} (θ)

of X at θ is the additive compensator of the real-valued, special semimartingale

L (exp (θ \cdot X)) .

The modified Laplace cumulant

K^{X} (θ)

of X at θ is the process

K^{X} (θ) : = log (E ({\tilde{K}}^{X} (θ))) .

(A10)

Note that the additive compensator of the process

L (exp (θ \cdot X)) = \frac{1}{{(exp (θ \cdot X))}_{-}} \cdot exp (θ \cdot X)

is predictable because

exp (θ \cdot X)

is special and

\frac{1}{{(exp (θ \cdot X))}_{-}}

is predictable. Therefore,

L (exp (θ \cdot X))

is a special semimartingale and the definition of

{\tilde{K}}^{X} (θ)

is well posed. As far as

K^{X} (θ)

is concerned, recalling (Shiryaev and Jacod 2003, Chp. III, Theorem 7.4) we have

{Δ {\tilde{K}}^{X} (θ)}_{t} = \int_{R^{d}} (e^{⟨ θ_{t}, x ⟩} - 1) ν^{X} ({t} \times d x) > - 1, t \geq 0 .

Thus, the stochastic exponential in (A10) is strictly positive and the process

K^{X} (θ)

is well defined, as well.

If

θ \cdot X

is exponentially special, then

K^{X} (θ)

is its exponential compensator, see (Shiryaev and Jacod 2003, Chp. III, Theorem 7.14). This means that the process

Z^{θ} = {\{Z_{t}^{θ}\}}_{t}

, defined by

Z_{t}^{θ} : = exp (θ \cdot X_{t} - K^{X} {(θ)}_{t}), t \geq 0

, is a local martingale starting at

Z_{0}^{θ} = 1

. We now recall the concept of uniform integrability.

Definition A5.

A non-empty set Φ of real-valued random variable defined on a probability space

(Ω, F, P)

is uniformly integrable if

lim_{n \to \infty} sup_{X \in Φ} E [|X| 1_{\{|X| \geq n\}}] = 0 .

Supposing that

Z^{θ}

is a uniformly integrable martingale, then we can set

P^{θ} (d ω) : = Z_{\infty}^{θ} P (d ω)

, where

Z_{t} \to Z_{\infty}

a.s. and in

L^{1}

as

t \to \infty

. It is straightforward to show that

Z^{θ}

is the density of

P^{θ}

relative to P; besides, these two distributions are locally equivalent as

Z_{t}^{θ} > 0, t \geq 0

.

The following result (Shiryaev and Jacod 2003, Chp. III, Theorem 7.18) establishes a necessary and sufficient condition such that the process

S^{i} : = S_{0}^{i} exp (X^{i}), S_{0}^{i} \in R^{+}

is a

P^{θ}

-local martingale for every

i = 1, \dots, d

.

Theorem A2.

Let

θ \in L (X)

be such that

θ \cdot X

is exponentially special and

Z^{θ}

is a uniformly integrable martingale. Define

θ_{j}^{(i)} : = \{\begin{matrix} θ_{j}, & j \neq i \\ θ_{i} + 1, & j = i \end{matrix} .

Then the processes

S^{i} = S_{0}^{i} exp (X^{i})

are

P^{θ}

-local martingales if and only if

θ^{(i)} \cdot X

is exponentially special and

K^{X} (θ^{(i)}) = K^{X} (θ)

up to evanescence for every

i = 1, \dots, d

.

We call

P^{θ}

geometric Esscher measure, or Esscher martingale transform for exponential processes. For

d = 1

, in case the geometric Esscher measure exists, (Kallsen and Shiryaev 2002, Theorem 4.2) provides its uniqueness: this means that, if we find another process

\tilde{θ} \in L (X)

such that

S = S_{0} exp (X)

is a

P^{\tilde{θ}}

-local martingale, then

P^{\tilde{θ}} = P^{θ}

.

We conclude with some technical results. Let

θ \in L (X)

be such that

θ \cdot X

is exponentially special. By (Shiryaev and Jacod 2003, Chp. III, Theorem 7.4) and (Shiryaev and Jacod 2003, Chp. II, Proposition 2.9) we can write

{\tilde{K}}^{X} (θ) = \tilde{κ} (θ) \cdot A

, where

\tilde{κ} {(θ)}_{t} : = ⟨ θ_{t}, b_{t} ⟩ + \frac{1}{2} ⟨ θ_{t}, c_{t} θ_{t} ⟩ + \int_{R^{d}} (e^{⟨ θ_{t}, x ⟩} - 1 - ⟨ θ_{t}, h (x) ⟩) F_{t} (d x), t \geq 0 .

(A11)

The next lemma (Kallsen and Shiryaev 2002, Lemma 2.11) allows us to express the drift process as a function of the characteristics of the semimartingale.

Lemma A1.

Let

θ \in L (X)

be such that

θ \cdot X

is a special semimartingale. Then its drift process

D^{X} (θ) = δ (θ) \cdot A

, where

δ {(θ)}_{t} : = ⟨ θ_{t}, b_{t} ⟩ + \int_{R^{d}} ⟨ θ_{t}, x - h (x) ⟩ F_{t} (d x), t \geq 0 .

Finally, we use Girsanov’s theorem to compute the characteristics

(B^{θ}, C^{θ}, {ν^{X}}^{θ})

of X under

P^{θ}

—provided its existence—relative to the same h:

\{\begin{matrix} {B^{θ}}^{i} = B^{i} + c^{i \cdot} θ \cdot A + h^{i} (x) (\frac{e^{⟨ θ_{t}, x ⟩}}{1 + \hat{W} {(θ)}_{t}} - 1) 🟉 ν^{X}, i = 1, \dots, d \\ C^{θ} = C \\ {ν^{X}}^{θ} (d t, d x) = \frac{e^{⟨ θ_{t}, x ⟩}}{1 + \hat{W} {(θ)}_{t}} ν^{X} (d t, d x) \end{matrix},

(A12)

where

\hat{W} {(θ)}_{t} : = \int_{R^{d}} (e^{⟨ θ_{t}, x ⟩} - 1) ν^{X} (\{t\} \times d x), t \geq 0

. Note that

\hat{W} {(θ)}_{t} = Δ {\tilde{K}}^{X} {(θ)}_{t} > - 1, t \geq 0 .

Remark A2.

For

ω \in Ω, t \geq 0

and

G \in B (R^{d})

, we have:

\begin{matrix} ν^{X} (ω; \{t\} \times G) & = \int_{R_{0}^{+}} d A_{s} (ω) \int_{R^{d}} 1_{\{t\} \times G} (s, x) F_{(s, ω)} (d x) \\ = \int_{\{t\}} d A_{s} (ω) \int_{R^{d}} 1_{G} (x) F_{(t, ω)} (d x) = F_{(t, ω)} (G) \int_{\{t\}} d A_{s} (ω) . \end{matrix}

Moreover, as

\int_{\{t\}} d A_{s} (ω) = A_{t} (ω) - A_{t -} (ω),

if the function

A_{\cdot} (ω)

is continuous in t, then

ν^{X} (ω; \{t\} \times d x)

is the null measure on

B (R^{d})

, which implies that

\hat{W} {(θ)}_{t} (ω) = 0

.

Appendix E. Lévy Processes on Skorohod Space and Relative Entropy of Distributions

Let

(Ω, F, P)

be a probability space which carries a

R^{d}

-valued, additive process

{\{X_{t}\}}_{t}

with system of generating triplets

{\{(A_{t}, ν_{t}, γ_{t})\}}_{t}

. For every

t \geq 0

we define

x_{t} : D \to R^{d}

, where

D

is the Skorohod Space, as

x_{t} (ξ) : = ξ (t), ξ \in D,

and we introduce the

σ

-algebra

F_{D} : = σ (\{x_{t}, t \geq 0\})

. Now we are in the position to define a natural filtration

F : = {(F_{t})}_{t \geq 0}

on the measurable space

(D, F_{D})

, where

F_{t} : = σ (\{x_{s}, 0 \leq s \leq t\}), t \geq 0

. Since

{\{X_{t}\}}_{t}

is an RCLL process, we set the map

ϕ : Ω \to D

as

ϕ (ω) : = X_{\cdot} (ω), ω \in Ω

, where

X_{\cdot} (ω) : [0, \infty) \to R^{d}

, with

X_{\cdot} (ω) (t) : = X_{t} (ω)

for any

t \geq 0

. The function

ϕ

is

F / F_{D}

measurable, since

F_{D} = σ (\{x_{t}, t \geq 0\}) = σ (\{x_{t}^{- 1} (B), B \in B (R^{d}), t \geq 0\})

and, for every

B \in B (R^{d})

and

t \geq 0

, we have

\begin{matrix} ϕ^{- 1} (x_{t}^{- 1} (B)) & = \{ω \in Ω : ϕ (ω) \in x_{t}^{- 1} (B)\} = \{ω \in Ω : X_{\cdot} (ω) \in x_{t}^{- 1} (B)\} \\ = \{ω \in Ω : X_{t} (ω) \in B\} = X_{t}^{- 1} (B) \in F . \end{matrix}

This enables us to construct the pushforward measure

P^{D}

on

(D, F_{D})

, that is,

P^{D} (A) : = P ϕ^{- 1} (A) = P (ϕ^{- 1} (A)), A \in F_{D} .

(A13)

We now focus on the stochastic process

{\{x_{t}\}}_{t}

defined on the probability space

(D, F_{D}, P^{D})

. Fix a cylinder set

C \in F_{D}

, i.e.,

C = \{ξ \in D : ξ (t_{1}) \in B_{1}, \dots, ξ (t_{n}) \in B_{n}\},

for some

t_{1} < t_{2} < \dots < t_{n}, B_{1}, \dots, B_{n} \in B (R^{d})

and

n \in N

. By (A13), we have

\begin{matrix} P^{D} (x_{t_{1}} \in B_{1}, \dots, x_{t_{n}} \in B_{n}) = P^{D} (C) = P (ϕ^{- 1} (C)) = P (X_{t_{1}} \in B_{1}, \dots, X_{t_{n}} \in B_{n}) . \end{matrix}

Thus,

{\{x_{t}\}}_{t}

and

{\{X_{t}\}}_{t}

are identical in law, whence

{\{x_{t}\}}_{t}

is an additive process with the same system of generating triplets as

{\{X_{t}\}}_{t}

. Specifically, if

{\{X_{t}\}}_{t}

were a Lévy process, then

{\{x_{t}\}}_{t}

would inherit the temporal homogeneity, so it would be a Lévy process, as well.

Consider two Lévy processes

({\{x_{t}\}}_{t}, P)

and

({\{x_{t}\}}_{t}, P^{'})

on the Skorohod space

(D, F_{D})

endowed with the filtration

F

. The next theorem (Sato 1999, Theorems 33.1 & 33.2) provides us with conditions which ensure

{P|}_{F_{t}} \sim {P^{'}|}_{F_{t}}

for any

t > 0

.

Theorem A3.

Let

({\{x_{t}\}}_{t}, P)

,

({\{x_{t}\}}_{t}, P^{'})

be Lévy processes on

R^{d}

with generating triplets

(A, ν, γ)

and

(A^{'}, ν^{'}, γ^{'})

, respectively. Then the following properties are equivalent:

(a): ${P|}_{F_{t}} \sim {P^{'}|}_{F_{t}}$ for every $t > 0$ ;
(b): the generating triplets satisfy: $A = A^{'}$ , $ν \sim ν^{'}$ .

Besides, considering the function

ϕ : R^{d} \to R

defined by

ϕ : = log (\frac{d ν^{'}}{d ν})

,

\int_{R^{d}} {(e^{ϕ (x) / 2} - 1)}^{2} ν (d x) < \infty, γ^{'} - γ - \int_{|x| \leq 1} x (ν^{'} - ν) (d x) \in \{A y, y \in R^{d}\} .

In this case, chosen

η \in R^{d}

such that

γ^{'} - γ - \int_{|x| \leq 1} x (ν^{'} - ν) (d x) = A η

, there exists a process

U = {\{U_{t}\}}_{t}

defined on

D

which satisfies the following properties:

(i): U is a P-Lévy process on $R$ with generating triplet

$\{\begin{matrix} σ_{U}^{2} = ⟨ η, A η ⟩ \\ ν_{U} = {ν ϕ^{- 1}|}_{R \ \{0\}} \\ γ_{U} = - \frac{1}{2} ⟨ η, A η ⟩ - \int_{R} (e^{y} - 1 - y 1_{D} (y)) ν ϕ^{- 1} (d y) \end{matrix};$
(ii): $E^{P} [e^{U_{t}}] = E^{P^{'}} [e^{- U_{t}}] = 1$ for every $t \geq 0$ ;
(iii): $e^{U_{t}} = \frac{d P^{'} |_{F_{t}}}{d P |_{F_{t}}} P - a . s .$ for every $t > 0$ .

An explicit expression of the process U, which is unique up to identity in law, can be retrieved in (Sato 1999, Theorem 33.2).

Definition A6.

Given two probability measures

P, P^{'}

on a measurable space

(Ω, F)

, the relative entropy

H (P, P^{'})

of P with respect to

P^{'}

is defined by

H (P, P^{'}) : = \{\begin{matrix} \int_{Ω} log (\frac{d P}{d P^{'}} (ω)) P (d ω), & if P ≪ P^{'} \\ \infty, & otherwise \end{matrix} .

Recall that given two distributions

P, P^{'}

on

(Ω, F)

we have

H (P, P^{'}) \geq 0

, with equality if and only if

P = P^{'}

.

We finally present a theorem which explicitly computes the relative entropy of two equivalent Lévy processes in the Skorohod space as a function of their generating triplets.

Theorem A4.

Let

({\{x_{t}\}}_{t}, P)

,

({\{x_{t}\}}_{t}, P^{'})

be Lévy processes on

R^{d}

defined on

(D, F_{D})

with generating triplets

(A, ν, γ)

and

(A^{'}, ν^{'}, γ^{'})

, respectively. Suppose that

{P|}_{F_{t}} \sim {P^{'}|}_{F_{t}}

for every

t > 0

and choose

η \in R^{d}

such that

γ^{'} - γ - \int_{|x| \leq 1} x (ν^{'} - ν) (d x) = A η .

Assume also that

E^{P} [g (U_{t})] < \infty

for some

t > 0

, where

g (x) : = (|x| \lor 1) e^{|x|}, x \in R

. Then for every

T > 0

it results

H ({P^{'}|}_{F_{T}}, {P|}_{F_{T}}) = \frac{T}{2} ⟨ η, A η ⟩ + T \int_{R^{d}} (\frac{d ν^{'}}{d ν} log \frac{d ν^{'}}{d ν} + 1 - \frac{d ν^{'}}{d ν}) d ν .

(A14)

Proof.

Let us fix a finite time horizon

T > 0

. For any

z \in (0, 1)

, by assumption we have

E^{P} [e^{z U_{T}}] = \int_{R} e^{z x} P_{U_{T}} (d x) \leq \int_{R} g (x) P_{U_{T}} (d x) < \infty .

We introduce the moment generating function

M_{U_{T}} (z) : = E^{P} [e^{z U_{T}}] = e^{T Ψ (z)}, z \in (0, 1),

where

Ψ

is the cumulant function of the Lévy process U, i.e.,

Ψ (z) = \frac{1}{2} σ_{U}^{2} z^{2} + γ_{U} z + \int_{R} (e^{z x} - 1 - z x 1_{D} (x)) ν_{U} (d x), z \in (0, 1),

(A15)

and the last equality is ensured by (A9) in Appendix B. Actually

M_{U_{T}}

is well defined for

z = 1

too, with

M_{U_{T}} (1) = E^{P} [e^{U_{T}}] = e^{T Ψ (1)} = 1

, by A3 in Theorem A3. We can see that

M_{U_{T}}

is differentiable in

(0, 1)

with

{M_{U_{T}}}^{'} (\cdot) = E^{P} [U_{T} e^{\cdot U_{T}}]

. Indeed, for every

z \in (0, 1)

,

M_{U_{T}} (z) = \int_{R} e^{z x} P_{U_{T}} (d x)

and we can derive under integral sign since

|x| e^{z x} \leq g (x) \in L^{1} (P_{U_{T}}), z \in (0, 1), x \in R .

At this point the dominated convergence theorem readily shows that

lim_{z \to 1^{-}} {M_{U_{T}}}^{'} (z) = \int_{R} x e^{x} P_{U_{T}} (d x) = E^{P} [U_{T} e^{U_{T}}] .

On the other hand, we introduce the function

f (z) : = e^{T Ψ (z)}, z \in (0, 1) .

Even in this case we can affirm that f is differentiable in its domain, with derivative provided by

f^{'} = T e^{T Ψ} Ψ^{'}

. In particular the following equality is true:

Ψ^{'} (z) = σ_{U}^{2} z + γ_{U} + \int_{R} (x e^{z x} - x 1_{D} (x)) ν_{U} (d x), z \in (0, 1) .

In fact, we can derive under the integral sign in (A15) as, for every

z \in (0, 1)

, it results that

|x e^{z x} - x| \leq 1 + e

,

x \in D

, with

x e^{z x} - x \leq C x^{2}

in a neighborhood of 0 for some constant

C > 0

which is independent of z. Moreover,

|x| e^{z x} \leq g (x)

for

|x| > 1

, with

\int_{|x| > 1} |x| e^{|x|} ν_{U} (d x) < \infty

by Theorem

25.3

in Sato (1999). Applying another time the Lebesgue’s convergence theorem we arrive at

lim_{z \to 1^{-}} {M_{U_{T}}}^{'} (z) = T e^{T Ψ (1)} (σ_{U}^{2} + γ_{U} + \int_{R} (x e^{x} - x 1_{D} (x)) ν_{U} (d x)) .

Therefore

\begin{matrix} E^{P} [U_{T} e^{U_{T}}] & = T e^{T Ψ (1)} (σ_{U}^{2} + γ_{U} + \int_{R} (x e^{x} - x 1_{D} (x)) ν_{U} (d x)) \\ = \frac{T}{2} ⟨ η, A η ⟩ + T \int_{R^{d}} (\frac{d ν^{'}}{d ν} log \frac{d ν^{'}}{d ν} + 1 - \frac{d ν^{'}}{d ν}) d ν, \end{matrix}

using the expression of

(σ_{U}^{2}, ν_{U}, γ_{U})

in A3 of Theorem A3. Since

\frac{{d P^{'}|}_{F_{t}}}{{d P|}_{F_{t}}} = e^{U_{t}}

for every

t > 0

(see A3 in Theorem A3) and

H ({P^{'}|}_{F_{T}}, {P|}_{F_{T}}) = \int_{D} \frac{{d P^{'}|}_{F_{T}}}{{d P|}_{F_{T}}} log \frac{{d P^{'}|}_{F_{T}}}{{d P|}_{F_{T}}} d P = E^{P} [U_{T} e^{U_{T}}],

we obtain (A14) and complete the proof. □

Remark A3.

In the case of two

R

-valued Lévy processes

({\{x_{t}\}}_{t}, P)

,

({\{x_{t}\}}_{t}, P^{'})

with generating triplets

(σ^{2}, ν, γ)

and

({σ^{2}}^{'}, ν^{'}, γ^{'})

, respectively, under the hypothesis of the previous theorem (A14) reduces to

H ({P^{'}|}_{F_{T}}, {P|}_{F_{T}}) = \frac{T}{2} σ^{2} η^{2} + T \int_{R} (\frac{d ν^{'}}{d ν} log \frac{d ν^{'}}{d ν} + 1 - \frac{d ν^{'}}{d ν}) d ν .

If we are dealing with pure jump processes (e.g.,

N I G

processes), the first term in the sum of the right-hand side is 0; if instead

σ^{2} > 0

, then we have

\frac{T}{2} σ^{2} η^{2} = \frac{T}{2 σ^{2}} {(γ^{'} - γ - \int_{|x| \leq 1} x (ν^{'} - ν) (d x))}^{2},

restoring (Cont and Tankov 2004a, Proposition 9.10).

References

Abramowitz, Milton, and Irene A. Stegun. 1970. Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. Washington, DC: US Government Printing Office, vol. 55. [Google Scholar]
Asmussen, Søren, and Jan Rosiński. 2001. Approximations of Small Jumps of Lévy Processes with a View towards Simulation. Journal of Applied Probability 38: 482–93. [Google Scholar] [CrossRef]
Barndorff-Nielsen, Ole E. 1998. Processes of Normal Inverse Gaussian Type. Finance and Stochastics 2: 41–68. [Google Scholar] [CrossRef]
Barndorff-Nielsen, Ole, and Christian Halgreen. 1977. Infinite Divisibility of the Hyperbolic and Generalized Inverse Gaussian Distributions. Probability Theory and Related Fields 38: 309–12. [Google Scholar] [CrossRef]
Benth, Fred Espen, and Maren Diane Schmeck. 2014. Pricing futures and options in electricity markets. In The Interrelationship Between Financial and Energy Markets. Berlin/Heidelberg: Springer, pp. 233–60. [Google Scholar]
Benth, Fred Espen, Jurate Saltyte Benth, and Steen Koekebakker. 2008. Stochastic Modelling of Electricity and Related Markets. Singapore: World Scientific, vol. 11. [Google Scholar]
Carr, Peter, and Dilip Madan. 1999. Option Valuation using the Fast Fourier Transform. Journal of Computational Finance 2: 61–73. [Google Scholar] [CrossRef]
Chen, Dan. 2011. Three Essays on Pricing and Hedging in Incomplete Markets. Ph.D. thesis, The London School of Economics and Political Science (LSE), London. [Google Scholar]
Choulli, Tahir, and Christophe Stricker. 2006. More on minimal entropy–Hellinger martingale measure. Mathematical Finance 16: 1–19. [Google Scholar] [CrossRef]
Cont, Rama, and Peter Tankov. 2004a. Financial Modelling with Jump Processes. Boca Raton: Chapman and Hall/CRC. [Google Scholar]
Cont, Rama, and Peter Tankov. 2004b. Nonparametric calibration of jump-diffusion option pricing models. Journal of Computational Finance, Incisive Media 7: 1–49. [Google Scholar] [CrossRef]
Eberlein, Ernst, and Ernst August V. Hammerstein. 2004. Generalized hyperbolic and inverse Gaussian distributions: Limiting cases and approximation of processes. In Seminar on Stochastic Analysis, Random Fields and Applications IV. Berlin: Springer, pp. 221–64. [Google Scholar]
Esscher, F. 1932. On the Probability Function in the Collective Theory of Risk. Skandinavisk Aktuarietidskrift 15: 175–95. [Google Scholar]
Gerber, Hans U., and Elias SW Shiu. 1994. Option Pricing by Esscher Transforms. Transactions of the Society of Actuaries 46: 99–191. [Google Scholar]
Hell, Philipp, Thilo Meyer-Brandis, and Thorsten Rheinländer. 2012. Consistent factor models for temperature markets. International Journal of Theoretical and Applied Finance 15: 1250027. [Google Scholar] [CrossRef]
He, Sheng-wu, Jia-gang Wang, and Jia-an Yan. 2018. Semimartingale Theory and Stochastic Calculus. New York: Routledge. [Google Scholar]
Iacus, Stefano Maria. 2011. Option Pricing and Estimation of Financial Models with R. New York: John Wiley and Sons. [Google Scholar]
Jeanblanc, Monique, Marc Yor, and Marc Chesney. 2009. Mathematical Methods for Financial Markets. Berlin: Springer. [Google Scholar]
Kallsen, Jan, and Albert N. Shiryaev. 2002. The Cumulant Process and Esscher’s Change of Measure. Finance and Stochastics 6: 397–428. [Google Scholar] [CrossRef][Green Version]
Klüppelberg, Claudia, Alexander Lindner, and Ross Maller. 2004. A Continuous Time GARCH Process Driven by a Lévy Process: Stationarity and Second Order Behaviour. Journal of Applied Probability 41: 601–622. [Google Scholar] [CrossRef]
Lee, Young, and Thorsten Rheinländer. 2012. Optimal martingale measures for defaultable assets. Stochastic Processes and their Applications 122: 2870–84. [Google Scholar] [CrossRef]
Lewis, PA W., and Gerald S. Shedler. 1979. Simulation of Nonhomogeneous Poisson Processes by Thinning. Naval Research Logistics Quarterly 26: 403–13. [Google Scholar] [CrossRef]
Morozov, Vladimir Alekseevich. 1966. On the solution of functional equations by the method of regularization. Doklady Akademii Nauk 167: 510–12. [Google Scholar]
Protter, P. E. 2005. Stochastic Integration and Differential Equations, 2nd ed. Stochastic Modelling and Applied Probability. Berlin: Springer, vol. 21. [Google Scholar]
Rheinländer, Thorsten, and Jenny Sexton. 2011. Hedging Derivatives. Singapore: World Scientific, vol. 15. [Google Scholar]
Rudin, Walter. 1987. Real and Complex Analysis, 3rd ed. Mathematics Series; New York: McGraw-Hill Higher Education. [Google Scholar]
Sato, Ken-Iti. 1999. Lévy Processes and Infinitely Divisible Distributions. Cambridge: Cambridge University Press. [Google Scholar]
Shiryaev, Albert, and Jean J. Jacod. 2003. Limit Theorems for Stochastic Processes. A Series of Comprehensive Studies in Mathematics; Berlin: Germany, vol. 288. [Google Scholar]
Shreve, Steven E. 2004. Stochastic Calculus for Finance II (Continuous–Time Models). Springer Finance Textbooks. Berlin: Springer, vol. 11. [Google Scholar]
Van Heerwaarden, Angela E., Rob Kaas, and Marc J. Goovaerts. 1989. Properties of the Esscher premium calculation principle. Insurance Mathematics and Economics 8.4 335: 261–67. [Google Scholar] [CrossRef]
Watson, George Neville. 1966. A Treatise on the Theory of Bessel Functions, 2nd ed. Cambridge: Cambridge University Press. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Comparing Two Different Option Pricing Methods

Abstract

1. Introduction

2. Esscher Measure Method

2.1. The Model

2.1.1. Simulation of the $P^{θ}$ -Dynamics

2.1.2. Empirical Results

3. Calibration with Entropic Penalty Term Method

3.1. The Model

3.1.1. Numerical Approximation

3.1.2. Empirical Results

4. Conclusions

5. Future Research

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Normal Inverse Gaussian Distribution

Appendix B. Cumulant Function of Lévy Processes

Appendix C. Characteristics of Semimartingales

Appendix D. Laplace Cumulant and Geometric Esscher Measure

Appendix E. Lévy Processes on Skorohod Space and Relative Entropy of Distributions

References

Article Metrics

Citations

Article Access Statistics

Comparing Two Different Option Pricing Methods

Abstract

1. Introduction

2. Esscher Measure Method

2.1. The Model

2.1.1. Simulation of the P θ -Dynamics

2.1.2. Empirical Results

3. Calibration with Entropic Penalty Term Method

3.1. The Model

3.1.1. Numerical Approximation

3.1.2. Empirical Results

4. Conclusions

5. Future Research

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Normal Inverse Gaussian Distribution

Appendix B. Cumulant Function of Lévy Processes

Appendix C. Characteristics of Semimartingales

Appendix D. Laplace Cumulant and Geometric Esscher Measure

Appendix E. Lévy Processes on Skorohod Space and Relative Entropy of Distributions

References

Article Metrics

Citations

Article Access Statistics

2.1.1. Simulation of the $P^{θ}$ -Dynamics