Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives

Ben-Abdellatif, Malek; Ben-Ameur, Hatem; Chérif, Rim; Rémillard, Bruno

doi:10.3390/risks12120183

Open AccessFeature PaperArticle

Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives

¹

Department of Finance, School of Business, ESLSCA University, Giza 12511, Egypt

²

Department of Decision Sciences, HEC Montréal, Montréal, QC H3T 2A7, Canada

³

Department of Management, School of Business, The American University of Cairo, New Cairo 11835, Egypt

^*

Author to whom correspondence should be addressed.

Risks 2024, 12(12), 183; https://doi.org/10.3390/risks12120183

Submission received: 22 October 2024 / Revised: 16 November 2024 / Accepted: 18 November 2024 / Published: 21 November 2024

Download

Browse Figures

Versions Notes

Abstract

We use dynamic programming, finite elements, and parallel computing to design and evaluate two-dimensional financial derivatives. Our dynamic program is flexible, as it divides the evaluation process into two components: one related to the dynamics of the underlying process and the other to the characteristics of the financial derivative. It is efficient as it uses local polynomials at each step of the backward recursion to approximate the option value function, while it assumes only a numerical (but not a statistical) error and a state (but not a time) discretization. Parallel computing is used to speed up the model resolution and enhance its overall efficiency. To support our construction, we evaluate American options, which are subject to market risk, and exchangeable bonds, which are subject to default risk.

Keywords:

dynamic programming; finite elements; parallel computing; two-dimensional American options; exchangeable bonds

1. Introduction

This paper aims to design a flexible dynamic program (DP) and efficiently evaluate two-dimensional financial derivatives. This methodology is highly flexible because it relies on known transition parameters of the underlying process and known payoff functions of financial derivatives. Moreover, it is efficient because it uses local polynomials to approximate the value functions at each step of the backward recursion. Our model assumes only a numerical (but not a statistical) error and a state (but not a time) discretization. As an illustration, we evaluate American options and exchangeable bonds.

DP and finite elements (local approximations) have been efficiently used for valuing one-dimensional financial derivatives. See Ben-Ameur et al. (2016) and Ayadi et al. (2016) for methods in valuing American-style options and corporate securities. Since this approach is not feasible in high-dimensional state spaces, DP is usually combined with the Monte Carlo simulation and spectral analysis (global approximations) to approximate the value functions at each step of the backward recursion. In this case, the numerical error is amplified by a statistical error underlying the Monte Carlo approach, which can be seen as an extra cost to overcome the curse of dimensionality.

The modeling process assumes a constant search for the best compromise between accuracy and computing time. Local approximations are much more accurate than global approximations, but they are more time-consuming. Parallel computing is used to speed up the model resolution at each step of the backward recursion and to enhance the overall efficiency of the numerical procedure. This paper supports the idea that DP combined with finite elements remains effective for valuing low-dimensional financial derivatives. Methods for model-dimensional reduction can help to reach this objective (Reisinger and Wittum 2007).

American-style financial derivatives cannot be valued in closed form and must be approximated in some way. The literature reports three main backward pricing methodologies, namely, the lattice approach, finite differences, and dynamic programming. They all compete for valuing low-dimensional financial derivatives, but DP is the main methodology used in high-dimensional state spaces since it naturally combines with the Monte Carlo simulation.

The lattice approach assumes a discrete model that usually converges to a continuous counterpart when the time and space increments tend to zero. It runs in two steps. The first step builds the lattice forward, while the second step evaluates financial derivatives backward in time (Boyle 1988; Boyle et al. 1989; Kamrad and Ritchken 1991). The discrete approach and the continuous approach can be combined to improve the efficiency of the lattice construction (Bayer et al. 2022; Bungartz et al. 2012; Kargin 2005; Tanaka 2014).

Finite differences assume state and time discretizations of continuous models and numerically solve partial differential equations that characterize the (holding) value function of an option contract at each evaluation node (Hartley 2000). Berridge and Schumacher (2008) and Dockendorf and Paxson (2015) considered European exotic options, specifically focusing on the construction of the evaluation grid nodes. Pettersson et al. (2008) and Milovanović and Von Sydow (2018) used spectral analysis, while Wang et al. (2023) and Glau and Wunderlich (2022) used neural networks for value-function approximations at each step of their numerical procedures. Multiple methods have been used to address the weakness of the differential operator near the option exercise frontier (Attipoe and Tambue 2022; Heo et al. 2019; Kim et al. 2016). Methods for model dimension reduction are used to overcome the curse of dimensionality (Caldana et al. 2016; Hanbali and Linders 2019).

The main challenge in valuing high-dimensional American options in continuous models, which combines dynamic programming and Monte Carlo simulation, is twofold. On the one hand, if the sole evaluation node is at inception, forward DP considers sub-optimal stopping policies, given that the true option exercise strategy is unknown (Andersen and Broadie 2004; Boyle et al. 1997; Broadie and Detemple 1997; Del Moral et al. 2012; Ibáñez and Zapatero 2004; Liu and Hong 2009). On the other hand, if the evaluation process is achieved along the random sample paths of the underlying asset vector, backward DP solves the model recursively from the option maturity down to the origin. At each step of the backward recursion and each evaluation node, a poor Monte Carlo simulation of size one is used to estimate the option (holding) value since the underlying trajectories never intersect. The literature reports several remedies. The bundling-based approach assumes that sample paths in the same neighborhood have the same evaluation root (Barraquand and Martineau 1995; Raymar and Zwecher 1997; Bally and Pages 2003a, 2003b; Bally and Printems 2005; Jin et al. 2007, 2013). This approach goes back to Tilley (1993) and Raymar and Zwecher (1997) for valuing one-dimensional American options. The least squares-based approach adjusts the poor Monte Carlo estimates of size one at each step of the backward recursion via global approximations, such as linear, local, and robust regressions (Carrière 1996; Longstaff and Schwartz 2001; Tompaidis and Yang 2014; Tsitsiklis and Van Roy 1999) and neural networks (Chen and Wan 2021; Kohler et al. 2010). The simulated tree-based approach augments the number of simulated paths at each evaluation node (Broadie and Glasserman 1997). Forward and backward DP create a dual approach for valuing multivariate American options (Broadie and Glasserman 1997; Haugh and Kogan 2004; Rogers 2002). These methodologies inherit statistical errors from the generation of random paths and numerical errors from multiple approximations. Variance reduction techniques can be employed to enhance the overall efficiency of the numerical experiment (Dang et al. 2015; Giles 2015).

We propose a two-dimensional backward DP approach, which assumes only numerical (but not statistical) errors and state (but not time) discretization. Accurate local polynomials are employed to approximate the option value function at each step of the backward recursion, with parallel computing utilized to enhance the overall efficiency of the procedure.

The rest of this paper is organized as follows. Section 2 presents our model for valuing two-dimensional American options, Section 3 focuses on exchangeable bonds, and Section 4 concludes the paper.

2. Designing and Valuing American Options

We consider a frictionless market in which two stocks,

S^{1}

and

S^{2}

, are traded continuously and move according to a bivariate lognormal process. The risk-free rate,

r_{f}

, is assumed to be constant. This market is known to be arbitrage-free and complete. Thus, there exists a unique risk-neutral probability measure

Q

under which the state process

(S^{1}, S^{2})

moves according to the following stochastic differential equation:

\frac{d S_{t}^{i}}{S_{t}^{i}} = (r_{f} - d_{i}) d t + σ_{i} d W_{t}^{i}, for i = 1, 2,

(1)

where

d_{i}

denotes the continuous dividend rate of stock i,

σ_{i}

denotes its log-return volatility, and

(W^{1}

,

W^{2})

denotes a bivariate correlated Brownian motion with the following:

Cor (W_{t}^{1}, W_{t}^{2}) = ρ, for all t > 0 .

The analytical solution of (1) is as follows:

S_{u}^{i} = S_{t}^{i} exp [(r_{f} - d_{i} - \frac{σ_{i}^{2}}{2}) (u - t) + σ_{i} (W_{u}^{i} - W_{t}^{i})], for 0 \leq t \leq u .

(2)

An American option on

(S^{1}, S^{2})

with maturity T is defined by its cash-flow process,

κ (t, x, y) \geq 0

, where

x = S_{t}^{1}

and

y = S_{t}^{2}

denote the levels of the underlying stocks at time

t \in [0, T]

. This is the option exercise value

v_{t}^{e} (x, y) = κ (t, x, y)

. Examples include the exchange option, as follows:

κ (t, x, y) = max (x - y, 0),

the call-on-max option, as follows:

κ (t, x, y) = max (max (x, y) - K, 0),

and the put-on-min option, as follows:

κ (t, x, y) = max (K - min (x, y), 0),

where K denotes the option strike price. The exchange option gives the option holder the right to exchange

S^{2}

for

S^{1}

; the call-on-max option gives its holder the right to purchase the higher-priced asset at the strike price K; and the put-on-min gives the right to sell the lower-priced asset at the strike K. Stulz (1982) and Johnson (1987) derived closed-form solutions for their European counterparts, characterized by the following:

κ (t, x, y) = 0, for 0 \leq t < T .

We herein consider Bermudan options with

N + 1

regular exercise opportunities, that is,

t_{0} = 0, t_{1}, \dots, t_{N} = T

, where

t_{n + 1} - t_{n} = Δ t

. No-arbitrage pricing gives the following:

v_{t_{n}}^{h} (x, y) = E^{Q} [e^{- r_{f} Δ t} v_{t_{n + 1}} (S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (x, y)],

(3)

where

v_{t_{n}}^{h} (x, y)

and

v_{t_{n}} (x, y) = max (v_{t_{n}}^{h} (x, y), v_{t_{n}}^{e} (x, y))

denote the option holding value and overall option value functions at

(t_{n}, x, y)

. We set

v_{t_{N}}^{h} (x, y) = 0

for all

x > 0

and

y > 0

.

The expectation in Equation (3) cannot be computed in closed form and has to be approximated in some way. Valuing American options can be interpreted as an optimal Markov decision process (stochastic dynamic program) since the option value function is forward-looking and known at maturity.

Let

G

be a set of grid points

{(a_{1}, b_{1}), (a_{1}, b_{2}), \dots, (a_{p}, b_{q})}

such that

max (Δ a_{k}, Δ b_{l}) \to 0

and

Q [(S_{t}^{1}, S_{t}^{2}) \in [a_{p}, \infty) \times R_{+}^{*} \cup R_{+}^{*} \times [b_{q}, \infty)] \to 0

, when p and

q \to \infty

. We set

a_{0} = b_{0} = 0

and

a_{p + 1} = b_{q + 1} = \infty

. The rectangle

[a_{i}, a_{i + 1}) \times [b_{j}, b_{j + 1})

is designated by

R_{i j}

.

Define the transition tables

T^{00}, T^{10}, T^{01}

, and

T^{11}

as follows:

T_{k l i j}^{ν μ} = E^{Q} [{(S_{t_{n + 1}}^{1})}^{ν} {(S_{t_{n + 1}}^{2})}^{μ} I ((S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) \in R_{i j}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})], for ν and μ \in {0, 1} .

(4)

For example,

T_{k l i j}^{00}

represents the transition probability that the Markov process

(S^{1}, S^{2})

moves from

(a_{k}, b_{l})

at

t_{n}

and visits the rectangle

R_{i j}

at

t_{n + 1}

. The rest of the transition tables represent truncated first-order direct and cross-moments of the state process

(S^{1}, S^{2})

at

t_{n + 1}

given the current position

(a_{k}, b_{l})

at

t_{n}

. The computation of these transition parameters, which are at the heart of our DP approach, can be treated as a fixed cost, provided that the Markov state process is homogeneous,

t_{n + 1} - t_{n}

is a positive constant, and the grid points,

G

, are fixed over time. We derive them in closed form in Appendix A.

Assume that an approximation of the option value function is available at a future decision date

t_{n + 1}

on

G

, as indicated by

{\tilde{v}}_{t_{n + 1}} (a_{k}, b_{l})

, for

k = 1, \dots, p

and

l = 1, \dots, q

. This is not really a strong assumption since the option value function is known at maturity in closed form, that is,

{\tilde{v}}_{t_{N}} = v_{t_{N}} = v_{t_{N}}^{e}

. DP acts as follows:

Use a bilinear piecewise polynomial and interpolate the option value function ${\tilde{v}}_{t_{n + 1}}$ at $t_{n + 1}$ from $G$ to the overall state space ${[0, \infty)}^{2}$ by setting the following:

${\hat{v}}_{t_{n + 1}} (x, y) = \sum_{i = 0}^{p} \sum_{j = 0}^{q} (α_{i j}^{n + 1} + β_{i j}^{n + 1} x + γ_{i j}^{n + 1} y + δ_{i j}^{n + 1} x y) \times I ((x, y) \in R_{i j}),$

(5)

where the local coefficients $α_{i j}^{n + 1}$ , $β_{i j}^{n + 1}$ , $γ_{i j}^{n + 1}$ , and $δ_{i j}^{n + 1}$ for $i = 1, \dots, p - 1$ and $j = 1, \dots, q - 1$ are derived in closed form by solving a system of linear equations:

$\begin{matrix} \{\begin{matrix} {\hat{v}}_{t_{n + 1}} (a_{i}, b_{j}) = {\tilde{v}}_{t_{n + 1}} (a_{i}, b_{j}) \\ {\hat{v}}_{t_{n + 1}} (a_{i + 1}, b_{j}) = {\tilde{v}}_{t_{n + 1}} (a_{i + 1}, b_{j}) \\ {\hat{v}}_{t_{n + 1}} (a_{i}, b_{j + 1}) = {\tilde{v}}_{t_{n + 1}} (a_{i}, b_{j + 1}) \\ {\hat{v}}_{t_{n + 1}} (a_{i + 1}, b_{j + 1}) = {\tilde{v}}_{t_{n + 1}} (a_{i + 1}, b_{j + 1}) \end{matrix} \end{matrix}$

(6)

and the rest are set to their adjacent counterparts;
Use no-arbitrage pricing and approximate the option holding value function at $t_{n}$ on $G$ :

$\begin{matrix} {\tilde{v}}_{t_{n}}^{h} (a_{k}, b_{l}) & = & E^{Q} [e^{- r_{f} Δ t} {\hat{v}}_{t_{n + 1}} (S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})] \\ = & e^{- r Δ t} \sum_{i, j} (α_{i j}^{n + 1} T_{k l i j}^{00} + β_{i j}^{n + 1} T_{k l i j}^{10} + γ_{i j}^{n + 1} T_{k l i j}^{01} + δ_{i j}^{n + 1} T_{k l i j}^{11}); \end{matrix}$

(7)
Approximate the option value function at $t_{n}$ on $G$ :

$\begin{matrix} {\tilde{v}}_{t_{n}} (a_{k}, b_{l}) & = & max (v_{t_{n}}^{e} (a_{k}, b_{l}), {\tilde{v}}_{t_{n}}^{h} (a_{k}, b_{l})); \end{matrix}$

(8)
Go to step 1 and repeat until $t_{n} = 0$ .

Equation (7) splits the option holding value into two parts: The local coefficients are related to the option contract and the transition parameters to the dynamics of the state process. Overall, the option holding value is calculated by summing local future value components, each multiplied by their associated transition parameters, and then discounted back at the risk-free rate. The same equation shows that DP assumes a space discretization, but not time discretization, and does respect the true dynamics of the state process. Note that the time increment

Δ t

does not need to be small as it is required by the lattice approach and by finite differences. Our numerical procedure can be designed to stop and evaluate option contracts only at decision dates since the transition parameters are derived in closed form for any positive time increment

Δ t

. For a European option, set

Δ t

at the option maturity and run DP in one step; for a Bermudan option, set

Δ t

at the time interval between two decision dates and run DP in multiple steps. For example, we can fix

Δ t

at the time interval between two coupon dates for options embedded in bonds that can be exercised only at payment dates. Finally, Equation (5) shows that DP ends up with an interpolation

{\hat{v}}_{t_{n}} (x, y)

of

v_{t_{n}} (x, y)

, for all

t_{n}, x > 0

and

y > 0

. Thus, the first and second derivatives of

v_{t_{n}} (x, y)

become available at all

(t_{n}, x, y)

, among other sensitivity coefficients. For example, the approximated deltas at

(t_{0}, x_{0}, y_{0})

are as follows:

\frac{\partial {\hat{v}}_{t_{0}} (x_{0}, y_{0})}{\partial x} = β_{i j}^{0} + δ_{i j}^{0} y_{0} and \frac{\partial {\hat{v}}_{t_{0}} (x_{0}, y_{0})}{\partial y} = γ_{i j}^{0} + δ_{i j}^{0} x_{0},

given that

(x_{0}, y_{0}) \in R_{i j}

. Higher-order local approximations are more accurate but more time-consuming.

At each step

t_{n}

of the backward recursion, the computational effort underlying Equation (7) can be conducted simultaneously for

k = 1, \dots, p

and

l = 1, \dots, q

. We used parallel computing to improve the overall efficiency of our DP procedure. The code lines are written in C and compiled with GCC. Parallel computing was performed using the MPI library. We used the supercomputer Briarée managed by Calcul Québec and Compute Canada.1 Briarée has 8064 CPUs (cores), each running at the speed of 2.667 GHz. See Appendix B for further details.

Our numerical experiment focuses on the put-on-min option contract. Table 1 and Table 2 compare DP to Boyle (1988), which uses a two-dimensional binomial tree for valuing European vs. American put-on-min options. The closed-form solution for the European contract is given in Stulz (1982). We set

S_{0}^{1} = S_{0}^{2} = 40

,

d_{1} = d_{2} = 0

,

σ_{1} = 0.2

,

σ_{2} = 0.3

,

ρ = 0.5

,

r_{f} = 5 %

(effective)

\equiv 0.04879

(continuously compounded), and

T = 7

months

\equiv 0.58333

years.

As explained above, DP does not need time discretization. For comparison purposes, we run DP with the same number of time steps as Boyle (1988). When the number of time steps is low, DP performs almost perfectly, while the binomial tree method is less accurate. As expected, with a higher number of time steps, the binomial tree converges and achieves accuracy comparable to that of DP. Boyle (1988) does not report computing times. Each DP’s CPU time (in seconds) can be split into a fixed cost, associated with the transition parameters, and a linear cost, associated with the backward recursion. The fixed cost accounts for a sizable portion of the total CPU time. The relevant DP’s computing time is the linear CPU time since the transition parameters can be computed only once or twice a day, following the model estimation step.

Table 3 compares DP to alternative methodologies based on the Monte Carlo simulation in the context of the dual approach of Rogers (2002) and the bundling approach by Jin et al. (2007). Their random samples are of sizes

10, 000

and

60, 000

, respectively. We report their respective

95 %

confidence intervals. Hartley (2000) used finite differences. The parameters are

K = 100

,

d_{1} = d_{2} = 0

,

ρ = 0

,

σ_{1} = σ_{2} = 0.6

,

r = 0.06

, and

T = 0.5

. DP values, obtained with

p = q = 300

, almost always belong to their associated

95 %

confidence intervals and compare extremely well with Hartley’s (2000) values, which were described by Rogers (2002) as extremely accurate. Figure 1 plots the exercise region of a put-on-min option at the fourth of ten decision dates, where

K = 100

. For example, it is optimal to hold the option when

(S_{t_{4}}^{1}, S_{t_{4}}^{2}) = (60, 60)

, even though exercising the option has value.

3. Designing and Valuing Exchangeable Bonds

An exchangeable bond allows bondholders the discretion to convert their holdings into shares of a company other than the issuer. This instrument is subject to both the credit risk of the issuing company and the market risk of the underlying stock.

Exchangeable bonds have been offered since the early 1970s. About 14% of convertible bonds were exchangeable in the US, according to Grimwood and Hodges (2002). The issuance of exchangeable bonds is mainly motivated in the literature as a tax-saving strategy (Jones and Mason 1986) and/or a divesting policy (Barber 1993).

We consider a public company with a debt portfolio made of a senior straight bond and a junior exchangeable bond. This (issuing) firm is assumed to hold the shares underlying the exchangeable bond, which are pledged to junior bondholders who have priority on the pledged shares under default.

The balance-sheet equality (BSE) of the issuing firm depends on whether the exchange option was already exercised or not, which results in the following:

a + s \times I (f_{t_{n - 1}} = 0) + {TB}_{t_{n}} (a, s, f) - {BC}_{t_{n}} (a, s, f) = D_{t_{n}}^{s} (a, s, f) + D_{t_{n}}^{j} (a, s, f) + E_{t_{n}} (a, s, f),

(9)

where

s = S_{t_{n}}

denotes the value of the shares underlying the exchangeable bond,

a + s = A_{t_{n}} + S_{t_{n}}

denotes the value of the issuing firm’s assets;

f_{t_{n - 1}} = 1

if the exchange option was exercised before or at

t_{n - 1}

, and 0 otherwise (held until

t_{n - 1}

), with

f = f_{t_{n - 1}}

. We assume that the evaluation date

t_{n}

belongs to the coupon/capital payment dates

P = {t_{0}, t_{1}, \dots, t_{N}}

. The couple

(A, S)

is modeled as a lognormal process, as described in Equation (1). This two-dimensional structural setting builds on the work of Ayadi et al. (2016), who considered junior and senior debt portfolios without embedded options.

The value functions

{TB}_{t_{n}} (a, s, f), {BC}_{t_{n}} (a, s, f), D_{t_{n}}^{s} (a, s, f), D_{t_{n}}^{j} (a, s, f),

and

E_{t_{n}} (a, s, f)

represent the (net present) values of tax benefits, bankruptcy costs, the senior straight bond, the junior exchangeable bond, and equity of the issuing firm at

(t_{n}, a, s, f)

, respectively. In particular,

D_{t_{n}}^{j} (a, s, 1) = 0

, for all

a > 0

and

s > 0

. Each value function is characterized by two components, namely, its current cash flows and future potentialities, as shown in Table 4, Table 5, Table 6 and Table 7.

The firm is committed to paying

d_{n} = d_{n}^{s} + d_{n}^{j}

at

t_{n}

to its creditors, where

d_{n}^{s} = C_{n}^{s} + P_{n}^{s}

and

d_{n}^{j} = C_{n}^{j} + P_{n}^{j}

denote the regular interest and principal payments to senior and junior bondholders, respectively. The current cash flow of

{TB}_{t_{n}}

is

{tb}_{n} = r_{c} (C_{n}^{s} + C_{n}^{j}) = r_{c} C_{n}

under survival at

t_{n}

, where

r_{c}

denotes the corporate tax rate. The current cash flow of

{BC}_{t_{n}}

is proportional to the remaining firm’s asset value

w a = w A_{t_{n}}

, under default at

t_{n}

, where w denotes the bankruptcy cost parameter.

DP starts the resolution at maturity, where the value functions of the corporate securities in Equation (9) are known. Table 4 presents these value functions under the condition of exercise before

t_{N}

(

f_{t_{N - 1}} = 1

), which is consistent with Ayadi et al. (2016). The junior debt and pledged shares vanish from the BSE at

t_{N}

. Table 5 reports six events under the assumption that the exchange option was held until

t_{N - 1}

(

f_{t_{N - 1}} = 0

), three of which happen under solvency, and the rest under financial stress. For example, event (3) reports the case of a solvent firm, where the junior bond is exchanged against the pledged shares, resulting in a default. The junior bondholders are paid

s + (1 - w) a - d_{N}^{s}

or s, depending on whether the senior bondholders are fully or partially paid. It could happen that

s + max ((1 - w) a - d_{N}^{s}, 0) < d_{N}^{j}

, in which case junior bondholders are better off holding. For simplicity, we ignore this unlikely event. Event (5) reports the case of a stressed firm, where the exchange option is exercised, resulting in survival. Exercising the embedded option can lead a solvent company to default or a stressed company to survive.

Assuming the model has been solved backward in time from

t_{N}

to

t_{n + 1}

, the potential future values of a generic value function

v_{t_{n}} (a, s, f)

for corporate security are given by the following equations:

{\bar{v}}_{t_{n}} (a, s, 1) = E^{Q} [ρ_{n} v_{t_{n + 1}} (A_{t_{n + 1}}, S_{t_{n + 1}}, 1) | (A_{t_{n}}, S_{t_{n}}, f_{t_{n - 1}}) = (a, s, 1)],

under the condition of exercising before

t_{n}

and, therefore, exercising before

t_{n + 1}

,

\begin{matrix} {\bar{v}}_{t_{n}} (a, s, 0) & = E^{Q} [ρ_{n} v_{t_{n + 1}} (A_{t_{n + 1}}, S_{t_{n + 1}}, 1) | (A_{t_{n}}, S_{t_{n}}, f_{t_{n - 1}}) = (a, s, 0)], \end{matrix}

when holding until

t_{n - 1}

and exercising at

t_{n}

, and

\begin{matrix} {\bar{\bar{v}}}_{t_{n}} (a, s, 0) & = E^{Q} [ρ_{n} v_{t_{n + 1}} (A_{t_{n + 1}}, S_{t_{n + 1}}, 0) | (A_{t_{n}}, S_{t_{n}}, f_{t_{n - 1}}) = (a, s, 0)], \end{matrix}

when holding until

t_{n}

, where

ρ_{n} = e^{- r_{f} (t_{n + 1} - t_{n})}

denotes the risk-free discount factor over

[t_{n}, t_{n + 1}]

. Thus,

\bar{v}

is inferred from Table 6 at

t_{n + 1}

, while

\bar{\bar{v}}

is inferred from Table 7 at

t_{n + 1}

. It is important to note that

{\bar{v}}_{t_{n}} (a, s, 1)

is a function of

(t_{n}, a)

only (Ayadi et al. 2016). As explained in Section 2, DP alternates between an interpolation step and an integration step to solve the model at time

t_{n}

. Table 6 and Table 7 exhibit the value functions of corporate securities at

(t_{n}, a, s, f)

. The rest comes by backward induction. It is important to note that Table 4 and Table 5 are consistent with Table 6 and Table 7, given that the future potentialities of

{TB}_{t_{N}}

,

{BC}_{t_{N}}

,

D_{t_{N}}^{s}

, and

D_{t_{N}}^{j}

are null, while

{\bar{E}}_{t_{N}} = a

and

{\bar{\bar{E}}}_{t_{N}} = a + s

. Section 3 shows that DP is a flexible alternative for designing and evaluating complex financial derivatives.

4. Conclusions

We propose a dynamic programming approach for designing and valuing two-dimensional financial derivatives. Examples include American options and exchangeable bonds. Our dynamic program splits the evaluation process into two components related to the dynamics of the underlying process and the option contract. This results in high flexibility in the model-design step and efficiency in the model-resolution step. We use local polynomials and parallel computing to enhance the efficiency of the overall numerical procedure.

Future research avenues include the extension of our construction to alternative underlying processes and/or higher-dimensional state spaces, possibly by combining dynamic programming with methodologies for model-dimensional reduction. It is worth noting that the computing time of our numerical procedure drastically increases with the dimension of the state space. However, as briefly explained in the introduction, this paper is a step forward in designing and solving dynamic programs in intermediate dimensional state spaces, say one, two, and three. For higher dimensional state spaces, our strategy involves using principal component analysis and reducing the nominal dimension of the state space to a lower effective dimension. This is often relevant in finance, as explained by Wang and Sloan (2005). Thus, option contracts can be valued assuming only a numerical error but not a statistical (sampling) error. The relative efficiency of our dynamic program with respect to traditional multivariate pricing algorithms, such as the LSMC procedure, must be analyzed further.

Author Contributions

Conceptualization, H.B.-A. and B.R.; methodology, H.B.-A. and B.R.; software, M.B.-A. and R.C.; validation, H.B.-A., M.B.-A. and R.C.; formal analysis, H.B.-A., M.B.-A. and R.C.; investigation, M.B.-A. and R.C.; resources, H.B.-A., M.B.-A. and R.C.; writing—original draft preparation, H.B.-A., M.B.-A.; writing—review and editing, H.B.-A., M.B.-A. and R.C.; visualization, M.B.-A. and R.C.; supervision, H.B.-A.; project administration, H.B.-A.; funding acquisition, H.B.-A. All authors have read and agreed to the published version of the manuscript.

Funding

This paper was supported by two research grants received by the second author: Canadian Statistical Sciences Institute (32-153-300-22-R2491) and Natural Sciences and Engineering Research Council (R55).

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Transition Parameters

The transition parameters

T_{k l i j}^{ν μ}

for

ν

and

μ \in (0, 1)

,

k \in {1, \dots, p}

,

l \in {1, \dots, q}

,

i \in {0, \dots, p}

, and

j \in {0, \dots, q}

are calculated as follows:

\begin{matrix} T_{k l i j}^{00} = & E^{Q} [I ((S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) \in R_{i j}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})] \\ = & Q [(S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) \in R_{i j} ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})] \\ = & \int_{x_{k, i}}^{x_{k, i + 1}} \int_{y_{l, j}}^{y_{l, j + 1}} ϕ (z_{1}, z_{2}, ρ) d z_{1} d z_{2} \\ = & Φ (x_{k, i + 1}, y_{l, j + 1}, ρ) - Φ (x_{k, i}, y_{l, j + 1}, ρ) - \\ Φ (x_{k, i + 1}, y_{l, j}, ρ) + Φ (x_{k, i}, y_{l, j}, ρ), \end{matrix}

where

\begin{matrix} x_{k, i} = & (log (a_{i} / a_{k}) - (r - δ_{1} - σ_{1}^{2} / 2) Δ t) / (σ_{1} \sqrt{Δ t}) \\ y_{l, j} = & (log (b_{j} / b_{l}) - (r - δ_{2} - σ_{2}^{2} / 2) Δ t) / (σ_{2} \sqrt{Δ t}) . \end{matrix}

The functions

ϕ (\cdot, \cdot, ρ)

and

Φ (\cdot, \cdot, ρ)

are, respectively, the density and the cumulative density functions of the bivariate standard normal distribution with correlation coefficient

ρ

. The function

Φ (\cdot, \cdot, ρ)

is computed according to Genz (2004).

\begin{matrix} T_{k l i j}^{10} = & E^{Q} [S_{t_{n + 1}}^{1} I ((S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) \in R_{i j}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})] \\ = & \int_{x_{k, i}}^{x_{k, i + 1}} \int_{y_{l, j}}^{y_{l, j + 1}} a_{k} exp ((r - d_{1} - σ_{1}^{2} / 2) Δ t + σ_{1} \sqrt{Δ t} z_{1}) \times \\ ϕ (z_{1}, z_{2}, ρ) d z_{1} d z_{2} \\ = & w_{k}^{1} \int_{x_{k, i} - σ_{1} \sqrt{Δ t}}^{x_{k, i + 1} - σ_{1} \sqrt{Δ t}} \int_{y_{l, j} - ρ σ_{1} \sqrt{Δ t}}^{y_{l, j + 1} - ρ σ_{1} \sqrt{Δ t}} ϕ (u_{1}, u_{2}, ρ) d u_{1} d u_{2} \\ = & w_{k}^{1} [Φ (x_{k, i + 1} - σ_{1} \sqrt{Δ t}, y_{l, j + 1} - ρ σ_{1} \sqrt{Δ t}, ρ) - \\ Φ (x_{k, i} - σ_{1} \sqrt{Δ t}, y_{l, j + 1} - ρ σ_{1} \sqrt{Δ t}, ρ) - \\ Φ (x_{k, i + 1} - σ_{1} \sqrt{Δ t}, y_{l, j} - ρ σ_{1} \sqrt{Δ t}, ρ) + \\ Φ (x_{k, i} - σ_{1} \sqrt{Δ t}, y_{l, j} - ρ σ_{1} \sqrt{Δ t}, ρ)], \end{matrix}

where

w_{k}^{1} = a_{k} exp ((r - d_{1} - σ_{1}^{2} / 2) Δ t + σ_{1}^{2} Δ t / 2)

.

\begin{matrix} T_{k l i j}^{01} = & E^{Q} [S_{t_{n + 1}}^{2} I ((S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) \in R_{i j}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})] \\ = & \int_{x_{k, i}}^{x_{k, i + 1}} \int_{y_{l, j}}^{y_{l, j + 1}} b_{l} exp ((r - d_{2} - σ_{2}^{2} / 2) Δ t + σ_{2} \sqrt{Δ t} z_{2}) \times \\ ϕ (z_{1}, z_{2}, ρ) d z_{1} d z_{2} \\ = & w_{l}^{2} \int_{x_{k, i} - ρ σ_{2} \sqrt{Δ t}}^{x_{k, i + 1} - ρ σ_{2} \sqrt{Δ t}} \int_{y_{l, j} - σ_{2} \sqrt{Δ t}}^{y_{l, j + 1} - σ_{2} \sqrt{Δ t}} ϕ (u_{1}, u_{2}, ρ) d u_{1} d u_{2} \\ = & w_{l}^{2} [Φ (x_{k, i + 1} - ρ σ_{2} \sqrt{Δ t}, y_{l, j + 1} - σ_{2} \sqrt{Δ t}, ρ) - \\ Φ (x_{k, i} - ρ σ_{2} \sqrt{Δ t}, y_{l, j + 1} - σ_{2} \sqrt{Δ t}, ρ) - \\ Φ (x_{k, i + 1} - ρ σ_{2} \sqrt{Δ t}, y_{l, j} - σ_{2} \sqrt{Δ t}, ρ) + \\ Φ (x_{k, i} - ρ σ_{2} \sqrt{Δ t}, y_{l, j} - σ_{2} \sqrt{Δ t}, ρ)], \end{matrix}

where

w_{l}^{2} = b_{l} exp ((r - d_{2} - σ_{2}^{2} / 2) Δ t + σ_{2}^{2} Δ t / 2)

.

\begin{matrix} T_{k l i j}^{11} = & E^{Q} [S_{t_{n + 1}}^{1} S_{t_{n + 1}}^{2} I ((S_{t_{n + 1}}^{1}, S_{t_{n + 1}}^{2}) \in R_{i j}) ∣ (S_{t_{n}}^{1}, S_{t_{n}}^{2}) = (a_{k}, b_{l})] \\ = & \int_{x_{k, i}}^{x_{k, i + 1}} \int_{y_{l, j}}^{y_{l, j + 1}} a_{k} exp ((r - d_{1} - σ_{1}^{2} / 2) Δ t + σ_{1} \sqrt{Δ t} z_{1}) \times \\ b_{l} exp ((r - d_{2} - σ_{2}^{2} / 2) Δ t + σ_{2} \sqrt{Δ t} z_{2}) ϕ (z_{1}, z_{2}, ρ) d z_{1} d z_{2} \\ = & w_{k}^{1} w_{l}^{2} exp (ρ σ_{1} σ_{2} Δ t) \times \\ \int_{x_{k, i} - (σ_{1} + ρ σ_{2}) \sqrt{Δ t}}^{x_{k, i + 1} - (σ_{1} + ρ σ_{2}) \sqrt{Δ t}} \int_{y_{l, j} - (ρ σ_{1} + σ_{2}) \sqrt{Δ t}}^{y_{l, j + 1} - (ρ σ_{1} + σ_{2}) \sqrt{Δ t}} ϕ (u_{1}, u_{2}, ρ) d u_{1} d u_{2} \\ = & w_{k}^{1} w_{l}^{2} exp (ρ σ_{1} σ_{2} Δ t) \times \\ [Φ (x_{k, i + 1} - (σ_{1} + ρ σ_{2}) \sqrt{Δ t}, y_{l, j + 1} - (ρ σ_{1} + σ_{2}) \sqrt{Δ t}, ρ) - \\ Φ (x_{k, i} - (σ_{1} + ρ σ_{2}) \sqrt{Δ t}, y_{l, j + 1} - (ρ σ_{1} + σ_{2}) \sqrt{Δ t}, ρ) - \\ Φ (x_{k, i + 1} - (σ_{1} + ρ σ_{2}) \sqrt{Δ t}, y_{l, j} - (ρ σ_{1} + σ_{2}) \sqrt{Δ t}, ρ) + \\ Φ (x_{k, i} - (σ_{1} + ρ σ_{2}) \sqrt{Δ t}, y_{l, j} - (ρ σ_{1} + σ_{2}) \sqrt{Δ t}, ρ)] . \end{matrix}

Appendix B. Parallel Computing

Parallel computing uses multiple central processing units (CPUs) simultaneously to speed up complex computations. For C programming, there are two libraries used for parallel computing: MPI and OpenMP.

The Message Passing Interface (MPI) library allows the computing process to exchange information between the running CPU environments in order to achieve a given job. Each CPU has access to a certain memory space. MPI requires case-sensitive programming changes from the serial code to its parallel version.

Parallel computing can also run when all CPUs share the same memory space. Open Multi-Processing (OpenMP) is a library that allows one to implement parallel computing with minimal change to the serial code. However, shared-memory supercomputers are extremely expensive and, thus, somewhat inaccessible.

MPI and OpenMP are compatible with Fortran and C languages. Parallel computing is also feasible under other software packages, e.g., graphics processing unit (GPU) for Matlab and R.

The easiest way to parallelize DP at step

t_{n}

is to submit the computation tasks associated with a given grid point

(a_{k}, b_{l})

to a single CPU, for

k = 1, \dots, p

and

l = 1, \dots, q

. Our parallel code acts as follows:

This single CPU computes once and locally stores the overall grid points $(a_{i}, b_{j})$ and exercise values $κ (a_{i}, b_{j})$ , for $i = 1, \dots, p$ and $j = 1, \dots, q$ .
Following Equation (4), it computes and locally stores the $4 \times (p + 1) (q + 1)$ transition parameters $T_{k l i j}^{00}$ , $T_{k l i j}^{10}$ , $T_{k l i j}^{01}$ , and $T_{k l i j}^{11}$ , for $i = 0, \dots, p$ and $j = 0, \dots, q$ .
Following Equations (5) and (6), it computes and stores the local coefficients $α_{i j}^{n + 1}$ , $β_{i j}^{n + 1}$ , $γ_{i j}^{n + 1}$ , and $δ_{i j}^{n + 1}$ for $i = 0, \dots, p$ and $j = 0, \dots, q$ at step $n + 1$ .
Following Equations (7) and (8), it computes and stores the option’s holding value ${\tilde{v}}_{t_{n}}^{h} (a_{k}, b_{l})$ at step n, and then the overall value ${\tilde{v}}_{n} (a_{k}, b_{l})$ .
The same CPU exports ${\tilde{v}}_{t_{n}} (a_{k}, b_{l})$ to a selected CPU, the so-called master CPU.
The master CPU collects ${\tilde{v}}_{t_{n}} (a_{k}, b_{l})$ for $k = 1, \dots, p$ and $l = 1, \dots, q$ , and sends them back to all running CPUs.
Go to step 3 and repeat until $n = 0$ .

Since the number of CPUs available to the analyst is usually less than the grid size

p q

, we allocate an equal number of grid points to each CPU. Determining this allocation for each grid size

p q

is a matter of efficiency.

Assume that the same program is run twice with n and

k n

CPUs, where n and

k \in N^{*}

. Let

τ_{1}

and

τ_{2}

be the computing times of the first and second runs, respectively. In the best-case scenario, the expected computing time declines by the same factor k, that is,

E [τ_{2}] = \frac{E [τ_{1}]}{k},

which results in the following relative efficiency ratio:

\frac{E [τ_{1}] / E [τ_{2}]}{k} = 1 .

In fact, this ratio is usually lower than one, since the CPUs exchange information during the computing process, as in steps 5–6, causing the parallel code to behave partially like the serial code, as in step 1. A relative efficiency ratio higher than

75 %

is highly desirable.

We used the supercomputer Briarée, managed by Calcul Québec and Compute Canada, which is equipped with 8064 CPUs (cores). These 8064 cores were divided into 672 computing nodes, each equipped with two six-core processors running at a speed of 2.667 GHz. Thus, each computing node included 12 cores. The number of computing nodes,

\bar{n}

, required for parallel computing must be specified by the programmer (

\bar{n} \leq 672

), resulting in

12 \times \bar{n}

cores. Briarée has a total memory space of 26.72 TB, split between the computing nodes. Given the architecture of Briarée’s hardware (Figure A1), the number of grid points submitted to each core is as follows:

\frac{p q}{12 \times \bar{n}} \in N^{*} .

The code lines were written in C and compiled with GCC. We used the MPI library to access parallel computing:

Figure A1. Briarée’s architecture.

Note

1	The operation of this supercomputer was funded by the Canada Foundation for Innovation (CFI), the ministère de l’Économie, de la Science et de l’Innovation du Québec (MESI), and the Fonds de recherche du Québec—Nature et technologies (FRQ-NT).

References

Andersen, Leif, and Mark Broadie. 2004. Primal-dual simulation algorithm for pricing multidimensional American options. Management Science 50: 1222–34. [Google Scholar] [CrossRef]
Attipoe, David Sena, and Antoine Tambue. 2022. Novel numerical techniques based on mimetic finite difference method for pricing two-dimensional options. Results in Applied Mathematics 13: 100229. [Google Scholar] [CrossRef]
Ayadi, Mohamed A., Hatem Ben-Ameur, and Tarek Fakhfakh. 2016. A dynamic program for valuing corporate securities. European Journal of Operational Research 249: 751–70. [Google Scholar] [CrossRef]
Bally, Vlad, and Gilles Pages. 2003a. A quantization algorithm for solving multidimensional discrete-time optimal stopping problems. Bernoulli 9: 1003–49. [Google Scholar] [CrossRef]
Bally, Vlad, and Gilles Pages. 2003b. Error analysis of the optimal quantization algorithm for obstacle problems. Stochastic Processes and their Applications 106: 1–40. [Google Scholar] [CrossRef]
Bally, Vlad, and Jacques Printems. 2005. A quantization tree method for pricing and hedging multidimensional American options. Mathematical Finance 15: 119–68. [Google Scholar] [CrossRef]
Barber, Brad M. 1993. Exchangeable debt. Financial Management 22: 48–60. [Google Scholar] [CrossRef]
Barraquand, Jérôme, and Didier Martineau. 1995. Numerical valuation of high dimensional multivariate American securities. Journal of Financial and Quantitative Analysis 30: 383–405. [Google Scholar] [CrossRef]
Bayer, Christian, Chiheb Ben Hammouda, and Raúl Tempone. 2022. Numerical smoothing with hierarchical adaptive sparse grids and quasi-Monte Carlo methods for efficient option pricing. Quantitative Finance 23: 209–27. [Google Scholar] [CrossRef]
Ben-Ameur, Hatem, Rim Chérif, and Bruno Rémillard. 2016. American-style options in jump-diffusion models: Estimation and evaluation. Quantitative Finance 16: 1313–24. [Google Scholar] [CrossRef]
Berridge, Steffan John, and Johannes Maria Schumacher. 2008. An irregular grid approach for pricing high-dimensional American options. Journal of Computational and Applied Mathematics 222: 94–111. [Google Scholar] [CrossRef]
Boyle, Phelim P. 1988. A lattice framework for option pricing with two state variables. Journal of Financial and Quantitative Analysis 23: 1–12. [Google Scholar] [CrossRef]
Boyle, Phelim P., Jeremy Evnine, and Stephen Gibbs. 1989. Numerical evaluation of multivariate contingent claims. Review of Financial Studies 2: 241–50. [Google Scholar] [CrossRef]
Boyle, Phelim P., Mark Broadie, and Paul Glasserman. 1997. Monte Carlo methods for security pricing. Journal of Economic Dynamics and Control 21: 1267–321. [Google Scholar] [CrossRef]
Broadie, Marc, and Jérôme Detemple. 1997. The valuation of American options on multiple assets. Mathematical Finance 7: 241–86. [Google Scholar] [CrossRef]
Broadie, Marc, and Paul Glasserman. 1997. Pricing American-style securities using simulation. Journal of Economic Dynamics and Control 21: 1323–52. [Google Scholar] [CrossRef]
Bungartz, Hans-Joachim, Alexander Heinecke, Dirk Pflüger, and Stefanie Schraufstetter. 2012. Option pricing with a direct adaptive sparse grid approach. Journal of Computational and Applied Mathematics 236: 3741–50. [Google Scholar] [CrossRef]
Caldana, Ruggero, Gianluca Fusai, Alessandro Gnoatto, and Martino Grasselli. 2016. General closed-form basket option pricing bounds. Quantitative Finance 16: 535–54. [Google Scholar] [CrossRef]
Carrière, Jacques F. 1996. Valuation of the early-exercise price for options using simulations and nonparametric regression. Insurance: Mathematics and Economics 19: 19–30. [Google Scholar] [CrossRef]
Chen, Yangang, and Justin W. L. Wan. 2021. Deep neural network framework based on backward stochastic differential equations for pricing and hedging American options in high dimensions. Quantitative Finance 21: 45–67. [Google Scholar] [CrossRef]
Dang, Duy-Minh, Qifan Xu, and Shangzhe Wu. 2015. Multilevel dimension reduction Monte Carlo simulation for high-dimensional stochastic models in finance. Procedia Computer Science 51: 1583–92. [Google Scholar] [CrossRef]
Del Moral, Pierre, Bruno Rémillard, and Sylvain Rubenthaler. 2012. Monte Carlo approximations of American options that preserve monotonicity and convexity. In Numerical Methods in Finance. New York: Springer, pp. 115–43. [Google Scholar]
Dockendorf, Jörg, and Dean A. Paxson. 2015. Sequential real rainbow options. The European Journal of Finance 21: 867–92. [Google Scholar] [CrossRef]
Genz, Alan. 2004. Numerical computation of rectangular bivariate and trivariate normal and t probabilities. Statistics and Computing 14: 251–60. [Google Scholar] [CrossRef]
Giles, Michael B. 2015. Multilevel Monte Carlo methods. Acta Numerica 24: 259–328. [Google Scholar] [CrossRef]
Glau, Kathrin, and Linus Wunderlich. 2022. The deep parametric PDE method and applications to option pricing. Applied Mathematics and Computation 432: 127355. [Google Scholar] [CrossRef]
Grimwood, Russell, and Stewart Hodges. 2002. The Valuation of Convertible Bonds: A Study of Alternative Pricing Models. Working Paper. Warwick: Warwick Finance Research Institute. [Google Scholar]
Hanbali, Hamza, and Daniel Linders. 2019. American-type basket option pricing: A simple two-dimensional partial differential equation. Quantitative Finance 19: 1689–704. [Google Scholar] [CrossRef]
Hartley, Peter M. 2000. Pricing a Multi-asset American Option. Working Paper. Bath: University of Bath. [Google Scholar]
Haugh, Martin B., and Leonid Kogan. 2004. Pricing American options: A duality approach. Operations Research 52: 258–70. [Google Scholar] [CrossRef]
Heo, Youngjin, Hyunsoo Han, Hanbyeol Jang, Yongho Choi, and Junseok Kim. 2019. Finite difference method for the two-dimensional Black-Scholes equation with a hybrid boundary condition. Journal of the Korean Society for Industrial and Applied Mathematics 23: 19–30. [Google Scholar]
Ibáñez, Alfredo, and Fernando Zapatero. 2004. Monte Carlo valuation of American options through computation of the optimal exercise frontier. Journal of Financial and Quantitative Analysis 39: 253–75. [Google Scholar] [CrossRef]
Johnson, Herb. 1987. Options on the maximum or the minimum of several assets. Journal of Financial and Quantitative Analysis 22: 277–83. [Google Scholar] [CrossRef]
Jones, E. Philip, and Scott P. Mason. 1986. Equity-linked debt. Midland Corporate Finance Journal 3: 47–57. [Google Scholar]
Jin, Xing, Hwee Huat Tan, and Junhua Sun. 2007. A state-space partitioning method for pricing high-dimensional American-style options. Mathematical Finance 17: 399–426. [Google Scholar] [CrossRef]
Jin, Xing, Xun Li, Hwee Huat Tan, and Zhenyu Wu. 2013. A computationally efficient state-space partitioning approach to pricing high-dimensional American options via dimension reduction. European Journal of Operational Research 231: 362–70. [Google Scholar] [CrossRef]
Kamrad, Bardia, and Peter Ritchken. 1991. Multinomial approximating models for options with k state variables. Management Science 37: 1640–52. [Google Scholar] [CrossRef]
Kargin, Vladislav. 2005. Lattice option pricing by multidimensional interpolation. Mathematical Finance: An International Journal of Mathematics, Statistics and Financial Economics 15: 635–47. [Google Scholar] [CrossRef]
Kim, Junseok, Taekkeun Kim, Jaehyun Jo, Yongho Choi, Seunggyu Lee, Hyeongseok Hwang, Minhyun Yoo, and Darae Jeong. 2016. A practical finite difference method for the three-dimensional Black–Scholes equation. European Journal of Operational Research 252: 183–90. [Google Scholar] [CrossRef]
Kohler, Michael, Adam Krzyżak, and Nebojsa Todorovic. 2010. Pricing of High-Dimensional American Options by Neural Networks. Mathematical Finance: An International Journal of Mathematics, Statistics and Financial Economics 20: 383–410. [Google Scholar] [CrossRef]
Liu, Guangwu, and L. Jeff Hong. 2009. Revisit of stochastic mesh method for pricing American options. Operations Research Letters 37: 411–14. [Google Scholar] [CrossRef][Green Version]
Longstaff, Francis A., and Eduardo S. Schwartz. 2001. Valuing American options by simulation: A simple least-squares approach. The Review of Financial Studies 14: 113–47. [Google Scholar] [CrossRef]
Milovanović, Slobodan, and Lina Von Sydow. 2018. Radial basis function generated finite differences for option pricing problems. Computers & Mathematics with Applications 75: 1462–81. [Google Scholar]
Pettersson, Ulrika, Elisabeth Larsson, Gunnar Marcusson, and Jonas Persson. 2008. Improved radial basis function methods for multi-dimensional option pricing. Journal of Computational and Applied Mathematics 222: 82–93. [Google Scholar] [CrossRef]
Raymar, Steven B., and Michael J. Zwecher. 1997. A Monte Carlo valuation of American call options on the maximum of several stocks. Journal of Derivatives 5: 7–23. [Google Scholar] [CrossRef]
Reisinger, Christoph, and Gabriel Wittum. 2007. Efficient hierarchical approximation of high-dimensional option pricing problems. SIAM Journal on Scientific Computing 29: 440–58. [Google Scholar] [CrossRef]
Rogers, Leonard C. G. 2002. Monte Carlo valuation of American options. Mathematical Finance 12: 271–86. [Google Scholar] [CrossRef]
Stulz, René M. 1982. Options on the minimum or the maximum of two risky assets: Analysis and applications. Journal of Financial Economics 10: 161–85. [Google Scholar] [CrossRef]
Tanaka, Hideyuki. 2014. Higher-order interpolated lattice schemes for multidimensional option pricing problems. Journal of Computational and Applied Mathematics 255: 313–33. [Google Scholar] [CrossRef]
Tilley, James A. 1993. Valuing American options in a path simulation model. Transactions of the Society of Actuaries 45: 83–104. [Google Scholar]
Tompaidis, Stathis, and Chunyu Yang. 2014. Pricing American-Style Options by Monte Carlo Simulation: Alternatives to Ordinary Least Squares. Journal of Computational Finance 18: 121–43. [Google Scholar] [CrossRef]
Tsitsiklis, John N., and Benjamin Van Roy. 1999. Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives. IEEE Transactions on Automatic Control 44: 1840–51. [Google Scholar] [CrossRef]
Wang, Xiang, Jessica Li, and Jichun Li. 2023. A Deep Learning Based Numerical PDE Method for Option Pricing. Computational Economics 62: 149–64. [Google Scholar] [CrossRef]
Wang, Xiaoqun, and Ian H. Sloan. 2005. Why are high-dimensional finance problems often of low effective dimension? SIAM Journal on Scientific Computing 27: 159–83. [Google Scholar] [CrossRef]

Figure 1. Optimal policy for a put-on-min option.

Table 1. European put-on-min options—DP vs. Boyle (1988).

	DP with a Grid Size $pq$
	$72^{2}$	$144^{2}$	$300^{2}$	Boyle	Closed form
$K = 35$	1.411	1.392	1.388	1.425	1.387
40	3.837	3.805	3.800	3.778	3.798
45	7.543	7.508	7.501	7.475	7.500
Number of cores	576	1728	1800
Total CPU time	(0.93)	(5.40)	(34.48)
Linear CPU time	(0.65)	(4.11)	(11.99)	10 time steps
$K = 35$	1.504	1.410	1.391	1.392	1.387
40	3.970	3.832	3.805	3.795	3.798
45	7.694	7.537	7.507	7.499	7.500
Number of cores	576	1728	1800
Total CPU time	(2.14)	(10.29)	(67.01)
Linear CPU time	(1.83)	(8.85)	(46.91)	50 time steps

Table 2. American put-on-min option—DP vs. Boyle (1988).

	DP with a Grid Size $pq$
	$72^{2}$	$144^{2}$	$300^{2}$	Boyle
$K = 35$	1.436	1.416	1.413	1.450
40	3.918	3.887	3.881	3.870
45	7.713	7.678	7.671	7.645
Number of cores	576	1728	1800
Total CPU time	(1.01)	(5.79)	(36.63)
Linear CPU time	(0.67)	(4.51)	(13.75)	10 time steps
$K = 35$	1.535	1.440	1.422	1.423
40	4.064	3.926	3.899	3.892
45	7.880	7.727	7.697	7.689
Number of cores	576	1728	1800
Total CPU time	(1.96)	(9.94)	(66.45)
Linear CPU time	(1.71)	(8.46)	(46.43)	50 time steps

Table 3. American put-on-min options—DP vs. Rogers (2002); Jin et al. (2007), and Hartley (2000).

	DP with a Grid Size $pq$
$(S_{0}^{1}$ , $S_{0}^{2})$	$72^{2}$	$144^{2}$	$300^{2}$	Rogers	Jin et al.	Hartley
(80, 80)	37.94	37.42	37.31	[37.35, 37.65]	[37.10, 37.40]	37.30
(80, 100)	32.78	32.20	32.09	[32.12, 32.26]	[31.84, 32.14]	32.08
(80, 120)	29.83	29.26	29.15	[29.18, 29.32]	[28.89, 29.24]	29.14
(100, 100)	25.89	25.20	25.07	[24.93, 25.23]	[24.83, 25.16]	25.06
(100, 120)	21.78	21.07	20.92	[20.89, 21.09]	[20.68, 20.99]	20.91
(120,120)	16.86	16.10	15.94	[15.99, 16.19]	[15.67, 16.00]	15.92
Number of cores	576	1728	1800
Total CPU time	(1.66)	(8.70)	(46.72)	(180)	(24)
Linear CPU time	(1.60)	(8.41)	(43.58)	51 time steps

Table 4. Value functions at

(t_{N}, a, s, 1)

.

Table 4. Value functions at

(t_{N}, a, s, 1)

.

	Survival	Default
BSE	$a > d_{N}^{s} -$ ${tb}_{N}^{s}$	$a \leq d_{N}^{s} -$ ${tb}_{N}^{s}$
$+ a$	a	a
+TB	${tb}_{N}^{s}$	0
−BC	0	$- w a$
=	=	=
$+ D^{s}$	$d_{N}^{s}$	$(1 - w) a$
$+ E$	$a - (d_{N}^{s} -$ ${tb}_{N}^{s})$	0

Table 5. Value functions at

(t_{N}, a, s, 0)

.

Table 5. Value functions at

(t_{N}, a, s, 0)

.

	Solvent company: $a + s > d_{N} -$ ${tb}_{N}$
	Holding	Exercising
	$s \leq P_{N}^{j}$	$s > P_{N}^{j}$
	(1) Survival	(2) Survival	(3) Default
BSE		$a > d_{N}^{s} + C_{N}^{j} -$ ${tb}_{N}$	$a \leq d_{N}^{s} + C_{N}^{j} -$ ${tb}_{N}$
$+ a$	a	a	a
$+ s$	s	s	s
+TB	${tb}_{N}$	${tb}_{N}$	0
−BC	0	0	$- w a$
=	=	=	=
$+ D^{s}$	$d_{N}^{s}$	$d_{N}^{s}$	$min (d_{N}^{s}, (1 - w) a)$
$+ D^{j}$	$d_{N}^{j}$	$s + C_{N}^{j}$	$s + max ((1 - w) a - d_{N}^{s}, 0)$
$+ E$	$a + s - (d_{N} -$ ${tb}_{N})$	$a - (d_{N}^{s} + C_{N}^{j} -$ ${tb}_{N})$	0
	Stressed company: $a + s \leq d_{N} -$ ${tb}_{N}$
	Holding	Exercising
	$(1 - w) a \geq d_{N}^{s}$	$(1 - w) a < d_{N}^{s}$
	(4) Default	(5) Survival	(6) Default
BSE		$a > d_{N}^{s} + C_{N}^{j} -$ ${tb}_{N}$	$a \leq d_{N}^{s} + C_{N}^{j} -$ ${tb}_{N}$
$+ a$	a	a	a
$+ s$	s	s	s
+TB	0	${tb}_{N}$	0
−BC	$- w a$	0	$- w a$
=	=	=	=
$+ D^{s}$	$d_{N}^{s}$	$d_{N}^{s}$	$(1 - w) a$
$+ D^{j}$	$s + (1 - w) a - d_{N}^{s}$	$s + C_{N}^{j}$	s
$+ E$	0	$a - (d_{N}^{s} + C_{N}^{j} -$ ${tb}_{N})$	0

Table 6. Value functions at

(t_{n}, a, s, 1)

.

Table 6. Value functions at

(t_{n}, a, s, 1)

.

	Survival	Default
BSE	$\bar{E} > d_{n}^{s} -$ ${tb}_{n}^{s}$	$\bar{E} \leq d_{n}^{s} -$ ${tb}_{n}^{s}$
$+ a$	a	a
+TB	$\bar{TB} +$ ${tb}_{n}^{s}$	0
−BC	$- \bar{BC}$	$- w a$
=	=	=
$+ D^{s}$	${\bar{D}}^{s} + d_{n}^{s}$	$(1 - w) a$
$+ E$	$\bar{E} - (d_{n}^{s} -$ ${tb}_{n}^{s})$	0

Table 7. Value functions at

(t_{n}, a, s, 0)

.

Table 7. Value functions at

(t_{n}, a, s, 0)

.

	Solvent company: $\bar{\bar{E}} > d_{n} -$ ${tb}_{n}$
	Holding	Exercising
	$s \leq {\bar{\bar{D}}}^{j} + P_{n}^{j}$	$s > {\bar{\bar{D}}}^{j} + P_{n}^{j}$
	Survival	Survival	Default
BSE		$\bar{E} > d_{n}^{s} + C_{n}^{j} -$ ${tb}_{n}$	$\bar{E} \leq d_{n}^{s} + C_{n}^{j} -$ ${tb}_{n}$
$+ a$	a	a	a
$+ s$	s	s	s
+TB	$\bar{\bar{TB}} +$ ${tb}_{n}$	$\bar{TB} +$ ${tb}_{n}$	0
−BC	$- \bar{\bar{BC}}$	$- \bar{BC}$	$- w a$
=	=	=	=
$+ D^{s}$	${\bar{\bar{D}}}^{s} + d_{n}^{s}$	${\bar{D}}^{s} + d_{n}^{s}$	$min ({\bar{D}}^{s} + d_{n}^{s}, (1 - w) a)$
$+ D^{j}$	${\bar{\bar{D}}}^{j} + d_{n}^{j}$	$s + C_{n}^{j}$	$s + max ((1 - w) a - {\bar{D}}^{s} - d_{n}^{s}, 0)$
$+ E$	$\bar{\bar{E}} - (d_{n} -$ ${tb}_{n})$	$\bar{E} - (d_{n}^{s} + C_{n}^{j} -$ ${tb}_{n})$	0
	Stressed company $\bar{\bar{E}} \leq d_{n} -$ ${tb}_{n}$
	Holding	Exercising
	$(1 - w) a \geq {\bar{\bar{D}}}^{s} + d_{n}^{s}$	$(1 - w) a < {\bar{\bar{D}}}^{s} + d_{n}^{s}$
	Default	Survival	Default
BSE		$a > {\bar{D}}^{s} + d_{n}^{s} + C_{n}^{j} -$ ${tb}_{n}$	$a \leq {\bar{D}}^{s} + d_{n}^{s} + C_{n}^{j} -$ ${tb}_{n}$
$+ a$	a	a	a
$+ s$	s	s	s
+TB	0	$\bar{TB} +$ ${tb}_{n}$	0
−BC	$- w a$	$- \bar{BC}$	$- w a$
=	=	=	=
$+ D^{s}$	${\bar{\bar{D}}}^{s} + d_{n}^{s}$	${\bar{D}}^{s} + d_{n}^{s}$	$(1 - w) a$
$+ D^{j}$	$s + (1 - w) a - {\bar{\bar{D}}}^{s} - d_{n}^{s}$	$s + C_{n}^{j}$	s
$+ E$	0	$\bar{E} - (d_{n}^{s} + C_{n}^{j} -$ ${tb}_{n})$	0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ben-Abdellatif, M.; Ben-Ameur, H.; Chérif, R.; Rémillard, B. Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives. Risks 2024, 12, 183. https://doi.org/10.3390/risks12120183

AMA Style

Ben-Abdellatif M, Ben-Ameur H, Chérif R, Rémillard B. Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives. Risks. 2024; 12(12):183. https://doi.org/10.3390/risks12120183

Chicago/Turabian Style

Ben-Abdellatif, Malek, Hatem Ben-Ameur, Rim Chérif, and Bruno Rémillard. 2024. "Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives" Risks 12, no. 12: 183. https://doi.org/10.3390/risks12120183

APA Style

Ben-Abdellatif, M., Ben-Ameur, H., Chérif, R., & Rémillard, B. (2024). Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives. Risks, 12(12), 183. https://doi.org/10.3390/risks12120183

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Programming for Designing and Valuing Two-Dimensional Financial Derivatives

Abstract

1. Introduction

2. Designing and Valuing American Options

3. Designing and Valuing Exchangeable Bonds

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Transition Parameters

Appendix B. Parallel Computing

Note

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI