1. Introduction
One fundamental problem faced by all traders is to determine when to sell an asset or financial derivative over a given trading horizon. The optimal trading decision depends crucially on the trader’s subjective belief of the distribution of the asset price in the future and the observed price fluctuations. By monitoring the asset price evolution over time, the trader decides whether to sell the asset or derivative at the prevailing market price, or continue to wait until a later time. In this paper, we tackle this problem by constructing models that reflect the two major features: (i) the trader’s market view on the terminal asset price distribution and the uninformative noise embedded in the asset price dynamics, and (ii) the timing option that gives rise to an optimal stopping problem corresponding to the trader’s asset or derivative liquidation.
In order to describe the asset price evolution, we present a randomized Brownian bridge (rBb) model, whereby the log-price of the asset follows a Brownian bridge with a randomized endpoint representing the random terminal log-price. In turn, the trader’s prior belief and learning mechanism can be encoded in the drift of the log-price process. Our model is adapted from the novel work by
Cartea et al. (
2016).
They model the asset’s mid-price as a randomized Brownian bridge, and solve a trading problem that maximizes the expected trading revenue by placing either limit or market orders while penalizing the running inventory. In comparison, the underlying asset price in our model is an exponential rBb with different randomized endpoints, with examples including discrete, normal, and double exponential distribution. In addition, we introduce an optimal stopping approach to the trader’s liquidation problem, which is applicable not only to selling the underlying asset, but also options written on it.
To solve the trader’s optimal stopping problem, we devise and apply a number of analytical tools. We first define the optimal liquidation premium that represents the additional value from optimally waiting to sell, as opposed to immediate liquidation. Indeed, as soon as this premium vanishes, it is optimal for the trader to sell. On the other hand, if this premium is always strictly positive, then the trader finds it optimal to wait through maturity. Therefore, the optimal liquidation premium provides not only new financial interpretations, but also another avenue of analytical investigation of the optimal stopping problem. Among our results, we identify the conditions under which it is optimal to immediately liquidate, or hold the asset/option position through expiration. Furthermore, we prove that the optimal strategy to liquidate a long-call–short-put position is identical to the optimal strategy to sell the underlying stock under the same trading horizon. This timing parity holds for any distribution of the randomized endpoint. Moreover, we derive the variational inequality associated with the optimal stopping problem, and present a finite-difference method to solve for the optimal trading boundaries.
In the literature, Brownian bridges have been used to represent market uncertainty or uninformative noise (see e.g.,
Brody et al. (
2009),
Hughston and Macrina (
2012), and
Macrina (
2014)). The randomized Brownian bridge model in this paper belongs to the information-based approach to pricing and trading. Among related studies,
Brody et al. (
2008) and
Filipović et al. (
2012) study information-based models where the asset prices are computed via conditional expectation with respect to an information process.
Randomized Brownian bridges, or their variations, have great potential applicability in a number of finanical applications. For example, while futures prices are supposed to be equal to the spot price in theory, it is observed that some commodity futures prices do not exactly converge to the corresponding spot prices upon maturity (see
Guo and Leung (
2017)).
Brennan and Schwartz (
1990) and
Dai et al. (
2011) investigate arbitrage strategies on stock index futures where the index arbitrage basis is assumed to follow a Brownian Bridge. One can also look for more potential applications in index tracking and exchange-traded funds (see
Leung and Santoli (
2016)). The randomized endpoint provides added flexibility in modeling random shocks to the asset price on a future date, which is applicable to events such as Federal Reserve announcements, and earnings surprises (see e.g.,
Johannes and Dubinsky (
2006) and
Leung and Santoli (
2014)).
As for optimal stopping problems involving Brownian bridges,
Ekström and Wanntorp (
2009) consider optimal single stopping of a Brownian bridge or odd powers of a Brownian bridge, without discounting.
Ekström and Vaicenavicius (
2017) use an optimal stopping approach to maximize the expected value of a Brownian bridge with an unknown pinning point. The optimal double problem is further studied in
Baurdoux et al. (
2015), where they maximize the expected spread between the payoffs at the entry and exit times where the underlying follows a Brownian bridge.
In related works on optimal stopping problems for securities trading under a finite horizon setting,
Leung and Ludkovski (
2011,
2012) introduce the concept of delayed purchase premium and analyze the problem of purchasing equity European and American options in an incomplete market, where the investor’s belief on asset price dynamics may differ from prevalent market views.
Leung and Liu (
2012) analyze the delayed purchase premium associated with credit derivatives under a multi-factor intensity-based default risk framework, and derive the optimal trading strategies. In
Leung and Shirai (
2015), the authors study the optimal timing to sell an asset or option subject to a path-dependent risk penalty.
The rest of the paper is structured as follows. In
Section 2, we present the randomized Brownian bridge model for the underlying asset price, show how different prior beliefs are encapsulated in the stochastic differential equation corresponding to the price dynamics, and illustrate the price behaviors through Monte Carlo simulation. In
Section 3, we formulate and analyze the optimal liquidation problem and present numerical results to illustrate the optimal trading strategies. In
Section 4, we summarize the numerical algorithm used for solving the optimal stopping problems under different settings in this paper. A number of proofs are collected in
Section 6.
2. Prior Belief and Price Dynamics
The model consists of a single asset whose positive price process is denoted by
. The trader in our model specifies a prior belief on the future distribution of the stock price at a fixed future time
T. We denote
to be the log-price of the asset, with
being the stock’s initial log-price. The trader’s belief is described by a real-valued random variable
D to be realized at time
T so that the terminal log-price is given by
To avoid arbitrage, we require that D have finite second moment and , where r is positive risk-free interest rate, under the historical probability measure .
As time progresses, new information arrives in the form of changes in asset price. The price fluctuation can also be viewed as noise prior to the realization of the terminal log-price
D. We model the log-price process as a randomized Brownian bridge. To this end, we first let
be a standard Brownian bridge
where
is a standard Brownian motion. The process
starts at 0 and ends at 0. It can be viewed as uninformative noise, which vanishes both at times 0 and
T. We assume that the processes
B and thus
are independent of the random variable
D.
Then, the log-price process is given by
where
is a constant parameter. The trader observes the price evolution of the asset, and the corresponding filtration
is generated by the log-price process
X. In other words, the trader does not directly observe the standard Brownian bridge
over time. Mathematically, this means that
is not adapted to
. The financial interpretation is that
reflects the uninformative noise in the markets such as market sentiments and rumors. Without direct observation of
, the trader is unable to decompose
into
and
D at any time
. At time
T, the trader observes the realization of
D and thus
. The parameter
allows us to control effect of
on the log-price fluctuation. By inspecting Equation (
3), the ratio
can be viewed as the rate at which the value of
D is revealed, going linearly from fully hidden at time 0 to fully revealed at time
T.
By Proposition 1 of
Cartea et al. (
2016), the log-price process satisfies the stochastic differential equation (SDE)
for
, where
is an
-adapted standard Brownian motion under the probability measure
. The drift term is
where
and
is the cumulative distribution function of the random variable
D. We refer to Appendix A of
Cartea et al. (
2016) for the derivation of Equation (
4). The
-Brownian motion
appearing in (
4) can be considered as the market information accessible by the trader. Unlike
, the value of
contains real information relevant to the risky asset price. Equations (
4)–(
6) represent how the price innovations reflect the probability distribution of
D and market information flow. The incomplete information in (
3) is reformulated as a complete information by projecting the log-price innovations onto the observable filtration. In that sense, the information-based approach is more flexible to add additional interpretation and intuition based on observable price process, as we present in the following sessions of the paper.
To understand the conditional expectation in (
6), we start with the definition
where
is the conditional probability density or mass function for the random variable
D defined by
Using Bayes formula, the conditional probability density is given by
where
denotes the probability density or mass function for
D, and
denotes the conditional density function for the random variable
given
. According to (
3), for any fixed
,
given
is Gaussian, i.e.,
Therefore, we can express the conditional probability density for
as
Substituting this expression into (
8), we have
Finally, Equation (
6) follows after we substitute (
9) into (
7).
The asset price is an exponential rBb defined by
. By Ito’s lemma, we obtain the SDE
In this paper, we will consider three different distributions for D: two-point discrete distribution, normal distribution, and double exponential distribution. The first one is discrete while the other two are continuous distributions. Compared to the normal distribution, the double exponential distribution is capable of generating two heavier tails that can be symmetric or asymmetric. Next, let us illustrate the effect of the distribution of D on the asset price dynamics.
Example 1. Suppose that the trader’s prior belief on the future log-price follows a two-point discrete distribution defined bywith mean and variance, respectively, Then, the drift term of in (4) is given bywhere A special case of Example 1 is when
D is a constant
. Then, the log-price process (
4) is simply a Brownian bridge that starts at
and ends at
. Its SDE is given by
Example 2. Suppose that the trader’s prior belief on the future log-price follows a normal distribution with mean μ and variance , i.e., Note that is different from σ in (4). The drift of in (4) is given by As an alternative continuous distribution to the normal distribution, we now consider the double exponential distribution, which has been used to model random jumps in asset prices (see e.g.,
Kou (
2002)).
Example 3. Suppose that the trader’s prior belief on the future log-price is represented by a double exponential random variable with the pdfwhere and . The mean and variance of are, respectively, The drift of in (4) is given bywhereandand is the standard normal cumulative distribution function. We present the computations in Section 6.2. The paths of the log-price process
X can be simulated over discrete times using the standard Euler–Maruyama method. Denote
as the discretization step, and set
, for
. Then, the path of
X, starting at
at time 0, can be simulated iteratively as follows:
where
is a sequence of independent and identically distributed
random variables. This method does not require the simulation of the random variable
D directly because the distributional characteristics of
D are encapsulated in the drift function
(see (
14), (
19) and (
23)). The procedure takes in the current value of
at each time step to compute the drift
, and each path evolves without the knowledge of the Brownian bridge
or the realization of the random variable
D as both
and
D are never simulated. This is consistent with the fact that the trader cannot observe
D prior to
T, and only has knowledge of the log-price process
X over time but not the Brownian bridge
. The paths will end up with the terminal distribution resembling that of
D. In Example 1, under two-point discrete distribution for
D, simulating according to (24) will generate a path ending up at either
or
, each with probability
and
, respectively. For instance, in
Figure 1a, we have
, meaning that about half of the paths will end up at either
or
.
As seen in Examples 1–3, the specification of the terminal log-price distribution directly affects the drift of
. To better observe the different structures of the drift
under different distributions for
D, we plot in
Figure 2 the function
at
(panel (a)) and
(panel (b)) with the three distributions sharing the same mean and variance. Under the normal distribution,
is linear in
x. In contrast, it is neither linear nor monotone under the two-point discrete and double exponential distributions. Moreover, under the two-point distribution,
A is positive when the stock price is low and negative when the stock price is relatively high, meaning that the asset price tends to have positive drift when prices are low and negative drift when prices are high. However, the opposite is observed in
A under the normal and double exponential distributions.
3. Optimal Liquidation Problems
With the asset price dynamics given above, we now consider a trader who holds the underlying asset S or an option written on S, and seeks to maximize the expected value from selling the security. Let be a generic reward function, representing the value received from the security sale at time t at log-price x. We assume a constant interest rate , which is also the discount rate used by the trader.
In order to determine the optimal timing to sell, the trader solves the optimal stopping problem
where
is the set of all stopping times with respect to
taking values between
t and
, with
. Here,
is the trading deadline which can come before the expiration date of the option
T. For all securities considered herein, the associated reward function
is defined through
T.
This problem can be represented in an alternative probabilistic form. To this end, we first define the process
By (
4) and Ito’s formula, we obtain the SDE
where we have denoted
,
,
, and
The function
, called the drive function (see
Leung and Shirai (
2015) for the terminology), determines the sign of the drift of the SDE for the discounted reward process
.
Integrating (27) and substituting in (
26), the value function can be expressed as
Rearranging the terms in (29), we define the difference between the value function
and the reward function
to be the delayed liquidation premium, i.e.,
For every position held, there is an embedded timing option to sell. The delayed liquidation premium quantifies the value of optimally waiting to exercise this timing option. Since for all , this follows from (30) that for all , meaning that the delayed liquidation premium is always positive.
By standard optimal stopping theory (
Karatzas and Shreve 1998, Theorem D.12), the optimal liquidation time, associated with
or
, is given by
In other words, it is optimal for the trader to exercise the timing option and close out the position as soon as the optimal liquidation premium
L vanishes. Accordingly, the trader’s optimal liquidation strategy can be described by the exercise region
and continuation region
, namely,
On the other hand, if the delayed liquidation premium is always strictly positive, then the trader finds it optimal to wait through the end of the trading horizon. In particular, we may have for all x, so we interpret as never exercising at all. Therefore, we can now identify the conditions under which it is optimal to immediately liquidate, or hold the asset/option position through .
Proposition 1. Let be the current time. Then, we have
- 1.
, ,
- 2.
, .
Proof. From (29), if is positive (resp. negative) , then the discounted reward function is maximized at the longest (resp. shortest) stopping time, i.e., (resp. ). ☐
In addition to the two extremal cases, we can also address other cases and solve for the trader’s nontrivial trading strategies. To do so, let us write down the variational inequality associated with the value function
(see (25)) with a general reward function
. First, define the differential operator
The optimal stopping problem
V is solved from the variational inequality:
where
, with the terminal condition
for all
. We refer to (
Leung and Shirai 2015, sct. 7) for a detailed proof for the existence and uniqueness of a strong solution to variational inequalities of this form. A more comprehensive reference is
Bensoussan and Lions (
1982). We will discuss in
Section 4 our numerical scheme to solve for the optimal trading strategies.
Next, we examine the trading strategies for stock and options, and study the varying effects of the trader’s belief encoded in the random variable D. For each security type, we will derive the corresponding drive function. It will then be the inputs for the variational inequality (35), which will be solved numerically for the optimal trading strategies. We consider several combinations in securities and beliefs to see any major differences in the strategies.
3.1. Stocks
For selling the stock
S, the reward function is simply
. Then, the drive function, denoted by
, for selling the stock can be computed via (28). Precisely, we get
for
. As seen in
Figure 2, the function
and thus drive function depend heavily on the prescribed distribution of
D, and may be nonlinear. In general, it is difficult to pinpoint the behavior of the drive function
. Nevertheless, under the normal distribution for
D, we obtain the following properties for
in
x and in
t, respectively.
Proposition 2. Suppose that the trader’s belief on log-price follows a normal distribution as in Example 2. Then, the drive function given in (36) is
(i) downward-sloping and concave in x ifor (ii) downward-sloping and convex in x if (iii) upward-sloping and concave in x if (iv) upward-sloping and convex in x iforwhere
Proposition 3. Suppose that the trader’s belief on log-price follows a normal distribution as in Example 2. Then, the drive function in (36) is
- (i)
downward-sloping and concave in t if - (ii)
downward-sloping and convex in t if - (iii)
upward-sloping and concave in t if - (iv)
upward-sloping and convex in t if
The proof is provided in
Section 6.3. In addition, if
, then the drive function is independent of
t. Under the normal distribution for
D,
is upward-slopping and convex if
, and downward-slopping and concave if
. However, for two-point discrete distribution and double exponential distribution, the properties of the drive function are not explicitly available, but the function can be numerically computed instantly using (36).
Figure 3 presents the optimal trading strategies for selling a stock with beliefs corresponding to the two-point discrete distribution, normal distribution and double exponential distribution. Typically, as seen in panels (a), (b) and (d), we observe a decreasing boundary over time. This means that the trader tends to sell the stock when the stock price is sufficiently high, but is willing to liquidate at a lower price as the trading deadline approaches. However,
Figure 3b shows that the optimal boundary is increasing in time under the two-point discrete distribution for
D (see Example 1). Note that, in
Figure 3b, the trader has a more divergent belief in the sense that the terminal stock price will end up either very high or very low. Under our model, this suggests that the trader will hold the stock if the price is high since the trader believes the price will likely increase further, and the trader will sell it if the price is low because the price is believed to go even lower under the trader’s belief.
Figure 3a,b represent the trader’s two contrasting beliefs. The magnitude of the parameter
guides the trader’s reaction to new price information and ultimately the trading strategy. This highlights that the behavior of the boundary can be significantly changed by the choice of distribution of
D and associated parameters.
In
Figure 4, we illustrate the sensitivities of the optimal strategies for selling stock with respect to a number of parameters under three different distributions for
D. As expected, the boundary increases as the mean (left panels) of
D increases and variance decreases (right panels).
Due to the nonlinearity of the drive functions under certain distributions, the trader may need to adopt more complicated trading strategies. As we have shown in
Figure 5, if the initial stock price is in the continuation region, then the trader does not sell until the price reaches the exercise boundaries. However, the continuation region is sandwiched by two upper and lower disconnected exercise regions. This means that not only when the price is sufficiently high but also sufficiently low, the trader will sell the stock. We also notice another continuation region when the stock price is very low. The intuition surrounding this continuation region is that when the current stock price is so low, the trader speculates that the price will rebound and chooses to hold onto the stock.
Figure 6a suggests a different trading strategy under the double exponential distribution for
D. Since the lower continuation region is relatively narrow, for most starting prices, the trader will find it optimal to sell the stock immediately. It is clear that trader’s prior belief impacts her trading strategies. In
Figure 6b, the parameters
and
are relatively small, and the mean and variance of
D are 0 and variance
(see (
21) and (
22)). Compared to the market view with the same mean but smaller variance of value 0.09 in
Figure 6a, the upper continuation region is smaller (and vanishes for small
t) and lower continuation region becomes larger in
Figure 6b.
If the trader adopts the normal distribution for
D with mean zero and variance
, then she will never update the log-price dynamics because the drift of
X is zero, i.e.,
. Other than this special degenerate case, the trader can take normal priors with different variances. We notice that the continuation/exercise regions are disconnected under the two-point discrete distribution and double exponential distribution in
Figure 5 and
Figure 6, but there is a unique exercise boundary under normal distribution.
3.2. Call and Put Options
Given that the stock price follows SDE (
4) under the historical measure
, the risk-neutral counterpart follows the geometric Brownian motion
where
is a standard Brownian motion under the risk-neutral measure
. Therefore, the no-arbitrage prices of European call and put options are found from the Black–Scholes pricing formulae.
The price of a European call with strike price
K and expiration date
T is given by
for
, where
Substituting the above into (28), we obtain the drive function
where
for the two-point, normal, double exponential cases are respectively computed by (
14), (
19) and (
23).
In
Figure 7a, we illustrate the optimal trading strategies for selling a European call under the normal distribution for
D. As we can see, the trader tends to sell the call option when the stock price is high but is willing to sell at a lower price as time approaches maturity. The green (light) line shows where
, and the continuation region contains
. Recall that
is the integrand in the expression for
. Naturally, if
, it is optimal for the trader to continue to hold the call because she can obtain positive infinitesimal premium by waiting for an infinitesimally small amount of time. The same argument also applies to the cases with a stock and a put option.
Next, we consider the liquidation of a European put option. The Black–Scholes put price is given by
where
K is the strike price,
T is the expiration date, and
and
are given by (39) and (40). Using (28), the corresponding drive function
is
Figure 7b presents the optimal timing strategies for selling an European put under normal distribution. The figure shows that the trader tends to sell the put option when the stock price is low (i.e., when the put option price is high), but is willing to sell at a higher stock price (i.e., lower option price) as time approaches maturity. As a result, the exercise region expands and the continuation region shrinks as time progresses.
By direct calculation using the drive functions (41) and (43) for a call and a put, and comparing with the drive function (36) for the underlying asset, we arrive at the following parity result.
Lemma 1. (Call–Put Parity) Consider a pair of European call and put options with the same underlying S, strike K and maturity T. Under model (4) with the same distribution for D, the associated drive functions satisfyfor all . Proposition 4. Under the same distribution of D, consider European-style call and put options with the same strike and maturity. The optimal strategy to liquidate a long-call–short-put position is identical to the optimal strategy to sell the underlying stock under the same trading horizon.
Proof. The delayed liquidation premium associated with the long-call–short-put position is given by
The last term is delayed liquidation premium associated with selling the stock
S. ☐
As an interesting consequence of Proposition 4, consider a pair of call and put with strike with maturity , and another pair with strike and maturity , with . The respective long-call–short-put positions will yield the same optimal timing strategy. This strategy is identical to that of selling the underlying stock S, which is independent of strike and maturity.
4. Numerical Implementation
We now summarize the numerical scheme used to solve the variational inequality satisfied by the value function for the optimal trading problem (25). A finite difference scheme is used to solve for the optimal liquidation boundaries.
The numerical solution of the system of variational inequalities can be obtained by applying a finite-difference scheme with the use of the Projected-Successive-Over-Relaxation (PSOR) method. The solution of the resulting equations for value functions are solved by the Successive Over Relaxation (SOR) method. We refer to
Wilmott et al. (
1995, chp. 9) for a detailed discussion on the projected SOR method. In each SOR iterative step in finding the numerical approximation of the value functions, we simply take the maximum value between the approximated function value and asset payoff. Then, the variational inequality (35) admits the linear complimentarity form:
We now consider the discretization of the partial differential equation
, over a bounded uniform grid with discretizations in time (
), and space (
, where
and
are the upper and lower bounds on the values of
x on the grid. Applying the Crank–Nicolson method for
x-derivatives and the backward difference for
t-derivatives on the resulting equation leads to the grid equation:
where the coefficients are given by
for
and
. The system to be solved backward in time is
where the right-hand side is
and
This leads to a sequence of stationary complementarity problems. Since the trader can close her position anytime prior to expiry, the value function
must satisfy the constraint
Correspondingly, the discrete scheme can be written as
Hence, at each time step
, we need to solve the linear complementarity problem
In particular, our algorithm enforces the constraint of dominating the payoff function as follows:
The projected SOR method is used to solve the linear system. Notice that the constraint is enforced at the same time as
is calculated in each iteration. Hence, at each time step
j, the PSOR algorithm is to iterate (on
k) the equations
where
k is the iteration counter and
is the overrelaxation parameter. The iterative scheme starts from an initial point
and proceeds until a convergence criterion is met, such as
where
is a tolerance parameter. The optimal boundary is identified by locating the grid points separating the exercise (or sell) region and continuation region, respectively defined by
and
.