Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework

Lauria, Davide; Lindquist, W. Brent; Rachev, Svetlozar T.; Hu, Yuan

doi:10.3390/jrfm18050230

Open AccessArticle

Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework

¹

Department of Economics, Statistics & Finance, University of Calabria, 87036 Calabria, Italy

²

Department of Mathematics & Statistics, Texas Tech University, Lubbock, TX 79409-1042, USA

³

Independent Researcher, Rockville, MD 20852, USA

^*

Author to whom correspondence should be addressed.

^†

Current address: Department of Management, University of Bergamo, 24127 Bergamo, Italy.

J. Risk Financial Manag. 2025, 18(5), 230; https://doi.org/10.3390/jrfm18050230

Submission received: 14 February 2025 / Revised: 22 April 2025 / Accepted: 22 April 2025 / Published: 25 April 2025

(This article belongs to the Special Issue Featured Papers in Mathematics and Finance, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

We introduce a binary tree for pricing contingent claims when the underlying security prices exhibit history dependence. We apply the model to the specific cases of moving-average and autoregressive behavior that are characteristic of price histories induced by market microstructure behavior. Our model is market-complete and arbitrage-free. When passing to the risk-neutral measure, the model preserves all parameters governing the natural-world price dynamics, including the instantaneous mean of the asset return and the instantaneous probabilities for the direction of asset price movement. This preservation holds for arbitrarily small, but non-zero, time increments characteristic of market microstructure transactions. In the (unrealistic) limit of continuous trading, the model reduces to continuous diffusion price processes, with the concomitant loss of the microstructure information.

Keywords:

binary trees; asset pricing; option pricing; market microstructure; technical analysis

1. Introduction

“At the level of transactions prices, …, the random walk conjecture is …a hypothesis that is very easy to reject in most markets even in small data samples. In microstructure, the question is not ‘whether’ transactions prices differ from a random walk, but rather ‘how much’ and ‘why?”’
(Hasbrouck, 1996)

Dynamic asset pricing theory, as introduced1 by Black and Scholes (1973) and Merton (1973) (BSM), is based on the concepts of no-arbitrage opportunity and replicating portfolios, along with a set of assumptions that can be classified into two groups. The first group of assumptions concerns the microstructure of the market: the rules under which trades are performed; the impact of transaction and timing costs; the role of information and its disclosure; discovery and formation of prices; volatility; liquidity; and market-maker and investor behavior. Under the assumptions of the BSM model, any trade is executed without taxes, transaction costs, and amount restriction (the market is frictionless); traders are price takers with symmetric information (a perfectly competitive market) and are able to trade any amount (no liquidity constraints) over any infinitesimally small time interval (continuous trading); and the market is assumed to be efficient (all relevant information is embedded in the market price), liquid (every order is executed instantaneously at the current equilibrium price), and free of arbitrage opportunity. The second group of assumptions is related to the choice of geometric Brownian motion (GBM) as the stochastic process describing the price dynamics of the security underlying the option contract. The assumption of GBM invokes a strong set of restrictions, including constant volatility, the normal distribution of log returns, and absence of long memory.2

The BSM model provides analytic solutions and an elegant machinery for computing the price of a European option; however, many of the hypotheses upon which it is rooted have been shown to be too restrictive. It is well known that many of the empirical properties3 of stock price returns are not consistent with the assumption of GBM. Consequently, a range of alternative models have been proposed to include various stylized facts. Another problem with the original BSM model is that it does not provide solutions for more complex contingent claims, such as those with a path-dependent pay-off (e.g., American options) or those whose underlying risk is not fully priced in financial markets, leading to market incompleteness. The most baffling result of the BSM model is that the option price does not depend on the drift of the underlying security. This puzzle was clarified in the subsequent work of Cox and Ross (1976) and Merton (1976), and then reformulated in terms of the risk-neutral measure by Harrison and Kreps (1979) and Harrison and Pliska (1981). The concept of the risk-neutral price was subsequently accepted and continuous-time models have proliferated.

Subsequent developments to the solution of the continuous-time pricing problem can be interpreted as improvements in one of two directions. The first is directed to the stochastic process driving the price dynamics in order to incorporate more statistical features of real price processes. This direction has produced the following strong result, known as the general version of the fundamental theorem of asset pricing (Delbaen & Schachermayer, 1994): “if a stochastic price process

S

is a bounded, real-valued semimartingale, there is an equivalent martingale measure (EMM) for

S

if and only if

S

satisfies the condition of “no free lunch with vanishing risk” (NFLVR), where NFLVR is a generalization of the no-arbitrage condition.4^,5 One consequence of this continuous-time fundamental theorem is the necessity to work within the confines of stochastic integration theory. A more critical consequence is that the general version of the fundamental theorem of asset pricing does not guarantee a unique EMM. In fact, there can be uncountably many EMMs, allowing for all possible option pricings within natural bounds. In incomplete markets, where perfect replication is not possible, hedgers seek alternative approaches to mitigate risk in a cost-efficient manner. Several alternative strategies exist, each with their own advantages and drawbacks, but they are still not entirely attractive due to their costs and limitations (Dolinsky & Neufeld, 2018; El Karoui & Quenez, 1995; Karatzas, 1997; Löhne & Rudloff, 2014; Rouge & El Karoui, 2000).

The second direction is to develop methods for solving pricing problems having no known analytic solution, due either to the complexity of the stochastic process or the complexity of the pay-off function associated with a contingent claim. The binomial option pricing model proposed by Cox et al. (1979) (CRR) was the first approach to pricing American options without sacrificing the intellectual machinery developed under the BSM model. CRR utilized a discrete-time, binomial lattice graph to describe the evolution of the price process of the underlying security. The discrete process was designed to converge to GBM as the time interval between two successive trades converged to zero. There was no intention in the CRR model to use the discrete setting to incorporate other stylized facts of asset returns. Other discrete models—utilizing binomial or trinomial lattices, or binary trees—have been developed to numerically price contingent claims under more complex assumptions, such as stochastic volatility or jump processes (see, e.g., Boyle, 1986; Derman et al., 1996; Rubinstein, 1994, 1998). Again, these discrete models have been designed to converge to a solution of a continuous-time stochastic process. This is usually ensured by setting moment-matching conditions in order to apply Donsker’s invariance principle (Billingsley, 2013). Using discrete models avoids working explicitly with stochastic integration theory.6

As noted above, the BSM model is very restrictive with regards to its incorporation of the details of market microstructure.7 In seminal work, Roll (1984) showed that, in an efficient market, the effective bid–ask spread can be measured by

spread = 2 \sqrt{- cov}

, where

cov

is the first-order serial covariance of price changes. Crucially, Roll’s reasoning was based upon analysis of a discrete-time model. We briefly recapitulate Roll’s model to indicate the connection with our work.8 Roll considered a martingale-efficient price

m_{t}

for an asset evolving as

m_{t} = m_{t - 1} + u_{t}

, where

u_{t}

represents independent, identically distributed, mean-zero random variables. A trade (buy or sell) at time t through a dealer results in a transaction price

p_{1} = m_{t} + q_{t} c,

where

q_{t} = - 1

denotes a sale to a dealer, while

q_{t} = 1

denotes a purchase from a dealer. The value

2 c

is the bid–ask spread. Under the buy–sell assumptions

E [q_{t_{1}} q_{t_{2}}] = 0

and

E [u_{t_{1}} q_{t_{2}}] = 0

for all

t_{1} \neq t_{2}

, the variance and first-order covariance of

Δ p_{t} = p_{t} - p_{t - 1}

are

Var [Δ p_{t}] : = γ_{0} = σ_{u}^{2} + 2 c^{2}, cov [Δ p_{t} Δ p_{t - 1}] : = γ_{1} = - c^{2} .

Solving for c produces Roll’s spread formula

c = 2 \sqrt{- γ_{1}}

. Solving for

σ_{u}^{2}

results in

σ_{u}^{2} = γ_{0} + 2 γ_{1}

. The values

γ_{0}

and

γ_{1}

can be estimated from historical transaction prices to obtain estimates for the model parameters c and

σ_{u}

.

In his treatise on market microstructure, Hasbrouck (2007) describes several discrete-time empirical market microstructure models which build upon Roll’s bid–ask model. The models are designed to capture, in various ways, the price formation process, incorporating the sequence of actions and reactions between market makers and traders. Using the binary-tree model developed in this paper, in Section 7 we develop a base model of binary white noise (BWN). Upon this, we build two binary-tree models which capture, successively, moving-average-of-order-one (MA(1)) and autoregressive-of-order-one (AR(1)) risky-asset price behaviors.

The works of Kim et al. (2016, 2019) and Hu et al. (2020a, 2020b) have shown that binomial pricing trees have sufficient flexibility to capture some of the stylized facts of price dynamics for option pricing in complete discrete-time markets (enabling a unique hedging strategy). These include the preservation, from the natural world to the risk-neutral valuation, of the probabilities of the natural-world stock-price directions; the mean and higher moments of returns; and the effects of noisy, informed, and misinformed traders. However, binomial trees are too simplistic to accommodate either the autoregressive or moving-average behavior of asset prices. Our thesis in this paper is that binary pricing trees are crucial for developing dynamic asset pricing models that incorporate such phenomena.

To further clarify the need for binary pricing trees, recall the fundamental pricing model in continuous time for a market consisting of a single bond and stock. The continuous-time bond price dynamics are given by

d β_{t}^{(cts)} = r_{t}^{(cts)} β_{t}^{(cts)} d t, t \in [0, T],

(1)

where

β_{0}^{(cts)} = β_{0} > 0

and

r_{t}^{(cts)}

is a continuous-time riskless rate (Duffie, 2001, p. 102). The stock’s log-price dynamics

L_{t}^{(cts)} = ln (S_{t}^{(cts)})

, with

S_{0}^{(cts)} = S_{0} > 0

, follow a continuous diffusion determined by the Itô process (Aït-Sahalia & Jacod, 2014, Chapter 1):

\begin{matrix} d L_{t}^{(cts)} = μ_{t} d t + σ_{t} d B_{t}, t \in [0, T], \end{matrix}

(2)

where

B_{t}

,

t \in [0, T]

is a standard Brownian motion whose trajectories generate a canonical filtered probability space (Duffie, 2001, Chapter 5 and Appendix E):

(Ω, F^{(cts)} = \{F^{(cts)} = σ (B_{u}, 0 \leq u \leq t)\}, P) .

Inclusion of microstructure features modifies the stock log-price dynamics, which can be written in discrete form as

L_{t_{n}}^{(obs)} = L_{t_{n}}^{(cts)} + ϵ_{t_{n}}^{(micro)}, n = 1, \dots, m - 1,

(3)

where

t_{n} = t_{1}, \dots, t_{m - 1}

indicate the times at which the microstructure features associated with a particular market “actor” (such as a trader) are realized.9 The microstructure dynamics

ϵ_{t_{n}}^{(micro)}

,

n = 1, \dots, m - 1

, are determined by (for example) a moving-average process MA(q) of order

q = 0, 1, \dots

(Mills, 2019, Chapter 3):

ϵ_{t_{n}}^{(micro)} = \sum_{k = 0}^{q} ϕ_{k} ζ_{t_{n - k}}, ϕ_{0} = 1, ϕ_{k} \in R, ϕ_{k} \neq 0, n = 1, \dots, m - 1 .

(4)

Here,

ζ_{t_{n - k}} = 0

when

k > n

, and

ζ_{t_{k}}

,

k = 0, \dots, m - 1

are independent, identically distributed random variables with zero mean and specified variance. As can be seen, for example, as shown by O’Hara (1999, Figure 1), when general microstructure features are included in the observed log-prices, the recombining binomial pricing tree is no longer an appropriate model for the stock-price dynamics and an extension to a binary (i.e., non-recombining) pricing model must be introduced.

The fundamental asset pricing theorem of Delbaen and Schachermayer (1994) requires the ability to trade in continuous time. But market microstructure phenomena occur at discrete times. The resultant observed process (3), being a combination of a semimartingale plus discrete-time microstructure noise, is therefore not a semimartingale, and the fundamental asset pricing theorem cannot be applied. Cheridito (2003) showed that fractional Brownian motion can be used as a price process and still maintain NFLVR by “introducing a minimal amount of time

h > 0

that must lie between two consecutive transactions”. Jarrow et al. (2009) extended this work to show that arbitrage-free price processes can be obtained without reliance on semimartingales provided continuous-time trading is not allowed (although the finite time intervals can be arbitrarily small).

We therefore adopt the discrete-time perspectives of Cheridito and Jarrow10 and present a discrete, binary-tree approach that is general enough to reproduce the statistical properties of real prices and encompass a class of models that are used in microstructure theory (see Easley & O’Hara, 1995, 2003; Hasbrouck, 2007; Fan et al., 2016; Aït-Sahalia & Jacod, 2014, Chapter 2). To this end, we apply the general approach of Hu et al. (2020a, 2020b) to binary-tree option pricing models.11 In Section 2, we develop a discrete binary tree (binary information tree) supporting random walks. In Section 3 and Section 4, we develop discrete-time pricing on this tree for a riskless rate, a riskless bank account, and a risky asset. Using a self-financing portfolio formulation, computation of the risk-neutral measure and pricing of options is discussed in Section 5.

In general, a random walk on a non-recombined binary tree (which is a particular case of an arbitrary branching process) will converge to a measure-valued diffusion (Daley & Vere-Jones, 2003, 2008; Mitov et al., 2009; Skorokhod, 1997). In Section 6, we show that, under the well-known Donsker–Prokhorov invariance principle (for constant instantaneous mean return and variance) or the Davydov–Rotar invariance principle (for time-dependent instantaneous mean return and variance), the restrictions of these invariance principles unfortunately require that a non-recombined random walk approach a classical random walk, which, in the continuum limit, produces price processes such as as GBM (under the Donsker–Prokhorov invariance principle) or continuous diffusion (under the Davydov–Rotar invariance principle), resulting in the concomitant loss of the microstructure information.

The principal contributions of this theoretical paper are the following. We develop a binary-tree framework that supports market-complete, arbitrage-free, option pricing models having path-dependent price processes (Section 2, Section 3, Section 4 and Section 5). We show (Section 6) that the attempt to consider the continuous-time limit of this binary-tree framework is self-defeating, as critical natural world parameters are lost in the continuum limit. We apply (Section 7) this general framework to construct models in which a component driving the random price behavior is described by BWN—mimicking market microstructure. We further incorporate this underlying microstructure into two path models, in which the risky-asset prices display either MA(1) or AR(1) path dependence. In Section 8, we discuss the simplifications inherent in these three models that reduce the complexity of computing option prices. We show that, as a result of partial recombination, the computational complexity of the MA(1) (and consequently of the BWN) model is

O (n^{2})

rather than the

O (2^{n})

expected of non-recombining binomial trees.

As the price-direction probabilities are preserved in the risk-neutral dynamics of our framework, we delve (Section 9) into a technical analysis of the probabilities governing sequences of price changes. We compute sequence probabilities empirically, based upon daily closing prices of an ETF and for stocks comprising the Dow Jones Industrial Average. This analysis provides a test of the efficient market hypothesis for daily data. By categorizing price-change sequences, our results indicate that the efficient market hypothesis operates within categories, but not across categories.

A brief consideration of future directions is provided in Section 10.

2. A Binary Information Tree for Pricing

Binomial (recombining binary) trees are described by simple lattice notation. Each node on the binomial lattice is labeled by a time and price-state index. If the lattice has

m + 1

time steps,

n = 0, 1, \dots, m

, then time step n has

n + 1

price states. Each node p, corresponding to a price state on a binomial tree, has the unique label

p_{n}^{k}

,

n = 0, \dots, m

,

k = 0, \dots, n

. Employing graph theory terminology, there are

(\binom{n}{k})

unique paths from the root to node

p_{n}^{k}

, characterized by the number of unique sequences of length n created using k values of “up” and

n - k

values of “down”. However, the price at node

p_{n}^{k}

depends only on the numbers k and

n - k

and not on the individual sequences. In describing a binomial tree, it is common practice to provide only the single time interval

n = {0, 1}

and refer to the

n = 1

price states as “up” and “down” (along with the formula for computing each).

For binary trees,12 the situation is more complex. Time step n now has

2^{n}

price states (nodes), and one would anticipate that the labeling

p_{n}^{k}

,

n = 0, \dots, m

,

k = 0, \dots, 2^{n} - 1

would be sufficient. However, non-recombining binary trees have the property that there is a unique path between the root (

n = 0

) and each node of the tree. Each path is distinguished by a unique sequence of “up” and “down” movements on the tree, and the price at each node generally depends on that unique sequence. We have chosen a notation that makes path labeling possible through a recorded sequence of local up (1) and down (0) movements. This notation also supports the flexibility for parameters, such as the probabilities governing price direction change, to vary along the tree.

Panel (a) in Figure 1 illustrates our labeling for an

m = 3

binary tree. At time step n, each price state (node) on the tree is labeled with a “level” index

l_{i}

,

i = 1, \dots, 2^{n - 1}

,

n = 1, \dots, m

,

l_{0} = 0

, and a local state index

ϵ_{n}^{(l_{n})} = {1, 0}

,

n = 1, \dots, m

, with

ϵ_{0}^{(0)} = 0

. For

n = 2

(Figure 1a) the price states are grouped into two levels, the lower two price states belonging to level

l_{2} = 1

and the upper two to

l_{2} = 2

. In the

l_{2} = 1

level, the two price states (nodes) are labeled

ϵ_{2}^{(1)} = 0

(“down”) and

ϵ_{2}^{(1)} = 1

(“up”). Analogous labeling occurs for the two

l_{2} = 2

nodes.

Panel (b) in Figure 1 illustrates path labeling for an

m = 3

binary tree. Each level

l_{n}

is reached via a uniquely labeled path through other levels. For example, the

l_{3} = 4

level (which contains two leaf nodes) is reached via the “level sequence”

L_{3} = (l_{0} = 0, l_{1} = 1, l_{2} = 2, l_{3} = 4) : = (0, 1, 2, 4)

. Each node

ϵ_{n}^{(l_{n})}

is reached via a uniquely labeled “node sequence” of local up and down price changes. For example, the node

ϵ_{3}^{(4)} = 0

is reached via the node sequence

M_{3} = (ϵ_{0}^{(0)} = 0, ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1, ϵ_{3}^{(4)} = 0) : = (0, 1, 1, 0)

. We note that, given a specific node sequence

M_{n}

, it is possible to reconstruct its corresponding level sequence

L_{n}

. Specifically, for

M_{n} = (0, m_{1}, m_{2}, \dots, m_{n})

, then

L_{n} = (l_{0}, l_{1}, l_{2}, \dots, l_{n})

, with

\begin{matrix} l_{0} = 0, l_{1} & = 1, l_{2} = {(m_{1})}_{10} + 1, l_{3} = {(m_{1} m_{2})}_{10} + 1, \\ \dots, l_{n} = {(m_{1} m_{2} \dots m_{n - 1})}_{10} + 1, \end{matrix}

(5)

where

{(m_{1} m_{2} \dots m_{k})}_{10}

denotes the decimal value of the binary integer

m_{1} m_{2} \dots m_{k}

.13 Note, however, that the reverse is not possible. Given the node sequence,

L_{n} = (l_{0}, l_{1}, l_{2}, \dots, l_{n})

, it is possible only to compute

m_{1}

through

m_{n - 1}

, leaving

m_{n}

unknown. For simplicity, when employed we will give both the relevant

M_{n}

and

L_{n}

sequences.

With this discussion of the indexing on tree levels, nodes, and paths, we proceed with the development of the dynamics on the tree. We define a discrete-time-filtered probability space (discrete-time stochastic basis)

{ST}^{(d)} = (Ω, F^{(d)}, P)

with the discrete filtration

F^{(d)} = \{F^{(n)}, n \in N_{0}\}

,

N_{0} \overset{def}{=} \{0, 1, \dots\}

, where

F^{(0)} = \{\emptyset, Ω\}

. Define

K_{n} = 2^{n - 1}

. The sigma fields,

F^{(n)} = σ (ϵ_{n}^{(k)}, k = 1, 2, \dots, K_{n}, n \in N)

, are generated by a sequence of dependent binary random variables

ϵ_{n}^{(k)}

,

k = 1, 2, \dots, K_{n}

, such that

P (ϵ_{n}^{(k)} = 1) = 1 - P (ϵ_{n}^{(k)} = 0) \in (0, 1)

. The triangular array

E_{N} = (ϵ_{n}^{(k)}, k = 1, 2, \dots, K_{n}, n \in N)

of binary random variables

ϵ_{n}^{(k)}

is defined on a probability space

(Ω, F, P)

, with probability laws

\begin{matrix} p_{(ϵ_{n}^{(1)}, \dots, ϵ_{n}^{(K_{n})})}^{(m_{n}^{(1)}, \dots, m_{n}^{(K_{n})})} & = P (ϵ_{n}^{(1)} = m_{n}^{(1)}, \dots, ϵ_{n}^{(K_{n})} = m_{n}^{(K_{n})}), \\ m_{n}^{(k)} \in {0, 1}, k = 1, \dots, K_{n}, n \in N, \end{matrix}

(6)

satisfying Kolmogorov’s extension theorem (Oksendal, 2013, Theorem 2.1.5, p. 11).

The probability space

(Ω, F, P)

is a standard probability space; without loss of generality, we can assume it is the Lebesgue probability space

Ω = [0, 1]

,

F = B ([0, 1])

,

P = Leb ([0, 1])

.14 We define the probability law for

\{ϵ_{n}^{(k)}, k = 1, \dots, K_{n}, n \in N\}

sequentially in time, defining the dynamics of

E_{N}

.

For

n = 0

, set

ϵ_{0}^{(0)} = 0

,

E_{0} \overset{def}{=} \{ϵ_{0}^{(0)}\}

, and

F^{(0)} = σ (E_{0}) = {\emptyset, Ω}

.

For

n = 1

, set

E_{1} \overset{def}{=} \{ϵ_{0}^{(0)}, ϵ_{1}^{(1)}\} = \{E_{0}, ϵ_{1}^{(1)}\}

. Then,

F^{(1)} = σ (E_{1})

with

\begin{matrix} P (ϵ_{1}^{(1)} = 1) & = P (ϵ_{1}^{(1)} = 1 |ϵ_{0}^{(0)} = 0) P (ϵ_{0}^{(0)} = 0) \\ P (ϵ_{1}^{(1)} = 0) & = P (ϵ_{1}^{(1)} = 0 |ϵ_{0}^{(0)} = 0) P (ϵ_{0}^{(0)} = 0) \end{matrix}\}, where P (ϵ_{0}^{(0)} = 0) = 1 .

The conditional probabilities satisfy

\begin{matrix} P (ϵ_{1}^{(1)} = 1 |ϵ_{0}^{(0)} = 0) & \in (0, 1), \\ P (ϵ_{1}^{(1)} = 0 |ϵ_{0}^{(0)} = 0) & = 1 - P (ϵ_{1}^{(1)} = 1 |ϵ_{0}^{(0)} = 0) . \end{matrix}

Thus,

\begin{matrix} P (ϵ_{1}^{(1)} = 1) : = p_{1}^{((0, l_{1} = 1), (0, ϵ^{(1)} = 1))} : = p_{1}^{((0, 1), (0, 1))} & \in (0, 1), \\ P (ϵ_{1}^{(1)} = 0) : = p_{1}^{((0, l_{1} = 1), (0, ϵ^{(1)} = 0))} : = p_{1}^{((0, 1), (0, 0))} & = 1 - p_{1}^{((0, 1), (0, 1))} . \end{matrix}

For

n > 1

, the general case is as follows. (For additional clarity, the sequential definitions for

n = 2

and 3 are provided in Appendix A). Set

E_{n} \overset{def}{=} \{E_{n - 1}, (ϵ_{n}^{(1)}, \dots, ϵ_{n}^{(K_{n})})\}

.

E_{n}

is the triangular array of binary random variables

ϵ_{l}^{(k_{l})}

,

l = 1, \dots, n

,

k_{l} = 1, \dots, K_{l}

, with

ϵ_{0}^{(0)} = 0

. Then,

F^{(n)} = σ (E_{n}), n = 0, 1, \dots, F^{(0)} = σ (E_{0}) = {\emptyset, Ω},

(7)

and

\begin{matrix} P (ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 1) & = P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) P (E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) \\ = P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) p_{n - 1}^{((0, l_{1}, \dots, l_{n - 1}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{2}^{(l_{n - 1})}))}, \\ P (ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 0) & = P (ϵ_{n}^{(l_{n})} = 0 |E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) P (E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) \\ = P (ϵ_{n}^{(l_{n})} = 0 |E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) p_{n - 1}^{((0, l_{1}, \dots, l_{n - 1}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{2}^{(l_{n - 1})}))}, \end{matrix}

(8)

where

l_{1} = 1

and

l_{n} = {(ϵ_{1}^{(l_{1})} \dots ϵ_{n - 1}^{(l_{n - 1})})}_{10} + 1

for

n > 1

. The

E_{n}

-conditional probabilities satisfy

\begin{matrix} P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) & \in (0, 1), \\ P (ϵ_{n}^{(l_{n})} = 0 |E_{n - 1} = (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) & = 1 - P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1} = (0, ϵ_{1}^{(1)}, \dots, ϵ_{n - 1}^{(l_{n - 1})})) . \end{matrix}

(9)

The unconditional probabilities satisfy

\begin{matrix} P (ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 1) & : = p_{n}^{((0, l_{1}, \dots, l_{n}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 1))} \in (0, p_{n - 1}^{((0, l_{1}, \dots, l_{n - 1}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{2}^{(l_{n - 1})}))}), \\ P (ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 0) & : = p_{n}^{((0, l_{1}, \dots, l_{n}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 0))} \\ = p_{n - 1}^{((0, l_{1}, \dots, l_{n - 1}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{2}^{(l_{n - 1})}))} - p_{n}^{((0, l_{1}, \dots, l_{n}), (0, ϵ_{1}^{(l_{1})}, \dots, ϵ_{n - 1}^{(l_{n - 1})}, 1))} . \end{matrix}

(10)

The level sequences

L_{n} = (0, l_{1}, \dots, l_{j}, \dots, l_{n}), l_{j} = 1, \dots, K_{j}, j = 1, \dots, n,

(11)

together with the discrete filtration

F^{(n)}

and the conditional probabilities (9) define a particular type of binary random tree which we designate as a binary information tree to level n (

{BIT}_{n}

).15 Each node sequence

M_{n} = (0, m_{1}, \dots, m_{j}, \dots, m_{n}), m_{j} = {0, 1}, j = 1, \dots, n,

(12)

defines a unique event

E_{n}^{(L_{n}, M_{n})} = E_{n}^{((0, l_{1}, \dots, l_{j}, \dots, l_{n}), (0, m_{1}, \dots, m_{j}, \dots, m_{n}))}

(13)

on

{BIT}_{n}

, where in (13)

l_{j} = {(m_{0} m_{1} \dots m_{j - 1})}_{10} + 1

,

j = 1, \dots, n

,

m_{0} = 0

. Event

E_{n}^{(L_{n}, M_{n})}

occurs with probability16

P (E_{n}^{(L_{n}, M_{n})}) = p_{n}^{(L_{n}, M_{n})} .

(14)

Panel (c) in Figure 1 provides an illustration of two specific probabilities of the form

p_{3}^{(L_{3}, M_{3})}

.

To develop a random-price time series simulating microstructure timing, for any given

m \in N \overset{def}{=} {1, 2, \dots}

we associate the levels of

{BIT}_{m}

with a sequence of time instances

0 = t_{0} < t_{1} < \dots < t_{m - 1} < t_{m} = T

over the finite period

[0, T]

,

T < \infty

. In our application to option pricing, the current time is

t_{0} = 0

(corresponding to the root event of

{BIT}_{m}

), while

t_{m} = T

is the terminal time (corresponding to the leaf events of

{BIT}_{m}

). Trades of assets occur only at the times

t_{1} < \dots < t_{m - 1}

. These trading instances are fixed and known at time

t_{0}

.17 The time intervals

[0, t_{1})

,

(t_{n}, t_{n + 1})

,

n = 1, \dots, m - 2

, and

(t_{m - 1}, T]

, over which no trades occur, are denoted inter-trade periods.18 We define

Δ t_{n} \overset{def}{=} t_{n} - t_{n - 1}

,

n = 1, \dots ., m

.

Associated with

{BIT}_{m}

, for all

t \in [0, T]

we can recursively define the càdlàg node paths

E_{n, t}^{(L_{n})}

,

n = 0, \dots, m - 1

, as follows:

\begin{matrix} E_{0, t}^{(L_{0})} & = ϵ_{0}^{(0)} = 0 & for t \in [t_{0} = 0, t_{1}), \\ E_{n, t}^{(L_{n})} & = E_{n - 1, t}^{(L_{n - 1})}, ϵ_{n}^{(l_{n})} & for t \in [t_{n}, t_{n + 1}), 1 \leq n \leq m - 2, \\ E_{m - 1, t}^{(L_{m})} & = E_{m - 2, t}^{(L_{m - 2})}, ϵ_{m - 1}^{(l_{m - 1})} & for t \in [t_{m - 1}, t_{m} = T] . \end{matrix}

(15)

We define the market information flow

{IF}_{m; [0, T]}

as

{IF}_{m; [0, T]} = \{E_{n, t}^{(L_{n})}; n = 0, \dots, m - 1, t \in [0, T], L_{n} \in L_{n}\},

(16)

where

L_{n}

is the set of all (

n + 1

)-tuples,

L_{n}

.

{IF}_{m; [0, T]}

generates the

{BIT}_{m}

stochastic basis

{ST}_{(m; [0, T])}^{(d)} = (Ω, F_{m; [0, T]}^{(d)}, P)

on

[0, T]

, where the filtration is defined by

F_{m; [0, T]}^{(d)} = \{F_{0; [0, T]}^{(d)} = \{\emptyset, Ω\}, F_{n; [0, T]}^{(d)} = σ \{E_{n, t}^{(L_{n})}, L_{n} \in L_{n}\}, n = 0, \dots, m - 1\} .

(17)

Given

{BIT}_{m}

, specification of

M_{m} = (0, m_{1}, \dots, m_{n}, m_{n + 1}, \dots, m_{m}) = (M_{n}, m_{n + 1}, \dots, m_{m})

,

n = 0, \dots, m - 1

, defines a unique, nested set of càdlàg event paths

E_{n, t}^{(L_{n}, M_{n})}

,

n = 0, \dots, m - 1

,

t \in [0, T]

, corresponding to the nested node sequences

L_{n} = (0, l_{1}, \dots, l_{n})

, with

l_{j} = {(m_{0}, m_{1} \dots m_{j - 1})}_{10} + 1

,

j = 1, \dots, n

,

m_{0} = 0

, and

ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{n}^{(l_{n})} = m_{n}

. From (15), this unique set of event paths is

\begin{matrix} E_{0, t}^{(L_{0}, M_{0})} & = 0 & for t \in [t_{0} = 0, t_{1}), \\ E_{1, t}^{(L_{1}, M_{1})} & = 0, m_{1} & for t \in [t_{1}, t_{2}), \\ E_{n, t}^{(L_{n}, M_{n})} & = E_{n - 1, t}^{(L_{n - 1}, M_{n - 1})}, m_{n} & for t \in [t_{n}, t_{n + 1}), 2 \leq n \leq m - 2, \\ E_{m - 1, t}^{(L_{m - 1}, M_{m - 1})} & = E_{m - 2, t}^{(L_{m - 2}, M_{m - 2})}, m_{m - 1} & for t \in [t_{m - 1}, t_{m} = T] . \end{matrix}

(18)

Event

E_{n}^{(L_{n}, M_{n})}

and event path

E_{n, t}^{(L_{n}, M_{n})}

labels on

{BIT}_{4}

are illustrated in Figure 2.

The

E_{n - 1}

-conditional probabilities along path

E_{n, t}^{(L_{n}, M_{n})}

are

P (ϵ_{j}^{(l_{j})} = m_{j} |ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{j - 1}^{(l_{j - 1})} = m_{j - 1}), j = 1, \dots, n .

(19)

The unconditional probability

p_{n}^{(L_{n}, M_{n})}

for event

E_{n}^{(L_{n}, M_{n})}

on path

E_{n, t}^{(L_{n}, M_{n})}

is determined by the sequence of conditional probabilities (19).

As there is no trade at

t_{m} = T

, there are no new events at time

t_{m}

on

{BIT}_{m}

; the last events occur at

t_{m - 1}

. Similarly, there is no event at

t_{0} = 0

, the first events occur at

t_{1}

. Thus, on

{BIT}_{m}

the events are labeled from

E_{1}^{(L_{1}, M_{1})}

to

E_{m - 1}^{(L_{m - 1}, M_{m - 1})}

. From (13) and (18), we have the event-path equivalence

E_{n}^{(L_{n}, M_{n})} \overset{EP}{\sim} E_{n, t_{n}}^{(L_{n}, M_{n})}, n = 1, \dots, m - 1,

where

\overset{EP}{\sim}

means that event

E_{n}^{(L_{n}, M_{n})}

occurs at time point

t_{n}

on path

E_{n, t}^{(L_{n}, M_{n})}

. To preserve uniformity of notation on

{BIT}_{m}

, we define the pseudo-event

E_{0}^{(L_{0}, M_{0})} = E_{0}^{(0, 0)}

for the root node, leading to the equivalence

E_{0}^{(0, 0)} \overset{EP}{\sim} E_{0, t_{0}}^{(0, 0)} .

As the “leaf” nodes at

t = t_{m}

are path termination points, the terminating pseudo-events will be labeled using path notation

E_{m - 1, t_{m}}^{(L_{m - 1}, M_{m - 1})}

. We shall refer to all events, pseudo or otherwise, simply as events.

For every fixed

m \in N

, the set of all

\sum_{n = 0}^{m - 1} 2^{n}

paths

E_{n, t}^{(L_{n}, M_{n})}

,

n = 0, \dots, m - 1

, defines an

F^{(d)}

-adapted

{BIT}_{m}

, which we denote

{BT}_{m}

. The probabilities

p_{n}^{(L_{n}, M_{n})}

will represent the natural probabilities for the direction of stock movements.

{BT}_{m}

provides the stochastic dynamics of the market information based on the time instances

0 = t_{0} < t_{1} < \dots < t_{m - 1} < t_{m} = T

.

Estimation of Probabilities

In our discrete market setting (Section 3, Section 4 and Section 5),

p_{n}^{(L_{n}, M_{n})}

will represent the probability of the direction of price changes at

t_{n}

. For example, given price

S_{t_{n - 1}}

corresponding to event

E_{n - 1}^{(L_{n - 1}, M_{n - 1})}

, then

p_{n}^{(L_{n}, M_{n})}

, with

L_{n} = (L_{n - 1}, l_{n})

,

M_{n} = (M_{n - 1}, 1)

, and

l_{n} = {(m_{1} \dots m_{n - 1} 1)}_{10} + 1

represents the probability of a price increase

S_{t_{n}} - S_{t_{n - 1}} > 0

, while

p_{n}^{(L_{n}, M_{n})}

with

M_{n} = (M_{n - 1}, 0)

and

l_{n} = {(m_{1} \dots m_{n - 1} 0)}_{10} + 1

represents the probability of a price decrease

S_{t_{n}} - S_{t_{n - 1}} < 0

.

Assuming that a sufficient history (

t < 0

) of stock prices is available at

t = 0

, one can utilize the historical frequency

{\hat{p}}_{1}^{((0, 1), (0, 1); Δ t_{1})}

of positive price changes over trading periods of size

Δ t_{1}

as an estimator for

p_{1}^{((0, 1), (0, 1))}

. For example, if

Δ t_{1} = 5

min, then

{\hat{p}}_{1}^{((0, 1), (0, 1); Δ t_{1})}

is the proportion of positive stock returns observed in a historical sample of 5-min returns.19 For

n > 1

, one can use the historical frequency

{\hat{p}}_{n}^{(L_{n}, M_{n}; Δ t_{1, n})}

as an estimator for

p_{n}^{(L_{n}, M_{n})}

, where

Δ t_{1, n} \overset{def}{=} Δ t_{1}, \dots, Δ t_{n}

. As an example of the computation of

{\hat{p}}_{n}^{(L_{n}, M_{n}; Δ t_{1, n})}

, assume equally spaced time intervals,

Δ t_{1} = \dots . = Δ t_{n} = Δ t

, where

Δ t = 1

day. Consider a data set of

T

historical daily returns partitioned into V non-overlapping time periods, each period consisting of n days. In each period, the succession of signs of the n daily returns is compared to the succession of signs implied by the

M_{n}

indexing of the path

E_{n}^{(L_{n}, M_{n})}

. If these two sign successions agree, the trial period is marked as a “success” (otherwise a “failure”). Thus, the partitioning of the data provides a set of V Bernoulli trials from which to compute

{\hat{p}}_{n}^{(L_{n}, M_{n}; Δ t_{1, n})} = \frac{number of successes in V trials}{V} .

(20)

In Section 9, we empirically investigate this procedure for computing the required probabilities. Our results indicate that a prohibitively extensive history

T = V n

would be required to ensure adequate sampling for values of

n ≳ 6

. In Section 9, we consider the use of bootstrap resampling to provide adequate samples.

3. Dynamics of the Riskless Rate and Bank Account Value on ${BT}_{m}$

We consider first the path-dependent dynamics of the riskless rate

r_{t}^{(d, f)}

and

B_{t}

on

{BT}_{m}

. Corresponding to the discrete times

t_{n}

,

n = 1, \dots, m

, we consider the

F^{(n - 1)}

-measurable riskless rates

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)}

and, without loss of generality, set

r_{t_{0}}^{(d, f)} = 0

.20 For any time

t \in [0, T]

, we define the

{ST}_{m; [0, T]}^{(d)}

-adapted riskless rate

r_{t}^{(d, f)}

as follows.

For

t \in [0, t_{1})

,

r_{t}^{(d, f)} = r_{t_{0}}^{(d, f)} = 0

.

For

t \in [t_{1}, t_{2})

,

r_{t}^{(d, f)} = r_{t_{1}; E_{0}^{(0, 0)}}^{(d, f)} \equiv r_{t_{1}; (ϵ_{0}^{(0)} = 0)}^{(d, f)}

.

For

t \in [t_{2}, t_{3})

,

r_{t}^{(d, f)} = \{\begin{matrix} r_{t_{2}; E_{1}^{((0, 1), (0, 1))}}^{(d, f)} \equiv r_{t_{2}; (ϵ_{1}^{(1)} = 1)}^{(d, f)} w . p . p_{1}^{((0, 1), (0, 1))}, \\ r_{t_{2}; E_{1}^{((0, 1), (0, 0))}}^{(d, f)} \equiv r_{t_{2}; (ϵ_{1}^{(1)} = 0)}^{(d, f)} w . p . p_{1}^{((0, 1), (0, 0))} . \end{matrix}

For

t \in [t_{3}, t_{4})

,

r_{t}^{(d, f)} = \{\begin{matrix} r_{t_{3}; E_{2}^{((0, 1, 2), (0, 1, 1))}}^{(d, f)} \equiv r_{t_{3}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)}^{(d, f)} w . p . p_{2}^{((0, 1, 2), (0, 1, 1))}, \\ r_{t_{3}; E_{2}^{((0, 1, 2), (0, 1, 0))}}^{(d, f)} \equiv r_{t_{3}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)}^{(d, f)} w . p . p_{2}^{((0, 1, 2), (0, 1, 0))}, \\ r_{t_{3}; E_{2}^{((0, 1, 1), (0, 0, 1))}}^{(d, f)} \equiv r_{t_{3}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1)}^{(d, f)} w . p . p_{2}^{((0, 1, 1), (0, 0, 1))}, \\ r_{t_{3}; E_{2}^{((0, 1, 1), (0, 0, 0))}}^{(d, f)} \equiv r_{t_{3}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0)}^{(d, f)} w . p . p_{2}^{((0, 1, 1), (0, 0, 0))} . \end{matrix}

In general, for path

E_{n - 1, t}^{(L_{n - 1}, M_{n - 1})}

and

t \in [t_{n}, t_{n + 1})

,

n = 2, \dots, m - 1

,

r_{t}^{(d, f)} = r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)} \equiv r_{t_{n}; (ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{n - 1}^{(l_{n - 1})} = m_{n - 1})}^{(d, f)} w . p . p_{n - 1}^{(L_{n - 1}, M_{n - 1})},

(21)

with the understanding that, when

n = m - 1

, for any path

E_{m - 2, t}^{(L_{m - 2}, M_{m - 2})}

, (21) holds for the closed time interval

t \in [t_{m - 1}, T]

. Figure 3 illustrates the path dependence of the riskless rates

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)}

.

For

t \in [t_{n}, t_{n + 1})

, the riskless rate

r_{t}^{(d, f)}

is

F^{(n - 1)}

-measurable. This definition is consistent with the definition of the riskless rate (short rate) dynamics in continuous time (Duffie, 2001, p. 102). Without loss of generality, we can define the path-dependent, instantaneous riskless rate

r_{t}^{(d, f, inst)} > 0

,

t \in [t_{n}, t_{n + 1})

by the relation

r_{t}^{(d, f)} = r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)} = r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f, inst)} Δ t_{n} .

(22)

Having determined the dynamics of the riskless rates

r_{t}^{(d, f)}

,

t \in [0, T]

, we now turn our attention to the dynamics of the riskless bank account

B

. The

{ST}_{m; [0, T]}^{(d)}

-adapted bank account price

β_{t}^{(d)}

,

t \in [0, T]

, is defined as follows.

For

t \in [0, t_{1})

,

β_{t}^{(d)} = β_{t_{0}}^{(d)} = β_{0} .

For

t \in [t_{1}, t_{2}),

β_{t}^{(d)} = β_{t_{1}}^{(d)} = β_{0} (1 + r_{t_{1}}^{(d, f)}) .

For

t \in [t_{2}, t_{3}),

β_{t}^{(d)} = β_{t_{2}}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}}^{(d, f)}) \{\begin{matrix} β_{t_{2}; (ϵ_{1}^{(1)} = 1)}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}; (ϵ_{1}^{(1)} = 1)}^{(d, f)}) w . p . p_{1}^{((0, 1), (0, 1))}, \\ β_{t_{2}; (ϵ_{1}^{(1)} = 0)}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}; (ϵ_{1}^{(1)} = 0)}^{(d, f)}) w . p . p_{1}^{((0, 1), (0, 0))} . \end{matrix}

For

t \in [t_{3}, t_{4})

,

β_{t}^{(d)} = β_{t_{3}}^{(d)} = β_{t_{2}}^{(d)} (1 + r_{t_{3}}^{(d, f)}) = \{\begin{matrix} \begin{matrix} β_{t_{3}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}; (ϵ_{1}^{(1)} = 1)}^{(d, f)}) & (1 + r_{t_{3}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)}^{(d, f)}) \\ w . p . p_{2}^{((0, 1, 2), (0, 1, 1))}, \end{matrix} \\ \begin{matrix} β_{t_{3}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}; (ϵ_{1}^{(1)} = 1)}^{(d, f)}) & (1 + r_{t_{3}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)}^{(d, f)}) \\ w . p . p_{2}^{((0, 1, 2), (0, 1, 0))}, \end{matrix} \\ \begin{matrix} β_{t_{3}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1)}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}; (ϵ_{1}^{(1)} = 0)}^{(d, f)}) & (1 + r_{t_{3}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1)}^{(d, f)}) \\ w . p . p_{2}^{((0, 1, 1), (0, 0, 1))}, \end{matrix} \\ \begin{matrix} β_{t_{3}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0)}^{(d)} = β_{t_{1}}^{(d)} (1 + r_{t_{2}; (ϵ_{1}^{(1)} = 0)}^{(d, f)}) & (1 + r_{t_{3}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0)}^{(d, f)}) \\ w . p . p_{2}^{((0, 1, 1), (0, 0, 0))} . \end{matrix} \end{matrix}

For

t \in [t_{n}, t_{n + 1})

,

n = 2, \dots, m - 1

, given path

E_{n - 1, t}^{(L_{n - 1}, M_{n - 1})}

, the value of the bank account

β_{t}^{(d)} = β_{t_{n}}^{(d)}

,

t \in [t_{n}, t_{n + 1})

is

\begin{matrix} β_{t}^{(d)} = β_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} & = β_{t_{n}; \{ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{n - 1}^{(l_{n - 1})} = m_{n - 1}\}}^{(d)} \\ = β_{t_{n - 1}; \{ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{n - 2}^{(l_{n - 2})} = m_{n - 2}\}}^{(d)} [1 + r_{t_{n}; \{ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{n - 1}^{(l_{n - 1})} = m_{n - 1}\}}^{(d, f)}] \\ = β_{t_{1}}^{(d)} \prod_{k = 2}^{n} [1 + r_{t_{k}; (ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{k - 1}^{(l_{k - 1})} = m_{k - 1})}^{(d, f)}] w . p . p_{n - 1}^{(L_{n - 1}, M_{n - 1})}, \end{matrix}

(23)

with the understanding that, when

n = m - 1

, (23) holds for the closed time interval

t \in [t_{m - 1}, T]

for any path

E_{m - 2, t}^{(L_{m - 2}, M_{m - 2})}

.

For

[t_{n}, t_{n + 1})

, the bank account value

β_{t}^{(d)}

is

F^{(n - 1)}

-measurable. This definition is consistent with the definition of the riskless asset price dynamics

β_{t}, t \in [0, T]

in continuous time. More precisely, as shown in Section 6, in continuous time the riskless asset price

β_{t}, t \in [0, T]

is defined on the filtered probability space

(Ω, F_{[0, T]} = {F_{t} = σ (B_{u}, 0 \leq u \leq t)}_{t \in [0, T]}, P)

, where

B_{t}

,

t \in [0, T]

, is a standard Brownian motion on

[0, T]

, and its price dynamics are determined by

d β_{t} = r_{t} β_{t} d t

,

β_{0} > 0

, where the short rate

r_{t} \geq 0

is

F_{t}

-measurable with

P ({sup}_{t \in [0, T]} \{r_{t} + 1 / r_{t}\} < \infty) = 1

. Thus,

β_{t}

is instantaneously riskless; in other words,

β_{u} = β_{t}

for

u \in [t, t + d t)

. This definition of the riskless asset pricing in continuous time was the motivation to define the riskless bank account valuation in discrete time by Equation (23).

4. Stock Price Dynamics on ${BT}_{m}$

The price dynamics

S_{t}^{(d)}, t \in [0, T]

of

S

on the stochastic basis

{ST}_{m; t \in [0, T]}^{(d)}

is an

F_{m; t \in [0, T]}^{(d)}

-adapted process. We define

S_{t}^{(d)}

,

t \in [0, T]

on

{ST}_{m; t \in [0, T]}^{(d)}

as follows.

For

t \in [0, t_{1})

,

S_{t}^{(d)} = S_{t_{0}}^{(d)} = S_{0} > 0 .

For

t \in [t_{1}, t_{2}),

S_{t}^{(d)} = S_{t_{1}}^{(d)} = \{\begin{matrix} S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)} = S_{0} s_{t_{1}; (ϵ_{1}^{(1)} = 1)} w . p . p_{1}^{((0, 1), (0, 1))}, \\ S_{t_{1}; (ϵ_{1}^{(1)} = 0)}^{(d)} = S_{0} s_{t_{1}; (ϵ_{1}^{(1)} = 0)} w . p . p_{1}^{((0, 1), (0, 0))}, \end{matrix}

(24)

for price ratio values

s_{t_{1}; (ϵ_{1}^{(1)})} > 0

. Let

r_{t_{1}}^{(d)} = \frac{S_{t_{1}}^{(d)} - S_{0}}{S_{0}} = \{\begin{matrix} s_{t_{1}; (ϵ_{1}^{(1)} = 1)} - 1 w . p . P (ϵ_{1}^{(1)} = 1), \\ s_{t_{1}; (ϵ_{1}^{(1)} = 0)} - 1 w . p . P (ϵ_{1}^{(1)} = 0), \end{matrix}

(25)

be the discrete (arithmetic) return of the stock at

t_{1}

. We assume that the mean

E [r_{t_{1}}^{(d)}]

and variance

Var [r_{t_{1}}^{(d)}]

of the return,

\begin{matrix} E [r_{t_{1}}^{(d)}] & = μ_{t_{1}}^{(r)} Δ t_{1}, & for some μ_{t_{1}}^{(r)} > r_{t_{1}}^{(d, f, inst)}, \\ Var [r_{t_{1}}^{(d)}] & = {(σ_{t_{1}}^{(r)})}^{2} Δ t_{1}, & for some σ_{t_{1}}^{(r)} > 0, \end{matrix}

(26)

are known. In (26), we interpret

μ_{t_{1}}^{(r)}

as the instantaneous mean return and

{(σ_{t_{1}}^{(r)})}^{2}

as the instantaneous variance at

t_{1}

. The moment conditions (26) imply that the price ratios

s_{t_{1}; (ϵ_{1}^{(1)})}

in (25) are determined by

\begin{matrix} s_{t_{1}; (ϵ_{1}^{(1)} = 1)} & = 1 + μ_{t_{1}}^{(r)} Δ t_{1} + σ_{t_{1}}^{(r)} \sqrt{\frac{P (ϵ_{1}^{(1)} = 0)}{P (ϵ_{1}^{(1)} = 1)} Δ t_{1}}, \\ s_{t_{1}; (ϵ_{1}^{(1)} = 0)} & = 1 + μ_{t_{1}}^{(r)} Δ t_{1} - σ_{t_{1}}^{(r)} \sqrt{\frac{P (ϵ_{1}^{(1)} = 1)}{P (ϵ_{1}^{(1)} = 0)} Δ t_{1}} . \end{matrix}

For

t \in [t_{2}, t_{3})

,

S_{t}^{(d)} = S_{t_{2}}^{(d)} = \{\begin{matrix} S_{t_{2}; E_{2}^{((0, 1, 2), (0, 1, 1))}}^{(d)} = S_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)}^{(d)} w . p . p_{1}^{((0, 1, 2), (0, 1, 1))}, \\ S_{t_{2}; E_{2}^{((0, 1, 2), (0, 1, 0))}}^{(d)} = S_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)}^{(d)} w . p . p_{1}^{((0, 1, 2), (0, 1, 0))}, \\ S_{t_{2}; E_{2}^{((0, 1, 1), (0, 0, 1))}}^{(d)} = S_{t_{2}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1)}^{(d)} w . p . p_{1}^{((0, 1, 1), (0, 0, 1))}, \\ S_{t_{2}; E_{2}^{((0, 1, 1), (0, 0, 0))}}^{(d)} = S_{t_{2}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0)}^{(d)} w . p . p_{1}^{((0, 1, 1), (0, 0, 0))} . \end{matrix}

(27)

Note that

S_{t_{2}; (ϵ_{1}^{(1)} = m_{1}, ϵ_{2}^{(2)} = m_{2})}^{(d)} \equiv S_{t_{2}; E_{2}^{((0, l_{1}, l_{2}), (0, m_{1}, m_{2}))}}^{(d)}

denotes the stock price along the

[t_{2}, t_{3})

segment of the path

E_{2, t}^{((0, l_{1}, l_{2}), (0, m_{1}, m_{2}))}

. Figure 4 illustrates the path dependence of the stock prices

S_{t_{n}; E_{n - 1}^{(L_{n}, M_{n})}}^{(d)}

up to

t_{3}

. In contrast,

S_{t_{2}}^{(d)}

denotes the state space of prices over the interval

[t_{2}, t_{3})

. We introduce the conditional notation

S_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(d)}

to designate the two state prices determined by the condition

ϵ_{1}^{(1)} = 1

, while

S_{t_{2} |(ϵ_{1}^{(1)} = 0)}^{(d)}

designates the two state prices determined by the condition

ϵ_{1}^{(1)} = 0

.

From (27), (24), and (A1),

S_{t_{2}}^{(d)} = \{\begin{matrix} S_{t_{1}; (ϵ_{1}^{(1)} = 1)} s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)} w . p . P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 1) p_{1}^{((0, 1), (0, 1))}, \\ S_{t_{1}; (ϵ_{1}^{(1)} = 1)} s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)} w . p . P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 1) p_{1}^{((0, 1), (0, 1))}, \\ S_{t_{1}; (ϵ_{1}^{(1)} = 0)} s_{t_{2}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1)} w . p . P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 0) p_{1}^{((0, 1), (0, 0))}, \\ S_{t_{1}; (ϵ_{1}^{(1)} = 0)} s_{t_{2}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0)} w . p . P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 0) p_{1}^{((0, 1), (0, 0))}, \end{matrix}

(28)

for price ratios

s_{t_{2}; (ϵ_{1}^{(1)}, ϵ_{2}^{(2)})} > 0

. Let

\begin{matrix} r_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(d)} = \frac{S_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(d)} - S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)}}{S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)}} & = \{\begin{matrix} \frac{S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)} s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)} - S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)}}{S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)}} \\ \frac{S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)} s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)} - S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)}}{S_{t_{1}; (ϵ_{1}^{(1)} = 1)}^{(d)}} \end{matrix} \\ = \{\begin{matrix} s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)} - 1 w . p . P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 1), \\ s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)} - 1 w . p . P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 1), \end{matrix} \end{matrix}

be the conditional arithmetic return at

t_{2}

given that

ϵ_{1}^{(1)} = 1

. We assume that the conditional mean and conditional variance,

\begin{matrix} E [r_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(d)}] & = μ_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(r)} Δ t_{2}, & μ_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(r)} > r_{t_{2}; (ϵ_{1}^{(1)} = k)}^{(d, f, inst)}, \\ Var [r_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(d)}] & = {(σ_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(r)})}^{2} Δ t_{2}, & σ_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(r)} > 0, \end{matrix}

(29)

are known. In (29),

μ_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(r)}

is the instantaneous conditional mean return and

{(σ_{t_{2} |(ϵ_{1}^{(1)} = k)}^{(r)})}^{2}

is the instantaneous conditional variance of the return at

t_{2}

given

ϵ_{1}^{(1)} = k

. The moment conditions (29) imply that, in (28),

\begin{matrix} s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1)} & = 1 + μ_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(r)} Δ t_{2} + σ_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(r)} \sqrt{\frac{P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 1)}{P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 1)} Δ t_{2}}, \\ s_{t_{2}; (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0)} & = 1 + μ_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(r)} Δ t_{2} - σ_{t_{2} |(ϵ_{1}^{(1)} = 1)}^{(r)} \sqrt{\frac{P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 1)}{P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 1)} Δ t_{2}}, \\ s_{t_{2}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1)} & = 1 + μ_{t_{2} |(ϵ_{1}^{(1)} = 0)}^{(r)} Δ t_{2} + σ_{t_{2} |(ϵ_{1}^{(1)} = 0)}^{(r)} \sqrt{\frac{P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 0)}{P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 0)} Δ t_{2}}, \\ s_{t_{2}; (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0)} & = 1 + μ_{t_{2} |(ϵ_{1}^{(1)} = 0)}^{(r)} Δ t_{2} - σ_{t_{2} |(ϵ_{1}^{(1)} = 0)}^{(r)} \sqrt{\frac{P (ϵ_{2}^{(2)} = 1 | ϵ_{1}^{(1)} = 0)}{P (ϵ_{2}^{(2)} = 0 | ϵ_{1}^{(1)} = 0)} Δ t_{2}} . \end{matrix}

For

t \in [t_{n}, t_{n + 1})

,

n = 2, \dots, m - 1

, given the path

E_{n - 1, t}^{(L_{n - 1}, M_{n - 1})}

, the stock price is

\begin{matrix} S_{t}^{(d)} = S_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} \\ = \{\begin{matrix} S_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)}^{(d)} = S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)}, \\ S_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 0)}^{(d)} = S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 0)}, \end{matrix} \end{matrix}

(30)

for price ratios

s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = k)}

,

k = 0, 1

. Again, it is understood that, when

n = m - 1

, for a given path

E_{m - 2, t}^{(L_{m - 2}, M_{m - 2})}

(30) holds for the closed time interval

t \in [t_{m - 1}, T]

. Let

\begin{matrix} r_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} \\ \begin{matrix} = \frac{S_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} - S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}}{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}} \\ = \{\begin{matrix} \frac{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)} - S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}}{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}} : = r_{t_{n}; E_{n}^{(L_{n}, M_{n} = (M_{n - 1}, 1))}}^{(d)} \\ \frac{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 0)} - S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}}{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}} : = r_{t_{n}; E_{n}^{(L_{n}, M_{n} = (M_{n - 1}, 0))}}^{(d)} \end{matrix} \\ = \{\begin{matrix} s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)} - 1 w . p . P (ϵ_{n}^{(l_{n}))} = 1 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}), \\ s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 0)} - 1 w . p . P (ϵ_{n}^{(l_{n}))} = 0 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}), \end{matrix} \end{matrix} \end{matrix}

be the conditional arithmetic return at

t_{n}

given the path

E_{n - 1, t}^{(L_{n - 1}, M_{n - 1})}

.21 We assume the conditional mean and conditional variance of

r_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}

,

\begin{matrix} E [r_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}] & = μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} Δ t_{n}, \\ Var [r_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}] & = {(σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)})}^{2} Δ t_{n}, \end{matrix}

(31)

are known for some

μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1}))}}^{(r)} > r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f, inst)}, σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} > 0 .

Conditions (31) imply that, in (30)

\begin{matrix} s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)} = 1 & + μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} Δ t_{n} \\ + σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} \sqrt{\frac{P (ϵ_{n}^{(l_{n})} = 0 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})})}{P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})})} Δ t_{n}}, \\ s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 0)} = 1 & + μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} Δ t_{n} \\ - σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} \sqrt{\frac{P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})})}{P (ϵ_{n}^{(l_{n})} = 0 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})})} Δ t_{n}} . \end{matrix}

(32)

Note the notational equivalences

s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = m_{n})} \equiv s_{t_{n}; E_{n}^{(L_{n}, M_{n})}}, S_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = m_{n})} \equiv S_{t_{n}; E_{n}^{(L_{n}, M_{n})}} .

The first form in each equality is used when we wish to specify the value of

m_{n}

, the second form refers to the respective price at an arbitratry event in

{BIT}_{m}

.

The riskless rate

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)}

, bank account value

β_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}

, stock price

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}

, price ratio

s_{t_{n}; E_{n}^{(L_{n}, M_{n})}}

, return

r_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}

, and the conditional moments

μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)}

and

σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)}

, corresponding to the

[t_{n}, t_{n + 1})

segments of each of the two event paths

E_{n, t}^{(L_{n}, (M_{n - 1}, m_{n} = 1))}

and

E_{n, t}^{(L_{n}, (M_{n - 1}, m_{n} = 0))}

, are illustrated in Figure 5. The bank account value, riskless rate, and the two moments are the same for each of the two path segments.

Stock price dynamics in the natural world are determined by (30) and (31), which contain the following sets of model parameters:

$I^{(P)}$ —the probabilities for stock upward direction, $P (ϵ_{n}^{(l_{n})} = 1 |E_{n - 1}^{(L_{n - 1}, M_{n - 1})})$ ;
$I^{(μ)}$ —the conditional means $μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)}$ ;
$I^{(σ)}$ —the conditional variances $σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)}$ ;

on each node of the pricing tree. In Section 5, we show that the risk-neutral tree dynamics of the stock preserves

I^{(P)}, I^{(μ)}, I^{(σ)}

. This is important given that the information on

I^{(P)}

and

I^{(μ)}

is lost in passing to the continuous-time limit in the natural world and subsequent use of BSM for risk-neutral valuation. Using a discrete pricing tree rather than a continuous-time pricing model allows us to introduce richer, more flexible models for the price dynamics to accommodate market microstructure features in option pricing.

5. Risk-Neutral Dynamics on ${BT}_{m}$ : Option Pricing

The option

C

has discrete price dynamics

f_{t_{n}}^{(d)} = f (S_{t_{n}}^{(d)}, t_{n})

,

n = 0, \dots, m - 1

, on

{BT}_{m}

for some

f (x, t) \in R

,

x > 0

,

t \in [0, T]

, with terminal time

T = t_{m}

and terminal value

f_{T} = g (S_{T})

for some

g (x) \in R

,

x > 0

.22 At event

E_{n}^{(L_{n}, M_{n})}

, the option has the price

f (S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}, t_{n}) \equiv f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} .

(33)

Consider the replicating portfolio satisfying

f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} = D_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} + β_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}

(34)

at event

E_{n}^{(L_{n}, M_{n})}

, where

D_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}

is the delta position. Requiring the usual discrete-time, no-arbitrage conditions, we have

\begin{matrix} \begin{matrix} 0 & = D_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} + β_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} - f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} \\ = D_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} S_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} + β_{t_{n + 1}; E_{n}^{(L_{n}, M_{n})}}^{(d)} - f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} \\ = D_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} S_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)} + β_{t_{n + 1}; E_{n}^{(L_{n}, M_{n})}}^{(d)} - f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)} . \end{matrix} \end{matrix}

(35)

Figure 6 illustrates these path-dependent, no-arbitrage conditions on the binary tree at

E_{n}^{(L_{n}, M_{n})}

.

From the second two equations in (35), the delta position is given by

\begin{matrix} \begin{matrix} D_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} & = \frac{f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} - f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)}}{S_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} - S_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)}} \\ = [\frac{f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} - f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)}}{S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} σ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}}^{(r)} \sqrt{Δ t_{n + 1}}}] \end{matrix} \end{matrix}

\times \sqrt{P (ϵ_{n + 1}^{(l_{n + 1})} = 1 | E_{n}^{(L_{n}, M_{n})}) P (ϵ_{n + 1}^{(l_{n + 1})} = 0 | E_{n}^{(L_{n}, M_{n})})},

(36)

where the final equality in (36) is obtained using (30) and (32). From the first equation in (35), and using (36), (23), and (22), we obtain the recurrence relation for the risk-neutral option value at event

E_{n}^{(L_{n}, M_{n})}

:

\begin{matrix} \begin{matrix} f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)} & = \frac{1}{(1 + r_{t_{n + 1}; E_{n}^{(L_{n}, M_{n})}}^{(d, f, inst)} Δ t_{n + 1})} \\ \times \{q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} \end{matrix} \\ + q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)} f_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)}\}, \end{matrix}

(37)

where the conditional risk-neutral probabilities are given by

\begin{matrix} q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} & = P (ϵ_{n + 1}^{(l_{n + 1})} = 1 |E_{n}^{(L_{n}, M_{n})}) \\ - θ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}} \sqrt{P (ϵ_{n + 1}^{(l_{n + 1})} = 0 |E_{n}^{(L_{n}, M_{n})}) P (ϵ_{n + 1}^{(l_{n + 1})} = 1 |E_{n}^{(L_{n}, M_{n})}) Δ t_{n + 1}}, \\ q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)} & = 1 - q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{l_{n + 1})} = 1)}^{(d)} \\ = P (ϵ_{n + 1}^{(l_{n + 1})} = 0 |E_{n}^{(L_{n}, M_{n})}) \\ + θ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}} \sqrt{P (ϵ_{n + 1}^{(l_{n + 1})} = 0 |E_{n}^{(L_{n}, M_{n})}) P (ϵ_{n + 1}^{(l_{n + 1})} = 1 |E_{n}^{(L_{n}, M_{n})}) Δ t_{n + 1}}, \end{matrix}

(38)

with

θ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}} = \frac{μ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}}^{(r)} - r_{t_{n + 1}; E_{n}^{(L_{n}, M_{n})}}^{(d, f, inst)}}{σ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}}^{(r)}}

being the market price of risk. Equations (34) through (38) hold for all events

E_{n}^{(L_{n}, M_{n})}

on

{BT}_{m}

when

n = 0, \dots, m - 2

. As

S_{t_{m - 1}; E_{m - 1}^{(L_{m - 1}, M_{m - 1})}}^{(d)}

has a constant value

S_{T; E_{m - 1}^{(L_{m - 1}, M_{m - 1})}}

over the time interval

[t_{m - 1}, t_{m} = T]

, the terminal value

g (S_{T; E_{m - 1}^{(L_{m - 1}, M_{m - 1})}})

determines each option price

f_{t_{m - 1}; E_{m - 1}^{(L_{m - 1}, M_{m - 1})}}^{(d)}

at the events

E_{m - 1}^{(L_{m - 1}, M_{m - 1})}

. These values provide the “initial conditions” for the recurrence relation (37).

Note that (36)–(38) each have an overall form that is familiar from binomial tree models. This should not be too surprising as a binomial tree is a particular form of a binary tree. The path dependence in our model therefore arises not through a change in form of these equations but from the fact that the variables (prices, probabilities, conditional moments) are explicitly path-dependent.

We further note that the natural-world conditional probabilities, and the mean and variance of the return dynamics, of

S

are retained in the risk-neutral price dynamics of

C

. This represents the tremendous advantage of discrete binary option pricing over its limiting continuous-time model; under the latter, information about the probabilities and the mean of the return process is lost (Section 6).

The extension to the pricing of an American option follows from the classical approach for valuation of an American option on a binomial tree (see, e.g., Hull, 2006, Section 12.5). The market value of the American option at event

E_{n}^{(L_{n}, M_{n})}

is given by

f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; American)} = max (f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}, S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; exercise)}),

(39)

where

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; exercise)}

is the exercise value of the stock, which is known at event

E_{n}^{(L_{n}, M_{n})}

. We emphasize that

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; exercise)}

is the market value of the stock and not its fair-holding value

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; fair-holding)}

. The risk-neutral tree stock dynamics

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; fair-holding)}

,

n = 1, \dots, m

, are determined via the risk-neutral recursion (37),

\begin{matrix} \begin{matrix} S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; fair-holding)} & = \frac{1}{(1 + r_{t_{n + 1}; E_{n}^{(L_{n}, M_{n})}}^{(d, f, inst)} Δ t_{n + 1})} \\ \times \{q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} S_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d; fair-holding)} \end{matrix} \\ + q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)} S_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d; fair-holding)}\}, \end{matrix}

(40)

n = 0, \dots, m - 2

, based on the terminal value23

S_{T}^{(d; fair-holding)} = S_{t_{m - 1}; E_{m - 1}^{(L_{m - 1}, M_{m - 1})}}^{(d; fair-holding)} = S_{T; E_{m - 1}^{(L_{m - 1}, M_{m - 1})}}^{(d)} .

In contrast to (39), Breen (1991) uses

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; fair-holding)}

to define the fair value of the American option at the node

E_{n}^{(L_{n}, M_{n})}

,

f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; American; fair)} = max (f_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}, S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d; fair-holding)}) .

(41)

A trader in search of statistical arbitrage opportunities relative to an American option could compare (39) with (41) when seeking potential mispricing in the market value of the option.

6. Limiting Dynamics of Binary Pricing Trees

In this section, we investigate the limiting dynamics of the binary-tree pricing by assuming that the lengths of trading intervals uniformly vanish at a rate

N^{- 1}

as

N \to \infty

. To support this, we generalize our notation for the trading times as follows. For any given

N \in N = \{1, 2, \dots\}

, we consider the fixed time instances

0 = t_{0, N} < t_{1, N} < \dots < t_{n_{N} - 1, N} < t_{n_{N}, N} = T < \infty

. The current time is

t_{0, N} = 0

, the terminal time is

t_{n_{N}, N} = T

and, as previously, trades of

S

and

B

occur only at the times

t_{1, N} < \dots < t_{n_{N} - 1, N}

. The time intervals are denoted

Δ t_{n, N} = t_{n, N} - t_{n - 1, N}

,

n = 1, \dots, n_{N}

. It is straightforward to adapt the results of Section 2, Section 3, Section 4 and Section 5 to this notational change for t. The resultant

F^{(d)}

-adapted binary information tree is now denoted

{BT}_{n_{N}}

. We impose the restriction

Δ_{N} = max {Δ t_{n, N}, n = 1, \dots, n_{N}} = O (\frac{1}{N}) .

(42)

To determine the continuum limit behavior, we apply the Donsker–Prokhorov invariance principle (DPIP) for continuous diffusions.24 To apply the DPIP we assume that, for each

n = 1, \dots, n_{N}

, the random variables

ϵ_{n}^{(1)}, \dots, ϵ_{n}^{(K_{n})}

determining the probabilities (14) and (19) are independent. Therefore, the probabilistic structure of the triangular array

E_{N} = (ϵ_{n}^{(k)}, k = 1, \dots, K_{n}, n \in N)

is determined by the probability laws,

p_{(ϵ_{n}^{(1)}, \dots, ϵ_{n}^{(K_{n})})}^{(m_{n}^{(1)}, \dots, m_{n}^{(K_{n})})} = P (ϵ_{n}^{(1)} = m_{n}^{(1)}, \dots, ϵ_{n}^{(K_{n})} = m_{n}^{(K_{n})}) = \prod_{k = 1}^{K_{n}} P (ϵ_{n}^{(k)} = m_{n}^{(k)}),

for

m_{n}^{(k)} \in \{0, 1\}

,

k = 1, \dots, K_{n}

,

n \in N

. This assumption of independence results in simplified expressions for (14) and (19):

\begin{matrix} \begin{matrix} P (ϵ_{n}^{(k)} = 1) & \overset{def}{=} p_{n}^{(k; 1)}, \\ P (ϵ_{n}^{(k)} = 0) & \overset{def}{=} p_{n}^{(k; 0)} = 1 - p_{n}^{(k; 1)}, \end{matrix}\} k = 1, \dots, K_{n}, \end{matrix}

(43)

\begin{matrix} P (E_{n}^{(L_{n}, M_{n})}) = p_{n}^{(L_{n}, M_{n})} = \prod_{k = 1}^{n} P (ϵ_{k}^{(l_{k})} = m_{k}) = \prod_{k = 1}^{n} p_{k}^{(l_{k}; m_{k})}, \end{matrix}

(44)

\begin{matrix} P (ϵ_{j}^{(l_{j})} = m_{j} |(ϵ_{1}^{(l_{1})} = m_{1}, \dots, ϵ_{j - 1}^{(l_{j - 1})} = m_{j - 1})) = P (ϵ_{j}^{(l_{j})} = m_{j}) = p_{j}^{(l_{j}; m_{j})} . \end{matrix}

(45)

As

ϵ_{n}^{(k)} \overset{d}{=} Bernoulli (p_{n}^{(k; 1)})

, then

δ_{n}^{(k)} \overset{def}{=} \frac{ϵ_{n}^{(k)} - p_{n}^{(k; 1)}}{\sqrt{p_{n}^{(k; 0)}}}, k = 1, \dots, K_{n}, n = 1, \dots, n_{N},

has

E [δ_{n}^{(k)}] = 0

and

Var [δ_{n}^{(k)}] = 1

. We now view the filtration (7) as generated by the triangular series

δ_{n}^{(1)}, \dots, δ_{n}^{(n)}, n \in N

; that is,

F^{(n)} = σ (δ_{1}^{(1)}, (δ_{2}^{(1)}, δ_{2}^{(2)}), \dots, (δ_{n}^{(1)}, \dots, δ_{n}^{(n)})), n = 1, \dots, n_{N} .

Next, for a given sequence

L = (l_{1}, \dots, l_{n}, \dots)

,

l_{n} = 1, \dots, n

,

n = 1, \dots, n_{N}

, we consider the random walk

δ_{n}^{(L)} = \sum_{k = 1}^{n} δ_{k}^{(k)}

. By the DPIP, the sequence of

D [0, \infty)

processes

B_{n}^{(L; [0, \infty))} = \{B_{t; n}^{L} = \frac{δ_{⌊ n t ⌋}^{(L)}}{\sqrt{n}}, t \geq 0\}

converges to a standard Brownian motion

B_{[0, \infty)} = {B_{t}, t \geq 0}

in the Skorokhod J1-topology.25 By denoting the canonical filtration

F^{B_{[0, \infty)}} = {σ (B_{u}, 0 \leq u \leq t), t \geq 0}

, we can assume, by the Skorokhod embedding theorem (see, e.g., Kallenberg, 1997, Chapter 14) that

F^{(d)} \subset F^{B_{[0, \infty)}}

and the triangular series

ϵ_{1}^{(1)}, (ϵ_{2}^{(1)}, ϵ_{2}^{(2)}), \dots, (ϵ_{2}^{(1)}, \dots, ϵ_{n}^{(n)}), \dots

are in the same stochastic basis space

(Ω, F^{B_{[0, \infty)}}, P)

.

For the time interval

[0, T]

, we now consider the limiting behavior of the discrete riskless rates

r_{t_{n, N}}^{(d, f)} = r_{t_{n, N}}^{(d, f, inst)} Δ t_{n, N}

in (22). We assume that the discrete instantaneous riskless rate process

\begin{matrix} r_{[0, T]; N}^{(d, f, inst)} = \{r_{t, N}^{(d, f, inst)} = r_{t_{n - 1, N}}^{(d, f, inst)}, t \in [t_{n - 1, N}, t_{n, N}), n = 1, \dots, n_{N}, \\ r_{T, N}^{(d, f, inst)} = r_{t_{n_{N}, N}}^{(d, f, inst)}\} \end{matrix}

converges uniformly to a continuous-time instantaneous riskless rate

r_{[0, T]}^{(f)} = \{r_{t}^{(f)}, t \in [0, T]\}

, where26

$r_{[0, T]}^{(f)}$ has strictly positive continuous trajectories on $[0, T]$ ;
$sup \{| r_{t}^{(f)} - r_{t, N}^{(d, f, inst)} |, t \in [0, T]\} = O (\frac{1}{N})$ .

Then, the discrete bank account value

β_{t, N}^{(d)}

,

t \in [0, T]

in (23) converges uniformly to the continuous-time riskless asset dynamics

β_{t} = β_{0} e^{\int_{0}^{t} r_{u}^{(f)} d u}, t \in [0, T],

where the deterministic instantaneous riskless-rate (short-rate) process

r_{[0, T]}^{(f)}

is

F^{B_{[0, T]}}

-adapted.

Consider the discrete mean and volatility processes for

S

:

\begin{matrix} μ_{[0, T]; N} & = \{\begin{matrix} μ_{t, N} = μ_{t_{n, N} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}, t \in [t_{n, N}, t_{n + 1, N}), n = 1, \dots, n_{N} - 1, \\ μ_{T, N} = μ_{t_{n_{N} - 1, N} |E_{n_{N} - 2}^{(L_{n_{N} - 2}, M_{n_{N} - 2})}}, \end{matrix} \\ σ_{[0, T]; N} & = \{\begin{matrix} σ_{t, N} = σ_{t_{n, N} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}, t \in [t_{n, N}, t_{n + 1, N}), n = 1, \dots, n_{N} - 1, \\ σ_{T; N} = σ_{t_{n_{N} - 1, N} |E_{n_{N} - 2}^{(L_{n_{N} - 2}, M_{n_{N} - 2})}} . \end{matrix} \end{matrix}

Assume that

μ_{[0, T], N}

and

σ_{[0, T], N}

converge uniformly on

[0, T]

to

μ_{t}

and

σ_{t}

, respectively, such that

sup {| μ_{t, N} - μ_{t} | + | σ_{t, N} - σ_{t} |, t \in [0, T]} = O (\frac{1}{N}),

Further, assume that

μ_{t}

and

σ_{t}

are

F^{B_{[0, T]}}

-adapted, with

μ_{[0, T]} = {μ_{t}, t \in [0, T]}

and

σ_{[0, T]} = {σ_{t}, t \in [0, T]}

having continuous trajectories on

[0, T]

.27 We define the

D [0, T]

price process

S_{[0, T]; N} = \{\begin{matrix} S_{t, N} = S_{t_{n, N}}^{(d)} = S_{t_{n, N}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}, t \in [t_{n, N}, t_{n + 1, N}), n = 1, \dots, n_{N} - 2, \\ S_{t, N} = S_{t_{n_{N} - 1, N}}^{(d)} = S_{t_{n_{N} - 1, N}; E_{n_{N} - 1}^{(L_{n_{N} - 1}, M_{n_{N} - 1})}}^{(d)}, t \in [t_{n_{N} - 1, N}, t_{n_{N}, N} = T] . \end{matrix}

(46)

As in Hu et al. (2020a), a non-standard invariance principle (Davydov & Rotar, 2008) can be used to show that (46) converges weakly in

D [0, T]

topology (Skorokhod, 2005) to a process

S_{[0, T]} = {S_{t}, t \in [0, T]}

governed by a cumulative return process

R_{[0, T]} = {R_{t}, t \in [0, T]}

satisfying

d R_{t} = μ_{t} d t + σ_{t} d B_{t}, R_{0} = 0,

(47)

where

B_{t}

is a standard Brownian motion and

d S_{t} = S_{t} d R_{t}

(see Duffie (2001, Appendix 6D)). By (47),

S_{[0, T]}

is a continuous diffusion and, if

μ_{t}

and

σ_{t}

are constant,

S_{[0, T]}

is a GBM. In the risk-neutral world, the limiting cumulative return process obeys (47) with

μ_{t}

replaced by

r_{t}^{(f)}

. We note that the discrete model is much more informative than the continuous-time model as it preserves the path-dependent probabilities

p_{n}^{(L_{n}, M_{n})}

, with no assumption on their (in)dependence.

7. Stock Pricing—Special Cases

The binary-tree pricing model presented above encompasses several time-series processes that have been proposed to model stock prices. We consider three such processes, all of which assume constant time spacing,

Δ t_{n} = Δ t

,

n = 1, \dots, m

.28 These examples are BWN and the cases in which the first difference of the price process is assumed to display either MA(1) or AR(1) behavior. These latter two time-discrete models are of particular importance in microstructure theory. In the Introduction, we mentioned the analogous models of Hasbrouck (1988), which build upon the seminal Roll model (Roll, 1984). See Hasbrouck (2007, Chapter 8) for an overview and discussion of other closely related models.

7.1. Asymmetric Binary White Noise

Consider the case in which the conditional probabilities at each node of

{BIT}_{m}

are the same:

\begin{matrix} \begin{matrix} P (ϵ_{j}^{(l_{j})} = 1 |E_{j - 1} = (0, ϵ_{j}^{(l_{1})} = m_{1}, \dots, ϵ_{j - 1}^{(l_{j - 1})} = m_{j - 1})) & = p \\ P (ϵ_{j}^{(l_{j})} = 0 |E_{j - 1} = (0, ϵ_{j}^{(l_{1})} = m_{1}, \dots, ϵ_{j - 1}^{(l_{j - 1})} = m_{j - 1})) & = 1 - p \end{matrix}\}, \end{matrix}

l_{j} = 1, \dots, K_{j}, j = 1, \dots, m .

(48)

We develop a white noise process (Hamilton, 2020, Chapter 3.2) on

{BIT}_{m}

where each node takes on one of only two possible values. This asymmetric binary white noise (ABWN) on

{BIT}_{m}

is the

F^{(d)}

-adapted process

\begin{matrix} z_{0}^{(d)} & = z_{t_{0}}^{(d)} = 0, \\ z_{t}^{(d)} & = z_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} = \{\begin{matrix} z_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)}^{(d)} : = z_{u} w . p . p, \\ z_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 0)}^{(d)} : = z_{d} w . p . 1 - p, \end{matrix} n = 1, \dots, m - 1 . \end{matrix}

(49)

As the process (49) on

{BIT}_{m}

is path-independent, the ABWN notation can be simplified to

z_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} \equiv z_{t_{n}}^{(l_{n})}

,

l_{n} = 1, \dots, K_{n}

, as indicated in (50).

Using (48) and the white noise moments

E [z_{t_{n}}^{(l_{n})}] = 0

and

Var [z_{t_{n}}^{(l_{n})}] = σ_{z}^{2} Δ t

with

σ_{z}^{2}

being finite, (49) becomes

\begin{matrix} z_{t}^{(d)} & = z_{0}^{(0)} = 0, & t \in [0, t_{1} = Δ t), \\ z_{t}^{(d)} & = z_{t_{n}}^{(l_{n})} = \{\begin{matrix} z_{u} = σ_{z} \sqrt{\frac{1 - p_{1}}{p_{1}} Δ t} w . p . p, \\ z_{d} = - σ_{z} \sqrt{\frac{1 - p_{1}}{p_{1}} Δ t} w . p . 1 - p, \end{matrix} & t \in [t_{n}, t_{n + 1} = t_{n} + Δ t), \\ l_{n} = 1, \dots, K_{n}, n = 1, \dots, m - 1 . \end{matrix}

(50)

It is straightforward to show that the ABWN process

z_{t}^{(d)}

has the first-order autocorrelation,

E [z_{t_{n}}^{(l_{n})} z_{t_{n - 1}}^{(l_{n - 1})}] = {(E [z_{t_{n}}^{(l_{n})}])}^{2} = 0

a property of a white noise process.

7.2. Binary Moving Average of Order 1

We define the random process

Δ S_{t}^{(d)} = S_{t}^{(d)} - S_{t - Δ t}^{(d)}

to be a binary moving-average process of order one (BMA(1)) by setting

Δ S_{t}^{(d)} = Δ S_{t_{n}; E_{n}^{((L_{n - 1}, l_{n}), (M_{n - 1}, m_{n}))}}^{(d)} = \{\begin{matrix} c + θ z_{t_{n - 1}}^{(l_{n - 1})} + z_{u}, if m_{n} = 1, \\ c + θ z_{t_{n - 1}}^{(l_{n - 1})} + z_{d}, if m_{n} = 0, \end{matrix}

(51)

for

t \in [t_{n}, t_{n + 1})

,

n = 1, \dots, m - 1

, and values

c, θ \in R^{+}

. In (51), the local index

m_{n} = 1

occurs with probability p,

m_{n} = 0

occurs with probability

1 - p

, and

z_{t_{n - 1}}^{(l_{n - 1})}

is the ABWN process (50). The expected value, variance, and first-order auto-covariance of

Δ S_{t}^{(d)}

follow the usual MA(1) process:

E [Δ S_{t_{n}}^{(d)}] = c, Var [Δ S_{t_{n}}^{(d)}] = (1 + θ^{2}) σ_{z}^{2}, cov {[Δ S_{t_{k}}^{(d)} Δ S_{t_{n}}^{(d)}]}_{k < n} = \{\begin{matrix} θ σ_{z}^{2}, if k = n - 1, \\ 0, otherwise, \end{matrix}

(52)

giving the familiar MA(1) autocorrelation coefficient:

ρ_{1} = \frac{cov [Δ S_{t_{n - 1}}^{(d)} Δ S_{t_{n}}^{(d)}]}{Var [Δ S_{t_{n}}^{(d)}]} = \frac{θ}{1 + θ^{2}} .

In practice, the coefficients

θ

and

σ_{z}

of the BMA(1) process can be estimated from historical price differences. Let

{\hat{γ}}_{0}

and

{\hat{γ}}_{1}

denote, respectively, the empirical variance and first-order auto-covariance of the risky-asset price differences. Setting

{\hat{γ}}_{0} = (1 + θ^{2}) σ_{z}^{2}, {\hat{γ}}_{1} = θ σ_{z}^{2},

(53)

and solving for

θ

and

σ_{z}^{2}

gives the solution pairs

θ_{+}, σ_{z +}^{2}

and

θ_{-}, σ_{z -}^{2}

,

\begin{matrix} θ_{+} = \frac{{\hat{γ}}_{0} + \sqrt{{\hat{γ}}_{0}^{2} - 4 {\hat{γ}}_{1}^{2}}}{2 {\hat{γ}}_{1}}, σ_{z +}^{2} = \frac{{\hat{γ}}_{0} - \sqrt{{\hat{γ}}_{0}^{2} - 4 {\hat{γ}}_{1}^{2}}}{2}, \\ θ_{-} = \frac{{\hat{γ}}_{0} - \sqrt{{\hat{γ}}_{0}^{2} - 4 {\hat{γ}}_{1}^{2}}}{2 {\hat{γ}}_{1}}, σ_{z -}^{2} = \frac{{\hat{γ}}_{0} + \sqrt{{\hat{γ}}_{0}^{2} - 4 {\hat{γ}}_{1}^{2}}}{2}, \end{matrix}

(54)

which have the following properties:

θ_{+} θ_{-} = 1

;

σ_{z +}^{2} σ_{z -}^{2} = {\hat{γ}}_{1}^{2}

;

θ_{+} {\hat{γ}}_{1} = σ_{z -}^{2}

; and

θ_{-} {\hat{γ}}_{1} = σ_{z +}^{2}

. To guarantee

σ_{z \pm}^{2} \in R

with

σ_{z \pm}^{2} > 0

, Equation (54) require

| {\hat{γ}}_{0} / {\hat{γ}}_{1} | \geq 2

. Addition of the constraint

| θ | < 1

to guarantee invertibility of the BMA(1) process restricts the solution of (53) to the pair

θ_{-}, σ_{z -}^{2}

.

The choice of the BMA(1) tree is motivated by the fact that it represents a generalization (see, e.g., Hasbrouck, 2007, Chapters 4.2 and 8) of the Roll (1984) microstructure model discussed in the Introduction.

7.3. Autoregressive of Order 1

The ABWN process can also be used as a basis to model the first difference of the price process as binary autoregressive of the first-order (BAR(1)). Let

\begin{matrix} Δ S_{t}^{(d)} & = Δ S_{t_{1}; E_{1}^{(l_{1}, m_{1})}}^{(d)} & = \{\begin{matrix} c + z_{u}, if m_{1} = 1, \\ c + z_{d}, if m_{1} = 0, \end{matrix} for t \in [t_{1}, t_{2}), \\ Δ S_{t}^{(d)} & = Δ S_{t_{n}; E_{n}^{((L_{n - 1}, l_{n}), (M_{n - 1}, m_{n}))}}^{(d)} & = \{\begin{matrix} c + ϕ Δ S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} + z_{u}, if m_{n} = 1, \\ c + ϕ Δ S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} + z_{d}, if m_{n} = 0, \end{matrix} \\ for t \in [t_{n}, t_{n + 1}), n = 2, \dots, m - 1, \end{matrix}

(55)

where

c, ϕ \in R

. In (55), the local index

m_{n} = 1

occurs with probability p, while

m_{n} = 0

occurs with probability

1 - p

. Requiring

ϕ \in (- 1, 1)

ensures that the process has MA(∞) representation. The expected value, variance, and auto-covariance of order h for

Δ S_{t_{n}}

are

\begin{matrix} E [Δ S_{t_{n}}] & = c (1 + ϕ + ϕ^{2} + \dots + ϕ^{n - 1}), \\ Var [Δ S_{t_{n}}] & = σ_{z}^{2} Δ t \sum_{i = 0}^{n - 1} ϕ^{2 i} = σ_{z}^{2} Δ t (\frac{1 - ϕ^{2 n}}{1 - ϕ^{2}}), \\ cov [Δ S_{t_{k}} Δ S_{t_{n}}] & = σ_{z}^{2} Δ t ϕ^{n - k} \sum_{i = 0}^{k - 1} ϕ^{2 i} = σ_{z}^{2} Δ t ϕ^{n - k} (\frac{1 - ϕ^{2 k}}{1 - ϕ^{2}}), for k < n, \end{matrix}

giving

ρ_{h} = lim_{n \to \infty; k = n - h} (\frac{cov [Δ S_{t_{k}} Δ S_{t_{n}}]}{Var [Δ S_{t_{n}}]}) = ϕ^{h}

for a finite lag h.

8. Computational Simplifications

Computations on a non-recombining binary tree are well known to be exponentially expensive, both in terms of memory requirements as well as execution time.29 Common methods to reduce the number of time intervals,

0, t_{1}, \dots, t_{m} = T

, involved in binary-tree option pricing computations include using either a common large time step,

Δ t_{n} = Δ t > 1

,

n = 1, \dots, m

, or a graduated increase in time intervals

Δ t_{n} > Δ t_{n - 1}

,

n = 1, \dots, m

. However, the following features of the three special-case models reduce the computational and storage complexity.

8.1. ABWN Model

As stated by (49) and illustrated in Figure 7, every level

l_{n}

on the ABWM binary tree has exactly the same two state values. As all conditional probabilities (48) reduce to either p or

1 - p

, the ABWM model reduces to a simple model on a (recombining) binomial tree.

8.2. BMA(1) Model

As a result of (51), the triangular configuration illustrated in Figure 7 represents the fundamental replicating unit for

Δ S

in the state space of the BMA(1) tree. By iterating (51) to obtain prices, it can be shown that, at time step n,

n = 1, \dots, m

, there are

2 n

unique price states, each of the form

S_{n, i} = S_{0} + n c + α_{1, i} z_{u} + α_{2, i} 2 z_{d} + θ (δ_{1, i} z_{u} + δ_{2, i} z_{d}), i = 1, \dots, 2 n .

(56)

In (56),

S_{0}

is the initial price. The coefficients

α_{j, i}

and

δ_{j, i}

are listed in Table 1. The coefficients satisfy

α_{1, i} + α_{2, i} = n

and

δ_{1, i} + δ_{2, i} = n - 1

,

i = 1, \dots, 2 n

, with

α_{1, i} = α_{2, 2 n - i + 1}

and

δ_{1, i} = δ_{2, 2 n - i + 1}

reflecting the

z_{u} \leftrightarrow z_{d}

symmetry of the binary tree.

The binary-tree description of the BMA(1) model in Section 7.2 uses the path notation

(L_{n}, M_{n})

, with

M_{n} = {0, m_{1}, m_{2}, \dots, m_{n}}

, to designate a specific node at time n. The partial recombining of the binary tree (from

2^{n}

distinct price states to

2 n

price states at time step n) suggests the simpler node (price) labeling of

(n, i)

,

i = 1, \dots, 2 n

, of (56). The connection between the two is as follows. Let

δ_{1}

denote the number of elements

m_{i} = 1

in the binary string

m_{1} m_{2} \dots m_{n - 1}

and

α_{1}

denote the number of elements

m_{i} = 1

in the binary string

m_{1} m_{2} \dots m_{n}

. Then, node

(L_{n}, M_{n})

corresponds to the node

(n, i)

, with

i = α_{1} + δ_{1} + 1

having the price given by the corresponding value of i in Table 1.

As a result of the reduction of the number of nodes, from

2^{n}

to

2 n

at time n, the BMA(1) model becomes computationally tractable for large values of n (competitive with trinomial models, which have

2 n + 1

nodes at time n). With prices (and therefore nodes) indexed as in (56), each price is uniquely specified by the vector of values

(n α_{1} α_{2} δ_{1} δ_{2})

(we ignore the value

S_{0}

, which is common to all nodes). Using this vector notation, Figure 8 illustrates the prices on a

{BIT}_{5}

BMA(1) tree. The figure also presents the vector denoting the change in the price

Δ S_{(n - 1, i_{n - 1}), (n, i_{n})}

along each indicated branch of the tree. It is a feature of the BMA(1) tree that only four price-change vectors occur in the tree:

(11010)

,

(10110)

,

(11001)

, and

(10101)

. (This is implicitly stated in Figure 7.) Furthermore, these four vectors always occur in this repetitive sequence in the transition from time n to

n + 1

.

The BMA(1) model supports the following further simplifications. For simplicity, we express these using the

(L_{n}, M_{n})

path notation of Section 2.

{BIT}_{m}

: As the BMA(1) model utilizes ABWN, from (48) we have constant conditional probabilities p and

1 - p

.

Riskless rate and bank account dynamics: For all n, if we assume

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f, inst)} = r

, a constant, then, from (22),

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)} = r Δ t

. Consequently, from (23)

β_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} = β_{0}^{(d)} {(1 + r Δ t)}^{n} w . p . p_{n - 1}^{(L_{n - 1}, M_{n - 1})}, n = 1, \dots, m, p_{0}^{(L_{0}, M_{0})} = 1 .

Risky asset: We have the unconditional expectation (52)

E [Δ S_{t_{n}}^{(d)}] = c

,

n = 1, \dots, m

. From (31),

\begin{matrix} E [r_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}] & = μ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)} Δ t = \{\begin{matrix} \frac{θ z_{u} + c}{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}} if m_{n - 1} = 1, \\ \frac{θ z_{d} + c}{S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}} if m_{n - 1} = 0, \end{matrix} \\ Var [r_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)}] & = {(σ_{t_{n} |E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(r)})}^{2} Δ t = \frac{σ_{z}^{2} Δ t}{{(S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)})}^{2}} . \end{matrix}

(57)

Risk-neutral dynamics: Equation (38) simplifies to

\begin{matrix} q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 1)}^{(d)} & = p - θ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}} \sqrt{p (1 - p) Δ t}, \\ q_{t_{n + 1}; (E_{n}^{(L_{n}, M_{n})}, ϵ_{n + 1}^{(l_{n + 1})} = 0)}^{(d)} & = (1 - p) + θ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}} \sqrt{p (1 - p) Δ t} \end{matrix}

(58)

with

θ_{t_{n + 1} |E_{n}^{(L_{n}, M_{n})}} = \{\begin{matrix} \frac{θ z_{u} + c - r S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}}{σ_{z}} if m_{n + 1} = 1, \\ \frac{θ z_{d} + c - r S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}}{σ_{z}} if m_{n + 1} = 0 . \end{matrix}

8.3. BAR(1) Model

To understand the simplification of the BAR(1) model, rewrite (55) as

Δ S_{t}^{(d)} = Δ S_{t_{n}; E_{n}^{((L_{n - 1}, l_{n}), (M_{n - 1}, m_{n}))}}^{(d)} = \{\begin{matrix} ϕ Δ S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} + U, if m_{n} = 1, \\ ϕ Δ S_{t_{n - 1}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d)} + D, if m_{n} = 0, \end{matrix}

(59)

for

n = 1, \dots, m - 1

, where

U = c + z_{u}

,

D = c + z_{d}

, and

Δ S_{t_{0}; E_{0}^{(L_{0}, M_{0})}}^{(d)} = 0

. By iterating (59), it is straightforward to develop an analytic formula for the

2^{n}

possible price states at level n.

Let

U D_{n}

denote a naturally ordered set of all possible binary sequences of length n, where the binary choice for each element of a sequence is either U or D. Let

U D_{n, k}

denote the k-th sequence in the set

U D_{n}

. The natural ordering in

U D_{n}

arises from the iteration of (59). Specifically

U D_{n + 1, 2 k} = (U D_{n, k}, U)

and

U D_{n + 1, 2 k - 1} = (U D_{n, k}, D)

. For example,

\begin{matrix} U D_{2} & = \{(D, D), (D, U), (U, D), (U, U)\} where U D_{2, 3} = (U, D), \\ U D_{3} & = {(D, D, D), (D, D, U), (D, U, D), (D, U, U), (U, D, D), (U, D, U), \\ (U, U, D), (U, U, U)} . \end{matrix}

Thus,

U D_{3, 6} = (U, D, U) = (U D_{2, 3}, U)

and

U D_{3, 5} = (U, D, D) = (U D_{2, 3}, D)

.

Considering the sequence

U D_{n, k}

to be a vector, at level n each of the

2^{n}

possible states can be written as the scalar product:

Φ_{n - 1} \cdot U D_{n, k}, k = 1, \dots, 2^{n},

(60)

where

Φ_{n}

is the vector

(ϕ^{n}, ϕ^{n - 1}, \dots, ϕ, 1)

. For example, the state corresponding to

U D_{3, 6}

is

Φ_{2} \cdot U D_{3, 6} = ϕ^{2} U + ϕ D + U = c (ϕ^{2} + ϕ + 1) + ϕ^{2} z_{u} + ϕ z_{d} + z_{u} .

However, under the restriction

ϕ \in (- 1, 1)

, for a large enough n,

ϕ^{n}

becomes smaller than the machine precision (machine epsilon) and finite-precision computation of the higher-order terms in sums such as

1 + ϕ + ϕ^{2} + \dots + ϕ^{n} + ϕ^{n + 1} + \dots ϕ^{n + k}

becomes meaningless. Thus, beyond a certain value of n, the numerical prices no longer change (the price of each child node remains equal to the price of the parent node). The value of n for which this occurs depends on the magnitudes

| ϕ |

,

| z_{u} |

, and

| z_{d} |

.30

9. Technical Analyses of the Probability Estimates

In Section Estimation of Probabilities, we noted the desirability of estimating the direction-of-price-change probabilities

p_{n}^{(L_{n}, M_{n}; Δ t_{1, n})}

from historical data using (20) on a set of V non-overlapping binomial sequences, each of length n. However, we also noted that, even for relatively small values of n, a prohibitively extensive history of returns would be required to ensure an adequate sample to determine each of the

2^{n}

probabilities for a given value of n. Table 2 codifies this problem using daily return data for the SPDR Dow Jones Industrial Average ETF Trust (DIA) covering the period of prices from 20 January 1998 through 5 May 2023. This data set provides 6365 return values; that is, a sequence of 6365 values of 0 (down) or 1 (up) price changes.

From the table it is evident that by

n = 6

, V is too small to ensure an adequate sample size for determining each of the

2^{6}

possible sequences. It is not clear the sample size is adequate even for

n = 5

.

There are models that can be applied to a historical return time series to generate “mimicking” time series. Of course, all such models add additional model error to the process. One approach is to fit the historical return time series to an ARMA(

l, m

)-GARCH(

p, q

) model combined with a distribution model for the ARMA residuals, and then use the ARMA-GARCH-distribution model with fitted parameters to generate adequate numbers, V, of mimicking return series from which to compute the required probabilities. Such an approach involves fitting a significant number of required parameters. It generates return time series, which is a step removed from the

0, 1

sequences required by (20). We prefer to utilize bootstrap resampling directly on the

0, 1

sequence determined by the historical data set. Bootstrapping has the advantage of constantly resampling the historical sequence.31 For a given value of n, we required

V \geq 10, 000 \times 2^{n}

bootstrapped samples (i.e., each of the

2^{n}

probabilities is determined based upon a expectation of 10,000 occurrences for each of the

2^{n}

possible sequences).

We compared our bootstrap resampled results against those computed from the historical time series with no resampling (i.e., with expected number of sequence occurrences given in Table 2). For plotting convenience, we have developed the following labeling system for each of the

2^{n}

possible

0, 1

sequences of length n. We illustrate the general notation using specific examples. Consider the

n = 5

sequences. They can be mapped to the

2^{5} = 32

values

x = - 15.5, - 14.5, \dots, - 0.5, 0.5, \dots, 14.5, 15.5

. For example, 01101 is mapped to

x = {(01101)}_{10} + 1 - (2^{5} + 1) / 2 = - 2.5

; 00000 is mapped to

x = {(00000)}_{10} + 1 - (2^{5} + 1) / 2 = - 15.5

; and 11111 to

x = {(11111)}_{10} + 1 - (2^{5} + 1) / 2 = 15.5

. This labeling has the property that

- x

and x label binary complement sequences (i.e., 01101 corresponds to

x = - 2.5

and 10010 to

x = 2.5

). A positive value of x indicates a binary string beginning with 1, while a negative value of x indicates a binary string beginning with 0.

Using the values x to represent the sequences corresponding to the various events

E_{n}^{(L_{n}, M_{n})}

with

Δ t_{1, n} = Δ t = 1

, Figure 9 and Figure 10 compare the results obtained for the probabilities

p_{n}^{(x)}

computed via (20) for the DIA data set without and with bootstrap resampling for

n = 4, 6, and 8

.

There is reasonable agreement between the results without and with bootstrap resampling for

n = 4

; significant differences develop for

n = 6

, which are then clearly revealed for

n = 8

. In particular, without bootstrap resampling, when

n = 8

the paucity of data results in the probabilities

p_{8}^{(x)}

taking on only nine possible values. The 95% confidence intervals in Figure 9 and Figure 10 are based upon the results in Figure 11 that the probabilities

p_{n}^{(x)}

obtained from bootstrap resampling are well described by a normal distribution.

Analysis of the probabilities of individual sequences falls within the area of technical pattern analysis (Lo et al., 2000). We continue this analysis by examining the highest- and lowest-probability paths. Specifically, for fixed n we consider whether the highest-probability sequences specify paths that are “closely grouped” (with a similar statement for the lowest-probability paths). If the highest-probability paths occur randomly, this would provide further confirmation of the efficient market hypothesis. However, clustered paths suggest the presence of pronounced patterns, as argued by Lo et al. (2000). A related consideration is whether the highest-probability path for

n = n_{1}

is a projection of the highest-probability path for

n = n_{2} > n_{1}

.

Figure 12 displays the observed results for the grouping of paths. For the

2^{n}

sequences of length n, we plot the highest-observed-probability path (colored gold), and the next

n - 1

-highest-probability paths (colored blue). Similarly, we plot the lowest-observed-probability path (colored green), and the next

n - 1

-lowest-probability paths (colored red). We consider

n = {4, 6, 8}

and plot results for the data without and with bootstrap resampling. Due to the data limitations with no resampling, there is a more “random” distribution of high- and low-probability paths. (Note in particular the highest- and lowest-probability paths for

n = 6

in the case with no resampling.) With bootstrap resampling improving sample sizes, there is a more distinct grouping of the high- and low-probability paths, with the high-probability paths characterized by more consistent price increases and the low-probability paths characterized by more consistent price decreases.

Figure 13 displays the observed results for the projections of the highest- and lowest-probability paths. The highest-probability path for

n = {4, 5, 6, 7, 8, 10}

are each plotted on the same graph. Similarly for the respective lowest-probability path for each value of n. It is clear that, over this range of values of n, the lowest-probability path for

n = n_{1}

is simply the projection (truncation) of the highest-probability path for

n = n_{2} > n_{1}

. In the case of the highest-probability path, there is a “discontinuity” in the projection. For

n < 6

, the highest-probability path is a truncation of that for

n = 6

. For

n = 7, 8

, the highest-probability path is almost a truncation of that for

n = 10

(with a slight difference occurring for

n = 7

at

t_{2}

and

n = 8

at

t_{3}

).

To test the dynamic stability of such estimates, we reperformed the probability estimation procedure for the DIA data set using a rolling window of length 15 years (3780 trading days). This generated 2586 windows. For each window, sequence probabilities were computed for

n = {2, 4, 6}

, using bootstrap resampling to ensure adequate sample sizes. (To speed-up the computation, we employed

1000 \times 2^{n}

bootstrapped samples in each window.) For each choice of n, the rolling windows produced an empirical distribution of probability estimates for each of the

2^{n}

sequences. These distributions are summarized as box–whisker plots in Figure 14. Figure 9 and Figure 14 show very similar structures (for

n = 4

and 6), indicating relative stability between the rolling window and global estimates of the sequence probabilities.

We now address the substructure that is apparent in Figure 14 (and in Figure 9 and Figure 10). Figure 15 replots the

n = {4, 6}

box–whisker plots with the sequences placed in categories according to the number of zeros (price downturns; equivalently the number of ones (price upturns)) each contains. Within each category, the sequences are still labeled from smallest to largest numerical label, x, as indicated in the top plot of Figure 15. The substructure seen in Figure 14 has largely vanished from Figure 15, indicating that the number of price downturns (equivalently upturns) is the major driver of a sequence’s probabilities. The uniformity of the ranges of the probability distributions within a category is indicative of an efficient market hypothesis operating within each category. It is in the difference in the ranges of the probability distributions between categories that market inefficiencies are seen.32 For

n = 4

, there is no overlap between the range of the empirical probability distribution for the sequence 0000 and the range of any distribution for sequences containing two or more upturns. For

n = 6

, there is no overlap between the range of the empirical distribution for the sequence 000000 and the range of any other distribution. Furthermore, the range of any distribution for a sequence containing five or six downturns has no overlap with the range of any distribution for a sequence containing four or more upturns.

We compare the sequence probability estimates among different assets using the 30 components comprising (as of 31 August 2020) the Dow Jones Industrial Average (DJIA) index. Price data were used for the period 3 January 2000 through 26 August 2022, with the exceptions of Visa (price data beginning 18 March 2008) and Dow (price data beginning 29 February 2019). This provided 5699 return values for 28 of the assets (3637 returns for Visa and 868 returns for Dow). Sequence probabilities were computed for these assets for

n = {1, 2, 3}

. For these small values of n, the probabilities were computed from the data without bootstrap resampling. The estimated probabilities for sequences of length

n = 1, 2

are presented in Table A1 in Appendix B. The probabilities for sequences of length

n = 3

are presented in Table A2. Significant p-values obtained from the one-sided z-test are also indicated. For comparison, sequence probabilities for the DIA data are also provided in these tables.

For

n = 1

, for all 31 assets, the probability of a negative return is smaller than the probability of a positive return. For 24 of these assets, this relationship is significant at a level ≤

5 %

. For assets where the probabilities of the sequences 00 (

n = 2

) or 000 (

n = 3

) are significant at the level ≤

5 %

, these sequences have the smallest probability. This holds for 26 of 31 assets for

n = 2

and 21 of 31 assets for

n = 3

. For the sequences 11 and 111, the results are not as strong. These sequences are significant at the level ≤

5 %

in 11 of 31 assets for

n = 2

and 9 of 31 assets for

n = 3

. However, these sequences represent, respectively, the highest probability for only 10 of the 11, for

n = 2

, and 7 of the 9, for

n = 3

, of these assets. If, instead, we consider sequences that contain at most a single negative return (a single “0” value), then the highest probability sequence that has significance at the

\leq 5 %

level occurs for 15 of 31, for

n = 2

, and 20 of 31, for

n = 3

, assets. These observations are consistent with the DIA results in Figure 13 for

n = {4, 5, 6, 7, 8, 10}

.

10. Discussion

This work contains several points that we wish to emphasize.

The most significant is that modeling of market microstructure cannot be achieved using (existing) continuous-time stochastic theory. We provide two critical observations in the Introduction to support this statement. The first is that inclusion of microstructure effects results in price processes that are not semimartingales. The second is to note the work of Jarrow et al. (2009) (and Cheridito (2003)) that shows finite intervals between transaction times are required to maintain arbitrage-free price processes in such cases. (While Roll’s approximation

s p r e a d = 2 \sqrt{- c o v}

resembles a continuous-time statement, it is crucial to recognize that

c o v

is the first-order serial covariance of price changes, which are not instantaneous, but occur over finite market time intervals.) We argue that discrete, binary-tree models are appropriate models to capture microstructure behavior.

Our major theoretical result in this paper is the development a market-complete, arbitrage-free, binary-tree option pricing framework that is capable of capturing price processes exhibiting path-dependent behavior. Consistent with the observations in the previous paragraph, we note that any attempt to consider the continuous-time limit of this binary-tree model is self-defeating, as critical microstructure detail is lost in the continuum limit.

By applying the framework to build specific moving-average MA(1) and autoregressive AR(1) models, we have demonstrated that this binary-tree framework can be used to model market microstructure. In the case of the MA(1) model, we show that the binary price tree is sufficiently recombining that the computational complexity of the model is only

O (n^{2})

. Both the MA(1) and AR(1) models require empirical testing in order to demonstrate their applicability as well as to provide comparison against other models. As this theoretical paper is already lengthy, we made the decision to leave such critical considerations for future investigation.

As it relates directly to the efficient market hypothesis, we have devoted a section of this work to a technical analysis of the direction-of-price-change probabilities that appear as parameters in the binary-tree model. Analysis of price histories for the 30 assets in the Dow Jones Average leads us to conclude that, when fixed-length sequences of daily price changes are categorized according to the number of price downturns in the sequence, it appears that the efficient market hypothesis operates within each category. However, market inefficiencies are evident between categories.

Author Contributions

Conceptualization, S.T.R.; methodology, W.B.L. and S.T.R.; software, D.L. and Y.H.; validation, D.L., W.B.L. and Y.H.; formal analysis, W.B.L. and S.T.R.; investigation, D.L., W.B.L. and Y.H.; data curation, D.L.; writing—original draft preparation, D.L.; writing—review and editing, W.B.L.; visualization, W.B.L.; supervision, S.T.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available as follows. Stock and ETF price data provided through Bloomberg Professional Services and used under license. Code for the computations in Section 9 are available upon request to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Sequential Definition of the Probability Law on the BIT for n = 2 and 3

For

n = 2

, set

E \overset{def}{=} \{ϵ_{0}^{(0)}, ϵ_{1}^{(1)}, (ϵ_{2}^{(1)}, ϵ_{2}^{(2)})\} = \{E_{1}, (ϵ_{2}^{(1)}, ϵ_{2}^{(2)})\}

. Then,

F^{(2)} = σ (E_{2})

with

\begin{matrix} P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) & = P (ϵ_{2}^{(2)} = 1 |ϵ_{1}^{(1)} = 1) P (ϵ_{1}^{(1)} = 1) \\ = P (ϵ_{2}^{(2)} = 1 |ϵ_{1}^{(1)} = 1) p_{1}^{((0, 1), (0, 1))}, \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) & = P (ϵ_{2}^{(2)} = 0 |ϵ_{1}^{(1)} = 1) P (ϵ_{1}^{(1)} = 1) \\ = P (ϵ_{2}^{(2)} = 0 |ϵ_{1}^{(1)} = 1) p_{1}^{((0, 1), (0, 1))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1) & = P (ϵ_{2}^{(2)} = 1 |ϵ_{1}^{(1)} = 0) P (ϵ_{1}^{(1)} = 0) \\ = P (ϵ_{2}^{(2)} = 1 |ϵ_{1}^{(1)} = 0) p_{1}^{((0, 1), (0, 0))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0) & = P (ϵ_{2}^{(2)} = 0 |ϵ_{1}^{(1)} = 0) P (ϵ_{1}^{(1)} = 0) \\ = P (ϵ_{2}^{(2)} = 0 |ϵ_{1}^{(1)} = 0) p_{1}^{((0, 1), (0, 0))} . \end{matrix}

(A1)

The

E_{1}

-conditional probabilities satisfy

\begin{matrix} P (ϵ_{2}^{(2)} = 1 |ϵ_{1}^{(1)} = 1) & \in (0, 1), \\ P (ϵ_{2}^{(2)} = 0 |ϵ_{1}^{(1)} = 1) & = 1 - P (ϵ_{2}^{(2)} = 1 |ϵ_{1}^{(1)} = 1), \\ P (ϵ_{2}^{(1)} = 1 |ϵ_{1}^{(1)} = 0) & \in (0, 1), \\ P (ϵ_{2}^{(1)} = 0 |ϵ_{1}^{(1)} = 0) & = 1 - P (ϵ_{2}^{(1)} = 1 |ϵ_{1}^{(1)} = 0) . \end{matrix}

Thus, the unconditional probabilities are

\begin{matrix} P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) & \overset{def}{=} p_{2}^{((0, 1, 2), (0, 1, 1))} \in (0, p_{1}^{((0, 1), (0, 1))}), \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) & \overset{def}{=} p_{1}^{((0, 1, 2), (0, 1, 0))} = p_{1}^{((0, 1), (0, 1))} - p_{2}^{((0, 1, 2), (0, 1, 1))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) & \overset{def}{=} p_{2}^{((0, 1, 1), (0, 0, 1))} \in (0, p_{1}^{((0, 1), (0, 0))}), \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) & \overset{def}{=} p_{2}^{((0, 1, 1), (0, 0, 0))} = p_{1}^{((0, 1), (0, 0))} - p_{2}^{((0, 1, 1), (0, 0, 1))} . \end{matrix}

To estimate

p_{2}^{((0, 1, 2), (0, 1, 1))}

, we use the historical frequency

{\hat{p}}_{1}^{((0, 1), (0, 1); Δ t_{1}, Δ t_{2})}

of observing “a positive price change over a trading period of size

Δ t_{1}

followed by a positive price change over a trading period of size

Δ t_{2}

”. Estimates for the remaining three probabilities of two-step stock movements are computed analogously.

For

n = 3

, set

E_{3} \overset{def}{=} \{ϵ_{0}^{(0)}, ϵ_{1}^{(1)}, (ϵ_{2}^{(1)}, ϵ_{2}^{(2)}), (ϵ_{3}^{(1)}, ϵ_{3}^{(2)}, ϵ_{3}^{(3)}, ϵ_{3}^{(4)})\} = \{E_{2}, (ϵ_{3}^{(1)}, ϵ_{3}^{(2)}, ϵ_{3}^{(3)}, ϵ_{3}^{(4)})\} .

Then,

F^{(3)} = σ (E_{3})

, with

\begin{matrix} P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1, ϵ_{3}^{(4)} = 1) & = P (ϵ_{3}^{(4)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) \\ = P (ϵ_{3}^{(4)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) p_{2}^{((0, 1, 2), (0, 1, 1))}, \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1, ϵ_{3}^{(4)} = 0) & = P (ϵ_{3}^{(4)} = 0 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) \\ = P (ϵ_{3}^{(4)} = 0 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) p_{2}^{((0, 1, 2), (0, 1, 1))}, \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0, ϵ_{3}^{(3)} = 1) & = P (ϵ_{3}^{(3)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) \\ = P (ϵ_{3}^{(3)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) p_{2}^{((0, 1, 2), (0, 1, 0))}, \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0, ϵ_{3}^{(3)} = 0) & = P (ϵ_{3}^{(3)} = 0 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) \\ = P (ϵ_{3}^{(3)} = 0 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) p_{2}^{((0, 1, 2), (0, 1, 0))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1, ϵ_{3}^{(2)} = 1) & = P (ϵ_{3}^{(2)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) \\ = P (ϵ_{3}^{(2)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) p_{2}^{((0, 1, 1), (0, 0, 1))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1, ϵ_{3}^{(2)} = 0) & = P (ϵ_{3}^{(2)} = 0 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) \\ = P (ϵ_{3}^{(2)} = 0 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) p_{2}^{((0, 1, 1), (0, 0, 1))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0, ϵ_{3}^{(1)} = 1) & = P (ϵ_{3}^{(1)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) \\ = P (ϵ_{3}^{(1)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) p_{1}^{((0, 1, 1), (0, 0, 0))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0, ϵ_{3}^{(1)} = 0) & = P (ϵ_{3}^{(1)} = 0 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) \\ = P (ϵ_{3}^{(1)} = 0 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) p_{1}^{((0, 1, 1), (0, 0, 0))} . \end{matrix}

The

E_{2}

-conditional probabilities satisfy

\begin{matrix} P (ϵ_{3}^{(4)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) & \in (0, 1), \\ P (ϵ_{3}^{(4)} = 0 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1) & = 1 - P (ϵ_{3}^{(4)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1), \\ P (ϵ_{3}^{(3)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) & \in (0, 1), \\ P (ϵ_{3}^{(3)} = 0 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0) & = 1 - P (ϵ_{3}^{(3)} = 1 |ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0), \\ P (ϵ_{3}^{(2)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) & \in (0, 1), \\ P (ϵ_{3}^{(2)} = 0 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1) & = 1 - P (ϵ_{3}^{(2)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 1), \\ P (ϵ_{3}^{(1)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) & \in (0, 1), \\ P (ϵ_{3}^{(1)} = 0 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0) & = 1 - P (ϵ_{3}^{(1)} = 1 |ϵ_{1}^{(1)} = 0, ϵ_{2}^{(2)} = 0) . \end{matrix}

(A2)

Thus, the unconditional probabilities are

\begin{matrix} P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1, ϵ_{3}^{(4)} = 1) & \overset{def}{=} p_{3}^{((0, 1, 2, 4), (0, 1, 1, 1))} \in (0, p_{2}^{((0, 1, 2), (0, 1, 1))}), \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 1, ϵ_{3}^{(4)} = 0) & \overset{def}{=} p_{3}^{((0, 1, 2, 4), (0, 1, 1, 0))} \\ = p_{2}^{((0, 1, 2), (0, 1, 1))} - p_{3}^{((0, 1, 2, 4), (0, 1, 1, 1))}, \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0, ϵ_{3}^{(3)} = 1) & \overset{def}{=} p_{3}^{((0, 1, 2, 3), (0, 1, 0, 1))} \in (0, p_{2}^{((0, 1, 2), (0, 1, 0))}), \\ P (ϵ_{1}^{(1)} = 1, ϵ_{2}^{(2)} = 0, ϵ_{3}^{(3)} = 0) & \overset{def}{=} p_{3}^{((0, 1, 2, 3), (0, 1, 0, 0))} \\ = p_{2}^{((0, 1, 2), (0, 1, 0))} - p_{3}^{((0, 1, 2, 3), (0, 1, 0, 1))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1, ϵ_{3}^{(2)} = 1) & \overset{def}{=} p_{3}^{((0, 1, 1, 2), (0, 0, 1, 1))} \in (0, p_{2}^{((0, 1, 1), (0, 0, 1))}), \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 1, ϵ_{3}^{(2)} = 0) & \overset{def}{=} p_{3}^{((0, 1, 1, 2), (0, 0, 1, 0))} \\ = p_{2}^{((0, 1, 1), (0, 0, 1))} - p_{3}^{((0, 1, 1, 2), (0, 0, 1, 1))}, \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0, ϵ_{3}^{(1)} = 1) & \overset{def}{=} p_{3}^{((0, 1, 1, 1), (0, 0, 0, 1))} \in (0, p_{1}^{((0, 1, 1), (0, 0, 0))}), \\ P (ϵ_{1}^{(1)} = 0, ϵ_{2}^{(1)} = 0, ϵ_{3}^{(1)} = 0) & \overset{def}{=} p_{3}^{((0, 1, 1, 1), (0, 0, 0, 0))} \\ = p_{2}^{((0, 1, 1), (0, 0, 0))} - p_{3}^{((0, 1, 1, 1), (0, 0, 0, 1))} . \end{matrix}

(A3)

Note that the unconditional probabilities (A3) can be written concisely as

\begin{matrix} P (ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}, 1) & = p_{3}^{((0, l_{1}, l_{2}, l_{3}), (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}, 1))} \in (0, p_{2}^{((0, l_{1}, l_{2}), (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}))}), \\ P (ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}, 0) & = p_{3}^{((0, l_{1}, l_{2}, l_{3}), (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}, 0))} \\ = p_{2}^{((0, l_{1}, l_{2}), (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}))} - p_{3}^{((0, l_{1}, l_{2}, l_{3}), (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})}, 1))}, \end{matrix}

where

l_{1} = 1

,

l_{2} = {(ϵ_{1}^{(l_{1})})}_{10} + 1

,

l_{3} = {(ϵ_{1}^{(l_{1})} ϵ_{2}^{(l_{2})})}_{10} + 1

. Similarly, the conditional probabilities (A2) can be written concisely as

\begin{matrix} P (ϵ_{3}^{(l_{3})} = 1 |E_{2} = (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})})) & \in (0, 1), \\ P (ϵ_{3}^{(l_{3})} = 0 |E_{2} = (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})})) & = 1 - P (ϵ_{3}^{(l_{3})} = 1 |E_{2} = (0, ϵ_{1}^{(l_{1})}, ϵ_{2}^{(l_{2})})) . \end{matrix}

Appendix B. Path Probability Tables for Assets in the DJIA Index

Table A1. Path probabilities for sequences of length (columns 2 and 3)

n = 1

and (columns 4 to 7)

n = 2

for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.

Table A1. Path probabilities for sequences of length (columns 2 and 3)

n = 1

and (columns 4 to 7)

n = 2

for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.

Path Label:	$- 0.5$	$0.5$	$- 1.5$	$- 0.5$	$0.5$	$1.5$
Symbol	Path Probability		Path Probability
AAPL	0.477 ***	0.523 ***	0.230 **	0.248	0.246	0.276 ***
AMGN	0.499	0.501	0.245	0.256	0.253	0.246
AXP	0.490	0.510	0.232 *	0.260	0.257	0.251
BA	0.486 *	0.514 *	0.229 **	0.260	0.254	0.257
CAT	0.486 *	0.514 *	0.239	0.247	0.248	0.266 *
CRM	0.487 *	0.513 *	0.222 ***	0.271 **	0.257	0.249
CSCO	0.484 *	0.516 *	0.229 **	0.262	0.247	0.261
CVX	0.475 ***	0.525 ***	0.221 ***	0.250	0.257	272 **
DIS	0.487 *	0.513 *	0.233 *	0.258	0.251	0.259
DOW	0.486	0.514	0.239	0.268	0.225	0.268
GS	0.488 *	0.512 *	0.229 **	0.256	0.262	0.253
HD	0.480 **	0.520 **	0.229 **	0.248	0.255	0.268 *
HON	0.477 ***	0.523 ***	0.219 ***	0.265 *	0.250	0.266 *
IBM	0.488 *	0.512 *	0.232 *	0.265 *	0.248	0.255
INTC	0.488 *	0.512 *	0.232 *	0.258	0.256	0.255
JNJ	0.486 *	0.514 *	0.225 **	0.265 *	0.257	0.253
JPM	0.493	0.507	0.233 *	0.254	0.267 *	0.246
KO	0.482 **	0.518 **	0.230 **	0.244	0.260	0.266 *
MCD	0.464 ***	0.536 ***	0.207 ***	0.262	0.252	0.279 ***
MMM	0.475 ***	0.525 ***	0.215 ***	0.259	0.261	0.265 *
MRK	0.494	0.506	0.246	0.242	0.253	0.258
MSFT	0.486 *	0.514 *	0.225 **	0.263	0.258	0.253
NKE	0.483 **	0.517 **	0.224 ***	0.253	0.264	0.258
PG	0.481 **	0.519 **	0.225 **	0.262	0.250	0.263
TRV	0.477 ***	0.523 ***	0.216 ***	0.254	0.268 *	0.262
UNH	0.477 ***	0.523 ***	0.221 ***	0.260	0.251	0.267 *
V	0.462 ***	0.538 ***	0.195 ***	0.275 **	0.259	0.270 *
VZ	0.493	0.517	0.234 *	0.264	0.255	0.248
WBA	0.499	0.501	0.244	0.258	0.251	0.246
WMT	0.487 *	0.513 *	0.224 ***	0.263	0.264	0.250
DIA	0.455 ***	0.545 ***	0.201 ***	0.262	0.246	0.291 ***

Table A2. Path probabilities for sequences of length

n = 3

for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.

Table A2. Path probabilities for sequences of length

n = 3

for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.

Path Label:	$- 3.5$	$- 2.5$	$- 1.5$	$- 0.5$	$0.5$	$1.5$	$2.5$	$3.5$
Symbol	Path Probability
AAPL	0.102 **	0.125	0.122	0.122	0.128	0.123	0.132	0.146 **
AMGN	0.112	0.119	0.148 *	0.122	0.131	0.121	0.121	0.125
AXP	0.118	0.121	0.132	0.131	0.114	0.132	0.121	0.131
BA	0.111 *	0.134	0.113	0.142 *	0.118	0.130	0.121	0.131
CAT	0.119	0.114	0.124	0.131	0.121	0.126	0.128	0.137
CRM	0.110 *	0.116	0.127	0.127	0.122	0.145 *	0.130	0.124
CSCO	0.095 ***	0.139 *	0.136	0.114	0.128	0.114	0.132	0.142 *
CVX	0.101 ***	0.114	0.128	0.132	0.118	0.144 **	0.128	0.136
DIS	0.112	0.102 **	0.134	0.139 *	0.138	0.115	0.121	0.139 *
DOW	0.115	0.137	0.077 *	0.132	0.150	0.124	0.128	0.137
GS	0.111 *	0.126	0.129	0.132	0.116	0.133	0.125	0.129
HD	0.104 **	0.131	0.120	0.129	0.124	0.129	0.122	0.141 *
HON	0.102 **	0.107 *	0.129	0.140 *	0.122	0.138	0.129	0.132
IBM	0.112 *	0.116	0.123	0.131	0.133	0.125	0.130	0.131
INTC	0.115	0.123	0.126	0.127	0.118	0.126	0.134	0.131
JNJ	0.112	0.135	0.125	0.122	0.117	0.132	0.114	0.144 **
JPM	0.116	0.114	0.131	0.150 ***	0.121	0.127	0.122	0.119
KO	0.116	0.115	0.123	0.130	0.115	0.131	0.132	0.138 *
MCD	0.095 ***	0.108 *	0.107 *	0.132	0.129	0.147 **	0.139 *	0.142 *
MMM	0.102 **	0.112 *	0.118	0.138 *	0.131	0.130	0.129	0.140 *
MRK	0.118	0.132	0.118	0.126	0.130	0.130	0.111 *	0.134
MSFT	0.103 **	0.113	0.136	0.136	0.126	0.134	0.129	0.123
NKE	0.107 *	0.122	0.118	0.132	0.123	0.135	0.136	0.127
PG	0.096 ***	0.122	0.126	0.121	0.135	0.140 *	0.128	0.132
TRV	0.099 ***	0.122	0.124	0.135	0.119	0.148 **	0.120	0.132
UNH	0.101 ***	0.124	0.126	0.121	0.115	0.142 *	0.137	0.135
V	0.079 ***	0.111	0.126	0.144 *	0.119	0.156 ***	0.139	0.125
VZ	0.106 **	0.122	0.136	0.141 *	0.126	0.128	0.122	0.118
WBA	0.111 *	0.133	0.132	0.119	0.132	0.123	0.128	0.122
WMT	0.103 **	0.121	0.127	0.118	0.126	0.146 **	0.139 *	0.119
DIA	0.090 ***	0.109 *	0.118	0.138	0.125	0.129	0.124	0.167 ***

Notes

1	See also the seminal works of Bachelier (1900).
2	For the definition of a long-memory process, we refer the interested reader to Mandelbrot (2001) and Beran (2017).
3	It is common to define these properties as “stylized facts”. These include: volatility clustering; returns with heavy tailed distributions; tail dependence; leverage effects; and long-term memory. See Mittnik et al. (2007) for a comprehensive exposition on the topic.
4	It is worth noting that all Lévy processes are semimartingales, and many well-studied models in finance assume that the asset log-returns follow Lévy processes, as seen in Eberlein and Prause (2000); Rachev et al. (2011); Schoutens (2003).
5	See Samura et al. (2013) for conditions under which real-valued, cadlag processes that satisfy NFVLR must be semimartingales.
6	However, answering questions related to the existence and uniqueness of solutions to the continuous-time stochastic PDE that a numerical model may be intending to approximate does require the full machinery of stochastic integration theory.
7	This term originates with the seminal paper by Garman (1976). See O’Hara (1997) for an extensive overview of market microstructure studies.
8	We follow the description in Hasbrouck (2007, Chapter 3.4).
9	As noted in Section 2, we reserve the time points $t_{0}$ and $t_{m}$ for the current time and the terminal time of an option, respectively.
10	Neither work by Cheridito (2003) nor Jarrow et al. (2009) considered option pricing. Thus, our work provides a critical extension.
11	We refer the interested reader to Dzhaparidze and van Zuijlen (1996), Shiryaev (1999, Section II), and Cordero et al. (2014). These papers extend the Cox et al. (1979) and Jarrow and Rudd (1982) binomial models to general binary pricing models, without preserving the information on upward stock price probability or mean stock returns. As discussed in Hu et al. (2020a, 2020b), this is a considerable drawback when option trading is performed in discrete times.
12	In this paper, our interest is only in fully (perfectly) balanced binary trees. For brevity, we shall continue to refer to them simply as binary trees.
13	Thus, the example node sequence $M_{3} = (0, 1, 1, 0)$ corresponds to the level sequence $L_{3} = (0, 1, l_{2}, l_{3})$ with $l_{2} = {(1)}_{10} + 1 = 2$ , and $l_{3} = {(11)}_{10} + 1 = 4$ .
14	See “Standard probability space”, Encyclopedia of Mathematics, EMS Press (2001).
15	Binary information trees are nested, ${BIT}_{n} \in {BIT}_{m}$ for $n < m$ .
16	The family of probabilities $p_{n}^{(L_{n}, M_{n})}$ , $L_{n} = (0, l_{1}, \dots, l_{j}, \dots, l_{n})$ , $l_{j} = 1, \dots, 2^{j - 1}$ , $M_{n} = (0, m_{1}, \dots, m_{j}, \dots, m_{n})$ , $m_{j} = 0, 1$ , $j = 1, \dots, n$ , should satisfy Kolmogorov’s extension theorem (see Oksendal, 2013, Theorem 2.1.5, p. 11), as illustrated in the cases for $n = 2, 3$ in Appendix A.
17	While it may be of interest to view the trading times $t_{1} < \dots < t_{m - 1}$ as stopping times, this is beyond the scope of the current paper. However, such an extension can be done by introducing binary pricing models with dynamics following discrete-time semimartingales that are contaminated by noise occurring at random time instances. See, for example, Jacod and Protter (2012, Chapter 16) and (Aït-Sahalia & Jacod, 2014, Chapter 9).
18	Note the assumption that no trade occurs at $t_{0} = 0$ or at the terminal time $t_{m} = T$ (i.e., there is no new market information at times 0 and T).
19	Hung and Swallow (1999) provide a robust test for sample proportions when the Bernoulli trials are dependent. Applying robust estimates for $p_{1}^{((0, 1), (0, 1))}$ does not make a significant difference in the numerical examples we consider in Section 9 because our sample size is relatively large and the dependence between the Bernoulli trials is weak.
20	The first trading date is $t_{1}$ . Thus, the first opportunity for the trader to deposit or withdraw from the bank account is $t_{1}$ . The value of $r_{t_{0}; E_{0}}^{(d, f)}$ is irrelevant for the trader.
21	We emphasize the equivalent notations $\begin{matrix} S_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)}^{(d)} & = S_{t_{n}; E_{n}^{(L_{n}, M_{n} = (M_{n - 1}, 1))}}^{(d)}, & s_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)} = s_{t_{n}; E_{n}^{(L_{n}, M_{n} = (M_{n - 1}, 1))}}, \\ r_{t_{n}; (E_{n - 1}^{(L_{n - 1}, M_{n - 1})}, ϵ_{n}^{(l_{n})} = 1)}^{(d)} & = r_{t_{n}; E_{n}^{(L_{n}, M_{n} = (M_{n - 1}, 1))}}^{(d)}, & etc . \end{matrix}$
22	The functions f on $(0, \infty) \times [0, T]$ and g on $(0, \infty)$ satisfy the usual regularity conditions; see Duffie (2001, Chapter 5). These conditions will only be needed when we consider the limiting option price process as max $(Δ t_{n}) \to 0$ .
23	Recall discussion immediately following (30).
24	The DPIP is also known as the Functional Limit Theorem. We will apply the DPIP for continuous diffusions only, see Davydov and Rotar (2008). Extensions to more general DPIP, where the limiting price process is a semimartingale, are known; see Cherny et al. (2003); Duan et al. (2006) and Hu et al. (2020a). It will be of interest to study DPIP when the limiting pricing process is a semimartingale plus noise. These types of DPIP could be obtained by applying limiting results as studied in Jacod and Protter (2012), but that line of research is beyond the scope of this paper. Unfortunately, as pointed out in Hu et al. (2020a, 2020b), the limiting stock price dynamics erases important information contained in the discrete pricing model; specifically the probabilities for the direction of stock price moment and, in the case of option pricing, the mean return of the stock. For this critical reason we view this section on the continuum limit of the discrete dynamics mainly as an extension to the classical CRR and Jarrow and Rudd (1982) option pricing models. As these limiting results reveal, incorporation of market microstructure features requires full use of the discrete binary-tree pricing model.
25	See, e.g., Jacod and Protter (2012, p. 49), and Cherny et al. (2003) and the references therein.
26	The limiting riskless rate is also assumed to satisfy $P ({sup}_{t \in [0, T]} \{r_{t} + \frac{1}{r_{t}}\} < \infty) = 1$ . See Duffie (2001, p. 102) for the extension to a stochastic short rate $r_{[0, T]}^{(f)}$ under additional regularity conditions.
27	Relaxing the assumptions on $μ_{[0, T]}$ and $σ_{[0, T]}$ requires an extension of the non-standard DPIP by Davydov and Rotar (2008) for general continuous diffusions, which is beyond the scope of the current work. The reason we do not pay significant attention to the limiting behavior of the binary asset pricing dynamics is that the continuous dynamics of the return process $R_{[0, T]} = R_{t}$ , $t \in [0, T]$ loses the important information regarding the probabilities for the direction of stock price movements $p_{n}^{(L_{n}, M_{n})} = \prod_{j = 1}^{n} p_{j}^{(l_{j}, m_{j})}$ , $n = 0, \dots, n_{N}$ , $N \in N$ . Even worse, when passing to risk-neutral continuous dynamics, the extremely valuable information about the mean stock returns $μ_{t_{n, N} \| E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}$ will also be lost. When discussing market microstructure option pricing models, losing information on $p_{n}^{(L_{n}, M_{n})}$ and $μ_{t_{n, N} \| E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}$ hardly seems justifiable. Thus, in this work we concentrate our attention on (discrete) binary asset pricing, and pass to the limit as $Δ_{N} = O (\frac{1}{N}), N ↑ \infty$ , only to provide a comparison with the classical BSM asset pricing continuous-time dynamics.
28	This constant time spacing can be relaxed at the cost of greater computational complexity.
29	A simple estimate serves to illustrate the issue. Consider a reasonably standard desktop computer with 128 GBytes ( $2^{37}$ bytes) of RAM. Storing a single floating point value at each node of balanced, binary tree with m levels requires $2^{m + 1} - 1$ floats. Assuming single precision ( $4 = 2^{2}$ bytes) floats, this would naively suggest that an $m = 34$ level tree could be accommodated. In practice, the memory must be shared with the operating system and other executing programs. In addition, cache sizes are much smaller, which significantly degrade computational time. Thus, values of $m \sim 30$ appear more achievable. Advanced computational hardware, such as GPUs and high performance computing clusters can make larger values of m achievable.
30	For $ϕ = 1 / 2$ , machine epsilon is reached by $n = 24$ (single precision) or $n = 53$ (double precision).
31	Specifically we employed the R program ts_boot() using block resampling with block lengths having a geometric distribution with mean length n.
32	We note that these results are based upon daily closing prices. We make no inferences for returns based upon other price intervals.

References

Aït-Sahalia, Y., & Jacod, J. (2014). High-frequency financial econometrics. Princeton University Press. [Google Scholar]
Bachelier, L. (1900). Théorie de la spéculation. Annales Scientifiques de l’École Normale Supérieure, 3(17), 21–86. [Google Scholar] [CrossRef]
Beran, J. (2017). Statistics for long-memory processes. Chapman & Hall/CRC. [Google Scholar]
Billingsley, P. (2013). Convergence of probability measures. John Wiley & Sons. [Google Scholar]
Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–654. [Google Scholar] [CrossRef]
Boyle, P. P. (1986). Option Valuation Using a Three Jump Process. International Options Journal, 3, 7–12. [Google Scholar]
Breen, R. (1991). The accelerated binomial option pricing model. Journal of Financial and Quantitative Analysis, 26(2), 153–164. [Google Scholar] [CrossRef]
Cheridito, P. (2003). Arbitrage in fractional Brownian motion models. Finance and Stochastics, 7, 533–553. [Google Scholar] [CrossRef]
Cherny, A. S., Shiryaev, A. N., & Yor, M. (2003). Limit behavior of the “horizontal-vertical” random walk and some extensions of the Donsker-Prokhorov invariance principle. Theory of Probability & Its Applications, 47(3), 377–394. [Google Scholar]
Cordero, F., Klein, I., & Perez-Ostafe, L. (2014). Binary markets under transaction costs. International Journal of Theoretical and Applied Finance, 17(5), 1450030. [Google Scholar] [CrossRef]
Cox, J. C., & Ross, S. A. (1976). The valuation of options for alternative stochastic processes. Journal of Financial Economics, 3(1), 145–166. [Google Scholar] [CrossRef]
Cox, J. C., Ross, S. A., & Rubinstein, M. (1979). Option pricing: A simplified approach. Journal of Financial Economics, 7(3), 229–263. [Google Scholar] [CrossRef]
Daley, D. J., & Vere-Jones, D. (2003). An introduction to the theory of point processes, Volume I: Elementary theory and methods. Springer. [Google Scholar]
Daley, D. J., & Vere-Jones, D. (2008). An introduction to the theory of point processes, Volume II: General theory and structure. Springer. [Google Scholar]
Davydov, Y., & Rotar, V. (2008). On a non-classical invariance principle. Statistics & Probability Letters, 78(14), 2031–2038. [Google Scholar]
Delbaen, F., & Schachermayer, W. (1994). A general version of the fundamental theorem of asset pricing. Mathematische Annalen, 300(1), 463–520. [Google Scholar] [CrossRef]
Derman, E., Kani, I., & Chriss, N. (1996). Implied trinomial trees of the volatility smile (Quantitative Strategies Research Notes). Goldman Sachs. [Google Scholar]
Dolinsky, Y., & Neufeld, A. (2018). Super-replication in fully incomplete markets. Mathematical Finance, 28(2), 483–515. [Google Scholar] [CrossRef]
Duan, J.-C., Ritchken, P., & Sun, Z. (2006). Approximating GARCH-jump models, jump-diffusion processes, and option pricing. Mathematical Finance, 16(1), 21–52. [Google Scholar] [CrossRef]
Duffie, D. (2001). Dynamic asset pricing theory. Princeton University Press. [Google Scholar]
Dzhaparidze, K. O., & van Zuijlen, M. C. A. (1996). Introduction to option pricing in a securities market I: Binary models. CWI Quarterly, 9(4), 319–355. [Google Scholar]
Easley, D., & O’Hara, M. (1995). Market microstructure. Handbooks in Operations Research and Management Science, 9, 357–383. [Google Scholar]
Easley, D., & O’Hara, M. (2003). Microstructure and asset pricing. Handbook of the Economics of Finance, 1, 1021–1051. [Google Scholar]
Eberlein, E., & Prause, K. (2000). The generalized hyperbolic model: Financial derivatives and risk measures. In H. Geman, D. Madan, S. R. Pliska, & T. Vorst (Eds.), Mathematical finance—Bachelier Congress 2000 (pp. 245–267). Springer. [Google Scholar]
El Karoui, N., & Quenez, M.-C. (1995). Dynamic programming and pricing of contingent claims in an incomplete market. SIAM Journal on Control and Optimization, 31(1), 29–66. [Google Scholar] [CrossRef]
Fan, J., Imerman, M. B., & Dai, W. (2016). What does the volatility risk premium say about liquidity provision and demand for hedging tail risk? Journal of Business & Economic Statistics, 34(4), 519–535. [Google Scholar]
Garman, M. B. (1976). Market microstructure. Journal of Financial Economics, 3(3), 257–275. [Google Scholar] [CrossRef]
Hamilton, J. D. (2020). Time series analysis. Princeton University Press. [Google Scholar]
Harrison, J. M., & Kreps, D. M. (1979). Martingales and arbitrage in multiperiod securities markets. Journal of Economic Theory, 20(3), 381–408. [Google Scholar] [CrossRef]
Harrison, J. M., & Pliska, S. R. (1981). Martingales and stochastic integrals in the theory of continuous trading. Stochastic Processes and Their Applications, 11(3), 215–260. [Google Scholar] [CrossRef]
Hasbrouck, J. (1988). Trades, quotes, inventories, and information. Journal of Financial Economics, 22(2), 229–252. [Google Scholar] [CrossRef]
Hasbrouck, J. (1996). Modeling market microstructure time series. In G. S. Maddala, & C. R. Rao (Eds.), Statistical methods in finance (Vol. 14, pp. 647–692). Elsevier. [Google Scholar]
Hasbrouck, J. (2007). Empirical market microstructure: The institutions, economics, and econometrics of securities trading. Oxford University Press. [Google Scholar]
Hu, Y., Shirvani, A., Lindquist, W. B., Fabozzi, F. J., & Rachev, S. T. (2020a). Option pricing incorporating factor dynamics in complete markets. Journal of Risk and Financial Management, 13(12), 321. [Google Scholar] [CrossRef]
Hu, Y., Shirvani, A., Stoyanov, S., Kim, Y. S., Fabozzi, F. J., & Rachev, S. T. (2020b). Option pricing in markets with informed traders. International Journal of Theoretical and Applied Finance, 23(6), 2050037. [Google Scholar] [CrossRef]
Hull, J. (2006). Options, futures, & other derivatives: Solutions manual. Prentice Hall International. [Google Scholar]
Hung, M., & Swallow, W. H. (1999). Robustness of group testing in the estimation of proportions. Biometrics, 55(1), 231–237. [Google Scholar] [CrossRef]
Jacod, J., & Protter, P. (2012). Discretization of processes (Vol. 67). Springer. [Google Scholar]
Jarrow, R. A., Protter, P., & Sayit, H. (2009). No arbitrage without semimartingales. The Journal of Applied Probability, 19(2), 596–616. [Google Scholar] [CrossRef]
Jarrow, R. A., & Rudd, A. (1982). Approximate option valuation for arbitrary stochastic processes. Journal of Financial Economics, 10(3), 347–369. [Google Scholar] [CrossRef]
Kallenberg, O. (1997). Foundations of modern probability. Springer. [Google Scholar]
Karatzas, I. (1997). Lectures in the mathematics of finance. American Mathematical Society. [Google Scholar]
Kim, Y. S., Stoyanov, S., Rachev, S., & Fabozzi, F. (2016). Multi-purpose binomial model: Fitting all moments to the underlying geometric Brownian motion. Economics Letters, 145, 225–229. [Google Scholar] [CrossRef]
Kim, Y. S., Stoyanov, S., Rachev, S., & Fabozzi, F. J. (2019). Enhancing binomial and trinomial equity option pricing models. Finance Research Letters, 28, 185–190. [Google Scholar] [CrossRef]
Lo, A. W., Mamaysky, H., & Wang, J. (2000). Foundations of technical analysis: Computational algorithms, statistical inference, and empirical implementation. The Journal of Finance, 55(4), 1705–1765. [Google Scholar] [CrossRef]
Löhne, A., & Rudloff, B. (2014). An algorithm for calculating the set of superhedging portfolios in markets with transaction costs. International Journal of Theoretical and Applied Finance, 17(2), 1450012. [Google Scholar] [CrossRef]
Mandelbrot, B. B. (2001). Stochastic volatility, power laws and long memory. Quantitative Finance, 1(6), 558–559. [Google Scholar] [CrossRef]
Merton, R. C. (1973). Theory of rational option pricing. The Bell Journal of Economics and Management Science, 4(1), 141–183. [Google Scholar] [CrossRef]
Merton, R. C. (1976). Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, 3(1), 125–144. [Google Scholar] [CrossRef]
Mills, T. C. (2019). Applied time series analysis: A practical guide to modeling and forecasting. Academic Press. [Google Scholar]
Mitov, G. K., Rachev, S. T., Kim, Y. S., & Fabozzi, F. J. (2009). Barrier option pricing by branching processes. International Journal of Theoretical and Applied Finance, 12(7), 1055–1073. [Google Scholar] [CrossRef]
Mittnik, S., Fabozzi, F. J., Focardi, S. M., Rachev, S. T., & Jašić, T. (2007). Financial econometrics: From basics to advanced modeling techniques. John Wiley & Sons. [Google Scholar]
O’Hara, M. (1997). Market microstructure theory. John Wiley & Sons. [Google Scholar]
O’Hara, M. (1999). Making market microstructure matter. Financial Management, 28(2), 83–90. [Google Scholar] [CrossRef]
Oksendal, B. (2013). Stochastic differential equations: An introduction with applications. Springer. [Google Scholar]
Rachev, S. T., Kim, Y. S., Bianchi, M. L., & Fabozzi, F. J. (2011). Financial models with Lévy processes and volatility clustering. John Wiley & Sons. [Google Scholar]
Roll, R. (1984). A simple implicit measure of the effective bid-ask spread in an efficient market. The Journal of Finance, 39(4), 1127–1139. [Google Scholar]
Rouge, R., & El Karoui, N. (2000). Pricing via utility maximization and entropy. Mathematical Finance, 10(2), 259–276. [Google Scholar] [CrossRef]
Rubinstein, M. (1994). Implied binomial trees. The Journal of Finance, 49(3), 771–818. [Google Scholar] [CrossRef]
Rubinstein, M. (1998). Edgeworth binomial trees. Journal of Derivatives, 5(3), 20–27. [Google Scholar] [CrossRef]
Samura, S. K., Mao, J., & Yao, D. (2013). Semimartingale property and its connection to arbitrage. Journal of Mathematical Finance, 3, 237–241. [Google Scholar] [CrossRef]
Schoutens, W. (2003). Lévy processes in finance: Pricing financial derivatives. John Wiley & Sons. [Google Scholar]
Shiryaev, A. N. (1999). Essentials of stochastic finance: Facts, models, theory (O. E. Barndorff-Nielsen, Ed.; Vol. 3). World Scientific. [Google Scholar]
Skorokhod, A. V. (1997). Measure-valued diffusion. Ukranian Mathematical Journal, 49, 506–513. [Google Scholar] [CrossRef]
Skorokhod, A. V. (2005). Basic principles and applications of probability theory. Springer. [Google Scholar]

Figure 1. Nomenclature used to label the binary information tree

{BIT}_{4}

. (a) Level values

l_{n}

and local indices

ϵ_{n}^{(l_{n})}

. (b) Trajectory labels,

L_{n}

and

M_{n}

,

n = 0, \dots, 3

. (c) An illustration of two of the probabilities

p_{3}^{(L_{3}, M_{3})}

.

Figure 1. Nomenclature used to label the binary information tree

{BIT}_{4}

. (a) Level values

l_{n}

and local indices

ϵ_{n}^{(l_{n})}

. (b) Trajectory labels,

L_{n}

and

M_{n}

,

n = 0, \dots, 3

. (c) An illustration of two of the probabilities

p_{3}^{(L_{3}, M_{3})}

.

Figure 2. Event

E_{n}^{(L_{n}, M_{n})}

and event path

E_{n, t_{n}}^{(L_{n}, M_{n})}

labeling for

{BIT}_{4}

. To preserve figure clarity, only half of the events are labeled at

t_{3}

.

Figure 2. Event

E_{n}^{(L_{n}, M_{n})}

and event path

E_{n, t_{n}}^{(L_{n}, M_{n})}

labeling for

{BIT}_{4}

. To preserve figure clarity, only half of the events are labeled at

t_{3}

.

Figure 3. Illustration of the path dependence of the riskless rates

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)}

. For figure clarity, each riskless rate is indicated by the shortened form

r_{t_{n}; (ϵ_{1}^{l_{1}} = m_{1}, \dots, ϵ_{n}^{l_{n}} = m_{n})}

.

Figure 3. Illustration of the path dependence of the riskless rates

r_{t_{n}; E_{n - 1}^{(L_{n - 1}, M_{n - 1})}}^{(d, f)}

. For figure clarity, each riskless rate is indicated by the shortened form

r_{t_{n}; (ϵ_{1}^{l_{1}} = m_{1}, \dots, ϵ_{n}^{l_{n}} = m_{n})}

.

Figure 4. Illustration of the path dependence of the stock prices

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}

. For figure clarity, each price is indicated by the shortened form

S_{t_{n}; (ϵ_{1}^{l_{1}} = m_{1}, \dots, ϵ_{n}^{l_{n}} = m_{n})}

.

Figure 4. Illustration of the path dependence of the stock prices

S_{t_{n}; E_{n}^{(L_{n}, M_{n})}}^{(d)}

. For figure clarity, each price is indicated by the shortened form

S_{t_{n}; (ϵ_{1}^{l_{1}} = m_{1}, \dots, ϵ_{n}^{l_{n}} = m_{n})}

.

Figure 5. Illustration of the stock price

S^{(d)}

, price ratio s, return

r^{(d)}

, riskless bank account value

β^{(d)}

, riskless rate

r^{(d, f)}

, and moments

μ^{(r)}

and

σ^{(r)}

corresponding to the

[t_{n}, t_{n + 1})

segments of each of the event paths

E_{n, t}^{(L_{n}, (M_{n - 1}, m_{n} = 1))}

and

E_{n, t}^{(L_{n}, (M_{n - 1}, m_{n} = 0))}

. Note that the r,

β

,

μ

, and

σ

values are common to both path segments.

Figure 5. Illustration of the stock price

S^{(d)}

, price ratio s, return

r^{(d)}

, riskless bank account value

β^{(d)}

, riskless rate

r^{(d, f)}

, and moments

μ^{(r)}

and

σ^{(r)}

corresponding to the

[t_{n}, t_{n + 1})

segments of each of the event paths

E_{n, t}^{(L_{n}, (M_{n - 1}, m_{n} = 1))}

and

E_{n, t}^{(L_{n}, (M_{n - 1}, m_{n} = 0))}

. Note that the r,

β

,

μ

, and

σ

values are common to both path segments.

Figure 6. Illustration of the path-dependent, no-arbitrage conditions (35) on the binary tree at an event

E_{n}^{(L_{n}, M_{n})}

.

Figure 6. Illustration of the path-dependent, no-arbitrage conditions (35) on the binary tree at an event

E_{n}^{(L_{n}, M_{n})}

.

Figure 7. Each node in the ABWN model has the structure presented on the left. The triangular unit presented on the right is replicated everywhere on the BMA(1) tree. The value of

z_{t_{n - 2}}^{(l_{n - 2})}

is either

z_{u}

or

z_{d}

.

Figure 7. Each node in the ABWN model has the structure presented on the left. The triangular unit presented on the right is replicated everywhere on the BMA(1) tree. The value of

z_{t_{n - 2}}^{(l_{n - 2})}

is either

z_{u}

or

z_{d}

.

Figure 8. Illustration of the prices on a

{BIT}_{5}

BMA(1) tree. The bold numbers represent the vector,

n α_{1} α_{2} δ_{1} δ_{2}

, of coefficients in (56) used to compute each price. The italicized numbers represent the coefficient vector denoting the price change along the indicated branch of the tree.

Figure 8. Illustration of the prices on a

{BIT}_{5}

BMA(1) tree. The bold numbers represent the vector,

n α_{1} α_{2} δ_{1} δ_{2}

, of coefficients in (56) used to compute each price. The italicized numbers represent the coefficient vector denoting the price change along the indicated branch of the tree.

Figure 9. The probabilities

p_{n}^{(x)}

obtained from the DIA data set without and with bootstrap resampling for (left plots)

n = 4

and (right plots)

n = 6

. The dashed horizontal line denotes

2^{- n}

. The red horizontal lines denote the 95% confidence interval.

Figure 9. The probabilities

p_{n}^{(x)}

obtained from the DIA data set without and with bootstrap resampling for (left plots)

n = 4

and (right plots)

n = 6

. The dashed horizontal line denotes

2^{- n}

. The red horizontal lines denote the 95% confidence interval.

Figure 10. The probabilities

p_{n}^{(x)}

obtained from the DIA data set without and with bootstrap resampling for

n = 8

. The dashed horizontal line denotes

2^{- n}

. The red horizontal lines denote the 95% confidence interval.

Figure 10. The probabilities

p_{n}^{(x)}

obtained from the DIA data set without and with bootstrap resampling for

n = 8

. The dashed horizontal line denotes

2^{- n}

. The red horizontal lines denote the 95% confidence interval.

Figure 11. Normal probability plots (solid dots) for the DIA probabilities computed using bootstrap resampling for (left)

n = 4

and (right)

n = 10

. The dotted line and equation are the results of the linear fit to the data.

Figure 11. Normal probability plots (solid dots) for the DIA probabilities computed using bootstrap resampling for (left)

n = 4

and (right)

n = 10

. The dotted line and equation are the results of the linear fit to the data.

Figure 12. Plotted for sequences of length n are the n highest- (blue, gold) and n lowest- (red, green) probability paths for the data (top) without and (bottom) with bootstrap resampling. The highest-probability path is colored gold, the lowest-probability path is green.

Figure 13. The highest- and lowest-probability paths plotted for

n = {4, 5, 6, 7, 8, 10}

. Note that the color used for a smaller-n path obscures the color used for a larger-n path if both have a segment occurring on the same branch of the tree.

Figure 13. The highest- and lowest-probability paths plotted for

n = {4, 5, 6, 7, 8, 10}

. Note that the color used for a smaller-n path obscures the color used for a larger-n path if both have a segment occurring on the same branch of the tree.

Figure 14. Box–whisker summaries of the computed distributions of probability estimates for each of the

2^{n}

sequences for

n = {2, 4, 6}

. The empirical distributions were obtained from the DIA data set using a rolling window of 15 years.

Figure 14. Box–whisker summaries of the computed distributions of probability estimates for each of the

2^{n}

sequences for

n = {2, 4, 6}

. The empirical distributions were obtained from the DIA data set using a rolling window of 15 years.

Figure 15. The box–whisker summaries of Figure 14 reordered into categories based on the number of price up- and downturns occurring in the sequence.

Table 1. Values of the coefficients

α_{j, i}

and

δ_{j, i}

,

j = 1, 2

, in (56).

Table 1. Values of the coefficients

α_{j, i}

and

δ_{j, i}

,

j = 1, 2

, in (56).

	n Even				n Odd
i	$α_{1, i}$	$α_{2, i}$	$δ_{1, i}$	$δ_{2, i}$	$α_{1, i}$	$α_{2, i}$	$δ_{1, i}$	$δ_{2, i}$
$2 n$	n	0	$n - 1$	0	n	0	$n - 1$	0
$2 n - 1$	$n - 1$	1	$n - 1$	0	$n - 1$	1	$n - 1$	0
$2 n - 2$	$n - 1$	1	$n - 2$	1	$n - 1$	1	$n - 2$	1
$2 n - 3$	$n - 2$	2	$n - 2$	1	$n - 2$	2	$n - 2$	1
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮
$n + 1$	$\frac{n}{2}$	$\frac{n}{2}$	$\frac{n}{2}$	$\frac{n}{2} - 1$	$\frac{n + 1}{2}$	$\frac{n - 1}{2}$	$\frac{n - 1}{2}$	$\frac{n - 1}{2}$
n	$\frac{n}{2}$	$\frac{n}{2}$	$\frac{n}{2} - 1$	$\frac{n}{2}$	$\frac{n - 1}{2}$	$\frac{n + 1}{2}$	$\frac{n - 1}{2}$	$\frac{n - 1}{2}$
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮
4	2	$n - 2$	1	$n - 2$	2	$n - 2$	1	$n - 2$
3	1	$n - 1$	1	$n - 2$	1	$n - 1$	1	$n - 2$
2	1	$n - 1$	0	$n - 1$	1	$n - 1$	0	$n - 1$
1	0	n	0	$n - 1$	0	n	0	$n - 1$

Table 2. Expected number of occurrences of each sequence in patterns of length n in the DIA return data set covering the period of prices from 20 January 1998 through 5 May 2023.

Pattern	Number of	Number of	Expected Number of
Length	Sequences	Non-Overlapping	Pattern Sequences in
( $n$ )	in Pattern	Intervals	$V$ Non-Overlapping
	( $2^{n}$ )	( $V$ )	Intervals
4	16	1591	99.4
5	32	1273	39.8
6	64	1060	16.6
8	256	795	3.1
10	1024	636	0.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lauria, D.; Lindquist, W.B.; Rachev, S.T.; Hu, Y. Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. J. Risk Financial Manag. 2025, 18, 230. https://doi.org/10.3390/jrfm18050230

AMA Style

Lauria D, Lindquist WB, Rachev ST, Hu Y. Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. Journal of Risk and Financial Management. 2025; 18(5):230. https://doi.org/10.3390/jrfm18050230

Chicago/Turabian Style

Lauria, Davide, W. Brent Lindquist, Svetlozar T. Rachev, and Yuan Hu. 2025. "Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework" Journal of Risk and Financial Management 18, no. 5: 230. https://doi.org/10.3390/jrfm18050230

APA Style

Lauria, D., Lindquist, W. B., Rachev, S. T., & Hu, Y. (2025). Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. Journal of Risk and Financial Management, 18(5), 230. https://doi.org/10.3390/jrfm18050230

Article Menu

Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework

Abstract

1. Introduction

2. A Binary Information Tree for Pricing

Estimation of Probabilities

3. Dynamics of the Riskless Rate and Bank Account Value on ${BT}_{m}$

4. Stock Price Dynamics on ${BT}_{m}$

5. Risk-Neutral Dynamics on ${BT}_{m}$ : Option Pricing

6. Limiting Dynamics of Binary Pricing Trees

7. Stock Pricing—Special Cases

7.1. Asymmetric Binary White Noise

7.2. Binary Moving Average of Order 1

7.3. Autoregressive of Order 1

8. Computational Simplifications

8.1. ABWN Model

8.2. BMA(1) Model

8.3. BAR(1) Model

9. Technical Analyses of the Probability Estimates

10. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Sequential Definition of the Probability Law on the BIT for n = 2 and 3

Appendix B. Path Probability Tables for Assets in the DJIA Index

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework

Abstract

1. Introduction

2. A Binary Information Tree for Pricing

Estimation of Probabilities

3. Dynamics of the Riskless Rate and Bank Account Value on BT m

4. Stock Price Dynamics on BT m

5. Risk-Neutral Dynamics on BT m : Option Pricing

6. Limiting Dynamics of Binary Pricing Trees

7. Stock Pricing—Special Cases

7.1. Asymmetric Binary White Noise

7.2. Binary Moving Average of Order 1

7.3. Autoregressive of Order 1

8. Computational Simplifications

8.1. ABWN Model

8.2. BMA(1) Model

8.3. BAR(1) Model

9. Technical Analyses of the Probability Estimates

10. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Sequential Definition of the Probability Law on the BIT for n = 2 and 3

Appendix B. Path Probability Tables for Assets in the DJIA Index

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Dynamics of the Riskless Rate and Bank Account Value on ${BT}_{m}$

4. Stock Price Dynamics on ${BT}_{m}$

5. Risk-Neutral Dynamics on ${BT}_{m}$ : Option Pricing