All articles published by MDPI are made immediately available worldwide under an open access license. No special
permission is required to reuse all or part of the article published by MDPI, including figures and tables. For
articles published under an open access Creative Common CC BY license, any part of the article may be reused without
permission provided that the original article is clearly cited. For more information, please refer to
https://www.mdpi.com/openaccess.
Feature papers represent the most advanced research with significant potential for high impact in the field. A Feature
Paper should be a substantial original Article that involves several techniques or approaches, provides an outlook for
future research directions and describes possible research applications.
Feature papers are submitted upon individual invitation or recommendation by the scientific editors and must receive
positive feedback from the reviewers.
Editor’s Choice articles are based on recommendations by the scientific editors of MDPI journals from around the world.
Editors select a small number of articles recently published in the journal that they believe will be particularly
interesting to readers, or important in the respective research area. The aim is to provide a snapshot of some of the
most exciting work published in the various research areas of the journal.
We introduce a binary tree for pricing contingent claims when the underlying security prices exhibit history dependence. We apply the model to the specific cases of moving-average and autoregressive behavior that are characteristic of price histories induced by market microstructure behavior. Our model is market-complete and arbitrage-free. When passing to the risk-neutral measure, the model preserves all parameters governing the natural-world price dynamics, including the instantaneous mean of the asset return and the instantaneous probabilities for the direction of asset price movement. This preservation holds for arbitrarily small, but non-zero, time increments characteristic of market microstructure transactions. In the (unrealistic) limit of continuous trading, the model reduces to continuous diffusion price processes, with the concomitant loss of the microstructure information.
“At the level of transactions prices, …, the random walk conjecture is …a hypothesis that is very easy to reject in most markets even in small data samples. In microstructure, the question is not ‘whether’ transactions prices differ from a random walk, but rather ‘how much’ and ‘why?”’
Dynamic asset pricing theory, as introduced1 by Black and Scholes (1973) and Merton (1973) (BSM), is based on the concepts of no-arbitrage opportunity and replicating portfolios, along with a set of assumptions that can be classified into two groups. The first group of assumptions concerns the microstructure of the market: the rules under which trades are performed; the impact of transaction and timing costs; the role of information and its disclosure; discovery and formation of prices; volatility; liquidity; and market-maker and investor behavior. Under the assumptions of the BSM model, any trade is executed without taxes, transaction costs, and amount restriction (the market is frictionless); traders are price takers with symmetric information (a perfectly competitive market) and are able to trade any amount (no liquidity constraints) over any infinitesimally small time interval (continuous trading); and the market is assumed to be efficient (all relevant information is embedded in the market price), liquid (every order is executed instantaneously at the current equilibrium price), and free of arbitrage opportunity. The second group of assumptions is related to the choice of geometric Brownian motion (GBM) as the stochastic process describing the price dynamics of the security underlying the option contract. The assumption of GBM invokes a strong set of restrictions, including constant volatility, the normal distribution of log returns, and absence of long memory.2
The BSM model provides analytic solutions and an elegant machinery for computing the price of a European option; however, many of the hypotheses upon which it is rooted have been shown to be too restrictive. It is well known that many of the empirical properties3 of stock price returns are not consistent with the assumption of GBM. Consequently, a range of alternative models have been proposed to include various stylized facts. Another problem with the original BSM model is that it does not provide solutions for more complex contingent claims, such as those with a path-dependent pay-off (e.g., American options) or those whose underlying risk is not fully priced in financial markets, leading to market incompleteness. The most baffling result of the BSM model is that the option price does not depend on the drift of the underlying security. This puzzle was clarified in the subsequent work of Cox and Ross (1976) and Merton (1976), and then reformulated in terms of the risk-neutral measure by Harrison and Kreps (1979) and Harrison and Pliska (1981). The concept of the risk-neutral price was subsequently accepted and continuous-time models have proliferated.
Subsequent developments to the solution of the continuous-time pricing problem can be interpreted as improvements in one of two directions. The first is directed to the stochastic process driving the price dynamics in order to incorporate more statistical features of real price processes. This direction has produced the following strong result, known as the general version of the fundamental theorem of asset pricing (Delbaen & Schachermayer, 1994): “if a stochastic price process is a bounded, real-valued semimartingale, there is an equivalent martingale measure (EMM) for if and only if satisfies the condition of “no free lunch with vanishing risk” (NFLVR), where NFLVR is a generalization of the no-arbitrage condition.4,5 One consequence of this continuous-time fundamental theorem is the necessity to work within the confines of stochastic integration theory. A more critical consequence is that the general version of the fundamental theorem of asset pricing does not guarantee a unique EMM. In fact, there can be uncountably many EMMs, allowing for all possible option pricings within natural bounds. In incomplete markets, where perfect replication is not possible, hedgers seek alternative approaches to mitigate risk in a cost-efficient manner. Several alternative strategies exist, each with their own advantages and drawbacks, but they are still not entirely attractive due to their costs and limitations (Dolinsky & Neufeld, 2018; El Karoui & Quenez, 1995; Karatzas, 1997; Löhne & Rudloff, 2014; Rouge & El Karoui, 2000).
The second direction is to develop methods for solving pricing problems having no known analytic solution, due either to the complexity of the stochastic process or the complexity of the pay-off function associated with a contingent claim. The binomial option pricing model proposed by Cox et al. (1979) (CRR) was the first approach to pricing American options without sacrificing the intellectual machinery developed under the BSM model. CRR utilized a discrete-time, binomial lattice graph to describe the evolution of the price process of the underlying security. The discrete process was designed to converge to GBM as the time interval between two successive trades converged to zero. There was no intention in the CRR model to use the discrete setting to incorporate other stylized facts of asset returns. Other discrete models—utilizing binomial or trinomial lattices, or binary trees—have been developed to numerically price contingent claims under more complex assumptions, such as stochastic volatility or jump processes (see, e.g., Boyle, 1986; Derman et al., 1996; Rubinstein, 1994, 1998). Again, these discrete models have been designed to converge to a solution of a continuous-time stochastic process. This is usually ensured by setting moment-matching conditions in order to apply Donsker’s invariance principle (Billingsley, 2013). Using discrete models avoids working explicitly with stochastic integration theory.6
As noted above, the BSM model is very restrictive with regards to its incorporation of the details of market microstructure.7 In seminal work, Roll (1984) showed that, in an efficient market, the effective bid–ask spread can be measured by , where is the first-order serial covariance of price changes. Crucially, Roll’s reasoning was based upon analysis of a discrete-time model. We briefly recapitulate Roll’s model to indicate the connection with our work.8 Roll considered a martingale-efficient price for an asset evolving as , where represents independent, identically distributed, mean-zero random variables. A trade (buy or sell) at time t through a dealer results in a transaction price
where denotes a sale to a dealer, while denotes a purchase from a dealer. The value is the bid–ask spread. Under the buy–sell assumptions and for all , the variance and first-order covariance of are
Solving for c produces Roll’s spread formula . Solving for results in . The values and can be estimated from historical transaction prices to obtain estimates for the model parameters c and .
In his treatise on market microstructure, Hasbrouck (2007) describes several discrete-time empirical market microstructure models which build upon Roll’s bid–ask model. The models are designed to capture, in various ways, the price formation process, incorporating the sequence of actions and reactions between market makers and traders. Using the binary-tree model developed in this paper, in Section 7 we develop a base model of binary white noise (BWN). Upon this, we build two binary-tree models which capture, successively, moving-average-of-order-one (MA(1)) and autoregressive-of-order-one (AR(1)) risky-asset price behaviors.
The works of Kim et al. (2016, 2019) and Hu et al. (2020a, 2020b) have shown that binomial pricing trees have sufficient flexibility to capture some of the stylized facts of price dynamics for option pricing in complete discrete-time markets (enabling a unique hedging strategy). These include the preservation, from the natural world to the risk-neutral valuation, of the probabilities of the natural-world stock-price directions; the mean and higher moments of returns; and the effects of noisy, informed, and misinformed traders. However, binomial trees are too simplistic to accommodate either the autoregressive or moving-average behavior of asset prices. Our thesis in this paper is that binary pricing trees are crucial for developing dynamic asset pricing models that incorporate such phenomena.
To further clarify the need for binary pricing trees, recall the fundamental pricing model in continuous time for a market consisting of a single bond and stock. The continuous-time bond price dynamics are given by
Inclusion of microstructure features modifies the stock log-price dynamics, which can be written in discrete form as
where indicate the times at which the microstructure features associated with a particular market “actor” (such as a trader) are realized.9 The microstructure dynamics , , are determined by (for example) a moving-average process MA(q) of order (Mills, 2019, Chapter 3):
Here, when , and , are independent, identically distributed random variables with zero mean and specified variance. As can be seen, for example, as shown by O’Hara (1999, Figure 1), when general microstructure features are included in the observed log-prices, the recombining binomial pricing tree is no longer an appropriate model for the stock-price dynamics and an extension to a binary (i.e., non-recombining) pricing model must be introduced.
The fundamental asset pricing theorem of Delbaen and Schachermayer (1994) requires the ability to trade in continuous time. But market microstructure phenomena occur at discrete times. The resultant observed process (3), being a combination of a semimartingale plus discrete-time microstructure noise, is therefore not a semimartingale, and the fundamental asset pricing theorem cannot be applied. Cheridito (2003) showed that fractional Brownian motion can be used as a price process and still maintain NFLVR by “introducing a minimal amount of time that must lie between two consecutive transactions”. Jarrow et al. (2009) extended this work to show that arbitrage-free price processes can be obtained without reliance on semimartingales provided continuous-time trading is not allowed (although the finite time intervals can be arbitrarily small).
We therefore adopt the discrete-time perspectives of Cheridito and Jarrow10 and present a discrete, binary-tree approach that is general enough to reproduce the statistical properties of real prices and encompass a class of models that are used in microstructure theory (see Easley & O’Hara, 1995, 2003; Hasbrouck, 2007; Fan et al., 2016; Aït-Sahalia & Jacod, 2014, Chapter 2). To this end, we apply the general approach of Hu et al. (2020a, 2020b) to binary-tree option pricing models.11 In Section 2, we develop a discrete binary tree (binary information tree) supporting random walks. In Section 3 and Section 4, we develop discrete-time pricing on this tree for a riskless rate, a riskless bank account, and a risky asset. Using a self-financing portfolio formulation, computation of the risk-neutral measure and pricing of options is discussed in Section 5.
In general, a random walk on a non-recombined binary tree (which is a particular case of an arbitrary branching process) will converge to a measure-valued diffusion (Daley & Vere-Jones, 2003, 2008; Mitov et al., 2009; Skorokhod, 1997). In Section 6, we show that, under the well-known Donsker–Prokhorov invariance principle (for constant instantaneous mean return and variance) or the Davydov–Rotar invariance principle (for time-dependent instantaneous mean return and variance), the restrictions of these invariance principles unfortunately require that a non-recombined random walk approach a classical random walk, which, in the continuum limit, produces price processes such as as GBM (under the Donsker–Prokhorov invariance principle) or continuous diffusion (under the Davydov–Rotar invariance principle), resulting in the concomitant loss of the microstructure information.
The principal contributions of this theoretical paper are the following. We develop a binary-tree framework that supports market-complete, arbitrage-free, option pricing models having path-dependent price processes (Section 2, Section 3, Section 4 and Section 5). We show (Section 6) that the attempt to consider the continuous-time limit of this binary-tree framework is self-defeating, as critical natural world parameters are lost in the continuum limit. We apply (Section 7) this general framework to construct models in which a component driving the random price behavior is described by BWN—mimicking market microstructure. We further incorporate this underlying microstructure into two path models, in which the risky-asset prices display either MA(1) or AR(1) path dependence. In Section 8, we discuss the simplifications inherent in these three models that reduce the complexity of computing option prices. We show that, as a result of partial recombination, the computational complexity of the MA(1) (and consequently of the BWN) model is rather than the expected of non-recombining binomial trees.
As the price-direction probabilities are preserved in the risk-neutral dynamics of our framework, we delve (Section 9) into a technical analysis of the probabilities governing sequences of price changes. We compute sequence probabilities empirically, based upon daily closing prices of an ETF and for stocks comprising the Dow Jones Industrial Average. This analysis provides a test of the efficient market hypothesis for daily data. By categorizing price-change sequences, our results indicate that the efficient market hypothesis operates within categories, but not across categories.
A brief consideration of future directions is provided in Section 10.
2. A Binary Information Tree for Pricing
Binomial (recombining binary) trees are described by simple lattice notation. Each node on the binomial lattice is labeled by a time and price-state index. If the lattice has time steps, , then time step n has price states. Each node p, corresponding to a price state on a binomial tree, has the unique label , , . Employing graph theory terminology, there are unique paths from the root to node , characterized by the number of unique sequences of length n created using k values of “up” and values of “down”. However, the price at node depends only on the numbers k and and not on the individual sequences. In describing a binomial tree, it is common practice to provide only the single time interval and refer to the price states as “up” and “down” (along with the formula for computing each).
For binary trees,12 the situation is more complex. Time step n now has price states (nodes), and one would anticipate that the labeling , , would be sufficient. However, non-recombining binary trees have the property that there is a unique path between the root () and each node of the tree. Each path is distinguished by a unique sequence of “up” and “down” movements on the tree, and the price at each node generally depends on that unique sequence. We have chosen a notation that makes path labeling possible through a recorded sequence of local up (1) and down (0) movements. This notation also supports the flexibility for parameters, such as the probabilities governing price direction change, to vary along the tree.
Panel (a) in Figure 1 illustrates our labeling for an binary tree. At time step n, each price state (node) on the tree is labeled with a “level” index , , , , and a local state index , , with . For (Figure 1a) the price states are grouped into two levels, the lower two price states belonging to level and the upper two to . In the level, the two price states (nodes) are labeled (“down”) and (“up”). Analogous labeling occurs for the two nodes.
Panel (b) in Figure 1 illustrates path labeling for an binary tree. Each level is reached via a uniquely labeled path through other levels. For example, the level (which contains two leaf nodes) is reached via the “level sequence” . Each node is reached via a uniquely labeled “node sequence” of local up and down price changes. For example, the node is reached via the node sequence . We note that, given a specific node sequence , it is possible to reconstruct its corresponding level sequence . Specifically, for , then , with
where denotes the decimal value of the binary integer .13 Note, however, that the reverse is not possible. Given the node sequence, , it is possible only to compute through , leaving unknown. For simplicity, when employed we will give both the relevant and sequences.
With this discussion of the indexing on tree levels, nodes, and paths, we proceed with the development of the dynamics on the tree. We define a discrete-time-filtered probability space (discrete-time stochastic basis) with the discrete filtration , , where . Define . The sigma fields, , are generated by a sequence of dependent binary random variables , , such that . The triangular array of binary random variables is defined on a probability space , with probability laws
The probability space is a standard probability space; without loss of generality, we can assume it is the Lebesgue probability space , , .14 We define the probability law for sequentially in time, defining the dynamics of .
For , set , , and .
For , set . Then, with
The conditional probabilities satisfy
Thus,
For , the general case is as follows. (For additional clarity, the sequential definitions for and 3 are provided in Appendix A). Set . is the triangular array of binary random variables , , , with . Then,
and
where and for . The -conditional probabilities satisfy
The unconditional probabilities satisfy
The level sequences
together with the discrete filtration and the conditional probabilities (9) define a particular type of binary random tree which we designate as a binary information tree to level n ().15 Each node sequence
defines a unique event
on , where in (13) , , . Event occurs with probability16
Panel (c) in Figure 1 provides an illustration of two specific probabilities of the form .
To develop a random-price time series simulating microstructure timing, for any given we associate the levels of with a sequence of time instances over the finite period , . In our application to option pricing, the current time is (corresponding to the root event of ), while is the terminal time (corresponding to the leaf events of ). Trades of assets occur only at the times . These trading instances are fixed and known at time .17 The time intervals , , , and , over which no trades occur, are denoted inter-trade periods.18 We define , .
Associated with , for all we can recursively define the càdlàg node paths , , as follows:
We define the market information flow as
where is the set of all ()-tuples, . generates the stochastic basis on , where the filtration is defined by
Given , specification of , , defines a unique, nested set of càdlàg event paths , , , corresponding to the nested node sequences , with , , , and . From (15), this unique set of event paths is
Event and event path labels on are illustrated in Figure 2.
The -conditional probabilities along path are
The unconditional probability for event on path is determined by the sequence of conditional probabilities (19).
As there is no trade at , there are no new events at time on ; the last events occur at . Similarly, there is no event at , the first events occur at . Thus, on the events are labeled from to . From (13) and (18), we have the event-path equivalence
where means that event occurs at time point on path . To preserve uniformity of notation on , we define the pseudo-event for the root node, leading to the equivalence
As the “leaf” nodes at are path termination points, the terminating pseudo-events will be labeled using path notation . We shall refer to all events, pseudo or otherwise, simply as events.
For every fixed , the set of all paths , , defines an -adapted , which we denote . The probabilities will represent the natural probabilities for the direction of stock movements. provides the stochastic dynamics of the market information based on the time instances .
Estimation of Probabilities
In our discrete market setting (Section 3, Section 4 and Section 5), will represent the probability of the direction of price changes at . For example, given price corresponding to event , then , with , , and represents the probability of a price increase , while with and represents the probability of a price decrease .
Assuming that a sufficient history () of stock prices is available at , one can utilize the historical frequency of positive price changes over trading periods of size as an estimator for . For example, if min, then is the proportion of positive stock returns observed in a historical sample of 5-min returns.19 For , one can use the historical frequency as an estimator for , where . As an example of the computation of , assume equally spaced time intervals, , where day. Consider a data set of historical daily returns partitioned into V non-overlapping time periods, each period consisting of n days. In each period, the succession of signs of the n daily returns is compared to the succession of signs implied by the indexing of the path . If these two sign successions agree, the trial period is marked as a “success” (otherwise a “failure”). Thus, the partitioning of the data provides a set of V Bernoulli trials from which to compute
In Section 9, we empirically investigate this procedure for computing the required probabilities. Our results indicate that a prohibitively extensive history would be required to ensure adequate sampling for values of . In Section 9, we consider the use of bootstrap resampling to provide adequate samples.
3. Dynamics of the Riskless Rate and Bank Account Value on
We consider first the path-dependent dynamics of the riskless rate and on . Corresponding to the discrete times , , we consider the -measurable riskless rates and, without loss of generality, set .20 For any time , we define the -adapted riskless rate as follows.
For , .
For , .
For ,
For ,
In general, for path and , ,
with the understanding that, when , for any path , (21) holds for the closed time interval . Figure 3 illustrates the path dependence of the riskless rates .
For , the riskless rate is -measurable. This definition is consistent with the definition of the riskless rate (short rate) dynamics in continuous time (Duffie, 2001, p. 102). Without loss of generality, we can define the path-dependent, instantaneous riskless rate , by the relation
Having determined the dynamics of the riskless rates , , we now turn our attention to the dynamics of the riskless bank account . The -adapted bank account price , , is defined as follows.
For ,
For
For
For ,
For , , given path , the value of the bank account , is
with the understanding that, when , (23) holds for the closed time interval for any path .
For , the bank account value is -measurable. This definition is consistent with the definition of the riskless asset price dynamics in continuous time. More precisely, as shown in Section 6, in continuous time the riskless asset price is defined on the filtered probability space , where , , is a standard Brownian motion on , and its price dynamics are determined by , , where the short rate is -measurable with . Thus, is instantaneously riskless; in other words, for . This definition of the riskless asset pricing in continuous time was the motivation to define the riskless bank account valuation in discrete time by Equation (23).
4. Stock Price Dynamics on
The price dynamics of on the stochastic basis is an -adapted process. We define , on as follows.
For ,
For
for price ratio values . Let
be the discrete (arithmetic) return of the stock at . We assume that the mean and variance of the return,
are known. In (26), we interpret as the instantaneous mean return and as the instantaneous variance at . The moment conditions (26) imply that the price ratios in (25) are determined by
For ,
Note that denotes the stock price along the segment of the path . Figure 4 illustrates the path dependence of the stock prices up to . In contrast, denotes the state space of prices over the interval . We introduce the conditional notation to designate the two state prices determined by the condition , while designates the two state prices determined by the condition .
be the conditional arithmetic return at given that . We assume that the conditional mean and conditional variance,
are known. In (29), is the instantaneous conditional mean return and is the instantaneous conditional variance of the return at given . The moment conditions (29) imply that, in (28),
For , , given the path , the stock price is
for price ratios , . Again, it is understood that, when , for a given path (30) holds for the closed time interval . Let
be the conditional arithmetic return at given the path .21 We assume the conditional mean and conditional variance of ,
The first form in each equality is used when we wish to specify the value of , the second form refers to the respective price at an arbitratry event in .
The riskless rate , bank account value , stock price , price ratio , return , and the conditional moments and , corresponding to the segments of each of the two event paths and , are illustrated in Figure 5. The bank account value, riskless rate, and the two moments are the same for each of the two path segments.
Stock price dynamics in the natural world are determined by (30) and (31), which contain the following sets of model parameters:
—the probabilities for stock upward direction, ;
—the conditional means ;
—the conditional variances ;
on each node of the pricing tree. In Section 5, we show that the risk-neutral tree dynamics of the stock preserves . This is important given that the information on and is lost in passing to the continuous-time limit in the natural world and subsequent use of BSM for risk-neutral valuation. Using a discrete pricing tree rather than a continuous-time pricing model allows us to introduce richer, more flexible models for the price dynamics to accommodate market microstructure features in option pricing.
5. Risk-Neutral Dynamics on : Option Pricing
The option has discrete price dynamics , , on for some , , , with terminal time and terminal value for some , .22 At event , the option has the price
Consider the replicating portfolio satisfying
at event , where is the delta position. Requiring the usual discrete-time, no-arbitrage conditions, we have
Figure 6 illustrates these path-dependent, no-arbitrage conditions on the binary tree at .
From the second two equations in (35), the delta position is given by
where the final equality in (36) is obtained using (30) and (32). From the first equation in (35), and using (36), (23), and (22), we obtain the recurrence relation for the risk-neutral option value at event :
where the conditional risk-neutral probabilities are given by
with
being the market price of risk. Equations (34) through (38) hold for all events on when . As has a constant value over the time interval , the terminal value determines each option price at the events . These values provide the “initial conditions” for the recurrence relation (37).
Note that (36)–(38) each have an overall form that is familiar from binomial tree models. This should not be too surprising as a binomial tree is a particular form of a binary tree. The path dependence in our model therefore arises not through a change in form of these equations but from the fact that the variables (prices, probabilities, conditional moments) are explicitly path-dependent.
We further note that the natural-world conditional probabilities, and the mean and variance of the return dynamics, of are retained in the risk-neutral price dynamics of . This represents the tremendous advantage of discrete binary option pricing over its limiting continuous-time model; under the latter, information about the probabilities and the mean of the return process is lost (Section 6).
The extension to the pricing of an American option follows from the classical approach for valuation of an American option on a binomial tree (see, e.g., Hull, 2006, Section 12.5). The market value of the American option at event is given by
where is the exercise value of the stock, which is known at event . We emphasize that is the market value of the stock and not its fair-holding value . The risk-neutral tree stock dynamics , , are determined via the risk-neutral recursion (37),
In contrast to (39), Breen (1991) uses to define the fair value of the American option at the node ,
A trader in search of statistical arbitrage opportunities relative to an American option could compare (39) with (41) when seeking potential mispricing in the market value of the option.
6. Limiting Dynamics of Binary Pricing Trees
In this section, we investigate the limiting dynamics of the binary-tree pricing by assuming that the lengths of trading intervals uniformly vanish at a rate as . To support this, we generalize our notation for the trading times as follows. For any given , we consider the fixed time instances . The current time is , the terminal time is and, as previously, trades of and occur only at the times . The time intervals are denoted , . It is straightforward to adapt the results of Section 2, Section 3, Section 4 and Section 5 to this notational change for t. The resultant -adapted binary information tree is now denoted . We impose the restriction
To determine the continuum limit behavior, we apply the Donsker–Prokhorov invariance principle (DPIP) for continuous diffusions.24 To apply the DPIP we assume that, for each , the random variables determining the probabilities (14) and (19) are independent. Therefore, the probabilistic structure of the triangular array is determined by the probability laws,
for , , . This assumption of independence results in simplified expressions for (14) and (19):
As , then
has and . We now view the filtration (7) as generated by the triangular series ; that is,
Next, for a given sequence , , , we consider the random walk . By the DPIP, the sequence of processes
converges to a standard Brownian motion in the Skorokhod J1-topology.25 By denoting the canonical filtration , we can assume, by the Skorokhod embedding theorem (see, e.g., Kallenberg, 1997, Chapter 14) that and the triangular series are in the same stochastic basis space .
For the time interval , we now consider the limiting behavior of the discrete riskless rates in (22). We assume that the discrete instantaneous riskless rate process
converges uniformly to a continuous-time instantaneous riskless rate , where26
has strictly positive continuous trajectories on ;
.
Then, the discrete bank account value , in (23) converges uniformly to the continuous-time riskless asset dynamics
where the deterministic instantaneous riskless-rate (short-rate) process is -adapted.
Consider the discrete mean and volatility processes for :
Assume that and converge uniformly on to and , respectively, such that
Further, assume that and are -adapted, with and having continuous trajectories on .27 We define the price process
As in Hu et al. (2020a), a non-standard invariance principle (Davydov & Rotar, 2008) can be used to show that (46) converges weakly in topology (Skorokhod, 2005) to a process governed by a cumulative return process satisfying
where is a standard Brownian motion and (see Duffie (2001, Appendix 6D)). By (47), is a continuous diffusion and, if and are constant, is a GBM. In the risk-neutral world, the limiting cumulative return process obeys (47) with replaced by . We note that the discrete model is much more informative than the continuous-time model as it preserves the path-dependent probabilities , with no assumption on their (in)dependence.
7. Stock Pricing—Special Cases
The binary-tree pricing model presented above encompasses several time-series processes that have been proposed to model stock prices. We consider three such processes, all of which assume constant time spacing, , .28 These examples are BWN and the cases in which the first difference of the price process is assumed to display either MA(1) or AR(1) behavior. These latter two time-discrete models are of particular importance in microstructure theory. In the Introduction, we mentioned the analogous models of Hasbrouck (1988), which build upon the seminal Roll model (Roll, 1984). See Hasbrouck (2007, Chapter 8) for an overview and discussion of other closely related models.
7.1. Asymmetric Binary White Noise
Consider the case in which the conditional probabilities at each node of are the same:
We develop a white noise process (Hamilton, 2020, Chapter 3.2) on where each node takes on one of only two possible values. This asymmetric binary white noise (ABWN) on is the -adapted process
As the process (49) on is path-independent, the ABWN notation can be simplified to , , as indicated in (50).
Using (48) and the white noise moments and with being finite, (49) becomes
It is straightforward to show that the ABWN process has the first-order autocorrelation, a property of a white noise process.
7.2. Binary Moving Average of Order 1
We define the random process to be a binary moving-average process of order one (BMA(1)) by setting
for , , and values . In (51), the local index occurs with probability p, occurs with probability , and is the ABWN process (50). The expected value, variance, and first-order auto-covariance of follow the usual MA(1) process:
giving the familiar MA(1) autocorrelation coefficient:
In practice, the coefficients and of the BMA(1) process can be estimated from historical price differences. Let and denote, respectively, the empirical variance and first-order auto-covariance of the risky-asset price differences. Setting
and solving for and gives the solution pairs and ,
which have the following properties: ; ; ; and . To guarantee with , Equation (54) require . Addition of the constraint to guarantee invertibility of the BMA(1) process restricts the solution of (53) to the pair .
The choice of the BMA(1) tree is motivated by the fact that it represents a generalization (see, e.g., Hasbrouck, 2007, Chapters 4.2 and 8) of the Roll (1984) microstructure model discussed in the Introduction.
7.3. Autoregressive of Order 1
The ABWN process can also be used as a basis to model the first difference of the price process as binary autoregressive of the first-order (BAR(1)). Let
where . In (55), the local index occurs with probability p, while occurs with probability . Requiring ensures that the process has MA(∞) representation. The expected value, variance, and auto-covariance of order h for are
giving
for a finite lag h.
8. Computational Simplifications
Computations on a non-recombining binary tree are well known to be exponentially expensive, both in terms of memory requirements as well as execution time.29 Common methods to reduce the number of time intervals, , involved in binary-tree option pricing computations include using either a common large time step, , , or a graduated increase in time intervals , . However, the following features of the three special-case models reduce the computational and storage complexity.
8.1. ABWN Model
As stated by (49) and illustrated in Figure 7, every level on the ABWM binary tree has exactly the same two state values. As all conditional probabilities (48) reduce to either p or , the ABWM model reduces to a simple model on a (recombining) binomial tree.
8.2. BMA(1) Model
As a result of (51), the triangular configuration illustrated in Figure 7 represents the fundamental replicating unit for in the state space of the BMA(1) tree. By iterating (51) to obtain prices, it can be shown that, at time step n, , there are unique price states, each of the form
In (56), is the initial price. The coefficients and are listed in Table 1. The coefficients satisfy and , , with and reflecting the symmetry of the binary tree.
The binary-tree description of the BMA(1) model in Section 7.2 uses the path notation , with , to designate a specific node at time n. The partial recombining of the binary tree (from distinct price states to price states at time step n) suggests the simpler node (price) labeling of , , of (56). The connection between the two is as follows. Let denote the number of elements in the binary string and denote the number of elements in the binary string . Then, node corresponds to the node , with having the price given by the corresponding value of i in Table 1.
As a result of the reduction of the number of nodes, from to at time n, the BMA(1) model becomes computationally tractable for large values of n (competitive with trinomial models, which have nodes at time n). With prices (and therefore nodes) indexed as in (56), each price is uniquely specified by the vector of values (we ignore the value , which is common to all nodes). Using this vector notation, Figure 8 illustrates the prices on a BMA(1) tree. The figure also presents the vector denoting the change in the price along each indicated branch of the tree. It is a feature of the BMA(1) tree that only four price-change vectors occur in the tree: , , , and . (This is implicitly stated in Figure 7.) Furthermore, these four vectors always occur in this repetitive sequence in the transition from time n to .
The BMA(1) model supports the following further simplifications. For simplicity, we express these using the path notation of Section 2.
: As the BMA(1) model utilizes ABWN, from (48) we have constant conditional probabilities p and .
Riskless rate and bank account dynamics: For all n, if we assume , a constant, then, from (22), . Consequently, from (23)
Risky asset: We have the unconditional expectation (52) , . From (31),
Risk-neutral dynamics: Equation (38) simplifies to
with
8.3. BAR(1) Model
To understand the simplification of the BAR(1) model, rewrite (55) as
for , where , , and . By iterating (59), it is straightforward to develop an analytic formula for the possible price states at level n.
Let denote a naturally ordered set of all possible binary sequences of length n, where the binary choice for each element of a sequence is either U or D. Let denote the k-th sequence in the set . The natural ordering in arises from the iteration of (59). Specifically and . For example,
Thus, and .
Considering the sequence to be a vector, at level n each of the possible states can be written as the scalar product:
where is the vector . For example, the state corresponding to is
However, under the restriction , for a large enough n, becomes smaller than the machine precision (machine epsilon) and finite-precision computation of the higher-order terms in sums such as becomes meaningless. Thus, beyond a certain value of n, the numerical prices no longer change (the price of each child node remains equal to the price of the parent node). The value of n for which this occurs depends on the magnitudes , , and .30
9. Technical Analyses of the Probability Estimates
In Section Estimation of Probabilities, we noted the desirability of estimating the direction-of-price-change probabilities from historical data using (20) on a set of V non-overlapping binomial sequences, each of length n. However, we also noted that, even for relatively small values of n, a prohibitively extensive history of returns would be required to ensure an adequate sample to determine each of the probabilities for a given value of n. Table 2 codifies this problem using daily return data for the SPDR Dow Jones Industrial Average ETF Trust (DIA) covering the period of prices from 20 January 1998 through 5 May 2023. This data set provides 6365 return values; that is, a sequence of 6365 values of 0 (down) or 1 (up) price changes.
From the table it is evident that by , V is too small to ensure an adequate sample size for determining each of the possible sequences. It is not clear the sample size is adequate even for .
There are models that can be applied to a historical return time series to generate “mimicking” time series. Of course, all such models add additional model error to the process. One approach is to fit the historical return time series to an ARMA()-GARCH() model combined with a distribution model for the ARMA residuals, and then use the ARMA-GARCH-distribution model with fitted parameters to generate adequate numbers, V, of mimicking return series from which to compute the required probabilities. Such an approach involves fitting a significant number of required parameters. It generates return time series, which is a step removed from the sequences required by (20). We prefer to utilize bootstrap resampling directly on the sequence determined by the historical data set. Bootstrapping has the advantage of constantly resampling the historical sequence.31 For a given value of n, we required bootstrapped samples (i.e., each of the probabilities is determined based upon a expectation of 10,000 occurrences for each of the possible sequences).
We compared our bootstrap resampled results against those computed from the historical time series with no resampling (i.e., with expected number of sequence occurrences given in Table 2). For plotting convenience, we have developed the following labeling system for each of the possible sequences of length n. We illustrate the general notation using specific examples. Consider the sequences. They can be mapped to the values . For example, 01101 is mapped to ; 00000 is mapped to ; and 11111 to . This labeling has the property that and x label binary complement sequences (i.e., 01101 corresponds to and 10010 to ). A positive value of x indicates a binary string beginning with 1, while a negative value of x indicates a binary string beginning with 0.
Using the values x to represent the sequences corresponding to the various events with , Figure 9 and Figure 10 compare the results obtained for the probabilities computed via (20) for the DIA data set without and with bootstrap resampling for .
There is reasonable agreement between the results without and with bootstrap resampling for ; significant differences develop for , which are then clearly revealed for . In particular, without bootstrap resampling, when the paucity of data results in the probabilities taking on only nine possible values. The 95% confidence intervals in Figure 9 and Figure 10 are based upon the results in Figure 11 that the probabilities obtained from bootstrap resampling are well described by a normal distribution.
Analysis of the probabilities of individual sequences falls within the area of technical pattern analysis (Lo et al., 2000). We continue this analysis by examining the highest- and lowest-probability paths. Specifically, for fixed n we consider whether the highest-probability sequences specify paths that are “closely grouped” (with a similar statement for the lowest-probability paths). If the highest-probability paths occur randomly, this would provide further confirmation of the efficient market hypothesis. However, clustered paths suggest the presence of pronounced patterns, as argued by Lo et al. (2000). A related consideration is whether the highest-probability path for is a projection of the highest-probability path for .
Figure 12 displays the observed results for the grouping of paths. For the sequences of length n, we plot the highest-observed-probability path (colored gold), and the next -highest-probability paths (colored blue). Similarly, we plot the lowest-observed-probability path (colored green), and the next -lowest-probability paths (colored red). We consider and plot results for the data without and with bootstrap resampling. Due to the data limitations with no resampling, there is a more “random” distribution of high- and low-probability paths. (Note in particular the highest- and lowest-probability paths for in the case with no resampling.) With bootstrap resampling improving sample sizes, there is a more distinct grouping of the high- and low-probability paths, with the high-probability paths characterized by more consistent price increases and the low-probability paths characterized by more consistent price decreases.
Figure 13 displays the observed results for the projections of the highest- and lowest-probability paths. The highest-probability path for are each plotted on the same graph. Similarly for the respective lowest-probability path for each value of n. It is clear that, over this range of values of n, the lowest-probability path for is simply the projection (truncation) of the highest-probability path for . In the case of the highest-probability path, there is a “discontinuity” in the projection. For , the highest-probability path is a truncation of that for . For , the highest-probability path is almost a truncation of that for (with a slight difference occurring for at and at ).
To test the dynamic stability of such estimates, we reperformed the probability estimation procedure for the DIA data set using a rolling window of length 15 years (3780 trading days). This generated 2586 windows. For each window, sequence probabilities were computed for , using bootstrap resampling to ensure adequate sample sizes. (To speed-up the computation, we employed bootstrapped samples in each window.) For each choice of n, the rolling windows produced an empirical distribution of probability estimates for each of the sequences. These distributions are summarized as box–whisker plots in Figure 14. Figure 9 and Figure 14 show very similar structures (for and 6), indicating relative stability between the rolling window and global estimates of the sequence probabilities.
We now address the substructure that is apparent in Figure 14 (and in Figure 9 and Figure 10). Figure 15 replots the box–whisker plots with the sequences placed in categories according to the number of zeros (price downturns; equivalently the number of ones (price upturns)) each contains. Within each category, the sequences are still labeled from smallest to largest numerical label, x, as indicated in the top plot of Figure 15. The substructure seen in Figure 14 has largely vanished from Figure 15, indicating that the number of price downturns (equivalently upturns) is the major driver of a sequence’s probabilities. The uniformity of the ranges of the probability distributions within a category is indicative of an efficient market hypothesis operating within each category. It is in the difference in the ranges of the probability distributions between categories that market inefficiencies are seen.32 For , there is no overlap between the range of the empirical probability distribution for the sequence 0000 and the range of any distribution for sequences containing two or more upturns. For , there is no overlap between the range of the empirical distribution for the sequence 000000 and the range of any other distribution. Furthermore, the range of any distribution for a sequence containing five or six downturns has no overlap with the range of any distribution for a sequence containing four or more upturns.
We compare the sequence probability estimates among different assets using the 30 components comprising (as of 31 August 2020) the Dow Jones Industrial Average (DJIA) index. Price data were used for the period 3 January 2000 through 26 August 2022, with the exceptions of Visa (price data beginning 18 March 2008) and Dow (price data beginning 29 February 2019). This provided 5699 return values for 28 of the assets (3637 returns for Visa and 868 returns for Dow). Sequence probabilities were computed for these assets for . For these small values of n, the probabilities were computed from the data without bootstrap resampling. The estimated probabilities for sequences of length are presented in Table A1 in Appendix B. The probabilities for sequences of length are presented in Table A2. Significant p-values obtained from the one-sided z-test are also indicated. For comparison, sequence probabilities for the DIA data are also provided in these tables.
For , for all 31 assets, the probability of a negative return is smaller than the probability of a positive return. For 24 of these assets, this relationship is significant at a level ≤. For assets where the probabilities of the sequences 00 () or 000 () are significant at the level ≤, these sequences have the smallest probability. This holds for 26 of 31 assets for and 21 of 31 assets for . For the sequences 11 and 111, the results are not as strong. These sequences are significant at the level ≤ in 11 of 31 assets for and 9 of 31 assets for . However, these sequences represent, respectively, the highest probability for only 10 of the 11, for , and 7 of the 9, for , of these assets. If, instead, we consider sequences that contain at most a single negative return (a single “0” value), then the highest probability sequence that has significance at the level occurs for 15 of 31, for , and 20 of 31, for , assets. These observations are consistent with the DIA results in Figure 13 for .
10. Discussion
This work contains several points that we wish to emphasize.
The most significant is that modeling of market microstructure cannot be achieved using (existing) continuous-time stochastic theory. We provide two critical observations in the Introduction to support this statement. The first is that inclusion of microstructure effects results in price processes that are not semimartingales. The second is to note the work of Jarrow et al. (2009) (and Cheridito (2003)) that shows finite intervals between transaction times are required to maintain arbitrage-free price processes in such cases. (While Roll’s approximation resembles a continuous-time statement, it is crucial to recognize that is the first-order serial covariance of price changes, which are not instantaneous, but occur over finite market time intervals.) We argue that discrete, binary-tree models are appropriate models to capture microstructure behavior.
Our major theoretical result in this paper is the development a market-complete, arbitrage-free, binary-tree option pricing framework that is capable of capturing price processes exhibiting path-dependent behavior. Consistent with the observations in the previous paragraph, we note that any attempt to consider the continuous-time limit of this binary-tree model is self-defeating, as critical microstructure detail is lost in the continuum limit.
By applying the framework to build specific moving-average MA(1) and autoregressive AR(1) models, we have demonstrated that this binary-tree framework can be used to model market microstructure. In the case of the MA(1) model, we show that the binary price tree is sufficiently recombining that the computational complexity of the model is only . Both the MA(1) and AR(1) models require empirical testing in order to demonstrate their applicability as well as to provide comparison against other models. As this theoretical paper is already lengthy, we made the decision to leave such critical considerations for future investigation.
As it relates directly to the efficient market hypothesis, we have devoted a section of this work to a technical analysis of the direction-of-price-change probabilities that appear as parameters in the binary-tree model. Analysis of price histories for the 30 assets in the Dow Jones Average leads us to conclude that, when fixed-length sequences of daily price changes are categorized according to the number of price downturns in the sequence, it appears that the efficient market hypothesis operates within each category. However, market inefficiencies are evident between categories.
Author Contributions
Conceptualization, S.T.R.; methodology, W.B.L. and S.T.R.; software, D.L. and Y.H.; validation, D.L., W.B.L. and Y.H.; formal analysis, W.B.L. and S.T.R.; investigation, D.L., W.B.L. and Y.H.; data curation, D.L.; writing—original draft preparation, D.L.; writing—review and editing, W.B.L.; visualization, W.B.L.; supervision, S.T.R. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data that support the findings of this study are available as follows. Stock and ETF price data provided through Bloomberg Professional Services and used under license. Code for the computations in Section 9 are available upon request to the corresponding author.
Conflicts of Interest
The authors declare no conflicts of interest.
Appendix A. Sequential Definition of the Probability Law on the BIT for n = 2 and 3
For , set . Then, with
The -conditional probabilities satisfy
Thus, the unconditional probabilities are
To estimate , we use the historical frequency of observing “a positive price change over a trading period of size followed by a positive price change over a trading period of size ”. Estimates for the remaining three probabilities of two-step stock movements are computed analogously.
For , set
Then, , with
The -conditional probabilities satisfy
Thus, the unconditional probabilities are
Note that the unconditional probabilities (A3) can be written concisely as
where , , . Similarly, the conditional probabilities (A2) can be written concisely as
Appendix B. Path Probability Tables for Assets in the DJIA Index
Table A1.
Path probabilities for sequences of length (columns 2 and 3) and (columns 4 to 7) for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.
Table A1.
Path probabilities for sequences of length (columns 2 and 3) and (columns 4 to 7) for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.
Path Label:
Symbol
Path Probability
Path Probability
AAPL
0.477 ***
0.523 ***
0.230 **
0.248
0.246
0.276 ***
AMGN
0.499
0.501
0.245
0.256
0.253
0.246
AXP
0.490
0.510
0.232 *
0.260
0.257
0.251
BA
0.486 *
0.514 *
0.229 **
0.260
0.254
0.257
CAT
0.486 *
0.514 *
0.239
0.247
0.248
0.266 *
CRM
0.487 *
0.513 *
0.222 ***
0.271 **
0.257
0.249
CSCO
0.484 *
0.516 *
0.229 **
0.262
0.247
0.261
CVX
0.475 ***
0.525 ***
0.221 ***
0.250
0.257
272 **
DIS
0.487 *
0.513 *
0.233 *
0.258
0.251
0.259
DOW
0.486
0.514
0.239
0.268
0.225
0.268
GS
0.488 *
0.512 *
0.229 **
0.256
0.262
0.253
HD
0.480 **
0.520 **
0.229 **
0.248
0.255
0.268 *
HON
0.477 ***
0.523 ***
0.219 ***
0.265 *
0.250
0.266 *
IBM
0.488 *
0.512 *
0.232 *
0.265 *
0.248
0.255
INTC
0.488 *
0.512 *
0.232 *
0.258
0.256
0.255
JNJ
0.486 *
0.514 *
0.225 **
0.265 *
0.257
0.253
JPM
0.493
0.507
0.233 *
0.254
0.267 *
0.246
KO
0.482 **
0.518 **
0.230 **
0.244
0.260
0.266 *
MCD
0.464 ***
0.536 ***
0.207 ***
0.262
0.252
0.279 ***
MMM
0.475 ***
0.525 ***
0.215 ***
0.259
0.261
0.265 *
MRK
0.494
0.506
0.246
0.242
0.253
0.258
MSFT
0.486 *
0.514 *
0.225 **
0.263
0.258
0.253
NKE
0.483 **
0.517 **
0.224 ***
0.253
0.264
0.258
PG
0.481 **
0.519 **
0.225 **
0.262
0.250
0.263
TRV
0.477 ***
0.523 ***
0.216 ***
0.254
0.268 *
0.262
UNH
0.477 ***
0.523 ***
0.221 ***
0.260
0.251
0.267 *
V
0.462 ***
0.538 ***
0.195 ***
0.275 **
0.259
0.270 *
VZ
0.493
0.517
0.234 *
0.264
0.255
0.248
WBA
0.499
0.501
0.244
0.258
0.251
0.246
WMT
0.487 *
0.513 *
0.224 ***
0.263
0.264
0.250
DIA
0.455 ***
0.545 ***
0.201 ***
0.262
0.246
0.291 ***
Table A2.
Path probabilities for sequences of length for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.
Table A2.
Path probabilities for sequences of length for assets in the DJIA index and for DIA. Also indicated are the probabilities having significant p-values from the one-sided z-test. *, ** and *** refer to the standard significance levels of 5%, 1% and 0.1%, respectively.
For the definition of a long-memory process, we refer the interested reader to Mandelbrot (2001) and Beran (2017).
3
It is common to define these properties as “stylized facts”. These include: volatility clustering; returns with heavy tailed distributions; tail dependence; leverage effects; and long-term memory. See Mittnik et al. (2007) for a comprehensive exposition on the topic.
4
It is worth noting that all Lévy processes are semimartingales, and many well-studied models in finance assume that the asset log-returns follow Lévy processes, as seen in Eberlein and Prause (2000); Rachev et al. (2011); Schoutens (2003).
5
See Samura et al. (2013) for conditions under which real-valued, cadlag processes that satisfy NFVLR must be semimartingales.
6
However, answering questions related to the existence and uniqueness of solutions to the continuous-time stochastic PDE that a numerical model may be intending to approximate does require the full machinery of stochastic integration theory.
7
This term originates with the seminal paper by Garman (1976). See O’Hara (1997) for an extensive overview of market microstructure studies.
In this paper, our interest is only in fully (perfectly) balanced binary trees. For brevity, we shall continue to refer to them simply as binary trees.
13
Thus, the example node sequence corresponds to the level sequence with , and .
14
See “Standard probability space”, Encyclopedia of Mathematics, EMS Press (2001).
While it may be of interest to view the trading times as stopping times, this is beyond the scope of the current paper. However, such an extension can be done by introducing binary pricing models with dynamics following discrete-time semimartingales that are contaminated by noise occurring at random time instances. See, for example, Jacod and Protter (2012, Chapter 16) and (Aït-Sahalia & Jacod, 2014, Chapter 9).
18
Note the assumption that no trade occurs at or at the terminal time (i.e., there is no new market information at times 0 and T).
19
Hung and Swallow (1999) provide a robust test for sample proportions when the Bernoulli trials are dependent. Applying robust estimates for does not make a significant difference in the numerical examples we consider in Section 9 because our sample size is relatively large and the dependence between the Bernoulli trials is weak.
20
The first trading date is . Thus, the first opportunity for the trader to deposit or withdraw from the bank account is . The value of is irrelevant for the trader.
21
We emphasize the equivalent notations
22
The functions f on and g on satisfy the usual regularity conditions; see Duffie (2001, Chapter 5). These conditions will only be needed when we consider the limiting option price process as max.
The DPIP is also known as the Functional Limit Theorem. We will apply the DPIP for continuous diffusions only, see Davydov and Rotar (2008). Extensions to more general DPIP, where the limiting price process is a semimartingale, are known; see Cherny et al. (2003); Duan et al. (2006) and Hu et al. (2020a). It will be of interest to study DPIP when the limiting pricing process is a semimartingale plus noise. These types of DPIP could be obtained by applying limiting results as studied in Jacod and Protter (2012), but that line of research is beyond the scope of this paper. Unfortunately, as pointed out in Hu et al. (2020a, 2020b), the limiting stock price dynamics erases important information contained in the discrete pricing model; specifically the probabilities for the direction of stock price moment and, in the case of option pricing, the mean return of the stock. For this critical reason we view this section on the continuum limit of the discrete dynamics mainly as an extension to the classical CRR and Jarrow and Rudd (1982) option pricing models. As these limiting results reveal, incorporation of market microstructure features requires full use of the discrete binary-tree pricing model.
The limiting riskless rate is also assumed to satisfy . See Duffie (2001, p. 102) for the extension to a stochastic short rate under additional regularity conditions.
27
Relaxing the assumptions on and requires an extension of the non-standard DPIP by Davydov and Rotar (2008) for general continuous diffusions, which is beyond the scope of the current work. The reason we do not pay significant attention to the limiting behavior of the binary asset pricing dynamics is that the continuous dynamics of the return process , loses the important information regarding the probabilities for the direction of stock price movements , , . Even worse, when passing to risk-neutral continuous dynamics, the extremely valuable information about the mean stock returns will also be lost. When discussing market microstructure option pricing models, losing information on and hardly seems justifiable. Thus, in this work we concentrate our attention on (discrete) binary asset pricing, and pass to the limit as , only to provide a comparison with the classical BSM asset pricing continuous-time dynamics.
28
This constant time spacing can be relaxed at the cost of greater computational complexity.
29
A simple estimate serves to illustrate the issue. Consider a reasonably standard desktop computer with 128 GBytes ( bytes) of RAM. Storing a single floating point value at each node of balanced, binary tree with m levels requires floats. Assuming single precision ( bytes) floats, this would naively suggest that an level tree could be accommodated. In practice, the memory must be shared with the operating system and other executing programs. In addition, cache sizes are much smaller, which significantly degrade computational time. Thus, values of appear more achievable. Advanced computational hardware, such as GPUs and high performance computing clusters can make larger values of m achievable.
30
For , machine epsilon is reached by (single precision) or (double precision).
31
Specifically we employed the R program ts_boot() using block resampling with block lengths having a geometric distribution with mean length n.
32
We note that these results are based upon daily closing prices. We make no inferences for returns based upon other price intervals.
References
Aït-Sahalia, Y., & Jacod, J. (2014). High-frequency financial econometrics. Princeton University Press. [Google Scholar]
Bachelier, L. (1900). Théorie de la spéculation. Annales Scientifiques de l’École Normale Supérieure, 3(17), 21–86. [Google Scholar] [CrossRef]
Beran, J. (2017). Statistics for long-memory processes. Chapman & Hall/CRC. [Google Scholar]
Billingsley, P. (2013). Convergence of probability measures. John Wiley & Sons. [Google Scholar]
Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–654. [Google Scholar] [CrossRef]
Boyle, P. P. (1986). Option Valuation Using a Three Jump Process. International Options Journal, 3, 7–12. [Google Scholar]
Breen, R. (1991). The accelerated binomial option pricing model. Journal of Financial and Quantitative Analysis, 26(2), 153–164. [Google Scholar] [CrossRef]
Cheridito, P. (2003). Arbitrage in fractional Brownian motion models. Finance and Stochastics, 7, 533–553. [Google Scholar] [CrossRef]
Cherny, A. S., Shiryaev, A. N., & Yor, M. (2003). Limit behavior of the “horizontal-vertical” random walk and some extensions of the Donsker-Prokhorov invariance principle. Theory of Probability & Its Applications, 47(3), 377–394. [Google Scholar]
Cordero, F., Klein, I., & Perez-Ostafe, L. (2014). Binary markets under transaction costs. International Journal of Theoretical and Applied Finance, 17(5), 1450030. [Google Scholar] [CrossRef]
Cox, J. C., & Ross, S. A. (1976). The valuation of options for alternative stochastic processes. Journal of Financial Economics, 3(1), 145–166. [Google Scholar] [CrossRef]
Cox, J. C., Ross, S. A., & Rubinstein, M. (1979). Option pricing: A simplified approach. Journal of Financial Economics, 7(3), 229–263. [Google Scholar] [CrossRef]
Daley, D. J., & Vere-Jones, D. (2003). An introduction to the theory of point processes, Volume I: Elementary theory and methods. Springer. [Google Scholar]
Daley, D. J., & Vere-Jones, D. (2008). An introduction to the theory of point processes, Volume II: General theory and structure. Springer. [Google Scholar]
Davydov, Y., & Rotar, V. (2008). On a non-classical invariance principle. Statistics & Probability Letters, 78(14), 2031–2038. [Google Scholar]
Delbaen, F., & Schachermayer, W. (1994). A general version of the fundamental theorem of asset pricing. Mathematische Annalen, 300(1), 463–520. [Google Scholar] [CrossRef]
Derman, E., Kani, I., & Chriss, N. (1996). Implied trinomial trees of the volatility smile (Quantitative Strategies Research Notes). Goldman Sachs. [Google Scholar]
Dolinsky, Y., & Neufeld, A. (2018). Super-replication in fully incomplete markets. Mathematical Finance, 28(2), 483–515. [Google Scholar] [CrossRef]
Duffie, D. (2001). Dynamic asset pricing theory. Princeton University Press. [Google Scholar]
Dzhaparidze, K. O., & van Zuijlen, M. C. A. (1996). Introduction to option pricing in a securities market I: Binary models. CWI Quarterly, 9(4), 319–355. [Google Scholar]
Easley, D., & O’Hara, M. (1995). Market microstructure. Handbooks in Operations Research and Management Science, 9, 357–383. [Google Scholar]
Easley, D., & O’Hara, M. (2003). Microstructure and asset pricing. Handbook of the Economics of Finance, 1, 1021–1051. [Google Scholar]
Eberlein, E., & Prause, K. (2000). The generalized hyperbolic model: Financial derivatives and risk measures. In H. Geman, D. Madan, S. R. Pliska, & T. Vorst (Eds.), Mathematical finance—Bachelier Congress 2000 (pp. 245–267). Springer. [Google Scholar]
El Karoui, N., & Quenez, M.-C. (1995). Dynamic programming and pricing of contingent claims in an incomplete market. SIAM Journal on Control and Optimization, 31(1), 29–66. [Google Scholar] [CrossRef]
Fan, J., Imerman, M. B., & Dai, W. (2016). What does the volatility risk premium say about liquidity provision and demand for hedging tail risk? Journal of Business & Economic Statistics, 34(4), 519–535. [Google Scholar]
Garman, M. B. (1976). Market microstructure. Journal of Financial Economics, 3(3), 257–275. [Google Scholar] [CrossRef]
Hamilton, J. D. (2020). Time series analysis. Princeton University Press. [Google Scholar]
Harrison, J. M., & Kreps, D. M. (1979). Martingales and arbitrage in multiperiod securities markets. Journal of Economic Theory, 20(3), 381–408. [Google Scholar] [CrossRef]
Harrison, J. M., & Pliska, S. R. (1981). Martingales and stochastic integrals in the theory of continuous trading. Stochastic Processes and Their Applications, 11(3), 215–260. [Google Scholar] [CrossRef]
Hasbrouck, J. (1988). Trades, quotes, inventories, and information. Journal of Financial Economics, 22(2), 229–252. [Google Scholar] [CrossRef]
Hasbrouck, J. (1996). Modeling market microstructure time series. In G. S. Maddala, & C. R. Rao (Eds.), Statistical methods in finance (Vol. 14, pp. 647–692). Elsevier. [Google Scholar]
Hasbrouck, J. (2007). Empirical market microstructure: The institutions, economics, and econometrics of securities trading. Oxford University Press. [Google Scholar]
Hu, Y., Shirvani, A., Lindquist, W. B., Fabozzi, F. J., & Rachev, S. T. (2020a). Option pricing incorporating factor dynamics in complete markets. Journal of Risk and Financial Management, 13(12), 321. [Google Scholar] [CrossRef]
Hu, Y., Shirvani, A., Stoyanov, S., Kim, Y. S., Fabozzi, F. J., & Rachev, S. T. (2020b). Option pricing in markets with informed traders. International Journal of Theoretical and Applied Finance, 23(6), 2050037. [Google Scholar] [CrossRef]
Hull, J. (2006). Options, futures, & other derivatives: Solutions manual. Prentice Hall International. [Google Scholar]
Hung, M., & Swallow, W. H. (1999). Robustness of group testing in the estimation of proportions. Biometrics, 55(1), 231–237. [Google Scholar] [CrossRef]
Jacod, J., & Protter, P. (2012). Discretization of processes (Vol. 67). Springer. [Google Scholar]
Jarrow, R. A., Protter, P., & Sayit, H. (2009). No arbitrage without semimartingales. The Journal of Applied Probability, 19(2), 596–616. [Google Scholar] [CrossRef]
Jarrow, R. A., & Rudd, A. (1982). Approximate option valuation for arbitrary stochastic processes. Journal of Financial Economics, 10(3), 347–369. [Google Scholar] [CrossRef]
Kallenberg, O. (1997). Foundations of modern probability. Springer. [Google Scholar]
Karatzas, I. (1997). Lectures in the mathematics of finance. American Mathematical Society. [Google Scholar]
Kim, Y. S., Stoyanov, S., Rachev, S., & Fabozzi, F. (2016). Multi-purpose binomial model: Fitting all moments to the underlying geometric Brownian motion. Economics Letters, 145, 225–229. [Google Scholar] [CrossRef]
Kim, Y. S., Stoyanov, S., Rachev, S., & Fabozzi, F. J. (2019). Enhancing binomial and trinomial equity option pricing models. Finance Research Letters, 28, 185–190. [Google Scholar] [CrossRef]
Lo, A. W., Mamaysky, H., & Wang, J. (2000). Foundations of technical analysis: Computational algorithms, statistical inference, and empirical implementation. The Journal of Finance, 55(4), 1705–1765. [Google Scholar] [CrossRef]
Löhne, A., & Rudloff, B. (2014). An algorithm for calculating the set of superhedging portfolios in markets with transaction costs. International Journal of Theoretical and Applied Finance, 17(2), 1450012. [Google Scholar] [CrossRef]
Mandelbrot, B. B. (2001). Stochastic volatility, power laws and long memory. Quantitative Finance, 1(6), 558–559. [Google Scholar] [CrossRef]
Merton, R. C. (1973). Theory of rational option pricing. The Bell Journal of Economics and Management Science, 4(1), 141–183. [Google Scholar] [CrossRef]
Merton, R. C. (1976). Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, 3(1), 125–144. [Google Scholar] [CrossRef]
Mills, T. C. (2019). Applied time series analysis: A practical guide to modeling and forecasting. Academic Press. [Google Scholar]
Mitov, G. K., Rachev, S. T., Kim, Y. S., & Fabozzi, F. J. (2009). Barrier option pricing by branching processes. International Journal of Theoretical and Applied Finance, 12(7), 1055–1073. [Google Scholar] [CrossRef]
Mittnik, S., Fabozzi, F. J., Focardi, S. M., Rachev, S. T., & Jašić, T. (2007). Financial econometrics: From basics to advanced modeling techniques. John Wiley & Sons. [Google Scholar]
O’Hara, M. (1997). Market microstructure theory. John Wiley & Sons. [Google Scholar]
O’Hara, M. (1999). Making market microstructure matter. Financial Management, 28(2), 83–90. [Google Scholar] [CrossRef]
Oksendal, B. (2013). Stochastic differential equations: An introduction with applications. Springer. [Google Scholar]
Rachev, S. T., Kim, Y. S., Bianchi, M. L., & Fabozzi, F. J. (2011). Financial models with Lévy processes and volatility clustering. John Wiley & Sons. [Google Scholar]
Roll, R. (1984). A simple implicit measure of the effective bid-ask spread in an efficient market. The Journal of Finance, 39(4), 1127–1139. [Google Scholar]
Rouge, R., & El Karoui, N. (2000). Pricing via utility maximization and entropy. Mathematical Finance, 10(2), 259–276. [Google Scholar] [CrossRef]
Rubinstein, M. (1994). Implied binomial trees. The Journal of Finance, 49(3), 771–818. [Google Scholar] [CrossRef]
Rubinstein, M. (1998). Edgeworth binomial trees. Journal of Derivatives, 5(3), 20–27. [Google Scholar] [CrossRef]
Samura, S. K., Mao, J., & Yao, D. (2013). Semimartingale property and its connection to arbitrage. Journal of Mathematical Finance, 3, 237–241. [Google Scholar] [CrossRef]
Schoutens, W. (2003). Lévy processes in finance: Pricing financial derivatives. John Wiley & Sons. [Google Scholar]
Shiryaev, A. N. (1999). Essentials of stochastic finance: Facts, models, theory (O. E. Barndorff-Nielsen, Ed.; Vol. 3). World Scientific. [Google Scholar]
Skorokhod, A. V. (1997). Measure-valued diffusion. Ukranian Mathematical Journal, 49, 506–513. [Google Scholar] [CrossRef]
Skorokhod, A. V. (2005). Basic principles and applications of probability theory. Springer. [Google Scholar]
Figure 1.
Nomenclature used to label the binary information tree . (a) Level values and local indices . (b) Trajectory labels, and , . (c) An illustration of two of the probabilities .
Figure 1.
Nomenclature used to label the binary information tree . (a) Level values and local indices . (b) Trajectory labels, and , . (c) An illustration of two of the probabilities .
Figure 2.
Event and event path labeling for . To preserve figure clarity, only half of the events are labeled at .
Figure 2.
Event and event path labeling for . To preserve figure clarity, only half of the events are labeled at .
Figure 3.
Illustration of the path dependence of the riskless rates . For figure clarity, each riskless rate is indicated by the shortened form .
Figure 3.
Illustration of the path dependence of the riskless rates . For figure clarity, each riskless rate is indicated by the shortened form .
Figure 4.
Illustration of the path dependence of the stock prices . For figure clarity, each price is indicated by the shortened form .
Figure 4.
Illustration of the path dependence of the stock prices . For figure clarity, each price is indicated by the shortened form .
Figure 5.
Illustration of the stock price , price ratio s, return , riskless bank account value , riskless rate , and moments and corresponding to the segments of each of the event paths and . Note that the r, , , and values are common to both path segments.
Figure 5.
Illustration of the stock price , price ratio s, return , riskless bank account value , riskless rate , and moments and corresponding to the segments of each of the event paths and . Note that the r, , , and values are common to both path segments.
Figure 6.
Illustration of the path-dependent, no-arbitrage conditions (35) on the binary tree at an event .
Figure 6.
Illustration of the path-dependent, no-arbitrage conditions (35) on the binary tree at an event .
Figure 7.
Each node in the ABWN model has the structure presented on the left. The triangular unit presented on the right is replicated everywhere on the BMA(1) tree. The value of is either or .
Figure 7.
Each node in the ABWN model has the structure presented on the left. The triangular unit presented on the right is replicated everywhere on the BMA(1) tree. The value of is either or .
Figure 8.
Illustration of the prices on a BMA(1) tree. The bold numbers represent the vector, , of coefficients in (56) used to compute each price. The italicized numbers represent the coefficient vector denoting the price change along the indicated branch of the tree.
Figure 8.
Illustration of the prices on a BMA(1) tree. The bold numbers represent the vector, , of coefficients in (56) used to compute each price. The italicized numbers represent the coefficient vector denoting the price change along the indicated branch of the tree.
Figure 9.
The probabilities obtained from the DIA data set without and with bootstrap resampling for (left plots) and (right plots) . The dashed horizontal line denotes . The red horizontal lines denote the 95% confidence interval.
Figure 9.
The probabilities obtained from the DIA data set without and with bootstrap resampling for (left plots) and (right plots) . The dashed horizontal line denotes . The red horizontal lines denote the 95% confidence interval.
Figure 10.
The probabilities obtained from the DIA data set without and with bootstrap resampling for . The dashed horizontal line denotes . The red horizontal lines denote the 95% confidence interval.
Figure 10.
The probabilities obtained from the DIA data set without and with bootstrap resampling for . The dashed horizontal line denotes . The red horizontal lines denote the 95% confidence interval.
Figure 11.
Normal probability plots (solid dots) for the DIA probabilities computed using bootstrap resampling for (left) and (right) . The dotted line and equation are the results of the linear fit to the data.
Figure 11.
Normal probability plots (solid dots) for the DIA probabilities computed using bootstrap resampling for (left) and (right) . The dotted line and equation are the results of the linear fit to the data.
Figure 12.
Plotted for sequences of length n are the n highest- (blue, gold) and n lowest- (red, green) probability paths for the data (top) without and (bottom) with bootstrap resampling. The highest-probability path is colored gold, the lowest-probability path is green.
Figure 12.
Plotted for sequences of length n are the n highest- (blue, gold) and n lowest- (red, green) probability paths for the data (top) without and (bottom) with bootstrap resampling. The highest-probability path is colored gold, the lowest-probability path is green.
Figure 13.
The highest- and lowest-probability paths plotted for . Note that the color used for a smaller-n path obscures the color used for a larger-n path if both have a segment occurring on the same branch of the tree.
Figure 13.
The highest- and lowest-probability paths plotted for . Note that the color used for a smaller-n path obscures the color used for a larger-n path if both have a segment occurring on the same branch of the tree.
Figure 14.
Box–whisker summaries of the computed distributions of probability estimates for each of the sequences for . The empirical distributions were obtained from the DIA data set using a rolling window of 15 years.
Figure 14.
Box–whisker summaries of the computed distributions of probability estimates for each of the sequences for . The empirical distributions were obtained from the DIA data set using a rolling window of 15 years.
Figure 15.
The box–whisker summaries of Figure 14 reordered into categories based on the number of price up- and downturns occurring in the sequence.
Figure 15.
The box–whisker summaries of Figure 14 reordered into categories based on the number of price up- and downturns occurring in the sequence.
Table 1.
Values of the coefficients and , , in (56).
Table 1.
Values of the coefficients and , , in (56).
n Even
n Odd
i
n
0
0
n
0
0
1
0
1
0
1
1
1
1
2
1
2
1
⋮
⋮
⋮
⋮
⋮
⋮
⋮
⋮
n
⋮
⋮
⋮
⋮
⋮
⋮
⋮
⋮
4
2
1
2
1
3
1
1
1
1
2
1
0
1
0
1
0
n
0
0
n
0
Table 2.
Expected number of occurrences of each sequence in patterns of length n in the DIA return data set covering the period of prices from 20 January 1998 through 5 May 2023.
Table 2.
Expected number of occurrences of each sequence in patterns of length n in the DIA return data set covering the period of prices from 20 January 1998 through 5 May 2023.
Pattern
Number of
Number of
Expected Number of
Length
Sequences
Non-Overlapping
Pattern Sequences in
()
in Pattern
Intervals
Non-Overlapping
()
()
Intervals
4
16
1591
99.4
5
32
1273
39.8
6
64
1060
16.6
8
256
795
3.1
10
1024
636
0.6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Lauria, D.; Lindquist, W.B.; Rachev, S.T.; Hu, Y.
Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. J. Risk Financial Manag.2025, 18, 230.
https://doi.org/10.3390/jrfm18050230
AMA Style
Lauria D, Lindquist WB, Rachev ST, Hu Y.
Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. Journal of Risk and Financial Management. 2025; 18(5):230.
https://doi.org/10.3390/jrfm18050230
Chicago/Turabian Style
Lauria, Davide, W. Brent Lindquist, Svetlozar T. Rachev, and Yuan Hu.
2025. "Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework" Journal of Risk and Financial Management 18, no. 5: 230.
https://doi.org/10.3390/jrfm18050230
APA Style
Lauria, D., Lindquist, W. B., Rachev, S. T., & Hu, Y.
(2025). Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. Journal of Risk and Financial Management, 18(5), 230.
https://doi.org/10.3390/jrfm18050230
Article Metrics
No
No
Article Access Statistics
For more information on the journal statistics, click here.
Multiple requests from the same IP address are counted as one view.
Lauria, D.; Lindquist, W.B.; Rachev, S.T.; Hu, Y.
Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. J. Risk Financial Manag.2025, 18, 230.
https://doi.org/10.3390/jrfm18050230
AMA Style
Lauria D, Lindquist WB, Rachev ST, Hu Y.
Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. Journal of Risk and Financial Management. 2025; 18(5):230.
https://doi.org/10.3390/jrfm18050230
Chicago/Turabian Style
Lauria, Davide, W. Brent Lindquist, Svetlozar T. Rachev, and Yuan Hu.
2025. "Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework" Journal of Risk and Financial Management 18, no. 5: 230.
https://doi.org/10.3390/jrfm18050230
APA Style
Lauria, D., Lindquist, W. B., Rachev, S. T., & Hu, Y.
(2025). Bridging Asset Pricing and Market Microstructure: Option Valuation in Roll’s Framework. Journal of Risk and Financial Management, 18(5), 230.
https://doi.org/10.3390/jrfm18050230