High-Frequency Quote Volatility Measurement Using a Change-Point Intensity Model

Zhicheng Li; Haipeng Xing

doi:10.3390/math10040634

and

¹

Center for Economics, Finance and Management Studies, Hunan University, Changsha 410006, China

²

Department of Applied Math, Stony Brook University, Stony Brook, New York, NY 11790, USA

^*

Author to whom correspondence should be addressed.

Mathematics2022, 10(4), 634;https://doi.org/10.3390/math10040634

This article belongs to the Special Issue Mathematical and Statistical Methods Applications in Finance

Version Notes

Order Reprints

Abstract

Quote volatility is important in determining the cost of demand in a high frequency (HF) order market. This paper proposes a new model to measure quote volatility based on the point process and price-change duration. Specifically, we built a change-point intensity (CPI) model to describe the dynamics of price-change events for a given level of threshold. The instantaneous volatility of quote price can be calculated at any time according to price-change intensities. Based on this, we can quantify the cost of demanding liquidity for traders with different trading latency by using integrated variances. Furthermore, we use the autoregressive conditional intensity (ACI) model proposed by Russell (1999) as a benchmark comparison. The results suggest that our model has better performance of both in-sample fitness and out-of-sample prediction.

Keywords:

quote volatility; price duration; change-point model

JEL Classification:

C58; G10; G17

1. Introduction

Currently, Limit Order Book (LOB) is widely used in financial markets to facilitate traders to manage their orders and to implement transactions. In the LOB, the existing limit orders for a financial asset can be viewed as the up to time liquidity provision in the market, while the issuing of market orders from any traders is an instant demand of that liquidity. A liquidity demander who wants to buy shares (or sell shares) immediately will take the price at the best ask (or at the best bid) to complete transactions. However, because of the trading latency, traders may suffer price uncertainty (risks) of transactions as the quoted price at the best bid and the best ask would fluctuate from the time of issuing orders to the time that the order is matched. The problem is critical for traders with relatively high trading latency and becomes severe in the periods when the quote price changes quickly.

Because of the development of algorithmic trading and high-frequency trading in the last two decades, some traders have a speed advantage in order to execute their orders. The volatility of the quoted price and the latency of trades determine the uncertainty of the purchasing (selling) price for those liquidity demanders. Hasbrouck [1] has pointed out that low-latency traders have the advantage of having lower cost to demand orders, while in the paper, he assumes that the price path is already given at the time of order submission. However, the dynamic of the quote price is quite unpredictable with some periods being much more volatile than others. If a trader arrives at a time when the quote price is more volatile, she will encounter a higher risk of transaction pricing.

Different traders have different latency, from microseconds for high-frequency traders to dozens of seconds for ‘slow’ traders; see [2]. To provide a framework that is suitable for us to quantify quote volatility and the cost of demanding liquidity for different traders, we resort to the instantaneous volatility of quote price at any time point and further construct an integrated variance for different time horizons. However, the estimation of instantaneous volatility becomes particularly difficult for irregular LOB data in HF.

In the development of information technology and electronic trading, the updating frequency of order events has reached the level of microseconds, while there is also silent time where LOB stays unchanged for a couple of seconds. To deal with irregular data, the theory of point process is employed. According to it, the arrivals of a certain type of market events can be viewed as ordered points occurring in the time space. In the past, a number of econometric models were built to describe the occurrence of limit order submission, trade arrival, price change, etc. [3,4], and many of them directly study the financial durations between those market events [5,6,7].

Recently, Yang et al. [8] applied the autoregressive conditional duration (ACD) model by [5] and Markov-switching multifractal inter-trade duration (MSMD) model by [9] to study LOB and find their limitations in fitting HF transaction data. Abergel and Jedidi [10] introduced the Hawkes process to model LOB, in which past events can influence the occurrence intensity of a current event. Then, Swishchuk and Huffman [11] constructed general compound Hawkes processes and investigated their properties in LOB. Later, Morariu-Patrichi and Pakkanen [12] applied state-dependent Hawkes processes to HF LOB data and built a novel model that captures the feedback loop between the order flow and the shape of the LOB. In addition, Li et al. [13] used a time-varying Markov regime switching model to study the arrival time of trades in LOB and captured the bimodal distribution of intertrade durations.

Although the point process has been extensively exploited to model HF financial data, short-term volatility measurement using price duration is not common in the literature. Cho and Frees [14] initiated a discussion of using price duration as volatility measurement, and Gerhard and Hautsch [15] formally proposed a volatility estimator based on price durations. Tse and Yang [16] developed duration-based variance estimators using ACD specifications, and recently, Hong et al. [17] proposed a non-parametric duration-based estimator and concluded that the duration-based volatility estimators are more efficient than noise-robust realized volatility estimators. Furthermore, some papers focus on modeling the intensity function itself. By defining the intensity in continuous time and allowing the intensity process to be updated whenever required, the instantaneous volatility can be calculated by the inverse of intensity. Russell [18] first proposed a univariate dynamic intensity model, the autoregressive conditional intensity (ACI) model, which follows an autoregressive structure that is updated at the time of new market events. Then, the ACI model was extended to the stochastic conditional intensity processes and multivariate process [19,20,21].

In this paper, we propose a new change-point model to measure the quote volatility. The seminal work on change-point models can be traced back to Box and Tiao [22], who want to solve a common modeling problem for the time series for which its parameters may undergo occasional changes. It arises in many applications, e.g., engineering, econometrics, and biomedicine. However, the generalized statistic method for estimation was developed until Lai et al. [23], who used BCMIX (bounded complexity mixture) to reduce the complexity of computation. In our framework, we view the jump of the intensity of the quote price movement as a change point. Moreover, the domain set for renewing the intensity is infinite, and the renewing distribution is continuous. Thus, our model can generate a much larger space than the traditional ACI model. The empirical results demonstrate that our model performs much better in terms of fitting current HF data. Moreover, the change-point intensity model can not only measure the cost of demanding liquidity for traders with different latency but also can be used to test volatility jumps in HF environments.

The paper is organized as follows. Section 2 introduces the method of volatility measurement using the price-change duration and our change-point intensity model. Section 3 provides the estimation procedure and the simulation results. Section 4 presents the data we used. Section 5 shows the in-sample fitness of our model and the measurement of HF quote volatility, with a benchmark comparison with the ACI model. Section 6 further implements the out-of-sample test and evaluates the model’s predictive power. Section 7 provides the conclusions.

2. Volatility Measurement Using Price Duration

2.1. Instantaneous Volatility Measurement Use Price Duration

According to [5], the instantaneous volatility can be measured by the conditional instantaneous variance of returns, which is defined as follows:

σ^{2} (t) : = lim_{Δ ↓ 0} E [\frac{1}{Δ} {(\frac{p (t + Δ) - p (t)}{p (t)})}^{2}| F_{t}],

where

{p (t), t \geq 0}

is a price process of a financial security, and

F_{t}

denotes the information setup, including t. Following [15,16], duration-based variance estimators rely on a relationship between the conditional intensity function and the conditional instantaneous variance of a point process. Specifically for price volatility, we can consider the price-change process as a point process. We define

δ

as the threshold of the price-change event, and

{t_{i}^{δ}}_{i = 1, 2, \dots, n}

are the times when these price-change events occur. Clearly, the number of events n depends on the value of

δ

.

If we define

x_{i}^{δ} : = t_{i}^{δ} - t_{i - 1}^{δ}

as the price duration between two consecutive price-change events, then the conditional variance per time over the price duration is as follows.

\begin{matrix} σ^{2} (t_{i}^{δ}) & = E [\frac{1}{x_{i + 1}^{δ}} {(\frac{δ}{p (t_{i}^{δ})})}^{2}| F_{t_{i}}] \\ = E [\frac{1}{x_{i + 1}^{δ}}| F_{t_{i}}] {(\frac{δ}{p (t_{i}^{δ})})}^{2} . \end{matrix}

The above calculation requires either specifying a stochastic process for

\frac{1}{x_{i + 1}^{δ}}

or computing the distribution

\frac{1}{x_{i + 1}^{δ}}

using a transformation of the conditional distribution of

x_{i + 1}^{δ}

.

Here, we introduce more information on the point process theory, which leads to the formulation of instantaneous volatility using the price-change intensity function. Let

{t_{i}}_{i = 1, 2, \dots, n}

be a sequence of event arrival times

0 \leq t_{i} \leq t_{i + 1}

, then a orderly point process is associated with a counting process,

N (t)

, where

N (t) = Σ_{i \geq 1} 1_{t_{i} \leq t}

is the number of events up to and including time t. A point process can be characterized by a intensity function

λ (t; F_{t})

, which is described as follows.

λ (t; F_{t}) = lim_{Δ ↓ 0} \frac{1}{Δ} P r [N (t + Δ) > N (t) | F_{t}] .

It represents the probability for a new arrival of the event in an infinitesimal time interval. In many applications, this is equivalent to the hazard function, particularly in traditional duration or survival analysis, where cross-sectional duration data are analyzed [5,24], while the intensity function is mostly defined in continuous time and conditions on a possibly continuously varying information set

F_{t}

.

In particular, for the price-change event with the price threshold

δ

, the price variation in a small time interval

Δ

can only be

δ

or

- δ

. Hence, the instantaneous variance of returns at time t can be derived in terms of the following expression.

\begin{matrix} σ^{2} (t) & = lim_{Δ ↓ 0} E [\frac{1}{Δ} {(\frac{p (t + Δ) - p (t)}{p (t)})}^{2}| F_{t}] \\ = lim_{Δ ↓ 0} \frac{1}{Δ} P r [| p (t + Δ) - p (t) | \geq δ| F_{t}] {(\frac{δ}{p (t)})}^{2} \\ = lim_{Δ ↓ 0} \frac{1}{Δ} P r [N^{δ} (t + Δ) > N^{δ} (t) | F_{t}] {(\frac{δ}{p (t)})}^{2} \\ = λ^{δ} (t; F_{t}) {(\frac{δ}{p (t)})}^{2} . \end{matrix}

(1)

Therefore, the measurement of instantaneous volatility lies in estimation of the intensity function associated with the process of

δ

—price changes, i.e.,

λ^{δ} (t; F_{t})

. A similar result is obtained in [4,5,17].

Another important property about the integrated intensity function builds the basis for the construction of the likelihood function of an intensity-based model and leads to the mixture-of-exponential representation that is essential for our change-point intensity (CPI) model. According to the random time change theorem by [25] which transforms a wide class of point processes to a unit-rate Poisson process, both Barndorff-Nielsen and Shiryaev [26] and Hautsch [4] have shown that, if the event arrival time of a point process is

t_{1}, t_{2,} \dots, t_{n}

, then the integrated intensity function is as follows:

Λ (t_{i - 1}, t_{i}) \int_{t_{i - 1}}^{t_{i}} λ (s) d s \sim i . i . d E x p (1),

(2)

where

E x p (1)

is the exponential distribution with the rate parameter as 1.

If we further assume that the intensity function is constant between two consecutive events, i.e.,

λ (t) = λ_{i}

between

t_{i - 1}

and

t_{i}

, then the above property becomes

y_{i} λ_{i} \sim i . i . d E x p (1)

, where

y_{i} = t_{i} - t_{i - 1}

is the event duration between

t_{i - 1}

and

t_{i}

. Rearranging the equation, we will have the mixture-of-exponential representation.

y_{i} = \frac{ϵ_{i}}{λ_{i}}, ϵ_{i} \sim i . i . d E x p (1) .

(3)

Chen et al. [9] also arrives at the same result by interpreting a point process as a dynamic, uncountable set of independent Bernoulli trials.

2.2. The Benchmark ACI Model

The benchmark autoregressive conditional intensity (ACI) model was proposed by [18]. It presents dynamic parameterizations of the intensity function in continuous time, which allows updating the intensity process whenever required. The intensity is characterized in the the following form:

λ (t, F (t)) = ψ (t) λ_{0} (t) s (t),

(4)

which is driven by three components: one component

ϕ (t)

capturing the dynamic structure, a baseline intensity component

λ_{0} (t)

, and a seasonal periodicity component

s (t)

.

The core part is to model dynamic component

ψ (t)

, which is given by the following:

ψ (t) = exp ({\tilde{ψ}}_{N (t) + 1} + z_{N (t)}^{T} γ)

(5)

where

z_{N (t)}

is the vector of covariates that is collected at the time of the preceding event (say

t_{i - 1}

),

γ

is the vector of coefficients for these covariates, and

{\tilde{ψ}}_{N (t) + 1}

is a piecewise-constant dynamic component between

i - 1

and i events. This piecewise-constant component follows a form of ARMA(

1, 1

):

{\tilde{ψ}}_{i} = c + α {\tilde{ε}}_{i - 1} + β {\tilde{ψ}}_{i - 1},

(6)

where

β

is the persistence parameter, and

α

is the coefficient associated with the innovation term

{\tilde{ε}}_{i - 1}

. The innovation term is specified as follows.

{\tilde{ε}}_{i} : = 1 - ε_{i} = 1 - Λ (t_{i - 1}, t_{i}) .

Based on the theory of point process, the probability of events occurring at time

t_{1,} t_{2,} \dots, t_{n}

is

\prod_{i = 1}^{n} λ (t_{i}) \cdot exp [- \int_{t_{i - 1}}^{t_{i}} λ (t) d t]

. Hence, the log-likelihood of the observation of events is as follows.

ln L = \sum_{i = 1}^{n} [ln λ (t_{i}) - \int_{t_{i - 1}}^{t_{i}} λ (t) d t] .

(7)

Specifically, for this quote volatility measurement, set the baseline intensity

λ_{0} (t) = exp (ω) x {(t)}^{a - 1}

, where

x (t)

is the backward recurrence time at t and is defined as

x (t) = t - t_{N (t)}

.

t_{N (t)}

is the nearest backward event time, and

x (t_{i}) = t_{i} - t_{i - 1}

. For the seasonal factor

s (t)

, we can use 1 h intervals and set it as piecewise linear within one interval:

s (t) = s ({\hat{t}}_{k - 1}) + b_{k} (t - {\hat{t}}_{k - 1}), {\hat{t}}_{k - 1} \leq t \leq {\hat{t}}_{k}

where

{\hat{t}}_{k}, k = 1, \dots, 6

are the interval cutting time and we set

s ({\hat{t}}_{0}) = 1

.

Therefore, the parameters in this simple ACI model without covariates are as follows:

c, α, β, ω, a, b_{1}, \dots, b_{6}

and we can use the maximum likelihood (MLE) for estimation.

2.3. The Change-Point Model for Quote Volatility

For the quote volatility measurement in an HF environment, we propose a new change-point intensity (CPI) model, following [22,23]. Similarly to other intensity-based duration models, we think that the price duration follows an exponential distribution that has the price-change intensity as its rate parameter

λ

. In our case, the shift of intensity is a change point, which may respond to market environment fluctuation, liquidity shocks, and a myriad of perceived changes of information on the stock. Allowing intensity to change over time can account for relatively long periods punctuated by extremely short periods observed in the time series.

According to the mixture of exponential representation in the Equation (3), the duration is as follows:

y_{t} = \frac{ε_{t}}{λ_{t}},

(8)

where

ε_{t}

follows an i.i.d.

E x p (1)

. The conditional intensity follows a Markov change-point process with the renewing distribution

G (\cdot)

.

λ_{t + 1} = \{\begin{matrix} λ_{t} & w . p . 1 - p, \\ λ_{t + 1} \sim G (\cdot) & w . p . p . \end{matrix}

(9)

It belongs to the class of change point process because the underlying intensity

λ_{t}

undergoes occasional changes. It is also a Markov process because the value of future intensity only depends on the current state, i.e., either keep the same value as the current intensity with probability

1 - p

or randomly draw from a fixed distribution

G (\cdot)

with probability p.

Given

G (\cdot)

defined in a continuous and infinite space, our model can generate much greater flexibility by using a very small numbers of parameters. Specifically, in our model, we assume that the renewing distribution is a Gamma distribution, and the p.d.f is as follows:

\begin{matrix} G (λ_{t}) = Gamma (λ_{t}; α, β) & = \frac{β^{α} λ_{t}^{α - 1} e^{- λ_{t} β}}{Γ (α)} \\ = Z (α, β) \cdot λ_{t}^{α - 1} e^{- λ_{t} β} \end{matrix}

(10)

where

α

and

β

are the shape and rate parameter of Gamma distribution, respectively, and

Z (α, β) = \frac{β^{α}}{Γ (α)}

is a defined function of

α

and

β

.

This model can also be view as a conditional mean duration model ( similarly to the traditional ACD model) in terms of the following:

y_{t} = M_{t} ε_{t},

(11)

where

M_{t} = \frac{1}{λ_{t}}

is the conditional mean level for the inter-trade duration, and

M_{t}

also follows Markov change-point process:

M_{t + 1} = \{\begin{matrix} M_{t} & w . p . 1 - p \\ M_{t + 1} \sim F (\cdot) & w . p . p \end{matrix}

(12)

where

F (\cdot)

is the renewing distribution of

M_{t}

.

The economic intuition of our CPI model is as follows. In the period without a change point (with probability

1 - p

), it is plausible to think that the market is behaving consistently and the price-change intensity remains unchanged. At this time, quote volatility is constant. However, when the market environment changes or there is some liquidity/informational shock (with probability p) on the stock, the evolution of quote price enters a new state and the price-change intensity changes, and correspondingly, the quote volatility becomes a new value.

2.4. Model Comparison

In this part, we show some comparative structures of our CPI model, using the ACI model as an anchoring benchmark.

The price-change intensity in our model is set to be constant within the interval of price updates while it may have a sudden jump to a new level for subsequent price-change events. We think that this setting is reasonable for ultra-high-frequency data as the time interval of quote-price updates is short, for which we can ignore the dynamics in between, but more importantly, it can help us to circumvent the complex estimation of the time structure of the underlying intensity. Nevertheless, when there is some exogenous shock (information or liquidity shock), the quote-price change intensity can become a new one, and correspondingly, quote volatility changes.
Although, in general, the quote-price change duration is short, it may also have a large dispersion as the shortest duration can proceed to the extent of microseconds while the longest duration could be a couple of seconds. This will become a potential problem for the ACI model as the evolution of price-change intensity in the ACI model is relatively smooth. (In the ACI model, in addition to the baseline component $λ_{0} (t)$ and the seasonal component $s (t)$ , the dynamic component $ϕ (t)$ follows an ARMA structure.) However, in our change-point intensity model, the new level of intensity is drawn from a continuous distribution $G (λ)$ . Hence, it allows drastic changes of intensity, depending on the value of distribution parameters.
Our model is also feasible to be extended to a framework of incorporating other influencing factors in determining the change of quote volatility. For example, we can allow intensity renewal probability p to be time varying and to be driven by some other factors:

$log \frac{p_{t}}{1 - p_{t}} = x_{t - 1}^{T} β$

where $x_{}$ is a $k \times 1$ vector of influencing factors, and $β$ is a $k \times 1$ vector of the corresponding factor loadings (or the regression coefficients). By having this structure, we can further analyze whether some factors can result in a high probability of changing the quote volatility. Nonetheless, this part of extension is beyond the scope of this paper and deserves a further development.

3. Model Estimation and Simulation

3.1. Model Estimation

For the change-point model introduced in Section 2.3, the complete log-likelihood function is as follows:

l_{c} ({y_{1 n}, λ_{1 n}}) = log P (λ_{1}) + \sum_{t = 1}^{n} log f (y_{t} | λ_{t}) + \sum_{t = 2}^{n} log P (λ_{t} | λ_{t - 1}),

(13)

where

y_{1 n}

is the sequence of observed event durations

{y_{t}}_{t = 1, \dots, n}

, and

λ_{1 n}

is the sequence of underlying intensities

{λ_{t}}_{t = 1, \dots, n}

.

P (λ_{1})

is probability that the initial intensity is

λ_{1}

,

f (y_{t} | λ_{t})

is the probability density of

y_{t}

given the current intensity is

λ_{t}

, and

P (λ_{t} | λ_{t - 1})

is the conditional probability of

λ_{t}

given the previous intensity is

λ_{t - 1}

. In our model, the above equation is equivalent to the following:

\begin{matrix} l_{c} ({y_{1 n}, λ_{1 n}}) & = & log G (λ_{1}) + \sum_{t = 1}^{n} log f (y_{t} | λ_{t}) \\ + \sum_{t = 2}^{n} [log G (λ_{t}) \cdot 1_{(I_{t} = 1)} + log p \cdot 1_{(I_{t} = 1)} + log (1 - p) \cdot 1_{(I_{t} = 0)}], \end{matrix}

(14)

because

log P (λ_{1}) = log G (λ_{1})

as we think the initial state is independently drawn from the Gamma distribution, and conditional probability

P (λ_{t} | λ_{t - 1})

can be derived based on two possible situations, i.e., with probability p it renews from

G (\cdot)

, and with probability

1 - p

, it keeps unchanged.

I_{t} = 1

means there is a change-point at

t

-th trade, i.e.,

λ_{t} \neq λ_{t - 1}

, and

1_{(I_{t} = 1)}

is a indexing function that returns 1 when

I_{t} = 1

and otherwise returns 0. Similarly,

1_{(I_{t} = 0)}

equals 1 if there is no change-point at

t

-th trade, and 0 if there is a change-point at

t

-th trade.

The parameters to be estimated in the change-point model are

α

,

β

, and p. The sequence of price-change durations

y_{1 n}

is known to us, however, we cannot observe the actual values of hidden variables

λ_{1 n}

. Thus, directly maximizing the complete log-likelihood is infeasible, and we need to use the Expectation Maximization (EM) method for model estimation.

3.1.1. Expected Likelihood

In the E-step, the expected log-likelihood, conditional on

D = {y_{1 n}, parameters of

last iteration}

, is as follows:

\begin{matrix} E (l_{c} ({y_{1 n}, λ_{1 n}}) | D) & = & E (log G (λ_{1}) | D) + \sum_{t = 1}^{n} E (log f (y_{t} | λ_{t}) | D) \\ + \sum_{t = 2}^{n} E (log G (λ_{t}) \cdot 1_{(I_{t} = 1)} | D) \\ + \sum_{t = 2}^{n} [log p \cdot P (I_{t} = 1 | D) + log (1 - p) \cdot P (I_{t} = 1 | D)], \end{matrix}

(15)

where the expectation is taken over the posterior distribution of hidden variables

λ_{i}

.

As

G (λ)

is a Gamma distribution, it is a conjugate prior for the exponential distribution. Hence, the posterior distribution of

λ_{i}

is also a Gamma distribution. We have shown in the Appendix A that the posterior distributions of

λ_{t}

is as follows:

f (λ_{t} | D) = \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot g_{i j} (λ_{t}),

(16)

where

g_{i j} (λ) \sim Gamma (α + (j - i + 1), β + \sum_{i}^{j} y_{t})

and

Π_{i t j}

is the posterior probability that the last change point occurs at i and the next change point occurs at

j + 1

, which means the current state of

λ_{t}

starts from i and ends at j. The steps for calculating

Π_{i t j}

will also be shown in the Appendix A. Moreover, the following is the case:

P (I_{t + 1} = 1 | D) = \sum_{1 \leq i \leq t} Π_{i t t} P (I_{t + 1} = 0 | D) = 1 - P (I_{t + 1} = 1 | D)

(17)

where

t \in [1, N - 1]

, and we also set

P (I_{1} = 1 | D) \equiv 1

.

Therefore, in Equation (15), we have the following.

\begin{matrix} E (log G (λ_{1}) | D) & = & \int_{λ_{1}} log G (λ_{1}) \cdot f (λ_{1} | D) d λ_{1} \\ = & \int_{λ_{1}} (log Z (α, β) + (α - 1) log λ_{1} - β λ_{1}) \cdot (\sum_{1 \leq i \leq 1 \leq j \leq n} Π_{i 1 j} \cdot g_{i j} (λ_{1})) d λ_{1} \end{matrix}

(18)

\begin{matrix} E (log f (y_{t} | λ_{t}) | D) & = & \int_{λ_{t}} log f (y_{t} | λ_{t}) \cdot f (λ_{t} | D) d λ_{t} \\ = & \int_{λ_{t}} (log λ_{t} - λ_{t} y_{t}) \cdot \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot g_{i j} (λ_{t}) \cdot d λ_{t} \end{matrix}

(19)

\begin{matrix} E (log G (λ_{t}) \cdot 1_{(I_{t} = 1)} | D) & = & \int log G (λ_{t}) \cdot f (λ_{t}, I_{t} = 1 | D) d λ_{t} \end{matrix}

Since

f (λ_{t} | D) = \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot g_{i j} (λ_{t})

, thus

f (λ_{t}, I_{t} = 1 | D) = \sum_{t \leq j \leq n} Π_{t t j} \cdot g_{t j} (λ_{t})

. Hence, we have the following.

E (log G (λ_{t}) \cdot 1_{(I_{t} = 1)} | D) = \sum_{t \leq j \leq n} Π_{t t j} \int log G (λ_{t}) \cdot g_{t j} (λ_{t}) d λ_{t}

(20)

3.1.2. Maximization and Parameters’ Update

Once we have the expected log-likelihood in (15) and write it as a function form with arguments

(α, β, p)

:

l_{E C} (α, β, p) \equiv E (l_{c} ({y_{1 n}, m_{1 n}}) | D),

(21)

we can perform our maximization step in EM and update our estimations of model parameters.

As only the last two items in (15) contains parameter p, therefore, by first order maximization, we have the following.

\begin{matrix} \frac{\partial l_{E C} (α, β, p)}{\partial \hat{p}} & = & \frac{1}{\hat{p}} \sum_{t = 2}^{n} P (I_{t} = 1 | D) - \frac{1}{1 - \hat{p}} \sum_{t = 2}^{n} P (I_{t} = 0 | D) \\ = & 0 \end{matrix}

Thus, due to the following:

\hat{p} = \frac{\sum_{t = 2}^{n} P (I_{t} = 1 | D)}{n - 1}

(22)

we obtain the new value of

\hat{p}

.

The first and third items in

l_{E C} (α, β, p)

contain parameters

(α, β)

:

\begin{matrix} E (log G (λ_{1}) | D) + \sum_{t = 2}^{n} E (log G (λ_{t}) \cdot 1_{(I_{t} = 1)} | D) \\ = & \int_{λ_{1}} (log G (λ_{1})) \cdot (\sum_{1 \leq i \leq 1 \leq j \leq n} Π_{i 1 j} \cdot g_{i j} (λ_{1})) d λ_{1} + \sum_{t = 2}^{n} \int_{λ_{t}} (log G (λ_{t})) \cdot (\sum_{t \leq j \leq n} Π_{t t j} \cdot g_{t j} (λ_{t})) d λ_{t} \\ = & \sum_{t = 1}^{n} \int_{λ_{t}} log G (λ_{t}) \cdot (\sum_{t \leq j \leq n} Π_{t t j} \cdot g_{t j} (λ_{t})) d λ_{t} \\ = & \sum_{t = 1}^{n} \int_{λ_{t}} (log Z (α, β) + (α - 1) log λ_{t} - β λ_{t}) \cdot (\sum_{t \leq j \leq n} Π_{t t j} \cdot g_{t j} (λ_{t})) d λ_{t} \\ = & \sum_{t = 1}^{n} [\sum_{t \leq j \leq n} (Π_{t t j} \int_{λ_{t}} (log Z (α, β) + (α - 1) log λ_{t} - β λ_{t}) g_{t j} (λ_{t}) d λ_{t})] \\ = & log Z (α, β) \cdot A + (α - 1) \cdot B - β \cdot C \\ = & [α log β - log Γ (α)] \cdot A + (α - 1) \cdot B - β \cdot C, \end{matrix}

(23)

where

A = \sum_{t = 1}^{n} (\sum_{t \leq j \leq n} Π_{t t j}) = \sum_{t = 1}^{n} P (I_{t} = 1 | D)

B = \sum_{t = 1}^{n} [\sum_{t \leq j \leq n} Π_{t t j} \cdot \int_{λ_{t}} log λ_{t} \cdot g_{t j} (λ_{t}) d λ_{t}]

C = \sum_{t = 1}^{n} [\sum_{t \leq j \leq n} Π_{t t j} \cdot \int_{λ_{t}} λ_{t} \cdot g_{t j} (λ_{t}) d λ_{t}]

Moreover, in this case, we can have a nice result in B.

\begin{matrix} \sum_{t \leq j \leq n} Π_{t t j} \cdot \int_{λ_{t}} log λ_{t} \cdot g_{t j} (λ_{t}) d λ_{t} & = & \sum_{t \leq j \leq n} Π_{t t j} \cdot E_{g_{t j}} [log λ_{t}] . \end{matrix}

And because

g_{t j} (λ) \sim Gamma (α + (j - t + 1), β + \sum_{t}^{j} y_{s})

,

E_{g_{t j}} [log λ_{t}] = ψ (α_{o l d} + j - t + 1) - log (β_{o l d} + \sum_{t}^{j} y_{s})

, where

ψ (\cdot)

is the Digamma function (first order derivative of log-gamma function):

\begin{matrix} ψ (α) & = & \frac{d log Γ (α)}{d α} \\ = & \frac{Γ^{'} (α)}{Γ (α)} \\ = & - γ - \sum_{k = 0}^{\infty} (\frac{1}{α + k} - \frac{1}{k + 1}) \end{matrix}

and constant

γ ≃ 0.5772156649

.

Similarly, in C, we have the following.

\sum_{t \leq j \leq n} Π_{t t j} \cdot \int_{λ_{t}} λ_{t} \cdot g_{t j} (λ_{t}) d λ_{t} = \sum_{t \leq j \leq n} Π_{t t j} \cdot E_{g_{t j}} [λ_{t}] .

Because

g_{t j} (λ) \sim Gamma (α + (j - t + 1), β + \sum_{t}^{j} y_{s})

, we have

E_{g_{t j}} [λ_{t}] = \frac{α_{o l d} + j - t + 1}{(β_{o l d} + \sum_{t}^{j} y_{s})}

.

Then, substitute (23) into (21), and we can obtain the following,

\begin{matrix} \frac{\partial l_{E C} (α, β, p)}{\partial \hat{β}} & = & \frac{A \cdot \hat{α}}{\hat{β}} - C = 0 \\ ⟹ \hat{α} & = & \frac{C}{A} \hat{β}, \end{matrix}

(24)

and

\begin{matrix} \frac{\partial l_{E C} (α, β, p)}{\partial \hat{α}} & = & 0 \\ ⟹ A \cdot log \hat{β} - A \cdot ψ (\hat{α}) + B & = & 0 . \end{matrix}

(25)

Thus, combining the last two Equations (24) and (25), we can solve the new value of

\hat{α}

and

\hat{β}

. In addition to the result of

\hat{p}

in (22), we can perform next EM iteration until the convergence of estimators.

3.1.3. Inference of $λ_{t}$

We cannot obtain the accurate value of

λ_{t}

; however, we can use the posterior mean as its estimator, i.e.,

{\hat{λ}}_{t}

. Therefore, the following is the case.

\begin{matrix} {\hat{λ}}_{t} & = & \int λ_{t} \cdot f (λ_{t} | D) d λ_{t} \\ = & \int λ_{t} \cdot \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} g_{i j} (λ_{t}) d λ_{t} \\ = & \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot \int λ_{t} \cdot g_{i j} (λ_{t}) d λ_{t} \\ = & \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot E_{g_{i j}} [λ_{t}] \end{matrix}

Since

g_{i j} (m) \sim Gamma (\hat{α} + (j - i + 1), \hat{β} + \sum_{i}^{j} y_{s})

,

E_{g_{i j}} [m_{t}] = \frac{\hat{α} + j - i + 1}{(\hat{β} + \sum_{i}^{j} y_{s})}

. Thus, the following is the case.

{\hat{λ}}_{t} = \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot \frac{\hat{α} + j - i + 1}{(\hat{β} + \sum_{i}^{j} y_{s})} t = 1, \dots, n

(26)

3.2. Simulation

In order to validate the change-point model and the corresponding estimation algorithm, we first simulated data that provided us with the true values of model parameters for comparison. We have simulated 7000 points of price-change events, and the parameters are

α = 5.0

(shape parameter in Gamma),

β = 2.0

(rate parameter in Gamma), and

p = 0.018

(the change point probability). Given this data generating process of the price-change intensity, we plot the simulated durations of quote-price changes in Figure 1. We also plot the inverse of simulated intensity

λ_{t}

, which is mean duration

m_{t}

in Figure 1. From the graph, we can observe that there is a large variation of event duration, i.e., the longest duration is more than 12 s, while the short duration is about

1 \times 10^{- 3}

s. This means that sometimes it takes a couple of seconds for a change in quote price, while sometimes it only needs a millisecond for a price change. Clearly, the volatility of this simulated quote price is not constant.

Figure 1. The simulated quote-price change durations and price-change intensities (first 1000 points).

We use the estimation method shown in Section 3.1 to estimate model parameters. We have performed our estimation by using the first 1000 points, 4000 points, and the entire sample, respectively. The results are presented in Table 1, and we observe that the estimations are close to real values, especially for the large sample. Moreover, we have plotted simulated intensities and estimated intensities, together with simulated inter-trade durations in Figure 2 for

N_{s a m p l e} = 7000

, and we find that the estimated intensities of price changes are also close to the ones in the simulated path.

Table 1. Estimation results for the simulated data, which has

α = 5.0

,

β = 2.0

, and

p = 0.018

.

Figure 2. Comparison between actual intensity in the simulation and estimated intensity from model estimation.

4. Data Environment

Our data are downloaded from LOBSTER (https://lobsterdata.com/ accessed on 2 February 2018), which provides high-quality LOB data of all Nasdaq stocks from June 2007. The LOB data reconstructed by LOBSTER are based on Nasdaq’s Historical TotalView-ITCH data, i.e., the historic record of what Nasdaq calls. LOBSTER simultaneously generates two files for each active trading day of a selected ticker. One is a ‘message’ file, which contains indicators for the type of event causing an update of LOB in the requested price range. The other is an ‘orderbook’ file, which records the ask and bid quotes of LOB at the time when the ‘message’ file is updated.

Table 2 shows a sample of LOB ‘messages’ and ‘orderbook’ files of AMZN on 2 January 2013. We show three events of the LOB. In panel A of Table 2, which is the ‘messages’ file, type 3 event and type 1 event represent a deletion and submission of a limit order, respectively. The direction ‘−1’ means the order event from the ask side, and ‘1’ denotes the bid-side order. In the meantime, the ‘orderbook’ file in panel B of Table 2 records the ‘shape’ of LOB after these three events. From this, we can observe that after submission of a new order at the bid side with a higher price than the existing best bid, the new best bid changes from 2550700 to 2550800, i.e., $255.07 to $255.08.

Table 2. The message file and order book file of LOBSTER data.

Figure 3 plots the evolution of quote price at the best bid in the first 50 s of AMZN stock on 2 January 2013. From this, we can observe that there is non-constant variation of bid price, with some intervals being much more volatile than others. A liquidity demander with 1 s of latency will encounter a high uncertainty and cost when she posts a (sell-side) market order at

t = 20

or

t = 30

, compared with the entering time at

t = 40

. Therefore, we need a method to effectively quantify the volatility of the quote price at any time and also the cost of demand for traders with different trading latencies.

Figure 3. The evolution of best bid price in the first 50 s of AMZN stock on 2 January 2013.

5. Model Fitness and In-Sample Analysis

5.1. Model Fit and Instantaneous Volatility

We use AMZN LOB data on 2 January 2013 to illustrate model fitness and to obtain the measurement of quote volatility. We choose the threshold of price change as 3 cents, which is three times the minimum unit of quote price. In Figure 4, we plot the series of quote-price duration (at the best bid level) of AMZN stock on 2 January 2013, given the threshold of price change as 3 cents. There is a large variation for this level of price duration. The shortest duration is

10^{- 5}

s, while the longest duration is about 80 s.

Figure 4. The quote price duration of the best bid price for the AMZN stock on 2 January 2013.

We first use our change-point intensity model (CPI) for estimation. The results are

\hat{α} = 0.25

,

\hat{β} = 0.006

, and

\hat{p} = 0.34

. Moreover, we can infer underlying intensities

{λ_{t}}_{1, \dots, N}

according to Equation (26). The instantaneous volatility can be derived according to Equation (1). Specifically, for time

t \in [t_{i}, t_{i + 1})

, we have the following:

σ^{2} (t) = λ (t_{i}) {(\frac{δ}{p (t_{i})})}^{2} t \in [t_{i}, t_{i + 1})

because, in the change-point model, intensity is assumed to be constant between two events. In Figure 5, we plot the fitted instantaneous quote volatility of the best bid price for the first 50 s of AMZN on the trading day of 2 January 2013. From the graph, we can observe that instantaneous quote volatility jumps to a high level when the quote price changes dramatically in a short time. On the other hand, volatility stayed at a low level if the quote price maintains fixed or changes steadily.

Figure 5. The instantaneous quote volatility of the best bid price for AMZN stock on 2 January 2013 calculated by the change-point model.

Moreover, we supplement the quote volatility estimation results by the ACI model in Figure 6, from which we can observe that ACI volatility exhibits strong spikes at the point when the quote price suddenly changes over a threshold. Furthermore, we provide the comparison between the models’ fitted residuals in Figure 7 and Figure 8. According to the property of the integrated intensity function shown in Equation (2) and the mixture of exponential expression in Equation (3), we should have the model’s fitted residuals follow an Exponential(1) distribution. From the results, we can clearly observe that the CPI model has a better result in terms of fitness.

Figure 6. The instantaneous quote volatility of the best bid price for AMZN stock on 2 January 2013 calculated by the ACI model.

Figure 7. The distribution of duration residuals of CPI.

Figure 8. The distribution of duration residuals of ACI.

5.2. Integrated Variance and Cost of Demand

Denote

p_{t}

as the logarithm price of the best bid quote or the best ask quote. Under the assumption of no arbitrage, we suppose it follows a continuous semi-martingale process:

p_{t} = p_{0} + \int_{0}^{t} μ (τ) d τ + \int_{0}^{t} σ (τ) d W_{τ}

where W denotes a standard Wiener process,

μ (τ)

is a finite càdlàg drift process, and

σ (τ)

is an adapted càdlàg volatility process associated with the instantaneous conditional mean and volatility of the corresponding return.

The integrated variance over a interval

[0, t]

is as follows:

I V (0, t) : = \int_{0}^{t} σ^{2} (τ) d W_{τ},

which also equals to the quadratic variation of a process corresponding to the sum of its squared increments measured on infinitesimal intervals. Hence, it is a natural quantity reflecting the riskiness of an asset over a given time span. Thus, we can use the derived the instantaneous volatility to calculate the the integrated variance.

For liquidity demanders with different trading latency

Δ_{i}

, the integrated variances are specifically as follows.

I V (t, t + Δ_{i}) : = \int_{t}^{t + Δ_{i}} σ^{2} (τ) d W_{τ} .

Therefore, the trading cost not only depends on latency

Δ_{i}

but also on the magnitude of the (instantaneous) volatility over the interval. In Table 3, we calculate the integrated variance for three types of trades, with trading latency of 0.01 s, 1 s, and 5 s, respectively. Clearly, a low-latency trader who has speed advantages in sending orders will encounter a lower risk to obtain her liquidity. For example, the mean value of the volatility of the transaction prices for a trader whose trading latency is 5 s is 0.035, while the value for a trader whose trading latency is 0.01 s is just

7.6 \times 10^{- 5}

.

Table 3. The integrated variances for different traders that calculated for AMZN stock on 2 January 2013.

Moreover, in the below figures (Figure 9, Figure 10 and Figure 11), we plot the fitted standard deviation (the square root of the integrated variance) of the bid price for the AMZN stock on 2 January 2013 in the first 50 s for three types of traders, for which their trading latencies are 0.01 s, 1 s, and 5 s, respectively. Compared with the ACI model, the price standard deviation estimated from our CPI model is relatively smooth, especially for the low-latency environment, which helps us to be more effective in evaluating the dynamics of quote volatility for HF traders. As the the evaluating time window increases, say the price standard deviation for 5 s, both models obtain similar results.

Figure 9. The fitted 0.01 s price standard deviation of the best bid price for AMZN stock on 2 January 2013.

Figure 10. The fitted 1 s price standard deviation of the best bid price for AMZN stock on 2 January 2013.

Figure 11. The fitted 5 s price standard deviation of the best bid price for AMZN stock on 2 January 2013.

We can quantify the cost of demand for a specific type of traders at different time points when initiating her orders. We use the price standard deviation as the cost of demanding liquidity. For example, for a trader whose trading latency is 1 s, she will suffer price uncertainty of USD 0.089 when she enters the market at

t = 10

s. While when this trader enters the market at

t = 20

s, the price uncertainty of her transaction price is about USD 0.289.

6. Out-of-Sample Performance and Model Prediction Power

At last, we want to examine the out-of-sample performance of our CPI model. However, it is hard to predict the volatility directly and to observe the model’s performance because actual volatility is unobserved. Therefore, we only predict the duration length for the next price change as the actual price-change duration is known to us. When the duration for the change of quote price is long, quote volatility is low. On the other hand, when the duration for the change of quote price is short, quote volatility should be high.

We use one-step-ahead forecasting based on the model’s in-sample estimation results. Specifically, for the quote prices (at the best bid) of the AMZN stock on 2 January 2013, we use the first 4000 data points (which are the 4000 events of price changes) for parameter estimation and perform a one-step-ahead prediction for the remaining observations of price durations. The expected price-change duration for the quote price can be derived as follows.

\begin{matrix} E_{t} (y_{t + 1}) & = & \frac{1}{E_{t} (λ_{t + 1})} \\ = & \frac{1}{\hat{p} \cdot \hat{λ_{t}} + (1 - \hat{p}) \cdot \frac{\hat{α}}{\hat{β}}} . \end{matrix}

(27)

This is because, in the CPI model, the intensity for the next price change either retains its past value

λ_{t}

(with probability

\hat{p}

) or renews from the Gamma distribution (with probability 1 −

\hat{p}

), and the mean value of the Gamma distribution is

\frac{\hat{α}}{\hat{β}}

.

We test the model’s out-of-sample performance by using the Mincer–Zarnowitz ordinary least squares (OLS) regressions.

y_{t + 1} = β_{0} + β_{1} E_{t} (y_{t + 1}) + μ_{t},

(28)

where

β_{0}

is the regression intercept,

β_{1}

is the regression coefficient for

E_{t} (y_{t + 1})

, and

μ_{t}

is the regression error term.

Moreover, we compare the model’s performance with the ACI model, and the results are shown in Table 4. From the results, we can observe that coefficients

β_{1}

are significantly positive in both models, suggesting that the predictions from both the CPI and ACI models can explain the variation of the real value of price-change durations. Nevertheless, our CPI model is significantly better in terms of prediction power because the R-squared in the fitting of CPI model is much higher.

Table 4. Mincer–Zarnowitz OLS results for CPI and ACI models.

7. Conclusions

This paper has proposed a new method to measure the volatility of quote prices in the limit order market, which is important to quantify the cost of demanding liquidity in the HF trading environment. We use the point process to describe price-change events that occur at the best quote level (at the best bid or the best ask), and volatility is measured based on the inference of price-change intensity according to realized price-change durations. In particular, we resort to the change-point model proposed by Lai et al. [23] to describe the dynamics of price-change intensity and name it as the change-point intensity (CPI) model. In the model, the underlying price-change intensity follows a Markov process, i.e., either maintains its past value or renews from a Gamma distribution. Thus, we can use the data of price-change durations to infer the underlying price-change intensity and further calculate quote volatility based on the method proposed by Engle and Russell [5].

We apply the CPI model to study the quote volatility of the AMZN stock on 2 January 2013. Specifically, we choose the threshold of price change as 3 cents to define the price-change event and construct the series of price-change durations. The instantaneous quote volatility at any time of the trading day can be derived from the estimated price-change intensity by our CPI model. Furthermore, we have calculated the cost of demand for traders with different trading latency based on the integrated variance. In addition, we compare both the in-sample fitness and out-of-sample prediction power with the benchmark ACI model by Russell [18], and the results suggest that our model performs better.

Our work has made progress in modeling HF quote volatility. Nonetheless, it leaves much room for future development. The current CPI model is a univariate structure that studies the dynamics of quote volatility itself. We can further extend it by incorporating other factors in determining the changes of quote volatility. This can be performed by setting intensity renewal probability p to be time-varying and to be driven by some other factors.

Author Contributions

Conceptualization, Z.L. and H.X.; methodology, H.X.; software, Z.L.; validation, Z.L. and H.X.; formal analysis, Z.L.; investigation, Z.L.; resources, H.X.; data curation, Z.L.; supervision, H.X.; writing, original draft, Z.L.; writing, review and editing, Z.L. and H.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data were obtained from LOBSTER (https://lobsterdata.com/ accessed on 2 February 2018).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Posterior Distribution of $λ_{t}$ in EM

Appendix A.1. Foward–Backward Filter

Here, we first show the general filter to update the posterior distribution of

λ_{t}

given

{y_{1}, \dots, y_{n}}

.

Appendix A.1.1. Forward Filter

Denote

R_{t} = max {k | I_{k} = 1, for k \leq t}

, then given

(y_{1 t}, R_{t} = s),

we have the density of the posterior distribution of

λ_{t}

.

g_{s t} (λ_{t}) ≜ f (λ_{t} | y_{t}, R_{t} = s) \propto \prod_{i = s}^{t} f (y_{i} | λ_{t}) G (λ_{t})

(A1)

Proposition A1.

The posterior distribution of

λ_{t}

given

y_{1 t}

can be expressed as follows:

f (λ_{t} | y_{1 t}) = \sum_{i = 1}^{t} p_{i t} \cdot g_{i t} (λ_{t})

(A2)

where

p_{i t} = P (R_{t} = i | y_{1 t}) .

The weight coefficients of the above mixture can be calculated recursively by the following:

p_{i t} = \frac{p_{i t}^{*}}{\sum_{s = 1}^{t} p_{s t}^{*}} for i = 1, \dots, t

and with the following.

p_{i t}^{*} = \{\begin{matrix} p \cdot f_{t t} / f_{00} & i = t \\ (1 - p) p_{i, t - 1} \cdot f_{i t} / f_{i, t - 1} & i < t \end{matrix}

(A3)

Proof.

We note that the following is the case.

\begin{matrix} f (λ_{t} | y_{1 t}) & \propto & f (λ_{t}, y_{t} | y_{1, t - 1}) \\ = & P (I_{t} = 1) \cdot f (λ_{t}, y_{t} | y_{1, t - 1}, I_{t} = 1) + P (I_{t} = 0) \cdot f (λ_{t}, y_{t} | y_{1, t - 1}, I_{t} = 0) \\ = & p \cdot f (λ_{t}, y_{t}) + (1 - p) \cdot f (λ_{t}, y_{t} | y_{1, t - 1}, I_{t} = 0) \\ = & p f (y_{t}) \cdot f (λ_{t} | y_{t}) + (1 - p) \sum_{i = 1}^{t - 1} f (λ_{t}, y_{t}, R_{t} = i | y_{1, t - 1}, I_{t} = 0) \\ = & p f (y_{t}) \cdot f (λ_{t} | y_{t}) + (1 - p) \sum_{i = 1}^{t - 1} \{P (R_{t} = i | y_{i, t - 1}, I_{t} = 0) \cdot f (λ_{t}, y_{t} | y_{1, t - 1}, I_{t} = 0, R_{t} = i)\} \\ = & p f (y_{t}) \cdot f (λ_{t} | y_{t}) + (1 - p) \sum_{i = 1}^{t - 1} \{p_{i, t - 1} f (y_{t} | y_{1, t - 1}, I_{t} = 0, R_{t} = i) f (λ_{t} | y_{1, t}, I_{t} = 0, R_{t} = i)\} \end{matrix}

From definition (A1), we could observe that

f (λ_{t} | y_{t}) = g_{t t} (λ_{t})

, and

f (λ_{t} | y_{1 t}, I_{t} = 0,

R_{t} = i) = g_{i t} (λ_{t})

. Substituting it into last equation, we have the following.

f (λ_{t} | y_{1 t}) p f_{t t} \cdot g_{t t} (λ_{t}) + (1 - p) \sum_{i = 1}^{t - 1} \{p_{i, t - 1} f (y_{t} | y_{1, t - 1}, I_{t} = 0, R_{t} = i) g_{i t} (λ_{t})\}

(A4)

Moreover, by integrating

λ

, we have the following:

f (y_{t}) = \int f (y_{t} | λ) G (λ) d λ

\begin{matrix} f (y_{t} | y_{1, t - 1}, I_{t}, R_{t} = i) & = & \frac{f (y_{1 t} | I_{t} = 0, R_{t} = i)}{f (y_{1, t - 1} | I_{t} = 0, R_{t} = i)} \\ = & \frac{\int \prod_{s = i}^{t} f (y_{s} | λ) \cdot G (λ) d λ}{\int \prod_{s = i}^{t - 1} f (y_{s} | λ) \cdot G (λ) d λ} \\ = & \frac{f_{i t}}{f_{i, t - 1}} \end{matrix}

where

f_{i t} = \int \prod_{s = i}^{t} f (y_{s} | λ) \cdot G (λ) d λ

, and

f_{00} = 1

is the normalizing term. Substituting these back into

f (λ_{t} | y_{1 t})

in expression (A4), we can prove Proposition A1. □

Appendix A.1.2. Backward Filter

Then, we consider the case given the information after t. We define

{\tilde{R}}_{t + 1} = min {k | I_{k + 1}

= 1, k \geq t + 1}

, i.e., the nearest change point in backward direction is k (

λ_{k} \neq λ_{k + 1} \Leftrightarrow I_{k + 1} = 1

). Moreover, the change point probability

q_{t + 1, j} = P ({\tilde{R}}_{t + 1} = j | y_{t + 1, n})

is considered. Similarly, we have the below proposition.

Proposition A2.

The posterior distribution of

λ_{t}

given

y_{t + 1, n}

can be expressed as follows:

f (λ_{t} | y_{t + 1, n}) = p \cdot G (λ_{t}) + (1 - p) \sum_{j = t + 1}^{n} q_{t + 1, j} \cdot g_{t + 1, j} (λ_{t})

(A5)

where the following is the case:

q_{t + 1, j} = \frac{q_{t + 1, j}^{*}}{\sum_{s = t + 1}^{n} q_{t + 1, s}^{*}} for j = t + 1, \dots, n

and

q_{t + 1, j}^{*}

can be calculated recursively as follows.

q_{t + 1, j}^{*} = \{\begin{matrix} p \cdot f_{t + 1, t + 1} / f_{00} & j = t + 1 \\ (1 - p) q_{t + 2, j} \cdot f_{t + 1, j} / f_{t + 2, j} & j > t + 1 \end{matrix}

(A6)

Proof.

We first show that the following is the case.

f (λ_{t + 1} | y_{t + 1, n}) = \sum_{j = t + 1}^{n} q_{t + 1, j} \cdot g_{t + 1, j} (λ_{t})

(A7)

The steps are similar as those in the forward filter.

\begin{array}{l} f (λ_{t + 1} | y_{t + 1, n}) & \propto & f (λ_{t + 1}, y_{t + 1} | y_{t + 2, n}) \\ = & \sum_{j = t + 1}^{n} f (λ_{t + 1}, y_{t + 1}, {\tilde{R}}_{t + 1} = j | y_{t + 2, n}) \\ = & P ({\tilde{R}}_{t + 1} = t + 1 | y_{t + 2, n}) \cdot f (λ_{t + 1}, y_{t + 1} | y_{t + 2, n}, {\tilde{R}}_{t + 1} = t + 1) + \sum_{j = t + 2}^{n} f (λ_{t + 1}, y_{t + 1}, {\tilde{R}}_{t + 1} = j | y_{t + 2, n}) \\ = & p f_{t + 1, t + 1} \cdot g_{t + 1, t + 1} (m) + (1 - p) \sum_{j = t + 2}^{n} q_{t + 2, j} \cdot \frac{f (y_{t + 1}, n | {\tilde{R}}_{t + 1} = j)}{f (y_{t + 2}, n | {\tilde{R}}_{t + 1} = j)} \cdot g_{t + 1, j} (m) \\ = & \sum_{j = t + 1}^{n} q_{t + 1, j} \cdot g_{t + 1, j} (λ_{t}) . \end{array}

The last step is based on the definition and calculation of

q_{t + 1, j}

.

Then, since we have the following:

f (λ_{t} | y_{t + 1, n}) = p \cdot G (λ_{t}) + (1 - p) \cdot f (λ_{t + 1} | y_{t + 1, n}),

we obtain the result in Proposition A2. □

Appendix A.1.3. Combination (Forward–Backward Algorithm)

Proposition A3.

The posterior distribution of

λ_{t}

given

y_{1, n}

can be expressed as follows:

f (λ_{t} | y_{1 n}) = \sum_{1 \leq i \leq t \leq j \leq n} Π_{i t j} \cdot g_{i j} (λ_{t}),

(A8)

where

Π_{i t j} = Π_{i t j}^{*} / \sum_{1 \leq s \leq t \leq k \leq n} Π_{s t k}^{*}

, and the following is the case.

Π_{i t j}^{*} = \{\begin{matrix} p \cdot p_{i t} & j = t, 1 \leq i \leq t \\ (1 - p) p_{i t} \cdot q_{t + 1, j} \cdot \frac{f_{i j}}{f_{i t} f_{t + 1, j}} & j > t, 1 \leq i \leq t \end{matrix} .

(A9)

Moreover, we have the following.

Π_{i t j} = P (I_{i} = 1, I_{i + 1} = \dots I_{j} = 0, I_{j + 1} = 1 | y_{1 n})

Proof.

Now, we use the Bayes theorem to combine forward and backward filters.

\begin{matrix} f (λ_{t} | y_{1 n}) & \propto & G (λ_{t}) \cdot f (y_{1 n} | λ_{t}) \\ \propto & G (λ_{t}) \cdot f (y_{1 t} | λ_{t}) \cdot f (y_{t + 1, n} | λ_{t}) \\ \propto & f (λ_{t} | y_{1 t}) \cdot f (λ_{t} | y_{t + 1, n}) / G (λ_{t}) \\ = & [\sum_{i = 1}^{t} p_{i t} g_{i t} (λ_{t})] \cdot [\frac{p G (λ_{t}) + (1 - p) \sum_{j = t + 1}^{n} q_{t + 1, j} \cdot g_{t + 1, j} (λ_{t})}{G (λ_{t})}] \\ = & \sum_{i = 1}^{t} [p \cdot p_{i t} \cdot g_{i t} (λ_{t})] + (1 - p) \sum_{i = 1}^{t} \sum_{j = t + 1}^{n} [p_{i t} q_{t + 1, j} \cdot \frac{g_{i t} (λ_{t}) g_{t + 1, j} (λ_{t})}{G (λ_{t})}] \end{matrix}

Then, it is easy to show that the following is the case.

\begin{matrix} \frac{g_{i t} (λ_{t}) g_{t + 1, j} (λ_{t})}{G (λ_{t})} & = & \frac{f (λ_{t} | y_{i t}) \cdot f (λ_{t} | y_{t + 1, j})}{G (λ_{t})} = \frac{f_{i j}}{f_{i t} f_{t + 1, j}} g_{i j} (λ_{t}) \end{matrix}

Therefore, according to the definition of

Π_{i t j}

, we derive the result in Proposition A3. □

Appendix A.2. The Calculation Steps of Posterior Distributions

In the expression of posterior distribution (16), we first calculate

g_{i j} (λ)

. According to the previous step of calculating the foward–backward filter, given

(y_{1 n}, R_{t} = i, R_{t + 1} = j),

we have the density of the posterior distribution of

λ

.

Substitute the Gamma distribution form into last equation, we have the following.

\begin{matrix} \prod_{t = i}^{j} f (y_{t} | λ) G (λ) & = & \frac{β^{α} λ^{α - 1} e^{- λ β}}{Γ (α)} \cdot λ e^{- λ y_{i}} \cdot λ e^{- λ y_{i + 1}} \dots λ e^{- λ y_{j}} \\ = & \frac{β^{α} λ^{α + (j - i)} e^{- m (β + y_{i} + y_{i + 1} + \dots + y_{j})}}{Γ (α)} . \end{matrix}

Therefore, it is easy to observe that

g_{i j} (λ) \sim Gamma (α + (j - i + 1), β + \sum_{i}^{j} y_{t})

.

Then, the next thing is to calculate

Π_{i t j}

. By the foward–backward filter part, we have

Π_{i t j} = \frac{Π_{i t j}^{*}}{\sum_{1 \leq s \leq t \leq k \leq n} Π_{s t k}^{*}}

, and the following is the case:

Π_{i t j}^{*} = \{\begin{matrix} p \cdot p_{i t} & j = t, 1 \leq i \leq t, \\ (1 - p) p_{i t} \cdot q_{t + 1, j} \cdot \frac{f_{i j}}{f_{i t} f_{t + 1, j}} & j > t, 1 \leq i \leq t . \end{matrix}

(A10)

where

f_{i j}

is defined as follows:

f_{i j} = \int \prod_{t = i}^{j} f (y_{t} | λ) \cdot G (λ) d λ

As

f (\cdot)

and

G (\cdot)

are conjugate, and we already calculated that the following is the case.

\prod_{t = i}^{j} f (y_{t} | λ) \cdot G (λ) = \frac{β^{α} λ^{α + (j - i + 1) - 1} e^{- λ (β + \sum_{i}^{j} y_{s})}}{Γ (α)}

Thus, it is easy to observe the following.

f_{i j} = \frac{Γ (α + j - i + 1) \cdot β^{α}}{Γ (α) \cdot {(β + \sum_{t = i}^{j} y_{t})}^{(α + j - i + 1)}}

(A11)

Moreover,

p_{i t} = \frac{p_{i t}^{*}}{\sum_{s = 1}^{t} p_{s t}^{*}}

and

q_{t + 1, j} = \frac{q_{t + 1, j}^{*}}{\sum_{s = t + 1}^{n} q_{t + 1, s}^{*}}

can be deducted recursively.

p_{i t}^{*} = \{\begin{matrix} p \cdot f_{t t} / f_{00} & i = t \\ (1 - p) p_{i, t - 1} \cdot f_{i t} / f_{i, t - 1} & i < t \end{matrix}

(A12)

q_{t + 1, j}^{*} = \{\begin{matrix} p \cdot f_{t + 1, t + 1} / f_{00} & j = t + 1 \\ (1 - p) q_{t + 2, j} \cdot f_{t + 1, j} / f_{t + 2, j} & j > t + 1 \end{matrix}

(A13)

In the forward filter

p_{i t}

, the recursive calculation can be executed in the following manner:

First when $t = 1$ , $p_{11} = 1$ ;
When $t = 2$ , $p_{12}^{*}$ can be deducted by $p_{11}$ , and $p_{22}^{*}$ can be directly calculated by itself. Thus, by normalization, we have $p_{12}$ and $p_{22}$ ;
When $t = 3$ , $p_{13}^{*}$ can be deducted by $p_{12}$ , $p_{23}^{*}$ can be deducted by $p_{22}$ , and $p_{33}^{*}$ can be directly calculated by itself;
$\dots \dots t = n$ , we can obtain the value of $p_{1 n}$ until $p_{n n}$ .

In the backward filter

q_{t + 1, j}

, the recursive calculation of can be executed in the following manner:

When $t = n - 1$ , $q_{t + 1, j} = q_{n n} = 1$ ;
When $t = n - 2$ , $q_{n - 1, n - 1}^{*}$ can be directly calculated by itself, and $q_{n - 1, n}^{*}$ can be deducted by $q_{n n}$ . Thus, by normalization, we have $q_{n - 1, n - 1}$ and $q_{n - 1, n}$ ;
When $t = n - 3$ , $q_{n - 2, n - 2}^{*}$ can be directly calculated by itself, $q_{n - 2, n - 1}^{*}$ can be deducted by $q_{n - 1, n - 1}$ , and $q_{n - 2, n}^{*}$ can be deducted by $q_{n - 1, n}$ .
$\dots \dots t = 1$ , we can obtain the value of $q_{22}$ until $q_{2 n}$ .

Therefore, once we have

f_{i j}

,

p_{i t}

, and

q_{t + 1, j}

, we can obtain the value of

Π_{i t j}

and further the expression of

f (λ_{t} | D)

in (16).

References

Hasbrouck, J. High-frequency quoting: Short-term volatility in bids and offers. J. Financ. Quant. Anal. 2018, 53, 613–641. [Google Scholar] [CrossRef]
Hasbrouck, J.; Saar, G. Low-latency trading. J. Financ. Mark. 2013, 16, 646–679. [Google Scholar] [CrossRef]
Bauwens, L.; Hautsch, N. Modelling financial high frequency data using point processes. Handb. Financ. Time Ser. 2009, 1, 953–979. [Google Scholar]
Hautsch, N. Econometrics of Financial High-Frequency Data; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Engle, R.F.; Russell, J.R. Autoregressive conditional duration: A new model for irregularly spaced transaction data. Econometrica 1998, 1, 1127–1162. [Google Scholar] [CrossRef]
Hautsch, N. Modelling Irregularly Spaced Financial Data: Theory and Practice of Dynamic Duration Models; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Žikeš, F.; Baruník, J.; Shenai, N. Modeling and forecasting persistent financial durations. Econom. Rev. 2017, 36, 1081–1110. [Google Scholar] [CrossRef]
Yang, J.; Li, Z.; Chen, X.; Xing, H. Modeling inter-trade durations in the limit order market. New Adv. Stat. Data Sci. 2017, 1, 259–276. [Google Scholar]
Chen, F.; Diebold, F.X.; Schorfheide, F. A Markov-switching multifractal inter-trade duration model, with application to US equities. J. Econom. 2013, 177, 320–342. [Google Scholar] [CrossRef][Green Version]
Abergel, F.; Jedidi, A. Long-time behavior of a Hawkes process–based limit order book. SIAM J. Financ. Math. 2015, 6, 1026–1043. [Google Scholar] [CrossRef]
Swishchuk, A.; Huffman, A. General compound Hawkes processes in limit order books. Risks 2020, 8, 28. [Google Scholar] [CrossRef]
Morariu-Patrichi, M.; Pakkanen, M.S. State-dependent Hawkes processes and their application to limit order book modelling. Quant. Financ. 2021, 1, 1–21. [Google Scholar] [CrossRef]
Li, Z.; Xing, H.; Chen, X. A multifactor regime-switching model for inter-trade durations in the limit order market. arXiv 2019, arXiv:1912.00764. [Google Scholar] [CrossRef]
Cho, D.C.; Frees, E.W. Estimating the volatility of discrete stock prices. J. Financ. 1988, 43, 451–466. [Google Scholar] [CrossRef]
Gerhard, F.; Hautsch, N. Volatility estimation on the basis of price intensities. J. Empir. Financ. 2002, 9, 57–89. [Google Scholar] [CrossRef]
Tse, Y.-K.; Yang, T.T. Estimation of high-frequency volatility: An autoregressive conditional duration approach. J. Bus. Econ. Stat. 2012, 30, 533–545. [Google Scholar] [CrossRef]
Hong, S.Y.; Nolte, I.; Taylor, S.; Zhao, V. Volatility estimation and forecasts based on price durations. J. Financ. Econom. 2021, nbab032. [Google Scholar] [CrossRef]
Russell, J.R. Econometric Modeling of Multivariate Irregularly-Spaced High-Frequency Data. Manuscript, GSB, University of Chicago. 1999. Available online: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.202.486&rep=rep1&type=pdf (accessed on 1 December 2021).
Bauwens, L.; Hautsch, N. Stochastic conditional intensity processes. J. Financ. Econom. 2006, 4, 450–493. [Google Scholar] [CrossRef]
Hall, A.D.; Hautsch, N. Modelling the buy and sell intensity in a limit order book market. J. Financ. Mark. 2007, 10, 249–286. [Google Scholar] [CrossRef]
Bowsher, C.G. Modelling security market events in continuous time: Intensity based, multivariate point process models. J. Econom. 2007, 141, 876–912. [Google Scholar] [CrossRef]
Box, G.E.; Tiao, G.C. Intervention analysis with applications to economic and environmental problems. J. Am. Stat. Assoc. 1975, 70, 70–79. [Google Scholar] [CrossRef]
Lai, T.L.; Liu, H.; Xing, H. Autoregressive models with piecewise constant volatility and regression parameters. Stat. Sin. 2005, 1, 279–301. [Google Scholar]
Lancaster, T. The Econometric Analysis of Transition Data; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
Daley, D.J.; Vere-Jones, D. An Introduction to the Theory of Point Processes: Volume I: Elementary Theory and Methods; Springer: New York, NY, USA, 2003. [Google Scholar]
Barndorff-Nielsen, O.E.; Shiryaev, A.N. Change of Time and Change of Measure; World Scientific Publishing Company: Singapore, 2015. [Google Scholar]

Figure 1. The simulated quote-price change durations and price-change intensities (first 1000 points).

Figure 2. Comparison between actual intensity in the simulation and estimated intensity from model estimation.

Figure 3. The evolution of best bid price in the first 50 s of AMZN stock on 2 January 2013.

Figure 4. The quote price duration of the best bid price for the AMZN stock on 2 January 2013.

Figure 5. The instantaneous quote volatility of the best bid price for AMZN stock on 2 January 2013 calculated by the change-point model.

Figure 6. The instantaneous quote volatility of the best bid price for AMZN stock on 2 January 2013 calculated by the ACI model.

Figure 7. The distribution of duration residuals of CPI.

Figure 8. The distribution of duration residuals of ACI.

Figure 9. The fitted 0.01 s price standard deviation of the best bid price for AMZN stock on 2 January 2013.

Figure 10. The fitted 1 s price standard deviation of the best bid price for AMZN stock on 2 January 2013.

Figure 11. The fitted 5 s price standard deviation of the best bid price for AMZN stock on 2 January 2013.

Table 1. Estimation results for the simulated data, which has

α = 5.0

,

β = 2.0

, and

p = 0.018

.

Table 1. Estimation results for the simulated data, which has

α = 5.0

,

β = 2.0

, and

p = 0.018

.

	$\hat{α}$	$\hat{β}$	$\hat{p}$
1000 points	3.47	1.51	0.030
4000 points	3.91	1.67	0.021
7000 points	4.10	1.77	0.019

Table 2. The message file and order book file of LOBSTER data.

Panel A: message file
Time (s)		Event Type	Order ID		Size	Price	Direction
⋮		⋮	⋮		⋮	⋮	⋮
35,101.685250		3	24,832,000		100	2,555,000	−1
35,101.685251		1	24,836,387		100	2,554,700	−1
35,101.685879		1	24,836,403		100	2,550,800	1
⋮		⋮	⋮		⋮	⋮	⋮
Panel B: order book file
Ask Price 1	Ask Size 1	Bid Price 1	Bid Size 1	Ask Price 2	Ask Size 2	Bid Price 2	Bid Size 2
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮
25,550,00	300	2,550,700	100	2,555,100	100	2,550,500	100
2,554,700	100	2,550,700	100	2,555,000	300	2,550,500	100
2,554,700	100	2,550,800	100	2,555,000	300	2,550,700	100
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮

Table 3. The integrated variances for different traders that calculated for AMZN stock on 2 January 2013.

Latency	Mean	S.D.	Min.	25%	Median	75%	Max.
0.01 S.	$7.6 \times 10^{- 5}$	$1.1 \times 10^{- 4}$	$3.0 \times 10^{- 6}$	$2.0 \times 10^{- 5}$	$2.6 \times 10^{- 5}$	$8.1 \times 10^{- 5}$	$8.0 \times 10^{- 4}$
1 S.	$7.5 \times 10^{- 3}$	$1.0 \times 10^{- 2}$	$3.0 \times 10^{- 6}$	$2.2 \times 10^{- 3}$	$2.6 \times 10^{- 3}$	$9.7 \times 10^{- 3}$	$5.1 \times 10^{- 2}$
5 S.	$3.5 \times 10^{- 2}$	$3.8 \times 10^{- 2}$	$3.0 \times 10^{- 6}$	$1.1 \times 10^{- 2}$	$1.5 \times 10^{- 2}$	$4.9 \times 10^{- 2}$	$1.3 \times 10^{- 1}$

Table 4. Mincer–Zarnowitz OLS results for CPI and ACI models.

	CPI Model	ACI Model
$β_{0}$	−0.241	−5.568
	(−2.735)	(−10.953)
$β_{1}$	4.236	12.614
	(78.800)	(18.310)
$R^{2}$	0.676	0.101
N	2974	2974

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

High-Frequency Quote Volatility Measurement Using a Change-Point Intensity Model

Abstract

1. Introduction

2. Volatility Measurement Using Price Duration

2.1. Instantaneous Volatility Measurement Use Price Duration

2.2. The Benchmark ACI Model

2.3. The Change-Point Model for Quote Volatility

2.4. Model Comparison

3. Model Estimation and Simulation

3.1. Model Estimation

3.1.1. Expected Likelihood

3.1.2. Maximization and Parameters’ Update

3.1.3. Inference of $λ_{t}$

3.2. Simulation

4. Data Environment

5. Model Fitness and In-Sample Analysis

5.1. Model Fit and Instantaneous Volatility

5.2. Integrated Variance and Cost of Demand

6. Out-of-Sample Performance and Model Prediction Power

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. The Posterior Distribution of $λ_{t}$ in EM

Appendix A.1. Foward–Backward Filter

Appendix A.1.1. Forward Filter

Appendix A.1.2. Backward Filter

Appendix A.1.3. Combination (Forward–Backward Algorithm)

Appendix A.2. The Calculation Steps of Posterior Distributions

References

Article Metrics

Citations

Article Access Statistics

High-Frequency Quote Volatility Measurement Using a Change-Point Intensity Model

Abstract

1. Introduction

2. Volatility Measurement Using Price Duration

2.1. Instantaneous Volatility Measurement Use Price Duration

2.2. The Benchmark ACI Model

2.3. The Change-Point Model for Quote Volatility

2.4. Model Comparison

3. Model Estimation and Simulation

3.1. Model Estimation

3.1.1. Expected Likelihood

3.1.2. Maximization and Parameters’ Update

3.1.3. Inference of λ t

3.2. Simulation

4. Data Environment

5. Model Fitness and In-Sample Analysis

5.1. Model Fit and Instantaneous Volatility

5.2. Integrated Variance and Cost of Demand

6. Out-of-Sample Performance and Model Prediction Power

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. The Posterior Distribution of λ t in EM

Appendix A.1. Foward–Backward Filter

Appendix A.1.1. Forward Filter

Appendix A.1.2. Backward Filter

Appendix A.1.3. Combination (Forward–Backward Algorithm)

Appendix A.2. The Calculation Steps of Posterior Distributions

References

Article Metrics

Citations

Article Access Statistics

3.1.3. Inference of $λ_{t}$

Appendix A. The Posterior Distribution of $λ_{t}$ in EM