Next Article in Journal
Formula I(1) and I(2): Race Tracks for Likelihood Maximization Algorithms of I(1) and I(2) Cointegrated VAR Models
Next Article in Special Issue
Time-Varying Window Length for Correlation Forecasts
Previous Article in Journal
Inequality and Poverty When Effort Matters
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Business Time Sampling Scheme with Applications to Testing Semi-Martingale Hypothesis and Estimating Integrated Volatility

1
Business School, University of International Business and Economics, 10 Huixin Dongjie, Beijing 100029, China
2
School of Economics, Singapore Management University, Singapore 178903, Singapore
*
Author to whom correspondence should be addressed.
Econometrics 2017, 5(4), 51; https://doi.org/10.3390/econometrics5040051
Submission received: 3 August 2017 / Revised: 28 September 2017 / Accepted: 17 October 2017 / Published: 13 November 2017
(This article belongs to the Special Issue Volatility Modeling)

Abstract

:
We propose a new method to implement the Business Time Sampling (BTS) scheme for high-frequency financial data. We compute a time-transformation (TT) function using the intraday integrated volatility estimated by a jump-robust method. The BTS transactions are obtained using the inverse of the TT function. Using our sampled BTS transactions, we test the semi-martingale hypothesis of the stock log-price process and estimate the daily realized volatility. Our method improves the normality approximation of the standardized business-time return distribution. Our Monte Carlo results show that the integrated volatility estimates using our proposed sampling strategy provide smaller root mean-squared error.

1. Introduction

In high-frequency financial data analysis, researchers usually do not use all available data but would select a subgrid of transactions. To choose the subgrid, two issues have to be considered: selecting the sampling scheme and choosing the target average sampling frequency. Three sampling schemes are commonly used in the literature: Calendar Time Sampling (CTS), Tick Time Sampling (TTS) and Business Time Sampling (BTS). Under the CTS scheme, transactions are selected by regularly spaced calendar time, such as every 5 s/min. The TTS scheme selects transactions with regularly spaced number of ticks, e.g., every 5 or 10 ticks. The BTS transactions are often selected to ensure approximately equal volatility for the returns over each interval. Thus, the CTS and TTS schemes are implemented based on explicit criteria (i.e., regular calendar-time length or number of ticks, respectively). In contrast, the BTS scheme depends on the unobserved volatility. As a result, the CTS and TTS schemes have been used widely in the literature, while the BTS scheme is used less frequently.
The BTS scheme possesses some desirable properties for high-frequency financial data analysis. It dates back to Dacorogna et al. (1993), and see also Zhou (1998), Peters and De Vilder (2006), and Mykland (2012). In particular, the BTS scheme yields independently and identically distributed (iid) normal returns for a semimartingale price process even when there is leverage or feedback effect.1 In contrast, due to the leverage effect or varying volatility, the calendar-time returns may not be iid normal even if the price process is a continuous local martingale. The assumption of iid Gaussian returns is required for the Gaussian-likelihood approach Nowman (1997) and several widely used integrated volatility estimates, including the multipower variation (MPV) estimate of Barndorff-Nielsen et al. (2006), the quantile realized volatility (QRV) method of Christensen et al. (2010) and the nearest neighbor truncation method of Andersen et al. (2012, 2014). However, in the implementation, researchers often sample transactions under the CTS or TTS scheme. Andersen et al. (2007, 2010) find that the normalized daily and weekly returns sampled in business time accord well with the standard normal distribution. To sample the time points, they use a sequential method to include intraday returns until the cumulative squared returns exceeds the average daily or weekly realized volatility.
One drawback of the method in Andersen et al. (2007, 2010) is that its performance in obtaining returns with approximately equal volatility over each interval deteriorates as the sampling frequency increases. In addition, researchers need to choose a threshold to obtain sampled returns at the target frequency. In this paper, we propose a new method to implement the BTS scheme, which has better performance as the sampling frequency increases and needs no threshold. We estimate the intraday integrated volatility using the jump-robust Tripower Realized Volatility (TRV) method of Barndorff-Nielsen et al. (2006) and calculate a time-transformation (TT) function over the investigated time period. The value of the TT function corresponds to the cumulative increments of the estimated intraday integrated volatility over time and the sampled BTS transactions are obtained using the inverse of the TT function.
We test the semi-martingale hypothesis on the BTS returns of 40 stocks selected from the New York Stock Exchange (NYSE). Our proposed BTS method improves the normality approximation of the empirical standardized return distribution. We further explore the use of the BTS scheme in estimating integrated volatility. First, we consider the Realized Volatility (RV) estimates for daily integrated volatility when the returns are sampled using the BTS, CTS and TTS schemes. Our Monte Carlo simulation shows that the TRV estimates (with and without subsampling) using the BTS returns provide the smallest root mean-squared error (RMSE). Second, we modify the ACD-ICV method of Tse and Yang (2012), making use of the BTS scheme. Our modified ACD-ICV estimator performs better than the Realized Kernel (RK) estimates Barndorff-Nielsen et al. (2008) and the method of Tse and Yang (2012), which uses price-event sampling.
The rest of this paper is as follows. Section 2 outlines our proposed implementation of the BTS scheme. Section 3 reports some empirical results on testing the semi-martingale hypothesis. In Section 4, we outline the estimation methods of daily volatility examined in this paper. We report the results of our Monte Carlo study in Section 5 and draw conclusions in Section 6. The Appendix A provides further details of the jump detection procedure and computation of the BT time-transformation function. Some additional results can be found in the accompanying online supplementary material.

2. Intraday Periodicity and the BTS Scheme

The stylized fact of intraday periodicity is a well known phenomenon in high-frequency financial data analysis. Trading activities are usually high at the beginning and close of the trading day and low around lunch time. Modifications are needed to get rid of this pattern before data are fitted into high-frequency models. To adjust for the intraday periodicity of transaction activity, Tse and Dong (2014) use a time-transformation function computed by pooling all transactions over all trading days in the sample. The TT function transforms all observed transactions from the calendar time to a transformed time for which transactions are evenly observed. However, the time-transformation function proposed by Tse and Dong (2014) will not be appropriate for BT sampling if volatility and transaction activity exhibit different intraday periodicity patterns.
Figure 1 presents the intraday periodicity in volatility and trading activity of the stock JP Morgan (JPM) from January 2010 to April 2013. Figure 1A plots the means of the 1 min intraday realized volatilities over all trading days in the sample period, expressed in annualized standard deviation in percent, while Figure 1B plots the total number of transactions at each second from 9:30 a.m. to 4:00 p.m. over all trading days in the sample. We observe that the realized volatility at the beginning of the trading day is approximately two to three times larger than that near the end of the trading day. In contrast, the number of transactions at the end of the trading day is two to three times larger than that in the morning. This finding is quite regular across other stocks in our sample, which shows that intraday periodicity patterns in volatility and trading activity are quite different.2 Thus, TTS returns in the morning will have larger volatility than those in the afternoon. In contrast, by construction, BTS returns will have nearly constant volatility over the whole trading day.
To implement the BTS scheme, Peters and De Vilder (2006) select transactions based on increments of estimated quadratic volatility. To alleviate the microstructure noise effect, transactions are selected sparsely, such as at 2 min frequency, to calculate the volatility. One drawback of this method is that it assumes away the price jumps, although this assumption is often rejected in the literature (see, e.g., Barndorff-Nielsen and Shephard (2004, 2006), Huang and Tauchen (2005), Lee and Mykland (2008) and Boudt et al. (2011)).3 Another drawback of this method (also in Andersen et al. 2010) is that researchers need to select a threshold value to obtain transactions at the target sampling frequency. The procedure of choosing the threshold has to be iterated, especially when the target frequency is high.
In this paper, we adopt the time-transformation approach to implement the BTS scheme. Let T denote the calendar-time length in seconds aggregated over all trading days in the sample. The time-transformation function Q ( t ) at calendar time t (in sec), for 0 t T , is computed as the empirical proportion of the intraday integrated volatility up to time t.4 The diurnally transformed time corresponding to calendar time t is denoted by t ˜ , with t ˜ = T × Q ( t ) . Thus, returns over equal intervals of diurnally transformed time will have approximately equal integrated volatility. To obtain BTS transactions at a given frequency, we can select equally spaced transactions at the diurnally transformed time and choose the corresponding BTS transactions using the inverse function Q 1 ( t ) . In contrast to the methods in the literature, we do not need to choose a threshold value when implementing the BTS scheme.
We outline our method to obtain the BTS transactions as follows. Consider a sequence of estimated intraday integrated volatility V k for K consecutive time intervals over the period ( 0 , T ] , with the end point in each time interval represented by t k , for k = 1 , , K .5 Denote the collection of points t k by H V ( t 0 = 0 ) and define N t 0 = 0 and N t k = i = 1 k V i for k = 1 , , K . The time-transformation function is calculated as Q ( t k ) = N t k / N t K , for t k H V . Q ( t ) at any calendar-time point t can then be computed using a cubic interpolation that preserves monotonicity in t.6 To sample a sequence of calendar-time points with BTS duration h, we take equally spaced diurnally transformed BT points t ˜ j , j = 0 , , L , with t ˜ j t ˜ j 1 = h and L = [ T / h ] . Then, t j = Q 1 ( t ˜ j / T ) are the required calendar-time points for the BTS scheme.
Figure 2A,B present the time-transformation functions for stock JPM based on intraday volatility and trading activity, respectively. These functions are computed by merging the data over the complete sample period, resulting in representative one-day time-transformation functions.7 Note that the two transformation functions exhibit different intraday patterns, with the compression of diurnally transformed time during market open more prominent for volatility than for trading activity.

3. Testing the Semi-Martingale Hypothesis Using BTS Returns

As discussed above, BTS returns are iid normal for a semi-martingale price process even when there is leverage and/or feedback effect, whereas the CTS returns may not be iid normal even if the price process is a continuous local martingale. We now examine empirically the behavior of the BTS returns following the study of Andersen et al. (2010).

3.1. The Semi-Martingale Hypothesis

Let r k = Y t k Y t k 1 be the jump-adjusted returns over the time interval ( t k 1 , t k ) .8 If the log-price follows a jump-diffusion process9 with no leverage and volatility feedback, r k standardized by the integrated volatility will follow a standard normal distribution, i.e.,
r k t k 1 t k σ 2 ( τ ) d τ N ( 0 , 1 ) , k = 1 , 2 , ,
where σ ( · ) is the instantaneous volatility function. The above result, however, will not hold if σ ( · ) exhibits correlation over time (feedback effect) or with the log-price innovation (leverage effect). On the other hand, if the jump-adjusted returns are sampled over business time so that, over each business-time interval ( t ˜ k 1 , t ˜ k ) , we have
t ˜ k = inf s > t ˜ k 1 t ˜ k 1 s σ 2 ( τ ) d τ > σ ¯ 2
for a given volatility threshold σ ¯ 2 ,10 Then, the jump-adjusted returns r ˜ k over the business-time intervals ( r ˜ k 1 , r ˜ k ) satisfy
r ˜ k σ ¯ N ( 0 , 1 ) , k = 1 , 2 , .
To sample a sequence of BTS returns, Andersen et al. (2010) include intraday returns until the cumulative squared 5-min returns exceed a threshold, defined as the average daily or weekly realized volatility in calendar time (ABFN method hereafter). They report improved accuracy of the normal approximation under this sampling scheme.

3.2. Empirical Results of the Tests

To examine empirically the performance of our proposed BTS method versus the ABFN method, we use data of the top 40 market-capitalization stocks from the NYSE in 2010. We extract the tick-by-tick transaction data of these stocks from the TAQ database from January 2010 to April 2013. To clean the raw data, we follow the steps described in Tse and Dong (2014). Using the sequential jump-detection procedure of Andersen et al. (2010), we investigate the proportion of detected jumps (number of the detected jumps over the total number of sampled returns) when different sampling intervals and sampling schemes are used.11
We test the normality assumption of the drift-corrected and jump-adjusted BTS returns (returns with jumps removed). For each trading day, all BTS returns with jumps are deleted and the jump-adjusted 30-min, daily or weekly BTS returns are then computed by summing the remaining consecutive jump-adjusted BTS returns. We apply the Lilliefors (1967) test for normality to the jump-adjusted CTS returns, ABFN returns, and BTS returns. The results are reported in Table 1. It can be seen that the BTS method and the ABFN method substantially improve the normality approximation of the standardized return distribution over the Calendar-Time method. While the results for the weekly data are similar for the BTS method and the ABFN method, the BTS sampling scheme at higher frequencies restores normality for several stocks.
We further compare the performance of various methods by computing the moments of 30 min returns with jump-adjustment. In addition to using the Lilliefors (1967) test for normality, we also examine the skewness and kurtosis of the sampled returns of the 40 stocks and compute the average of the absolute difference of the calculated skewness and kurtosis versus 0 and 3, respectively. The results are reported in Table 2. We observe that, for high-frequency returns, the BTS method performs the best in restoring normality, while the TTS method performs the worst. This observation confirms our finding in Section 2 that we cannot use the TTS scheme to approximate the BTS scheme to select intraday returns. For illustration, Figure 3 presents the QQ (Quantile-Quantile) plots of the 30 min jump-adjusted ABFN returns and BTS returns for the JPM stock data. Our proposed BTS method performs better than the ABFN method in restoring normality for returns sampled at high frequency.12

4. Estimation of Integrated Volatility

We now examine the use of the BTS scheme for estimating integrated volatility. We consider two methods of estimating daily volatility: Realized Volatility (RV) method and Autoregressive Conditional Duration-Integrated Conditional Volatility (ACD-ICV) method. The literature on RV estimation has grown tremendously since its inception. In this paper, we select the Tripower Realized Volatility (TRV) estimate of Barndorff-Nielsen et al. (2006) for its robustness to price jumps.13 We compare the performance of the TRV estimates when returns are sampled by BTS, CTS and TTS schemes, with and without subsampling. With the same estimator used, the performance of the estimates is only differentiated by the sampling method. We also consider the use of the ACD-ICV approach, with some modifications based on the BTS methodology.

4.1. Integrated Volatility Estimation Using BT Returns

For a given subgrid H , the TRV estimate is computed as
V T = ξ 2 3 3 i = 2 | H | 1 | r i , | 2 3 | r i | 2 3 | r i , + | 2 3 ,
where r i = ( Y i , + Y i ) for Y i , + , Y i H and ξ k = 2 k 2 Γ ( ( k + 1 ) / 2 ) / Γ ( 1 / 2 ) for k > 0 , with Γ ( · ) denoting the gamma function and Y i , + denoting the elements following Y i in H . We define | H | as number of points in grid H minus 1. For the CTS scheme, the previous-tick method is adopted when there is no transaction at the selected time point. For the TTS scheme, the number of subsampling grids S is selected to ensure that each subgrid has transactions at the target sampling frequency. To implement the subsampling method under the BTS scheme, we select BTS transactions with the sampling frequency being twice the average transaction duration. Subsampling is then implemented to obtain subgrids at the target sampling frequency.

4.2. Integrated Volatility Estimation Using the Modified ACD-ICV Method

Tse and Yang (2012) propose the ACD-ICV method to estimate daily and intraday volatility by modeling the price durations parametrically. They point out that, for short intraday intervals, such as an hour or 15 min, the RV methods use only local data for the period of interest. Thus, the infill sample size may not be large enough to justify the applicability of the asymptotics of the RV estimates. In contrast, the ACD-ICV method estimates the conditional volatility using data beyond the period of interest and produces better estimates of volatility over short intraday intervals. Their simulation results show that the ACD-ICV method performs better than other methods (such as the Realized Kernel method of Barndorff-Nielsen et al. (2008) in estimating the daily, 1 h and 15 min integrated volatility.
The ACD-ICV method samples observations from the observed transaction data based on a pre-specified price threshold δ . That is, transactions are selected whenever the absolute price change exceeds the threshold δ , which are the price events. Suppose H P E = { t 0 , t 1 , t 2 , , t N } is the selected price events and the ith price duration is x i = t i t i 1 , i = 1 , , N . Let Φ i denote the information set upon the price event at time t i . Denote ψ i + 1 = E ( x i + 1 | Φ i ) as the conditional expectation of the price duration and assume that the standardized durations ϵ i = x i / ψ i , i = 1 , , N are iid positive random variables with a mean of unity. Given the information Φ i at time t i , the conditional instantaneous return variance per unit time at time t > t i , denoted by σ 2 ( t | Φ i ) , is
σ 2 ( t | Φ i ) = δ 2 ψ i + 1 λ t t i ψ i + 1 ,
where λ ( · ) is the hazard function of ϵ i . Assuming ϵ i to be iid standard exponential distributed, the integrated conditional variance (ICV) over time period [ t n 1 , t n 2 + 1 ] is calculated as
ICV = δ 2 i = n 1 n 2 t i + 1 t i ψ i + 1 .
The conditional expectation of the price durations ψ i can be estimated by various methods, such as the ACD method of Engle and Russell (1998) or the Augmented ACD method of Fernandes and Grammig (2006).
The original ACD-ICV method models the price durations obtained by the threshold δ by assuming that these durations have equal volatility, and this assumption may not be true. In this paper, we modify the ACD-ICV method in two ways. First, instead of modeling the durations of the price events, we model the BTS durations that are obtained based on volatility change (instead of the absolute price change). The BTS returns are sampled as in Section 2. Second, we replace δ 2 in Equation (6) by an estimate of the mean volatility over each sampled BTS return. Suppose there are K BTS returns over m trading days, and the estimated integrated volatility over these m trading days is equal to V m . Then, each BTS return has an approximately constant integrated volatility of V D = V m / K . Instead of using δ 2 as an approximation of the integrated volatility of each price event as in Tse and Yang (2012), we use V D to replace δ 2 in Equation (6) to obtain a new ACD-ICV estimate.
Thus, for the ACD-ICV approach, we consider three variations of estimates. We first select transactions using the price-event sampling method (where transactions are selected based on absolute price change) and estimate the daily integrated volatility using the ACD-ICV method as in Tse and Yang (2012). We denote this method by ME1. We then replace δ 2 in equation (6) by V D , which is the integrated volatility estimated using TRV with subsampling at 3-min sampling frequency. We call this method ME2.14 Finally, we sample data using the BTS scheme (not by price events) and repeat the computation as in ME2, which is called ME3. We compare the daily volatility estimates using the ACD-ICV methods against the RK method.15

5. Monte Carlo Study

We conduct a Monte Carlo (MC) study to examine the performances of different integrated volatility estimates. Our MC set-up draws upon other models in the literature.

5.1. Simulation Models

We consider five simulation models, which are summarized in Table 3. Models MD1 and MD2 are the Heston models (Heston (1993) and Aït-Sahalia and Mancini (2008) with some modifications) with high and low volatility, respectively. MD3 is the two-factor afine stochastic volatility model with an intraday U-shape pattern Hasbrouck (1999) and Andersen et al. (2012). MD4 is a deterministic volatility set-up Tse and Yang (2012). Finally, MD5 is MD1 with price jumps.
For all set-ups above, we set the initial price to 60 and the initial value of σ to 30%. We introduce sparsity of trade to the data by simulating exponentially distributed calendar-time transaction durations.16 We first simulate transactions second by second and then generate exponentially distributed transaction durations with mean equal to 5 s, 10 s and 20 s, respectively. For simplicity, we only investigate iid market microstructure noise with constant noise-signal ratio (NSR). Based on the findings in Dong and Tse (2017), we consider cases with NSR = 0.005%, 0.01% and 0.02%. We introduce a 0.01 price rounding error in all simulations. The intraday duration periodicity is adjusted by the time-transformation method in Tse and Dong (2014) before we fit all price durations and BTS durations to the ACD model. Each model is simulated over 60 trading days, with the simulation repeated 1000 times.

5.2. Simulation Results

We first report our results on the TRV estimates. For each model set-up, sampling frequencies of 1, 2, 3, 5 and 10 min are considered. We compute TRV based on the CTS, TTS and BTS returns, with and without subsampling. To save space, we present only the results for the BTS scheme with subsampling in Table 4. Table 5 summarizes the average difference in RMSE of the CTS and TTS schemes versus the BTS scheme over all models and parameter set-ups, with subsampling, in both absolute and relative terms. All results can be found in the supplementary material (Tables S4–S9). The BTS returns perform the best in reporting generally smaller root mean-squared error (RMSE), especially for methods with no subsampling. When the subsampling method is used, the RMSE decreases substantially. Generally, the BTS scheme still performs the best and its advantage is especially obvious for MD3 and MD4 when there is intraday volatility periodicity in the simulated price process.17 Under all sampling schemes, the TRV estimates suffer from the market microstructure noise problems when the NSR is large, resulting in high mean error (ME) at high sampling frequency. When sparsity and NSR are both low, high sampling frequency at 1 min interval produces the lowest RMSE. Overall, the BTS returns outperform the CTS and TTS returns in estimating the integrated volatility.
We now turn to the results of the ACD-ICV method. Table 6 and Table 7 report the ME and RMSE of the RK and the ACD-ICV estimates for MD1 and MD5, respectively. Results for other models can be found in the online supplementary material (Tables S10–S12). Table 8 summarizes the average RMSE of Realized Kernel estimate versus the ACD-ICV estimates over all models and parameter set-ups. The RK method performs quite well for the unbiasedness property, reporting small ME for all models except MD5 (model with price jump). ME1 reports quite big absolute ME and RMSE values among all the sampling frequencies considered and it often performs worse than the RK method except for MD5.18 In contrast, the modified ACD-ICV methods, ME2 and ME3, report quite small ME and RMSE. ME3 consistently reports smaller RMSE than ME2 except for few cases in MD3 and MD5 at the 3 min and 5 min sampling frequencies. Moreover, ME2 varies more across different sampling frequencies. This demonstrates the superiority of the BTS durations over the price durations when they are fitted to the ACD model to estimate the integrated volatility. The better performance of ME3 over ME2 is mainly due to the fact that the BTS scheme performs better in yielding returns with constant volatility compared against the ACD-ICV method using the price events.
Our modified ACD-ICV method ME3 consistently produces lower RMSE over all models and parameter set-ups than the RK estimates. Its performance is robust over a wide range of sampling frequencies of up to 15 min. It also outperforms the TRV estimates with subsampling, except possibly for MD5, in which case their performances are comparable.19

6. Conclusions

We propose an easy-to-use time-transformation method to implement the BTS scheme at a prespecified average sampling frequency. Using 40 stocks from the NYSE, we perform normality test to the jump-adjusted daily and weekly BTS returns. Our results show that stock prices can be considered discrete observations from a continuous-time jump-diffusion process. The BTS scheme performs better than other sampling methods in yielding iid Gaussian returns, and it also performs better than the CTS and TTS schemes in estimating daily realized volatility using the TRV method. We also show the superiority of the BTS durations over the price durations in estimating the daily integrated volatility using the ACD-ICV method. Our modified ACD-ICV estimate, ME3, which models the high-frequency BTS durations using the ACD model, performs the best in reporting smaller RMSE values.
Finally, we note that there are other possible applications of the BTS scheme. For example, we can estimate the conditional instantaneous volatility or intraday integrated volatility (such as over 30 min intervals) by modeling the high-frequency BTS durations using the ACD model. Moreover, we can estimate the integrated quarticity ( I Q = 0 1 σ 4 ( τ ) d τ ) by using a method similar to that implemented in this paper. That is, we first obtain transactions with approximately equal quarticity increments. The conditional instantaneous quarticity or integrated quarticity can be estimated further by modeling clustering of the corresponding durations using the ACD model. Please refer to Jacod and Rosenbaum (2013) and Andersen et al. (2014), among others, for theories and empirical applications of the asset price quarticity.

Supplementary Materials

The following are available online at www.mdpi.com/2225-1146/5/4/51/s1, Figure S1: 1-min Realized Volatility and number of transactions, 2010–2013. Table S1: Proportion of detected jumps. Table S2: Rejection proportion of the normality hypothesis for no-jump returns under different sampling schemes. Table S3: Rejection proportion of the normality hypothesis for returns under different sampling schemes. Table S4: ME and RMSE of daily volatility estimates using the TRV method without subsampling under the CTS scheme. Table S5: ME and RMSE of daily volatility estimates using the TRV method without subsampling under the TTS scheme. Table S6: ME and RMSE of daily volatility estimates using the TRV method without subsampling under the BTS scheme. Table S7: ME and RMSE of daily volatility estimates using the TRV method with subsampling under the CTS scheme. Table S8: ME and RMSE of daily volatility estimates using the TRV method with subsampling under the TTS scheme. Table S9: ME and RMSE of daily volatility using the TRV method with subsampling under the BTS scheme. Table S10: ME and RMSE of daily volatility estimates of the RK and ACD-ICV methods for Model MD2. Table S11: ME and RMSE of daily volatility estimates of the RK and ACD-ICV methods for Model MD3. Table S12: ME and RMSE of daily volatility estimates of the RK and ACD-ICV methods for Model MD4.

Acknowledgments

Yingjie Dong acknowledges support provided by the Fundamental Research Funds for the Central Universities in University of International Business and Economics (UIBE) (CXTD8-01).

Author Contributions

Yingjie Dong and Yiu-Kuen Tse contributed equally to this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Jump Detection Procedure

We remove price jumps using the sequential jump-adjustment procedure of Andersen et al. (2010) with some minor modifications. The test statistic Z t for day t is computed as
Z t = | H t | ln V R t ln V B t ( ( μ 1 4 + 2 μ 1 2 5 ) Q T t B V t 2 ) 1 2 ,
where μ 1 = 2 / π , V R t is the realized volatility, V B t is the realized bipower volatility and Q T t is the realized tripower quarticity, and H t is the grid on day t. The realized tripower quarticity Q T t is computed as
Q T t = 1 | H t | μ 4 3 3 j = 2 | H t | 1 | r i , | 4 3 | r i | 4 3 | r i , + | 4 3 ,
where r i = Y i , + Y i for Y i , + , Y i H t and μ 4 3 = 2 2 3 Γ ( 7 / 6 ) / Γ ( 1 / 2 ) . Under the condition of no jumps, Z t is approximately standard normal. At the significance level α , a jump is considered to be significant if Z t > z 1 α , where z α is the α -quantile of the standard normal distribution. When Z t > z 1 α , we delete the return that has the maximum absolute value among all returns and repeat the jump test again for the remaining returns. This recursive procedure stops when Z t z 1 α .

Appendix A.2. Computation of the BT Time-Transformation Function

To compute the BT transformation function Q ( t k ) = N t k / N t K , where N t k = i = 1 k V i for k = 1 , , K , we need to calculate the intraday integrated volatility V i . To alleviate the effects of market microstructure noise and price jumps we adopt the Tripower Realized Volatility (TRV) method of Barndorff-Nielsen et al. (2006). Suppose there are S subsampling grids and the sth subgrid is denoted by H s . Under subgrid H s , for trading day d, denote the returns by { r d , 1 ( s ) , r d , 2 ( s ) , r d , 3 ( s ) , , r d , n s ( s ) } . We define
V T d , i ( s ) = ξ 2 3 3 | r d , i 1 ( s ) | 2 3 | r d , i ( s ) | 2 3 | r d , i + 1 ( s ) | 2 3 ,
for i = 2 , , n s 1 , where ξ k = 2 k 2 Γ ( ( k + 1 ) / 2 ) / Γ ( 1 / 2 ) for k > 0 and Γ ( · ) is the gamma function. Moreover, V T d , 1 ( s ) = V T d , 2 ( s ) and V T d , n s ( s ) = V T d , n s 1 ( s ) . Since the estimate V T d , i ( s ) may have large fluctuations, we group every g V T d , i ( s ) together to obtain the estimated intraday integrated volatility at 10-min average sampling frequency. Thus, the pth intraday integrated volatility on day d under subgrid H s is V d , p ( s ) = i = p ( g 1 ) + 1 p g V T d , i ( s ) . The corresponding end point of V d , p ( s ) is the end point of r d , p g ( s ) and is denoted by t d , p ( s ) . Finally, the estimated pth intraday integrated volatility V d , p and its corresponding end point t d , p can be obtained by taking the average over all subsamples. That is, V d , p = 1 S s = 1 S V d , p ( s ) and t d , p = 1 S s = 1 S t d , p ( s ) . V d , p and t d , p pooled over all m trading days form V K for k = 1 , , K and H V , respectively, for the computation of the time-transformation function Q ( t ) . The number of subsamples S is selected to obtain subgrids of approximately 1 min sampling frequency.

References

  1. Aït-Sahalia, Yacine, and Loriano Mancini. 2008. Out of sample forecasts of quadratic variation. Journal of Econometrics 147: 17–33. [Google Scholar] [CrossRef] [Green Version]
  2. Andersen, Torben G., Tim Bollerslev, and Dobrislav Dobrev. 2007. No-arbitrage semi-martingale restrictions for continuous-time volatility models subject to leverage effects, jumps and iid noise: Theory and testable distributional implications. Journal of Econometrics 138: 125–80. [Google Scholar] [CrossRef]
  3. Andersen, Torben G., Tim Bollerslev, Per Frederiksen, and Ørregaard Nielsen. 2010. Continuous-time models, realized volatilities, and testable distributional implications for daily stock returns. Journal of Applied Econometrics 25: 233–61. [Google Scholar] [CrossRef]
  4. Andersen, Torben G., Dobrislav Dobrev, and Ernst Schaumburg. 2012. Jump-robust volatility estimation using nearest neighbor truncation. Journal of Econometrics 169: 75–93. [Google Scholar] [CrossRef]
  5. Andersen, Torben G., Dobrislav Dobrev, and Ernst Schaumburg. 2014. A robust neighborhood truncation approach to estimation of integrated quarticity. Econometric Theory 30: 3–59. [Google Scholar] [CrossRef]
  6. Bajgrowicz, Pierre, Olivier Scaillet, and Adrien Treccani. 2016. Jumps in high-frequency data: Spurious detections, dynamics, and news. Management Science 62: 2198–217. [Google Scholar] [CrossRef]
  7. Barndorff-Nielsen, Ole E., and Neil Shephard. 2004. Power and bipower variation with stochastic volatility and jumps. Journal of Financial Econometrics 2: 1–37. [Google Scholar] [CrossRef]
  8. Barndorff-Nielsen, Ole E., and Neil Shephard. 2006. Econometrics of testing for jumps in financial economics using bipower variation. Journal of financial Econometrics 4: 1–30. [Google Scholar] [CrossRef]
  9. Barndorff-Nielsen, Ole E., Neil Shephard, and Matthias Winkel. 2006. Limit theorems for multipower variation in the presence of jumps. Stochastic Processes and Their Applications 116: 796–806. [Google Scholar] [CrossRef]
  10. Barndorff-Nielsen, Ole E., Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2008. Designing realized kernels to measure the ex post variation of equity prices in the presence of noise. Econometrica 76: 1481–536. [Google Scholar]
  11. Boudt, Kris, Christophe Croux, and Sébastien Laurent. 2011. Robust estimation of intraweek periodicity in volatility and jump detection. Journal of Empirical Finance 18: 353–67. [Google Scholar] [CrossRef]
  12. Christensen, Kim, Roel Oomen, and Mark Podolskij. 2010. Realised quantile-based estimation of the integrated variance. Journal of Econometrics 159: 74–98. [Google Scholar] [CrossRef]
  13. Dacorogna, Michael M., Ulrich A. Müller, Robert J. Nagler, Richard B. Olsen, and Olivier V. Pictet. 1993. A geographical model for the daily and weekly seasonal volatility in the foreign exchange market. Journal of International Money and Finance 12: 413–38. [Google Scholar] [CrossRef]
  14. Dambis, Karl E. 1965. On the decomposition of continuous submartingales. Theory of Probability & Its Applications 10: 401–10. [Google Scholar]
  15. Dong, Yingjie, and Yiu-Kuen Tse. 2017. On estimating market microstructure noise variance. Economics Letters 150C: 59–62. [Google Scholar] [CrossRef]
  16. Dubins, Lester E., and Gideon Schwarz. 1965. On continuous martingales. Proceedings of the National Academy of Sciences of the United States of America 53: 913. [Google Scholar] [CrossRef] [PubMed]
  17. Engle, Robert F., and Jeffrey R. Russell. 1998. Autoregressive conditional duration: A new model for irregularly spaced transaction data. Econometrica 66: 1127–62. [Google Scholar] [CrossRef]
  18. Fernandes, Marcelo, and Joachim Grammig. 2006. A family of autoregressive conditional duration models. Journal of Econometrics 130: 1–23. [Google Scholar] [CrossRef]
  19. Hasbrouck, Joel. 1999. The dynamics of discrete bid and ask quotes. The Journal of Finance 54: 2109–42. [Google Scholar] [CrossRef]
  20. Heston, Steven L. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. Review of Financial Studies 6: 327–43. [Google Scholar] [CrossRef]
  21. Huang, Xin, and George Tauchen. 2005. The relative contribution of jumps to total price variance. Journal of Financial Econometrics 3: 456–99. [Google Scholar] [CrossRef]
  22. Jacod, Jean, and Mathieu Rosenbaum. 2013. Quarticity and other functionals of volatility: Efficient estimation. The Annals of Statistics 41: 1462–84. [Google Scholar] [CrossRef]
  23. Lee, Suzanne S., and Jan Hannig. 2010. Detecting jumps from Lévy jump diffusion processes. Journal of Financial Economics 96: 271–90. [Google Scholar] [CrossRef]
  24. Lee, Suzanne S., and Per A. Mykland. 2008. Jumps in financial markets: A new nonparametric test and jump dynamics. The Review of Financial Studies 21: 2535–63. [Google Scholar] [CrossRef]
  25. Lilliefors, Hubert W. 1967. On the Kolmogorov-Smirnov test for normality with mean and variance unknown. Journal of the American Statistical Association 62: 399–402. [Google Scholar] [CrossRef]
  26. Mykland, Per A. 2012. A Gaussian calculus for inference from high frequency data. Annals of Finance 8: 235–58. [Google Scholar] [CrossRef]
  27. Nowman, Khalid B. 1997. Gaussian Estimation of Single-Factor Continuous Time Models of The Term Structure of Interest Rates. The Journal of Finance 52: 1695–706. [Google Scholar] [CrossRef]
  28. Oomen, Roel C. A. 2006. Properties of realized variance under alternative sampling schemes. Journal of Business & Economic Statistics 24: 219–37. [Google Scholar]
  29. Peters, Remco T., and Robin G. De Vilder. 2006. Testing the continuous semimartingale hypothesis for the S&P 500. Journal of Business & Economic Statistics 24: 444–54. [Google Scholar]
  30. Tse, Yiu-Kuen, and Thomas Tao Yang. 2012. Estimation of high-frequency volatility: An autoregressive conditional duration approach. Journal of Business & Economic Statistics 30: 533–45. [Google Scholar]
  31. Tse, Yiu-Kuen, and Yingjie Dong. 2014. Intraday Periodicity Adjustments of Transaction Duration and Their Effects on High-Frequency Volatility Estimation. Journal of Empirical Finance 28: 352–61. [Google Scholar] [CrossRef]
  32. Zhou, Bin. 1998. F-consistency, De-volatilization and normalization of high-frequency financial data. In Nonlinear Modelling of High Frequency Financial Time Series. Edited by Christian Dunis and Bin Zhou. New York: Wiley, pp. 109–23. [Google Scholar]
1.
Dambis (1965) and Dubins and Schwarz (1965) show that a process compiled from a continuous local martingale with equal quadratic-variation increments is a Brownian motion. Leverage effect refers to the asymmetry between equity returns and volatility. That is, large negative returns tend to be associated with higher future volatility than positive returns of the same magnitude. Feedback effect refers to the case when the volatility function is correlated over time.
2.
Results for other stocks can be found in the online supplementary material (Figure S1).
3.
One drawback of the jump detection methods is the presence of the spurious detections due to multiple testing issues. See Bajgrowicz et al. (2016) for a discussion.
4.
As there are 6.5 h of trades in a trading day for the NYSE, for m trading days we have T = 23400 m s. Q ( t ) at calendar time t (in s) is an increasing function of t, with t = 0 , 1 , · · · , T , Q ( 0 ) = 0 and Q ( T ) = 1 .
5.
Here, t k are calendar-time points which need not to be regularly spaced. We outline the detailed steps in calculating V k and t k , for k = 1 , , K , in the Appendix A. V k can be any suitable estimates of intraday integrated volatility. In this paper we use the TRV method of Barndorff-Nielsen et al. (2006) to calculate V k for its robustness to jumps.
6.
We use the Matlab (2015a, Mathworks, Natick, MA, United States) command pchip in this paper. Given Q ( t ) and the calendar-time point t, the diurnally transformed time t ˜ is t ˜ = T × Q ( t ) . Conversely, given Q ( t ) and a diurnally transformed time t ˜ , the corresponding calendar time is t = Q 1 ( t ˜ / T ) .
7.
In the empirical applications in this paper, the time-transformation function for BTS is extended over the whole sample period, which takes account of varying volatility over different trading days.
8.
The jump-adjustment procedure can be found in the Appendix A.
9.
The Brownian semimartingale process can be defined as d Y t = μ t d t + σ t d W t , where μ t d t is the drift term, the instantaneous volatility process σ t is càdlàg, and W t denotes a standard Brownian motion independent of the drift. In this paper, we further add jumps to the Brownian semimartingale and assume the price process to be a generic jump-diffusion process. That is, d Y t = μ t d t + σ t d W t + κ t d q t , where d q t = 1 when there is a jump at time t, and d q t = 0 otherwise, and κ t denotes the jump size if a jump occurs at time t. We assume the jump component to be a finite activity jump process. Note that when there are infinite number of jumps in the data, our BTS method will work if we select BTS transactions based on estimated integrated volatility that are robust to the presence of Lévy-type jumps. See Lee and Hannig (2010) for the evidence of the presence of the Lévy-type jumps and see Barndorff-Nielsen et al. (2006) for an analysis of the multipower variation estimates when there are infinite number of jumps.
10.
Note that, given t ˜ k 1 and σ ¯ 2 , t ˜ k is the minimum business time so that the integrated volatility over the interval ( t ˜ k 1 , t ˜ k ) reaches σ ¯ 2 .
11.
As our focus here is the testing of the semi-martingale hypothesis, the results of the jump detection are not presented. Details of the selected stocks and results of the jump tests can be found in the supplementary material (Tables S1–S3) for which sampling frequencies of 1 min, 5 min and 10 min are used. When the sampling frequency is equal to 1 min, more than 12 stocks report jump proportions with values exceeding 10% under all sampling schemes. This suggests that sampling frequency that is too high (such as 1 min) may render misleading results when they are used for jump detection using the method of Andersen et al. (2010). See Oomen (2006) for an analysis of the performance of the realized variance estimator among alternative sampling schemes.
12.
QQ plots of the jump-adjusted ABFN returns and BTS returns at daily and weekly frequencies are very similar. We also calculate the ACF values of the sampled 30 min returns up to lag 150. All returns sampled using various methods exhibit no periodicity. The correlation between the ABFN returns and BTS returns increases as the sampling frequency decreases, and the value is around 13.5% at weekly frequency.
13.
We do not use the bipower realized volatility method here since the TRV method is more robust to the presence of jumps, especially when the sampling frequency is high.
14.
Note that, for ME1 and ME2, the returns are sampled by price events and the ACD models are fitted to diurnally transformed durations using the time-transformation function based on the number of trades.
15.
The RK method is selected for comparison due to its superior performance among the RV estimators (see Barndorff-Nielsen et al. (2008)). To calculate the bandwidth of the RK method, we use the subsampling realized volatility estimator and 3 min TTS returns. For the ACD-ICV methods, all results in this paper are based on conditional duration models fitted using the power ACD (PACD) model (see Fernandes and Grammig 2006).
16.
Sparsity occurs as empirically transactions are not observed sec by sec. Inactive stocks typically have more sparse transactions.
17.
When there is intraday volatility periodicity, the BTS returns resemble more closely to normal distribution than the CTS and TTS returns.
18.
This is in contrast to the findings in Tse and Yang (2012), which shows the superiority of the ACD-ICV method over the RK method via simulation using second-by-second transactions (sparsity of 1 s). The poor performance of ME1 is mainly due to the transaction sparsity, since using σ 2 as the proxy for integrated volatility over one price event becomes unreliable when transactions are sparse. Supporting evidence is provided in our simulation study that the RMSE of ME1 increases when observed transactions are more sparse.
19.
MD5 is a model with price jumps, and the TRV method is constructed to be robust to price jumps.
Figure 1. Intraday volatility and trading activity of JPMorgan Chase (JPM), January 2010 to April 2013.
Figure 1. Intraday volatility and trading activity of JPMorgan Chase (JPM), January 2010 to April 2013.
Econometrics 05 00051 g001
Figure 2. Time-transformation functions for JPM.
Figure 2. Time-transformation functions for JPM.
Econometrics 05 00051 g002
Figure 3. QQ (Quantile-Quantile) plots of sampled 30 min returns for JPMorgan Chase (JPM), January 2010–April 2013.
Figure 3. QQ (Quantile-Quantile) plots of sampled 30 min returns for JPMorgan Chase (JPM), January 2010–April 2013.
Econometrics 05 00051 g003
Table 1. Test of normality hypothesis for no-jump returns of 40 New York Stock Exchange (NYSE) stocks.
Table 1. Test of normality hypothesis for no-jump returns of 40 New York Stock Exchange (NYSE) stocks.
FrequencyCTSABFNBTS
5%1%5%1%5%1%
Weekly1593140
Daily404012564
30 min404040403026
Notes: The figures are the numbers of stocks (out of 40) for which the normality hypothesis at the 5% and 1% levels of significance are rejected based on the Lilliefors test implemented for 30 min, daily and weekly jump-adjusted returns, 2010–2013. The returns are constructed over 5 min sampling frequency using the Calendar Time Sampling (CTS) method, the method of Andersen et al. (2010) (ABFN) and the method proposed in this paper (BTS). The total number of trading days of each stock ranges from 830 to 853.
Table 2. Mean of the absolute difference for no-jump 30 min returns of 40 New York Stock Exchange (NYSE) stocks.
Table 2. Mean of the absolute difference for no-jump 30 min returns of 40 New York Stock Exchange (NYSE) stocks.
Measures of shapeCTSTTSABFNBTS
Skewness (diff.)0.12720.20910.11050.0384
Kurtosis (diff.)5.374510.78932.37490.2792
Notes: The figures are the average of the absolute difference of the calculated skewness and kurtosis versus 0 and 3, respectively. To calculate the skewness and kurtosis, we use 30 min jump-adjusted returns (2010–2013) of 40 stocks. The 30 min returns are constructed over 5 min sampling frequency using the Calendar Time Sampling (CTS) method, the Tick Time Sampling (TTS) method, the method of Andersen et al. (2010) (ABFN) and the method proposed in this paper (BTS). The total number of trading days of each stock ranges from 830 to 853.
Table 3. Summary of simulation models.
Table 3. Summary of simulation models.
ModelCodeDescription of ModelDescription of Model Parameters
Heston Model (high volatility)MD1 d log X ( t ) = μ σ 2 ( t ) 2 d t + σ ( t ) d W 1 ( t ) ,
d σ 2 ( t ) = κ ( α σ 2 ( t ) ) d t + γ σ ( t ) d W 2 ( t ) .
μ = 0.05 , κ = 5 , α = 0.25 , γ = 0.5 and Corr ( d W 1 ( t ) , d W 2 ( t ) ) = 0.5 .
Heston Model (low volatility)MD2Same as MD1. α = 0.04 , all remaining parameters same as MD1.
Two-factor affine stochastic volatility model with U-shape intraday volatility patternMD3 d log X ( t ) = σ u ( t ) σ s v ( t ) d W 1 ( t ) , σ s v 2 ( t ) = σ 1 2 ( t ) + σ 2 2 ( t ) , d σ 1 2 ( t ) = κ 1 [ θ 1 σ 1 2 ( t ) ] d t + η 1 σ 1 ( t ) d W 21 ( t ) , d σ 2 2 ( t ) = κ 2 [ θ 2 σ 2 2 ( t ) ] d t + η 2 σ 2 ( t ) d W 22 ( t ) ,
σ u ( t ) = C + A e a t + B e b ( 1 t ) , t [ 0 , 1 ] .
κ 1 = 0.6 , κ 2 = 0.1 , θ 1 = 0.09 , θ 2 = 0.04 , η 1 = 0.2 , η 2 = 0.1 ,
ρ 1 = Corr ( d W 1 ( t ) , d W 21 ( t ) ) = 0.9 ,
and ρ 2 = Corr ( d W 1 ( t ) , d W 22 ( t ) ) = 0.4 . A = 0.75 , B = 0.25 ,
C = 0.88929198 , a = 10 and b = 10 .
Deterministic volatility model with U-shape intraday volatility patternMD4 d log X ( u ) = σ ( u ) d W ( u ) , σ ( u ) = σ ( t , τ ) = σ 1 ( t ) σ 2 ( τ ) , where t is the day of trade and τ is the intraday time. σ 1 ( t ) is the volatility of day t, σ 2 ( τ ) is the intraday variations at time τ of each day. σ 1 ( t ) = 20 % for t = 1 with σ 1 ( t ) increasing linearly in t over 20 days to reach 30 % . It then remains level for the next 20 days and decreases linearly in t to 20 % over 20 days. σ 2 ( τ ) is computed as in Tse and Yang (2012) using the IBM tick-by-tick transaction data in 2012.
MD1 with price jumpsMD5 d log X ( t ) = μ σ 2 ( t ) 2 d t + σ ( t ) d W 1 ( t ) + J ( t ) d P ( t ) ,
d σ 2 ( t ) = κ ( α σ 2 ( t ) ) d t + γ σ ( t ) d W 2 ( t ) .
P ( t ) is a Poisson process with on average one price jump every two days.
J ( t ) is the size of the jumps with J ( t ) N ( 0.02 , 0.004 ) .
Table 4. Mean error (ME) and root mean-squared error (RMSE) of estimates of daily volatility using the Tripower Realized Volatility (TRV) method under the Business Time Sampling (BTS) scheme with subsampling.
Table 4. Mean error (ME) and root mean-squared error (RMSE) of estimates of daily volatility using the Tripower Realized Volatility (TRV) method under the Business Time Sampling (BTS) scheme with subsampling.
SparsityNSRModelMERMSE
1-min2-min3-min5-min10-min1-min2-min3-min5-min10-min
5-s0.005%MD1−0.42860.04700.00310.0004−0.17771.38691.72282.16102.81654.0975
MD2−0.4321−0.0500−0.0513−0.0338−0.13090.99641.17061.46651.92422.7797
MD3−0.44350.02000.0123−0.0019−0.19161.24251.56571.94312.55113.6823
MD4−0.2403−0.09130.01260.0055−0.20691.05741.37231.68492.22783.2164
MD5−0.06870.62820.77321.09401.64631.41632.01962.55873.44855.2161
0.01%MD10.33820.44860.26250.1532−0.11421.35201.76462.15622.80774.0850
MD20.07860.21720.11920.0675−0.08840.90491.18311.46081.91522.7695
MD30.32910.40020.25550.1439−0.13011.21261.60161.95212.54603.6734
MD40.59040.31280.27340.1532−0.14741.18091.40431.70592.23733.2230
MD50.72801.03311.03651.24921.71281.59892.16712.63913.49365.2306
0.02%MD11.84631.25230.77890.44990.01262.27622.10532.25062.81484.0590
MD21.08020.74790.46030.2651−0.00591.45031.38641.51091.91452.7529
MD31.78621.16440.73970.4265−0.01672.14401.91662.05422.55583.6603
MD42.26161.10360.78890.4437−0.02912.49661.76431.86092.27983.2357
MD52.27131.85031.56491.55591.84242.70802.65182.87533.59825.2587
10-s0.005%MD1−0.9375−0.0435−0.0307−0.0333−0.18801.63351.83572.16002.84574.1021
MD2−0.8123−0.1034−0.0778−0.0560−0.14171.23041.25001.47351.93942.7833
MD3−0.8436−0.0958−0.0423−0.0340−0.20881.48541.65841.97422.57513.7013
MD4−0.5797−0.2436−0.0830−0.0584−0.24321.27151.44041.71212.25473.2308
MD5−0.59860.54290.72901.06371.63341.56592.09912.55733.47115.2171
0.01%MD10.07130.33970.23560.1224−0.12511.35011.85612.16112.83554.0856
MD2−0.13450.15200.10100.0460−0.09920.94671.25441.46611.93232.7757
MD30.10280.28150.20970.1090−0.15291.23901.68001.97742.56863.6909
MD40.31440.19050.19290.0949−0.18301.18011.43211.72932.26133.2356
MD50.41900.93620.99801.22441.69681.52752.23112.63633.51265.2337
0.02%MD12.04791.11370.77700.4235−0.00512.50052.12852.26612.83584.0650
MD21.18150.66750.45860.2426−0.01751.58851.41981.52771.93012.7608
MD31.96221.02220.71580.3933−0.03402.34651.94482.07662.58553.6804
MD42.10821.03870.72070.3933−0.06442.42051.77281.88232.29973.2520
MD52.41111.72591.54371.53521.82772.87182.66692.87723.62255.2623
20-s0.005%MD1−1.9043−0.4650−0.1715−0.1586−0.30522.51541.90812.36322.88844.1176
MD2−1.4513−0.3972−0.1718−0.1551−0.21891.84651.32321.61421.97282.8010
MD3−1.7349−0.4945−0.1976−0.1617−0.32452.28801.74212.09962.65173.7326
MD4−1.3673−0.5413−0.2730−0.2261−0.35861.93721.64631.81012.30443.2759
MD5−1.56840.05390.61490.94671.52632.34522.01852.70003.49255.2178
0.01%MD1−0.77270.05950.0795−0.0012−0.24741.83881.86582.36192.86914.1079
MD2−0.6930−0.0470−0.0025−0.0515−0.17981.34361.27471.61111.96492.7898
MD3−0.67150.01100.0681−0.0090−0.26781.66841.68572.11312.63933.7245
MD4−0.3601−0.05770.0292−0.0623−0.30561.43541.54731.78542.31403.2870
MD5−0.43580.58720.88541.10721.58681.82452.11692.78763.53235.2285
0.02%MD11.40621.08340.57900.3089−0.12292.25432.19422.43462.87224.0765
MD20.74670.63890.33020.1612−0.09531.46461.46741.64701.95762.7740
MD31.35980.99230.56290.2802−0.15732.11061.98122.21022.64453.7064
MD41.61000.91720.65080.2535−0.18112.18131.80691.91532.35993.2984
MD51.74501.62991.40631.42371.71762.56232.64743.01913.63725.2592
Notes: ME and RMSE are the mean error and root mean-squared error, respectively, of the volatility estimates in annualized standard deviation in percentage. The average true daily integrated volatility is around 40% for MD1, 27% for MD2, 36% for MD3, 28% for MD4 and 40% for MD5. MD1 and MD2 are the Heston model at different volatility level. MD3 is the two-factor stochastic volatility model with intraday volatility periodicity. MD4 is the deterministic volatility model with intraday volatility periodicity and MD5 is the Heston model (MD1) with price jumps. The first column indicates the average duration of the observed simulated transactions. NSR is the noise-signal ratio. Results for average transaction frequency of 1 min, 2 min, 3 min, 5 min and 10 min are reported. The sampling frequency of the BTS scheme equals twice the average transaction duration.
Table 5. Comparison of the Tripower Realized Volatility (TRV) estimates under different sampling schemes with subsampling.
Table 5. Comparison of the Tripower Realized Volatility (TRV) estimates under different sampling schemes with subsampling.
Average RMSE Difference of the TRV Estimates1-min2-min3-min5-min10-min
Avg. (RMSE(CTS)-RMSE(BTS))0.30430.16660.18910.22150.1926
Avg. (RMSE(CTS)-RMSE(BTS))/RMSE(BTS) (%)17.82629.85159.64938.86265.7456
Avg. (RMSE(TTS)-RMSE(BTS))−0.02370.12610.17210.22400.2072
Avg. (RMSE(TTS)-RMSE(BTS))/RMSE(BTS) (%)−0.10117.99779.07109.10056.2101
Notes: RMSE is the root mean-squared error of the daily TRV estimates in annualized standard deviation in percent. The first row is the average of the RMSE of the TRV estimates (over all simulation models) based on the CTS scheme over that based on the BTS scheme. The second row is the average relative difference in percentage. The third and fourth rows are similarly presented for the TTS scheme compared to the BTS scheme.
Table 6. Mean error (ME) and root mean-squared error (RMSE) of daily volatility estimates of the realized kernel (RK) and Autoregressive Conditional Duration-Integrated Conditional Variance (ACD-ICV) methods for Model MD1.
Table 6. Mean error (ME) and root mean-squared error (RMSE) of daily volatility estimates of the realized kernel (RK) and Autoregressive Conditional Duration-Integrated Conditional Variance (ACD-ICV) methods for Model MD1.
ME RMSE
Sparsity NSR RK ACD-ICV Avg. Sampling Frequency (ACD-ICV) RK ACD-ICV Avg. Sampling Frequency (ACD-ICV)
1-min 3-min 5-min 10-min 15-min 1-min 3-min 5-min 10-min 15-min
5-s0.005%−0.1087ME1−5.8450−3.4421−2.6778−1.8270−1.35462.2978ME16.21423.88813.31492.97382.9941
ME2−0.2043−0.2366−0.2205−0.09280.0734ME21.75271.75001.96732.38102.6985
ME3 −0.11200.05400.24460.43620.7306 ME31.27721.40011.72611.43801.5706
0.01%−0.0876ME1−3.4503−2.0139−1.5554−1.0491−0.73572.2998ME13.88112.62412.45512.57352.7766
ME20.04370.00890.03010.16330.3410ME21.62661.69061.93262.38962.7213
ME30.14890.30760.49720.69500.9897 ME31.23671.37101.70781.49481.6735
0.02%−0.0461ME11.44070.69240.47110.34380.38242.3040ME12.09201.84682.01002.42902.7559
ME20.51810.50450.53530.68040.8449ME21.57781.78682.03402.51182.8656
ME30.66770.82531.00791.20711.5051 ME31.33201.49391.82911.73991.9751
10-s0.005%−0.1416ME1−10.9555−6.0669−4.6302−3.1722−2.44502.6341ME111.39356.43475.09263.98203.6161
ME2−0.3400−0.3899−0.3720−0.2555−0.0909ME22.28641.87652.03632.42422.6968
ME3 −0.2235 −0.19760.02120.27640.5716 ME31.47741.43021.58701.47641.5774
0.01%−0.1178ME1−8.5745−4.8751−3.7310−2.5558−1.94042.6363ME18.98995.25804.25613.49113.2988
ME2−0.0858−0.1337−0.11650.00150.1719ME22.02811.78251.99302.40332.7120
ME30.04090.07130.28290.53870.8356 ME31.41791.38361.56211.51291.6586
0.02%−0.0710ME1−4.8863−2.6923−2.0864−1.4154−1.03272.6399ME15.32223.21402.85232.75462.8704
ME20.42410.37670.39760.53040.7014ME21.83411.77172.01312.46892.8195
ME30.56650.59170.80601.06341.3599 ME31.45871.44541.65471.72921.9392
20-s0.005%−0.1987ME1−17.7796−9.5646−7.3266−5.0140−3.94443.0600ME118.30659.96127.73825.61304.7999
ME2−0.6770−0.7315−0.7231−0.6051−0.4518ME23.14082.23012.24932.51402.7698
ME3 −0.4760 −0.5490 −0.3795 −0.07620.2182 ME32.27891.60711.74311.59321.6135
0.01%−0.1723ME1−16.2877−8.6764−6.6429−4.5336−3.54343.0621ME116.83629.06037.05965.16794.4519
ME2−0.4067−0.4638−0.4586−0.3355−0.1811ME22.95392.08302.14642.44812.7132
ME3 −0.2070 −0.2778 −0.10000.19260.4882 ME32.20621.51191.70341.58091.6494
0.02%−0.1200ME1−12.8662−7.0624−5.3456−3.6371−2.82333.0670ME113.36767.44665.78924.37063.9009
ME20.11910.06290.07280.20210.3562ME22.62261.93042.04612.43032.7561
ME30.32870.26140.44930.72871.0260 ME32.16321.47691.75261.70061.8539
Notes: ME and RMSE are the mean error and root mean-squared error, respectively, of the volatility estimates in annualized standard deviation in percentage. MD1 is the Heston model and the average true daily integrated volatility is around 40%. ME1 is the ACD-ICV method of Tse and Yang (2012). ME2 is ME1 with δ 2 replaced by V D , the integrated volatility estimated using TRV with subsampling at 3 min sampling frequency. ME3 is ME2 with sampled durations computed from BTS returns. All ACD models are fitted to diurnally transformed durations using the time-transformation function based on the number of trades as in Tse and Dong (2014).
Table 7. Mean error (ME) and root mean-squared error (RMSE) of daily volatility estimates of the realized kernel (RK) and Autoregressive Conditional Duration-Integrated Conditional Variance (ACD-ICV) methods for Model MD5.
Table 7. Mean error (ME) and root mean-squared error (RMSE) of daily volatility estimates of the realized kernel (RK) and Autoregressive Conditional Duration-Integrated Conditional Variance (ACD-ICV) methods for Model MD5.
MERMSE
SparsityNSRRKACD-ICVAvg. Sampling Frequency (ACD-ICV)RKACD-ICVAvg. Sampling Frequency (ACD-ICV)
1-min3-min5-min10-min15-min 1-min3-min5-min10-min15-min
5-s0.005%5.4227ME1−5.8297−3.3882−2.5730−1.6045−1.01889.9602ME16.20263.84093.23602.86562.8789
ME20.80580.76850.78920.91641.0902ME21.90981.91992.14302.60912.9672
ME30.88461.04641.25561.45771.7653 ME31.66361.86712.16712.08482.2923
0.01%5.4418ME1−3.4340−1.9407−1.4617−0.8313−0.37699.9686ME13.86032.57212.42082.52772.7279
ME21.05121.01871.04531.17581.3516ME21.92152.00702.25722.72663.0797
ME31.14711.30821.51601.71532.0254 ME31.78631.99212.28392.24062.4745
0.02%5.4794ME11.47280.77130.58490.58180.72579.9860ME12.09901.87752.06122.50422.8495
ME21.53101.51811.54671.68911.8715ME22.15392.33152.56073.01353.3723
ME31.67211.82222.02632.23362.5459 ME32.12132.31452.58162.62222.8867
10-s0.005%5.3907ME1−10.9449−6.0276−4.5318−2.9491−2.104210.0436ME111.37956.40155.01263.81363.4271
ME20.66330.61060.62420.74700.9182ME22.31041.94612.13772.57422.9195
ME30.77050.79480.99851.29141.5981 ME31.68281.84352.02092.02292.2163
0.01%5.4121ME1−8.6446−4.8146−3.6363−2.3242−1.584210.0534ME19.06075.20474.17443.33473.1353
ME20.91530.86600.88911.01251.1800ME22.19411.99212.20582.65593.0205
ME31.03581.06041.26891.55481.8631 ME31.78761.94312.13332.17522.3929
0.02%5.4548ME1−4.8774−2.6232−1.9874−1.1786−0.656210.0727ME15.30533.15922.78332.65252.7837
ME21.42831.38481.40851.54071.7137ME22.25912.24082.45542.91283.2801
ME31.56571.59531.79832.07972.3929 ME32.09472.23932.43942.54642.8021
20-s0.005%5.3322ME1−17.8068−9.5419−7.2562−4.8115−3.599210.1532ME118.33949.93987.67405.44354.5299
ME20.33970.28210.28820.40190.5596ME23.07552.11172.16662.52712.8374
ME30.53400.46390.61570.94731.2592 ME32.26641.74451.95291.91892.0788
0.01%5.3562ME1−16.4046−8.6377−6.5612−4.3376−3.210110.1639ME116.95029.02846.98575.00494.2219
ME20.60350.54730.56000.66820.8417ME22.99032.10082.18182.56432.9096
ME30.80430.73850.88281.21681.5292 ME32.31521.82162.01522.04502.2400
0.02%5.4036ME1−12.8619−7.0131−5.2494−3.4318−2.489710.1854ME113.36357.40265.69994.21293.6884
ME21.13781.07781.09051.21021.3818ME22.83242.21762.32622.76203.1219
ME31.34351.27461.42971.75652.0717 ME32.50682.06512.29052.38642.6243
Notes: ME and RMSE are the mean error and root mean-squared error, respectively, of the volatility estimates in annualized standard deviation in percentage. MD5 is the Heston model with price jumps and the average true daily integrated volatility is around 40%. ME1 is the ACD-ICV method of Tse and Yang (2012). ME2 is ME1 with δ 2 replaced by V D , the integrated volatility estimated using TRV with subsampling at 3-min sampling frequency. ME3 is ME2 with sampled durations computed from BTS returns. All ACD models are fitted to diurnally transformed durations using the time-transformation function based on the number of trades as in Tse and Dong (2014).
Table 8. Comparison of the root mean-squared error (RMSE) of daily volatility estimates of the realized kernel (RK) and Autoregressive Conditional Duration-Integrated Conditional Variance (ACD-ICV) methods.
Table 8. Comparison of the root mean-squared error (RMSE) of daily volatility estimates of the realized kernel (RK) and Autoregressive Conditional Duration-Integrated Conditional Variance (ACD-ICV) methods.
RKACD-ICVAvg. Sampling Frequency (ACD-ICV)
1-min3-min5-min10-min15-min
3.9198ME18.19554.76033.86403.20273.0690
ME21.93211.64301.75422.15592.4848
ME31.44831.39581.51431.47551.6280
Notes: RMSE are the root mean-squared error of the volatility estimates (over all simulation models) in annualized standard deviation in percentage. ME1 is the ACD-ICV method of Tse and Yang (2012). ME2 is ME1 with δ 2 replaced by V D , the integrated volatility estimated using TRV with subsampling at 3-min sampling frequency. ME3 is ME2 with sampled durations computed from BTS returns. All ACD models are fitted to diurnally transformed durations using the time-transformation function based on the number of trades as in Tse and Dong (2014).

Share and Cite

MDPI and ACS Style

Dong, Y.; Tse, Y.-K. Business Time Sampling Scheme with Applications to Testing Semi-Martingale Hypothesis and Estimating Integrated Volatility. Econometrics 2017, 5, 51. https://doi.org/10.3390/econometrics5040051

AMA Style

Dong Y, Tse Y-K. Business Time Sampling Scheme with Applications to Testing Semi-Martingale Hypothesis and Estimating Integrated Volatility. Econometrics. 2017; 5(4):51. https://doi.org/10.3390/econometrics5040051

Chicago/Turabian Style

Dong, Yingjie, and Yiu-Kuen Tse. 2017. "Business Time Sampling Scheme with Applications to Testing Semi-Martingale Hypothesis and Estimating Integrated Volatility" Econometrics 5, no. 4: 51. https://doi.org/10.3390/econometrics5040051

APA Style

Dong, Y., & Tse, Y. -K. (2017). Business Time Sampling Scheme with Applications to Testing Semi-Martingale Hypothesis and Estimating Integrated Volatility. Econometrics, 5(4), 51. https://doi.org/10.3390/econometrics5040051

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop