Next Article in Journal
Simulation of the Grondona System of Conditional Currency Convertibility Based on Primary Commodities, Considered as a Means to Resist Currency Crises
Next Article in Special Issue
Equalizing Seasonal Time Series Using Artificial Neural Networks in Predicting the Euro–Yuan Exchange Rate
Previous Article in Journal
Managerial Self-Attribution Bias and Banks’ Future Performance: Evidence from Emerging Economies
Previous Article in Special Issue
The Role of Economic Uncertainty in UK Stock Returns
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quasi-Maximum Likelihood Estimation for Long Memory Stock Transaction Data—Under Conditional Heteroskedasticity Framework

by
A. M. M. Shahiduzzaman Quoreshi
1,*,
Reaz Uddin
1 and
Naushad Mamode Khan
2
1
Department of Industrial Economics, Blekinge Institute of Technology, SE-371 79 Karlskrona, Sweden
2
Department of Economics and Statistics, Faculty of Social Sciences and Humanities, University of Mauritius, Reduit 80837, Mauritius
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2019, 12(2), 74; https://doi.org/10.3390/jrfm12020074
Submission received: 15 March 2019 / Revised: 17 April 2019 / Accepted: 25 April 2019 / Published: 27 April 2019
(This article belongs to the Special Issue Analysis of Global Financial Markets)

Abstract

:
This paper introduces Quasi-Maximum Likelihood Estimation for Long Memory Stock Transaction Data of unknown underlying distribution. The moments with conditional heteroscedasticity have been discussed. In a Monte Carlo experiment, it was found that the QML estimator performs as well as CLS and FGLS in terms of eliminating serial correlations, but the estimator can be sensitive to start value. Hence, two-stage QML has been suggested. In empirical estimation on two stock transaction data for Ericsson and AstraZeneca, the 2SQML turns out relatively more efficient than CLS and FGLS. The empirical results suggest that both of the series have long memory properties that imply that the impact of macroeconomic news or rumors in one point of time has a persistence impact on future transactions.
JEL Classifications:
C13; C22; C25; C51; G12; G14

1. Introduction

Classical economic theory of price determination is a function of demand and supply. For example, in the Walrasian auctioneer approach, demands and supplies of a good are aggregated to find a market-clearing price. However, availability of high frequency data enables studying market mechanisms or market microstructures. These studies depart from demand–supply function of price determination under classical economics and takes into account other factors considered to be influential in price determination. For instance, Working (1953) in matching demand and supply curves in equilibrium expands focus on the underlying trading mechanism. Demsetz (1968) investigates effect of transactions costs in price determination in securities market and incorporates the influence of time factor of demand and supply in analyzing formation of market prices. Studies of market microstructure concern, among others, the impact of transactions, bid–ask spreads (the difference between bid and ask prices), volume of and time between transactions (duration) on price formation. The studies also investigate trading behavior of actors in response to news, rumors, etc. In the securities market, a transaction (or a trade) is completed by a buyer and a seller agreeing to exchange a specific volume of stocks at certain price. These transactions and time elapsed between each transaction are correlated. Fewer trades take place with increasing time lapse between successive transactions in a given time interval. Therefore, trading intensity and duration can be considered to be inversely related. During the past decades research on market microstructure to understand pricing processes has been centered on the trading intensity and durations. Diamond and Verrecchia (1987) illustrate implications of bad news in low trading intensity. Easley and O’hara (1992) differ on level of implication demonstrating that a low trading intensity implies no news. Besides, Engle (2000) finds that durations are associated with price volatilities. Moreover, the stock transactions data are counts for a fixed interval of time and Quoreshi (2014) explains that the time series of the data may also have long memory property.
The long memory phenomenon in time series has been first considered by Hurst (1951, 1956). In these studies, he explains the long term storage requirements of the Nile River. He shows that the cumulated water flows in a year depends not only on the water flows in recent years, but also on water flows in years much prior to the present year. Mandelbrot and van Ness (1968) explain and advance Hurst’s studies by employing fractional Brownian motion. In analogy with Mandelbrot and van Ness (1968); Granger (1980); Granger and Joyeux (1980) and Hosking (1981) developed Autoregressive Fractionally Integrated Moving Average (ARFIMA) models to account for the long memory in time series data. Granger and Ding (1996) pointed out that a number of other processes can also have the long memory property. An empirical study regarding the usefulness of ARFIMA models is conducted by Bhardwaj and Swanson (2005), who find strong evidence in favor of ARFIMA in absolute, squared, and log-squared stock index returns.
A time series of count data describes a non-negative sequence of count observations, which is integer-valued and observed at equidistant intervals of time. The literature on different techniques to model, estimate and exploit such data is ever growing. Jacobs and Lewis (1978a, 1978b, 1983) introduced time dependence and developed discrete ARMA (DARMA) models. An important difference between the continuous variable ARMA model and its corresponding Integer-valued ARMA (INARMA) version is that the latter contains parameters that are interpreted as probabilities and then take on values in narrower intervals than do the parameters of the ARMA model (e.g., McKenzie 1986; Al-Osh and Alzaid 1987; Al-Osh and Alzaid 1988). Brännäs and Quoreshi (2010) advance integer-valued moving average model to model the number of transactions in intraday data of stocks. Quoreshi (2006, 2008, 2014, 2017) advances further the INMA model into bivariate, multivariate and long memory (INARFIMA) framework. These papers consider Conditional Least Square (CLS), Feasible Generalized Least Square (FGLS), and Generalized Methods of Moments (GMM). A large number of studies have considered the modeling of bivariate or multivariate count data assuming an underlying Poisson distribution (e.g., Gourieroux et al. 1984). Heinen and Rengifo (2003) introduced multivariate time series count data models based on the Poisson and the double Poisson distribution. Other extensions to traditional count data regression models are considered by, e.g., Brännäs and Brännäs (2004) and Rydberg and Shephard (1999). None of these papers consider maximum likelihood estimation since full density function of underlying distribution is unknown. Sunecher et al. (2018) recently introduce a first-order bivariate integer-valued moving average process (BINMA(1)) where they propose a generalized quasi-likelihood (GQL) method of estimation. Ristic et al. (2018) introduces a new bivariate integer-valued moving average of the first order (BINMA(1)) process with independent Negative Binomial (NB) innovations under nonstationary moment conditions. They also employed a generalized quasi-likelihood (GQL) method of estimation.
In this paper, we propose a quasi-maximum likelihood (QML) estimator for nonstationary integer-valued long memory model for unknown underlying distribution and compare this estimator with CLS and FGLS that have performed better than GMM in the previous studies. We employ the QML estimator on stock transactions data for Ericsson and AstraZeneca. Both stock series demonstrate long memory. Empirically it is also found that the QML estimator is more efficient compared to the other estimators.
The paper is organized as follows. The INARFIMA ( 0 , d , 0 )   model is presented in Section 2. The conditional and unconditional moment properties of the INARFIMA ( 0 , d , 0 ) models are obtained. The estimation procedures, CLS and FGLS for unknown parameters are discussed and the QML estimator is proposed in Section 3. The results from a Monte Carlo experiment are presented in Section 4. A detailed description of the empirical data is given in Section 5. The empirical results for the stock series are presented in Section 6, and the concluding comments are included in the final section.

2. The Model

Quoreshi (2014) proposes the INARFIMA ( p , d , q ) long memory properties of stock transaction data. The INMA ( ) representation of INARFIMA model, which is INARFIMA ( 0 , d , 0 ) , can be written as
y t = u t + d 1 · u t 1 + d 2 · u t 2 + d 3 · u t 3
or
y t = ( 1 + L ) d u t
where L is a lag operator and the notation L = ( L ) i for i > 0 . Here we assume that here is no cross-lag dependence among u t . Note that yt has long memory in a sense that the variables have slow decaying autocorrelation functions and the parameters d j = Γ ( j + d i ) / [ Γ ( j + 1 ) Γ ( d i ) ] , j = 0 ,   1 , 2 ,   where   d 0 = 1 . Note that d j are considered thinning probabilities and hence d j [ 0 , 1 ] . The macroeconomic news or rumors are assumed to be captured by { u t } and filtered by { d j } through the system. The binomial thinning operator is used to account for the integer-valued property of count data. This operator can be written as
α · u = j u z j
with { z j } j = 1 u an iid sequence of 0–1 random variables, such that Pr ( z j = 1 ) = α = 1 − Pr( z j = 0 ) . Conditionally on the integer-valued u , α · u is binomially distributed with E ( α · u | u ) = α u and V ( α · u | u ) = α ( 1 α ) u . Unconditionally it holds that E ( α · u ) = α λ and V ( λ · u ) = α 2 σ 2 + α ( 1 α ) λ , where E ( u ) = λ and V ( u ) = σ 2 . Obviously, α · u { 0 , 1 , , u } . Many authors accentuate exact distributional results for y t , while Brännäs and Quoreshi (2010) stress only the first- and second-order moment conditions. In analogy with Brännäs and Quoreshi (2010), we construct the Quasi-Maximum Likelihood Estimation based on the first and second-order moment condition.
Assuming independence between and within the thinning operations and { u t } an iid sequence with mean λ and variance σ 2 = v λ where υ > 0, the unconditional first and second-order moments can be given as
E ( y t ) = λ ( 1 + i = 1 d i )
V ( y t ) = λ [ ( v + i = 1 d i ) + ( v 1 ) ( 1 + i d i 2 ) ]
γ k = σ 2 ( d k + i = 1 d i d k + 1 ) ,                                                             k 1  
where γk denotes the autocovariance function at lag k. It is obvious from (3)–(5) that the mean, variance, and autocovariance take only positive values since λ , σ 2 , and d i are all positive and that i = 1 d i < is required for { y t } to be a stationary process. Note also that the variance may be larger than the mean (overdispersion), smaller than the mean (underdispersion), or equal to the mean (equidispersion) depending on whether v   > 1, v   ∈ (0, 1) or v = 1, respectively. When lag length q is finite, summing to infinite is replaced to summing to q. The conditional mean, variance, and covariance for the INARFIMA ( 0 , d , 0 ) are in an analogous way:
E ( y t | Y t 1 ) = E t 1 = λ ( 1 + i = 1 d i u t i )
V ( y t | Y t 1 ) = V t 1 = λ v + i = 1 d i ( 1 d i ) u t i    
where Y t 1 is the information set available at time t − 1. The conditional mean and variance vary with u t i . Since the conditional variance varies with u t i , there is a conditional heteroscedasticity or nonstationary property of moving average type that Brännäs and Hall (2001) called MACH(q). The effect of u t i on the mean is greater than on the variance. Note also, that like the unconditional variance, the conditional variance could be overdispersed, underdispersed, or equidispersed depending on whether v > 1 + i d λ ,   v < ( 1 + i d i λ )   or   v i = 1 + i d i λ ,   respectively .

3. Estimation

If we do not assume a full density function, we may estimate the Quasi Maximum Likelihood (QML) Estimator as discussed by Weiss (1986) instead of Maximum Likelihood (ML) Estimator. Conditional Least Square (CLS), Feasible Generalized Least Square (FGLS) are Generalized Methods of Moments discussed in INMA model (Brännäs and Quoreshi 2010). In the previous studies, FGLS is the best estimator among the three in terms of eliminating serial correlation (Brännäs and Quoreshi 2010). The CLS is second, which is almost as good as FGLS. Here, we only construct the QML and FGLS for INARFIMA ( 0 , d , 0 ) model and compare the results with CLS.
The Conditional least square (CLS) estimator for INARFIMA ( 0 , d , 0 ) representation model has the following residual.
                                        e t = y t E t 1 = y t λ ( 1 + i = 1 d i u t i )
The criterion function S C L S = i = m + 1 T e t 2 is minimized with respect to unknown parameters, i.e., ψ = ( λ   and   δ ) where δ is vector of parameters with elements d i . The E t 1 is the conditional expected value of y t defined in Equation (6). Using a finite maximum lag m in (8) instead of infinite lags may cause biasing effects. Due to omitted variables, i.e., u t m 1 , , u t , we may expect a positive bias on the parameters λ   and   δ (Brännäs and Quoreshi 2010). These moment conditions correspond to the normal equations of the CLS estimator that focuses on the unknown parameters of the conditional mean function. Alternatively and equivalently, the properties E( e t ) = 0 and E( e t e t j ) = 0, j ≥ 1 could be used. The FGLS estimator minimizes
                                          S F G L S = t = m + 1 T e t V ^ 1
with V ^ 1 as given. The variance of error from CLS estimates may be used for approximation of V ^ 1 in equation (9). Alternatively, V ^ 1 can be estimated as specified in (7) by employing estimates from CLS. The covariance matrix estimators for CLS and FGLS are
C o v ( ψ ^ C L S ) = ( t = m + 1 T e t ψ e t ψ ) 1
C o v ( ψ ^ F G L S ) = ( t = m + 1 T V ^ 1 e t ψ e t ψ ) 1
The QML estimator for INARFIMA ( 0 , d , 0 ) representation model has the same residual as in Equation (9), and we propose the following criterion function to maximize
f ( y 1 ,   y 2 , y T | Y t 1 , λ and   δ ) = t = 1 T f ( y t | Y t 1 , ψ i )   = ( 1 2 π V t 1 ) T / 2 e x p ( i = m + 1 T e t 2 2 V t 1 )
where   ψ i = ( λ and   δ ) and V t 1 is as in Equation ( 7 ) . This specification may be motivated by the central limit theorem since y t are counts but not the E t 1 . According to the central limit theorem, the standardized expected value of a sample is normally distributed with mean zero and variance one if the sample size is large enough. A relevant empirical study of distribution properties on high-frequency intraday transaction prices are conducted by Andersen et al. (2001). Taking the logarithm of Equation (10), we may simply use the criterion function and minimize the function as
      L n f ( y 1 ,   y 2 , y T | Y t 1 , λ , δ and V ^ t 1 ) = T 2 ln ( V ^ t 1 ) ln ( 2 π ) ( t = m + 1 T e t 2 2 V ^ t 1 )
where V ^ t 1 is an estimate for V t 1 that is to be estimated. Since T, 2, and π   are constants, we can equivalently minimize the following criterion function
L n f ( y 1 ,   y 2 , y T | Y t 1 , λ , δ and V ^ t 1 ) = ln ( V ^ t 1 ) ( t = m + 1 T e t 2 V ^ t 1 ) .
Note that the V ^ t 1 is to be estimated at the same time as the other parameters. If the estimation is sensitive to the start value of V ^ t 1 , we can obviously estimate CLS at the first stage and calculate the V ^ t 1 which can be used as the start value for QML. We call this estimation procedure Two-Stage Quasi Maximum Likelihood (2SQML) Estimation. The covariance matrix estimators for QML and 2SQML are
C o v ( ψ ^ Q M L ) = ( t = m + 1 T V ^ 1 e t ψ e t ψ ) 1
C o v ( ψ ^ 2 S Q M L ) = ( t = m + 1 T V ^ C L S 1 e t ψ e t ψ ) 1 .
Note that we call an estimator relatively efficient if we have a smaller standard deviation (error) for the parameters of interest compared to the other estimator. The covariance matrices mentioned above calculate the standard error for the estimates in respective estimator. The criteria that are used to choose best model fitting are, e.g., Adjusted- R 2 , mean square error (MSE), Akaike Information Criteria (AIC), and Schwarz Bayesian Information Criterion (SBIC). The Adjusted- R 2 explains the degree of variation in the dependent variable explained by the independent variables, while MSE explains to what extend the regression models are unable to explain the variation of independent variable. Hence, this implies that the higher Adjusted- R 2 the smaller MSE. The S C L S is the MSE for Equation (8), while the corresponding Adjusted- R 2 is (1 − S C L S /var( y t )), where var( y t ) is the variance of the independent variable. The ACI and SBIC are similar to MSE. Instead of MSE, the AIC and SBIC use a function that incorporate the value from the likelihood function used in estimation, as in Equation (11). In the first step, these criteria may be used for model selection.
In time series, Ljung–Box statistics and Box–Pierce test are widely used to check the serial correlations between the residuals. These criteria can be used to evaluate estimators given a model. The null hypothesis is that the residuals e t are independently distributed while the alternative hypothesis is that the residuals are not independently distributed; there are serial correlations between the residuals. The Ljung–Box statistics for residuals can be written
Q L B = n ( n + 2 ) k = 1 h ρ ^ k 2 n k
where   n is the number of observations and k is the number of lags used for estimation (Ljung and Box 1978). The ρ ^ k is the autocorrelations of residuals at lag k . The null hypothesis is rejected if
Q L B > χ 1 α , h 2
where χ 1 α , h 2 is chi-squared distribution with significance level α and degree of freedom h . Similarly, the Box–Pierce test can be written as
Q B P = n k = 1 h ρ ^ k 2
where n , k , are ρ ^ k as described above (Box and Pierce 1970). The difference between these two statistics emerges from the term ( n + 2 ) and ( n k ) used in Ljung–Box statistics and Q L B > Q B P and hence Q L B is more restrictive to reject the null hypothesis.

4. Monte Carlo Experiment

Quoreshi (2014) studies, in a brief Monte Carlo experiment, the bias, MSE, Ljung–Box statistics (LB), AIC, and SBIC properties of the CLS estimators for finite-lag specifications; data is generated according to INARFIMA ( 0 , d , 0 ) . Here, we study employing QML, 2SQML, and FGLS estimators and compare the results with CLS. Initially, we found that QML is highly sensitive to start values and produces large biased estimates. Hence, we only focus the result on 2SQML, FGLS, and CLS. This study generates data from Poisson distribution to study the bias properties under 2SQML and FGLS estimation procedure. Smith et al. (1996) and Quoreshi (2014) studied the bias and misspecification in ARFIMA and INARFIMA models, respectively. Drost et al. (2009) investigate finite sample behavior of semiparametric integer-valued AR(p) models, while Brännäs and Quoreshi (2010) study finite lag misspecification when the data is generated according to an infinite-lag INMA model. In this Monte Carlo experiment we study the bias, MSE, Ljung–Box statistics, AIC, and SBIC properties of the FGLS and 2SQML estimators for finite-lag specifications when data is generated according to INARFIMA (0, d, 0).The data generating process is as in (1), with d j = Γ ( j + d ) / [ Γ ( j + 1 ) Γ ( d ) ] , j = 0 ,   1 , 2 ,   where   d 0 = 1 and where u t is drawn according to Equation (2). The values for d, d = 0.1 ,   0.25   and   0.4 are used and lag length m = 70   is   chosen . The u t sequence is generated as Poisson with parameter λ = 5 so that λ = σ 2 in the conditional variance equation in (7). Six time series with length T = 2000   and   T = 10 , 000 are generated. The first 500 observations are discarded to avoid the start-up effect. The results for the Monte Carlo experiment are given in Table 1, Table 2 and Table 3. In Table 1, we see that the estimates for   d decreases as lag length (M) increases from 10 to 70 and approaches to the theoretical value d   =   0.1 . This implies that the biasness of the estimates decreases from 0.071 to 0.004 for T = 2000 as the lag length approaches the theoretical lag length 70 (see d ^ corresponding to M10–M70 in Table 1). The parameter estimated with lag length 90 (M90) is d <   0.1 , which implies negative biasness. Like Brännäs and Quoreshi (2010), we conclude that we may expect a positive biasing effect on the parameters due to omitted variables. The results for MSE, AIC, and SBIC for all the three estimators are same up to three decimals (Table 1). Depending on the size of d, the standard AIC and SBIC may need to be corrected. The result indicates that 2SQML, FGLS and CLS estimators perform equally well in terms of eliminating serial correlation (see QLB100 and QLB100 in Table 1, Table 2 and Table 3). However, standard error (see s.e. for M70 in Table 1, Table 2 and Table 3) for d varies slightly between the estimator. Taking this into account, we may conclude that CLS performs best while 2SQML estimator performs better than FGLS. It is to be noted that both QML and FGLS are sensitive to start values. In this case 2SQML should be used.

5. Data and Descriptive

In this paper, we use the same data set used by Quoreshi (2014). The reason for this is that we introduce Quasi-Maximum Likelihood Method, which has not been considered in the study and the studies on count data earlier due to unknown underlying distribution. We intend to employ this method on the same data set and replicate the previous study to compare with results emerged from employing this method. Quoreshi (2014) has downloaded the tick-by-tick data for Ericsson B and AstraZeneca from the Ecovision system and the data are later filtered to generate transaction data which are counts. The stocks are frequently traded and have the highest turnovers at the Stockholmsbörsen. The two stock series are collected for the period 5 November–12 December 2002. Due to a technical problem in downloading data there are no data for 12 November in the time series and the first captured minutes of 5 December is 1037. Since we are interested in capturing the number of ordinary transactions, we have deleted all trading before 0935 (trading opens at 0930) and after 1714 (order book closes at 1720). The transactions in the first few minutes are subject to a different trading mechanism while there is practically no trading after 1714. The data are aggregated into one minute intervals of time. For high frequency data, researchers usually use one, two, five, or ten minute intervals of time and the choice is rather arbitrary. There are altogether 11,960 observations for both the Ericsson B and AstraZeneca series. The series together with their autocorrelation and partial-autocorrelation functions and histograms are exhibited in Figure 1. There are frequent zero frequencies in both series, especially in the AstraZeneca series, and hence the application of count data modeling is called for. The counts in both series fluctuate around their means which is an indication of mean reverting processes. The autocorrelation functions for both series suggest fractional integration which implies long memory. The histograms exhibit the distribution of counts and the possible empirical densities of the counts. Even with relatively large sample size, the distributions appear far from being normally distributed. On the other hand, the distributions appear to be similar to Poisson distribution; both variances are greater than their respective means (Figure 1). Hence, there is no scope for employing known distributions.

6. Empirical Results

CLS, FGLS, and 2SQML are employed for estimation with lag length of 70 following the suggestions of Brännäs and Quoreshi (2010). It is to be noted that the AIC and SBIC criteria are not applicable in the context of long memory (Brännäs and Quoreshi 2010; Quoreshi 2014), which is also supported by the Monte Carlo Experiment. The results of the empirical studies are presented in Table 4. Empirically, we find evidence for long memory property ( d ^   < 0.5) for both Ericsson B and AstraZeneca series (see Table 4). The series for both AstraZeneca and Ericsson has mean reversion property and is covariance stationary. The findings suggest that the impact of macroeconomic news or rumors in one point of time has a persistence impact on future transactions. It may be recalled that news disseminated through formal channels which may have impact on overall stock markets and specifically on a particular stock is termed macroeconomic news, such as news on interest rates or unemployment statistics for a country which may influence all stocks. Rumors are information that spread through unofficial channels yet is related to macroeconomic news or a particular stock. For AstraZeneca, we find that CLS perform best while 2SQML performs better than FGLS in terms of eliminating serial correlation. Considering the standard error of the parameters ( λ ^ ,   d ^ ), we find that 2SQML performs best among the three estimators, while CLS performs better than FGLS. For Ericsson B, we find that 2SQML performs best in terms of both eliminating serial correlations and minimum standard error for the parameters. All other corresponding estimates turn out equal up to 3 decimals estimated by the three estimators.

7. Concluding Remarks

This paper introduces Quasi-Maximum Likelihood Estimation of Integer-Valued Long Memory Model for unknown underlying distribution and the estimation procedures for QML have been discussed. The paper compares the 2SQML estimator with FGLS and CLS. In a Monte Carlo experiment, it is found that the 2SQML, FGLS, and CLS estimators perform equally well in terms of eliminating serial correlation. The empirical study suggests that CLS performs best for AstraZeneca, while 2SQML performs best for Ericsson B in terms of eliminating serial correlations. However, the 2SQML estimator performs better than both the CLS and FGLS in terms of minimum standard error for estimates of the parameters for both Ericsson B and AstraZeneca, although CLS perform best followed by 2SQML in the simulation study. Note that the data in the simulation study is equidispersed since the data is generated from the Poisson distribution, while the data for the empirical study is overdispersed. The results of the study collectively may indicate that 2SQML estimator is relatively more efficient compared to CLS and FGLS for overdispersed data. The empirical results suggest that both series have long memory properties, which implies that the impact of macroeconomic news or rumors in one point of time has a persistence impact on future transactions.

Author Contributions

R.U. Contributed to literature review, estimation, simulation and data Analysis. N.M.K. contributed to estimation, simulation and commenting on overall paper. A.M.M.S.Q. designed, supervised and wrote the paper.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Al-Osh, M. A., and Aus A. Alzaid. 1987. First Order Integer-Valued Autoregressive INAR(1)) Process. Journal of Time Series Analysis 8: 261–75. [Google Scholar] [CrossRef]
  2. Al-Osh, M. A., and Aus A. Alzaid. 1988. Integer-Valued Moving Average (INMA) Process. Statistical Papers 29: 281–300. [Google Scholar] [CrossRef]
  3. Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Heiko Ebens. 2001. The distribution of realizedstockreturn volatility. Journal of Financial Economics 61: 43–76. [Google Scholar] [CrossRef]
  4. Bhardwaj, Geetesh, and Norman R. Swanson. 2005. An Empirical Investigation of the Usefulness of ARFIMA Models for Predicting Macroeconomic and Financial Time Series. Journal of Econometrics 131: 539–78. [Google Scholar] [CrossRef]
  5. Box, George E. P., and David A. Pierce. 1970. Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models. Journal of the American Statistical Association 65: 1509–26. [Google Scholar] [CrossRef]
  6. Brännäs, Kurt, and Eva Brännäs. 2004. Conditional Variance in Count Data Regression. Communication in Statistics; Theory and Methods 33: 2745–58. [Google Scholar] [CrossRef]
  7. Brännäs, Kurt, and Andreia Hall. 2001. Estimation in Integer-Valued Moving Average Models. Applied Stochastic Models in Business and Industry 17: 277–91. [Google Scholar] [CrossRef]
  8. Brännäs, Kurt, and A. M. M. Shahiduzzaman Quoreshi. 2010. Integer-Valued Moving Average Modelling of the Number of Transactions in Stocks. Applied Financial Economics 20: 129–440. [Google Scholar] [CrossRef]
  9. Demsetz, Harold. 1968. The Cost of Transacting. Quarterly Journal of Economics 82: 33–53. [Google Scholar] [CrossRef]
  10. Diamond, Douglas W., and Robert E. Verrecchia. 1987. Constraints on Short-Selling and Asset Price Adjustments to Private Information. Journal of Financial Economics 18: 277–311. [Google Scholar] [CrossRef]
  11. Drost, Feike C., Ramon Van den Akker, and Bas J. M. Werker. 2009. Efficient estimation of auto-regression parameters and innovation distributions for semiparametric integer-valued AR(p) models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 71: 467–85. [Google Scholar] [CrossRef]
  12. Easley, David, and Maureen O’hara. 1992. Time and the Process of Security Price Adjustment. Journal of Finance 47: 905–27. [Google Scholar] [CrossRef]
  13. Engle, Robert F. 2000. The Econometrics of Ultra-High Frequency Data. Econometrica 68: 1–22. [Google Scholar] [CrossRef]
  14. Gourieroux, Christian, Alain Monfort, and Alain Trognon. 1984. Pseudo Maximum Likelihood Methods: Application to Poisson Models. Econometrica 52: 701–20. [Google Scholar] [CrossRef]
  15. Granger, Clive W. J. 1980. Long Memory Relationships and the Aggregation of Dynamic Models. Journal of Econometrics 14: 227–38. [Google Scholar] [CrossRef]
  16. Granger, Clive W. J., and Zhuanxin Ding. 1996. Varieties of Long Memory Models. Journal of Econometrics 73: 61–77. [Google Scholar] [CrossRef]
  17. Granger, Clive W. J., and Roselyne Joyeux. 1980. An Introduction to Long-Memory Time Series Models and Fractional Differencing. Journal of Time Series Analysis 1: 15–29. [Google Scholar] [CrossRef]
  18. Heinen, Andréas, and Erick Rengifo. 2003. Multivariate Modelling of Time Series Count Data: An Autoregressive Conditional Poisson Model. CORE Discussion Paper 2003/25. Louvain-la-Neuve: Université Catholique de Louvain. [Google Scholar]
  19. Hosking, J. R. M. 1981. Fractional Differencing. Biometrika 68: 165–76. [Google Scholar] [CrossRef]
  20. Hurst, Harold Edwin. 1951. Long-Term Storage Capacity of Reservoirs. Transactions of the American Society of Civil Engineers 116: 770–808. [Google Scholar]
  21. Hurst, Harold Edwin. 1956. Methods of Using Long-Term Storage in Reservoirs. Proceedings of the Institute of Civil Engineers 1: 519–43. [Google Scholar] [CrossRef]
  22. Jacobs, Patricia A., and Peter A. W. Lewis. 1978a. Discrete Time Series Generalized by Mixtures I: Correlational and Runs Properties. Journal of the Royal Statistical Society B 40: 94–105. [Google Scholar]
  23. Jacobs, Patricia A., and Peter A. W. Lewis. 1978b. Discrete Time Series Generalized by Mixtures II: Asymptotic Properties. Journal of the Royal Statistical Society B 40: 222–28. [Google Scholar]
  24. Jacobs, Patricia A., and Peter A. W. Lewis. 1983. Stationary Discrete Autoregressive Moving Average Time Series Generated by Mixtures. Journal of Time Series Analysis 4: 19–36. [Google Scholar] [CrossRef]
  25. Ljung, Greta M., and George E. P. Box. 1978. On a Measure of a Lack of Fit in Time Series Models. Biometrika 65: 297–303. [Google Scholar] [CrossRef]
  26. Mandelbrot, Benoit B., and John W. van Ness. 1968. Fractional Brownian Motions, Fractional Noises and Applications. SIAM Review 10: 422–37. [Google Scholar] [CrossRef]
  27. McKenzie, Ed. 1986. Autoregressive Moving-Average Processes with Negative Binomial and Geometric Marginal Distributions. Advances in Applied Probability 18: 679–705. [Google Scholar] [CrossRef]
  28. Quoreshi, A. M. M. Shahiduzzaman. 2006. Bivariate Time Series Modelling of Financial Count Data. Communications in Statistics: Theory and Methods 35: 1343–58. [Google Scholar] [CrossRef]
  29. Quoreshi, A. M. M. Shahiduzzaman. 2008. A vector integer-valued moving average model for high frequency financial count data. Economics Letters 101: 258–61. [Google Scholar] [CrossRef]
  30. Quoreshi, A. M. M. Shahiduzzaman. 2014. A Long Memory, Count Data, Time Series Model for Financial Application. Quantitative Finance 14: 2225–35. [Google Scholar] [CrossRef]
  31. Quoreshi, A. M. M. Shahiduzzaman. 2017. A bivariate integer-valued long-memory model for high-frequency financial count data. Communications in Statistics: Theory and Methods 46. [Google Scholar] [CrossRef]
  32. Ristic, Miroslav M., Yuvraj Sunecher, Naushad Ali Mamode Khan, and Vandna Jowaheer. 2018. A GQL-based Inference in non-stationary BINMA(1) Time Series. TEST, 1–13. [Google Scholar] [CrossRef]
  33. Rydberg, Tina Hviid, and Neil Shephard. 1999. BIN Models for Trade-by-Trade Data. Modelling the Number of Trades in a Fixed Interval of Time. Working Paper Series W23. Oxford: Nuffield College. [Google Scholar]
  34. Smith, Jeremy, Nick Taylor, and Sanjay Yadav. 1996. Comparing the bias and Misspecification in ARFIMA models. Journal of Time Series Analysis 18: 507–27. [Google Scholar] [CrossRef]
  35. Sunecher, Yuvraj, Naushad Mamode Khan, and Vandna Jowaheer. 2018. BINMA(1) Model with COM-Poisson Innovations: Estimation and Application. Communication in Statistics: Simulation and Computation, 1–22. [Google Scholar] [CrossRef]
  36. Weiss, Andrew A. 1986. Asymptotic theory for ARCH models: Estimation and testing. Econometric Theory 2: 107–31. [Google Scholar] [CrossRef]
  37. Working, Holbrook. 1953. Futures Trading and Hedging. American Economic Review 43: 314–43. [Google Scholar]
Figure 1. The time series of Ericsson B (mean 11.73 and variance 84.86) and AstraZeneca (mean 1.33 and variance 3.75) and their autocorrelation and partial-autocorrelation functions.
Figure 1. The time series of Ericsson B (mean 11.73 and variance 84.86) and AstraZeneca (mean 1.33 and variance 3.75) and their autocorrelation and partial-autocorrelation functions.
Jrfm 12 00074 g001
Table 1. The properties of the Conditional Least Square (CLS), Feasible Generalized Least Square (FGLS), and Two-Stage Quasi Maximum Likelihood (2SQML) estimators for finite-lag specifications, when data is generated according to Integer-valued Autoregressive Fractionally Integrated Moving Average (INARFIMA) (0,d,0) models with d   =   0.1 and lag (M) = 70.
Table 1. The properties of the Conditional Least Square (CLS), Feasible Generalized Least Square (FGLS), and Two-Stage Quasi Maximum Likelihood (2SQML) estimators for finite-lag specifications, when data is generated according to Integer-valued Autoregressive Fractionally Integrated Moving Average (INARFIMA) (0,d,0) models with d   =   0.1 and lag (M) = 70.
LagParameters T = 2000 and d = 0.1T = 10,000 and d = 0.1
CLSFGLSMLCLSFGLSML
M10 d ^ 0.1710.1710.1710.1670.1670.167
(s.e.)0.0010.0030.0030.0000.0010.001
BIAS0.0710.0710.0710.0670.0670.067
MSE8.2868.2868.2868.1178.1178.117
QLB100135.447135.447135.447207.628207.628207.628
QLB200210.680210.680210.680302.137302.137302.137
AIC4239.7854239.7854239.78520,943.29920,943.29920,943.299
SBIC4312.3954312.3954312.39521,033.61321,033.61321,033.613
M30 d ^ 0.1260.1260.1260.1230.1230.123
(s.e.)0.0010.0020.0020.0000.0010.001
BIAS0.0260.0260.0260.0230.0230.023
MSE8.2268.2268.2268.0348.0348.034
QLB100127.753127.753127.753129.387129.387129.387
QLB200200.223200.223200.223220.119220.119220.119
AIC4246.4084246.4084246.40820,867.82120,867.82120,867.821
SBIC4451.0364451.0364451.03621,122.34121,122.34121,122.341
M50 d ^ 0.1120.1120.1120.1090.1090.109
(s.e.)0.0010.0020.0020.0000.0010.001
BIAS0.0120.0120.0120.0090.0090.009
MSE8.1888.1888.1888.0188.0188.018
QLB100124.244124.244124.244110.770110.770110.770
QLB200197.235197.235197.235199.517199.517199.517
AIC4257.1684257.1684257.16820,868.72620,868.72620,868.726
SBIC4593.8144593.8144593.81421,287.45321,287.45321,287.453
M70 d ^ 0.1040.1040.1040.1010.1010.101
(s.e.)0.0010.0020.0020.0000.0010.001
BIAS0.0040.0040.0040.0010.0010.001
MSE8.1908.1908.1908.0168.0168.016
QLB100122.988122.988122.988104.384104.384104.384
QLB200195.961195.961195.961193.215193.215193.215
AIC4277.8884277.8884277.88820,886.42520,886.42520,886.425
SBIC4746.5524746.5524746.55221,469.35921,469.35921,469.359
M90 d ^ 0.0990.0990.0990.0960.0960.096
(s.e.)0.0010.0020.0020.0000.0010.001
BIAS−0.001−0.001−0.001−0.004−0.004−0.004
MSE8.1688.1688.1688.0218.0218.021
QLB100121.982121.982121.98299.53999.53999.539
QLB200192.315192.315192.315188.162188.162188.162
AIC4292.4174292.4174292.41720,911.71420,911.71420,911.714
SBIC4893.0994893.0994893.09921,658.85521,658.85521,658.855
Table 2. The properties of the CLS, FGLS, and 2SQML estimators for finite-lag specifications, when data is generated according to INARFIMA ( 0 , d , 0 ) models with d = 0.25 and lag (m) = 70.
Table 2. The properties of the CLS, FGLS, and 2SQML estimators for finite-lag specifications, when data is generated according to INARFIMA ( 0 , d , 0 ) models with d = 0.25 and lag (m) = 70.
LagParameters T = 2000 and d = 0.25T = 10,000 and d = 0.25
CLSFGLSMLCLSFGLSML
M10 d ^ 0.3970.3970.3970.4040.4040.404
(s.e.)0.0010.0040.0050.0000.0020.002
BIAS0.1470.1470.1470.1540.1540.154
MSE17.42817.42817.42817.46917.46917.469
QLB100252.360252.360252.360861.417861.418861.418
QLB200330.591330.592330.591951.603951.604951.603
AIC5705.8745705.8755705.87428,504.20828,504.21028,504.209
SBIC5778.4845778.4855778.48428,594.52228,594.52328,594.523
M30 d ^ 0.2920.2920.2920.2980.2980.298
(s.e.)0.0010.0030.0030.0000.0010.001
BIAS0.0420.0420.0420.0480.0480.048
MSE16.21916.21916.21916.20216.20216.202
QLB100173.555173.555173.555434.016434.016434.016
QLB200246.801246.801246.801511.413511.413511.413
AIC5599.7295599.7295599.72927,861.79527,861.79527,861.795
SBIC5804.3575804.3575804.35728,116.31528,116.31628,116.315
M50 d ^ 0.2600.2600.2600.2640.2640.264
(s.e.)0.0010.0030.0030.0000.0010.001
BIAS0.0100.0100.0100.0140.0140.014
MSE15.95115.95115.95115.95615.95615.956
QLB100152.446152.446152.446343.686343.686343.686
QLB200222.466222.466222.466418.824418.824418.824
AIC5588.6945588.6945588.69427,738.27127,738.27127,738.271
SBIC5925.3405925.3405925.34028,156.99828,156.99828,156.998
M70 d ^ 0.2420.2420.2420.2450.2450.245
(s.e.)0.0010.0030.0030.0000.0010.001
BIAS−0.008−0.008−0.008−0.005−0.005−0.005
MSE15.84515.84515.84515.84815.84815.848
QLB100136.373136.373136.373294.513294.513294.513
QLB200212.107212.107212.107370.136370.136370.136
AIC5596.1485596.1485596.14827,694.28827,694.28827,694.288
SBIC6064.8126064.8126064.81228,277.22228,277.22228,277.223
M90 d ^ 0.2300.2300.2300.2330.2330.233
(s.e.)0.0010.0030.0030.0000.0010.001
BIAS−0.020−0.020−0.020−0.017−0.017−0.017
MSE15.90915.90915.90915.80915.80915.809
QLB100132.433132.433132.433268.457268.457268.457
QLB200210.826210.826210.826343.640343.640343.640
AIC5624.6785624.6785624.67827,691.88427,691.88427,691.884
SBIC6225.3606225.3606225.36028,439.02528,439.02528,439.025
Table 3. The properties of the CLS, FGLS, and 2SQML estimators for finite-lag specifications, when data is generated according to INARFIMA ( 0 , d , 0 ) models with d = 0.4 and lag (m) = 70.
Table 3. The properties of the CLS, FGLS, and 2SQML estimators for finite-lag specifications, when data is generated according to INARFIMA ( 0 , d , 0 ) models with d = 0.4 and lag (m) = 70.
LagParametersT = 2000 and d = 0.4T = 10,000 and d = 0.4
CLSFGLSMLCLSFGLSML
M10 d ^ 0.5980.5980.5980.6050.6050.605
(s.e.)0.0010.0040.0050.0000.0020.002
BIAS0.1980.1980.1980.2050.2050.205
MSE41.79841.79841.79840.19740.19740.197
QLB100549.235549.235549.2351949.8791949.8781949.877
QLB200665.178665.177665.1772136.3782136.3772136.375
AIC7349.1477349.1477349.14636,337.15236,337.15136,337.149
SBIC7421.7577421.7577421.75636,427.46636,427.46536,427.463
M30 d ^ 0.4610.4610.4610.4630.46333.105
(s.e.)0.0010.0040.0040.0000.0020.331
BIAS0.0610.0610.0610.0630.0630.063
MSE34.03034.03034.03033.10533.10533.105
QLB100340.270340.270340.270964.972964.973964.972
QLB200436.152436.152436.1521108.8711108.8721108.871
AIC7064.1237064.1247064.12334,917.59734,917.59834,917.597
SBIC7268.7517268.7527268.75135,172.11735,172.11935,172.117
M50 d ^ 0.4100.4100.4100.4120.4120.412
(s.e.)0.0010.0040.0040.0000.0020.002
BIAS0.0100.0100.0100.0120.0120.012
MSE32.75832.75832.75831.69231.69231.692
QLB100301.374301.375301.374742.921742.922742.921
QLB200393.018393.018393.018878.651878.652878.651
AIC7017.8577017.8587017.85734,553.00834,553.01034,553.008
SBIC7354.5037354.5047354.50334,971.73634,971.73734,971.736
M70 d ^ 0.3820.3820.3820.3840.3840.384
(s.e.)0.0010.0040.0030.0000.0020.002
BIAS−0.018−0.018−0.018−0.016−0.016−0.016
MSE31.90731.90731.90731.15231.15231.152
QLB100275.290275.291275.290657.154657.154657.154
QLB200365.202365.202365.202789.378789.379789.378
AIC6988.4646988.4646988.46434,418.47534,418.47634,418.475
SBIC7457.1287457.1287457.12835,001.40935,001.41035,001.409
M90 d ^ 0.3640.3640.3640.3640.3640.364
(s.e.)0.0010.0030.0040.0000.0020.001
BIAS−0.036−0.036−0.036−0.036−0.036−0.036
MSE31.43031.43031.43030.82630.82630.826
QLB100255.701255.701255.701602.208602.208602.208
QLB200346.677346.677346.677733.237733.238733.237
AIC6980.7096980.7096980.70934,340.86234,340.86234,340.862
SBIC7581.3917581.3917581.39135,088.00235,088.00335,088.002
Table 4. CLS, FGLS, and 2SQML estimates for Ericsson and AstraZeneca.
Table 4. CLS, FGLS, and 2SQML estimates for Ericsson and AstraZeneca.
EricssonAstraZeneca
CLSFGLSMLCLSFGLSML
d ^ 0.3240.3240.3240.2040.2040.204
(s.e)0.0080.0080.0080.0120.0120.011
λ ^ 2.6252.6252.6250.5070.5070.507
(s.e)0.0910.0920.0880.0270.0270.026
Var--54.722--3.303
(s.e.)--1.809--0.151
AIC48,010.16048,010.16048,010.16014,433.56514,433.56514,433.565
SBIC48,534.80248,534.80248,534.80214,958.20714,958.20714,958.207
QLB100335.111335.111335.111235.586235.586235.587
QLB200422.191422.191422.191352.309352.309352.309
MSE54.72254.72254.7223.3033.3033.303

Share and Cite

MDPI and ACS Style

Quoreshi, A.M.M.S.; Uddin, R.; Mamode Khan, N. Quasi-Maximum Likelihood Estimation for Long Memory Stock Transaction Data—Under Conditional Heteroskedasticity Framework. J. Risk Financial Manag. 2019, 12, 74. https://doi.org/10.3390/jrfm12020074

AMA Style

Quoreshi AMMS, Uddin R, Mamode Khan N. Quasi-Maximum Likelihood Estimation for Long Memory Stock Transaction Data—Under Conditional Heteroskedasticity Framework. Journal of Risk and Financial Management. 2019; 12(2):74. https://doi.org/10.3390/jrfm12020074

Chicago/Turabian Style

Quoreshi, A. M. M. Shahiduzzaman, Reaz Uddin, and Naushad Mamode Khan. 2019. "Quasi-Maximum Likelihood Estimation for Long Memory Stock Transaction Data—Under Conditional Heteroskedasticity Framework" Journal of Risk and Financial Management 12, no. 2: 74. https://doi.org/10.3390/jrfm12020074

Article Metrics

Back to TopTop