Abstract
This study considers the goodness of fit test for a class of conditionally heteroscedastic location-scale time series models. For this task, we develop an entropy-type goodness of fit test based on residuals. To examine the asymptotic behavior of the test, we first investigate the asymptotic property of the residual empirical process and then derive the limiting null distribution of the entropy test.
1. Introduction
In this study, we consider the goodness of fit (GOF) test on the innovations of location-scale time series models with heteroscedasticity. These models accommodate a broad class of financial time series models (see Noh and Lee [1] and Kim and Lee [2]). Correct information on the innovation distribution is considerably important in analyzing time series. For example, in the parameter estimation, one conventionally uses the Gaussian quasi-maximum likelihood estimator (QMLE), which undermines the accuracy of estimation when their innovation distributions are deviated far from the normal distribution. To overcome this difficulty, a different likelihood function has been considered as an alternative—see Lee and Lee [3] who use a family of normal mixtures, and Lee and Kim [4] who use asymmetric skew t distribution (ASTD) and asymmetric exponential power distribution (AEPD) families. The family of normal and Student’s t distributions has been widely used in the literature—see Hansen [5], who uses a skew Student’s t distribution in generalized autoregressive conditionally heteroscedastic (GARCH)-type models (Bollerslev [6]), and also the papers cited in Kim and Lee [2,7].
The GOF test has a long history, and has been playing a central role in matching given data sets with the best-fitted distribution families (see D’Agostino and Stephens [8] for a review). Among the GOF tests, the empirical process-based GOF test has long been popular because the classical Kolmogorov–Smirnov and Cramér–von Mises tests can be generated from the empirical process. Recently, Lee, Vonta and Karagrigoriou [9] proposed an entropy-based GOF test and demonstrated that it outperforms the classical tests in various situations. Lee, Lee and Park [10] and Lee and Oh [11] later applied the entropy test to GARCH-type models, and all confirmed its validity empirically. Further, Lee and Kim [4] used the entropy test for iid random variables following ASTD and AEPD families to demonstrate that ASTD accommodates AEPD to a greater degree than the other way around. Although the asymptotic theorems for the entropy test are established for GARCH models (Lee, Lee and Park [10]), those are not yet attempted in general location-scale time series models. Motivated by this, we are led to investigate the asymptotic behavior of the residual empirical process from the location-scale model and then verify the limiting null distribution of the entropy test—see Durbin [12], Lee and Wei [13], and Lee and Taniguchi [14] for relevant references.
2. Entropy Test for Location-Scale Models
2.1. Main Result
Let us consider the conditional location-scale model:
where and are measurable functions, with compact subsets and , and , where denotes the true model parameter belonging to ; is a sequence of iid random variables with zero mean and unit variance. In what follows, we assume that is strictly stationary and ergodic and that is independent of past observations for . In this section, we consider the entropy-based GOF test proposed by Lee, Vonta and Karagrigoriou [9] for the location-scale models in (1). To this end, we set up the hypotheses:
where denotes the innovation distribution of the model and can be any family of distributions.
To carry out the test, inspired by Rosenblatt [15], we check whether the transformed random variables follow a uniform distribution on , say, , where and are the true parameters. Since the parameters are unknown, by replacing those with their estimates, we check the departure from based on with , where and with : see Francq and Zakoian [16], who take this approach of using initial values for GARCH models.
The entropy-based GOF test is constructed based on the Boltzmann–Shannon entropy defined by
for any density function f. It is noteworthy that the actually measures the distance between a distribution with density f and the uniform distribution. Lee, Vonta and Karagrigoriou [9] construct a GOF test using an approximation form of the integral in (3). For any distribution F, we introduce
where the ’s are weights with and , m is the number of disjoint intervals for partitioning the data range, and are preassigned partition points. Note that the argument in (4) is a good approximation of that in (3) when are all equal to 1—see Section 2.1 of Lee, Vonta and Karagrigoriou [9], and also their Remark 1 concerning the role of weights w.
Further, we define the residual empirical process:
with , where is any consistent estimator of under the null; for example, the maximum likelihood estimator (MLE). We then define the entropy test by .
To derive the null limiting distribution of the entropy test, we impose the regularity conditions as follows:
- (C1)
- (i) For some random variable V and constant , for all ;(ii) For some random variable V and constant , for all .
- (C2)
- (i) For all , and are differentiable in and on some neighborhoods of and of ;(ii) There exists a random variable V and constant such that for all , and .
- (C3)
- (i) For all , and are twice differentiable in and , where and are the ones in (C2)(i);(ii) and ;(iii) and
- (C4)
- (i) is continuous and has a positive density ;(ii) and are uniformly continuous on ;(iii) For some , , , and .
- (C5)
- is twice continuously differentiable with respect to and there exists such that and is uniformly continuous on
- (C6)
- Under the null, and .
Remark 1.
The above conditions can be found in Kim and Lee [2]. They show that a class of GARCH and TGARCH models with ASTD and AEPD innovations satisfy the regularity conditions and the MLE is asymptotically normal.
Below is the main result of this section: see the proof in Section 2.2.
Theorem 1.
Under (C1)∼(C6), we have
where with and
Moreover, we are led to the following result, the detailed proof of which is omitted for brevity because it is essentially the same as that of Lee, Vonta and Karagrigoriou [9] and Lee, Lee and Park [10].
Theorem 2.
Suppose that the assumptions in Theorem 1 hold. Then, under , if as , we have that for all large m, as ,
where is any finite subset of the class of all weights and is the Brownian bridge on .
Here, the symbol as indicates that the limiting distribution of is approximately the same as the distribution of A as n tends to ∞. More precisely, we can write , where as and for all .
Remark 2.
As seen in the proof of Theorem 2 of Lee, Lee and Park [10], one can easily check that owing to Theorem 1, under the null,
wherein the term: becomes negligible as n tends to infinity when m is large. This yields Theorem 2.
2.2. Proof of Theorem 1
We reexpress as follows:
where
Since owing to Lemma 1 below, we handle the two terms and . Let
and let be a sequence of positive integer numbers with and as . We express , where
for some between x and . By Taylor’s theorem, we can express
with
for some between and . Then, owing to (C1)(i) and (C4)(iii),
and due to the ergodic theorem, Lemma 4 of Amemiya [17], (C3)(iii), (C4)(iii), and (C6), we get , so that
Similarly, it can be easily seen that
Next, we analyze . Owing to the ergodic theorem, Lemma 4 of Amemiya [17], , , , and , we have
for some , which is no more than
where is an intermediate point between and .
Meanwhile, since and , we can find (large) , such that on the event , , with probability tending to 1,
Hence, owing to the ergodic theorem, Lemma 4 of Amemiya [17], , , , and , we can have that on and for ,
for some and intermediate vector between and , which is negligible. Because for , , it holds that , which together with (10) and (11) indicates
Since owing to and , we establish the theorem.
Lemma 1.
Under the assumptions in Theorem 1, we have
Proof of Lemma 1.
Due to , for any , there exists such that , where is a compact neighborhood of with for all . For a positive real number , we partition into a finite number, say, of subsets with diameter less than . Set
Let be an integer such that , where and is the largest integer that does not exceed x. We divide the interval into parts by the points with .
Then, for any points in , we have
Putting , for , we can express
with
and and are the same as and , with and replaced by and , , respectively.
To show , we verify that , . Below, we only provide the proof for the cases of , since the cases of can be handled similarly.
We first deal with . By the mean value theorem, we can see that is no more than
with
Note that the term in (16) is due to Lemma 1, and
where is a real number between and . Using an argument similar to that in (12), we can see that , which can be made arbitrarily small by taking sufficiently small . Hence, we get .
Next, because , we can write
Hence, it remains to show that
Put
and . Note that forms an array of martingale differences. Then, we get
and further, applying Rosenthal’s inequality (Hall and Heyde [18], p. 23),
Lemma 2.
Under the assumptions in Theorem 1, for every , we have
where .
Proof of Lemma 2.
The lemma can be proven by using (C2)∼(C4) and the second-order Taylor’s expansion theorem centered at x and y. We omit the details for brevity. ☐
3. Discussion
In implementation, following the idea of Lee, Vonta and Karagrigoriou [9] and Lee, Lee and Park [10], we generate independent and identically distributed (i.i.d.) r.v.s , , from , where J is a large integer (e.g., 1000), and then use and to apply the test:
The choice of m could be an important issue because the test performance might be sensitive to m. Here, we use because this has produced reasonably good results in our previous studies. The critical values could be obtained through Monte Carlo simulations as follows:
- (i)
- From the data , estimate and by suitable estimators and ; for example, MLE (Kim and Lee [2]).
- (ii)
- Generate from and using the equation: . Then, obtain , denoted by , with the preassigned m in (21) based on these random variables.
- (iii)
- Repeat the above procedure B times and calculate the percentile of the obtained B number of values.
- (iv)
- Reject if the value of obtained from the original observations is larger than the obtained percentile in (iii).
The good performance of the entropy test for GARCH-type models can be seen in our previous works: Lee, Lee and Park [10], Lee and Oh [11], and Lee and Kim [4]. However, more refined empirical studies are required to see the performance of the above procedure in various location-scale models. Meanwhile, verifying the weak consistency of the can be an important issue. The proof would be similar to that in Lee and Kim [4], which, however, needs much more careful analysis. All these issues are worth further investigation and are left as our future project.
4. Conclusions
In this study, we considered the entropy-based test for location-scale time series models and showed that it converges weakly to a functional of a Brownian bridge. As mentioned earlier, the bootstrap test in this setting deserves special attention owing to its importance in implementation. Furthermore, a modification of the entropy test based on integrated distributions is worth further investigation. We leave these issues to our future project.
Acknowledgments
We would like to thank the referees for their careful reading and valuable comments that improve the quality of the paper. This research is supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and future Planning (No. 2015R1A2A2A010003894).
Author Contributions
Sangyeol Lee and Minjo Kim conceived the topic, designed analysis tools and wrote the paper. Both authors have read and approved the final manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| GOF | goodness of fit test |
| GARCH | generalized autoregressive conditionally heteroscedastic |
| QMLE | quasi-maximum likelihood estimator |
| ASTD | asymmetric skew t distribution |
| AEPD | asymmetric exponential power distribution |
References
- Noh, J.; Lee, S. Quantile regression for location-scale time series models with conditional hetroscedasticity. Scand. J. Stat. 2015, 43, 700–720. [Google Scholar] [CrossRef]
- Kim, M.; Lee, S. Inferences for conditionally heteroscedastic location-scale time series models with ASTD and AEPD innovations. Unpublished work. 2016. [Google Scholar]
- Lee, T.; Lee, S. Normal mixture quasi maximum likelihood estimator for GARCH models. Scand. J. Stat. 2009, 36, 157–170. [Google Scholar] [CrossRef]
- Lee, S.; Kim, M. On entropy based goodness of fit test for asymmetric student-t and exponential power distributions. J. Stat. Comput. Simul. 2017, 87, 187–197. [Google Scholar] [CrossRef]
- Hansen, B.E. Autoregressiveconditional density estimation. Int. Econ. Rev. (Philadelphia) 1994, 35, 705–730. [Google Scholar] [CrossRef]
- Bollerslev, T. Generalized autoregressive conditional heteroskedasticity. J. Econ. 1986, 31, 307–327. [Google Scholar] [CrossRef]
- Kim, M.; Lee, S. Comparison study of semiparametric methods to estimate conditional VaR and ES. Korean J. Appl. Stat. 2016, 29, 171–180. (In Korean) [Google Scholar] [CrossRef]
- D’Agostino, R.B.; Stephens, M.A. Goodness-of-Fit Techniques; Marcel Dekker, Inc.: New York, NY, USA, 1986. [Google Scholar]
- Lee, S.; Vonta, I.; Karagrigoriou, A. A maximum entropy type test of fit. Comput. Stat. Data Anal. 2011, 55, 2635–2643. [Google Scholar] [CrossRef]
- Lee, J.; Lee, S.; Park, S. Maximum entropy test for GARCH models. Stat. Method 2015, 22, 8–16. [Google Scholar] [CrossRef]
- Lee, S.; Oh, H. Entropy test and residual empirical process for autoregressive conditional duration models. Comput. Stat. Data Anal. 2015, 86, 1–12. [Google Scholar] [CrossRef]
- Durbin, J. Weak convergence of the sample distribution function when parameters are estimated. Ann. Stat. 1973, 1, 279–290. [Google Scholar] [CrossRef]
- Lee, S.; Wei, C.Z. On residual empirical processes of stochastic regression models with applications to time series. Ann. Stat. 1999, 27, 237–261. [Google Scholar]
- Lee, S.; Taniguchi, M. Asymptotic theory for ARCH-SM models: LAN and residual empirical process. Stat. Sin. 2005, 15, 215–234. [Google Scholar]
- Rosenblatt, M. Remarks on a multivaraite transformation. Ann. Math. Stat. 1952, 23, 470–472. [Google Scholar] [CrossRef]
- Francq, C.; Zakoïan, J.M. Maximum likelihood estimation of pure GARCH and ARMA-GARCH processes. Bernoulli 2004, 10, 605–637. [Google Scholar] [CrossRef]
- Amemiya, T. Regression analysis when the dependent variable is truncated normal. Econometrica 1973, 41, 997–1016. [Google Scholar] [CrossRef]
- Hall, P.; Heyde., C.C. Martingale Limit Theory and Its Application; Academic Press: New York, NY, USA, 1980. [Google Scholar]
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).