Next Article in Journal
Climate Disaster Risks—Empirics and a Multi-Phase Dynamic Model
Previous Article in Journal
The Discovery of Long-Run Causal Order: A Preliminary Investigation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Maximum Likelihood Estimation for the Fractional Vasicek Model

1
Faculty of Economics, Gakushuin University, Tokyo 171-8588, Japan
2
School of Management, Zhejiang University, Hangzhou 310058, China
3
School of Economics and Lee Kong Chian Schoo of Business, Singapore Management University, Singapore 178903, Singapore
*
Author to whom correspondence should be addressed.
We would like to thank three referees for helpful comments. Jun Yu’s homepage http://www.mysmu.edu/faculty/yujun/.
Econometrics 2020, 8(3), 32; https://doi.org/10.3390/econometrics8030032
Submission received: 9 November 2019 / Revised: 7 August 2020 / Accepted: 7 August 2020 / Published: 12 August 2020

Abstract

:
This paper estimates the drift parameters in the fractional Vasicek model from a continuous record of observations via maximum likelihood (ML). The asymptotic theory for the ML estimates (MLE) is established in the stationary case, the explosive case, and the boundary case for the entire range of the Hurst parameter, providing a complete treatment of asymptotic analysis. It is shown that changing the sign of the persistence parameter changes the asymptotic theory for the MLE, including the rate of convergence and the limiting distribution. It is also found that the asymptotic theory depends on the value of the Hurst parameter.

1. Introduction

Since Vasicek (1977) introduced a model to describe the evolution of short-term interest rates, the so-called Vasicek model has enjoyed a wide range of applications. Jamshidian (1989) used it to price bond options. Scott (1987) used it to model the evolution of the instantaneous volatility of stock price and to price European call options.
Many extensions have been made to generalize the specification of Vasicek. For example, motivated by the phenomenon of long-range dependence found in the data of hydrology, geophysics, climatology, telecommunication, economics, and finance, the Brownian motion in the Vasicek model has been replaced by a fractional Brownian motion (fBm), leading to the following fractional Vasicek model (fVm):
d X t = κ μ X t d t + σ d B t H ,
where σ is a positive constant, μ , κ R , B t H is an fBm with H ( 0 , 1 ) being the Hurst parameter, and X 0 = o p ( T ) with o p defined at the end of this section. An fBm B t H is a zero mean Gaussian process, defined on a complete probability space ( Ω , F , P ) , with the following covariance function:
E ( B t H B s H ) = 1 2 | t | 2 H + | s | 2 H | t s | 2 H .
The process B t H is self-similar in the sense that a R + , B a t H = d a H B t H . It becomes the standard Brownian motion W t when H = 1 / 2 and can be represented as a stochastic integral with respect to the standard Brownian motion. It is negatively correlated when 0 < H < 1 / 2 . When 1 / 2 < H < 1 , it has long-range dependence in the sense that n = 1 E B 1 H ( B n + 1 H B n H ) = . In this case, the positive (negative) increments are likely to be followed by positive (negative) increments. The parameter H, which is also called the self-similarity parameter, measures the intensity of the long-range dependence.
Parameter κ is often referred to as the persistence parameter. When κ > 0 , X t is stationary and ergodic. In this case, μ is the unconditional mean of X t and κ is the mean-reversion parameter. When κ < 0 , X t is explosive and hence non-ergodic. When κ = 0 , X t is the boundary case, and the drift term κ μ X t d t disappears. Therefore, μ is superfluous in this case. The ergodic fVm was used to model the evolution of instantaneous volatility in Comte and Renault (1998), the evolution of quadratic variation in Aït-Sahalia and Mancini (2008), and the evolution of realized variance in Gatheral et al. (2018). The explosive fVm was used to model the NASDAQ index in Lui et al. (2020). The explosive OU was used to model the log real estate price in Chen et al. (2017).
An alternative to and perhaps slightly more general specification than Model (1) is:
d X t = α κ X t d t + σ d B t H .
In Model (3), even when κ = 0 , the drift term does not vanish, and it is α d t . This alternative specification for the drift term was used in Chan et al. (1992) and Yu and Phillips (2001). When α in (3) is known (without loss of generality, it is assumed to be zero), (3) becomes the fractional Ornstein–Uhlenbeck (fOU) process. A unique path-wise solution to the stochastic differential equation in (3) is:
X t = e κ t X 0 + α κ 1 e κ t + σ 0 t e κ ( t s ) d B s H ,
where the stochastic integral, 0 t e κ ( t s ) d B s H , is the path-wise Riemann–Stieltjes integral, and the solution is unique (Proposition A.1 in Cheridito et al. 2003).
Assuming that a continuous record of observations is available for X t with t [ 0 , T ] , a number of studies have introduced methods to estimate κ and α (or μ ) and developed asymptotic distributions for the proposed estimators under the scheme of T .1 When H > 1 / 2 and κ > 0 , borrowing the idea of Hu and Nualart (2010) and Hu et al. (2019), Xiao and Yu (2019a) considered the ergodic-type estimates of κ and μ . Xiao and Yu (2019b) extended the results of Xiao and Yu (2019a) from the case where H ( 1 / 2 , 1 ) to where H ( 0 , 1 / 2 ) . Assuming κ > 0 and H > 1 / 2 , Nourdin and Tran (2019) extended the results of Xiao and Yu (2019a, 2019b) to a model where the fBm is replaced with a Hermite process. Using the Malliavin calculus, Es-Sebaiy and Viens (2019) studied the estimation problem for the drift parameter for some stochastic differential equations driven by fBm. Lohvinenko and Ralchenko (2017) considered the maximum likelihood (ML) estimates of κ and α when κ > 0 and H ( 1 / 2 , 1 ) . Moreover, Lohvinenko and Ralchenko (2019) considered the maximum likelihood (ML) estimates of κ and α when κ < 0 and H ( 1 / 2 , 1 ) .
Our paper also focuses on the MLE of κ and α . We aim to develop the asymptotic distributions for the MLE of κ and α under the following scenarios: (1) κ > 0 and H ( 0 , 1 / 2 ] ; (2) κ = 0 and H ( 0 , 1 ) ; (3) κ < 0 and H ( 0 , 1 ) . Therefore, together with Lohvinenko and Ralchenko (2017), a complete coverage of asymptotic theory for all possible cases is provided to the MLE of κ and α .
Other estimation methods and alternative sampling schemes are possible. A recent study by Ng and Wirjanto (2019) investigated the bias property of the least squares estimator (LS) of κ based on discrete-sampled data. It is shown that the bias depends on the Hurst parameter and the true value of κ . While the assumption of a continuous-time record is practically too strong, it allows us to obtain the ML estimates in closed-form. Moreover, the results obtained in our paper will serve as the benchmark for those based on discrete-sampled data.
Several recent applications of Model (1) can be found in economics and finance. Gatheral et al. (2018) assumed κ = 0 and found the evidence of H < 1 / 2 in the log realized volatility (RV) of a DAX contract, the Bund futures contract, the S&P 500 index, and the NASDAQ index. Bennedsen et al. (2017) documented the evidence of H < 1 / 2 in the log RV of a large number of U.S. equities. Wang et al. (2019) found the evidence of κ > 0 and H < 1 / 2 in the log RV, the realized kernel and bipower variation of the S&P500, NASDAQ, and DJIA, and in the log RV of three exchange rates. Fukasawa et al. (2019) found the log RV of several stock indices indicating that H is smaller than 0 . 1 . Lui et al. (2020) reported the evidence of κ < 0 and H < 1 / 2 in the S&P500 in the 1990s. Unfortunately, none of these empirical studies used the ML method to estimate κ .
The rest of the paper is organized as follows. Section 2 introduces the MLE of κ and α . Section 3 is devoted to the asymptotic theory for the stationary case (i.e., κ > 0 ), but H ( 0 , 1 / 2 ] . Section 4 studies the asymptotic properties of the MLE in the boundary case (i.e., κ = 0 ) and for the entire range for the Hurst parameter H ( 0 , 1 ) . In Section 5, we establish the asymptotic behaviors of the MLE for the non-ergodic case (i.e., κ < 0 ) and for the entire range for the Hurst parameter H ( 0 , 1 ) . Section 6 contains some concluding remarks and gives directions for further research. All the proofs are collected in Appendix A.
We use the following notations throughout the paper: O, o, O p , o p , p , a . s , d , and ∼ denote the same order, the smaller order, the same order in probability, the smaller order in probability, convergence in probability, convergence almost surely, convergence in the distribution, and asymptotic equivalence, respectively, as T . Throughout this paper, the constant C only depends on H, whose values can differ at different places.

2. ML Estimation

Following Kleptsyna et al. (2000), by applying the Girsanov theorem for the fBm developed in Norros et al. (1999), one can get the expression for the continuous-record log-likelihood function for Model (3) as follows:
( κ , α ) = 0 T Q H ( t ) d M t H + 1 2 0 T Q H ( t ) 2 d ω t H ,
where:
Q H t = 1 σ d d ω t H 0 t k H t , s α κ X s d s ,
k H t , s = 1 k H s t s 1 2 H , k H = 2 H Γ 3 2 H Γ H + 1 2 ,
ω t H = 1 λ H t 2 2 H ,
λ H = 2 H Γ 3 2 H Γ H + 1 2 Γ 3 2 H ,
M t H = 0 t k H ( t , s ) d B s H .
Taking the derivatives of the log-likelihood function with respect to κ and α and setting them to zero, Lohvinenko and Ralchenko (2017) obtained the following expressions for the MLE of α and κ :
α ˜ T = S T 0 T P H 2 t d ω t H 0 T P H t d S t 0 T P H t d ω t H ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 σ ,
κ ˜ T = S T 0 T P H t d ω t H ω T H 0 T P H t d S t ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 ,
where:
S t = 1 σ 0 t k H t , s d X s ,
P H t = 1 σ d d ω t H 0 t k H t , s X s d s ,
Combining (3) and (6) with (13), we deduce that:
P H t = 1 σ α κ + 1 σ X 0 α κ V H t + P ˜ H t ,
where:
V H t = d d ω t H 0 t k H t , s e κ s d s ,
P ˜ H t = d d ω t H 0 t k H t , s U s d s ,
U t = 0 t e κ t s d B s H .
Using the idea of Kleptsyna and Le Breton (2002), Lohvinenko and Ralchenko (2017) obtained the following results:
Q H t = α σ κ P H t ,
S t = 0 t Q H s d ω s H + M t H = α σ ω t H κ 0 t P H s d ω s H + M t H ,
d S t = α σ d ω t H κ P H t d ω t H + d M t H .
The process M t H , the so-called fundamental martingale, is a Gaussian martingale with the variance function being ω t H . Moreover, the natural filtration of the martingale M H coincides with the natural filtration of the fBm. Based on (19) and (20), the MLE of α and κ can be represented as:
α ˜ T = α + M T H 0 T P H 2 t d ω t H 0 T P H t d M t H 0 T P H t d ω t H ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 σ ,
κ ˜ T = κ + M T H 0 T P H t d ω t H ω T H 0 T P H t d M t H ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 ,
When a continuous record of observations of X t is available, Lohvinenko and Ralchenko (2017) studied the consistency and the asymptotic normality of the MLE defined by (10) and (11) when H > 1 / 2 and κ > 0 . The goal of the present paper is to establish the asymptotic theory for the MLE of α and κ for all the other cases, including H < 1 / 2 and κ > 0 , H ( 0 , 1 ) and κ = 0 , and H ( 0 , 1 ) and κ < 0 .

3. Asymptotic Theory When κ > 0

In this section, inspired by Lohvinenko and Ralchenko (2017), we extend the asymptotic properties of α ˜ T and κ ˜ T from the case of H ( 1 / 2 , 1 ) to the case of H ( 0 , 1 / 2 ] . For the sake of comparison, we first introduce the main result of Lohvinenko and Ralchenko (2017). When H > 1 / 2 , Lohvinenko and Ralchenko (2017) obtained the asymptotic normality for the MLE of α and κ , i.e.,
T 1 H α ˜ T α d N 0 , λ H σ 2 ,
T κ ˜ T κ d N 0 , 2 κ .
Remark 1.
We can use the ergodic property to estimate α and κ (denoted by α ^ H N and κ ^ H N , respectively). Following the idea of Equation (1.3) in Nourdin and Tran (2019) or Equations (2.9) and (2.10) in Xiao and Yu (2019a), we can easily obtain the following results by the ergodic theorem:
1 T 0 T X t d t a . s . α κ , 1 T 0 T X t 2 d t a . s . σ 2 κ 2 H H Γ 2 H + 1 T 0 T X t d t 2 .
Solving these two equations for α and κ, we obtain the ergodic-type estimators of κ and μ as:
α ^ H N = 1 T 0 T X t d t 1 T 0 T X t 2 d t 1 T 0 T X t d t 2 σ 2 H Γ 2 H 1 / 2 H ,
κ ^ H N = 1 T 0 T X t 2 d t 1 T 0 T X t d t 2 σ 2 H Γ 2 H 1 / 2 H .
with H > 1 / 2 .
Using Theorem 1.3 in Nourdin and Tran (2019) or Theorem 3.3 in Xiao and Yu (2019a), we can easily obtain:
T 1 H α ^ H N α d N 0 , σ 2 ,
for 1 / 2 < H < 1 and:
T κ ^ H N κ d N 0 , κ ϕ H ,
for 1 / 2 < H < 3 / 4 and ϕ H = 4 H 1 4 H 2 1 + Γ 3 4 H Γ 4 H 1 Γ 2 2 H Γ 2 H .
Similarly, using Theorem 1.3 in Nourdin and Tran (2019) or Theorem 3.3 in Xiao and Yu (2019a), we can obtain:
T log ( T ) κ ^ H N κ d N 0 , 16 κ 9 π ,
for H = 3 / 4 .
Using Theorem 1.3 in Nourdin and Tran (2019) or Theorem 3.3 in Xiao and Yu (2019a), again, for H ( 3 / 4 , 1 ) , we have:
T 2 2 H κ ^ H N κ L κ 2 H 1 H Γ ( 2 H + 1 ) R ( H ) ,
where R ( H ) is the Rosenblatt distribution with E R H = 2 H 2 2 H 1 / 4 H 3 .
Comparing (23) with (27), we can see that the convergence rate of α ˜ T is the same as that of α ^ H N . However, α ˜ T is more efficient than α ^ H N because of λ H < 1 for H ( 1 / 2 , 1 ) . Similarly, comparing (24) with (28), we can see that the convergence rate of κ ˜ T is the same as that of κ ^ H N when H ( 1 / 2 , 3 / 4 ) . In this case, the asymptotic variance of κ ^ H N depends on H while the asymptotic variance of κ ˜ T is always 2 κ . Since ϕ H > 2 , κ ˜ T is more efficient than κ ^ H N . When H = 3 / 4 , the convergence rate of κ ^ H N is slower than that of κ ˜ T . Let us also mention that κ ^ H N is asymptotically more efficient than the LS estimator of κ when H ( 1 / 2 , 1 ) ; see Xiao and Yu (2019a) for details about the LS estimator of κ. When H = 1 / 2 , κ ^ H N and the LS estimator of κ are asymptotically equivalent. For more comparison of the LS estimator and MLE, see Tanaka (2020).
The objective of this section is to obtain the consistency and the asymptotic normality of α ˜ T and κ ˜ T when H ( 0 , 1 / 2 ] . Since the asymptotic laws of α ˜ T are different when H ( 0 , 1 / 2 ) from those when H = 1 / 2 , we need to treat them separately.

3.1. Asymptotic Theory When H ( 0 , 1 / 2 )

Before presenting the asymptotic properties of α ˜ T and κ ˜ T for H ( 0 , 1 / 2 ) , we first state a useful technical lemma.
Lemma 1.
For κ > 0 and H 0 , 1 in Model (3), as T , we have:
V H T = O T H 3 2 ,
0 T V H t d ω t H = O T 1 2 H ,
0 T P ˜ H t d M t H = O p T ,
0 T P ˜ H 2 t d ω t H = O p T ,
0 T V H 2 t d ω t H = O 1 ,
0 T P ˜ H t d ω t H = O p T 1 H ,
0 T V H t P ˜ H t d ω t H = O p T
0 T V H t d M t H = O p 1 .
We can now describe the asymptotic laws of α ˜ T and κ ˜ T as T .
Theorem 1.
For κ > 0 and H 0 , 1 / 2 in Model (3), as T , we have:
T α ˜ T α d N 0 , 2 α 2 κ ,
T κ ˜ T κ d N 0 , 2 κ .
Remark 2.
Comparing the asymptotic theory with that obtained in Lohvinenko and Ralchenko (2017), the asymptotic normality continues to hold for both estimators. Moreover, comparing (39) with (24), we can see that the asymptotic theory for κ ˜ T is the same regardless of H 0 , 1 / 2 or H 1 / 2 , 1 . Comparing (38) with (23), we can see that the asymptotic variance of α ˜ T depends on H. The asymptotic variance is λ H σ 2 with the consistency order T 1 H if H 1 / 2 , 1 , whereas it does not depend on H with the consistency order T as T becomes large if H 0 , 1 / 2 .
Remark 3.
The asymptotic theory for the MLE of κ in the fOU when H 0 , 1 / 2 has been developed in the literature; see, for example, Theorem 2 in Brouste and Kleptsyna (2010). It is the same as in (39). Therefore, having to estimate an additional parameter α, there is no efficiency loss in estimating κ asymptotically.
Remark 4.
Following Hu et al. (2019), Xiao and Yu (2019b) considered the ergodic-type estimate of κ defined in (25) when 0 < H < 1 / 2 and showed that:
T κ ^ H N κ d N 0 , κ δ H N 2 ,
where δ H N 2 = 1 4 H 2 4 H 1 + 2 Γ 2 4 H Γ 4 H Γ 2 H Γ 1 2 H . Figure 1 compares the asymptotic variances of κ ˜ T and κ ^ H N by plotting δ H N 2 against twofor H ( 0 , 1 / 2 ) . It can be seen that κ ˜ T is more efficient than κ ^ H N . The smaller H is, the larger the relative asymptotic efficiency in κ ˜ T . The two estimators have the same asymptotic variance when H = 1 / 2 .
Remark 5.
Using the Taylor expansion and Theorem 3.2 of Xiao and Yu (2019b), we can easily obtain that for 0 < H < 1 / 2 :
T α ^ H N α d N 0 , α 2 κ δ H N 2 .
Comparing (38) with (41), we can see that α ˜ T and α ^ H N share the same convergence rate. However, α ˜ T is more efficient than α ^ H N because δ H N 2 < 2 for H ( 0 , 1 / 2 ) .

3.2. Asymptotic Theory When H = 1 / 2

When H = 1 / 2 , B t 1 / 2 = W t , which is a standard Brownian motion, and the fVm becomes the standard Vasicek model. In this case, it can be shown that fundamental martingale M t H becomes a standard Brownian motion. Consequently, the MLE reduces to the LS estimates and can be rewritten as:
α ˜ T = X T 0 T X t 2 d t 0 T X t d t 0 T X t d X t T 0 T X t 2 d t 0 T X t d t 2 ,
κ ˜ T = X T 0 T X t d t T 0 T X t d X t T 0 T X t 2 d t 0 T X t d t 2 ,
where the stochastic integration 0 T X t d X t is interpreted as an Itô integral. The asymptotic theory in (42) and (43) has been studied in the literature; see, for example, Kubilius et al. (2018) and Kutoyants (2004). From page 64 of Kutoyants (2004), we can easily show that α ˜ T and κ ˜ T are consistent and asymptotically normally distributed. Hence, we only provide the asymptotic laws of α ˜ T and κ ˜ T for H = 1 / 2 without proof.
Theorem 2.
For κ > 0 and H = 1 / 2 in Model (3), as T , we have:
T α ˜ T α d N 0 , σ 2 + 2 α 2 κ ,
T κ ˜ T κ d N 0 , 2 κ .
Remark 6.
When α 0 , we can summarize the three sets of asymptotic theory for the MLE of α as follows:
If H ( 0 , 1 / 2 ) , T α ˜ T α d N 0 , 2 α 2 κ , If H = 1 / 2 , T α ˜ T α d N 0 , σ 2 + 2 α 2 κ , If H ( 1 / 2 , 1 ) , T 1 H α ˜ T α d N 0 , λ H σ 2 ,
where the last asymptotic theory was obtained in Theorem 3.4 of Lohvinenko and Ralchenko (2017). While the three sets of asymptotic theory for κ ˜ T are identical, the three sets of asymptotic theory for α ˜ T are different. When H changes from a value in ( 0 , 1 / 2 ) to 1 / 2 , while the rate of convergence stays the same (i.e., T ), the asymptotic variance changes from 2 α 2 κ to σ 2 + 2 α 2 κ . When H changes from a value in ( 0 , 1 / 2 ] to ( 1 / 2 , 1 ) , both the rate of convergence and the asymptotic variance change.
Remark 7.
When α is known and assumed to be zero and H = 1 / 2 , the asymptotic theory for the MLE of κ was obtained in Brown and Hewitt (1975) and in Feigin (1976). The two sets of asymptotic theory are the same, suggesting that there is no efficiency loss in estimating κ when α is estimated or not.

4. Asymptotic Theory When κ = 0

In this section, we consider the asymptotic laws of α ˜ T and κ ˜ T for the entire range for the Hurst parameter, i.e., H ( 0 , 1 ) . Note that when κ = 0 , we have:
X t = X 0 + α t + σ B t H .
For the model d U t = κ U t d t + d B t H , it is well known that the MLE of κ can be expressed as:
κ ^ T κ = 0 T P ^ H t d M t H 0 T P ^ H 2 t d ω t H ,
where P ^ H t = d d ω t H 0 t k H t , s B s H d s .
Before considering the asymptotic properties of α ˜ T and κ ˜ T , we first introduce a lemma, which will be used to derive the asymptotic theory.
Lemma 2.
For κ = 0 and H 0 , 1 in Model (1), as T , we have:
0 T P ^ H t d M t H = O p T ,
0 T P ^ H 2 t d ω t H = O p T 2 ,
0 T t d ω t H = 1 λ H 2 2 H 3 2 H T 3 2 H ,
0 T t 2 d ω t H = 1 λ H 1 H 2 H T 4 2 H ,
0 T P ^ H t d ω t H = O p T 2 H ,
0 T t P ^ H t d ω t H = O p T 3 H ,
V H T = λ H k H B 3 2 H , 3 2 H ,
d d ω t H 0 t k H t , s s d s = a H t ,
where B ( · , · ) is the Beta function, λ H is defined by (8), and a H = 3 2 H 4 1 H .
We can now describe the asymptotic behavior of α ˜ T and κ ˜ T as T .
Theorem 3.
For κ = 0 , α 0 , and H 0 , 1 in Model (1), as T , we have:
T 1 H α ˜ T α d N 0 , σ 2 ρ H ,
T 2 H κ ˜ T κ d N 0 , σ 2 α 2 ϕ H ,
where ρ H = λ H 3 2 H 2 , ϕ H = 32 H ( 1 H ) ( 2 H ) Γ ( 3 2 H ) Γ ( H + 1 2 ) Γ ( 3 2 H ) and λ H is defined by (8).
Remark 8.
In the case of H = 1 / 2 and α 0 , we can see that X t = X 0 + α t + σ W t . A straightforward algebraic calculation shows ω t H = t , P H t = 1 σ X t , M T H = W t , P ^ H ( t ) = W t and that:
1 T 3 0 T X t 2 d t = α 2 3 + o p ( 1 ) ,
1 T 2 0 T X t d t = α 2 + o p ( 1 ) ,
1 T T 0 T X t d W t = α T T 0 T t d W t + o p ( 1 ) .
Then, by the scaling properties of the Brownian motion, (21) and (58)–(60), we deduce that:
T α ˜ T α = T W T 0 T X t 2 d t 0 T X t d W t 0 T X t d t T 0 T X t 2 d t 0 T X t d t 2 σ = T 1 T 1 T W T 1 T 3 0 T X t 2 d t 1 T T 0 T X t d W t 1 T 2 0 T X t d t 1 T 3 0 T X t 2 d t 1 T 2 0 T X t d t 2 σ = α 2 3 W T T α 2 2 1 T T 0 T t d W t + o p ( 1 ) α 2 3 α 2 2 + o p ( 1 ) σ = 12 σ W T 3 T 1 2 T T 0 T t d W t + o p ( 1 ) d N 0 , 4 σ 2 ,
which is identical to (56) with H = 1 / 2 . Moreover, using (22) and (58)–(60), we can write:
T T κ ˜ T κ = T T W T 0 T X t d t T 0 T X t d W t σ T 0 T X t 2 d t 0 T X t d t 2 = W T T 1 T 2 0 T X t d t 1 T T 0 T X t d W t σ 1 T 3 0 T X t 2 d t 1 T 2 0 T X t d t 2 d N 0 , 12 σ 2 α 2 ,
which is identical to (57) with H = 1 / 2 being assumed.
Remark 9.
In the case of H = 1 / 2 and α = 0 , with α and κ being estimated, by the scaling properties of the Brownian motion, we have:
T α ˜ T α d W 1 0 1 W t d t 2 0 1 W t d t 0 1 W t d W t 0 1 W t 2 d t 0 1 W t d t 2 σ , T κ ˜ T κ d W 1 0 1 W t d t 0 1 W t d W t 0 1 W t 2 d t 0 1 W t d t 2 .
Thus, the limiting distributions of α ˜ T and κ ˜ T are not normal. In particular, the asymptotic distribution of κ ˜ T is a Dickey–Fuller–Phillips-type distribution with the rate of convergence being T. Hence, when κ = 0 is unknown, the value of α plays an important role in the study of asymptotic laws for the MLE.

5. Asymptotic Theory When κ < 0

When κ < 0 , the model given by (3) is non-ergodic or explosive. Since the proofs of the asymptotic theory of α ˜ T and κ ˜ T when H = 1 / 2 are different from those when H ( 0 , 1 / 2 ) ( 1 / 2 , 1 ) , we first consider the case of H = 1 / 2 . For the sake of notational simplicity, we introduce the process ξ t = σ 0 t e κ s d W s N 0 , σ 2 e 2 κ t 1 2 κ for t 0 . Obviously, ξ N 0 , σ 2 2 κ . Moreover, using (17) and the definition of ξ t , we can easily obtain:
σ e κ T 0 T U t d t = e κ T 0 T e κ t ξ t d t p ξ κ ,
σ e 2 κ T 0 T e κ t U t d t p ξ 2 κ .
Since E 0 T e κ T t d W t 2 = e 2 κ T 0 T e 2 κ t d t = e 2 κ T 1 2 κ , we can obtain:
0 T e κ T t d W t = O p ( 1 ) .
Moreover, let η t = σ 0 t e κ t s d W s be a zero mean Gaussian process. Since E η t 2 = σ 2 0 t e 2 κ t s d s = σ 2 2 κ + σ 2 2 κ e 2 κ t , we have η N 0 , σ 2 2 κ . Consequently, we can obtain:
σ 0 T e κ T t ξ t d W t p ξ η ,
where ξ and η are two independent N ( 0 , σ 2 / ( 2 κ ) ) random variables.

5.1. Asymptotic Theory When H = 1 / 2

Now, we can state the key results of the asymptotic theory for α ˜ T and κ ˜ T when H = 1 / 2 .
Theorem 4.
For κ < 0 , H = 1 / 2 in Model (1), as T , we have:
T α ˜ T α d N 0 , σ 2 ,
e κ T 2 κ κ ˜ T κ d η X 0 α κ + ξ ,
where ξ and η are two independent N ( 0 , σ 2 / ( 2 κ ) ) random variables.
Remark 10.
In (66), if we set X 0 = α κ , the limiting distribution of e κ T 2 κ κ ˜ T κ becomes a standard Cauchy variate. This limiting distribution is the same as that in the Vasicek model driven by a standard Brownian motion (see, e.g., Feigin 1976). The asymptotic theory in (66) is similar to that in the explosive discrete-time and continuous-time models when discretely-sampled data are available (see, e.g., White 1958; Anderson 1959; Phillips and Magdalinos 2007; Wang and Yu 2015, 2016).

5.2. Asymptotic Theory When H ( 0 , 1 / 2 ) ( 1 / 2 , 1 )

We now turn to the case when H ( 0 , 1 / 2 ) ( 1 / 2 , 1 ) assuming κ < 0 . The limiting theory is the most difficult to derive in our paper and, hence, is the main technical contribution to the literature. First, we can have the following lemma.
Lemma 3.
For κ < 0 and H ( 0 , 1 ) in Model (1), we have:
V H t = O t H 1 2 e κ t ,
0 T P ˜ H 2 t d ω t H = O p e 2 κ T ,
0 T P ˜ H t d M t H = O p e κ T ,
0 T V H t d ω t H = O T 1 2 H e κ T ,
0 T P ˜ H t d ω t H = O p e κ T T 1 2 H ,
0 T V H 2 t d ω t H = O e 2 κ T ,
0 T V H t P ˜ H t d ω t H = O p e 2 κ T ,
0 T V H t d M t H = O p e κ T .
Now, we can state the asymptotic theory for α ˜ T and κ ˜ T for κ < 0 and H ( 0 , 1 / 2 ) ( 1 / 2 , 1 ) .
Theorem 5.
When κ < 0 , H ( 0 , 1 / 2 ) ( 1 / 2 , 1 ) , and X 0 = α κ in Model (1), as T , we have:
T 1 H α ˜ T α d N 0 , λ H σ 2 ,
e κ T 2 κ κ ˜ T κ d X sin π H Y ,
where X and Y are two independent N ( 0 , 1 ) random variables.
Remark 11.
For the entire range of H ( 0 , 1 ) , the asymptotic distribution of α ˜ T is normal with the rate of convergence of T 1 H and variance σ 2 . This asymptotic distribution is the same as that of the LS estimate (see Theorem 3.5 in Xiao and Yu (2019a) and Section 3 in Xiao and Yu (2019b)).
Remark 12.
According to (76), the asymptotic law of e κ T 2 κ κ ˜ T κ is the standard Cauchy times sin π H . For H ( 0 , 1 / 2 ) ( 1 / 2 , 1 ) , sin π H ( 0 , 1 ) , suggesting that as H draws further away from 1 / 2 , κ is estimated with higher accuracy. Moreover, with X 0 = α κ , from Theorem 3.5 in Xiao and Yu (2019a, 2019b), we can see that the LS estimator of κ, which is denoted by κ ^ L S , has the asymptotic law e κ T 2 κ κ ^ L S κ d C , where C is the standard Cauchy distribution. Since the second moment of the Cauchy distribution is infinite, we cannot use the variances to measure the asymptotic relative efficiency. From Theorem 2 in Tanaka (2020) and based on the asymptotic concentration probability, we can see that the MLE is always more efficient asymptotically than the LS estimator for H ( 0 , 0 . 5 ) ( 0 . 5 , 1 ) . For H = 1 / 2 , the MLE is asymptotically the same as the LS estimator.
Remark 13.
When X 0 α κ , using Lemma 3, we can obtain:
e κ T 2 κ κ ˜ T κ = 2 κ e κ T 0 T 1 σ X 0 α κ V H t + P ˜ H t d M t H + o p ( 1 ) 4 κ 2 e 2 κ T 0 T 1 σ X 0 α κ V H t + P ˜ H t 2 d ω t H + o p ( 1 ) .
In this case, to obtain the asymptotic distribution of e κ T κ ˜ T κ / ( 2 κ ) , one needs to calculate the Laplace transform of 0 T 1 σ X 0 α κ V H t + P ˜ H t 2 d ω t H . On the other hand, for 1 / 2 < H < 1 , using the moment generating function of S T , 0 T P H 2 t d ω t H , 0 T P H t d S t , and 0 T P H t d ω t H , Lohvinenko and Ralchenko (2019) provided the joint asymptotic normality of MLE of the vector parameter ( α , κ ) :
T 1 H α ˜ T α e κ T κ ˜ T κ d ϑ 1 ϑ 2 / ϑ 3 ,
where ϑ 1 d N 0 , λ H σ 2 , ϑ 2 d N 0 , 1 , and ϑ 3 d N x 0 α κ λ H ρ H κ H 1 π 4 4 H , 1 4 κ 2 sin π H , which generalizes (76) for particular X 0 = α / κ .

6. Concluding Remarks and Future Directions

The fVm has found more and more applications in practice. In this paper, we consider the MLE of parameters in the drift term when a continuous record of observations is available. The ML estimation is made possible due to the presence of the fundamental martingale and the generalized Girsanov theorem. The asymptotic theory is based on the assumption that T .
It is shown that the MLE of α is asymptotically normal regardless of the sign of κ . However, the asymptotic law of the MLE of κ critically depends on the sign of κ . More precisely, when κ > 0 and H ( 0 , 1 ) , we have shown that the asymptotic distribution of the MLE of κ is normal with the rate of convergence being T . The asymptotic variance is 2 κ , which is independent of H. When κ = 0 and α 0 , the asymptotic distribution of the MLE of κ is normal with the rate of convergence being T 2 H . The asymptotic variance depends on H. When κ = 0 and α = 0 , the asymptotic distribution of the MLE of κ is a Dickey–Fuller–Phillips distribution with the rate of convergence being T. When κ < 0 , it is shown that the limiting distribution is a Cauchy-type with the rate of convergence being e κ T . If one further assumes that X 0 = α / κ , the limiting distribution becomes a standard Cauchy variate multiplied by sin ( π H ) . Table 1 summarizes the asymptotic laws of α ˜ T and κ ˜ T for different ranges of H and κ , where λ H = 2 H Γ 3 2 H Γ H + 1 2 Γ 3 2 H , ρ H = λ H 3 2 H 2 , ϕ H = 32 H ( 1 H ) ( 2 H ) Γ ( 3 2 H ) Γ ( H + 1 2 ) Γ ( 3 2 H ) , ξ , and η are two independent N ( 0 , σ 2 / ( 2 κ ) ) random variables and X and Y are two independent N ( 0 , 1 ) random variables. Moreover, we assume X 0 = α / κ for κ < 0 and H ( 0 , 1 / 2 ) ( 1 / 2 , 1 ) .
This study also suggests several important directions for future research. First, it is worth investigating generalizing the results in this paper to nonlinear stochastic differential equations driven by the fBm. The ergodic theorem, fractional calculus, and Malliavin calculus will be employed for obtaining the asymptotic properties of both the MLE and the LS estimators.
Second, in this paper, H and σ are assumed to be known. In practice, both H and σ are almost always unknown. Although many approaches have been proposed to estimate the Hurst coefficient and the volatility parameter from discrete time observations, how to estimate H and σ in fVm with a continuous record of observations is an open question. It is interesting to realize that we can use the generalized quadratic variation to estimate both the Hurst parameter and the volatility parameter in fVm. For T > 0 and any ϵ ξ ,
H = lim ϵ 0 , ξ 0 1 2 log ϵ ξ log 0 T X t + ξ X t 2 d t 0 T X t + ϵ X t 2 d t , σ 2 = lim ϵ 0 0 T X t + ϵ X t 2 d t ϵ 2 H T .
It would be interesting to study the asymptotic properties of these estimators mentioned above, which will be reported in later work.
Third, this paper assumes that a continuous record of an increasing time span is available for the development of asymptotic theory. In practice, data are typically observed at discrete time points with ( 0 , h , 2 h , , N h ( : = T ) ) where h is the sampling interval and T is the time span. When high frequency data over a long spanning time period are available, one may consider using a double asymptotic scheme by assuming h 0 and T . The discretized model corresponding to (3) is given by:
y t h = μ + exp ( κ h ) y t 1 h μ + u t , ( 1 L ) d u t = ε t , t = 1 , , N ,
where L is the lag operator, d = H 1 / 2 . As shown in Wang and Yu (2016), under the double asymptotic scheme, exp ( κ h ) = exp κ / k N = 1 κ / k N + O ( k N 2 ) 1 where k N : = 1 / h as h 0 and k N / N = 1 / T 0 as T . This implies an autoregressive (AR) model with the AR root moderately deviating from unity and with a fractionally integrated error term with d ( 1 / 2 , 0 ) . This model is closely related to a model considered in Magdalinos (2012) where it is assumed that d ( 0 , 1 / 2 ) . Developing double asymptotic theory based on discretely-sampled data will allow one to extend the results of Magdalinos (2012) to the case where d ( 1 / 2 , 1 / 2 ) . The development of the MLE and the asymptotic theory is beyond the scope of this paper and will be reported in later work.

Author Contributions

Three authors make the equal contribution to the paper. All authors have read and agreed to the published version of the manuscript.

Funding

Xiao gratefully acknowledges the financial support of the National Natural Science Foundation of China (No. 71871202); Yu wishes to thank the Singapore Ministry of Education for Academic Research Fund under grant number MOE2013-T3-1-009 and acknowledges the financial support from the Lee Foundation.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of Lemma 1

We first consider (30). Let us introduce the modified Bessel functions of the first kind (see, e.g., Abramowitz and Stegun 1972), which are defined as:
I ν ( z ) = 1 2 z ν k = 0 1 4 z 2 k k ! Γ ( ν + k + 1 ) , where ν R .
From page 377 in Abramowitz and Stegun (1972), we can see that the asymptotic behavior of I ν ( z ) is:
I ν ( z ) = e z 2 π z [ 1 + O ( 1 / z ) ] , when z , | arg z | < π 2 .
Using (6), (15) and (A2), we get:
V H T = d d ω T H 0 T k H T , s e κ s d s = d d ω T H π κ H 1 Γ 3 2 H k H T 1 H e κ T 2 I 1 H κ T 2 = κ H 3 2 Γ 2 2 H Γ 1 2 H T H 3 2 + O T H 5 2 ,
which is (30).
Then, as T , using arguments similar to those in Lemma 4.2 of Lohvinenko and Ralchenko (2017), we can obtain:
0 T V H t d ω t H = 0 T k H T , s e κ s d s = O T 1 2 H ,
which yields (31).
By the proof of Theorem 3 in Tanaka (2013), we can easily obtain (32) and (33). The result of (34) follows directly from 0 1 V H 2 t d ω t H < and 1 T V H 2 t d ω t H < (see the proof of Lemma 4.7 in Lohvinenko and Ralchenko 2017).
Next, we consider (35). Let J ˜ T = 0 T P ˜ H ( t ) d ω t H . Then, using Lemma 2 in Lohvinenko and Ralchenko (2019) and replacing J T with J ˜ T , we have:
E e θ J ˜ T = m ˜ 1 α 1 , β 1 α 1 σ , κ + β 1 exp α 1 2 2 σ 2 ω T H ,
where α 1 = σ θ / κ , β 1 = κ , m ˜ 1 α 1 , β 1 ξ 1 , ξ 2 is the moment generating function defined by Lemma 1 in Lohvinenko and Ralchenko (2019). Using Lemma 1 in Lohvinenko and Ralchenko (2019) again, we can obtain:
m ˜ 1 α 1 , β 1 α 1 σ , κ + β 1 = D α 1 , β 1 κ + β 1 1 / 2 exp [ α 1 2 σ 2 c 3 T 2 2 H e β 1 T 8 D α 1 , β 1 κ + β 1 × I 1 H β 1 T 2 I H 1 β 1 T 2 κ + β 1 2 T ] ,
where c 3 = 4 ( 1 H ) Γ ( H ) Γ ( 1 H ) / λ H and D α 1 , β 1 ξ 2 is defined by Equation (10) in Lohvinenko and Ralchenko (2019).
Consequently, using the fact D α 1 , β 1 κ + β 1 = e 2 κ T , α 1 = σ θ / κ , β 1 = κ , (A4) and (A5), we have:
E e θ J ˜ T = e κ T exp θ 2 κ 2 c 3 T 2 2 H e β 1 T 8 e 2 κ T e κ T π κ T × 1 + O T 1 κ + β 1 2 T + θ 2 ω T H 2 κ 2 = exp θ 2 2 κ 2 ω T H + θ 2 O T 1 2 H ,
which yields:
E J ˜ T = d d θ E e θ J ˜ T θ = 0 = 0 ,
E J ˜ T 2 = d 2 d θ 2 E e θ J ˜ T θ = 0 = ω T H κ 2 + O T 1 2 H .
Using (7), (A5), (A7) and (A8), we can obtain:
J ˜ T = O T 1 H , J ˜ T T 1 H d N 0 , 1 κ 2 λ H ,
which implies (35).
Now, we consider (36). Using the Cauchy–Schwarz inequality, (33) and (34), we obtain:
0 T V H t P ˜ H t d ω t H 0 T V H 2 t d ω t H 0 T P ˜ H 2 t d ω t H = O ( 1 ) O p ( T ) ,
which implies (36).
Finally, we are left with (37). Using (34), we can obtain:
E 0 T V H t d M t H 2 = 0 T V H 2 t d ω t H = O ( 1 ) ,
which implies (37) directly.

Appendix A.2. Proof of Theorem 1

To simplify the notations, let X ˜ 0 : = X 0 α κ . Using (14), we have:
0 T P H 2 t d ω t H = 0 T α σ κ + 1 σ X ˜ 0 V H t + P ˜ H t 2 d ω t H = α 2 σ 2 κ 2 ω T H + 1 σ 2 X ˜ 0 2 0 T V H 2 t d ω t H + 0 T P ˜ H 2 t d ω t H + 2 α σ 2 κ X ˜ 0 0 T V H t d ω t H + 2 α σ κ 0 T P ˜ H t d ω t H + 2 σ X ˜ 0 0 T V H t P ˜ H t d ω t H .
Using (14) again, we obtain:
0 T P H t d ω t H 2 = 0 T α σ κ + 1 σ X ˜ 0 V H t + P ˜ H t d ω t H 2 = α 2 σ 2 κ 2 ω T H 2 + 1 σ 2 X ˜ 0 2 0 T V H t d ω t H 2 + 0 T P ˜ H t d ω t H 2 + 2 α σ κ ω T H 0 T P ˜ H t d ω t H + 2 α σ 2 κ ω T H X ˜ 0 0 T V H t d ω t H + 2 σ X ˜ 0 0 T V H t d ω t H 0 T P ˜ H t d ω t H .
Using (A9), (A10) and Lemma 1, we deduce that:
ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 = ω T H σ 2 X ˜ 0 2 0 T V H 2 t d ω t H + ω T H 0 T P ˜ H 2 t d ω t H + ω T H 2 α σ 2 κ X ˜ 0 0 T V H t d ω t H + ω T H 2 α σ κ 0 T P ˜ H t d ω t H + ω T H 2 σ X ˜ 0 0 T V H t P ˜ H t d ω t H 1 σ 2 X ˜ 0 2 0 T V H t d ω t H 2 0 T P ˜ H t d ω t H 2 2 α σ 2 κ ω T H X ˜ 0 0 T V H t d ω t H 2 α σ κ ω T H 0 T P ˜ H t d ω t H 2 σ X ˜ 0 0 T V H t d ω t H 0 T P ˜ H t d ω t H = ω T H 0 T P ˜ H 2 t d ω t H + o p ( T 3 2 H ) .
Moreover, using (14), we get:
0 T P H ( t ) d M t H 0 T P H ( t ) d ω t H = α σ κ M T H + 1 σ X ˜ 0 0 T V H t d M t H + 0 T P ˜ H t d M t H × α σ κ ω t H + 1 σ X ˜ 0 0 T V H t d ω t H + 0 T P ˜ H t d ω t H = α 2 σ 2 κ 2 M T H ω T H + α σ κ M T H 1 σ X ˜ 0 0 T V H t d ω t H + α σ κ M T H 0 T P ˜ H t d ω t H + 1 σ X ˜ 0 0 T V H t d M t H α σ κ ω T H + 1 σ 2 X ˜ 0 2 0 T V H t d M t H 0 T V H t d ω t H + 1 σ X ˜ 0 0 T V H t d M t H 0 T P ˜ H t d ω t H + α σ κ ω T H 0 T P ˜ H t d M t H + 1 σ X ˜ 0 0 T P ˜ H t d M t H 0 T V H t d ω t H + 0 T P ˜ H t d M t H 0 T P ˜ H t d ω t H .
By combining (A9), (A12) and Lemma 1, we have:
M T H 0 T P H 2 t d ω t H 0 T P H ( t ) d M t H 0 T P H ( t ) d ω t H = α 2 σ 2 κ 2 M T H ω T H + M T H σ 2 X ˜ 0 2 0 T V H 2 t d ω t H + M T H 0 T P ˜ H 2 t d ω t H + M T H 2 α σ 2 κ X ˜ 0 0 T V H t d ω t H + M T H 2 α σ κ 0 T P ˜ H t d ω t H + M T H 2 σ X ˜ 0 0 T V H t P ˜ H t d ω t H α 2 σ 2 κ 2 M T H ω T H α σ κ M T H 1 σ X ˜ 0 0 T V H t d ω t H α σ κ M T H 0 T P ˜ H t d ω t H 1 σ X ˜ 0 0 T V H t d M t H α σ κ ω T H 1 σ 2 X ˜ 0 2 0 T V H t d M t H 0 T V H t d ω t H 1 σ X ˜ 0 0 T V H t d M t H 0 T P ˜ H t d ω t H α σ κ ω T H 0 T P ˜ H t d M t H 1 σ X ˜ 0 0 T P ˜ H t d M t H 0 T V H t d ω t H 0 T P ˜ H t d M t H 0 T P ˜ H t d ω t H = α σ κ ω T H 0 T P ˜ H t d M t H + o p T 5 2 2 H .
According to Corollary 5.2 of Jost (2006), for H < 1 / 2 , we have the relationship between B t H and B t 1 H ,
B t H = 2 H Γ 2 H Γ 3 2 H 1 2 0 t t s 2 H 1 d B s 1 H .
Using (A14), following Equation (34) in Brouste and Kleptsyna (2010), we can transform Model (1) to the following model,
d X ^ t = κ μ X ^ t d t + σ d B t 1 H ,
where X ^ t = 2 H Γ ( 2 H ) Γ ( 3 2 H ) 1 2 0 t ( t s ) 1 2 H d X s . When H < 1 / 2 and 1 H > 1 / 2 , hence, the results in (4.4) and (4.5) of Lohvinenko and Ralchenko (2017) are valid for all H ( 0 , 1 ) .
Now, combining (21), (A13), (A11) and (4.4), (4.5) in Lohvinenko and Ralchenko (2017) with Slutsky’s theorem, we obtain:
T α ˜ T α = T M T H 0 T P H 2 t d ω t H 0 T P H t d M t H 0 T P H t d ω t H ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 σ = α σ κ ω T H 1 T 0 T P ˜ H t d M t H + o p T 2 2 H ω T H 1 T 0 T P ˜ H 2 t d ω t H + o p ( T 2 2 H ) σ d N 0 , 2 α 2 κ .
Now, we consider (39). Using (14) and (A10), we have:
M T H 0 T P H t d ω t H ω T H 0 T P H t d M t H = M T H α σ κ ω T H + M T H 1 σ X ˜ 0 0 T V H t d ω t H + M T H 0 T P ˜ H t d ω t H ω T H α σ κ M T H + ω T H 1 σ X ˜ 0 0 T V H t d M t H + ω T H 0 T P ˜ H t d M t H = ω T H 0 T P ˜ H t d M t H + o p T 5 2 2 H .
Finally, combining (22), (A15), (A11) and (4.4), (4.5) in Lohvinenko and Ralchenko (2017) with Slutsky’s theorem, we have:
T κ ˜ T κ = ω T H 1 T 0 T P ˜ H t d M t H + o p ( T 2 2 H ) ω T H 1 T 0 T P ˜ H 2 t d ω t H + o p ( T 2 2 H ) d N 0 , 2 κ .

Appendix A.3. Proof of Lemma 2

From the proof of Theorem 2 in Tanaka (2013), we can easily obtain (48) and (49). A simple calculation shows that:
0 T t d ω t H = 0 T t 1 λ H 2 2 H t 1 2 H d t = 1 λ H 2 2 H 3 2 H T 3 2 H .
Similarly, a standard calculation yields:
0 T t 2 d ω t H = 0 T t 2 1 λ H 2 2 H t 1 2 H d t = 1 λ H 1 H 2 H T 4 2 H .
Combining (49) with the Cauchy–Schwarz inequality, we have:
0 T P ^ H t d ω t H ω T H 0 T P ^ H 2 t d ω t H = O p T 2 H .
Using (49), (51) and the Cauchy–Schwarz inequality, we obtain:
0 T t P ^ H t d ω t H 0 T t 2 d ω t H 0 T P ^ H 2 t d ω t H = O p T 3 H .
Now, we consider (54). Form the definition of V H T , we conclude that:
V H T = d d ω T H 0 T k H T , s e κ s d s = d d ω T H 0 T k H T , s d s = d d ω T H 1 k H 0 T s T s 1 2 H d s = d d ω T H 1 k H 0 1 T 1 2 H u 1 u 1 2 H T d u = d d T 1 k H T 2 2 H B 3 2 H , 3 2 H / d ω T H d T = 1 .
Finally, we deal with (55). A standard calculation yields:
d d ω t H 0 t k H t , s s d s = d d ω t H 1 k H 0 t s t s 1 2 H s d s = d d ω t H 1 k H 0 t s 3 2 H t s 1 2 H d s = d d ω t H 1 k H 0 1 v t 3 2 H t v t 1 2 H t d v = d d ω t H 1 k H t 3 2 H 0 1 v 3 2 H 1 v 1 2 H d v = d d ω t H 1 k H t 3 2 H B 5 2 H , 3 2 H = a H t ,
where a H = ( 3 2 H ) / 4 4 H , and the proof of this lemma is complete.

Appendix A.4. Proof of Theorem 3

Using (13), (46) and (55), we have:
P H t = 1 σ d d ω t H 0 t k H t , s X s d s = 1 σ d d ω t H 0 t k H t , s X 0 + α s + σ B s H d s = X 0 σ + α σ d d ω t H 0 t k H t , s s d s + P ^ H t = X 0 σ + α σ a H t + P ^ H t ,
where P ^ H t = d d ω t H 0 t k H t , s B s H d s . Using (49)–(53) and (A16), we have:
0 T P H 2 t d ω t H = 0 T X 0 σ + α σ a H t + P ^ H t 2 d ω t H = X 0 2 σ 2 ω T H + α 2 σ 2 a H 2 0 T t 2 d ω t H + 0 T P ^ H 2 t d ω t H + 2 X 0 σ 2 α a H 0 T t d ω t H + 2 X 0 σ 0 T P ^ H t d ω t H + 2 α σ a H 0 T t P ^ H t d ω t H = α 2 σ 2 a H 2 0 T t 2 d ω t H + o p ( T 4 2 H ) .
Similarly, combining (50) with (52) leads to:
0 T P H t d ω t H = 0 T X 0 σ + α σ a H t + P ^ H t d ω t H = X 0 σ ω T H + α σ a H 0 T t d ω t H + 0 T P ^ H t d ω t H = α σ a H 0 T t d ω t H + o p ( T 3 2 H ) .
Moreover, using (48) and (A16), we have:
0 T P H t d M t H = 0 T X 0 σ + α σ a H t + P ^ H t d M t H = X 0 σ M T H + α σ a H 0 T t d M t H + 0 T P ^ H t d M t H = X 0 σ M T H + α σ a H 0 T t d M t H + O p T .
According to (A17) and (A18), we get:
ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 = 1 λ H T 2 2 H α 2 σ 2 a H 2 0 T t 2 d ω t H α 2 σ 2 a H 2 1 λ H 2 2 2 H 2 3 2 H 2 T 6 4 H + o p ( T 6 4 H ) = T 6 4 H λ H 2 α 2 σ 2 a H 2 1 H 2 H 2 2 H 2 3 2 H 2 + o p ( T 6 4 H ) = T 6 4 H σ 2 λ H 2 α 2 a H 2 1 H 2 H 3 2 H 2 + o p T 6 4 H .
Similarly, applying (A17) and (A19), we have:
M T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 0 T P H t d M t H = M T H α 2 σ 2 a H 2 0 T t 2 d ω t H α σ a H 0 T t d ω t H α σ a H 0 T t d M t H + o p ( T 5 3 H ) = α 2 σ 2 a H 2 M T H T 4 2 H λ H 1 H 2 H T 3 2 H λ H 2 2 H 3 2 H 0 T t d M t H + o p ( T 5 3 H ) = α 2 a H 2 T 5 3 H λ H σ 2 1 H 2 H M T H T 1 H 2 2 H 3 2 H 1 T 2 H 0 T t d M t H + o p ( T 5 3 H ) .
Using (A21), we can see that:
1 T 5 3 H M T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 0 T P H t d M t H = α 2 a H 2 λ H σ 2 1 H 2 H M T H T 1 H 2 2 H 3 2 H 1 T 2 H 0 T t d M t H + o p ( 1 ) d N 0 , 1 H 2 2 H 2 1 λ H 1 H 2 H 2 2 H 2 3 2 H 2 1 λ H .
Consequently, combining (21) and (A20)–(A22) with Slutsky’s theorem, we have:
T 1 H α ˜ T α = 1 T 5 3 H M T H 0 T P H 2 t d ω t H 0 T P H t d M t H 0 T P H t d ω t H 1 T 6 4 H ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 σ d N 0 , σ 2 ρ H .
By (7), (A18), (A19) and the fact that M T H = O p ( T 1 H ) , we obtain:
M T H 0 T P H t d ω t H ω T H 0 T P H t d M t H = M T H α σ a H 0 T t d ω t H ω T H α σ a H 0 T t d M t H + o p T 4 3 H = α σ a H M T H 1 λ H 2 2 H 3 2 H T 3 2 H 1 λ H T 2 2 H 0 T t d M t H + o p T 4 3 H = α σ λ H a H T 4 3 H M T H T 1 H 2 2 H 3 2 H 1 T 2 H 0 T t d M t H + o p T 4 3 H .
Using (22), (A20), (A23) and Slutsky’s theorem, we can see that:
T 2 H κ ˜ T κ = 1 T 4 3 H M T H 0 T P H t d ω t H ω T H 0 T P H t d M t H 1 T 6 4 H ω T H 0 T P H 2 t d ω t H 0 T P H t d ω t H 2 d N 0 , σ 2 α 2 ϕ H .

Appendix A.5. Proof of Theorem 4

Using (17), (61) and (62), we can obtain:
0 T X t 2 d t = 0 T α κ + e κ t X ˜ 0 + σ U t 2 d t = α 2 κ 2 T + X ˜ 0 2 0 T e 2 κ t d t + σ 2 0 T U t 2 d t + 2 α κ X ˜ 0 0 T e κ t d t + 2 α σ κ 0 T U t d t + 2 X ˜ 0 σ 0 T e κ t U t d t = X ˜ 0 2 0 T e 2 κ t d t + 0 T e 2 κ t ξ t 2 d t + 2 X ˜ 0 0 T e 2 κ t ξ t d t + o p e 2 κ T = 0 T e 2 κ t X ˜ 0 + ξ t 2 d t + o p ( e 2 κ T ) .
Similarly, using (17), (61) and (62) again, we can easily have:
0 T X t d t = 0 T α κ + e κ t X ˜ 0 + σ U t d t = α κ T + X ˜ 0 1 κ 1 e κ T + σ 0 T U t d t = O p e κ T .
A straightforward calculation shows:
0 T X t d W t = 0 T α κ + e κ t X ˜ 0 + σ U t d W t = α κ W T + X ˜ 0 0 T e κ t d W t + σ 0 T U t d W t = O p e κ T .
From the definition of ξ t , we can rewrite X t as X t = α κ + e κ t X ˜ 0 + e κ t ξ t . As a consequence, using (63), (64) and (A24), we can see that:
e 2 κ T 0 T X t 2 d t = 0 T e 2 κ t X ˜ 0 + ξ t 2 d t e 2 κ T + o p 1 ,
σ e κ T 0 T X t d W t = X ˜ 0 σ 0 T e κ T t d W t + σ 0 T e κ T t ξ t d W t + o p 1 .
Now, applying (21) and (A25)–(A27) and Slutsky’s theorem, we deduce:
T α ˜ T α = W T T e 2 κ T 0 T X t 2 d t e 2 κ T T 0 T X t d W t 0 T X t d t e 2 κ T 0 T X t 2 d t 1 T 0 T X t d t 2 σ = W T T e 2 κ T 0 T X t 2 d t + o p 1 e 2 κ T 0 T X t 2 d t + o p 1 σ = σ W T T + o p ( 1 ) ,
which implies (65).
Finally, using (22), (A25), (A27), (A28) and Slutsky’s theorem, we have:
e κ T κ ˜ T κ = e κ T W T T 0 T X t d t 0 T X t d W t 0 T X t 2 d t 1 T 0 T X t d t 2 σ = σ e κ T 0 T X t d W t + o p 1 e 2 κ T 0 T X t 2 d t + o p 1 d X 0 α κ + ξ η 1 2 κ X 0 α κ + ξ 2 ,
which yields (66), and the proof is done.

Appendix A.6. Proof of Lemma 3

We first consider (67). Applying (14) and (15), we can obtain:
V H t = d d ω t H 0 t k H t , s e κ s d s = d d t 0 t k H t , s e κ s d s d ω t H d t = λ H π κ H 1 Γ 3 / 2 H k H 2 2 H t 1 2 H [ 1 H t H e κ t 2 I 1 H κ t 2 κ 2 t 1 H e κ t 2 I 1 H κ t 2 + t 1 H e κ t 2 1 2 κ 2 I 2 H κ t 2 + I H κ t 2 ] = λ H π κ H 1 Γ 3 2 H k H 2 2 H [ 1 H t 1 + H e κ t 2 I 1 H κ t 2 κ 2 t H e κ t 2 I 1 H κ t 2 κ 4 t H e κ t 2 I 2 H κ t 2 κ 4 t H e κ t 2 I H κ t 2 ] = λ H π κ H 1 Γ 3 2 H k H 2 2 H [ 1 H t 1 + H e κ t π κ t 1 + O t 1 κ 2 t H e κ t π κ t 1 + O t 1 κ 4 t H e κ t π κ t 1 + O t 1 κ 4 t H e κ t π κ t 1 + O t 1 ] = λ H κ H 1 Γ 3 2 1 k H 2 2 H e κ t [ 1 H κ 1 2 t 3 2 + H 1 + O t 1 + 1 2 κ 1 2 t H 1 2 1 + O t 1 + 1 4 t H 1 2 κ 1 2 1 + O t 1 + 1 4 κ 1 2 t H 1 2 1 + O t 1 ] = O t H 1 2 e κ t .
where I ν ( z ) is the modified Bessel function of the first kind defined in (A1) and we used the asymptotic behavior of (A2).
Let us observe that (68) can be obtained easily from Theorem 2 in Tanaka (2015), and the details are omitted here. For (69), using (68), we have:
E 0 T P ˜ H t d M t H 2 = 0 T P ˜ H 2 t d ω t H = O ( e 2 κ T ) ,
which implies (69) directly.
Let 1 F 1 ( · , · , · ) be the confluent hypergeometric function of the first kind. From (A29) and the well-known result of the confluent hypergeometric function (see, for example, Equation 3.383 (1) in Gradshteyn and Ryzhik 2007; Bateman 1953, p. 278), we have:
0 T V H t d ω t H T 1 2 H e κ T C 0 T t T 1 2 H e κ T t d t = C T 0 1 u 1 2 H e κ T 1 u d u = C T 0 1 1 v 1 2 H e κ T v d v = C T 1 F 1 1 , 5 2 H , κ T = O ( 1 ) ,
which yields (70).
We now deal with (71). Let ζ t = σ 0 t e κ s d B s H . Then, using Lemma 2.2 of El Machkouri et al. (2016), as T , we have:
ζ T p ζ N 0 , E ζ 2 .
Using (6), (17), (A30) and the property of the confluent hypergeometric function (see, for example, Equation 3.383 (1) in Gradshteyn and Ryzhik 2007; Bateman 1953, p. 278), we have:
0 T P ˜ H t d ω t H = 0 T k H T , t U t d t = 1 k H 0 T t T t 1 2 H e κ t σ ζ t d t = C T 2 2 H 0 1 u 1 u 1 2 H e κ T u ζ T u d u = O p ( 1 ) T 2 2 H 0 1 u 1 u 1 2 H e κ T u d u = O p ( 1 ) T 2 2 H 1 F 1 3 2 H , 3 2 H , κ T = O p ( 1 ) T 2 2 H O p ( T H 3 2 e κ T ) = O p ( T 1 2 H e κ T ) ,
which implies (71).
We now turn to the term (72). Using (A29), we can easily obtain:
0 T V H 2 t d ω t H C 0 T t 2 H 1 e 2 κ t t 1 2 H d t = O ( e 2 κ T ) ,
which yields (72).
Using the Cauchy–Schwarz inequality, (68) and (72), we obtain:
0 T V H t P ˜ H t d ω t H 2 0 T V H 2 t d ω t H 0 T P ˜ H 2 t d ω t H = O p e 4 κ T ,
which implies (73).
Similarly, using (72), we have:
E 0 T V H t d M t H 2 = 0 T V H 2 t d ω t H = O e 2 κ T ,
which yields (74), and we complete the proof.

Appendix A.7. Proof of Theorem 5

Using (14), (68) and (70)–(73), we can obtain:
0 T P H 2 t d ω t H = 0 T α σ κ + 1 σ X ˜ 0 V H t + P ˜ H t 2 d ω t H = α 2 σ 2 κ 2 ω T H + 1 σ 2 X ˜ 0 2 0 T V H 2 t d ω t H + 0 T P ˜ H 2 t d ω t H + 2 α σ 2 κ X ˜ 0 0 T V H t d ω t H + 2 α σ κ 0 T P ˜ H t d ω t H + 2 σ X ˜ 0 0 T V H t P ˜ H t d ω t H = 1 σ 2 X ˜ 0 2 0 T V H 2 t d ω t H + 0 T P ˜ H 2 t d ω t H + 2 σ X ˜ 0 0 T V H t P ˜ H t d ω T H + o p e 2 κ T = 0 T 1 σ X ˜ 0 V H t + P ˜ H t 2 d ω t H + o p ( e 2 κ T ) .
According to (14), (68), (70) and (71), we obtain:
1 ω T H 0 T P H t d ω t H 2 = 1 ω T H 0 T α σ κ + 1 σ X ˜ 0 V H t + P ˜ H t d ω t H 2 = 1 ω T H α σ κ ω T H + 1 σ X ˜ 0 0 T V H t d ω t H + 0 T P ˜ H t d ω t H 2 = 1 ω T H [ α 2 σ 2 κ 2 ω T H 2 + 1 σ 2 X ˜ 0 2 0 T V H t d ω t H 2 + 0 T P ˜ H t d ω t H 2 + 2 α σ 2 κ ω T H 0 T V H t d ω t H + 2 α σ κ ω T H 0 T P ˜ H t d ω t H + 2 σ X ˜ 0 0 T V H t d ω t H 0 T P ˜ H t d ω t H ] = o p e 2 κ T .
From (14), (69) and (74), we can see that:
0 T P H t d M t H = 0 T α σ κ + 1 σ X ˜ 0 V H t + P ˜ H t d M t H = α σ κ M T H + 1 σ X ˜ 0 0 T V H t d M t H + 0 T P ˜ H t d M t H = X ˜ 0 σ 0 T V H t d M t H + 0 T P ˜ H t d M t H + o p ( e κ T ) .
From (14) and the definition of ω t H , we can obtain:
0 T P H t d ω t H = 0 T α σ κ + 1 σ X ˜ 0 V H t + P ˜ H t d ω t H = α σ κ ω T H + 1 σ X ˜ 0 0 T V H t d ω t H + 0 T P ˜ H t d ω t H = X ˜ 0 σ 0 T V H t d ω t H + 0 T P ˜ H t d ω t H + O ( T 2 2 H ) .
Using (A31), we have:
M T H ω T H 0 T P H 2 t d ω t H 1 ω T H 0 T P H t d M t H 0 T P H t d ω t H = M T H ω T H 0 T 1 σ X 0 α κ V H t + P ˜ H t 2 d ω t H 1 ω T H 1 σ X 0 α κ 0 T V H t d M t H + 0 T P ˜ H t d M t H 0 T P ˜ H t d ω t H + o p e 2 κ T T 1 H .
Now, combining the above result, (21), (A31), (A32) and Lemma 3 with Slutsky’s theorem, we have:
T 1 H α ˜ T α = T 1 H M T H ω T H 0 T P H 2 t d ω t H T 1 H ω T H 0 T P H t d M t H 0 T P H t d ω t H 0 T P H 2 t d ω t H 1 ω T H 0 T P H t d ω t H 2 σ d N 0 , σ 2 .
Moreover, from Equation (31) in Tanaka (2015), replacing H with 1 H leads to the same moment generating function. This suggests that the distribution of κ 0 T P ˜ H t d M t H / 0 T P ˜ H 2 t d ω t H is symmetric around H = 1 / 2 . Hence, Equation (33) in Tanaka (2015) holds true for all H ( 0 , 1 ) .
Now, let X and Y be two independent N ( 0 , 1 ) random variables. Then, using (14), (22), (A31)–(A34), Lemma 3, Slutsky’s theorem and Equation (33) in Tanaka (2015), we can see that:
e κ T 2 κ κ ˜ T κ = e κ T 2 κ M T H ω T H 0 T P H t d ω t H 0 T P H t d M t H 0 T P H 2 t d ω t H 1 ω T H 0 T P H t d ω t H 2 = 2 κ e κ T 0 T P H t d M t H + o p ( 1 ) 4 κ 2 e 2 κ T 0 T P H 2 t d ω t H + o p ( 1 ) = 2 κ e κ T 0 T X ˜ 0 σ V H t + P ˜ H t d M t H + o p ( 1 ) 4 κ 2 e 2 κ T 0 T X ˜ 0 σ V H t + P ˜ H t 2 d ω t H + o p ( 1 ) = 2 κ e κ T 0 T P ˜ H t d M t H + o p ( 1 ) 4 κ 2 e 2 κ T 0 T P ˜ H 2 t d ω t H + o p ( 1 ) d X sin π H Y ,
with X ˜ 0 = 0 .

References

  1. Abramowitz, Milton, and Irene A. Stegun. 1972. Handbook of Mathematical Functions. New York: Dover. [Google Scholar]
  2. Aït-Sahalia, Yacine, and Loriano Mancini. 2008. Out of sample forecasts of quadratic variation. Journal of Econometrics 147: 17–33. [Google Scholar] [CrossRef] [Green Version]
  3. Anderson, Theodore W. 1959. On asymptotic distributions of estimates of parameters of stochastic difference equations. Annals of Mathematical Statistics 30: 676–87. [Google Scholar] [CrossRef]
  4. Bateman, Harry. 1953. Higher Transcendental Functions. New York: McGraw-Hill Book Company, vol. I. [Google Scholar]
  5. Bennedsen, Mikkel, Asger Lunde, and Mikko S. Pakkanen. 2017. Decoupling the Short- and Long-Term Behavior of Stochastic Volatility. Working Paper. Aarhus, Denmark: CREATES, Aarhus University. [Google Scholar]
  6. Brouste, Alexandre, and Marina Kleptsyna. 2010. Asymptotic properties of MLE for partially observed fractional diffusion system. Statistical Inference for Stochastic Processes 13: 1–13. [Google Scholar] [CrossRef]
  7. Brown, Bruce M., and Jonathan I. Hewitt. 1975. Asymptotic likelihood theory for diffusion processes. Journal of Applied Probability 12: 228–38. [Google Scholar] [CrossRef]
  8. Chan, Kalok C., G. Andrew Karolyi, Francis A. Longstaff, and Anthony B. Sanders. 1992. An empirical comparison of alternative models of the short—Term interest rate. Journal of Finance 47: 1209–27. [Google Scholar] [CrossRef]
  9. Chen, Ye, Peter CB Phillips, and Jun Yu. 2017. Inference in continuous systems with mildly explosive regressors. Journal of Econometrics 201: 400–16. [Google Scholar] [CrossRef] [Green Version]
  10. Cheridito, Patrick, Hideyuki Kawaguchi, and Makoto Maejima. 2003. Fractional Ornstein–Uhlenbeck processes. Electronic Journal of Probability 8: 1–14. [Google Scholar] [CrossRef]
  11. Comte, Fabienne, and Eric Renault. 1998. Long memory continuous-time stochastic volatility models. Mathematical Finance 8: 291–323. [Google Scholar] [CrossRef]
  12. El Machkouri, Mohamed, Khalifa Es-Sebaiy, and Youssef Ouknine. 2016. Least squares estimator for non-ergodic Ornstein–Uhlenbeck processes driven by Gaussian processes. Journal of the Korean Statistical Society 45: 329–41. [Google Scholar] [CrossRef] [Green Version]
  13. Es-Sebaiy, Khalifa, and Frederi G. Viens. 2019. Optimal rates for parameter estimation of stationary Gaussian processes. Stochastic Processes and Their Applications 129: 3018–54. [Google Scholar] [CrossRef] [Green Version]
  14. Feigin, Paul David. 1976. Maximum likelihood estimation for continuous-time stochastic processes. Advances in Applied Probability 8: 712–36. [Google Scholar] [CrossRef]
  15. Fukasawa, Masaaki, Tetsuya Takabatake, and Rebecca Westphal. 2019. Is volatility rough? arXiv arXiv:1905.04852. [Google Scholar]
  16. Gatheral, Jim, Thibault Jaisson, and Mathieu Rosenbaum. 2018. Volatility is rough. Quantitative Finance 18: 933–49. [Google Scholar] [CrossRef]
  17. Gradshteyn, Izrail Solomonovich, and Iosif Moiseevich Ryzhik, eds. 2007. Table of Integrals, Series, and Products. Amsterdam: Elsevier. [Google Scholar]
  18. Hu, Yaozhong, and David Nualart. 2010. Parameter estimation for fractional Ornstein–Uhlenbeck processes. Statistics and Probability Letters 80: 1030–38. [Google Scholar] [CrossRef] [Green Version]
  19. Hu, Yaozhong, David Nualart, and Hongjuan Zhou. 2019. Parameter estimation for fractional Ornstein–Uhlenbeck processes of general Hurst parameter. Statistical Inference for Stochastic Processes 22: 111–42. [Google Scholar] [CrossRef] [Green Version]
  20. Jamshidian, Farshid. 1989. An exact bond option formula. Journal of Finance 44: 205–9. [Google Scholar] [CrossRef]
  21. Jost, Celine. 2006. Transformation formulas for fractional Brownian motion. Stochastic Processes and Their Applications 116: 1341–57. [Google Scholar]
  22. Kleptsyna, Marina L., and Alain Le Breton. 2002. Statistical analysis of the fractional Ornstein–Uhlenbeck type process. Statistical Inference for Stochastic Processes 5: 229–48. [Google Scholar] [CrossRef]
  23. Kleptsyna, Marina L., A. Le Breton, and Marie-Christine Roubaud. 2000. Parameter estimation and optimal filtering for fractional type stochastic systems. Statistical Inference for Stochastic Processes 3: 173–82. [Google Scholar] [CrossRef]
  24. Kubilius, Kestutis, Y. Mishura, and Kostiantyn Ralchenko. 2018. Parameter Estimation in Fractional Diffusion Models. Berlin/Heidelberg: Springer. [Google Scholar]
  25. Kutoyants, Yury A. 2004. Statistical Inference for Ergodic Diffusion Process. Springer Series in Statistics; London: Springer. [Google Scholar]
  26. Lohvinenko, Stanislav, and Kostiantyn Ralchenko. 2017. Maximum likelihood estimation in the fractional Vasicek model. Lithuanian Journal of Statistics 56: 77–87. [Google Scholar] [CrossRef] [Green Version]
  27. Lohvinenko, Stanislav, and Kostiantyn Ralchenko. 2019. Maximum likelihood estimation in the non-ergodic fractional Vasicek model. Modern Stochastics: Theory and Applications 6: 377–95. [Google Scholar] [CrossRef]
  28. Lui, Yiu Lim, Weilin Xiao, and Jun Yu. 2020. Mild-Explosive Autoregression with Anti-Persistent Errors. Oxford Bulletin of Economics and Statistics. forthcoming. [Google Scholar]
  29. Magdalinos, Tassos. 2012. Mildly explosive autoregression under weak and strong dependence. Journal of Econometrics 169: 179–87. [Google Scholar] [CrossRef] [Green Version]
  30. Ng, Wai Man, and Tony S. Wirjanto. 2019. Bias of the Mean-Reversion Parameter in the Fractional Ornstein–Uhlenbeck Process. Working Paper. Waterloo: University of Waterloo. [Google Scholar]
  31. Norros, Ilkka, Esko Valkeila, and Jorma Virtamo. 1999. An elementary approach to a Girsanov formula and other analytical results on fractional Brownian motions. Bernoulli 5: 571–87. [Google Scholar] [CrossRef]
  32. Nourdin, Ivan, and T. T. Diu Tran. 2019. Statistical inference for Vasicek-type model driven by Hermite processes. Stochastic Processes and Their Applications 129: 3774–91. [Google Scholar] [CrossRef] [Green Version]
  33. Phillips, Peter C. B., and Tassos Magdalinos. 2007. Limit theory for moderate deviations from a unit root. Journal of Econometrics 136: 115–30. [Google Scholar] [CrossRef] [Green Version]
  34. Scott, Louis O. 1987. Option pricing when the variance changes randomly: Theory, estimation, and an application. Journal of Financial and Quantitative Analysis 22: 419–38. [Google Scholar] [CrossRef]
  35. Tanaka, Katsuto. 2013. Distributions of the maximum likelihood and minimum contrast estimators associated with the fractional Ornstein–Uhlenbeck process. Statistical Inference for Stochastic Processes 16: 173–92. [Google Scholar] [CrossRef] [Green Version]
  36. Tanaka, Katsuto. 2015. Maximum likelihood estimation for the non-ergodic fractional Ornstein–Uhlenbeck process. Statistical Inference for Stochastic Processes 18: 315–32. [Google Scholar] [CrossRef]
  37. Tanaka, Katsuto. 2020. Comparison of the LS-based estimators and the MLE for the fractional Ornstein–Uhlenbeck process. Statistical Inference for Stochastic Processes 23: 415–434. [Google Scholar] [CrossRef]
  38. Vasicek, Oldrich. 1977. An equilibrium characterization of the term structure. Journal of Financial Economics 5: 177–88. [Google Scholar] [CrossRef]
  39. White, John S. 1958. The limiting distribution of the serial correlation coefficient in the explosive case. Annals of Mathematical Statistics 29: 1188–97. [Google Scholar] [CrossRef]
  40. Wang, Xiaohu, and Jun Yu. 2015. Limit theory for an explosive autoregressive process. Economics Letters 126: 176–80. [Google Scholar] [CrossRef]
  41. Wang, Xiaohu, Weilin Xiao, and Jun Yu. 2019. Estimation and Inference of Fractional Continuous-Time Model with Discrete-Sampled Data. Singapore Management University Economics and Statistics Working Paper Series; Paper No. 17-2019. Singapore: SMU School of Economics. [Google Scholar]
  42. Wang, Xiaohu, and Jun Yu. 2016. Double asymptotics for explosive continuous time models. Journal of Econometrics 193: 35–53. [Google Scholar] [CrossRef]
  43. Xiao, Weilin, and Jun Yu. 2019a. Asymptotic theory for estimating the drift parameters in the fractional Vasicek model. Econometric Theory 35: 198–231. [Google Scholar] [CrossRef] [Green Version]
  44. Xiao, Weilin, and Jun Yu. 2019b. Asymptotic theory for rough fractional Vasicek models. Economics Letters 177: 26–29. [Google Scholar] [CrossRef]
  45. Yu, Jun, and Peter C. B. Phillips. 2001. A Gaussian approach for continuous time models of the short–term interest rate. Econometrics Journal 4: 210–24. [Google Scholar] [CrossRef]
1.
When a continuous record of observations is available, H and σ can be recovered without estimation errors.
Figure 1. Plots of δ H N 2 against two as functions of H.
Figure 1. Plots of δ H N 2 against two as functions of H.
Econometrics 08 00032 g001
Table 1. Summary of the asymptotic laws of α ˜ T and κ ˜ T for different ranges of H and κ .
Table 1. Summary of the asymptotic laws of α ˜ T and κ ˜ T for different ranges of H and κ .
0 < H < 1 / 2 H = 1 / 2 1 / 2 < H < 1
κ > 0 T α ˜ T α d N 0 , 2 α 2 κ T α ˜ T α d N 0 , σ 2 + 2 α 2 κ T 1 H α ˜ T α d N 0 , λ H σ 2
T κ ˜ T κ d N 0 , 2 κ T κ ˜ T κ d N 0 , 2 κ T κ ˜ T κ d N 0 , 2 κ
κ = 0 T 1 H α ˜ T α d N 0 , σ 2 ρ H T α ˜ T α d N 0 , 4 σ 2 T 1 H α ˜ T α d N 0 , σ 2 ρ H
T 2 H κ ˜ T κ d N 0 , σ 2 α 2 ϕ H T 3 / 2 κ ˜ T κ d N 0 , 12 σ 2 α 2 T 2 H κ ˜ T κ d N 0 , σ 2 α 2 ϕ H
κ < 0 T 1 H α ˜ T α d N 0 , λ H σ 2 T α ˜ T α d N 0 , σ 2 T 1 H α ˜ T α d N 0 , λ H σ 2
e κ T 2 κ κ ˜ T κ d X sin π H Y e κ T 2 κ κ ˜ T κ d η X 0 α κ + ξ e κ T 2 κ κ ˜ T κ d X sin π H Y

Share and Cite

MDPI and ACS Style

Tanaka, K.; Xiao, W.; Yu, J. Maximum Likelihood Estimation for the Fractional Vasicek Model. Econometrics 2020, 8, 32. https://doi.org/10.3390/econometrics8030032

AMA Style

Tanaka K, Xiao W, Yu J. Maximum Likelihood Estimation for the Fractional Vasicek Model. Econometrics. 2020; 8(3):32. https://doi.org/10.3390/econometrics8030032

Chicago/Turabian Style

Tanaka, Katsuto, Weilin Xiao, and Jun Yu. 2020. "Maximum Likelihood Estimation for the Fractional Vasicek Model" Econometrics 8, no. 3: 32. https://doi.org/10.3390/econometrics8030032

APA Style

Tanaka, K., Xiao, W., & Yu, J. (2020). Maximum Likelihood Estimation for the Fractional Vasicek Model. Econometrics, 8(3), 32. https://doi.org/10.3390/econometrics8030032

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop