Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions

Bishwal, Jaya P. N.

doi:10.3390/appliedmath2010003

Open AccessArticle

Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions

by

Jaya P. N. Bishwal

Department of Mathematics and Statistics, University of North Carolina at Charlotte, 376 Fretwell Bldg., 9201 University City Blvd., Charlotte, NC 28223-0001, USA

AppliedMath 2022, 2(1), 39-53; https://doi.org/10.3390/appliedmath2010003

Submission received: 2 December 2021 / Revised: 22 December 2021 / Accepted: 7 January 2022 / Published: 8 January 2022

Download Versions Notes

Abstract

For stationary ergodic diffusions satisfying nonlinear homogeneous Itô stochastic differential equations, this paper obtains the Berry–Esseen bounds on the rates of convergence to normality of the distributions of the quasi maximum likelihood estimators based on stochastic Taylor approximation, under some regularity conditions, when the diffusion is observed at equally spaced dense time points over a long time interval, the high-frequency regime. It shows that the higher-order stochastic Taylor approximation-based estimators perform better than the basic Euler approximation in the sense of having smaller asymptotic variance.

Keywords:

Itô stochastic differential equation; diffusion process; Malliavin calculus; discrete observations; almost slowly increasing experimental design; quasi maximum likelihood estimators; conditional least squares estimator; Monte Carlo methods

MSC:

60F05; 60F10; 60H10; 60J60; 62F12; 62F15; 62M05

1. Introduction and Preliminaries

Parameter estimation in diffusion processes based on discrete observations is the recent trend of investigation in financial econometrics and mathematical biology since the data available in finance and biology are high-frequency discrete, though the model is continuous. For a treatise on this subject, see Bishwal (2008, 2021) [1,2].

Consider the Itô stochastic differential equation

\begin{matrix} d X_{t} & = & f (θ, X_{t}) d t + d W_{t}, t \geq 0 \\ X_{0} & = & X^{0} \end{matrix}

(1)

where

{W_{t}, t \geq 0}

is a one-dimensional standard Wiener process,

θ \in Θ

,

Θ

is a compact subset of ℝ, f is a known real valued function defined on

Θ \times ℝ

, the unknown parameter

θ

is to be estimated on the basis of observation of the process

{X_{t}, t \geq 0}

. Let

θ_{0}

be the true value of the parameter that is in the interior of

Θ

. We assume that the process

{X_{t}, t \geq 0}

is observed at

0 = t_{0} < t_{1} < \dots < t_{n} = T

with

Δ t_{i} : = t_{i} - t_{i - 1} = \frac{T}{n} = h

and

T = d n^{1 / 2}

for some fixed real number

d > 0

. We estimate

θ

from the observations

{X_{t_{0}}, X_{t_{1}}, \dots, X_{t_{n}}}

.

The conditional least squares estimator (CLSE) of

θ

is defined as

\begin{matrix} θ_{n, T} : = arg min_{θ \in Θ} Q_{n, T} (θ) \\ where Q_{n, T} (θ) = \sum_{i = 1}^{n} \frac{{[X_{t_{i}} - X_{t_{i - 1}} - f (θ, X_{t_{i - 1}}) h]}^{2}}{Δ t_{i}} . \end{matrix}

(2)

This estimator was first studied by Dorogovcev (1976) [3], who obtained its weak consistency under some regularity conditions as

T \to \infty

and

\frac{T}{n} \to 0

. Kasonga (1988) [4] obtained the strong consistency of the CLSE under some regularity conditions as

n \to \infty

assuming that

T = d n^{1 / 2}

for some fixed real number

d > 0

. Prakasa Rao (1983) [5] obtained asymptotic normality of the CLSE as

T \to \infty

and

\frac{T}{n^{1 / 2}} \to 0

.

Florens-Zmirou (1989) [6] studied the minimum contrast estimator, based on a Euler–Maruyama-type first-order approximate discrete time scheme of the SDE (1), which is given by

Z_{t_{i}} - Z_{t_{i - 1}} = f (θ, Z_{t_{i - 1}}) (t_{i} - t_{i - 1}) + W_{t_{i}} - W_{t_{i - 1}}, i \geq 1, Z_{0} = X^{0} .

(3)

The log-likelihood function of

{Z_{t_{i}}, 0 \leq i \leq n}

is given by

L_{n, T} = C \sum_{i = 1}^{n} \frac{{[Z_{t_{i}} - Z_{t_{i - 1}} - f (θ, Z_{t_{i - 1}}) h]}^{2}}{Δ t_{i}} .

(4)

where C is a constant independent of

θ

. A contrast for the estimation of

θ

is derived from the above log-likelihood by substituting

{Z_{t_{i}}, 0 \leq i \leq n}

with

{X_{t_{i}}, 0 \leq i \leq n}

. The resulting contrast is

H_{n, T} = C \sum_{i = 1}^{n} \frac{{[X_{t_{i}} - X_{t_{i - 1}} - f (θ, X_{t_{i - 1}}) h]}^{2}}{Δ t_{i}}

(5)

and the resulting minimum contrast estimator, called the Euler–Maruyama estimator, is given by

{\overset{ˇ}{θ}}_{n, T} : = arg min_{θ \in Θ} H_{n, T} (θ)

Florens-Zmirou (1989) [6] showed the

L_{2}

-consistency of the estimator as

T \to \infty

and

\frac{T}{n} \to 0

and asymptotic normality as

T \to \infty

and

\frac{T}{n^{2 / 3}} \to 0

.

Notice that the contrast

H_{n, T}

would be the log-likelihood of

(X_{t_{i}}, 0 \leq i \leq n)

if the transition probability was

N (f (θ, x) h, h))

. This led Kessler (1997) [7] to consider Gaussian approximation of the transition density. The most natural one is achieved through choosing its mean and variance to be the mean and variance of the transition density. Thus, the transition density is approximated by

N (E (X_{t_{i}} | X_{t_{i - 1}}), h))

, which produces the contrast

K_{n, T} = C \sum_{i = 1}^{n} \frac{{[X_{t_{i}} - E (X_{t_{i}} | X_{t_{i - 1}})]}^{2}}{Δ t_{i}} .

(6)

Since the transition density is unknown, in general, there is no closed-form expression for

E (X_{t_{i}} | X_{t_{i - 1}})

. Using the stochastic Taylor formula obtained in Florens-Zmirou (1989) [6], he obtained a closed-form expression of

E (X_{t_{i}} | X_{t_{i - 1}}) .

The contrast

H_{n, T}

is an example of such an approximation when

E (X_{t_{i}} | X_{t_{i - 1}}) \approx X_{t_{i - 1}} + h f (θ, X_{t_{i - 1}})

.

The resulting minimum contrast estimator, which is also the quasi-maximum likelihood estimator (QMLE), is given by

θ_{n, T} : = arg min_{θ \in Θ} K_{n, T} (θ)

Kessler (1997) [7] showed the

L_{2}

-consistency of the estimator as

T \to \infty

and

\frac{T}{n} \to 0

and asymptotic normality as

T \to \infty

and

\frac{T}{n^{(p - 1) / p}} \to 0

for an arbitrary integer p.

Denote

μ (θ, X_{t_{i - 1}}) : = E (X_{t_{i}} | X_{t_{i - 1}}), μ (θ, x) : = E (X_{t_{i}} | X_{t_{i - 1}} = x)

(7)

which is the mean function of the transition probability distribution. Hence, the contrast is given by

K_{n, T} = C \sum_{i = 1}^{n} \frac{{[X_{t_{i}} - μ (θ, X_{t_{i - 1}}))]}^{2}}{Δ t_{i}} .

(8)

If continuous observation of

{X_{t}}

on the interval

[0, T]

were available, then the likelihood function of

θ

would be

L_{T} (θ) = exp \{\int_{0}^{T} f (θ, X_{t}) d X_{t} - \frac{1}{2} \int_{0}^{T} f^{2} (θ, X_{t}) d t\}

(9)

(see Liptser and Shiryayev (1977) [8]). Since we have discrete data, we have to approximate the likelihood to obtain the MLE. Taking Itô-type approximation of the stochastic integral and rectangle rule approximation of the ordinary integral in (9), we obtain the approximate likelihood function

{\hat{L}}_{n, T} (θ) : = exp \{\sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) (X_{t_{i}} - X_{t_{i - 1}}) - \frac{h}{2} \sum_{i = 1}^{n} f^{2} (θ, X_{t_{i - 1}})\} .

(10)

The Itô approximate maximum likelihood estimate (IAMLE) based on

{\hat{L}}_{n, T}

is defined as

{\hat{θ}}_{n, T} : = arg max_{θ \in Θ} {\hat{L}}_{n, T} (θ) .

Weak consistency and asymptotic normality of this estimator were obtained by Yoshida (1992) [9] as

T \to \infty

and

\frac{T}{n} \to 0

.

Note that the CLSE, the Euler–Maruyama estimator and the IAMLE are the same estimator (see Shoji (1997) [10]). For the Ornstein–Uhlenbeck process, Bishwal and Bose (2001) [11] studied the rates of weak convergence of approximate maximum likelihood estimators, which are of conditional least squares type. For the Ornstein–Uhlenbeck process, Bishwal (2010) [12] studied the uniform rate of weak convergence for the minimum contrast estimator, which has a close connection to the Stratonovich–Milstein scheme. Bishwal (2009) [13] studied Berry–Esseen inequalities for conditional least squares estimator in discretely observed nonlinear diffusions. Bishwal (2009) [14] studied the Stratonovich-based approximate M-estimator of discretely sampled nonlinear diffusions. Bishwal (2011) [15] studied Milstein approximation of the posterior density of diffusions. Bishwal (2010) [16] studied conditional least squares estimation in nonlinear diffusion processes based on Poisson sampling. Bishwal (2011) [17] obtained some new estimators of integrated volatility using the stochastic Taylor-type schemes, which could be useful for option pricing in stochastic volatility models; see also Bishwal (2021) [2].

Prime denotes the derivative with respect to

θ

, dot denotes the derivative with respect to x and ⋁ denotes the max symbol throughout the paper. In order to obtain a better estimator in terms of lowering variance in Monte Carlo simulation, which may have a faster rate of convergence, first, we use the algorithm proposed in Bishwal (2008) [1]. Note that the Itô integral and the Fisk–Stratonovich (FS, henceforth; Fisk, while introducing the concept of quasimartingale, had the trapezoidal approximation and Stratonovich had the midpoint approximation, converging to the same mean square limit) integral are connected by

\int_{0}^{T} f (θ, X_{t}) d X_{t} = \int_{0}^{T} f (θ, X_{t}) o d X_{t} - \frac{1}{2} \int_{0}^{T} \dot{f} (θ, X_{t}) d t,

(11)

where o is the Itô’s circle for the FS integral. We transform the Itô integral (the limit of the rectangular approximation to preserve the martingale property) in (9) to the FS integral and apply FS-type trapezoidal approximation of the stochastic integral and rectangular rule-type approximation of the Lebesgue integrals and obtain the approximate likelihood

\begin{matrix} {\tilde{L}}_{n, T} (θ) : = exp {\frac{1}{2} \sum_{i = 1}^{n} [f (θ, X_{t_{i - 1}}) + f (θ, X_{t_{i}})] (X_{t_{i}} - X_{t_{i - 1}}) \\ - \frac{h}{2} \sum_{i = 1}^{n} \dot{f} (θ, X_{t_{i - 1}}) - \frac{h}{2} \sum_{i = 1}^{n} f^{2} (θ, X_{t_{i - 1}})} \end{matrix}

(12)

The Fisk–Stratonovich approximate maximum likelihood estimator (FSAMLE) based on

{\tilde{L}}_{n, T}

is defined as

{\tilde{θ}}_{n, T} : = arg max_{θ \in Θ} {\tilde{L}}_{n, T} (θ) .

Weak consistency as

T \to \infty

and

\frac{T}{n} \to 0

and asymptotic normality as

T \to \infty

and

\frac{T}{n^{2 / 3}} \to 0

of the FSAMLE were shown in Bishwal (2008) [1]. Berry–Esseen bounds for the IAMLE and the FSAMLE for the Ornstein–Uhlenbeck processes were obtained in Bishwal and Bose (2001) [11].

We shall use the following notations:

Δ X_{i} = X_{t_{i}} - X_{t_{i - 1}}

,

Δ W_{i} = W_{t_{i}} - W_{t_{i - 1}}

, C is a generic constant independent of

h, n

and other variables (it may depend on

θ

). Throughout the paper,

\dot{f}

denotes the derivative with respect to x and

f^{'}

denotes the derivative with respect to

θ

of the function

f (θ, x)

. Suppose that

θ_{0}

denotes the true value of the parameter and

θ_{0} \in Θ

. We assume the following conditions:

Assumption 1.

(A1)

| f (θ, x) | \leq a (θ) (1 + | x |)

,

| f (θ, x) - f (θ, y) | \leq a (θ) | x - y |

.

(A2)

| f (θ, x) - f (ϕ, y) | \leq b (x) | θ - ϕ |

for all

θ, ϕ \in Θ, x, y \in ℝ

where

{sup}_{θ \in Θ} | a (θ) | = a < \infty, E | b (X^{0}) |^{r} < \infty

for any integer r.

(A3) The diffusion process X is stationary and ergodic with invariant measure ν, i.e., for any g with

E [g (\cdot)] < \infty,

\frac{1}{n} \sum_{i = 1}^{n} g (X_{t_{i}}) \to E_{ν} [g (X_{0})] a . s . a s T \to \infty a n d h \to 0 .

(A4)

{sup}_{t \geq 0} E | X_{t} |^{q} < \infty

for all

q \geq 0

.

(A5)

E | f (θ, X^{0}) - f (θ_{0}, X^{0}) |^{2} = 0 i f f θ = θ_{0} .

(A6) f is continuously differentiable function in x up to order p for all θ.

(A7)

f (\cdot, x)

and all its derivatives are three times continuously differentiable with respect to θ for all

x \in ℝ

. Moreover, these derivatives up to third order with respect to θ are of polynomial growth in x uniformly in θ.

The Fisher information is given by

0 < I (θ) : = \int_{- \infty}^{\infty} {(f^{'} (θ, x))}^{2} d ν (x) < \infty

and for any

δ > 0

, or any compact

\bar{Θ} \subset Θ

,

inf_{θ_{0} \in \bar{Θ}} sup_{| θ - θ_{0} | > δ} E_{θ_{0}} {| f^{'} (θ, X_{0}) - f^{'} (θ_{0}, X_{0}) |}^{2} > 0 .

(A8) The Malliavin covariance of the process is nondegenerate.

The Malliavin covariance matrix of a smooth random variable S is defined as

γ_{T} = \int_{0}^{T} D_{t} S [D_{t} S]

^{*} d t

, where

D_{t}

is the Malliavin derivative. The Malliavin covariance is nondegenerate if

d e t (γ_{T})

is almost surely positive and, for any

m \geq 1

, one has

∥ 1 / d e t (γ_{T}) ∥_{L^{m}} < \infty .

This, associated with the functional

ω \to X (t, ω)

, is given by

0 < σ^{2} (t) = Y_{t}^{2} \int_{0}^{t} f^{2} (θ, X_{s}) Z_{s}^{2} d s < \infty

where

Y_{t}

and

Z_{t}

, respectively, satisfy

d Y_{t} = \dot{f} (θ, X_{t}) Y_{t} d t + Y_{t} d W_{t}, Y_{0} = 1, d Z_{t} = - \dot{f} (θ, X_{t}) Z_{t} d t - Z_{t} d W_{t}, Z_{0} = 1 .

In the case of independent observations, in order to prove the validity of asymptotic expansion, one usually needs a certain regularity condition for the underlying distribution, such as the Cramér condition; see Bhattacharya and Ranga Rao (1976) [18]. This type of condition then ensures the regularity of the distribution and hence the smoothness assumption of the functional under the expectation whose martingale expansion is desired can be removed. This type of condition for dependent observations leads to the regularity of the distribution of a functional with nondegenerate Malliavin covariance, which is known in Malliavin calculus; see Ikeda and Watanabe (1989) [19] and Nualart (1995) [20]. Malliavin covariance is connected to the Hörmander condition, which is a sufficient condition for a second-order differential operator to be hypoelliptic; see Bally (1991) [21]. For operators with analytic coefficients, this condition turns out to be also necessary, but this is not true for general smooth coefficients.

More precisely, let X be a differentiable ℝ-valued Wiener functional defined on a Wiener space. Assume that there exists a functional

ψ

such that

sup_{u \in ℝ} {| u |}^{j} E [e^{i u X} X^{k} ψ] < \infty, j, k \in Z^{+} .

Thus, it is a regularity condition of the characteristic function, which is a consequence of the nondegeneracy of the Malliavin covariance in the case of Wiener functionals. The functional

ψ

, which is a random variable satisfying

0 \leq ψ \leq 1

, is a truncation functional extracting from the Wiener space, the portion on which the distribution is regular. If X is almost regular, one may take

ψ

nearly equal to one. Uniform degeneracy of the Malliavin covariance of the functional

T^{- 1 / 2} \int_{0}^{T} f (θ_{0}, X_{t}) d W_{t}

can be shown under (A8); see Yoshida (1997) [22].

Bishwal (2009) [13] obtained the rate of convergence to normality of the Itô AMLE and the Fisk–Stratonovich AMLE of the order

O (T^{- 1 / 2} ⋁ \frac{T^{2}}{n})

and

O (T^{- 1 / 2} ⋁ \frac{T^{3}}{n^{2}})

, respectively, under the regularity conditions given above with

q > 16

for (A4). We obtain the rate of convergence to normality, i.e., Berry–Esseen bound of the order

O (T^{- 1 / 2} ⋁ \frac{T^{p + 1}}{n^{p}})

for the QMLE

θ_{n, T}

for arbitrary integer p.

We need the following lemma from Michel and Pfanzagl (1971) [23] to prove our main results.

Lemma 1.

Let

ξ, ζ

and η be any three random variables on a probability space

(Ω, F, P)

with

P (η > 0) = 1

. Then, for any

ϵ > 0

, we have

(a) sup_{x \in ℝ} | P {ξ + ζ \leq x} - Φ (x) | \leq sup_{x \in ℝ} | P {ξ \leq x} - Φ (x) | + P (| ζ | > ϵ) + ϵ,

(b) sup_{x \in ℝ} | P {\frac{ξ}{η} \leq x} - Φ (x) | \leq sup_{x \in ℝ} | P {ξ \leq x} - Φ (x) | + P {| η - 1 | > ϵ} + ϵ .

2. Main Results

We start with some preliminary lemmas. Let L denote the generator of the diffusion process,

g \in C^{2} (ℝ)

L g (x) : = f (θ, x) \dot{g} (x) + \frac{1}{2} \ddot{g} (x) .

The k-th iterate of L is denoted as

L^{k}

. Its domain is

C^{2 k} (ℝ)

. We set

L^{0} = I

.

Stochastic Taylor formula (Kloeden and Platen (1992) [24]): For a

p + 1

times continuously differentiable function

g : ℝ \to ℝ

, we have for

t \in [0, T]

and

p = 1, 2, 3, \dots

g (X_{t}) = g (X_{0}) + \sum_{k = 1}^{p} \frac{t^{k}}{k!} L^{k} g (X_{0}) + \int_{0}^{t} \dots \int_{0}^{s_{2}} L^{p + 1} g (X_{s_{1}}) d s_{1} \dots d s_{p + 1} .

Lemma 2.

With

f (x) = x

, the stochastic Taylor expansion of

μ (θ, x)

is given by

μ (θ, X_{t_{i - 1}}) : = E (X_{t_{i}} | X_{t_{i - 1}}) = \sum_{k = 0}^{p} \frac{h^{k}}{k!} L^{k} f (X_{t_{i - 1}}) + R (θ, h^{p + 1}, X_{t_{i - 1}})

where R denotes a function for which there exists a constant C such that

R (θ, h^{p + 1}, X_{t_{i - 1}}) \leq h^{p + 1} C (1 + | X_{t_{i - 1}} {|)}^{C} .

Proof.

Applying the stochastic Taylor formula of Florens-Zmirou (1989, Lemma 1) [6], one obtains the result. See also Kloeden and Platen (1992) [24].

Consider the following special cases:

Euler Scheme: For

p = 1

,

μ (θ, x) = L^{0} f (x) + h L^{1} f (x) + R (θ, h^{2}, x) .

Milstein Scheme: For

p = 2

,

μ (θ, x) = L^{0} f (x) + h L^{1} f (x) + \frac{h^{2}}{2!} L^{2} f (x) + R (θ, h^{3}, x) .

Simpson Scheme: For

p = 4

,

μ (θ, x) = L^{0} f (x) + h L^{1} f (x) + \frac{h^{2}}{2!} L^{2} f (x) + \frac{h^{3}}{3!} L^{3} f (x) + \frac{h^{4}}{4!} L^{4} f (x) + R (θ, h^{5}, x) .

Boole Scheme: For

p = 6

,

μ (θ, x) = L^{0} f (x) + h L^{1} f (x) + \frac{h^{2}}{2!} L^{2} f (x) + \frac{h^{3}}{3!} L^{3} f (x) + \frac{h^{4}}{4!} L^{4} f (x) + \frac{h^{5}}{5!} L^{5} f (x) +

\frac{h^{6}}{6!} L^{6} f (x) + R (θ, h^{7}, x) .

□

Remark 1.

For

p = 1

,

μ (θ, X_{t_{i - 1}}) \approx X_{t_{i - 1}} + h f (θ, X_{t_{i - 1}}) .

This produces the CLSE. This estimator has been very well studied in the literature (see Shoji (1997) [10]).

Remark 2.

Note that the Milstein scheme is equivalent to Stratonovich approximation of the stochastic integral after converting the Itô integral to the Stratonovich integral.

Lemma 3.

For all

p \geq 2

, we have

E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \leq C (\frac{T^{p + 1}}{n^{p - 1}}) .

Proof.

First, we show that, for

p = 2

,

E sup_{θ \in Θ} {|\sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \leq C \frac{T^{4}}{n^{2}} .

We emphasize that the Itô formula is a stochastic Taylor formula of order 2. By the Itô formula, we have

\begin{matrix} f (θ_{0}, X_{t}) - f (θ_{0}, X_{t_{i - 1}}) \\ = & \int_{t_{i - 1}}^{t} \dot{f} (θ_{0}, X_{u}) d X_{u} + \frac{1}{2} \int_{t_{i - 1}}^{t} \ddot{f} (θ_{0}, X_{u}) d u \\ = & \int_{t_{i - 1}}^{t} \dot{f} (θ_{0}, X_{u}) d W_{u} + \int_{t_{i - 1}}^{t} [\dot{f} (θ_{0}, X_{u}) f (θ_{0}, X_{u}) + \frac{1}{2} \ddot{f} (θ_{0}, X_{u})] d u \\ = : & \int_{t_{i - 1}}^{t} \dot{f} (θ_{0}, X_{u}) d W_{u} + \int_{t_{i - 1}}^{t} F (θ_{0}, X_{u}) d u \end{matrix}

where

F (θ_{0}, X_{u}) = : \dot{f} (θ_{0}, X_{u}) f (θ_{0}, X_{u}) + \frac{1}{2} \ddot{f} (θ_{0}, X_{u}) .

We employ Taylor expansion in the local neighborhood of

θ_{0}

. Let

θ = θ_{0} + T^{- 1 / 2} u, u \in ℝ

. Then, we have

\begin{matrix} E sup_{θ \in Θ} {|\sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ = & E sup_{θ \in Θ} {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f (θ, X_{t}) - f (θ, X_{t_{i - 1}})] d t|}^{2} \\ = & E sup_{u \in ℝ} {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f (θ_{0} + T^{- 1 / 2} u, X_{t}) - f (θ_{0} + T^{- 1 / 2} u, X_{t_{i - 1}})] d t|}^{2} \\ = & E sup_{u \in ℝ} |\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f (θ_{0}, X_{t}) - f (θ_{0}, X_{t_{i - 1}})] d t \\ {+ T^{- 1 / 2} u \sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f^{'} (\bar{θ}, X_{t}) - f^{'} (\bar{θ}, X_{t_{i - 1}})] d t|}^{2} \\ = & 2 E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f (θ_{0}, X_{t}) - f (θ_{0}, X_{t_{i - 1}})] d t|}^{2} \\ + 2 E sup_{u \in ℝ} {|T^{- 1 / 2} u \sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f^{'} (\bar{θ}, X_{t}) - f^{'} (\bar{θ}, X_{t_{i - 1}})] d t|}^{2} \\ = : & 2 E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f (θ_{0}, X_{t}) - f (θ_{0}, X_{t_{i - 1}})] d t|}^{2} + 2 G_{1} \\ = & E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [\int_{t_{i - 1}}^{t} \dot{f} (θ_{0}, X_{u}) d W_{u} + \int_{t_{i - 1}}^{t} f^{'} (θ_{0}, X_{t_{i - 1}}) F (θ_{0}, X_{u}) d u] d t|}^{2} + 2 G_{1} \\ \leq & 2 E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} \int_{t_{i - 1}}^{t} \dot{f} (θ_{0}, X_{u}) d W_{u} d t|}^{2} \\ + 2 E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} \int_{t_{i - 1}}^{t} f^{'} (θ_{0}, X_{t_{i - 1}}) F (θ_{0}, X_{u}) d u d t|}^{2} + 2 G_{1} \\ = : & 2 (J_{1} + J_{2}) + 2 G_{1} \end{matrix}

where

J_{1} = : E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} \int_{t_{i - 1}}^{t} \dot{f} (θ_{0}, X_{u}) d W_{u} d t|}^{2},

J_{2} = : E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} \int_{t_{i - 1}}^{t} f^{'} (θ_{0}, X_{t_{i - 1}}) F (θ_{0}, X_{u}) d u d t|}^{2},

G_{1} = : E sup_{u \in ℝ} {|T^{- 1 / 2} u \sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} [f^{'} (\bar{θ}, X_{t}) - f^{'} (\bar{θ}, X_{t_{i - 1}})] d t|}^{2}

and

| \bar{θ} - θ_{0} | \leq | θ - θ_{0} |

. Further

\begin{matrix} E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ \leq & 2 E sup_{θ \in Θ} {|\sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ + 2 E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i}|}^{2} . \end{matrix}

By Lemma 2, we have

\begin{matrix} μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}} - h f (θ, X_{t_{i - 1}}) = \sum_{k = 2}^{p} \frac{h^{k}}{k!} L^{k} f (θ, X_{t_{i - 1}}) + R (θ, h^{p + 1}, X_{t_{i - 1}}) . \end{matrix}

Further

\begin{matrix} \sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t \\ = & \sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} + \sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t \\ = & \sum_{i = 1}^{n} [\sum_{k = 2}^{p} \frac{h^{k}}{k!} L^{k} f (θ, X_{t_{i - 1}}) + R (θ, h^{p + 1}, X_{t_{i - 1}})] + \sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t . \end{matrix}

Hence

\begin{matrix} E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ \leq & 4 (J_{1} + J_{2}) + 2 E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [\sum_{k = 2}^{p} \frac{h^{k}}{k!} L^{k} f (θ, X_{t_{i - 1}}) + R (θ, h^{p + 1}, X_{t_{i - 1}})]|}^{2} . \end{matrix}

Observe that, with

B_{i, t} : = \int_{t_{i - 1}}^{t} f^{'} (θ_{0}, X_{t_{i - 1}}) \dot{f} (θ_{0}, X_{u}) d W_{u}, 1 \leq i \leq n,

we have

\begin{matrix} J_{1} & = & \sum_{i = 1}^{n} E {(\int_{t_{i - 1}}^{t_{i}} B_{i, t} d t)}^{2} + \sum_{j \neq i = 1}^{n} E (\int_{t_{i - 1}}^{t_{i}} B_{i, t} d t) (\int_{t_{j - 1}}^{t_{j}} B_{j, t} d t) \\ \leq & \sum_{i = 1}^{n} (t_{i} - t_{i - 1}) \int_{t_{i - 1}}^{t_{i}} E (B_{i, t}^{2}) d t \\ (the last term being zero due to the orthogonality of the integrals) \\ \leq & \sum_{i = 1}^{n} (t_{i} - t_{i - 1}) \int_{t_{i - 1}}^{t_{i}} \{\int_{t_{i - 1}}^{t} E {[f^{'} (θ_{0}, X_{t_{i - 1}}) \dot{f} (θ_{0}, X_{u})]}^{2} d u\} d t \\ \leq & C \frac{T}{n} \sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} (t - t_{i - 1}) d t (by (A 4) and (A 3)) \\ \leq & C \frac{T}{n} \sum_{i = 1}^{n} {(t_{i} - t_{i - 1})}^{2} \\ = & C \frac{T^{3}}{n^{2}} . \end{matrix}

On the other hand, with

A_{i, t} : = \int_{t_{i - 1}}^{t} f^{'} (θ_{0}, X_{t_{i - 1}}) F (θ_{0}, X_{u}) d u, 1 \leq i \leq n,

we have

\begin{matrix} J_{2} & = & E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} \int_{t_{i - 1}}^{t} f^{'} (θ_{0}, X_{t_{i - 1}}) F (θ_{0}, X_{u}) d u d t|}^{2} \\ = & E {|\sum_{i = 1}^{n} \int_{t_{i - 1}}^{t_{i}} A_{i, t} d t|}^{2} \\ = & \sum_{i = 1}^{n} E {(\int_{t_{i - 1}}^{t_{i}} A_{i, t} d t)}^{2} + \sum_{j \neq i = 1}^{n} E (\int_{t_{i - 1}}^{t_{i}} A_{i, t} d t) (\int_{t_{j - 1}}^{t_{j}} A_{j, t} d t) \\ \leq & \sum_{i = 1}^{n} (t_{i} - t_{i - 1}) E (\int_{t_{i - 1}}^{t_{i}} A_{i, t}^{2} d t) + \sum_{j \neq i = 1}^{n} {\{E {(\int_{t_{i - 1}}^{t_{i}} A_{i, t} d t)}^{2} E {(\int_{t_{j - 1}}^{t_{j}} A_{j, t} d t)}^{2}\}}^{1 / 2} \\ \leq & \sum_{i = 1}^{n} (t_{i} - t_{i - 1}) \int_{t_{i - 1}}^{t_{i}} E (A_{i, t}^{2}) d t \\ + \sum_{j \neq i = 1}^{n} {\{(t_{i} - t_{i - 1}) \int_{t_{i - 1}}^{t_{i}} E (A_{i, t}^{2}) d t (t_{j} - t_{j - 1}) \int_{t_{j - 1}}^{t_{j}} E (A_{j, t}^{2}) d t\}}^{1 / 2} . \end{matrix}

However,

E (A_{i, t}^{2}) \leq C {(t - t_{i - 1})}^{2}

using (A4) and (A3). On substitution, the last term is dominated by

\begin{matrix} C \sum_{i = 1}^{n} {(t_{i} - t_{i - 1})}^{4} + C \sum_{j \neq i = 1}^{n} {(t_{i} - t_{i - 1})}^{2} {(t_{j} - t_{j - 1})}^{2} \\ = & C \frac{T^{4}}{n^{3}} + C \frac{n (n - 1) T^{4}}{2 n^{4}} \leq C \frac{T^{4}}{n^{2}} . \end{matrix}

Thus

J_{1} + J_{2} \leq C \frac{T^{4}}{n^{2}} .

By the same method, we have

G_{1} \leq C \frac{T^{3}}{n^{2}} .

Hence

\begin{matrix} E sup_{θ \in Θ} {|\sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ \leq 2 (J_{1} + J_{2}) + 2 G_{1} \\ \leq C \frac{T^{4}}{n^{2}} . \end{matrix}

Thus, the proof for

p = 2

is complete. Next, we consider the general case

p \geq 3

. Denote

J_{3} : = E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [\sum_{k = 2}^{p} \frac{h^{k}}{k!} L^{k} f (θ, X_{t_{i - 1}}) + R (θ, h^{p + 1}, X_{t_{i - 1}})]|}^{2} .

We have

\begin{matrix} E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ \leq & 2 E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i}|}^{2} \\ + 2 E sup_{θ \in Θ} {|\sum_{i = 1}^{n} f (θ, X_{t_{i - 1}}) Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \\ \leq & 2 J_{3} + 4 (J_{1} + J_{2}) + 2 G_{1} . \end{matrix}

Observe that, by Lemma 2, we have

J_{3} \leq C \frac{T^{p + 1}}{n^{p - 1}} .

Thus, by combining the bounds for

J_{1}, J_{2}

,

J_{3}

and

G_{1}

, we have

E sup_{θ \in Θ} {|\sum_{i = 1}^{n} [μ (θ, X_{t_{i - 1}}) - X_{t_{i - 1}}] Δ t_{i} - \int_{0}^{T} f (θ, X_{t}) d t|}^{2} \leq C (\frac{T^{p + 1}}{n^{p - 1}}) .

□

The following lemma is from Bishwal (2008) [1].

Lemma 4.

Let

I_{T} : = \frac{1}{T I (θ_{0})} \int_{0}^{T} f^{2} (θ_{0}, X_{t}) d t .

Then, under the conditions (A1)–(A8),

sup_{θ \in Θ} E {[I_{T} (θ) - 1]}^{2} \leq C T^{- 1} .

The following lemma follows from Theorem 7 in Yoshida (1997) [22].

Lemma 5.

Let

M_{T} : = \frac{1}{\sqrt{T I (θ_{0})}} \int_{0}^{T} f (θ_{0}, X_{t}) d W_{t} .

Then, under the conditions (A1)–(A8),

sup_{x \in ℝ} |P_{θ_{0}} \{M_{T} \leq x\} - Φ (x)| \leq C T^{- 1 / 2} .

Our main result is the following theorem.

Theorem 1.

Under the conditions (A1)-(A8), for any

p \geq 1

, we have

sup_{x \in ℝ} |P_{θ} \{\sqrt{T I (θ)} (θ_{n, T} - θ) \leq x\} - Φ (x)| = O (T^{- 1 / 2} ⋁ \frac{T^{p + 1}}{n^{p}}) .

Proof.

We start with

p = 1

and

p = 2 .

Let

{\hat{l}}_{n, T} (θ) : = log {\hat{L}}_{n, T} (θ), a n d {\tilde{l}}_{n, T} (θ) : = log {\tilde{L}}_{n, T} (θ) .

By Taylor expansion, we have

{\hat{l}}_{n, T}^{'} ({\hat{θ}}_{n, T}) = {\hat{l}}_{n, T}^{'} (θ_{0}) + ({\hat{θ}}_{n, T} - θ_{0}) {\hat{l}}_{n, T}^{″} ({\bar{θ}}_{n, T})

where

|{\bar{θ}}_{n, T} - θ| \leq |{\hat{θ}}_{n, T} - θ_{0}|

. Since

{\hat{l}}_{n, T}^{'} ({\hat{θ}}_{n, T}) = 0

, hence we have

\begin{matrix} \sqrt{T I (θ_{0})} ({\hat{θ}}_{n, T} - θ_{0}) = - \frac{\frac{1}{\sqrt{T I (θ_{0})}} {\hat{l}}_{n, T}^{'} (θ_{0})}{\frac{1}{T I (θ_{0})} {\hat{l}}_{n, T}^{″} ({\bar{θ}}_{n, T})} = - \frac{\frac{1}{\sqrt{T I (θ_{0})}} \sum_{i = 1}^{n} f^{'} (θ_{0}, X_{t_{i - 1}}) Δ W_{i}}{\frac{1}{T I (θ_{0})} \sum_{i = 1}^{n} f^{″} ({\bar{θ}}_{n, T}, X_{t_{i - 1}}) Δ t_{i}} = : \frac{M_{n, T}}{V_{n, T}} \end{matrix}

Note that

V_{n, T} = \frac{1}{T I (θ_{0})} \sum_{i = 1}^{n} f^{″} ({\bar{θ}}_{n, T}, X_{t_{i - 1}}) Δ t_{i} = \frac{1}{T I (θ_{0})} \sum_{i = 1}^{n} f^{'} {({\bar{θ}}_{n, T}, X_{t_{i - 1}})}^{2} Δ t_{i} .

However,

E {(I_{T} - 1)}^{2} \leq C T^{- 1}

from Lemma 4 (see also Pardoux and Veretennikov (2001) [25] and Yoshida (2011) [26]). It can be shown that

E {(V_{n, T} - I_{T})}^{2} \leq C \frac{T}{n}

(see Altmeyer and Chorowski (2018) [27]). Hence

E {(V_{n, T} - 1)}^{2} = E {[(V_{n, T} - I_{T}) + (I_{T} - 1)]}^{2} \leq C (T^{- 1} ⋁ \frac{T}{n}) .

Further, by Lemma 1 (b), we have

\begin{matrix} sup_{x \in ℝ} |P_{θ} \{\sqrt{T I (θ)} ({\hat{θ}}_{n, T} - θ) \leq x\} - Φ (x)| \\ = & sup_{x \in ℝ} |P_{θ} \{\frac{M_{n, T}}{V_{n, T}} \leq x\} - Φ (x)| \\ = & sup_{x \in ℝ} |P_{θ} \{M_{n, T} \leq x\} - Φ (x)| + P_{θ} \{|V_{n, T} - 1| \geq ϵ\} + ϵ \\ \leq & C (T^{- 1 / 2} ⋁ \frac{T^{2}}{n}) + ϵ^{- 2} C (T^{- 1} ⋁ \frac{T}{n}) + ϵ . \end{matrix}

since, by Lemmas 1 (a) and 5, we have

\begin{matrix} sup_{x \in ℝ} |P_{θ} \{M_{n, T} \leq x\} - Φ (x)| \\ \leq & sup_{x \in ℝ} |P_{θ} \{M_{T} \leq x\} - Φ (x)| + P_{θ} \{|M_{n, T} - M_{T}| \geq ϵ\} + ϵ \\ \leq & C T^{- 1 / 2} + ϵ^{- 2} E {|M_{n, T} - M_{T}|}^{2} + ϵ \\ \leq & C (T^{- 1 / 2} ⋁ \frac{T^{2}}{n}) + ϵ^{- 2} C \frac{T}{n} + ϵ . \end{matrix}

Choosing

ϵ = T^{- 1 / 2}

, we have the result.

On the other hand, by Taylor expansion, we have

{\tilde{l}}_{n, T}^{'} ({\tilde{θ}}_{n, T}) = {\tilde{l}}_{n, T}^{'} (θ_{0}) + ({\tilde{θ}}_{n, T} - θ_{0}) {\tilde{l}}_{n, T}^{″} ({\bar{\bar{θ}}}_{n, T})

where

|{\bar{\bar{θ}}}_{n, T} - θ| \leq |{\tilde{θ}}_{n, T} - θ_{0}| .

Since

{\tilde{l}}_{n, T}^{'} ({\tilde{θ}}_{n, T}) = 0

, hence we have

\begin{matrix} \sqrt{T I (θ_{0})} ({\tilde{θ}}_{n, T} - θ_{0}) \\ = & - \frac{\frac{1}{\sqrt{T I (θ_{0})}} {\tilde{l}}_{n, T}^{'} (θ_{0})}{\frac{1}{T I (θ_{0})} {\tilde{l}}_{n, T}^{″} ({\bar{\bar{θ}}}_{n, T})} \\ = & - \{\frac{1}{\sqrt{T I (θ_{0})}} \{\frac{1}{2} \sum_{i = 1}^{n} [f^{'} (θ_{0}, X_{t_{i - 1}}) + f^{'} (θ_{0}, X_{t_{i}})] Δ W_{i} \\ + \frac{1}{2} \sum_{i = 1}^{n} [f^{'} (θ_{0}, X_{t_{i - 1}}) + f^{'} (θ_{0}, X_{t_{i}})] \int_{t_{i - 1}}^{t_{i}} f^{'} (θ_{0}, X_{t}) d t \\ - \frac{h}{2} \sum_{i = 1}^{n} {\dot{f}}^{'} (θ_{0}, X_{t_{i - 1}}) - h \sum_{i = 1}^{n} f (θ_{0}, X_{t_{i - 1}}) f^{'} (θ_{0}, X_{t_{i - 1}})\}\} \\ \times \{\frac{1}{T I (θ_{0})} \{\frac{1}{2} \sum_{i = 1}^{n} [f^{″} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}}) + f^{″} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}})] Δ W_{i} \\ - \frac{1}{2} \sum_{i = 1}^{n} [f^{'} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}}) + f^{'} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i}})] \int_{t_{i - 1}}^{t_{i}} f^{'} ({\bar{\bar{θ}}}_{n, T}, X_{t}) d t \end{matrix}

\begin{matrix} - \frac{h}{2} \sum_{i = 1}^{n} {\dot{f}}^{″} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}}) - h \sum_{i = 1}^{n} f ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}}) f^{″} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}}) \\ {- h \sum_{i = 1}^{n} f^{' 2} ({\bar{\bar{θ}}}_{n, T}, X_{t_{i - 1}})\}\}}^{- 1} \\ = : & {R_{n, T}} {S_{n, T}}^{- 1} . \end{matrix}

Let

lim S_{n, T} = S_{T}

in

L_{2}

as

T \to \infty

and

\frac{T}{n} \to 0

. Similar to Lemma 4, it can be shown that

E {(S_{T} - 1)}^{2} \leq C T^{- 1}

(see also Pardoux and Veretennikov (2001) [25] and Yoshida (2011) [26]). It can be shown that

E {(S_{n, T} - S_{T})}^{2} \leq C \frac{T}{n}

(see Altmeyer and Chorowski (2018) [27]). Hence

E {(S_{n, T} - 1)}^{2} = E {[(S_{n, T} - S_{T}) + (S_{T} - 1)]}^{2} \leq C (T^{- 1} ⋁ \frac{T}{n}) .

Thus, by Lemma 1 (b), we have

\begin{matrix} sup_{x \in ℝ} |P_{θ} \{\sqrt{T I (θ)} ({\tilde{θ}}_{n, T} - θ) \leq x\} - Φ (x)| \\ = & sup_{x \in ℝ} |P_{θ} \{\frac{R_{n, T}}{S_{n, T}} \leq x\} - Φ (x)| \\ = & sup_{x \in ℝ} |P_{θ} \{R_{n, T} \leq x\} - Φ (x)| + P_{θ} \{|S_{n, T} - 1| \geq ϵ\} + ϵ \\ \leq & C (T^{- 1 / 2} ⋁ \frac{T^{3}}{n^{2}}) + ϵ^{- 2} C (T^{- 1} ⋁ \frac{T}{n}) + ϵ . \end{matrix}

since, by Lemmas 1 (a) and 5, we have

\begin{matrix} sup_{x \in ℝ} |P_{θ} \{R_{n, T} \leq x\} - Φ (x)| \\ \leq & sup_{x \in ℝ} |P_{θ} \{M_{T} \leq x\} - Φ (x)| + P_{θ} \{|R_{n, T} - M_{T}| \geq ϵ} + ϵ \\ \leq & C T^{- 1 / 2} + ϵ^{- 2} E {|R_{n, T} - M_{T}|}^{2} + ϵ \\ \leq & C T^{- 1 / 2} + ϵ^{- 2} C \frac{T^{3}}{n^{2}} + ϵ . \end{matrix}

Choosing

ϵ = T^{- 1 / 2}

, we have the result.

Now, we study the general case for arbitrary p. By Taylor expansion, we have

K_{n, T}^{'} (θ_{n, T}) = K_{n, T}^{'} (θ_{0}) + (θ_{n, T} - θ_{0}) K_{n, T}^{″} ({\bar{\bar{\bar{θ}}}}_{n, T})

where

|{\bar{\bar{\bar{θ}}}}_{n, T} - θ| \leq |θ_{n, T} - θ_{0}|

. Since

K_{n, T}^{'} (θ_{n, T}) = 0

, hence we have

\begin{matrix} \sqrt{T I (θ_{0})} (θ_{n, T} - θ_{0}) = - \frac{\frac{1}{\sqrt{T I (θ_{0})}} K_{n, T}^{'} (θ_{0})}{\frac{1}{T I (θ_{0})} K_{n, T}^{″} ({\bar{\bar{\bar{θ}}}}_{n, T})} = - \frac{\frac{1}{\sqrt{T I (θ_{0})}} \sum_{i = 1}^{n} m^{'} (θ_{0}, X_{t_{i - 1}}) Δ W_{i}}{\frac{1}{T I (θ_{0})} \sum_{i = 1}^{n} m^{″} ({\bar{\bar{\bar{θ}}}}_{n, T}, X_{t_{i - 1}}) Δ t_{i}} = : \frac{N_{n, T}}{U_{n, T}} \end{matrix}

Note that

U_{n, T} = \frac{1}{T I (θ_{0})} \sum_{i = 1}^{n} m^{″} ({\bar{\bar{\bar{θ}}}}_{n, T}, X_{t_{i - 1}}) Δ t_{i} = \frac{1}{T I (θ_{0})} \sum_{i = 1}^{n} m^{'} {({\bar{\bar{\bar{θ}}}}_{n, T}, X_{t_{i - 1}})}^{2} Δ t_{i} .

Let

lim U_{n, T} = U_{T}

in

L_{2}

as

T \to \infty

and

\frac{T}{n} \to 0

. Similar to Lemma 4, it can be shown that

E {(U_{T} - 1)}^{2} \leq C T^{- 1}

(see also Pardoux and Veretennikov (2001) [25] and Yoshida (2011) [26]). It can be shown that

E [{(U_{n, T} - U_{T})}^{2} \leq C \frac{T}{n}

(see Altmeyer and Chorowski (2018) [27]). Hence

E {(U_{n, T} - 1)}^{2} = E {[(U_{n, T} - U_{T}) + (U_{T} - 1)]}^{2} \leq C (T^{- 1} ⋁ \frac{T}{n}) .

Further, by Lemma 1 (b), we have

\begin{matrix} sup_{x \in ℝ} |P_{θ} \{\sqrt{T I (θ)} (θ_{n, T} - θ) \leq x\} - Φ (x)| \\ = & sup_{x \in ℝ} |P_{θ} \{\frac{N_{n, T}}{U_{n, T}} \leq x\} - Φ (x)| \\ = & sup_{x \in ℝ} |P_{θ} \{N_{n, T} \leq x\} - Φ (x)| + P_{θ} \{|U_{n, T} - 1| \geq ϵ\} + ϵ \\ \leq & C (T^{- 1 / 2} ⋁ \frac{T^{p + 1}}{n^{p}}) + ϵ^{- 2} C (T^{- 1} ⋁ \frac{T}{n}) + ϵ . \end{matrix}

since, by Lemmas 1 (a) and 5, we have

\begin{matrix} sup_{x \in ℝ} |P_{θ} \{N_{n, T} \leq x\} - Φ (x)| \\ \leq & sup_{x \in ℝ} |P_{θ} \{M_{T} \leq x\} - Φ (x)| + P_{θ} \{|N_{n, T} - M_{T}| \geq ϵ\} + ϵ \\ \leq & C T^{- 1 / 2} + ϵ^{- 2} E {|N_{n, T} - M_{T}|}^{2} + ϵ \\ \leq & C T^{- 1 / 2} + ϵ^{- 2} C \frac{T^{p + 1}}{n^{p}} + ϵ . \end{matrix}

Choosing

ϵ = T^{- 1 / 2}

, we have the result. □

Remark 3.

With

p = 1

, for the Euler scheme, which produces the conditional least squares estimator, one obtains the rate

O (T^{- 1 / 2} ⋁ \frac{T^{2}}{n}) .

With

p = 2

, for the Milstein scheme, one obtains the rate

O (T^{- 1 / 2} ⋁ \frac{T^{3}}{n^{2}}) .

With

p = 4

, for the Simpson scheme, one obtains the rate

O (T^{- 1 / 2} ⋁ \frac{T^{5}}{n^{4}}) .

With

p = 6

, for the Boole scheme, one obtains the rate

O (T^{- 1 / 2} ⋁ \frac{T^{7}}{n^{6}}) .

Thus, the higher the p, the sharper the bound. Thus, the Itô/Euler scheme gives the first-order QMLE, the Milstein/Stratonovich scheme produces the second-order QMLE, the Simpson scheme produces the fourth-order QMLE and the Boole scheme produces the sixth-order QMLE. See Bishwal (2011) [28] for a connection of this area to the stochastic moment problem and hedging of generalized Black–Scholes options.

3. Example

Consider the stochastic differential equation

\begin{matrix} d X_{t} & = & θ \frac{X_{t}}{\sqrt{1 + X_{t}^{2}}} d t + d W_{t}, t \geq 0 \\ X_{0} & = & x_{0} \end{matrix}

The solution to the above SDE is called the hyperbolic diffusion process because it has a hyperbolic stationary distribution when

θ < 0

. The process has nonlinear drift and the process is stationary and ergodic, which distinguishes this from a linear drift case, such as the Ornstein–Uhlenbeck process and the Cox–Ingersoll–Ross process, which have linear drift. This model verifies assumption (A3). In fact, the stationary density is proportional to

exp (θ \sqrt{1 + X_{t}^{2}})

. It is not possible to calculate the conditional expectation for the hyperbolic diffusion process and hence one needs a higher-order Taylor expansion approach.

Remark 4

(Concluding Remark). It would be interesting to extend the results of the paper to diffusions with jumps using the strong stochastic Taylor expansion with jumps results in Chapter 6 of Kloeden and Bruti-Liberati (2010) [29].

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

Bishwal, J.P.N. Parameter Estimation in Stochastic Differential Equations; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Bishwal, J.P.N. Parameter Estimation in Stochastic Volatility Models; Springer Nature Switzerland AG: Cham, Switzerland, 2021; (forthcoming). [Google Scholar]
Dorogovcev, A.J. The consistency of an estimate of a parameter of a stochastic differential equation. Theory Prob. Math. Stat. 1976, 10, 73–82. [Google Scholar]
Kasonga, R.A. The consistency of a nonlinear least squares estimator from diffusion processes. Stoch. Proc. Appl. 1988, 30, 263–275. [Google Scholar] [CrossRef]
Prakasa Rao, B.L.S. Asymptotic theory for non-linear least squares estimator for diffusion processes. Math. Oper. Stat. Ser. Stat. 1983, 14, 195–209. [Google Scholar] [CrossRef]
Florens-Zmirou, D. Approximate discrete time schemes for stiatistics of diffusion processes. Statistics 1989, 20, 547–557. [Google Scholar] [CrossRef]
Kessler, M. Estimation of an ergodic diffusion from discrete observations. Scand. J. Stat. 1997, 24, 211–229. [Google Scholar] [CrossRef]
Liptser, R.S.; Shiryayev, A.N. Statistics of Random Processes I; Springer: New York, NY, USA, 1977. [Google Scholar]
Yoshida, N. Estimation for diffusion processes from discrete observations. J. Multivar. Anal. 1992, 41, 220–242. [Google Scholar] [CrossRef]
Shoji, I. A note on asymptotic properties of estimator derived from the Euler method for diffusion processes at discrete times. Stat. Probab. Lett. 1997, 36, 153–159. [Google Scholar] [CrossRef]
Bishwal, J.P.N.; Bose, A. Rates of convergence of approximate maximum likelihood estimators in the Ornstein-Uhlenbeck process. Comput. Math. Appl. 2001, 42, 23–38. [Google Scholar] [CrossRef]
Bishwal, J.P.N. Uniform rate of weak convergence for the minimum contrast estimator in the Ornstein-Uhlenbeck process. Methodol. Comput. Appl. Probab. 2010, 12, 323–334. [Google Scholar] [CrossRef]
Bishwal, J.P.N. Berry-Esseen inequalities for discretely observed diffusions. Monte Carlo Methods Appl. 2009, 15, 229–239. [Google Scholar] [CrossRef]
Bishwal, J.P.N. M-Estimation for discretely sampled diffusions. Theory Stoch. Process. 2009, 15, 62–83. [Google Scholar]
Bishwal, J.P.N. Milstein approximation of posterior density of diffusions. Int. J. Pure Appl. Math. 2011, 68, 403–414. [Google Scholar]
Bishwal, J.P.N. Conditional least squares estimation in diffusion processes based on Poisson sampling. J. Appl. Probab. Stat. 2010, 5, 169–180. [Google Scholar]
Bishwal, J.P.N. Some new estimators of integrated volatility. Am. Open J. Stat. 2011, 1, 74–80. [Google Scholar] [CrossRef][Green Version]
Bhattacharya, R.N.; Ranga Rao, R. Normal Approximation and Asymptotic Expansion; Wiley: New York, NY, USA, 1976. [Google Scholar]
Ikeda, N.; Watanabe, S. Stochastic Differential Equations and Diffusion Processes, 2nd ed.; North-Holland: Amsterdam, The Netherlands; Kodansha Ltd.: Tokyo, Japan, 1989. [Google Scholar]
Nualart, D. Malliavin Calculus and Related Topics; Springer: Berlin, Germany, 1995. [Google Scholar]
Bally, V. On the connection between the Malliavin covariance matrix and Hörmander condition. J. Funct. Anal. 1991, 96, 219–255. [Google Scholar] [CrossRef]
Yoshida, M. Malliavin calculus and asymptotic expansion for martingales. Probab. Theory Relat. Fields 1997, 109, 301–342. [Google Scholar] [CrossRef]
Michel, R.; Pfanzagl, J. The accuracy of the normal approximation for minimum contrast estimate. Zeit. Wahr. Verw. Gebiete 1971, 18, 73–84. [Google Scholar] [CrossRef]
Kloeden, P.E.; Platen, E. Numerical Solution of Stochastic Differential Equations; Springer: Berlin, Germany, 1992. [Google Scholar]
Pardoux, E.; Veretennikov, A.Y. On the Poisson equation and diffusion equation I. Ann. Probab. 2001, 29, 1061–1085. [Google Scholar] [CrossRef]
Yoshida, N. Polynomial type large deviation inequalities and quasi-likelihood analysis for stochastic differential equations. Ann. Inst. Stat. Math. 2011, 63, 431–479. [Google Scholar] [CrossRef]
Altmeyer, R.; Chorowski, J. Estimation error for occupation functionals of stationary Markov processes. Stoch. Proc. Appl. 2018, 128, 1830–1848. [Google Scholar] [CrossRef]
Bishwal, J.P.N. Stochastic moment problem and hedging of generalized Black-Scholes options. Appl. Numer. Math. 2011, 61, 1271–1280. [Google Scholar] [CrossRef]
Kloeden, P.E.; Bruti-Liberati, N. Numerical Solution of Stochastic Differential Equations with Jumps in Finance; Springer: Berlin, Germany, 2010. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bishwal, J.P.N. Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions. AppliedMath 2022, 2, 39-53. https://doi.org/10.3390/appliedmath2010003

AMA Style

Bishwal JPN. Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions. AppliedMath. 2022; 2(1):39-53. https://doi.org/10.3390/appliedmath2010003

Chicago/Turabian Style

Bishwal, Jaya P. N. 2022. "Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions" AppliedMath 2, no. 1: 39-53. https://doi.org/10.3390/appliedmath2010003

APA Style

Bishwal, J. P. N. (2022). Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions. AppliedMath, 2(1), 39-53. https://doi.org/10.3390/appliedmath2010003

Article Menu

Berry–Esseen Bounds of the Quasi Maximum Likelihood Estimators for the Discretely Observed Diffusions

Abstract

1. Introduction and Preliminaries

2. Main Results

3. Example

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI