Almost Sure Central Limit Theorem for Error Variance Estimator in Pth-Order Nonlinear Autoregressive Processes

Liang, Kaiyu; Zhang, Yong

doi:10.3390/math12101482

Open AccessArticle

Almost Sure Central Limit Theorem for Error Variance Estimator in Pth-Order Nonlinear Autoregressive Processes

by

Kaiyu Liang

and

Yong Zhang

^*

School of Mathematic, Jilin University, Changchun 130012, China

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(10), 1482; https://doi.org/10.3390/math12101482

Submission received: 2 April 2024 / Revised: 6 May 2024 / Accepted: 9 May 2024 / Published: 10 May 2024

(This article belongs to the Section Probability and Statistics)

Download Versions Notes

Abstract

:

In this paper, under some suitable assumptions, using the Taylor expansion, Borel–Cantelli lemma and the almost sure central limit theorem for independent random variables, the almost sure central limit theorem for error variance estimator in the pth-order nonlinear autoregressive processes with independent and identical distributed errors was established. Four examples, first-order autoregressive processes, self-exciting threshold autoregressive processes, threshold-exponential AR progresses and multilayer perceptrons progress, are given to verify the results.

Keywords:

almost sure central limit theorem; nonlinear autoregressive processes; error variance estimator; residuals

MSC:

60F15; 60G50

1. Introduction

Over the past twenty years, there has been an increasing interest in the nonlinear time series literature, for example, the monograph by Tong [1] represents a good account of nonlinear time series models. Compared to linear models, studying the properties of estimators in nonlinear time series models is technically more complex and difficult. In this paper, we will investigate the properties of estimators in nonlinear autoregressive processes.

Throughout this paper, we always assume that

{ε_{i}, i \in Z}

is a sequence of independent and identically distributed random variables with mean zero, finite variance

σ^{2}

.

{X_{i}, i \in Z}

is a sequence of strictly stationary real random variables which satisfies nonlinear autoregressive processes of order p

\begin{matrix} X_{i} = r_{θ} (X_{i - 1}, \dots, X_{i - p}) + ε_{i}, \end{matrix}

(1)

for some

θ = {(θ_{1}, \dots, θ_{q})}^{'} \in Θ \subset R^{q}

, where

r_{θ}

,

θ \in Θ

, is a family of known measurable functions from

R^{p} \to R

. Obviously,

X_{i - 1}, \dots, X_{i - s}

are independent of

{ε_{j}, j \geq i}

.

In recent years, many authors have studied the properties of estimators for the error sequence. One research interest is the error density estimator, for example, Liebscher [2] proved the law of logarithm and the law of iterated logarithm of the M-estimator in the nonlinear autoregressive processes of order p with independent errors. Cheng and Sun [3] studied the goodness-of-fit test of the errors in the nonlinear autoregressive processes of order p with independent and identical distributed errors. Fu and Yang [4] obtained the asymptotic normality of error kernel density estimators in the pth-order nonlinear autoregressive processes with independent and identical distributed errors. Cheng [5] obtained the asymptotic distribution of the maximum of a suitably normalized deviation of the density estimator from the expectation of the kernel error density. Li [6] established the asymptotic normality of the

L_{p}

-norms of error density estimators in the pth-order nonlinear autoregressive processes with independent and identical distributed errors. Kim et al. [7] considered the goodness-of-fit test of the errors in the nonlinear autoregressive processes of order p with a stationary

α

-mixing error. Cheng [8] considered the uniform strong consistency of the classical Glivenko–Cantelli Theorem for the residual-based empirical error in the pth-order nonlinear autoregressive processes with independent and identical distributed errors. Liu and Zhang [9] established the law of the iterated logarithm for error density estimators in the pth-order nonlinear autoregressive processes with independent and identical distributed errors.

The other research interest is the error variance estimator. Cheng [10] obtained the consistency and asymptotic normality of the variance estimator in the pth-order nonlinear autoregressive processes with independent and identical distributed errors. As we know, there are few results about the error variance estimators except for Cheng [10], and there are no results for the almost sure central limit theorem for the error variance estimator, and therefore, we will study the almost sure central limit theorem for the error variance estimator in this paper.

The almost sure central limit theorem (ASCLT, for short) has been first introduced independently by Brosamler [11] and Schatte [12]. Since then many interesting results have been discovered in this field. The classical ASCLT for a sequence

{X, X_{n}; n \geq 1}

of i.i.d. random variables with zero means states that when

Var (X) = σ^{2}

,

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{S_{k}}{\sqrt{k} σ} \leq x\} = Φ (x) a . s . \end{matrix}

(2)

for all

x \in R

with the logarithmic averages

d_{k} = 1 / k

and

D_{n} = \sum_{k = 1}^{n} d_{k}

,

S_{k} = \sum_{j = 1}^{k} X_{j}

. However, logarithmic averaging is not the only one providing a.s. convergence for partial sums of i.i.d. random variables. Peligrad and Révész [13] showed that (2) holds with

d_{k} = {(\log k)}^{α} / k

,

α > - 1

. Berkes and Csáki [14] showed that (2) holds also if

d_{k} = \exp {{(\log k)}^{α}} / k

,

0 \leq α < 1 / 2

. To compare these results, Hörmann [15], Tong et al. [16], Miao [17], Li [18], Zhang [19,20], Wu and Jiang [21], and Li and Zhang [22,23,24] showed that the a.s. limit (2) holds for any weight sequence

{d_{k}}

satisfying a mild growth condition similar to Kolmogorov’s condition on the law of iterated logarithm.

The paper is organized as follows. In Section 1, the significance and background of research is introduced. Some assumptions and main results are stated in Section 2. Several useful lemmas are listed in Section 3. The proofs are listed in Section 4. Examples are stated in Section 5. In the sequel, we denote with

C, C_{1}, C_{2}, \dots

generic constants that may be different in each of its appearances.

I {A}

denotes the indicator function of the set A.

Φ (\cdot)

denotes the distribution function of the standard normal random variable

N

.

2. Main Results

The goal of this paper is to study the properties of the estimator of the error variance

σ^{2}

by means of the observations

{X_{1}, X_{2}, \dots, X_{n}}

in model (1). The main difficulty is that we do not observe the error

{ε_{1}, ε_{2}, \dots, ε_{n}}

, the structure of estimation of parameters is complex, and the residuals are unknown. We need to use Taylor’s expansion and many other techniques to deal with it. This is the greatest contribution of this paper. We will follow the following steps. Firstly, we compute an estimator

\hat{θ} = {({\hat{θ}}_{1}, \dots, {\hat{θ}}_{q})}^{'}

of unknown parameter

θ

. Secondly, based on the estimator

\hat{θ}

and model (1), we calculate the following residuals

\begin{matrix} {\hat{ε}}_{i} = X_{i} - r_{\hat{θ}} (X_{i - 1}, \dots, X_{i - p}), i = 1, 2, \dots, n . \end{matrix}

(3)

Finally, using the above residuals, we estimate the error variance

σ^{2}

by using the following equation

\begin{matrix} {\hat{σ}}_{n}^{2} = \frac{1}{n} \sum_{i = 1}^{n} {\hat{ε}}_{i}^{2} . \end{matrix}

(4)

Before giving the main results, we need the following basic assumptions for model (1) which will be used throughout the paper. For

1 \leq i \leq n

and

1 \leq j \leq q

, let

Y_{i j} ≜ \frac{\partial}{\partial θ_{j}} r_{θ} (X_{i - 1}, \dots, X_{i - p}), Z_{i j l} ≜ \frac{\partial^{2}}{\partial θ_{j} \partial θ_{l}} r_{θ^{*}} (X_{i - 1}, \dots, X_{i - p}),

where

θ^{*} = θ + λ (\hat{θ} - θ)

for some

λ \in (0, 1)

. By the fact that

X_{i - 1}, \dots, X_{i - s}

are independent of

{ε_{j}, j \geq i}

, we conclude that

ε_{i}

is independent of

Y_{i j}

.

Assumption 1.

Let

U \subset Θ \subset R^{q}

be an open neighborhood of θ. For any

y \in R^{p}

,

θ = (θ_{1}, \dots, θ_{q}) \in U

,

j, l = 1, \dots, q

, assume that

|\frac{\partial}{\partial θ_{j}} r_{θ} (y)| \leq M_{1} (y), |\frac{\partial^{2}}{\partial θ_{j} \partial θ_{l}} r_{θ} (y)| \leq M_{2} (y),

where

E [M_{1}^{4} (X_{i - 1}, \dots, X_{i - p})] < \infty

and

E [M_{2}^{4} (X_{i - 1}, \dots, X_{i - p})] < \infty

for each

i \geq 1

.

Assumption 2.

Let

\hat{θ} = {({\hat{θ}}_{1}, \dots, {\hat{θ}}_{q})}^{'}

be a strong consistent estimator for θ satisfying the following law of iterated logarithm

\begin{matrix} \underset{n \to \infty}{\lim \sup} \sqrt{\frac{n}{\log \log n}} |\hat{θ} - θ| \leq C, a . s ., \end{matrix}

(5)

where

| \hat{θ} - θ | = \sqrt{\sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2}}

and C is a positive constant.

Remark 1.

By Corollary 2.2 of Klimko and Nelson [25], we know that the least square estimator for the stochastic process under some suitable conditions satisfies (5). For the first-order autoregressive progresses, Wang et al. [26] proved that the least square estimator of the unknown parameters meets (5). For smooth threshold autoregressive progresses, Chan and Tong [27] obtained the conditional least square estimators of the unknown parameters that satisfy (5). For general nonlinear autoregressive progresses of order p, Liebscher [2] established M-estimators for the unknown parameters that satisfy (5), Yao [28] obtained (5) for least square estimators of nonlinear autoregressive progresses.

Now, we will state the main result for the almost sure central limit theorem of the error variance estimator

{\hat{σ}}^{2}

.

Theorem 1.

Suppose that

{d_{k}}

is a sequence of positive numbers satisfying the following conditions:

(C1): ${\lim \sup}_{k \to \infty} k d_{k} {(\log D_{k})}^{ρ} / D_{k} < \infty$ for some $ρ > 1$ , where $D_{n} = \sum_{k = 1}^{n} d_{k}$ .
(C2): $D_{n} \to \infty$ , $D_{n} = o (n^{ϵ})$ , for any $ϵ > 0$ .

For model (1), under the Assumptions 1 and 2, if

E ε_{1}^{4} < \infty

, for all

x \in R

, we have

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{\sqrt{k}}{\sqrt{V a r (ε_{1}^{2})}} ({\hat{σ}}_{k}^{2} - σ^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

(6)

Corollary 1.

Let

c_{n} > 0

with

c_{n} ↑ \infty

and

\lim_{n \to \infty} \frac{c_{n + 1}}{c_{n}} = 1

and

\frac{k}{l} \leq {(\frac{c_{k}}{c_{l}})}^{γ}, k < l

for some constant

γ > 0

. Denote

d_{k} = \log \frac{c_{k + 1}}{c_{k}} \exp (\log^{β} c_{k}), D_{n} = \sum_{k = 1}^{n} d_{k}, 0 \leq β < 1 / 2 .

Then, under the assumptions of Theorem 1, (6) also holds.

Remark 2.

If the conditions (C1) and (C2) of Theorem 1 is satisfied for some sequence

{D_{n}}

, then it is also satisfied for any other sequence

D_{n}^{*} = Ψ (D_{n})

, provided that

Ψ : R^{+} \to R^{+}

is differentiable,

Ψ^{'} (x) = O (Ψ (x) / x)

and

\log Ψ^{'} (x)

is uniformly continuous on

(B, \infty)

for some

B > 0

. Typical examples are

Ψ (x) = x^{γ}

,

Ψ (x) = {(\log x)}^{γ}

,

Ψ (x) = {(\log \log x)}^{γ}

with some suitable

γ > 0

.

Remark 3.

It is easy to show that

d_{k} = l (k) / k

, where

l (x)

is slowly varying at infinity and

D_{n} \to \infty

, satisfies the conditions

(C 1)

and

(C 2)

. So typical examples including

d_{k} = 1 / k

;

d_{k} = \log^{θ} k / k, θ > - 1

;

d_{k} = \exp (\log^{γ} k) / k, 0 \leq γ < 1 / 2, 1 < ρ < (1 - γ) / γ

.

3. Preliminary Lemmas

Some useful lemmas which are needed to prove the main result are given in the following section.

Lemma 1

(Hall and Heyde [29], Theorem 2.11, P.23). Let

X_{1} = S_{1}

and

X_{i} = S_{i} - S_{i - 1}

,

2 \leq i \leq n

denote the differences of the sequences

{S_{i}, 1 \leq i \leq n}

. If

{S_{i}, F_{i}, 1 \leq i \leq n}

is a martingale and

p > 0

, then there exists constant C depending only on p such that

\begin{matrix} E (\max_{1 \leq i \leq n} {| S_{i} |}^{p}) \leq C \{E [{(\sum_{i = 1}^{n} E (X_{i}^{2} | F_{i - 1}))}^{p / 2}] + E (\max_{1 \leq i \leq n} {| X_{i} |}^{p})\} . \end{matrix}

Lemma 2.

For

1 \leq i \leq n

,

1 \leq j

,

l \leq q

, then for any

2 \leq t \leq 4

, one can obtain

\begin{matrix} E {| Y_{i j} |}^{t} \leq E M_{1}^{t} (X_{i - 1}, \dots, X_{i - p}) \leq {(E M_{1}^{4} (X_{i - 1}, \dots, X_{i - p}))}^{t / 4} < \infty, \\ E {| Z_{i j l} |}^{t} \leq E M_{2}^{t} (X_{i - 1}, \dots, X_{i - p}) \leq {(E M_{2}^{4} (X_{i - 1}, \dots, X_{i - p}))}^{t / 4} < \infty . \end{matrix}

Proof.

The proof of Lemma 2 is obvious by Assumption 1 and the Hölder inequality. □

Lemma 3.

Assume that

{G_{n}, n \geq 1}

is a sequence of random variables satisfying the ASCLT with the weight

{d_{k}}

defined as in Theorem 1, that is

\begin{matrix} \forall x \in R, \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} \leq x} = Φ (x) a . s . \end{matrix}

Let

{R_{n}, n \geq 1}

be a sequence of random variables converging almost surely to zero. Then,

{G_{n} + R_{n}, n \geq 1}

also satisfies the ASCLT. That is

\begin{matrix} \forall x \in R, \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} + R_{k} \leq x} = Φ (x) a . s . \end{matrix}

Proof.

For fixed

x \in R

and

η > 0

, recall that

{G_{n}, n \geq 1}

satisfies the ASCLT, then we have

\begin{matrix} T_{n, η} : = |\frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} \leq x + η} - Φ (x + η)| \to 0, a . s . \end{matrix}

and

\begin{matrix} W_{n, η} : = |\frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} \leq x - η} - Φ (x - η)| \to 0, a . s . \end{matrix}

Remark that

\begin{matrix} {G_{n} + R_{n} \leq x} \subset {G_{n} \leq x + η} \cup {| R_{n} | > η}, \end{matrix}

\begin{matrix} {G_{n} \leq x - η} \subset {G_{n} + R_{n} \leq x} \cup {| R_{n} | > η} . \end{matrix}

Then, we can conclude that

\begin{matrix} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} + R_{k} \leq x} - Φ (x) \\ \leq & \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} \leq x + η} - Φ (x + η) + \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {| R_{k} | > η} + | Φ (x + η) - Φ (x) | \\ \leq & T_{n, η} + \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {| R_{k} | > η} + \int_{x}^{x + η} \frac{1}{\sqrt{2 π}} e^{- \frac{t^{2}}{2}} d t \\ \leq & T_{n, η} + \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {| R_{k} | > η} + \frac{η}{\sqrt{2 π}}, \end{matrix}

and

\begin{matrix} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} + R_{k} \leq x} - Φ (x) \\ \geq & \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {G_{k} \leq x - η} - Φ (x - η) - \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {| R_{k} | > η} + Φ (x - η) - Φ (x) \\ \geq & - W_{n, η} - \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I {| R_{k} | > η} - \frac{η}{\sqrt{2 π}} . \end{matrix}

Noting that

{R_{n}, n \geq 1}

is a sequence of random variables converging almost surely to zero and the arbitrariness of

η

, the desired conclusion follows from above discussion. □

Lemma 4

(Zhang [30], Lemma 2.10, P.391). Let

{ζ_{n}, n \geq 1}

be a sequence of uniformly bounded random variables and

{d_{n}}

,

{D_{n}}

be defined as in Theorem 1. If there exist constants

C > 0

and

δ > 0

and a sequence of positive numbers

{a (k)}

such that

\sum_{n = 1}^{\infty} a (2^{n}) < \infty

and

E | ζ_{k} ζ_{j} | \leq C ({(k / j)}^{δ} + a (k)), for j / k > b_{n} = {(\log D_{n})}^{ρ / δ},

then

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} ζ_{k} = 0 a . s . \end{matrix}

Lemma 5.

Let

{ε_{i}, i \geq 1}

be a sequence of independent and identically distributed random variables with mean zero, finite variance

σ^{2}

and

E ε_{1}^{4} < \infty

. Let

{d_{n}}

,

{D_{n}}

be defined as in Theorem 1. Then for all

x \in R

,

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{1}{\sqrt{k V a r (ε_{1}^{2})}} \sum_{i = 1}^{k} (ε_{i}^{2} - σ^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

(7)

Proof.

Denote

T_{k} = \sum_{i = 1}^{k} (ε_{i}^{2} - σ^{2})

. Suppose that f is a bounded Lipschitz function. By classical central limit theorem, we have

\begin{matrix} E f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}}) \to E f (N) as k \to \infty . \end{matrix}

By the conclusions in Section 2 of Peligrad and Shao [31] and Theorem 7.1 of Billingsley [32], we know that (7) is equivalent to

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}}) = E f (N), a . s . \end{matrix}

Hence, to prove (7), it suffices to show that

\begin{matrix} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} [f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}}) - E f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}})] \to 0, a . s . n \to \infty . \end{matrix}

(8)

For convenience, let

W_{k} = f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}}) - E f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}})

. Notice that

{ε_{i}, i \in Z}

are independent, both f and

f^{'}

are bounded, then we conclude that for

1 \leq k < j \leq n

,

\begin{matrix} | E W_{k} W_{j} | = |Cov (f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}}), f (\frac{T_{j}}{\sqrt{j V a r (ε_{1}^{2})}}))| \\ = & |Cov (f (\frac{T_{k}}{\sqrt{k V a r (ε_{1}^{2})}}), f (\frac{T_{j}}{\sqrt{j V a r (ε_{1}^{2})}}) - f (\frac{T_{j} - T_{k}}{\sqrt{j V a r (ε_{1}^{2})}}))| \\ \leq & C_{1} E |f (\frac{T_{j}}{\sqrt{j V a r (ε_{1}^{2})}}) - f (\frac{T_{j} - T_{k}}{\sqrt{j V a r (ε_{1}^{2})}})| \\ \leq & C_{2} \frac{E | T_{k} |}{\sqrt{j V a r (ε_{1}^{2})}} \leq C_{3} \frac{{(E T_{k}^{2})}^{1 / 2}}{\sqrt{j V a r (ε_{1}^{2})}} \\ \leq & C_{4} \frac{\sqrt{k V a r (ε_{1}^{2})}}{\sqrt{j V a r (ε_{1}^{2})}} \leq C_{5} {(\frac{k}{j})}^{1 / 2}, \end{matrix}

then by Lemma 4 with

δ = 1 / 2

and

a (k) \equiv 0

, (8) holds, and therefore, the proof of (7) is completed. □

Lemma 6.

Under the assumptions of Theorem 1, for any

1 \leq j \leq q

, we have

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{\log \log n}{n^{3 / 2}} \sum_{i = 1}^{n} Y_{i j}^{2} = 0 a . s . \end{matrix}

Proof.

Let

n_{k} = [k^{α}]

,

α > 2

. By Lemma 2 and the Markov inequality, for any

ϵ > 0

, it is easy to known that

\begin{matrix} \sum_{k = 1}^{\infty} P (\frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \sum_{i = 1}^{n_{k}} Y_{i j}^{2} > ϵ) \\ \leq & C_{1} \sum_{k = 1}^{\infty} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \sum_{i = 1}^{n_{k}} E Y_{i j}^{2} \\ \leq & C_{2} \sum_{k = 1}^{\infty} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \cdot n_{k} \\ \leq & C_{3} \sum_{k = 1}^{\infty} \frac{\log \log (k + 1)}{k^{α / 2}} < \infty . \end{matrix}

Then by the Borel–Cantelli lemma, we obtain

\begin{matrix} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \sum_{i = 1}^{n_{k}} Y_{i j}^{2} \to 0 a . s . a s k \to \infty . \end{matrix}

(9)

Similarly, by Lemma 2 and the Markov inequality, for any

ϵ > 0

, one can obtain

\begin{matrix} \sum_{k = 1}^{\infty} P (\frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \max_{n_{k} < n \leq n_{k + 1}} \sum_{i = n_{k} + 1}^{n} Y_{i j}^{2} > ϵ) \\ \leq & C_{1} \sum_{k = 1}^{\infty} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} E [\max_{n_{k} < n \leq n_{k + 1}} \sum_{i = n_{k} + 1}^{n} Y_{i j}^{2}] \\ \leq & C_{2} \sum_{k = 1}^{\infty} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \sum_{i = n_{k} + 1}^{n_{k + 1}} E Y_{i j}^{2} \\ \leq & C_{3} \sum_{k = 1}^{\infty} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \cdot [n_{k + 1} - n_{k}] \\ \leq & C_{4} \sum_{k = 1}^{\infty} \frac{\log \log (k + 1)}{k^{3 α / 2}} \cdot [{(k + 1)}^{α} - k^{α}] \\ \leq & C_{5} \sum_{k = 1}^{\infty} \frac{\log \log (k + 1)}{k^{α / 2 + 1}} < \infty . \end{matrix}

By the Borel–Cantelli lemma, it follows that

\begin{matrix} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \max_{n_{k} < n \leq n_{k + 1}} \sum_{i = n_{k} + 1}^{n} Y_{i j}^{2} \to 0 a . s . a s k \to \infty . \end{matrix}

(10)

Then combining (9) with (10), for

n_{k} < n \leq n_{k + 1}

, one can obtain

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{\log \log n}{n^{3 / 2}} \sum_{i = 1}^{n} Y_{i j}^{2} \\ \leq & \underset{k \to \infty}{\lim \sup} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \sum_{i = 1}^{n_{k}} Y_{i j}^{2} + \underset{k \to \infty}{\lim \sup} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \max_{n_{k} < n \leq n_{k + 1}} \sum_{i = n_{k} + 1}^{n} Y_{i j}^{2} \\ \to & 0 a . s . a s k \to \infty . \end{matrix}

Thus, the proof of Lemma 6 is completed. □

Lemma 7.

Under the assumptions of Theorem 1, for any

1 \leq j, l \leq q

, we have

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{{(\log \log n)}^{2}}{n^{5 / 2}} \sum_{i = 1}^{n} Z_{i j l}^{2} = 0 a . s . \end{matrix}

Proof.

By Lemma 2 and the Markov inequality, for any

ϵ > 0

, it is easy to see that

\begin{matrix} \sum_{n = 1}^{\infty} P (\frac{{(\log \log n)}^{2}}{n^{5 / 2}} \sum_{i = 1}^{n} Z_{i j l}^{2} > ϵ) \\ \leq & C_{1} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{2}}{n^{5 / 2}} \sum_{i = 1}^{n} E Z_{i j l}^{2} \\ \leq & C_{2} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{2}}{n^{5 / 2}} n \\ \leq & C_{3} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{2}}{n^{3 / 2}} < \infty . \end{matrix}

By the Borel–Cantelli lemma, one can obtain

\begin{matrix} \frac{{(\log \log n)}^{2}}{n^{5 / 2}} \sum_{i = 1}^{n} Z_{i j l}^{2} \to 0 a . s . a s n \to \infty . \end{matrix}

The proof of Lemma 7 is completed. □

Lemma 8.

Under the assumptions of Theorem 1, for any

1 \leq j \leq q

, we have

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{{(\log \log n)}^{1 / 2}}{n} |\sum_{i = 1}^{n} Y_{i j} ε_{i}| = 0 a . s . \end{matrix}

Proof.

Let

\begin{matrix} Y_{m} = \sum_{i = 1}^{m} Y_{i j} ε_{i}, 1 \leq m \leq n . \end{matrix}

Let

F_{m}

be the

σ

-algebra generated by the random variables

\{ε_{i}, 1 \leq i \leq m\}

. By the fact that

Y_{i j}

and

ε_{i}

are independent, it is easy to compute that the process

\{Y_{m}, F_{m}, 1 \leq m \leq n\}

is a martingale. By Lemmas 1 and 2, for some

2 < t < 4

, we know

\begin{matrix} E | Y_{n} |^{t} = & E {|\sum_{i = 1}^{n} Y_{i j} ε_{i}|}^{t} \\ \leq & C_{1} E [{(\sum_{i = 1}^{n} E (Y_{i j}^{2} ε_{i}^{2} | F_{i - 1}))}^{t / 2}] + C_{2} E (\max_{1 \leq i \leq n} {| Y_{i j} ε_{i} |}^{t}) \\ \leq & C_{3} E [{(\sum_{i = 1}^{n} E Y_{i j}^{2})}^{t / 2}] \cdot {[E ε_{1}^{2}]}^{t / 2} + C_{4} \sum_{i = 1}^{n} E {| Y_{i j} |}^{t} \cdot E {| ε_{1} |}^{t} \\ \leq & C_{5} n^{t / 2} + C_{6} n \leq C_{7} n^{t / 2} . \end{matrix}

(11)

By the Markov inequality and (11), for any

ϵ > 0

, it is easy to obtain

\begin{matrix} \sum_{n = 1}^{\infty} P (\frac{{(\log \log n)}^{1 / 2}}{n} | \sum_{i = 1}^{n} Y_{i j} ε_{i} | > ϵ) \\ \leq & C_{1} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{t / 2}}{n^{t}} E {|\sum_{i = 1}^{n} Y_{i j} ε_{i}|}^{t} \\ \leq & C_{2} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{t / 2}}{n^{t}} n^{t / 2} \\ \leq & C_{3} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{t / 2}}{n^{t / 2}} < \infty . \end{matrix}

By the Borel–Cantelli lemma, we can obtain

\begin{matrix} \frac{{(\log \log n)}^{1 / 2}}{n} |\sum_{i = 1}^{n} Y_{i j} ε_{i}| \to 0 a . s . a s n \to \infty . \end{matrix}

The proof of Lemma 8 is completed. □

Lemma 9.

Under the assumptions of Theorem 1, for any

1 \leq j, l \leq q

, we have

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{\log \log n}{n^{3 / 2}} |\sum_{i = 1}^{n} Z_{i j l} ε_{i}| = 0 a . s . \end{matrix}

Proof.

Let

n_{k} = [k^{α}]

,

α > 2

. By Lemma 2, the Markov inequality,

C_{r}

inequality and Cauchy–Schwarz inequality, for any

ϵ > 0

, it is easy to see that

\begin{matrix} \sum_{k = 1}^{\infty} P (\frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} | \sum_{i = 1}^{n_{k}} Z_{i j l} ε_{i} | > ϵ) \\ \leq & C_{1} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} E {(\sum_{i = 1}^{n_{k}} Z_{i j l} ε_{i})}^{2} \\ \leq & C_{2} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} \cdot n_{k} \sum_{i = 1}^{n_{k}} E Z_{i j l}^{2} ε_{i}^{2} \\ \leq & C_{3} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{2}} \sum_{i = 1}^{n_{k}} {(E Z_{i j l}^{4})}^{1 / 2} {(E ε_{i}^{4})}^{1 / 2} \\ \leq & C_{4} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}} \leq C_{5} \sum_{k = 1}^{\infty} \frac{{(\log \log (k + 1))}^{2}}{k^{α}} < \infty . \end{matrix}

Then by the Borel–Cantelli lemma, we obtain

\begin{matrix} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} |\sum_{i = 1}^{n_{k}} Z_{i j l} ε_{i}| \to 0 a . s . a s k \to \infty . \end{matrix}

(12)

Similarly, By Lemma 2 and the Markov inequality and

C_{r}

inequality, for any

ϵ > 0

, one can obtain

\begin{matrix} \sum_{k = 1}^{\infty} P (\frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \max_{n_{k} < n \leq n_{k + 1}} |\sum_{i = n_{k} + 1}^{n} Z_{i j l} ε_{i}| > ϵ) \\ \leq & C_{1} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} E {(\max_{n_{k} < n \leq n_{k + 1}} | \sum_{i = n_{k} + 1}^{n} Z_{i j l} ε_{i} |)}^{2} \\ \leq & C_{2} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} E {(\sum_{i = n_{k} + 1}^{n_{k + 1}} | Z_{i j l} ε_{i} |)}^{2} \\ \leq & C_{3} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} \cdot (n_{k + 1} - n_{k}) \sum_{i = n_{k} + 1}^{n_{k + 1}} E [Z_{i j l}^{2} ε_{i}^{2}] \\ \leq & C_{4} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} \cdot (n_{k + 1} - n_{k}) \sum_{i = n_{k} + 1}^{n_{k + 1}} {(E Z_{i j l}^{4})}^{1 / 2} {(E ε_{i}^{4})}^{1 / 2} \\ \leq & C_{5} \sum_{k = 1}^{\infty} \frac{{(\log \log n_{k + 1})}^{2}}{n_{k}^{3}} \cdot {(n_{k + 1} - n_{k})}^{2} \\ \leq & C_{6} \sum_{k = 1}^{\infty} \frac{{(\log \log (k + 1))}^{2}}{k^{3 α}} \cdot k^{2 (α - 1)} \leq C_{7} \frac{{(\log \log (k + 1))}^{2}}{k^{α + 2}} < \infty . \end{matrix}

By the Borel–Cantelli lemma, it follows that

\begin{matrix} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \max_{n_{k} < n \leq n_{k + 1}} |\sum_{i = n_{k} + 1}^{n} Z_{i j l} ε_{i}| \to 0 a . s . a s k \to \infty . \end{matrix}

(13)

Then combining (12) with (13), for

n_{k} < n \leq n_{k + 1}

, one can obtain

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{\log \log n}{n^{3 / 2}} |\sum_{i = 1}^{n} Z_{i j l} ε_{i}| \\ \leq & \underset{k \to \infty}{\lim \sup} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} |\sum_{i = 1}^{n_{k}} Z_{i j l} ε_{i}| + \underset{k \to \infty}{\lim \sup} \frac{\log \log n_{k + 1}}{n_{k}^{3 / 2}} \max_{n_{k} < n \leq n_{k + 1}} |\sum_{i = n_{k} + 1}^{n} Z_{i j l} ε_{i}| \\ \to & 0 a . s . a s k \to \infty . \end{matrix}

The proof of Lemma 9 is completed. □

Lemma 10.

Under the assumptions of Theorem 1, for any

1 \leq j, l, k \leq q

, we have

\begin{matrix} \underset{n \to \infty}{\lim \sup} \frac{{(\log \log n)}^{3 / 2}}{n^{2}} |\sum_{i = 1}^{n} Y_{i j} Z_{i l k}| = 0 a . s . \end{matrix}

Proof.

By Lemma 2, the Markov inequality,

C_{r}

inequality and Cauchy–Schwarz inequality, for any

ϵ > 0

, it is easy to see that

\begin{matrix} \sum_{n = 1}^{\infty} P (\frac{{(\log \log n)}^{3 / 2}}{n^{2}} |\sum_{i = 1}^{n} Y_{i j} Z_{i l k}| > ϵ) \\ \leq & C_{1} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{3 t / 4}}{n^{t}} E {|\sum_{i = 1}^{n} Y_{i j} Z_{i l k}|}^{t / 2} \\ \leq & C_{2} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{3 t / 4}}{n^{t}} n^{t / 2 - 1} \sum_{i = 1}^{n} E [{|Y_{i j}|}^{t / 2} {|Z_{i l k}|}^{t / 2}] \\ \leq & C_{3} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{3 t / 4}}{n^{t / 2 + 1}} \sum_{i = 1}^{n} {(E | Y_{i j} |^{t})}^{1 / 2} {(E | Z_{i l k} |^{t})}^{1 / 2} \\ \leq & C_{4} \sum_{n = 1}^{\infty} \frac{{(\log \log n)}^{3 t / 4}}{n^{t / 2}} < \infty . \end{matrix}

where

2 < t \leq 4

is defined in Assumption 1.

By the Borel–Cantelli lemma, we obtain

\begin{matrix} \frac{{(\log \log n)}^{3 / 2}}{n^{2}} |\sum_{i = 1}^{n} Y_{i j} Z_{i l k}| \to 0 a . s . a s n \to \infty . \end{matrix}

The proof of Lemma 10 is completed. □

4. Proof

Recall (1) and (3), by Taylor’s expansion expansion with the Lagrange remainder, there exists

λ \in (0, 1)

, and

θ^{*} = θ + λ (\hat{θ} - θ)

\begin{matrix} {\hat{ε}}_{i} = & ε_{i} - [r_{\hat{θ}} (X_{i - 1}, \dots, X_{i - p}) - r_{θ} (X_{i - 1}, \dots, X_{i - p})] \\ = & ε_{i} - \sum_{j = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) Y_{i j} - \frac{1}{2} \sum_{j = 1}^{q} \sum_{l = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) Z_{i j l} . \end{matrix}

(14)

Then by (14), we can obtain

\begin{matrix} \sqrt{\frac{n}{V a r (ε_{1}^{2})}} ({\hat{σ}}_{n}^{2} - σ^{2}) \\ = & \sqrt{\frac{n}{V a r (ε_{1}^{2})}} (\frac{1}{n} \sum_{i = 1}^{n} {\hat{ε}}_{i}^{2} - \frac{1}{n} \sum_{i = 1}^{n} ε_{i}^{2} + \frac{1}{n} \sum_{i = 1}^{n} ε_{i}^{2} - \frac{1}{n} \sum_{i = 1}^{n} E ε_{i}^{2}) \\ = & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} ({\hat{ε}}_{i}^{2} - ε_{i}^{2})] + \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} (ε_{i}^{2} - E ε_{i}^{2})] \\ = & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} {(\sum_{j = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) Y_{i j})}^{2}] \\ + \sqrt{\frac{1}{16 n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} {(\sum_{j = 1}^{q} \sum_{l = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) Z_{i j l})}^{2}] \\ - \sqrt{\frac{4}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} \sum_{j = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) Y_{i j} ε_{i}] \\ - \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} \sum_{j = 1}^{q} \sum_{l = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) Z_{i j l} ε_{i}] \\ + \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} \sum_{j = 1}^{q} \sum_{l = 1}^{q} \sum_{k = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) ({\hat{θ}}_{k} - θ_{k}) Y_{i j} Z_{i l k}] \\ + \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} (ε_{i}^{2} - E ε_{i}^{2})] \\ = : & I_{n 1} + I_{n 2} - I_{n 3} - I_{n 4} + I_{n 5} + I_{n 6} . \end{matrix}

(15)

Recall the elementary inequality

\begin{matrix} {(\sum_{i = 1}^{q} a_{i} b_{i})}^{2} \leq (\sum_{i = 1}^{q} a_{i}^{2}) (\sum_{i = 1}^{q} b_{i}^{2}) . \end{matrix}

(16)

For

I_{n 1}

, by (5) and (16) and Lemma 6, it is easy to know that

\begin{matrix} I_{n 1} = & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} {(\sum_{j = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) Y_{i j})}^{2}] \\ \leq & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{j = 1}^{q} \sum_{i = 1}^{n} Y_{i j}^{2} \\ = & \frac{1}{\sqrt{V a r (ε_{1}^{2})}} \frac{n}{\log \log n} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{j = 1}^{q} \frac{\log \log n}{n^{3 / 2}} \sum_{i = 1}^{n} Y_{i j}^{2} \\ \to & 0 a . s . a s n \to \infty . \end{matrix}

(17)

For

I_{n 2}

, by (5) and (16) and Lemma 7, one can obtain

\begin{matrix} I_{n 2} = & \sqrt{\frac{1}{16 n V a r (ε_{1}^{2})}} [\sum_{i = 1}^{n} {(\sum_{j = 1}^{q} \sum_{l = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) Z_{i j l})}^{2}] \\ \leq & \sqrt{\frac{1}{16 n V a r (ε_{1}^{2})}} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{l = 1}^{q} {({\hat{θ}}_{l} - θ_{l})}^{2} \cdot \sum_{j = 1}^{q} \sum_{l = 1}^{q} \sum_{i = 1}^{n} Z_{i j l}^{2} \\ = & \frac{1}{4 \sqrt{V a r (ε_{1}^{2})}} {(\frac{n}{\log \log n} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2})}^{2} \cdot \sum_{j = 1}^{q} \sum_{l = 1}^{q} \frac{{(\log \log n)}^{2}}{n^{5 / 2}} \sum_{i = 1}^{n} Z_{i j l}^{2} \\ \to & 0 a . s . a s n \to \infty . \end{matrix}

(18)

For

I_{n 3}

, by (5) and (16) and Lemma 8, one can obtain

\begin{matrix} I_{n 3} = & \sqrt{\frac{4}{n V a r (ε_{1}^{2})}} [\sum_{j = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) \cdot \sum_{i = 1}^{n} Y_{i j} ε_{i}] \\ \leq & \sqrt{\frac{4}{n V a r (ε_{1}^{2})}} {[\sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{j = 1}^{q} {(\sum_{i = 1}^{n} Y_{i j} ε_{i})}^{2}]}^{1 / 2} \\ = & \frac{2}{\sqrt{V a r (ε_{1}^{2})}} {[\frac{n}{\log \log n} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{j = 1}^{q} {(\frac{{(\log \log n)}^{1 / 2}}{n} \sum_{i = 1}^{n} Y_{i j} ε_{i})}^{2}]}^{1 / 2} \\ \to & 0 a . s . a s n \to \infty . \end{matrix}

(19)

For

I_{n 4}

, by (5) and (16) and Lemma 9, we have

\begin{matrix} I_{n 4} = & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{j = 1}^{q} \sum_{l = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) \sum_{i = 1}^{n} Z_{i j l} ε_{i}] \\ \leq & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} {[\sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{l = 1}^{q} {({\hat{θ}}_{l} - θ_{l})}^{2} \cdot \sum_{j = 1}^{q} \sum_{l = 1}^{q} {(\sum_{i = 1}^{n} Z_{i j l} ε_{i})}^{2}]}^{1 / 2} \\ = & \frac{1}{\sqrt{V a r (ε_{1}^{2})}} {[{(\frac{n}{\log \log n} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2})}^{2} \cdot \sum_{j = 1}^{q} \sum_{l = 1}^{q} {(\frac{\log \log n}{n^{3 / 2}} \sum_{i = 1}^{n} Z_{i j l} ε_{i})}^{2}]}^{1 / 2} \\ \to & 0 a . s . a s n \to \infty . \end{matrix}

(20)

For

I_{n 5}

, by (5) and (16) and Lemma 10, we know

\begin{matrix} I_{n 5} = & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} [\sum_{j = 1}^{q} \sum_{l = 1}^{q} \sum_{k = 1}^{q} ({\hat{θ}}_{j} - θ_{j}) ({\hat{θ}}_{l} - θ_{l}) ({\hat{θ}}_{k} - θ_{k}) \sum_{i = 1}^{n} Y_{i j} Z_{i l k}] \\ \leq & \sqrt{\frac{1}{n V a r (ε_{1}^{2})}} {[\sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2} \cdot \sum_{l = 1}^{q} {({\hat{θ}}_{l} - θ_{l})}^{2} \cdot \sum_{k = 1}^{q} {({\hat{θ}}_{k} - θ_{k})}^{2} \cdot \sum_{j = 1}^{q} \sum_{l = 1}^{q} \sum_{k = 1}^{q} {(\sum_{i = 1}^{n} Y_{i j} Z_{i l k})}^{2}]}^{1 / 2} \\ = & \frac{1}{\sqrt{V a r (ε_{1}^{2})}} {[{(\frac{n}{\log \log n} \sum_{j = 1}^{q} {({\hat{θ}}_{j} - θ_{j})}^{2})}^{3} \cdot \sum_{j = 1}^{q} \sum_{l = 1}^{q} \sum_{k = 1}^{q} {(\frac{{(\log \log n)}^{3 / 2}}{n^{2}} \sum_{i = 1}^{n} Y_{i j} Z_{i l k})}^{2}]}^{1 / 2} \\ \to & 0 a . s . a s n \to \infty . \end{matrix}

(21)

Combing (17)–(21), one can obtain

\begin{matrix} I_{n 1} + I_{n 2} - I_{n 3} - I_{n 4} + I_{n 5} \to 0 a . s . a s n \to \infty . \end{matrix}

(22)

For

I_{n 6}

, by Lemma 5, it is obviously that

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{1}{\sqrt{k V a r (ε_{1}^{2})}} \sum_{i = 1}^{k} (ε_{i}^{2} - E ε_{i}^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

(23)

Finally, (6) follows by combining (15), (22) with (23) and Lemma 3, thus the proof of Theorem 1 is completed.

5. Examples

Some examples are given in this section to verify the almost sure central limit theorem for the error variance estimator for some special nonlinear autoregressive models. The first example is a degenerate model, that is, AR(1) progresses.

Example 1.

An AR(1) model is a family of

{X_{i}}

of random variables such that for every

i \geq 1

X_{i} = θ X_{i - 1} + ε_{i},

where

{ε_{i}, i \geq 1}

is a collection of i.i.d. random variables with zero mean and finite variance

σ^{2}

. We also assume that

E \exp {γ | ε_{i} ε_{j} |} < \infty

for some

γ > 0

and any

i, j \geq 1

. It is obviously that

{X_{i}}

is a stationary model under the condition

| θ | < 1

.

It is easy to check that the Assumption 1 holds naturally. For Assumption 2, by Theorem 1 of Wang et al. [26] and

E \exp {γ | ε_{i} ε_{j} |} < \infty

, (5) holds for the least squares estimator

\hat{θ}

. Therefore, we have the following statement for AR(1) progression due to Theorem 1.

Theorem 2.

Suppose

{d_{k}}

is a sequence of positive numbers satisfying conditions (C1) and (C2). For the above AR(1) model, if

E \exp {γ | ε_{i} ε_{j} |} < \infty

for some

γ > 0

and any

i, j \geq 1

, for any

x \in R

, one can obtain

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{\sqrt{k}}{\sqrt{V a r (ε_{1}^{2})}} ({\hat{σ}}_{k}^{2} - σ^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

The next example concerns the self-exciting threshold autoregressive (SETAR) progresses.

Example 2.

Let

{X_{i}, i \geq p}

be a sequence of stationary and geometrically ergodic random variable satisfying the following continuous SETAR(

p, l, d

) progresses.

X_{i} = \{\begin{matrix} a_{0} + \sum_{m = 1}^{p} a_{m} X_{i - m} + ε_{i}, if X_{i - d} \in R_{1}, \\ a_{0} + \sum_{m = 1}^{p} a_{j} X_{i - m} + \sum_{k = 2}^{j} b_{k} (X_{i - d} - r_{k - 1}) + ε_{i}, if X_{i - d} \in R_{j}, j = 2, \dots, l \end{matrix}

where

{ε_{i}}

is a collection of i.i.d. random variables with zero mean and finite variance

σ^{2}

,

R_{1}, \dots, R_{l}

are the different regions with

R_{s} = (r_{s - 1}, r_{s}]

for

1 \leq s \leq l

, and

- \infty = r_{0} < r_{1} < r_{2} < \dots < r_{l - 1} < r_{l} = + \infty

are the thresholds. Let

θ_{0} = {(a_{0}, \dots, a_{p}, b_{2}, \dots, b_{l}, r_{1}, \dots, r_{l - 1})}^{⊤} \in Θ \subset R^{q}

be the true parameters of the progresses and

θ = {({\bar{a}}_{0}, \dots, {\bar{a}}_{p}, {\bar{b}}_{2}, \dots, {\bar{b}}_{l}, {\bar{r}}_{1}, \dots, {\bar{r}}_{l - 1})}^{⊤}

,

{\tilde{X}}_{i} = {(X_{i}, \dots, X_{i - p + 1})}^{⊤}

,

q = p + 2 l - 1

.

Condition $C$ Suppose that

{ε_{i}}

has the density h and the density f of

X_{i}

is continuous and has a support including the interval

[r_{\min} - η, r_{\max} + η], η > 0

where

r_{\min} = \min {{\bar{r}}_{1} : θ \in Θ}

,

r_{\max} = \max {{\bar{r}}_{l - 1} : θ \in Θ}

. There is some

ε > 0

such that

{\bar{r}}_{k - 1} \leq {\bar{r}}_{k} - ε

for all

θ \in Θ

and

k = 2, \dots, l

.

By Corollary 3.1 of Liebscher [2], under Condition

C

and

E | ε_{i} |^{γ} < \infty

,

E | | {\tilde{X}}_{i} {| |}^{γ} < \infty

,

γ > 4

, the Assumption 2 holds. Therefore, we have the following result for SETAR progresses due to Theorem 1.

Theorem 3.

Suppose

{d_{k}}

is a sequence of positive numbers satisfying conditions (C1) and (C2). For the above SETAR(

p, l, d

) progresses, under Assumption 1 and Condition

C

and

E | ε_{i} |^{γ} < \infty

,

E | | {\tilde{X}}_{i} {| |}^{γ} < \infty

,

γ > 4

, for any

x \in R

, one can obtain

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{\sqrt{k}}{\sqrt{V a r (ε_{1}^{2})}} ({\hat{σ}}_{k}^{2} - σ^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

Next, we will consider the threshold-exponential AR progresses.

Example 3.

Let

R_{j}, j = 1, \dots, K

be non-overlapping and non-empty intervals of

R

such that

⋃_{j} R_{j} = R

. A combined threshold-exponential AR progresses is defined by

X_{i} = \sum_{j = 1}^{K} (α_{j} + β_{j} X_{i - 1}) I {X_{i - 1} \in R_{j}} + c e^{- γ X_{i - 1}^{2}} X_{i - 1} + ε_{i},

with

X_{0} = x_{0}

and

{ε_{i}}

is a collection of i.i.d. random variables with zero mean. Let the true parameters

θ_{0} = {(α_{1}, \dots, α_{K}, β_{1}, \dots, β_{K}, c, γ)}^{⊤} \in Θ \subset R^{q}

and the parameters

θ = {({\bar{α}}_{1}, \dots, {\bar{α}}_{K}, {\bar{β}}_{1}, \dots, {\bar{β}}_{K}, \bar{c}, \bar{γ})}^{⊤}

with

q = 2 K + 2

.

For Assumption 2, if

c \neq 0

,

γ > 0

,

| β_{j} | < 1

,

j = 1, \dots, K

and

E | ε_{i} |^{2 + δ} < \infty

for some

δ > 0

, by Theorem 4 of Yao [28], (5) holds for the least squares estimator

\hat{θ}

. Therefore, we have the following statement for threshold-exponential AR progresses due to Theorem 1.

Theorem 4.

Suppose

{d_{k}}

is a sequence of positive numbers satisfying conditions (C1) and (C2). For the above threshold-exponential AR progresses, if

c \neq 0

,

γ > 0

,

| β_{j} | < 1

,

j = 1, \dots, K

and

E | ε_{i} |^{2 + δ} < \infty

for some

δ > 0

and

E ε_{1}^{4} < \infty

, then under Assumption 1, for any

x \in R

, one can obtain

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{\sqrt{k}}{\sqrt{V a r (ε_{1}^{2})}} ({\hat{σ}}_{k}^{2} - σ^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

Next, we will consider the multilayer perceptrons progress.

Example 4.

Multilayer perceptrons progressesw have become popular in nonlinear modeling due to its universal approximation ability. Such an example is the model described below which has p input units feeding by variables

X_{i - 1}, \dots, X_{i - p}

at time i, a hidden layer with K units and one output unit which provides the variable

X_{i}

X_{i} = \sum_{j = 1}^{K} α_{j} ψ (\sum_{l = 1}^{p} β_{l j} X_{i - l} + β_{0 j}) + α_{0} + ε_{i},

where

{ε_{i}}

is a collection of i.i.d. random variables with zero mean. Let the true parameters

θ_{0} = {(α_{0}, \dots, α_{K}, β_{l j}, 0 \leq l \leq p, 1 \leq j \leq K)}^{⊤} \in Θ \subset R^{q}

and the parameters

θ = {({\bar{α}}_{0}, \dots, {\bar{α}}_{K}, {\bar{β}}_{l j}, 0 \leq l \leq p, 1 \leq j \leq K)}^{⊤}

with

q = 1 + K (p + 1)

.

For Assumption 2 and sigmoid map

ψ (x) = t a n h (x) = \frac{e^{x} - e^{- x}}{e^{x} + e^{- x}}

, if for all

θ

are different from

θ_{0}

, there exists

x \in R^{p}

such that

r_{θ} (x) \neq r_{θ_{0}} (x)

,

E | ε_{i} |^{6 + δ} < \infty

for some

δ > 0

and the matrix

I_{0}

is regular, where

I_{0} = 2 \int_{R^{p}} M_{θ_{0}} (x) μ_{θ_{0}} (d x), M_{θ} (x) = {(\frac{\partial r_{θ} (x)}{\partial θ_{i}} \cdot \frac{\partial r_{θ} (x)}{\partial θ_{j}})}_{1 \leq i, j \leq q},

then by Theorem 5 of Yao [28], (5) holds for the least squares estimator

\hat{θ}

. Therefore, we have the following statement for multilayer perceptrons due to Theorem 1.

Theorem 5.

Suppose

{d_{k}}

is a sequence of positive numbers satisfying conditions (C1) and (C2). For the univariate multilayer perceptrons progress with

ψ (x) = t a n h (x)

, if for all θ different from

θ_{0}

, there exists

x \in R^{p}

such that

r_{θ} (x) \neq r_{θ_{0}} (x)

,

E | ε_{i} |^{6 + δ} < \infty

for some

δ > 0

and the matrix

I_{0}

is regular, then under Assumption 1, for any

x \in R

, one can obtain

\begin{matrix} \lim_{n \to \infty} \frac{1}{D_{n}} \sum_{k = 1}^{n} d_{k} I \{\frac{\sqrt{k}}{\sqrt{V a r (ε_{1}^{2})}} ({\hat{σ}}_{k}^{2} - σ^{2}) \leq x\} = Φ (x) a . s . \end{matrix}

6. Conclusions

In this paper, using Taylor’s expansion, the Borel–Cantelli lemma and the classical almost sure central limit theorem for independent random variables, the authors establish the almost sure central limit theorem for the error variance estimator for nonlinear autoregressive progresses with independent and identical distributed errors. The results extend the almost sure central limit theorem for the error variance estimator to the nonlinear autoregressive progresses. Four examples, first-order autoregressive processes, self-exciting threshold autoregressive processes, threshold-exponential AR progresses and multilayer perceptrons progress, are given to verify the results. In the future, we will try to investigate the almost sure central limit theorem for the error variance estimator for nonlinear autoregressive progresses with dependent errors and the moderate deviation principle for the error variance estimator for nonlinear autoregressive progresses with independent errors.

Author Contributions

Conceptualization, K.L.; formal analysis, K.L.; methodology, Y.Z.; validation, K.L. and Y.Z.; visualization, K.L. and Y.Z.; writing—original draft preparation, K.L.; writing—review and editing, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 12171198, 11771178); the Science and Technology Development Program of Jilin Province (Grant No. 20210101467JC); and the Science and Technology Program of Jilin Educational Department during the “14th Five-Year” Plan Period (Grant No. JJKH20241239KJ).

Data Availability Statement

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current (theoretical) study.

Acknowledgments

We would like to thank the anonymous referees for a very careful reading of the article.

Conflicts of Interest

The authors declare no conflicts of interest in this paper.

References

Tong, H. Non-Linear Time Series: A Dynamical Approach; Oxford University Press: New York, NY, USA, 1990. [Google Scholar]
Liebscher, E. Strong convergence of estimators in nonlinear autoregressive models. J. Multivar. Anal. 2003, 84, 247–261. [Google Scholar] [CrossRef]
Cheng, F.; Sun, S. A goodness-of-fit test of the errors in nonlinear autoregressive time series models. Stat. Probab. Lett. 2008, 78, 50–59. [Google Scholar] [CrossRef]
Fu, K.; Yang, X. Asymptotics of kernel density estimators in nonlinear autoregressive models. J. Math. Chem. 2008, 44, 831–838. [Google Scholar] [CrossRef]
Cheng, F. Global property of error density estimation in nonlinear autoregressive time series models. Stat. Inference Stoch. Process. 2010, 13, 43–53. [Google Scholar] [CrossRef]
Li, J. Asymptotics of the L_p-norms of density estimators in the nonlinear autoregressive models. Commun. Stat. Theory Methods 2014, 43, 4845–4855. [Google Scholar] [CrossRef]
Kim, K.; Sin, M.; Kim, O. A goodness-of-fit test of the errors in nonlinear autoregressive time series models with stationary α-mixing error terms. ROMAI J. 2014, 10, 63–70. [Google Scholar]
Cheng, F. Strong consistency of the distribution estimator in the nonlinear autoregressive time series. J. Multivar. Anal. 2015, 142, 41–47. [Google Scholar] [CrossRef]
Liu, T.; Zhang, Y. Law of the iterated logarithm for error density estimators in nonlinear autoregressive models. Commun. Stat. Theory Methods 2020, 49, 1082–1098. [Google Scholar] [CrossRef]
Cheng, F. Variance estimation in nonlinear autoregressive time series models. J. Stat. Plann. Inference 2011, 141, 1588–1592. [Google Scholar] [CrossRef]
Brosamler, G. An almost everywhere central limit theorem. Math. Proc. Camb. Philos. Soc. 1988, 104, 561–574. [Google Scholar] [CrossRef]
Schatte, P. On strong versions of the central limit theorem. Math. Nachrichten 1988, 137, 249–256. [Google Scholar] [CrossRef]
Peligrad, M.; Reévész, P. On the almost sure central limit theorem. In Almost Everywhere Convergence; Academic Press: Boston, MA, USA, 1989; Volume II, pp. 209–225. [Google Scholar]
Berkes, I.; Csáki, E. A universal result in almost sure central limit theory. Stoch. Process. Appl. 2001, 94, 105–134. [Google Scholar] [CrossRef]
Hörmann, S. An extension of almost surecentral limit theory. Stat. Probab. Lett. 2006, 76, 191–202. [Google Scholar]
Tong, B.; Peng, Z.; Saralees, N. An extension of almost surecentral limit theorem for order statistics. Extremes 2009, 12, 201–209. [Google Scholar]
Miao, Y. An extension of almost surecentral limit theory for the product of partial sums. J. Dyn. Syst. Geom. Theor. 2009, 7, 49–60. [Google Scholar]
Li, Y. An extension of the almost sure central limit theorem for products of sums under association. Commun. Stat. Theory Methods 2013, 42, 478–490. [Google Scholar] [CrossRef]
Zhang, Y. A universal result in almost sure central limit theorem for products of sums of partial sums under mixing sequence. Stochastics 2016, 88, 803–812. [Google Scholar] [CrossRef]
Zhang, Y. An extension of almost sure central limit theorem for self-normalized products of sums for mixing sequences. Commun. Stat. Theory Methods 2016, 45, 6625–6640. [Google Scholar] [CrossRef]
Wu, Q.; Jiang, Y. Almost sure central limit theorem for self-normalized partial sums and maxima. Rev. R. Acad. Cienc. Exactas Fís. Nat. Ser. A Mat. RACSAM 2016, 110, 699–710. [Google Scholar] [CrossRef]
Li, J.Y.; Zhang, Y. An almost sure central limit theorem for the stochastic heat equation. Stat. Probab. Lett. 2021, 177, 109149. [Google Scholar] [CrossRef]
Li, J.; Zhang, Y. An almost sure central limit theorem for the parabolic Anderson model with delta initial condition. Stochastics 2023, 95, 483–500. [Google Scholar] [CrossRef]
Li, J.; Zhang, Y. Almost sure central limit theorems for stochastic wave equations. Electron. Commun. Probab. 2023, 28, 9. [Google Scholar] [CrossRef]
Klimko, L.; Nelson, P. On conditional least squares estimation for stochastic processes. Ann. Stat. 1978, 6, 629–642. [Google Scholar] [CrossRef]
Wang, Y.; Mao, M.; Hu, X.; He, T. The law of iterated logarithm for autoregressive processes. Math. Probl. Eng. 2014, 2014, 972712. [Google Scholar] [CrossRef]
Chan, K.; Tong, H. On estimating thresholds in autoregressive models. J. Time Ser. Anal. 1986, 7, 179–190. [Google Scholar] [CrossRef]
Yao, J. On least squares estimation for stable nonlinear AR processes. Ann. Inst. Stat. Math. 2000, 52, 316–331. [Google Scholar] [CrossRef]
Hall, P.; Heyde, C. Martingale Limit Theory and its Application; Academic Press: New York, NY, USA, 1980. [Google Scholar]
Zhang, Y. Further research on limit theorems for self-normalized sums. Commun. Stat. Theory Methods 2020, 49, 385–402. [Google Scholar] [CrossRef]
Peligrad, M.; Shao, Q. A note on the almost sure central limit theorem. Stat. Probab. Lett. 1995, 22, 131–136. [Google Scholar] [CrossRef]
Billingsley, P. Convergence of Probability Measures; Wiley: New York, NY, USA, 1968. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liang, K.; Zhang, Y. Almost Sure Central Limit Theorem for Error Variance Estimator in Pth-Order Nonlinear Autoregressive Processes. Mathematics 2024, 12, 1482. https://doi.org/10.3390/math12101482

AMA Style

Liang K, Zhang Y. Almost Sure Central Limit Theorem for Error Variance Estimator in Pth-Order Nonlinear Autoregressive Processes. Mathematics. 2024; 12(10):1482. https://doi.org/10.3390/math12101482

Chicago/Turabian Style

Liang, Kaiyu, and Yong Zhang. 2024. "Almost Sure Central Limit Theorem for Error Variance Estimator in Pth-Order Nonlinear Autoregressive Processes" Mathematics 12, no. 10: 1482. https://doi.org/10.3390/math12101482

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Almost Sure Central Limit Theorem for Error Variance Estimator in Pth-Order Nonlinear Autoregressive Processes

Abstract

1. Introduction

2. Main Results

3. Preliminary Lemmas

4. Proof

5. Examples

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI