Self-Weighted LSE and Residual-Based QMLE of ARMA-GARCH Models

This paper studies the self-weighted least squares estimator (SWLSE) of the ARMA model with GARCH noises. It is shown that the SWLSE is consistent and asymptotically normal when the GARCH noise does not have a finite fourth moment. Using the residuals from the estimated ARMA model, it is shown that the residual-based quasi-maximum likelihood estimator (QMLE) for the GARCH model is consistent and asymptotically normal, but if the innovations are asymmetric, it is not as efficient as that when the GARCH process is observed. Using the SWLSE and residual-based QMLE as the initial estimators, the local QMLE for ARMA-GARCH model is asymptotically normal via an one-step iteration. The importance of the proposed estimators is illustrated by simulated data and five real examples in financial markets.

Keywords:

ARMA models; GARCH models; QMLE; Self-weighted LSE

1. Introduction

Time series models have been extensively applied in various areas and many methodologies were proposed in the literature; for example, () proposed a hybrid methodology that combines both ARIMA and ANN models to improve forecasting accuracy. Since (), the ARCH-type models have been widely used in economics and finance. In particular, the GARCH model proposed by () has been a benchmark in the risk management. () showed that the GARCH-based option-pricing models are able to price the SPX one-month variance swap rate, that is, the CBOE Volatility Index (VIX) accurately. () used the GARCH(

1, 1

) model to analyze stock market turmoil during COVID-19 outbreak in an emerging and developed Economy.

However, recent research showed that the usual statistical inference procedure does not work if the fourth moment of the GARCH process does not exist. To make it clear, let us consider the AR(1)-GARCH(

1, 1

) model

y_{t} = ϕ_{1} y_{t - 1} + ε_{t},

(1)

ε_{t} = η_{t} \sqrt{h_{t}} and h_{t} = α_{0} + α_{1} ε_{t - 1}^{2} + β_{1} h_{t - 1},

(2)

where

α_{0} > 0

,

α_{1} \geq 0

,

β_{1} \geq 0

, and

η_{t}

is a sequence of independent and identically distributed (i.i.d.) innovations with zero mean and unit variance. For model (1), the least squares estimator (LSE) of

ϕ_{1}

is

{\hat{ϕ}}_{L S n} \equiv {(\frac{1}{n} \sum_{t = 1}^{n} y_{t - 1}^{2})}^{- 1} (\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} y_{t - 1} y_{t}),

where n is the sample size. () and () showed that

{\hat{ϕ}}_{L S n}

is

\sqrt{n}

-consistent and asymptotically normal if

E ε_{t}^{4} < \infty

. However,

E ε_{t}^{4} = \infty

when the tail index

α

of

ε_{t}

is in

(0, 4]

. In this case, () and () showed that

ε_{t}

has a heavy-tailed feature and its sample autocorrelation function is neither

\sqrt{n}

-consistent nor asymptotically normal. () showed that

{\hat{ϕ}}_{L S n}

is

n^{1 - 2 / α}

-consistent and converges to a stable random variable when

α \in (2, 4)

. Furthermore, for the AR model with

ε_{t}

being G-GARCH(

1, 1

) noise in (), () showed that

\frac{\sqrt{n}}{log n} ({\hat{ϕ}}_{L S n} - ϕ_{1}) ⟶_{L} Normal, if α = 4 (i.e. E ε_{t}^{4} = \infty),

(3)

n^{1 - \frac{2}{α}} ({\hat{ϕ}}_{L S n} - ϕ_{1}) ⟶_{L} Stable, if α \in (2, 4) (i.e. E ε_{t}^{2} < \infty and E ε_{t}^{4} = \infty),

(4)

log n ({\hat{ϕ}}_{L S n} - ϕ_{1}) ⟶_{L} Stable, if α = 2 (i.e. E ε_{t}^{2} = \infty),

(5)

{\hat{ϕ}}_{L S n} - ϕ_{1} ⟶_{L} Stable, if α \in (0, 2) (i.e. E ε_{t}^{2} = \infty),

(6)

when

n \to \infty

, where

⟶_{L}

denotes the convergence in distribution. From (3)–(6), we find that the LSE not only has a slower rate of convergence but also is not asymptotically normal when

α \in (0, 4)

. Thus, based on the LSE, the classical theory and methodology (e.g., t-test, Wald test, and Ljung-Box test, among others) do not work in this case. Using a simulation method, we give the regime of parameter vector

(α_{1}, β_{1})

with

E ε_{t}^{2 ι} < \infty

in Figure 1 when

η_{t} \sim N (0, 1)

. It can be seen that the regime of

(α_{1}, β_{1})

is very small for

E ε_{t}^{4} < \infty

(i.e.,

α > 4

). In practice, the estimated value of

(α_{1}, β_{1})

does not lie in this regime, usually. Thus, it is very important to study the statistical inference when

α \in (0, 4]

. () studied the self-weighted least absolute deviation estimator (SLADE) of the ARMA-GARCH model and showed that it is consistent and asymptotically normal when

α \in (0, 4]

.

Figure 1. Parameter regime of

(α_{1}, β_{1})

with

E ε_{t}^{2 ι} < \infty

.

This paper studies the self-weighted LSE (SWLSE) of the ARMA model with GARCH noises. It is shown that the SWLSE is consistent and asymptotically normal when the GARCH noise does not have a finite fourth moment (i.e.,

α \in (2, 4]

). Using the residuals from the estimated ARMA model, it is shown that the residual-based quasi-maximum likelihood estimator (QMLE) for the GARCH model is consistent and asymptotically normal, but if the innovations are asymmetric, it is not as efficient as that when the GARCH process is observed. Using the SWLSE and residual-based QMLE as the initial estimators, the local QMLE for ARMA-GARCH model is asymptotically normal via an one-step iteration.

This paper is arranged as follows. Section 2 presents the model and assumptions. Section 3 presents our main results. Section 4 presents simulation results and Section 5 gives real examples. All the proofs are deferred into the Appendix A.

2. Model and Assumptions

Assume that

{y_{t} : t = 0, \pm 1, \pm 2, \dots}

are generated by the ARMA-GARCH model

y_{t} = μ + \sum_{i = 1}^{p} ϕ_{i} y_{t - i} + \sum_{i = 1}^{q} ψ_{i} ε_{t - i} + ε_{t},

(7)

ε_{t} = η_{t} \sqrt{h_{t}} and h_{t} = α_{0} + \sum_{i = 1}^{r} α_{i} ε_{t - i}^{2} + \sum_{i = 1}^{s} β_{i} h_{t - i},

(8)

where

α_{i} \geq 0

and

β_{j} \geq 0

,

i = 0, \dots, r

,

j = 1, \dots, s

, and

η_{t}

is defined as in (2). Denote

γ = {(μ, ϕ_{1}, \dots, ϕ_{p}, ψ_{1}, \dots, ψ_{q})}^{'}

,

δ = {(α_{0}, α_{1}, \dots, α_{r}, β_{1}, \dots, β_{s})}^{'}

, and

λ = {(γ^{'}, δ^{'})}^{'}

. Let

γ_{0}

,

δ_{0}

, and

θ_{0}

be the true values of

γ

,

δ

, and

θ

, respectively. The parameter subspaces

Θ_{γ} \subset R^{p + q + 1}

and

Θ_{δ} \subset R_{0}^{r + s + 1}

are compact, where

R = (- \infty, \infty)

and

R_{0} = [0, \infty)

. Denote

Θ = Θ_{γ} \times Θ_{δ}

,

m = p + q + r + s + 2

,

α (z) = \sum_{i = 1}^{r} α_{i} z^{i}

,

β (z) = 1 - \sum_{i = 1}^{s} β_{i} z^{i}

,

ϕ (z) = 1 - \sum_{i = 1}^{p} ϕ_{i} z^{i}

, and

ψ (z) = 1 + \sum_{i = 1}^{q} ψ_{i} z^{i}

. We introduce the following conditions:

Assumption 1.

θ_{0}

is an interior point in Θ and for each

θ \in Θ

,

ϕ (z) \neq 0

and

ψ (z) \neq 0

when

| z | \leq 1

, and

ϕ (z)

and

ψ (z)

have no common root with

ϕ_{p} \neq 0

or

ψ_{q} \neq 0

.

Assumption 2.

α (z)

and

β (z)

have no common root,

α_{r} + β_{s} \neq 0

, and

\sum_{i = 1}^{r} α_{i} + \sum_{j = 1}^{s} β_{j} < 1

for each

θ \in Θ

.

Assumption 1 is the stationarity and invertibility condition of ARMA models, under which it follows that

ψ^{- 1} (z) = \sum_{i = 0}^{\infty} a_{ψ} (i) z^{i} and ϕ (z) ψ^{- 1} (z) = \sum_{i = 0}^{\infty} a_{γ} (i) z^{i},

(9)

where

\sup_{Θ_{γ}} | a_{ψ} (i) | = O (ρ^{i})

and

\sup_{Θ_{γ}} | a_{γ} (i) | = O (ρ^{i})

with

ρ \in (0, 1)

. Assumption 2 ensures that

{ε_{t}}

is strictly stationary and ergodic with

E ε_{t}^{2} < \infty

, see () and (). It is also the identifiability condition for model (2) and, by Lemma 2.1 in (), the condition

\sum_{i = 1}^{s} β_{i} < 1

is equivalent to

\begin{matrix} 0 \leq ρ (G) < 1, where G = (\begin{matrix} β_{1} & \dots & β_{s} \\ I_{s - 1} & O \end{matrix}), \end{matrix}

(10)

I_{k}

is the

k \times k

identity matrix, and

ρ (B)

is the spectral radius of matrix B. Under this condition, we have

\begin{matrix} β^{- 1} (z) = \sum_{i = 0}^{\infty} a_{β} (i) z^{i} and α (z) β^{- 1} (z) = \sum_{i = 1}^{\infty} a_{δ} (i) z^{i}, \end{matrix}

(11)

where

\sup_{Θ_{δ}} | a_{β} (i) | = O (ρ^{i})

and

\sup_{Θ_{δ}} | a_{δ} (i) | = O (ρ^{i})

with

ρ = ρ (G)

.

Given the observations

{y_{n},

\dots, y_{1}}

and initial value

Y_{0} \equiv {y_{0}, y_{- 1}, \dots}

, we can write the parametric model as

ε_{t} (γ) = y_{t} - μ - \sum_{i = 1}^{p} ϕ_{i} y_{t - i} - \sum_{i = 1}^{q} ψ_{i} ε_{t - i} (γ),

(12)

η_{t} (λ) = ε_{t} (γ) / \sqrt{h_{t} (λ)} and h_{t} (λ) = α_{0} + \sum_{i = 1}^{r} α_{i} ε_{t - i}^{2} (γ) + \sum_{i = 1}^{s} β_{i} h_{t - i} (λ) .

(13)

It is easy to see that

η_{t} (λ_{0}) = η_{t}

,

ε_{t} (γ_{0}) = ε_{t}

, and

h_{t} (λ_{0}) = h_{t}

. In practice, we do not observe those

y_{i}

in

Y_{0}

and hence they have to be replaced by some constants. This does not affect our asymptotic results, see (). For simplicity, we do not study this case in details.

3. Main Results

The self-weighted estimation approach was proposed by () and it has been used to solve the problem on statistical inference of the heavy-tailed ARMA-GARCH model in () and (). Using a similar idea, we define the SWLSE as

{\tilde{γ}}_{n} = arg min_{γ \in Θ_{γ}} \sum_{t = 1}^{n} \frac{ε_{t}^{2} (γ)}{w_{t}},

where

w_{t} = 1 + \sum_{k = 1}^{\infty} k^{- 1 / 2 - 1} | y_{t - k} |

. We can state the following result:

Theorem 1.

Suppose that Assumptions 1 and 2 hold. Then, as

n \to \infty

,

(i) {\tilde{γ}}_{n} ⟶_{p} γ_{0},

(i i) \sqrt{n} ({\tilde{γ}}_{n} - γ_{0}) ⟶_{L} N (0, A^{- 1} B A^{- 1}),

where

⟶_{p}

denotes the convergence in probability,

A = E (w_{t}^{- 1} M_{t})

,

B = E (w_{t}^{- 2} h_{t} M_{t})

, and

M_{t} = [\partial ε_{t} (γ_{0}) / \partial γ] {[\partial ε_{t} (γ_{0}) / \partial γ]}^{'}

.

The preceding result holds for any kind of ARCH-type errors only if

E h_{t} < \infty

, see the proof in the Appendix A. To easily understand it, we refer to model (1) and (2) again. In this case, the information function is

E (y_{t - 1}^{2} / w_{t}) \leq E | y_{t - 1} | < \infty

. The score function is

n^{- 1 / 2} \sum_{t = 1}^{n} y_{t - 1} ε_{t} / w_{t}

and

E {(y_{t - 1} ε_{t} / w_{t})}^{2} \leq O (1) E h_{t} < \infty

, which is the condition we need for the GARCH errors. This result holds when

E ε_{t}^{4} < \infty

, but it is not as efficient as the LSE in this case. When

E ε_{t}^{4} = \infty

and

E ε_{t}^{2} < \infty

, the process

y_{t}

has a heavy tailed feature and the SWLSE has a faster rate of convergence than that of LSE. The weight function

w_{t}

can be replaced by others, see ().

Next, we use the residual

{\tilde{ε}}_{t} \equiv ε_{t} ({\tilde{γ}}_{n})

from ARMA parts as the artificial observation of

ε_{t}

. The log-quasi-likelihood function based on

{\tilde{ε}}_{t}

can be written as

\begin{matrix} {\tilde{L}}_{δ n} (δ) = \frac{1}{n} \sum_{t = 1}^{n} {\tilde{l}}_{t} (δ) and {\tilde{l}}_{t} (δ) = - \frac{1}{2} log {\tilde{h}}_{t} (δ) - \frac{{\tilde{ε}}_{t}^{2}}{2 {\tilde{h}}_{t} (δ)}, \end{matrix}

(14)

where

{\tilde{h}}_{t} (δ) = h_{t} {(λ) |}_{γ = {\tilde{γ}}_{n}}

. We define the residual-based QMLE of

δ_{0}

as

{\tilde{δ}}_{n} = arg max_{δ \in Θ_{δ}} {\tilde{L}}_{δ n} (δ) .

Denote

H_{δ t} (λ) = h_{t}^{- 2} (λ) [\partial h_{t} (λ) / \partial δ] [\partial h_{t} (λ) / \partial δ^{'}]

and

H_{δ t} (λ_{0})

by

H_{δ t}

. We now give the asymptotic properties of

{\tilde{δ}}_{n}

as follows.

Theorem 2.

Suppose that Assumptions 1 and 2 hold. Then, as

n \to \infty

,

\begin{matrix} (i) & {\tilde{δ}}_{n} ⟶_{p} δ_{0}, if E {| η_{t} |}^{2 + \tilde{ι}} < \infty for some \tilde{ι} > 0, \\ (i i) & \sqrt{n} ({\tilde{δ}}_{n} - δ_{0}) ⟶_{L} N (0, {(E H_{δ t})}^{- 1} Ω_{δ} {(E H_{δ t})}^{- 1}), if E η_{t}^{4} < \infty, \end{matrix}

where

Ω_{δ} = κ E H_{δ t} + E D_{t} (A^{- 1} B A^{- 1}) E D_{t}^{'} + κ_{3} {\tilde{Ω}}_{δ}

,

{\tilde{Ω}}_{δ} = E D_{t} A^{- 1} E (w_{t}^{- 1} {\tilde{D}}_{t}^{'})

+ E (w_{t}^{- 1} {\tilde{D}}_{t}) A^{- 1} E D_{t}^{'}

,

κ = E η_{t}^{4} - 1

,

κ_{3} = E η_{t}^{3}

,

D_{t} = E {h_{t}^{- 2} [\partial h_{t} (λ_{0}) / \partial δ] [\partial h_{t} (λ_{0}) / \partial γ^{'}]}

, and

{\tilde{D}}_{t} = E {h_{t}^{- 1 / 2} [\partial h_{t} (λ_{0}) / \partial δ] [\partial ε_{t} (γ_{0}) / \partial γ^{'}]}

.

When

η_{t}

is symmetric and

μ = 0

, we have

E η_{t}^{3} = 0

,

E D_{t} = E {\tilde{D}}_{t} = 0

, and hence

Ω_{δ} = κ E H_{δ t}

. When the conditional mean is zero (i.e.,

y_{t} = ε_{t}

), model (7) and (8) reduces to the GARCH model. In this case, the log-quasi-likelihood function based on

ε_{t}

can be written as

\begin{matrix} L_{δ n} (δ) = \frac{1}{n} \sum_{t = 1}^{n} l_{t} (δ) and l_{t} (δ) = - \frac{1}{2} log h_{t} (δ) - \frac{ε_{t}^{2}}{2 h_{t} (δ)} . \end{matrix}

(15)

Then, the global QMLE of

δ_{0}

is defined as

{\bar{δ}}_{n} = arg \max_{δ \in Θ_{δ}} L_{δ n} (δ)

. () and () showed that

{\bar{δ}}_{n}

is consistent and as

n \to \infty

,

\begin{matrix} \sqrt{n} ({\bar{δ}}_{n} - δ_{0}) ⟶_{L} N (0, κ {(E H_{δ t})}^{- 1}), if E η_{t}^{4} < \infty . \end{matrix}

(16)

From Theorem 2, we see that the efficiency of the estimated

δ_{0}

is affected by the estimated parameters in ARMA parts unless

η_{t}

has a symmetric density and

μ

is known to be zero without estimation. This gives a reminder to practitioners that we need to be careful when ones use the residuals to estimate the GARCH model.

Given

{y_{n}, \dots, y_{1}}

and the initial value

Y_{0}

, we can write down the log-quasi-likelihood function of model (7) and (8) as follows:

\begin{matrix} L_{n} (λ) = \frac{1}{n} \sum_{t = 1}^{n} l_{t} (λ) and l_{t} (λ) = - \frac{1}{2} log h_{t} (λ) - \frac{ε_{t}^{2} (γ)}{2 h_{t} (λ)} . \end{matrix}

(17)

Then, the global QMLE of

λ_{0}

is defined as the maximizer of

L_{n} (λ)

in

Θ

. () proved the consistency of this QMLE. But the asymptotic normality of this QMLE requires

E ε_{t}^{4} < \infty

, see also ().

Based on

{\tilde{λ}}_{n} \equiv {({\tilde{γ}}_{n}^{'}, {\tilde{δ}}_{n}^{'})}^{'}

, we obtain the local QMLE through an one-step iteration

\begin{matrix} {\hat{λ}}_{n} = {\tilde{λ}}_{n} - {[\sum_{t = 1}^{n} \frac{\partial^{2} l_{t} ({\tilde{λ}}_{n})}{\partial λ \partial λ^{'}}]}^{- 1} \sum_{t = 1}^{n} \frac{\partial l_{t} ({\tilde{λ}}_{n})}{\partial λ} . \end{matrix}

(18)

As in (), we can show that as

n \to \infty

,

\begin{matrix} \sqrt{n} ({\hat{λ}}_{n} - λ_{0}) ⟶_{L} N (0, Σ^{- 1} Ω Σ^{- 1}), \end{matrix}

where

Σ = E [U_{t} (λ_{0}) U_{t}^{'} (λ_{0})]

,

Ω = E [U_{t} (λ_{0}) J U_{t}^{'} (λ_{0})]

,

J = (\begin{matrix} 1 & κ_{3} \\ κ_{3} & κ \end{matrix})

, and

U_{t} (λ) = [h_{t}^{- 1 / 2} \partial ε_{t} (γ) / \partial λ, h_{t}^{- 1} \partial h_{t} (λ) / \partial λ]

. When

η_{t} \sim N (0, 1)

, the local QMLE is efficient. So, Theorems 1 and 2 provide an approach to obtain an efficient estimator for the full ARMA-GARCH models under the finite second moment condition of

ε_{t}

. When

η_{t}

is not normal, the efficient and adaptive estimators can be obtained by using the results in this section and following the similar lines as in (), (), (), and ().

4. Simulation Study

In this section, we assess the finite sample performance of

{\tilde{λ}}_{n} = {({\tilde{γ}}_{n}^{'}, {\tilde{δ}}_{n}^{'})}^{'}

and

{\hat{λ}}_{n} = {({\hat{γ}}_{n}^{'}, {\hat{δ}}_{n}^{'})}^{'}

, where

{\tilde{γ}}_{n}

is the SWLSE,

{\tilde{δ}}_{n}

is the residual-based QMLE, and

{\hat{λ}}_{n}

is the local QMLE. We generate 1000 replications of sample size

n = 1000

and 2000 from the following model

y_{t} = ϕ_{10} y_{t - 1} + ψ_{10} ε_{t - 1} + ε_{t},

(19)

ε_{t} = η_{t} \sqrt{h_{t}} and h_{t} = α_{00} + α_{10} ε_{t - 1}^{2} + β_{10} h_{t - 1},

(20)

where

γ_{0}^{'} = (ϕ_{10}, ψ_{10}) = (0.4, 0.5)

,

δ_{0}^{'} = (α_{00}, α_{10}, β_{10}) = (0.1, 0.1, 0.8)

, and

η_{t}

is chosen to be the standard normal N(

0, 1

) distribution, re-scaled Laplace

L (0, 1)

distribution, or re-scaled student’s

t (5)

distribution with

E η_{t}^{2} = 1

. Table 1 reports the sample bias (Bias), the sample standard deviations (SD), and the average estimated asymptotic standard deviation (AD) of

{\tilde{λ}}_{n}

and

{\hat{λ}}_{n}

. From this table, we find that (i) each considered estimator has a small bias, and its value of SD is close to that of AD, demonstrating the validity of its asymptotic normality; (ii)

{\hat{γ}}_{n}

could be slightly more efficient than

{\tilde{γ}}_{n}

, whereas

{\hat{δ}}_{n}

is as efficient as

{\tilde{δ}}_{n}

; (iii) all estimators for

η_{t} \sim N (0, 1)

are more efficient than the corresponding ones for

η_{t} \sim L (0, 1)

or

t (5)

. All these findings are consistent with our theory in Section 3. We should mention that the QMLE of

δ_{0}

is not reliable when the sample size n is less than 800 according to our simulation experiments and hence the results are not reported here.

Table 1. The results of

{\tilde{λ}}_{n}

and

{\hat{λ}}_{n}

.

As a comparison, we compute the classical LSE

{\hat{γ}}_{L S n} = {({\hat{ϕ}}_{L S n}, {\hat{ψ}}_{L S n})}^{'}

for

γ_{0}

in model (19) and (20), where

{\hat{γ}}_{L S n}

is computed in a similar way as

{\tilde{γ}}_{n}

with

w_{t} \equiv 1

. Table 2 reports the corresponding results of

{\hat{γ}}_{L S n}

. Compared with

{\tilde{γ}}_{n}

in Table 1, we find that

{\hat{γ}}_{L S n}

is less efficient than

{\tilde{γ}}_{n}

for all examined cases. This finding suggests that it seems better to fit the ARMA model by the SWLSE rather than the LSE method when the data exhibit the conditionally heteroscedastic effect.

Table 2. The results of

{\hat{γ}}_{L S n}

.

5. Real Examples

This section first studies the log returns (

\times 100

) of DJIA, NASDAQ, NASDAQ 100, and S&P 500 from 11 March 2015 to 10 March 2021, with a total of 1764 observations (see Figure 2). Denote each log return series by

{y_{t}}_{t = 1}^{1764}

. Before fitting an AR(1)-GARCH(

1, 1

) to

{y_{t}}_{t = 1}^{1764}

, we first estimate

α_{y}

, the tail index of

| y_{t} |

, and get the following results:

\begin{matrix} (DJIA) & {\hat{α}}_{y} = 2.3029, (NASDAQ) {\hat{α}}_{y} = 3.2592, \\ (0.9285) (0.6830) \\ (NASDAQ 100) & {\hat{α}}_{y} = 3.6956, (S & P 500) {\hat{α}}_{y} = 2.5329, \\ (0.6077) (0.8567) \end{matrix}

where

{\hat{α}}_{y}

is the proposed estimator of

α_{y}

in (), and the value in parentheses is the AD of

{\hat{α}}_{y}

. From the above results, we can conclude that each

| y_{t} |

has a finite second moment, but does not have a finite fourth moment. Hence, it is reasonable to fit four return series by using the procedure in Section 3, that is, we first obtain the SWLSE

{\tilde{γ}}_{n}

and the residual-based QMLE

{\tilde{δ}}_{n}

, and then obtain the local QMLE

{\hat{λ}}_{n}

. The resulting fitted models are as follows:

\begin{matrix} (DJIA) & \{\begin{matrix} y_{t} = 0.0859 - 0.0461 y_{t - 1} + ε_{t}, \\ (0.0173) (0.0292) \\ h_{t} = 0.0416 + 0.2108 ε_{t - 1}^{2} + 0.7532 h_{t - 1}, \\ (0.0109) (0.0378) (0.0377) \end{matrix} \\ (NASDAQ) & \{\begin{matrix} y_{t} = 0.1009 - 0.0663 y_{t - 1} + ε_{t}, \\ (0.0216) (0.0275) \\ h_{t} = 0.0643 + 0.1747 ε_{t - 1}^{2} + 0.7826 h_{t - 1}, \\ (0.0178) (0.0335) (0.0367) \end{matrix} \\ (NASDAQ 100) & \{\begin{matrix} y_{t} = 0.1125 - 0.0654 y_{t - 1} + ε_{t}, \\ (0.0225) (0.0276) \\ h_{t} = 0.0668 + 0.1751 ε_{t - 1}^{2} + 0.7855 h_{t - 1}, \\ (0.0180) (0.0325) (0.0351) \end{matrix} \\ (S & P 500) & \{\begin{matrix} y_{t} = 0.0910 - 0.0838 y_{t - 1} + ε_{t}, \\ (0.0171) (0.0289) \\ h_{t} = 0.0432 + 0.2206 ε_{t - 1}^{2} + 0.7453 h_{t - 1}, \\ (0.0117) (0.0422) (0.0414) \end{matrix} \end{matrix}

where all estimated parameters are the local QMLE

{\hat{λ}}_{n}

, and the values in parentheses are the ADs of

{\hat{λ}}_{n}

. From these fitted models, we can find that all estimated parameters are significantly different from zero at the level of 5%. In particular, the significant parameters in the fitted AR models imply that the U.S. stock market is not efficient during the examined period.

Figure 2. Log returns (

\times 100

) of DJIA, NASDAQ, NASDAQ 100, and S&P 500 from 11 March 2015 to 10 March 2021.

Next, this section considers the log returns (

\times 100

) of PHLX Oil Service Index OSX from 11 March 2015 to 10 March 2021, with a total of 1510 observations (see Figure 3). As before, we denote this log return series by

{y_{t}}_{t = 1}^{1510}

, and obtain its estimate

{\hat{α}}_{y} = 2.7960

with

A D = 0.7078

. This implies that

| y_{t} |

has a finite second moment, but does not have a finite fourth moment. Hence, we apply the local QMLE method to get the following fitted model for

y_{t}

:

\begin{matrix} (OSX) & \{\begin{matrix} y_{t} = - 0.0377 + 0.0239 y_{t - 1} + ε_{t}, \\ (0.0589) (0.0307) \\ h_{t} = 0.1329 + 0.1076 ε_{t - 1}^{2} + 0.8792 h_{t - 1} . \\ (0.0713) (0.0285) (0.0304) \end{matrix} \end{matrix}

Figure 3. Log returns (

\times 100

) of OSX from 11 March 2015 to 10 March 2021.

Unlike the fitted results for the four U.S. stock indexes above, the fitted AR coefficient for the OSX index is not significantly different from zero at the level of 5%, indicating that the oil market is efficient during the examined period.

6. Concluding Remarks

This paper studied the SWLSE of the ARMA model with GARCH noises and the residual-based QMLE for the GARCH model. The consistency and asymptotic normality of SWLSE were established under a little moment condition. The importance of the proposed estimators was illustrated by simulated data and four major stock indexes and one major oil index in U.S. The ARMA-GARCH model is very important in the risk management, see (). In practice, ones need to build the ARMA-GARCH model from the historical data. The major contribution of our paper is to present a way to build an efficient and reliable model for this purpose. Several potential future research topics are listed as follows: first, we may extend our procedure for the hybrid methodology that combines both ARIMA and ANN models with GARCH errors as in (); second, we could use our procedure to analyze the energy data and build an ARMA-GARCH model for the green energy, renewable energy, and bio-energy data as discussing in (); third, we may explore a linear programming or a genetic algorithm to find the QMLE of ARMA-GARCH model as presented in ().

Author Contributions

Conceptualization and methodology, S.L.; Data analysis, K.Z. All authors have read and agreed to the published version of the manuscript.

Funding

Ling’s research was funded by Hong Kong Research Grants Commission Grants (nos. 16500117, 16303118, 16301620, and 16300621), Australian Research Council, and the NSFC. Zhu’s research was funded by Hong Kong Research Grants Commission Grants (nos. 17304421, 17306818, and 17305619) and the NSFC (nos. 11690014 and 11731015).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available data sets were analyzed in this study and can be found at https://www.wsj.com/, accessed on 5 January 2022.

Acknowledgments

The authors thank the referees for careful reading and useful comments that helped to improve the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proofs

The following lemma gives two basic properties for model (7) and (8).

Lemma A1.

Suppose

{ε_{t}}

is generated by model (8) satisfying Assumption 2. Then (i)

{ε_{t}}

is strictly stationary and ergodic with

E ε_{t}^{2} < \infty

, and has the following causal representation:

\begin{matrix} ε_{t} = η_{t} \sqrt{h_{t}} and h_{t} = α_{0} [1 + \sum_{j = 1}^{\infty} u^{'} \prod_{i = 0}^{j - 1} P_{t - i} ξ_{t - j}] a . s .; \end{matrix}

and (ii) there exists some

ι \in (0, 1)

such that

E | ε_{t} |^{2 + ι} < \infty

if

E | η_{t} |^{2 + \tilde{ι}} < \infty

for some

\tilde{ι} > 0

, where

ξ_{t} = (η_{t}^{2}, 0, \dots, 0, 1,

{\dots, 0)}_{(r + s) \times 1}^{^{'}}

with the first component

η_{t}^{2}

and the

(r + 1)

th component 1, and

u = {(0, \dots, 0, 1, \dots, 0)}_{(r + s) \times 1}^{^{'}}

with the

(r + 1)

th component 1, and

\begin{matrix} P_{t} = (\begin{array}{c} α_{1} η_{t}^{2} & \dots & α_{r} η_{t}^{2} & β_{1} η_{t}^{2} & \dots & β_{s} η_{t}^{2} \\ I_{r - 1} & O & O \\ α_{1} & \dots & α_{r} & β_{1} & \dots & β_{s} \\ O & I_{s - 1} & O \end{array}) . \end{matrix}

Proof.

The result in (i) is from Theorem 2.1 of (). For (ii), we first show that there exists an integer

i_{0}

such that, for some

\tilde{ι} \in (0, 1)

,

\begin{matrix} E ∥ \prod_{k = 1}^{i_{0}} P_{t - k} ∥^{1 + {\tilde{ι}}_{1}} < 1, \end{matrix}

(A1)

where

∥ B ∥ = \sqrt{t r (B B^{'})}

for a vector or matrix B. Let

P = {[Π, O]}_{(r + s) \times (r + s)}^{'}

with

Π = {(α_{1}, \dots, α_{r}, β_{1}, \dots, β_{s})}^{'}

, C be defined as

P_{t}

with all the elements of its first row replaced by 0, and

P (x) = (E | η_{t} {|^{2 (1 + x)})}^{1 / (1 + x)} P + C .

Since

E | η_{t} |^{2 + \tilde{ι}} < \infty

, the spectral radius

ρ (P (x))

is continuous in terms of x in

[0, \tilde{ι})

. By Lemma 3.2 in () and Assumption 2, we know that

ρ (P (0)) = ρ (E P_{t}) < 1

, and there exists a constant

{\tilde{ι}}_{1} \in (0, \tilde{ι})

such that

\begin{matrix} ρ (P ({\tilde{ι}}_{1})) < ρ (E P_{t}) + [1 - ρ (E P_{t})] < 1 . \end{matrix}

(A2)

By Corollary A.2 in () and (A2),

\begin{matrix} ∥ P^{i} ({\tilde{ι}}_{1}) ∥ \leq c {[ρ (P ({\tilde{ι}}_{1}))]}^{i / 2} ⟶ 0, \end{matrix}

(A3)

as

i \to \infty

, where c is a constant. Let

c_{j} = (0, \dots, 0, 1, 0, \dots,

{0)}_{(r + s) \times 1}^{'}

with the jth element being 1. Since all the elements of

P_{t}

are nonnegative, it follows that

\begin{matrix} ∥ \prod_{k = 1}^{i} P_{t} ∥ \leq \sum_{j_{1}, j_{2} = 1}^{r + s} c_{j_{1}}^{'} \prod_{k = 1}^{i} P_{t} c_{j_{2}} . \end{matrix}

(A4)

By Minkowskii’s inequality and (A3) and (A4), we have that

\begin{matrix} E ∥ \prod_{k = 1}^{i} P_{t - k} ∥^{1 + {\tilde{ι}}_{1}} & \leq {(\sum_{j_{1}, j_{2} = 1}^{r + s} {E {[c_{j_{1}}^{'} \prod_{k = 1}^{i} P_{t - k} c_{j_{2}}]}^{1 + {\tilde{ι}}_{1}}}^{1 / (1 + {\tilde{ι}}_{1})})}^{1 + {\tilde{ι}}_{1}} \\ = {[\sum_{j_{1}, j_{2} = 1}^{r + s} {E {[c_{j_{1}}^{'} \prod_{k = 1}^{i} (η_{t - k}^{2} P + C) c_{j_{2}}]}^{1 + {\tilde{ι}}_{1}}}^{1 / (1 + {\tilde{ι}}_{1})}]}^{1 + {\tilde{ι}}_{1}} \\ \leq {[\sum_{j_{1}, j_{2} = 1}^{r + s} (c_{j_{1}}^{'} \prod_{k = 1}^{i} [(E | η_{t} |^{2 (1 + {\tilde{ι}}_{1})})^{1 / (1 + {\tilde{ι}}_{1})} P + C] c_{j_{2}})]}^{1 + {\tilde{ι}}_{1}} \\ = {[\sum_{j_{1}, j_{2} = 1}^{r + s} c_{j_{1}}^{'} P^{i} ({\tilde{ι}}_{1}) c_{j_{2}}]}^{1 + {\tilde{ι}}_{1}} ⟶ 0, \end{matrix}

as

i \to \infty

. Thus, there is

i_{0}

large enough such that (A1) holds. Using (A1) and the representation in (i), we can show that (ii) holds. This completes the proof. □

Lemma A2.

[Lemma A.1 in ()] If Assumptions 1 and 2 hold, then there exist constants C and

ρ \in (0, 1)

such that the following holds uniformly in Θ:

\begin{matrix} (i) & ε_{t - 1} (γ), ∥ \frac{\partial ε_{t} (γ)}{\partial γ} ∥, a n d ∥ \frac{\partial^{2} ε_{t} (γ)}{\partial γ \partial γ^{'}} ∥ are bounded a . s . by ξ_{γ t - 1}, \\ (i i) & h_{t} (λ) is bounded a . s . by ξ_{γ t - 1}^{2}, \end{matrix}

where

ξ_{γ t - 1} = C (1 + \sum_{j = 1}^{\infty} ρ^{j} | y_{t - j} |)

with constants

ρ \in (0, 1)

and C.

Proof of Theorem 1.

(i) Let

L_{s n} (γ) = \sum_{t = 1}^{n} [ε_{t}^{2} (γ) / w_{t}] / n

. First, the space

Θ_{γ}

is compact and

γ_{0}

is an interior point in

Θ_{γ}

. Second,

L_{s n} (γ)

is continuous in

γ \in Θ_{γ}

and is a measurable function of

{y_{s}

,

s = t, t - 1, \dots}

for all

γ \in Θ_{γ}

. Third, by Lemma A2(i),

\begin{matrix} E \sup_{γ \in Θ_{γ}} [ε_{t}^{2} (γ) / w_{t}] \leq C E (1 + \sum_{i = 0}^{\infty} ρ^{i} | y_{t - i} {|)}^{2} < \infty, \end{matrix}

where C is a constant. Moreover, by the ergodic theorem,

L_{s n} (γ) ⟶_{p} E [ε_{t}^{2} (γ) / w_{t}]

for each

γ

. Furthermore, by Theorem 3.1 in (),

L_{s n} (γ) ⟶_{p} E [ε_{t}^{2} (γ) / w_{t}]

uniformly in

Θ_{γ}

. Fourth,

\begin{matrix} ε_{t} (γ) = ε_{t} - [M_{t} (γ) - M_{t} (γ_{0})], \end{matrix}

where

M_{t} (γ) = \sum_{i = 1}^{p} ϕ_{i} y_{t - i} + \sum_{i = 1}^{q} ϕ_{i} ε_{t - i} (γ)

. Thus,

\begin{matrix} E [\frac{ε_{t}^{2} (γ)}{w_{t}}] & = E [\frac{ε_{t}^{2} (γ_{0})}{w_{t}}] + E \{\frac{{[M_{t} (γ) - M_{t} (γ_{0})]}^{2}}{w_{t}}\} \geq E [\frac{ε_{t}^{2} (γ_{0})}{w_{t}}], \end{matrix}

where the equality holds if and only if

M_{t} (γ) = M_{t} (γ_{0})

, that is,

ε_{t} (γ) = ε_{t} (γ_{0})

, which holds if and only if

γ = γ_{0}

under Assumption 1, that is,

E [ε_{t}^{2} (γ) / w_{t}]

reaches its unique minimum at

γ = γ_{0}

. Thus, we have established all the conditions for consistency in Theorem 4.1.1 in () and hence (i) holds.

(ii) First,

{\tilde{γ}}_{n}

is a consistent estimator of

γ_{0}

. Second,

\frac{\partial^{2} L_{s n} (γ)}{\partial γ \partial γ^{'}} = \frac{2}{n} \sum_{t = 1}^{n} \frac{1}{w_{t}} \frac{\partial ε_{t} (γ)}{\partial γ} \frac{\partial ε_{t} (γ)}{\partial γ^{'}} + \frac{2}{n} \sum_{t = 1}^{n} \frac{ε_{t} (γ)}{w_{t}} \frac{\partial^{2} ε_{t} (γ)}{\partial γ \partial γ^{'}}

exists and is continuous in

Θ_{γ}

. Third, let

A_{t} (γ) \equiv \frac{1}{w_{t}} \frac{\partial ε_{t} (γ)}{\partial γ} \frac{\partial ε_{t} (γ)}{\partial γ^{'}} + \frac{ε_{t} (γ)}{w_{t}} \frac{\partial^{2} ε_{t} (γ)}{\partial γ \partial γ^{'}} .

By Lemma A2, we can show that

E \sup_{γ \in Θ_{γ}} ∥ A_{t} (γ) ∥ < \infty

. By the ergodic theorem and Theorem 3.1 in (), we can show that

\partial^{2} L_{s n} (γ) / \partial γ \partial γ^{'}

converges to

2 E A_{t} (γ)

uniformly in

Θ_{γ}

in probability. Since

E A_{t} (γ)

is continuous in terms of

γ

, we can show that

\partial^{2} L_{s n} (γ_{n}) / \partial γ \partial γ^{'}

converges to

2 A

in probability for any sequence

γ_{n}

, such that

γ_{n} \to γ_{0}

in probability. Fourth,

\frac{\partial L_{s n} (γ_{0})}{\partial γ^{'}} = \frac{2}{n} \sum_{t = 1}^{n} \frac{ε_{t} (γ_{0})}{w_{t}} \frac{\partial ε_{t} (γ_{0})}{\partial γ} .

By Lemma A2, it follows that

\begin{matrix} B = E [\frac{ε_{t}^{2} (γ_{0})}{w_{t}^{2}} \frac{\partial ε_{t} (γ_{0})}{\partial γ} \frac{\partial ε_{t} (γ_{0})}{\partial γ^{'}}] = E [\frac{h_{t} (λ_{0})}{w_{t}^{2}} \frac{\partial ε_{t} (γ_{0})}{\partial γ} \frac{\partial ε_{t} (γ_{0})}{\partial γ^{'}}] \leq C^{2} E h_{t} < \infty . \end{matrix}

Similar to the proof of Lemma 4.2 in (), we can show that A and B are positive definite. By the central limit theorem, we have that

\partial L_{s n} (γ_{0}) / \partial γ ⟶_{L} N (0, 4 B)

. Thus, we have established all the conditions in Theorem 4.1.3 in (), and hence

\sqrt{n} ({\tilde{γ}}_{n} - γ_{0}) ⟶_{L} N (0, A^{- 1} B A^{- 1})

. This completes the proof. □

The following Lemma A3(i)–(ii) is Lemma A.2 in () and Lemma A3(iii) is Lemma A.3(i) in ().

Lemma A3.

If Assumptions 1 and 2 hold, then it follows that

\begin{matrix} (i) & \sup_{Θ} ∥ \frac{1}{h_{t} (λ)} \frac{\partial h_{t} (λ)}{\partial δ} ∥ \leq ξ_{δ t - 1}, \\ (i i) & \sup_{Θ} ∥ \frac{1}{h_{t} (λ)} \frac{\partial^{2} h_{t} (λ)}{\partial δ \partial δ^{'}} ∥ \leq ξ_{δ t - 1}, \\ (i i i) & \sup_{Θ} ∥ \frac{1}{\sqrt{h_{t} (λ)}} \frac{\partial h_{t} (λ)}{\partial γ} ∥ \leq ξ_{γ t - 1}, \end{matrix}

where

ξ_{δ t - 1} = C (1 + \sum_{j = 1}^{\infty} ρ^{j} | y_{t - j} |^{ι_{1}})

with constants

ρ \in (0, 1)

and C for any

ι_{1} > 0

.

To prove Theorem 2, we need to introduce another three lemmas. For their proofs, we need the condition that

E | ε_{t} |^{2 + {\tilde{ι}}_{1}} < \infty

for some

{\tilde{ι}}_{1} > 0

. Here and in the sequel,

l_{t} (δ) = l_{t} {(λ) |}_{γ = γ_{0}}

and

h_{t} (δ) = h_{t} {(λ) |}_{γ = γ_{0}}

.

Lemma A4.

If Assumptions 1 and 2 hold with

E | η_{t} |^{2 + \tilde{ι}} < \infty

for some

\tilde{ι} > 0

, then it follows that

\begin{matrix} \sup_{δ \in Θ_{δ}} | \frac{1}{n} \sum_{t = 1}^{n} [{\tilde{l}}_{t} (δ) - l_{t} (δ)] | = o_{p} (1) . \end{matrix}

Proof.

Since

ξ_{γ t}

in Lemma A2 is strictly stationary with

E ξ_{γ t}^{2} < \infty

, we have that

\max_{1 \leq t \leq n} ξ_{γ t}

/ \sqrt{n} = o_{p} (1)

. By Taylor’s expansion, Lemma A2(i), and Theorem 1(ii), it follows that

\begin{matrix} {\tilde{ε}}_{t} = ε_{t} + ({\tilde{γ}}_{n} - γ_{0}) \frac{\partial ε_{t} (γ^{*})}{\partial γ} = ε_{t} + o_{p} (1), \end{matrix}

(A5)

where

o_{p} (1)

holds uniformly in t, and

γ^{*}

lies between

γ_{0}

and

{\tilde{γ}}_{n}

. By (A5), we can readily show that

\begin{matrix} \sup_{δ \in Θ_{δ}} | \frac{1}{n} \sum_{t = 1}^{n} \frac{{\tilde{ε}}_{t}^{2} - ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} | = o_{p} (1), \end{matrix}

(A6)

since

{\tilde{h}}_{t} (δ) \geq {\underset{̲}{α}}_{0}

uniformly in

δ \in Θ_{δ}

. Note that

\begin{matrix} {\tilde{h}}_{t} (δ) = h_{t} (δ) + ({\tilde{γ}}_{n} - γ_{0}) \frac{\partial h_{t} (λ^{*})}{\partial γ}, \end{matrix}

(A7)

where

λ^{*} = {(γ^{*^{'}}, δ^{'})}^{'}

and

γ^{*}

lies between

γ_{0}

and

{\tilde{γ}}_{n}

. By Lemma A1(ii), we can show that

E (ε_{t}^{2} ξ_{γ t - 1}^{{\tilde{ι}}_{1}}) < \infty

as

{\tilde{ι}}_{1}

is small enough. By Lemma A3(iii) and the ergodic theorem, it follows that

\begin{matrix} \sup_{λ \in Θ} \frac{1}{n} \sum_{t = 1}^{n} ε_{t}^{2} ∥ \frac{\partial h_{t} (λ)}{\partial γ} ∥^{{\tilde{ι}}_{1}} \leq \frac{1}{n} \sum_{t = 1}^{n} ε_{t}^{2} ξ_{γ t - 1}^{{\tilde{ι}}_{1}} = O_{p} (1), \end{matrix}

as

{\tilde{ι}}_{1}

is small enough. Thus,

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} ε_{t}^{2} | \frac{1}{{\tilde{h}}_{t} (δ)} - \frac{1}{h_{t} (δ)} | & \leq \frac{2}{{\underset{̲}{α}}_{0}^{1 - {\tilde{ι}}_{1}} n} \sum_{t = 1}^{n} ε_{t}^{2} | \frac{1}{{\tilde{h}}_{t} (δ)} - \frac{1}{h_{t} (δ)} |^{{\tilde{ι}}_{1}} \\ \leq \frac{2}{{\underset{̲}{α}}_{0}^{1 + {\tilde{ι}}_{1}} n} \sum_{t = 1}^{n} ε_{t}^{2} | {\tilde{h}}_{t} (δ) - h_{t} (δ) |^{{\tilde{ι}}_{1}} \\ \leq \frac{2 ∥ {\tilde{γ}}_{n} - γ_{0} ∥^{{\tilde{ι}}_{1}}}{{\underset{̲}{α}}_{0}^{1 + {\tilde{ι}}_{1}} n} \sum_{t = 1}^{n} ε_{t}^{2} ∥ \frac{\partial h_{t} (λ^{*})}{\partial γ} ∥^{{\tilde{ι}}_{1}} = o_{p} (1), \end{matrix}

(A8)

where

o_{p} (1)

holds uniformly in

δ \in Θ_{δ}

. By (A6) and (A8), it follows that

\begin{matrix} \sup_{δ \in Θ_{δ}} | \frac{1}{n} \sum_{t = 1}^{n} [\frac{{\tilde{ε}}_{t}^{2}}{{\tilde{h}}_{t} (δ)} - \frac{ε_{t}^{2}}{h_{t} (δ)}] | = o_{p} (1) . \end{matrix}

(A9)

Moreover, we can show that

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} [log {\tilde{h}}_{t} (δ) - log h_{t} (δ)] I {{\tilde{h}}_{t} (δ) \geq h_{t} (δ)} \\ = \frac{1}{n} \sum_{t = 1}^{n} [log [1 + ({\tilde{γ}}_{n} - γ_{0}) \frac{1}{h_{t} (δ)} \frac{\partial h_{t} (λ^{*})}{\partial γ}]] I {{\tilde{h}}_{t} (δ) \geq h_{t} (δ)} \\ \leq \frac{1}{n {\tilde{ι}}_{1}} \sum_{t = 1}^{n} log {[1 + {\underset{̲}{α}}_{0}^{- 1} ∥ {\tilde{γ}}_{n} - γ_{0} ∥ ∥ \frac{\partial h_{t} (λ^{*})}{\partial γ} ∥]}^{{\tilde{ι}}_{1}}, \end{matrix}

where

λ^{*} = {(γ^{*^{'}}, δ^{'})}^{'}

and

γ^{*}

lies between

γ_{0}

and

{\tilde{γ}}_{n}

. Note that there exists an

{\tilde{ι}}_{1}

such that

E \sup_{λ \in Θ} {∥ \partial h_{t} (λ) / \partial γ ∥}^{{\tilde{ι}}_{1}} < \infty

. For any

ε > 0

, first taking

η

small enough such that

log [1 + η^{{\tilde{ι}}_{1}} {\underset{̲}{α}}_{0}^{- {\tilde{ι}}_{1}} E \sup_{λ \in Θ} ∥ \partial h_{t} (λ) / \partial γ ∥^{{\tilde{ι}}_{1}}] < ε^{2} {\tilde{ι}}_{1}

and then taking n large enough such that

P (∥ {\tilde{γ}}_{n} - γ_{0} ∥ \geq η) \leq ε

, it follows that

\begin{matrix} P (\frac{1}{n {\tilde{ι}}_{1}} \sum_{t = 1}^{n} log {[1 + \frac{1}{{\underset{̲}{α}}_{0}} ∥ {\tilde{γ}}_{n} - γ_{0} ∥ \sup_{λ \in Θ} ∥ \frac{\partial h_{t} (λ)}{\partial γ} ∥]}^{{\tilde{ι}}_{1}} \geq ε) \\ \leq P (\frac{1}{n {\tilde{ι}}_{1}} \sum_{t = 1}^{n} log {[1 + \frac{1}{{\underset{̲}{α}}_{0}} ∥ {\tilde{γ}}_{n} - γ_{0} ∥ \sup_{λ \in Θ} ∥ \frac{\partial h_{t} (λ)}{\partial γ} ∥]}^{{\tilde{ι}}_{1}} \geq ε, ∥ {\tilde{γ}}_{n} - γ_{0} ∥ \leq η) \\ + ε \\ \leq \frac{1}{n {\tilde{ι}}_{1} ε} \sum_{t = 1}^{n} E log {[1 + \frac{1}{{\underset{̲}{α}}_{0}} η \sup_{λ \in Θ} ∥ \frac{\partial h_{t} (λ)}{\partial γ} ∥]}^{{\tilde{ι}}_{1}} + ε \\ = \frac{1}{{\tilde{ι}}_{1} ε} E log {[1 + \frac{1}{{\underset{̲}{α}}_{0}} η \sup_{λ \in Θ} ∥ \frac{\partial h_{t} (λ)}{\partial γ} ∥]}^{{\tilde{ι}}_{1}} + ε \\ \leq \frac{1}{{\tilde{ι}}_{1} ε} log [1 + \frac{1}{{\underset{̲}{α}}_{0}^{{\tilde{ι}}_{1}}} η^{{\tilde{ι}}_{1}} \sup_{λ \in Θ} ∥ \frac{\partial h_{t} (λ)}{\partial γ} ∥^{{\tilde{ι}}_{1}}] + ε \leq 2 ε, \end{matrix}

where the last second inequality holds by Jensen’s inequality. Thus, as n is large enough,

\begin{matrix} P (\sup_{δ \in Θ_{δ}} \frac{1}{n} \sum_{t = 1}^{n} [log {\tilde{h}}_{t} (δ) - log h_{t} (δ)] I {{\tilde{h}}_{t} (δ) \geq h_{t} (δ)} \geq ε) \leq 2 ε . \end{matrix}

Similarly, we can show that

\begin{matrix} P (\sup_{δ \in Θ_{δ}} \frac{1}{n} \sum_{t = 1}^{n} [log {\tilde{h}}_{t} (δ) - log h_{t} (δ)] I {{\tilde{h}}_{t} (δ) \leq h_{t} (δ)} \geq ε) \leq 2 ε . \end{matrix}

Furthermore, by (A9), the conclusion holds. This completes the proof. □

Lemma A5.

If the assumptions of Lemma A3 hold, then it follows that

\begin{matrix} (i) & \sup_{δ \in Θ_{δ}} ∥ \frac{1}{n} \sum_{t = 1}^{n} [\frac{\partial^{2} {\tilde{l}}_{t} (δ)}{\partial δ \partial δ^{^{'}}} - \frac{\partial^{2} l_{t} (δ)}{\partial δ \partial δ^{^{'}}}] ∥ = o_{p} (1), \\ (i i) & E \sup_{δ \in Θ_{δ}} ∥ \frac{\partial^{2} l_{t} (δ)}{\partial δ \partial δ^{^{'}}} ∥ < \infty . \end{matrix}

Proof.

Denote

{\tilde{V}}_{t} (δ) = {\tilde{h}}_{t}^{- 1} (δ) [\partial {\tilde{h}}_{t} (δ) / \partial δ]

and similarly for

V_{t} (δ)

. Then

\begin{matrix} \frac{\partial^{2} {\tilde{l}}_{t} (δ)}{\partial δ \partial δ^{'}} = - \frac{1}{2} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{{\tilde{ε}}_{t}^{2}}{{\tilde{h}}_{t} (δ)} + [\frac{{\tilde{ε}}_{t}^{2}}{{\tilde{h}}_{t} (δ)} - 1] \frac{\partial {\tilde{V}}_{t} (δ)}{\partial γ} . \end{matrix}

(A10)

Similarly, we can have the formula of

\partial^{2} l_{t} (δ) / \partial δ \partial δ^{'}

. By (A5), we have

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{{\tilde{ε}}_{t}^{2}}{{\tilde{h}}_{t} (δ)} = \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} + \frac{o_{p} (1)}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{ε_{t}}{{\tilde{h}}_{t} (δ)} \\ + \frac{o_{p} (1)}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{1}{{\tilde{h}}_{t} (δ)} . \end{matrix}

(A11)

By Lemma A3(i),

\sup_{δ \in Θ_{δ}} ∥ {\tilde{V}}_{t} (δ) ∥ \leq \sup_{Θ} ∥ h_{t}^{- 1} (λ) [\partial h_{t} (λ) / \partial δ] ∥ \leq ξ_{δ t - 1}

. Furthermore, by Lemma A1, we can take

ι_{1}

in

ξ_{δ t - 1}

small enough such that the leading factors in the last terms are bounded uniformly in

δ \in Θ_{δ}

. Thus, the last two terms are

o_{p} (1)

, and hence it follows that

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{{\tilde{ε}}_{t}^{2}}{{\tilde{h}}_{t} (δ)} = \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t}^{'} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} + o_{p} (1), \end{matrix}

(A12)

where

o_{p} (1)

holds uniformly in

δ \in Θ_{δ}

. Moreover, by Lemma A3(i), we have

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) ∥ {\tilde{V}}_{t} (δ) - V_{t} (δ) ∥ \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} \\ \leq \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) ∥ {\tilde{V}}_{t} (δ) - V_{t} (δ) ∥^{ι} {[∥ {\tilde{V}}_{t} (δ) ∥ + ∥ V_{t} (δ) ∥]}^{1 - ι} \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} \\ \leq \frac{2}{n} \sum_{t = 1}^{n} ξ_{δ t - 1}^{2 - ι} ∥ {\tilde{V}}_{t} (δ) - V_{t} (δ) ∥^{ι} \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} . \end{matrix}

(A13)

By Lemma A1 and taking

ι

and

ι_{1}

in

ξ_{δ t - 1}

small enough, we have

E \max_{1 \leq n < \infty} \sup_{δ \in Θ_{δ}} [ξ_{δ t - 1}^{2 - ι} ∥ {\tilde{V}}_{t} (δ) - V_{t} (δ) ∥^{ι} \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)}] \leq C E (ξ_{δ t - 1}^{2} ε_{t}^{2}) < \infty,

where C is a constant. By the dominated convergence theorem, we can show that

\lim_{n \to \infty} E \sup_{δ \in Θ_{δ}} [ξ_{δ t - 1}^{2 - ι} ∥ {\tilde{V}}_{t} (δ) - V_{t} (δ) ∥^{ι} \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)}] = 0 .

Thus, we can show that (A13) is

o_{p} (1)

uniformly in

δ \in Θ_{δ}

. Furthermore, by (A12),

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} = \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) V_{t} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} + o_{p} (1) . \end{matrix}

(A14)

Similarly, we can show that

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) V_{t} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} = \frac{1}{n} \sum_{t = 1}^{n} V_{t} (δ) V_{t} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} + o_{p} (1) . \end{matrix}

(A15)

Similar to (A8), we can show that

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} V_{t} (δ) V_{t} (δ) \frac{ε_{t}^{2}}{{\tilde{h}}_{t} (δ)} = \frac{1}{n} \sum_{t = 1}^{n} V_{t} (δ) V_{t} (δ) \frac{ε_{t}^{2}}{h_{t} (δ)} + o_{p} (1) . \end{matrix}

(A16)

The

o_{p} (1)

in (A14)–(A16) hold uniformly in

δ \in Θ_{δ}

. By (A12) and (A14)–(A16), we have that

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} {\tilde{V}}_{t} (δ) {\tilde{V}}_{t} (δ) \frac{{\tilde{ε}}_{t}^{2}}{{\tilde{h}}_{t} (δ)} = \frac{1}{n} \sum_{t = 1}^{n} V_{t} (δ) V_{t} (δ) \frac{ε_{t}^{2}}{h_{t} (δ)} + o_{p} (1) . \end{matrix}

We can show that a similar equation holds for other terms in (A10). Thus, (i) holds. By Lemmas A2 and A3, it is straightforward to show that (ii) holds. This completes the proof. □

Lemma A6.

[Lemma A.7 in ()] If the conditions in Theorem 1 holds and

\sqrt{n} ∥ λ - λ_{0} ∥ \leq M

, then it follows that

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (λ)}{\partial λ \partial λ^{^{'}}} = \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (λ_{0})}{\partial λ \partial λ^{^{'}}} + o_{p} (1), \end{matrix}

for any fixed constant M.

Proof of Theorem 2.

Let

{\tilde{L}}_{n} (δ) = \sum_{t = 1}^{n} {\tilde{l}}_{t} (δ) / n

. First, the space

Θ_{δ}

is compact and

δ_{0}

is an interior point in

Θ_{δ}

. Second,

{\tilde{L}}_{n} (δ)

is continuous in

δ \in Θ_{δ}

and is a measurable function of

{y_{s}

,

s = t, t - 1, \dots}

for all

δ \in Θ_{δ}

. Third, by Lemma A2(ii), there exist constants C and

ρ \in (0, 1)

such that

\begin{matrix} 1 \leq \frac{h_{t} (δ)}{{\underset{̲}{α}}_{0}} \leq C (1 + \sum_{i = 1}^{\infty} ρ^{i} | ε_{t - i} {|)}^{2}, \end{matrix}

uniformly in

δ \in Θ_{δ}

. By Jensen’s inequality,

E \sup_{δ \in Θ_{δ}} | log h (δ) | \leq E \sup_{δ \in Θ_{δ}} log

[h (δ) / {\underset{̲}{α}}_{0}] + | log {\underset{̲}{α}}_{0} | < \infty

. Thus, we can show that

E \sup_{δ \in Θ_{δ}} | l_{t} (δ) | < \infty

. By the ergodic theorem,

\sum_{t = 1}^{n} l_{t} (δ) / n ⟶_{p} E l_{t} (δ)

for each

δ

. Furthermore, by Theorem 3.1 in (),

\sum_{t = 1}^{n} l_{t} (δ) / n ⟶_{p} E l_{t} (δ)

uniformly in

Θ_{δ}

. By Lemma A4,

{\tilde{L}}_{n} (δ) ⟶_{p} E l_{t} (δ)

uniformly in

Θ_{δ}

. Fourth, similar to the proof of Lemma A.10 of (), we can show that

E l_{t} (δ)

reaches its unique maximum at

δ = δ_{0}

. Thus, we have established all the conditions for consistency in Theorem 4.1.1 in () and hence (i) holds.

For (ii), we first have a consistent estimator

{\tilde{δ}}_{n}

of

δ_{0}

. Second,

\partial^{2} {\tilde{L}}_{n} (δ) / \partial δ \partial δ^{'}

exists and is continuous in

Θ_{δ}

. Third, by Lemma A5(ii),

E \sup_{δ \in Θ_{δ}} ∥ \partial^{2} l_{t} (δ) / \partial δ \partial δ^{'} ∥ < \infty

. By the ergodic theorem and Theorem 3.1 in (), we can show that

\sum_{t = 1}^{n} [\partial^{2} l_{t} (δ) / \partial δ \partial δ^{'}] / n ⟶_{p} E [\partial^{2} l_{t} (δ) / \partial δ \partial δ^{'}]

uniformly in

Θ_{δ}

. Since

E [\partial^{2} l_{t} (δ) / \partial δ \partial δ^{'}]

is continuous in terms of

δ

, we can show that

\sum_{t = 1}^{n} [\partial^{2} l_{t} (δ_{n}) / \partial δ \partial δ^{'}] / n

⟶_{p} - E H_{δ t} / 4

for any sequence

δ_{n}

, such that

δ_{n} ⟶_{p} δ_{0}

. Furthermore, by Lemma A5(i),

\partial^{2} {\tilde{L}}_{n} (δ_{n}) / \partial δ \partial δ^{'} ⟶_{p} - E H_{δ t} / 4

for any sequence

δ_{n}

, such that

δ_{n} ⟶_{p} δ_{0}

. Fourth, by Taylor’s expansion, it follows that

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial {\tilde{l}}_{t} (δ_{0})}{\partial δ} = \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial l_{t} (δ_{0})}{\partial δ} + \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} [\frac{\partial^{2} l_{t} (λ^{*})}{\partial δ \partial γ^{'}}] ({\tilde{γ}}_{n} - γ_{0}), \end{matrix}

where

λ^{*} = {(γ^{*^{'}}, δ_{0}^{'})}^{'}

and

γ^{*}

lies between

γ_{0}

and

γ

. By Lemma A6, we have

\frac{1}{n} \sum_{t = 1}^{n} [\frac{\partial^{2} l_{t} (λ^{*})}{\partial δ \partial γ^{'}}] = E [\frac{\partial^{2} l_{t} (λ_{0})}{\partial δ \partial γ^{'}}] + o_{p} (1) = - \frac{1}{2} E D_{t} + o_{p} (1) .

Furthermore, by Theorem 1, we can show that

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial {\tilde{l}}_{t} (δ_{0})}{\partial δ} = \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial l_{t} (δ_{0})}{\partial δ} + \frac{E D_{t} A^{- 1}}{2 \sqrt{n}} \sum_{t = 1}^{n} \frac{ε_{t} (γ_{0})}{w_{t}} \frac{\partial ε_{t} (γ_{0})}{\partial γ} + o_{p} (1) . \end{matrix}

By Lemma A4, we can see that

E ∥ H_{δ t} ∥ < \infty

and

E ∥ \partial l_{t} (δ_{0}) {/ \partial δ ∥}^{2} < \infty

. Thus,

Ω_{δ}

is finite. Similar to the proof of Lemma 4.2 in (), we can show that

E H_{δ t}

and

Ω_{δ}

are positive definite. By the central limit theorem, we have that

n^{- 1 / 2} \partial {\tilde{L}}_{n} (δ_{0}) / \partial δ ⟶_{L} N (0, Ω_{δ} / 4)

. Thus, we have established all the conditions in Theorem 4.1.3 in (), and hence

\sqrt{n} ({\hat{δ}}_{n} - δ_{0}) ⟶_{L} N (0, E^{- 1} H_{δ t} Ω_{δ} E^{- 1} H_{δ t})

. This completes the proof. □

References

An, Jaehyung, Alexey Mikhaylov, and Sang-Uk Jung. 2021. A linear programming approach for robust network revenue management in the airline industry. Journal of Air Transport Management 91: 101979. [Google Scholar] [CrossRef]
An, Jaehyung, and Alexey Mikhaylov. 2020. Russian energy projects in South Africa. Journal of Energy in Southern Africa 31: 58–64. [Google Scholar] [CrossRef]
Amemiya, Takeshi. 1985. Advanced Econometrics. Cambridge: Harvard University Press. [Google Scholar]
Basrak, Bojan, Richard A. Davis, and Thomas Mikosch. 2002. Regular variation of GARCH processes. Stochastic Processes and Their Applications 99: 95–115. [Google Scholar] [CrossRef] [Green Version]
Berkes, István, Lajos Horváth, and Piotr Kokoszka. 2003. GARCH processes: Structure and estimation. Bernoulli 9: 201–7. [Google Scholar] [CrossRef]
Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef] [Green Version]
Davis, Richard A., and Thomas Mikosch. 1998. The sample autocorrelations of heavy-tailed processs with applications to ARCH. Annals of Statistics 26: 2049–80. [Google Scholar] [CrossRef]
Drost, Feike C., and Chris A. J. Klaassen. 1997. Efficient estimation in semiparametric GARCH models. Journal of Econometrics 81: 193–221. [Google Scholar] [CrossRef] [Green Version]
Drost, Feike C., Chris A. J. Klaassen, and Bas J. M. Werker. 1997. Adaptive estimation in time series models. Annals of Statistics 25: 786–817. [Google Scholar] [CrossRef]
Engle, Robert F. 1982. Autoregressive conditional heteroskedasticity with estimates of variance of U.K. inflation. Econometrica 50: 987–1008. [Google Scholar] [CrossRef]
Francq, Christian, and Jean-Michel Zakoïan. 2004. Maximum likelihood estimation of pure GARCH and ARMA-GARCH processes. Bernoulli 10: 605–637. [Google Scholar] [CrossRef]
Hall, Peter, and Qiwei Yao. 2003. Inference in ARCH and GARCH models. Eonometrica 71: 285–317. [Google Scholar] [CrossRef] [Green Version]
He, Changli, and Timo Teräsvirta. 1999. Properties of moments of a family of GARCH processes. Journal of Econometrics 92: 173–92. [Google Scholar] [CrossRef]
He, Yi, Yanxi Hou, Liang Peng, and Jiliang Sheng. 2019. Statistical inference for a relative risk measure. Journal of Business & Economic Statistics 37: 301–11. [Google Scholar]
Hill, Jonathan B. 2010. On tail index estimation for dependent, heterogeneous data. Econometric Theory 26: 1398–436. [Google Scholar] [CrossRef] [Green Version]
Johansen, Søren. 1995. Likelihood-based Inference in Cointegrated Vector Autoregressive Models. Oxford: OUP Oxford. [Google Scholar]
Lange, Theis. 2011. Tail behavior and OLS estimation in AR-GARCH models. Statistica Sinica 21: 1191–200. [Google Scholar] [CrossRef] [Green Version]
Ling, Shiqing. 1999. On the stationarity and the existence of moments of conditional heteroskedastic ARMA models. Statistica Sinica 9: 1119–30. [Google Scholar]
Ling, Shiqing. 2003. Adaptive estimators and tests of stationary and non-stationary short and long memory ARIMA-GARCH models. Journal of the American Statistical Association 98: 955–67. [Google Scholar] [CrossRef]
Ling, Shiqing. 2005. Self-weighted LAD estimation for infinite variance autoregressive models. Journal of the Royal Statistical Society: Series B 67: 381–93. [Google Scholar] [CrossRef]
Ling, Shiqing. 2007. Self-weighted and local quasi-maximum likelihood estimator for ARMA-GARCH/IGARCH models. Journal of Econometrics 140: 849–73. [Google Scholar] [CrossRef]
Ling, Shiqing, and Michael McAleer. 2002. Necessary and sufficient moment conditions for the GARCH(r, s) and asymmetric power GARCH(r, s) models. Econometric Theory 18: 722–29. [Google Scholar] [CrossRef] [Green Version]
Ling, Shiqing, and Wai Keung Li. 1997. Fractional autoregressive integrated moving-average time series with conditional heteroskedasticity. Journal of the American Statistical Association 92: 1184–94. [Google Scholar] [CrossRef]
Ling, Shiqing, and Michael McAleer. 2003a. Asymptotic theory for a new vector ARMA-GARCH model. Econometric Theory 19: 280–310. [Google Scholar] [CrossRef] [Green Version]
Ling, Shiqing, and Michael McAleer. 2003b. On adaptive estimation in nonstationary ARMA models with GARCH errors. Annals of Statistics 31: 642–74. [Google Scholar] [CrossRef]
Pantula, Sastry G. 1989. Estimation of autoregressive models with ARCH errors. Sankhyā: The Indian Journal of Statistics, Series B 50: 119–38. [Google Scholar]
Setiawan, Budi, Marwa Ben Abdallah, Maria Fekete-Farkas, Robert Jeyakumar Nathan, and Zoltan Zeman. 2021. GARCH (1, 1) models and analysis of stock market turmoil during COVID-19 outbreak in an emerging and developed economy. Journal of Risk and Financial Management 14: 576. [Google Scholar] [CrossRef]
Weiss, Andrew A. 1986. Asymptotic theory for ARCH models: Estimation and testing. Econometrics Theory 2: 107–31. [Google Scholar] [CrossRef] [Green Version]
Zhang, G. Peter. 2003. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50: 159–75. [Google Scholar] [CrossRef]
Zhang, Rongmao, and Shiqing Ling. 2015. Asymptotic inference for AR models with heavy-tailed G-GARCH noises. Econometric Theory 31: 880–90. [Google Scholar] [CrossRef]
Zhang, Wenjun, and Jin E. Zhang. 2020. GARCH option pricing models and the variance risk premium. Journal of Risk and Financial Management 13: 51. [Google Scholar] [CrossRef] [Green Version]
Zhu, Ke, and Shiqing Ling. 2011. Global self-weighted and local quasi-maximum exponential likelihood estimators for ARMA-GARCH/ IGARCH models. Annals of Statistics 39: 2131–63. [Google Scholar] [CrossRef]
Zhu, Ke, and Shiqing Ling. 2015. LADE-based inference for ARMA models with unspecified and heavy-tailed heteroscedastic noises. Journal of the American Statistical Association 110: 784–94. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Parameter regime of

(α_{1}, β_{1})

with

E ε_{t}^{2 ι} < \infty

.

Figure 2. Log returns (

\times 100

) of DJIA, NASDAQ, NASDAQ 100, and S&P 500 from 11 March 2015 to 10 March 2021.

Figure 3. Log returns (

\times 100

) of OSX from 11 March 2015 to 10 March 2021.

Table 1. The results of

{\tilde{λ}}_{n}

and

{\hat{λ}}_{n}

.

Table 1. The results of

{\tilde{λ}}_{n}

and

{\hat{λ}}_{n}

.

			${\tilde{λ}}_{n}$
$η_{t}$	n		${\tilde{ϕ}}_{1 n}$	${\tilde{ψ}}_{1 n}$	${\tilde{α}}_{0 n}$	${\tilde{α}}_{1 n}$	${\tilde{β}}_{1 n}$
$N (0, 1)$	1000	Bias	−0.0012	0.0032	0.0189	0.0012	−0.0235
		SD	0.0443	0.0423	0.0650	0.0278	0.0839
		AD	0.0424	0.0402	0.0524	0.0290	0.0726
	2000	Bias	−0.0017	0.0015	0.0083	−0.0000	−0.0103
		SD	0.0300	0.0293	0.0342	0.0204	0.0471
		AD	0.0300	0.0285	0.0332	0.0201	0.0469
			${\hat{λ}}_{n}$
			${\hat{ϕ}}_{1 n}$	${\hat{ψ}}_{1 n}$	${\hat{α}}_{0 n}$	${\hat{α}}_{1 n}$	${\hat{β}}_{1 n}$
	1000	Bias	−0.0010	0.0022	0.0172	0.0015	−0.0210
		SD	0.0425	0.0406	0.0657	0.0282	0.0845
		AD	0.0405	0.0380	0.0526	0.0291	0.0729
	2000	Bias	−0.0016	0.0012	0.0073	−0.0000	−0.0087
		SD	0.0283	0.0274	0.0340	0.0205	0.0470
		AD	0.0286	0.0270	0.0332	0.0202	0.0469
			${\tilde{λ}}_{n}$
			${\tilde{ϕ}}_{1 n}$	${\tilde{ψ}}_{1 n}$	${\tilde{α}}_{0 n}$	${\tilde{α}}_{1 n}$	${\tilde{β}}_{1 n}$
$L (0, 1)$	1000	Bias	−0.0032	0.0035	0.0241	0.0020	−0.0304
		SD	0.0454	0.0414	0.0806	0.0381	0.1079
		AD	0.0456	0.0433	0.0639	0.0385	0.0909
	2000	Bias	−0.0001	0.0014	0.0116	0.0016	−0.0148
		SD	0.0328	0.0307	0.0426	0.0268	0.0599
		AD	0.0323	0.0307	0.0397	0.0269	0.0577
			${\hat{λ}}_{n}$
			${\hat{ϕ}}_{1 n}$	${\hat{ψ}}_{1 n}$	${\hat{α}}_{0 n}$	${\hat{α}}_{1 n}$	${\hat{β}}_{1 n}$
	1000	Bias	−0.0027	0.0028	0.0237	0.0028	−0.0296
		SD	0.0444	0.0402	0.0918	0.0390	0.1183
		AD	0.0443	0.0416	0.0641	0.0387	0.0913
	2000	Bias	−0.0008	0.0013	0.0109	0.0019	−0.0138
		SD	0.0316	0.0296	0.0424	0.0270	0.0598
		AD	0.0313	0.0295	0.0397	0.0270	0.0578
			${\tilde{λ}}_{n}$
			${\tilde{ϕ}}_{1 n}$	${\tilde{ψ}}_{1 n}$	${\tilde{α}}_{0 n}$	${\tilde{α}}_{1 n}$	${\tilde{β}}_{1 n}$
$t (5)$	1000	Bias	−0.0012	0.0016	0.0300	0.0046	−0.0395
		SD	0.0460	0.0445	0.0867	0.0432	0.1137
		AD	0.0454	0.0431	0.0734	0.0443	0.1038
	2000	Bias	0.0014	0.0005	0.0126	0.0025	−0.0164
		SD	0.0312	0.0305	0.0463	0.0325	0.0657
		AD	0.0323	0.0308	0.0459	0.0316	0.0666
			${\hat{λ}}_{n}$
			${\hat{ϕ}}_{1 n}$	${\hat{ψ}}_{1 n}$	${\hat{α}}_{0 n}$	${\hat{α}}_{1 n}$	${\hat{β}}_{1 n}$
	1000	Bias	−0.0022	0.0018	0.0291	0.0054	−0.0381
		SD	0.0472	0.0448	0.0897	0.0444	0.1166
		AD	0.0443	0.0417	0.0737	0.0445	0.1042
	2000	Bias	0.0006	0.0007	0.0119	0.0030	−0.0155
		SD	0.0317	0.0296	0.0462	0.0330	0.0656
		AD	0.0315	0.0297	0.0459	0.0317	0.0667

Table 2. The results of

{\hat{γ}}_{L S n}

.

Table 2. The results of

{\hat{γ}}_{L S n}

.

		$η_{t} \sim N (0, 1)$		$η_{t} \sim L (0, 1)$		$η_{t} \sim t (5)$
n		${\hat{ϕ}}_{L S n}$	${\hat{ψ}}_{L S n}$	${\hat{ϕ}}_{L S n}$	${\hat{ψ}}_{L S n}$	${\hat{ϕ}}_{L S n}$	${\hat{ψ}}_{L S n}$
1000	Bias	0.0001	0.0012	−0.0034	0.0024	−0.0033	0.0015
	SD	0.0441	0.0412	0.0482	0.0473	0.0518	0.0487
	AD	0.0437	0.0411	0.0507	0.0474	0.0525	0.0490
2000	Bias	−0.0018	0.0015	−0.0009	0.0011	−0.0008	0.0010
	SD	0.0307	0.0299	0.0350	0.0325	0.0382	0.0349
	AD	0.0311	0.0293	0.0367	0.0344	0.0382	0.0358

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Self-Weighted LSE and Residual-Based QMLE of ARMA-GARCH Models^†

Abstract

1. Introduction

2. Model and Assumptions

3. Main Results

4. Simulation Study

5. Real Examples

6. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proofs

References

Article Metrics

Citations

Article Access Statistics

Self-Weighted LSE and Residual-Based QMLE of ARMA-GARCH Models †

Abstract

1. Introduction

2. Model and Assumptions

3. Main Results

4. Simulation Study

5. Real Examples

6. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proofs

References

Article Metrics

Citations

Article Access Statistics

Self-Weighted LSE and Residual-Based QMLE of ARMA-GARCH Models^†