A Unified Test for the AR Error Structure of an Autoregressive Model

Wei, Xinyi; Liu, Xiaohui; Fan, Yawen; Tan, Li; Liu, Qing

doi:10.3390/axioms11120690

Open AccessArticle

A Unified Test for the AR Error Structure of an Autoregressive Model

by

Xinyi Wei

^1,2,

Xiaohui Liu

^1,2,

Yawen Fan

^1,2,*,

Li Tan

^1,2 and

Qing Liu

^1,2

¹

School of Statistics, Jiangxi University of Finance and Economics, Nanchang 330013, China

²

Key Laboratory of Data Science in Finance and Economics, Jiangxi University of Finance and Economics, Nanchang 330013, China

^*

Author to whom correspondence should be addressed.

Axioms 2022, 11(12), 690; https://doi.org/10.3390/axioms11120690

Submission received: 25 October 2022 / Revised: 20 November 2022 / Accepted: 28 November 2022 / Published: 1 December 2022

(This article belongs to the Special Issue Computational Statistics & Data Analysis)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

A direct application of autoregressive (AR) models with independent and identically distributed (iid) errors is sometimes inadequate to fit the time series data well. A natural alternative is further to assume the model errors following an AR process, whose structure however has essential impacts on the statistical inferences related to the autoregressive models. In this paper, we construct a new unified test for checking the AR error structure based on the empirical likelihood method. The proposed test is desirable because its limit distribution is always chi-squared regardless of whether the autoregressive model is stationary or non-stationary, with or without an intercept term. Some simulations are also provided to illustrate the finite sample performance of this test. Finally, we apply the proposed test to a financial real data set.

Keywords:

autoregressive model; AR errors; empirical likelihood; unified test

MSC:

62M10; 91B84; 01-08; 65C60

1. Introduction

When auxiliary variables are not available, autoregressive models are widely used to model this kind of time series data. Typically, the response is often assumed to depend linearly on its previous values. Among all autoregressive models, the autoregressive model of the first order, i.e., AR(1), is the simplest, which takes the following form:

\begin{matrix} X_{t} = μ + ϕ X_{t - 1} + ε_{t}, t = 1, 2, \dots, n, \end{matrix}

(1)

where

μ

and

ϕ

are unknown parameters with

μ

being the intercept item and

ϕ

the autoregression coefficient, and

{ε_{t}}

denotes the sequence of random errors or innovations having means of zero.

In many previous studies, a considerable amount of work has been provided on statistical inferences [1,2,3,4,5,6] and related applications [7,8,9] for AR models. In terms of practical applications, AR models are commonly used to describe the behavior of inflation or logarithmic exchange rate, where people are interested in whether there is a unit root or persistence of related variables. However, a precondition for an accurate unit root test or persistence test is that the model is properly fitted so that the parameters can be reasonably estimated. To guarantee this, it is important to perform predefined tests, e.g., the unit root test and serial correlation test, on the rationality of using the AR model before conducting a relevant economic analysis.

Among them, the unit root test is the most commonly mentioned. Note that the limit distributions of the estimators of

μ

and

ϕ

depend on whether the process

{X_{t}}

is stationary or non-stationary, i.e., Case (i)

| ϕ | < 1

(stationary), Case (ii)

μ = 0

and

ϕ = 1 + \frac{c}{n}

for some nonzero constant c (nearly integrated if

c \neq 0

, and unit root if

c = 0

), and Case (iii)

μ \neq 0

and

ϕ = 1 + \frac{c}{n}

for some nonzero constant c (nearly integrated if

c \neq 0

). It is well known that when the AR process has a unit root, its many statistical procedures have quite complex limit distributions, differing from that for the stationary case. Hence, various testing methods have been developed in the past decades to address the issue of unit root, including the augmented Dickey–Fuller (ADF) test [10], the Phillips–Perron (PP) test [11], the DF–GLS test [3], and the KPSS test [12], etc.

It is worth mentioning that if the true underlying innovations are correlated, the finite sample performance of the tests above may be greatly affected. To improve the efficiency of the estimation, a natural idea is to take into account the special structure of the errors if available. Note that it is common to assume that the errors further follow an AR process when they are correlated, while the performance of some testing procedures can be greatly improved once the AR structure has been addressed sufficiently, as shown in [13,14].

In detail, Ref. [13] considered the following autoregressive model with AR errors:

\begin{matrix} \{\begin{matrix} X_{t} = μ + ϕ X_{t - 1} + ε_{t}, \\ e_{t} = ε_{t} + \sum_{i = 1}^{p} ψ_{i} ε_{t - i} . \end{matrix} \end{matrix}

(2)

where

ψ = {(ψ_{1}, ψ_{2}, \dots, ψ_{p})}^{⊤}

denotes the vector of unknown parameters involved in the AR errors, and

e_{t}

denotes the random error involved in

ε_{t}

. Compared to Model (1), Ref. [13] here further assumed that

ε_{t}

follows an AR process. Note that (2) implies

ε_{t} = \sum_{i = 1}^{p} (- ψ_{i}) ε_{t - i} + e_{t}

. A unified unit root test was developed by considering the special structure in

{e_{t}}

. Their test has been shown to have desirable properties, as the related statistic converges in distribution to a standard chi-squared distributed variable. However, their test depends on preconditions such that the AR structure of

{ε_{t}}

has been well specified, and p is properly predefined. The violation of these conditions may result in power loss in this method, as shown in our simulations.

To this end, we are interested in producing statistics to test whether

ψ

is equal to some given constant vector

ψ_{0}

under Cases (i)–(iii), which has not been considered in the literature to the best of our knowledge. Note that although many tests have been developed for testing the possible serial correlation in

{ε_{t}}

, including the Lagrangian multiplier (LM) test [15], Box–Pierce (BP) test [16], and Ljung–Box (LB) test [17], etc., they cannot be used directly to test the hypothesis above. In view of this, we propose an empirical likelihood-based statistic for testing this issue by taking into account the AR structure. Note that the setting in Case (ii) causes issues in the derivation of the asymptotic distribution, as well as the related applications. A new data-splitting idea is also employed in order to unify Cases (i)–(iii). It turns out that the proposed statistic converges in distribution to a standard chi-squared distributed variable regardless of

{X_{t}}

being stationary or non-stationary, due to the special block structure of the asymptotic covariance matrix. The simulations show that our method has a good size, as well as nontrivial power performance in finite sample cases.

As a nonparametric method, empirical likelihood (EL) was firstly proposed by [18]. Because of its many excellent properties, i.e., no need to assume the parameter distribution in advance, it has been widely used in the literature when parametric methods do not work well to produce satisfactory results. Many authors have devoted themselves to extending this method. To name but a few, Ref. [19] obtained confidence regions for vector-valued statistical functions, which is a multivariate generalization of the work of [18]. Refs. [20,21] extended the empirical likelihood method to the setting of regression models and general estimation equations, respectively. Recently, Ref. [22] discussed the possibility of constructing unified tests by using empirical likelihood based on a weighted technique for time series models. Ref. [13] further extended this weighted technique to AR models with AR errors. Further, Ref. [23] applied the empirical likelihood method to test the heteroscedasticity for errors of single-index model. Ref. [6] developed a unified empirical likelihood inference method to test the predictability regardless of the properties of the predicting variable. Ref. [24] considered the unified test problem in a predictive regression model. To move the effect of the possible existence of an intercept, the idea of data-splitting has also been developed in [24]. The literature above inspired the current research.

We organize the rest of this paper as follows. Section 2 develops the unified test for the AR structure of the AR models. Section 3 reports the finite-sample simulation results. Section 4 applies the proposed test to the exchange rates between the U.S. dollar and eight countries. Section 5 concludes this paper. The detailed proof of the main theorem is specified in Appendix A.

2. Methodologies and Asymptotic Results

Supposing the random observations

{X_{t}}_{t = 1}^{n}

are generated from the model (2) with possible AR errors. Formulate

ψ = (ψ_{1}^{⊤}, ψ_{2}^{⊤})^{⊤}

and let

ψ_{0} = {(ψ_{1, 0}^{⊤}, ψ_{2, 0}^{⊤})}^{⊤}

be its true value.

Note that when

θ = θ_{0}

,

{e_{t} (θ)}

is a sequence of iid variables, it is more efficient to construct a statistical procedure on

\sum_{t = p + 1}^{n} e_{t}^{2} (θ)

than on

\sum_{t = p + 1}^{n} {(X_{t} - μ - ϕ X_{t - 1})}^{2}

, as discussed in [13], where

\begin{matrix} e_{t} (θ) & = & (X_{t} - μ - ϕ X_{t - 1} + \sum_{i = 1}^{p} ψ_{i} (X_{t - i} - μ - ϕ X_{t - i - 1})), and \\ ε_{t} (θ) & = & X_{t} - μ - ϕ X_{t - 1}, \end{matrix}

for a given

θ = {(μ, ϕ, ψ^{⊤})}^{⊤}

. However, their method depends on an assumption that the structure of the AR errors has been correctly specified, which needs to be pretested in practice. This motivates us to consider the following hypothesis:

H_{0} : ψ_{2} = ψ_{2, 0} v e r s u s H_{1} : ψ_{2} \neq ψ_{2, 0} .

Remarkably, when

ψ_{2, 0} = 0

,

{ε_{t}}

is a sequence of iid errors.

Note that when

θ

takes the true underlying value

θ_{0}

, we have

\begin{matrix} E (Z_{t}^{*} (θ) | F_{t - 1}) = 0, for t = p + 1, \dots, n, \end{matrix}

where

Z_{t}^{*} (θ) = {(Z_{t, 1}^{*} (θ), \dots, Z_{t, 2 + p}^{*} (θ))}^{⊤}

,

F_{t}

denotes the sigma field generated by

{e_{s} : t \leq t}

, and

\begin{matrix} \{\begin{matrix} Z_{t, 1}^{*} (θ) = e_{t} (θ) \\ Z_{t, 2}^{*} (θ) = e_{t} (θ) (X_{t - 1} + \sum_{i = 1}^{p} ψ_{i} X_{t + m - i - 1}) \\ Z_{t, 2 + i}^{*} (θ) = e_{t} (θ) (X_{t - i} - μ - ϕ X_{t - i - 1}), i = 1, \dots, p, \end{matrix} \end{matrix}

which can be obtained by taking the partial differential to the sum of least squares, i.e.,

\begin{matrix} \sum_{t = p + 1}^{n} {(X_{t} - μ - ϕ X_{t - 1} + \sum_{j = 1}^{p} ψ_{j} (X_{t - j} - μ - ϕ X_{t - j - 1}))}^{2}, \end{matrix}

with respect to

ψ

. Then, similar to [21], one can use the profile empirical likelihood method to construct a test for hypothesis

H_{0}

based on

{Z_{t}^{*} (θ)}

.

However, following [22], it is easy to verify that the resulting test does not converge in distribution to a standard chi-squared variable because the quantity

\frac{1}{\sqrt{n}} \sum_{t = p + 1}^{n} Z_{t}^{*} (θ_{0}) Z_{t}^{*} {(θ_{0})}^{⊤}

does not converge in probability for Case (ii), i.e.,

μ = 0

and

ϕ = 1 + \frac{c}{n}

for some nonzero constant c (nearly integrated if

c \neq 0

, and unit root if

c = 0

). As an improvement, one may use the weighted technique developed in [22] to construct a weighted empirical likelihood-based test. Unfortunately, the resulting testing statistic still faces a similar problem in the optimization step during the process of profiling the redundant parameters; see a similar discussion in [25].

To overcome this problem, we propose the construction of the following empirical likelihood function for

θ

:

\begin{matrix} L (θ) = sup \{\prod_{t = p + 1}^{m} m δ_{t} : δ_{1} \geq 0, \dots, δ_{m} \geq 0, \sum_{t = p + 1}^{m} δ_{t} = 1, \sum_{t = p + 1}^{m} δ_{t} Z_{t} (θ) = 0\}, \end{matrix}

based on the data-splitting idea, where

Z_{t} (θ) = {(Z_{t, 1} (θ), \dots, Z_{t, 2 + p} (θ))}^{⊤}

with

\begin{matrix} \{\begin{matrix} Z_{t, 1} (θ) = e_{t} (θ) \\ Z_{t, 2} (θ) = e_{t + m} (θ) (\frac{X_{t + m - 1}}{\sqrt{1 + X_{t + m - p - 1}^{2}}} + \sum_{i = 1}^{p} ψ_{i} \frac{X_{t + m - i - 1}}{\sqrt{1 + X_{t + m - p - i - 1}}}) \\ Z_{t, 2 + i} (θ) = e_{t} (θ) (X_{t - i} - μ - ϕ X_{t - i - 1}), i = 1, \dots, p, \end{matrix} \end{matrix}

(3)

where

m = [n / 2]

with

[\cdot]

is the floor function. That is, we use the second half of the data to handle

ϕ

, and the first half of the data to handle the rest of the parameters. Here,

\sqrt{1 + X_{t + m - p - i - 1}}

is mainly used for technical consideration, which can relieve the correlation among

{Z_{t} (θ)}

, and consequently improve the finite sample performance of the EL test.

Since our aim is to test

H_{0}

related to

ψ

, we are only interested in the parameter

ψ

. To this end, we treat the other parameters as redundant parameters, as in [21], and obtain the profile empirical likelihood ratio as

ℓ^{p} (ψ) : = {min}_{μ, ϕ} ℓ (μ, ϕ, ψ) .

To derive the asymptotic result for

ℓ^{p} (ψ)

, we need the following regular conditions:

(C1) Suppose ${X_{t}}$ follows one of the following cases:
-
(i) (Stationary) $| ϕ | < 1$ , independent of n;
-
(ii) (Non-stationary without an intercept) $ϕ = 1 - \frac{c}{n}$ for some constant c independent of n with $μ = 0$ ;
-
(iii) (Non-stationary with an intercept) $ϕ = 1 - \frac{c}{n}$ for some constant c independent of n with $μ \neq 0$ ;
(C2) $ψ (z) = 1 - \sum_{j = 1}^{p} ψ_{j} z^{j} \neq 0$ when $| z | < 1$ , and $ψ (z)$ has no common root with $ψ_{p} \neq 0$ .
(C3) ${e_{t}}$ are iid random errors, and satisfy $E (| e_{t} |^{2 + δ}) < \infty$ for some constant $δ > 0$ .

These conditions are quite common, and can be found in studies such as [13]. Here, (C2) is assumed to guarantee the stationarity of

{ε_{t}}

.

Under these conditions, we have the following result.

Theorem 1.

Suppose Conditions (C1)–(C3) hold. Then, under the null hypothesis

H_{0}

,

\begin{matrix} ℓ^{p} (ψ_{0}) \overset{d}{⟶} χ_{p}^{2}, \end{matrix}

as

n \to \infty

, where

χ_{p}^{2}

denotes a chi-squared random variable with p degrees of freedom, and ‘

\overset{d}{⟶}

’ denotes the convergence in distribution.

Remark 1.

Using a similar proof to that of Theorem 1, we can show that

\begin{matrix} {\tilde{ℓ}}^{p} (ψ_{0, 2}) \overset{d}{⟶} χ_{r}^{2}, as n \to \infty, \end{matrix}

where

{\tilde{ℓ}}^{p} (ψ_{0, 2}) = {min}_{μ, ϕ, ψ_{1}} ℓ (μ, ϕ, {(ψ_{1}^{⊤}, ψ_{0, 2}^{⊤})}^{⊤})

with r being the dimension of

ψ_{0, 2}

, which is the true value.

Theorem 1 is desirable because it shows that the proposed test has a standard chi-squared distribution asymptotically, regardless of which one of the Cases (i)–(iii) is followed by

{X_{t}}

. Based on Theorem 1, we may reject the null hypothesis

H_{0}

once

ℓ^{p} (ψ_{0}) > χ_{r}^{2} (1 - a)

at the significance level

a \in (0, 1)

, where

χ_{r}^{2} (1 - a)

denotes the

(1 - a)

-th quantile of

χ_{r}^{2}

.

3. Simulation Results

In this section, we conduct some simulations to investigate the finite sample performance of the proposed test in terms of both size and power. The simulations consist of three parts. In the first part, we investigate the finite sample performance of the proposed profile empirical likelihood, and compare it with a combination of the LB test and the Akaike information criterion (AIC), i.e., using firstly the LB test to detect whether there exists a serial correlation in the residuals, and then by employing the AIC to determine the order of the AR structure in residuals. In the second part, we investigate the possibility of using the proposed method to test whether or not

ψ

is equal to some given

ψ_{0}

, which may be useful when verifying the extent of the stationarity of the AR errors. Note that the combination of the LB and AIC cannot be used to fulfill this type of task. In the last part, we study the impact of misdetermining the AR structure of the errors on the finite sample performance of the unit root test developed in [13]. The LB test is computed with the R function Box.test.R, while for the computing of the profile empirical likelihood, we first use R package emplik to obtain the log-empirical likelihood ratio, and then optimize this log ratio by using the nlm.R function. All of these R functions are well-documented, and are currently available from the CRAN of the R-project.

In the first part, the random observations

{X_{t}}

are generated from the model (2) with

μ \in {0, 0.01}

, which indicates that the model has no intercept and an intercept item, respectively. We take

ϕ

from

{0.5, 1, 1 - \frac{1}{n}}

, where 0.5 indicates that

X_{t}

is a stationary process, and 1 indicates that it is a unit root process, while

1 - \frac{1}{n}

indicates a near unit root process.

{e_{t}}

is a sequence of iid random variables with means of zero and variances of one.

{ε_{t}}

follows the three different scenarios listed below.

S1: The null hypothesis $H_{0}^{(1)} : ψ = {(0, 0, 0)}^{⊤}$ , i.e., $ε_{t}$ has no serial correlation. The local alternative hypothesis: $ψ = ψ_{0} = {(d / \sqrt{n}, 0, 0)}^{⊤}$ for some $d > 0$ .
S2: The null hypothesis $H_{0}^{(2)} : ψ = {(0.1, 0, 0)}^{⊤}$ , i.e., $ε_{t}$ has first-order serial correlation. The local alternative hypothesis: $ψ = ψ_{0} = {(0.1, d / \sqrt{n}, 0)}^{⊤}$ for some $d > 0$ .
S3: The null hypothesis $H_{0}^{(3)} : ψ = {(0.1, 0.1, 0)}^{⊤}$ , i.e., $ε_{t}$ has second-order serial correlation. The local alternative hypothesis: $ψ = ψ_{0} = {(0.1, 0.1, d / \sqrt{n})}^{⊤}$ for some $d > 0$ .

In all Scenarios S1–S3, d is taken from

{1, 3, 5, 7}

. All computations are carried out 10,000 times with n ranging from 300 to 1200.

Table 1 reports the size performance of the proposed method with different settings at the significance levels

τ = 0.05

. We also report the ratios of determining the order of the AR error incorrectly by using the AIC of Scenarios S1–S3 under the condition of

H_{0}

for comparison. The EL method has a good performance in all Scenarios S1–S3. The results show that the size values of the EL method gradually converge to the significance level as the sample size n increases, no matter whether

X_{t}

is a stationary process, a near unit root process, or a unit root process, and regardless of whether

μ

is 0 or not. Conversely, for the AIC method, when

X_{t}

follows a stationary process, the ratios of determining the order of the AR error incorrectly are only closer to 5% in S1. Note that it performs poorly for the rest of the settings, meaning that it is affected greatly by the stationarity of

{X_{t}}

.

Figure 1 shows the power performance of the EL method. We can see that in S1 and S2, when

X_{t}

follows a stationary process, the convergence rate is the slowest. When

X_{t}

follows a near unit root process or a unit root process, as the value of d increases, the power converges quickly to 1. In S3, when

X_{t}

follows a near unit root process or a unit root process, as the value of d increases to 7, the power values have a slightly descending tendency. This implies that although the stationarity of

{X_{t}}

does not impact on the order of the local alternative hypothesis, it does affect the power function of the EL method.

In the second part, we consider testing whether or not

ψ

is equal to some given

ψ_{0}

. We simulate two settings, i.e.,

(I): The null hypothesis ${\tilde{H}}_{0}^{(1)} : ψ = (ψ_{1}, ψ_{2}) = {(0.1, 0.3)}^{⊤}$ against the local alternative hypothesis: $ψ = ψ_{0} = {(0.1 + \frac{d}{\sqrt{n}}, 0.3 + \frac{d}{\sqrt{n}})}^{⊤}$ , for some $d > 0$ .
(II): The null hypothesis ${\tilde{H}}_{0}^{(2)} : ψ_{2} = 0.3$ , against the local alternative hypothesis: $ψ_{2} = 0.3 + \frac{d}{\sqrt{n}}$ , for some $d > 0$ .

The other parameters are the same as those in the first part. The size (

d = 0

) and power (

d \in {3, 5, 10}

) performances are shown in Figure 2. As expected, similar observations can be found in Figure 3 as in the first part of simulations.

The simulation results in the first and second parts show that the proposed EL method has a good performance in specifying the AR error structure and testing whether or not

ψ

is equal to some given

ψ_{0}

, thereby confirming the theoretical result obtained in Theorem 1. It is worth noting that when taking the AR error structure into account, accurate identification is crucial, because it will affect the unit root test of the AR model. Therefore, in the third part, we conduct the following simulation to show the benefit of conducting a predefined test.

Step 1: We generate an AR model with an AR(1) error structure, and the parameters are

μ = 0, ϕ = 1 - \frac{1}{n}, ψ = \frac{10}{\sqrt{n}}

. Then, we use the EL and AIC methods to determine the order of the AR error. We consider sample sizes of 600 and 1200, repeat the tests 10,000 times, and record the order determination counts under the two methods. The results are shown in Figure 3. The abscissa represents the order of the AR error, and the ordinate is the number of each order. It can be seen from Figure 3 that under all sample sizes, the two methods show that the residuals have a serial correlation. For the EL test, in 10,000 experiments, 9398 of them are correctly ordered, and the error rate is only 6.02%. When the sample size increases to 1200, the error rate decreases to 5.84%. For the AIC, when the sample size is 600, the error rate is 34.21%, and when the sample size is 1200, the error rate is 22.75%. It is obvious that compared with the AIC method, the EL test has advantages in identifying the order of the correlated errors, which is consistent with the above simulation results.

Step 2: We use the method proposed in [13] to test the unit root of an AR (1) model when the AR error order is correctly and incorrectly determined. Table 2 records the probability of identifying a unit root when the real data are a near unit root. The results show that when the true underlying structure of the AR error is incorrectly specified, the power of the test proposed in [13] suffers from a loss compared to the case when the true underlying structure of the AR error is correctly specified. This shows the necessity of correctly testing the AR error structure before conducting the unit root test if one wants to obtain a more reliable unit root test result.

To summarize, the EL method proposed in this paper has obvious advantages in identifying the AR error structure, and these two methods are crucial in the subsequent real data analysis.

4. A Financial Real Data Application

In this section, we provide a real financial data example. The purpose of this section is to explore the error structure of different exchange rate markets. We collected the exchange rates of eight countries, including developed and developing countries, against the U.S. dollar. Currencies from developed countries include the Canadian dollar (CAD), Norwegian Kroner (NKR), Singapore dollar (SGD), Swedish Kronor (SKR) and Japanese yen (JPY). Currencies from developing countries include Chinese yuan (CNY), Thai baht (THB) and Sri Lanka Rupees (SRE). All data are downloaded from FRED database (fred.stlouisfed.org). The sample period is the daily data from 2 January 2017 to 31 December 2020 (

n = 1044

). Their time series graphs are provide in Figure 4.

We report the least squares estimation of the unknown parameters

μ

and

ϕ

, and the testing results of the EL, LB, and AIC methods, where the LB and AIC tests were conducted on residuals obtained from the least squares method. All results are listed in Table 3, in which the second and third columns are the estimated intercept and autoregressive coefficients, respectively; the fourth column is the order determination result of the EL test; the fifth column is the p-values of the EL test; and the last two columns are the p-values of LB test and the order determination result of AIC, respectively. The AIC test shows that most sequences have a serial correlation, except for CNY and JPY, while the EL method indicates that only one country’s data has an AR error of up to an order of 2. Note that the AIC tends to determine the correlated errors with a higher order than for Cases (ii)–(iii), while for most cases, the estimated

\hat{ϕ}

is very close to 1, i.e., a near unit root. It seems that the testing results for this dataset coincide roughly with the observations in the simulations.

5. Conclusions

The AR model is widely used in time series data modeling. However, the direct application of AR models with iid errors is inadequate sometimes. A common practice is to further assume that AR models have errors of AR structure. Note that the relevant structure of the error affects the statistical inference of the AR model. Therefore, it is important to test the error structure of the model in both theoretical and practical analyses, which is not considered in the literature to the best of our knowledge. Motivated by this, this paper proposed a consistency empirical likelihood test method based on the idea of data splitting. The limit distribution of the EL statistic was proved to be chi-squared asymptotically regardless of the process

{X_{t}}

being stationary or non-stationary, and with or without an intercept term. The proof is challenging and different from that of the traditional profile empirical likelihood in [21], as the quantity

\frac{1}{\sqrt{n}} \sum_{t = p + 1}^{n} Z_{t} (θ_{0})

does not converge in distribution to a normally distributed vector. Fortunately, the limit distribution of the profile empirical likelihood-based test is still chi-squared because of the special block structure of the asymptotic covariance matrix. The simulation results illustrated that the proposed method could not only have a good performance in specifying the AR error structure, but also could sufficiently test whether the coefficients of the error item are equal to some given values, which can not be achieved by some existing serial correlation tests in the literature. The technique in the proof of Theorem 1 is challenging as in Case (ii) the theoretical proof involves handling the convergence in space, and the special block structure of the asymptotic covariance matrix. Hopefully, it is of potential usage in practice as it is difficult to detect whether the process

{X_{t}}

is a unit root or near unit root process. Note that it is not necessary to make clear which case of (i)–(iii) the

{X_{t}}

follows when using our proposed test in practice.

As noted by an an anonymous reviewer, an issue of interest is whether the current result can be extended to the case when

ε_{t}

follows a autoregressive moving average model. Note that the discussion in this paper involves nonstationary Cases (ii) and (iii); it seems challenging to derive the related theoretical results. We will further consider this in the future.

Author Contributions

Conceptualization, X.L.; Methodology, X.L., X.W. and Y.F.; Software, X.W.; Validtion, X.L., Q.L. and L.T.; Formal analysis, X.W. and Y.F.; Investigation, Q.L. and L.T.; Resources, X.L.; Data curation, X.W.; Writing—original draft preparation, X.W., X.L., Y.F., L.T. and Q.L.; Writing—review and editing, X.L. and X.W.; Visualization, Q.L. and L.T.; Supervision, Q.L. and L.T.; Project administration, X.L.; Funding acquisition, X.L. All authors have read and agreed to the published version of the published version of the manuscript.

Funding

Xiaohui Liu’s research was funded by NSF of China grant number 11971208, National Social Science Foundation of China grant number 21&ZD152 and the Outstanding Youth Fund Project of the Science and Technology Department of Jiangxi Province grant number 20224ACB211003. Li Tan’s research was funded by Natural Science Foundation of China grant number 12061034 and China Postdoctoral Science Foundation grant number 2022M711424. Qing Liu’s research was funded by NSF of Jiangxi Province grant number 20192BAB201005, China Post doctoral Science Foundation grant number 2020M671961, and Post doctoral Science Foundation of Jiangxi Province grant number 2019KY47 and The APC was funded by Xiaohui Liu.

Data Availability Statement

The real data used in the artical can be found at fred.stlouisfed.org.

Acknowledgments

We thank the three anonymous reviewers, the Associate Editor, and the Guest Editor of this Special Issue for their insightful comments, which have led to many improvements in this paper. Xiaohui Liu’s research is supported by NSF of China (Grant No. 11971208), National Social Science Foundation of China (21&ZD152) and the Outstanding Youth Fund Project of the Science and Technology Department of Jiangxi Province (No. 20224ACB211003). Li Tan’s research is supported by Natural Science Foundation of China (No. 12061034) and China Postdoctoral Science Foundation (No. 2022M711424). Qing Liu’s research is supported by NSF of Jiangxi Province (No. 20192BAB201005), China Post doctoral Science Foundation (No. 2020M671961), and Post doctoral Science Foundation of Jiangxi Province (No. 2019KY47).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of the Main Result

In this appendix, we provide the detailed proofs for the main results. Before proceeding further, we need to first provide some necessary lemmas. For convenience, denote

μ_{0}

,

ϕ_{0}

, and

ψ_{0} : = {(ψ_{1, 0}, ψ_{2, 0}, \dots, ψ_{p, 0})}^{⊤}

as the true values of

μ

,

ϕ

, and

ψ : = {(ψ_{1}, ψ_{2}, \dots, ψ_{p})}^{⊤}

, respectively. Write

θ_{0} : = {(μ_{0}, ϕ_{0}, ψ_{0}^{⊤})}^{⊤}

, and let

F_{t}

be the sigma field generated by

{e_{s} : 1 \leq s \leq t, m + 1 \leq s \leq m + t}

.

For convenience, write

S_{t} : = {(S_{t, 1}, {\tilde{S}}_{t}^{⊤})}^{⊤} = {(S_{t, 1}, S_{t, 2}, \dots, S_{t, p + 1})}^{⊤}

, where

\begin{matrix} S_{t, 1} = \frac{1}{\sqrt{m}} \sum_{i = 1}^{t} e_{i}, S_{t, 2} = \frac{1}{\sqrt{m}} \sum_{i = 1}^{t} e_{i} ε_{i - 1}, \dots, S_{t, p + 1} = \frac{1}{\sqrt{m}} \sum_{i = 1}^{t} e_{i} ε_{i - p}, \end{matrix}

for

t = 1, 2, \dots, n

. By following [2], it is easy to check for any

s \in (0, 2]

that

\begin{matrix} S_{[n s]} \Rightarrow W (s) : = {(W_{e} (s), \tilde{W} {(s)}^{⊤})}^{⊤} = {(W_{e} (s), {\tilde{W}}_{1} (s), \dots, {\tilde{W}}_{p} (s))}^{⊤}, \end{matrix}

(A1)

under Case (ii) as

n \to \infty

, where ‘⇒’ denotes the convergence in space

D (0, 2]

which is the space of real-valued functions of the interval

(0, 2]

that are right continuous and have finite left limits,

[\cdot]

denotes the floor function, and

W (s)

is a vector of Gaussian processes with covariance matrix diag

{σ_{e}^{2}, Σ_{22}}

with

σ_{e}^{2} = E (e_{1}^{2})

and

\begin{matrix} Σ_{22} & = & (\begin{matrix} σ_{e}^{2} E (ε_{1}^{2}) & \dots & σ_{e}^{2} E (ε_{1} ε_{p}) \\ ⋮ & ⋱ & ⋮ \\ σ_{e}^{2} E (ε_{p} ε_{1}) & \dots & σ_{e}^{2} E (ε_{p}^{2}) \end{matrix}) . \end{matrix}

Lemma A1.

Under the same conditions of Theorem 1, as

n \to \infty

, we obtain

For Case (i),

$\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) \overset{d}{⟶} N (0, Σ), \end{matrix}$

(A2)

where ‘ $\overset{p}{⟶}$ ’ denotes the convergence in probability, and $Σ = diag {Σ_{11}, Σ_{22}}$ and

$\begin{matrix} Σ_{11} & = & (\begin{matrix} σ_{e}^{2} & 0 \\ 0 & σ_{e}^{2} \cdot lim_{t \to \infty} E {(\frac{X_{t + m - 1}}{\sqrt{1 + X_{t + m - p - 1}^{2}}} + \sum_{i = 1}^{p} ψ_{i} \frac{X_{t + m - i - 1}}{\sqrt{1 + X_{t + m - p - i - 1}}})}^{2} \end{matrix}) . \end{matrix}$
For Case (ii),

$\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) \\ = & {(W_{e} (1), ω_{2 m}^{*} W_{e} (2) - ω_{m}^{*} W_{e} (1), {\tilde{W}}_{1} (1), \dots, {\tilde{W}}_{p} (1))}^{⊤} + o_{p} (1), \end{matrix}$

(A3)

where, for $k = m, 2 m$ ,

$\begin{matrix} ω_{k}^{*} = (\frac{X_{k - p - 1}}{\sqrt{1 + X_{k - p - 1}^{2}}} + \sum_{i = 1}^{p} ψ_{i} \frac{X_{k - p - i - 1}}{\sqrt{1 + X_{k - p - i - 1}}}) . \end{matrix}$
For Case (iii),

$\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) \overset{d}{⟶} N (0, \tilde{Σ}), \end{matrix}$

(A4)

where $\tilde{Σ} = diag {σ_{e}^{2}, σ_{e}^{2}, Σ_{22}}$ .
for Case (i), or $Σ_{11} = diag {σ_{e}^{2}, σ_{e}^{2}}$ for Cases (ii) and (iii). Note that $σ_{e}^{2} = E (e_{t}^{2})$ .

Proof of Lemma A1.

Put

e_{t} = e_{t} (θ_{0})

and

ε_{t} = (X_{t - i} - μ_{0} - ϕ_{0} X_{t - i - 1})

for

i = 1, 2, \dots, p

when

θ = θ_{0}

. Then, it is easy to check that

{Z_{t} (θ_{0})}

is a martingale difference sequence (MDS) with respect to the filter

{F_{t}}

.

Next, for Case (i), under the conditions of Theorem 1, both

{X_{t}}

and

{ε_{t}}

are strictly stationary. Hence, for any

a : = {(a_{1}, a_{2}, \dots, a_{p + 2})}^{⊤} \in R^{p + 2}

, we obtain

\begin{matrix} \frac{1}{m} \sum_{t = p + 1}^{m} E ({(a^{⊤} Z_{t} (θ_{0}))}^{2} | F_{t - 1}) \\ = & a^{⊤} (\frac{1}{m} \sum_{t = p + 1}^{m} E (Z_{t} (θ_{0}) Z_{t}^{⊤} (θ_{0}) | F_{t - 1})) a \\ \overset{p}{⟶} & a^{⊤} Σ a, \end{matrix}

(A5)

as

n \to \infty

by using the law of large number for MDS [26]. Similarly, for any arbitrarily small

ϵ > 0

, we obtain

\begin{matrix} \frac{1}{m} \sum_{t = p + 1}^{m} E (| a^{⊤} Z_{t} (θ_{0}) |^{2} I (| a^{⊤} Z_{t} (θ_{0}) | \geq ϵ \sqrt{m}) | F_{t - 1}) \\ \leq & \frac{1}{ϵ^{δ_{1}} n^{δ_{1} / 2}} \frac{1}{m} \sum_{t = p + 1}^{m} E (| a^{⊤} Z_{t} (θ_{0}) |^{2 + δ_{1}} | F_{t - 1}) \\ \leq & \frac{{(p + 2)}^{1 + δ_{1}} {∥ a ∥}^{2 + δ_{1}}}{ϵ^{δ_{1}} n^{δ_{1} / 2}} \frac{1}{m} \sum_{t = p + 1}^{m} E (\sum_{i = 1}^{p + 2} | Z_{t, i} (θ_{0}) |^{2 + δ_{1}} | F_{t - 1}) \\ \overset{p}{⟶} & 0, \end{matrix}

(A6)

by noting that

E (| e_{t} |^{2 + δ_{1}}) < \infty

, which implies

E (| ε_{t} |^{2 + δ_{1}}) < \infty

and in turn

E (| X_{t} |^{2 + δ_{1}}) < \infty

when

{ε_{t}}

and

{X_{t}}

are strictly stationary. (A5) and (A6) together show the normality for Case (i) by using the central limit theorem for MDS [26].

For Case (ii), note that

\begin{matrix} X_{t + m - i - 1} = \sum_{k = 1}^{p} ϕ^{k - 1} ε_{t + m - i - k + 1} + ϕ^{p} X_{t + m - i - p - 1}, i = 1, 2, \dots, p, \end{matrix}

and by [2,27], it holds that for any

s \in (0, 1]

\begin{matrix} \frac{X_{[n s]}}{\sqrt{n}} \Rightarrow J_{c} (s) : = \int_{0}^{s} e^{- c (s - r)} d W (r) in the space D ((0, 1]), \end{matrix}

(A7)

as

n \to \infty

. Using these, it is easy to check that, as

n \to \infty

,

\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} e_{t + m} (\frac{X_{t + m - 1}}{\sqrt{1 + X_{t + m - p - 1}^{2}}} + \sum_{i = 1}^{p} ψ_{i} \frac{X_{t + m - i - 1}}{\sqrt{1 + X_{t + m - p - i - 1}}}) \\ = & \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} e_{t + m} (\frac{X_{t + m - p - 1}}{\sqrt{1 + X_{t + m - p - 1}^{2}}} + \sum_{i = 1}^{p} ψ_{i} \frac{X_{t + m - p - i - 1}}{\sqrt{1 + X_{t + m - p - i - 1}}}) + o_{p} (1) . \end{matrix}

Note that

\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} e_{t + m} (\frac{X_{t + m - p - 1}}{\sqrt{1 + X_{t + m - p - 1}^{2}}} + \sum_{i = 1}^{p} ψ_{i} \frac{X_{t + m - p - i - 1}}{\sqrt{1 + X_{t + m - p - i - 1}}}) \\ : = & \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} e_{t + m} ω_{t + m}^{*} \\ = & \sum_{t = p + 1}^{m} (S_{t + m, 1} - S_{t + m - 1, 1}) ω_{t + m}^{*} \\ = & S_{2 m, 1} ω_{2 m}^{*} - S_{p + m, 1} ω_{p + m + 1}^{*} + \sum_{t = p + 1}^{m - 1} S_{t + m, 1} (ω_{t + m}^{*} - ω_{t + m + 1}^{*}) . \end{matrix}

Using (A7), we obtain

w_{2 m} \overset{p}{\to} sgn (J_{c} (2))

, where sgn

(\cdot)

denotes the sign function. As

X_{[n s]} = O_{p} (\sqrt{m})

and

n \to \infty

for Case (ii), it is easy to check that there exists some

d \in (0, \frac{1}{2})

, for

i = 0, 1, \dots, p

, such that

\begin{matrix} |\sum_{t = p + 1}^{m - 1} S_{t + m, 1} (\frac{X_{t + m - p - i - 1}}{\sqrt{1 + X_{t + m - p - i - 1}}} - \frac{X_{t + m - p - i}}{\sqrt{1 + X_{t + m - p - i}}})| \\ = & |\sum_{t = p + 1}^{m - 1} S_{t + m, 1} \frac{X_{t + m - p - i} - X_{t + m - p - i - 1}}{{(1 + ξ_{t, i, *}^{2})}^{\frac{3}{2}}}| \\ \leq & O_{p} (m^{- d}) \times \frac{1}{m} \sum_{t = p + 1}^{m - 1} {| S_{t + m, 1} | | X_{t + m - p - i} - X_{t + m - p - i - 1} |} = o_{p} (1), \end{matrix}

where

ξ_{t, i, *}

lies between

X_{t + m - p - i}

and

X_{t + m - p - i - 1}

. This shows

\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) \\ = & {(S_{m, 1}, S_{2 m, 1} ω_{2 m}^{*} - S_{p + m, 1} ω_{p + m + 1}^{*}, S_{m, 2}, \dots, S_{m, p + 1})}^{⊤} + o_{p} (1), as n \to \infty . \end{matrix}

Then, the asymptotic result for Case (ii) follows immediately based on (A1).

Case (iii) can be proved similarly as Cases (i) and (ii). We omit the details. □

Lemma A2.

Under the same conditions of Theorem 1, as

n \to \infty

, we find that

(a) $\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{*}) = \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) + O_{p} (1)$ , uniformly for $(μ, ϕ) \in B$ ,
(b) $\frac{1}{m} \sum_{t = p + 1}^{m} Z_{t} (θ_{*}) Z_{t}^{⊤} (θ_{*}) = \frac{1}{m} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) Z_{t}^{⊤} (θ_{0}) + o_{p} (1) = Σ_{0} + o_{p} (1)$ , uniformly for $(μ, ϕ) \in B$ ,
(c) ${max}_{p + 1 \leq t \leq m} {sup}_{B} ∥ Z_{t} (θ_{*}) ∥ = o_{p} (\sqrt{m})$ ,
where $θ_{*} = {(μ, ϕ, ψ_{0}^{⊤})}^{⊤}$ ,

$\begin{matrix} B = \{\begin{matrix} {(μ, ϕ) : | μ - μ_{0} | + | ϕ - ϕ_{0} | < C / \sqrt{m}} & for Case (i), \\ {(μ, ϕ) : | μ - μ_{0} | + \sqrt{m} | ϕ - ϕ_{0} | < C / \sqrt{m}} & for Case (ii), \\ {(μ, ϕ) : | μ - μ_{0} | + m | ϕ - ϕ_{0} | < C / \sqrt{m}} & for Case (iii), \end{matrix} \end{matrix}$

for some positive constant C, $Σ_{0} = d i a g {Σ_{11}, Σ_{22}}$ for Case (i), and $d i a g {σ_{e}^{2}, σ_{e}^{2}, Σ_{22}}$ for Cases (ii) and (iii).

Proof of Lemma A2.

We only prove Parts (a) and (c), as the proof of (b) is trivial based on those of (a) and (c).

For Part (a), note that

\begin{matrix} e_{t} (θ_{*}) - e_{t} & = & X_{t} - μ - ϕ X_{t - 1} + \sum_{i = 1}^{p} ψ_{i, 0} (X_{t - i} - μ - ϕ X_{t - i - 1}) \\ - {X_{t} - μ_{0} - ϕ_{0} X_{t - 1} + \sum_{i = 1}^{p} ψ_{i, 0} (X_{t - i} - μ_{0} - ϕ_{0} X_{t - i - 1})} \\ = & - (μ - μ_{0}) (1 + \sum_{i = 1}^{p} ψ_{i, 0}) - (ϕ - ϕ_{0}) {X_{t - 1} + \sum_{i = 1}^{p} ψ_{i, 0} X_{t - i - 1}}, \end{matrix}

and

(X_{t - i} - μ - ϕ X_{t - i - 1}) - ε_{t - i} = (μ - μ_{0}) + (ϕ - ϕ_{0}) X_{t - i - 1}

for any

t = p + 1, \dots, 2 m

and

i = 0, 1, \dots, p

. Hence, we have

\begin{matrix} sup_{B} |\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} (Z_{t, 1} (θ_{*}) - Z_{t, 1} (θ_{0}))| \\ \leq & sup_{B} |- \sqrt{m} (μ - μ_{0}) (1 + \sum_{i = 1}^{p} ψ_{i, 0})| \\ sup_{B} |- m (ϕ - ϕ_{0}) \frac{1}{m \sqrt{m}} \sum_{t = p + 1}^{m} {X_{t - 1} + \sum_{i = 1}^{p} ψ_{i, 0} X_{t - i - 1}}| \\ = & C \cdot |(1 + \sum_{i = 1}^{p} ψ_{i, 0})| \cdot \{1 + |\int_{0}^{1} J_{c} (s) d s|\} \\ = & O_{p} (1), as n \to \infty . \end{matrix}

The proofs of

{sup}_{B} |\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} (Z_{t, k} (θ_{*}) - Z_{t, k} (θ_{0}))|

, for

k = 2, \dots, p + 2

, follow a similar fashion. This shows Part (a).

For Part (c), based on the decomposition of

e_{t} (θ_{*})

given in Part (a), we similarly have

\begin{matrix} max_{p + 1 \leq t \leq m} sup_{B} | Z_{t, 1} (θ_{*}) | & \leq & max_{p + 1 \leq t \leq m} | e_{t} | + | 1 + \sum_{i = 1}^{p} ψ_{i, 0} | \cdot sup_{B} | μ - μ_{0} | \\ + sup_{B} | ϕ - ϕ_{0} | \cdot max_{p + 1 \leq t \leq m} | X_{t - 1} + \sum_{i = 1}^{p} ψ_{i, 0} X_{t - i - 1} | \\ = & o_{p} (\sqrt{m}), \end{matrix}

by using the Markov inequality based on the conditions of Theorem 1 as

n \to \infty

.

{max}_{p + 1 \leq t \leq m} {sup}_{B} | Z_{t, k} (θ_{*}) | = o_{p} (\sqrt{m})

,

k = 2, \dots, p + 2

, can be proved similarly. We omit the details. This shows Part (c). □

Proof of Theorem 1.

In the following, we only prove Case (ii), as Cases (i) and (iii) follow a similar fashion.

Based on Lemmas A1 and A2, we can show by using similar techniques as in Theorem 1 of [28] that

\begin{matrix} ℓ (μ, ϕ, ψ_{0}) = {(\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{*}))}^{⊤} Σ_{0}^{- 1} (\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{*})) + o_{p} (1), \end{matrix}

(A8)

uniformly for

(μ, ϕ) \in B

. Note that

(μ_{0}, ϕ_{0}) \in B

. Trivially, with (A9), it follows

\begin{matrix} ℓ (μ_{0}, ϕ_{0}, ψ_{0}) = {(\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}))}^{⊤} Σ_{0}^{- 1} (\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0})) + o_{p} (1), \end{matrix}

(A9)

as

n \to \infty

.

ε_{t} (μ, ϕ) = X_{t} - μ - ϕ X_{t - 1}

,

{\tilde{X}}_{t - 1} = X_{t - 1} + \sum_{i = 1}^{p} ψ_{i, 0} X_{t - i - 1}

and

γ_{0} = 1 + \sum_{i = 1}^{p} ψ_{i, 0}

. Note that, with (A7), we have

\begin{matrix} \frac{1}{m} \sum_{t = p + 1}^{m} {\frac{1}{\sqrt{m}} {\tilde{X}}_{t - 1}} \overset{d}{⟶} γ_{0} \int_{0}^{1} J_{c} (s) d s, \\ \frac{1}{m} \sum_{t = p + 1}^{m} {\frac{1}{\sqrt{m}} {\tilde{X}}_{t + m - 1} ω_{t + m}^{*}} \overset{d}{⟶} γ_{0} \int_{1}^{2} sgn (J_{c} (s)) d s, as n \to \infty . \end{matrix}

Next, since

ε_{t} = Φ^{- 1} (B) e_{t}

, where

Φ (B) = 1 + \sum_{i = 1}^{p} ψ_{i, 0} B^{i}

with B being the lag operator satisfying

B^{j} e_{t} = e_{t - j}

, which is a linear process of

{e_{t}}

, we may show that

\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} {\frac{1}{\sqrt{m}} X_{t - 1} e_{t}} = O_{p} (1), and \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} {\frac{1}{\sqrt{m}} X_{t - 1} ε_{t}} = O_{p} (1), as n \to \infty, \end{matrix}

based on the martingale decomposition as those maintained in [29,30]; see the proof of Theorem 3.1 of [31] for similar discussions. Then, we have

\begin{matrix} \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} {Z_{t} (θ_{*}) - Z_{t} (θ_{0})} \\ = & - \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} (\begin{matrix} (μ - μ_{0}) γ_{0} + (ϕ - ϕ_{0}) {\tilde{X}}_{t - 1} \\ {(μ - μ_{0}) γ_{0} + (ϕ - ϕ_{0}) {\tilde{X}}_{t + m - 1}} ω_{t + m}^{*} \\ {(μ - μ_{0}) γ_{0} + (ϕ - ϕ_{0}) {\tilde{X}}_{t - 1}} ε_{t - 1} (μ, ϕ) \\ \dots \\ {(μ - μ_{0}) γ_{0} + (ϕ - ϕ_{0}) {\tilde{X}}_{t - 1}} ε_{t - p} (μ, ϕ) \end{matrix}) \\ - \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} (\begin{matrix} 0 \\ 0 \\ e_{t} ((μ - μ_{0}) + (ϕ - ϕ_{0}) X_{t - 2}) \\ \dots \\ e_{t} ((μ - μ_{0}) + (ϕ - ϕ_{0}) X_{t - p - 1}) \end{matrix}) \\ = & - (\begin{matrix} γ_{0} & γ_{0} \int_{0}^{1} J_{c} (s) d s \\ γ_{0} & γ_{0} \int_{1}^{2} sgn (J_{c} (s)) d s \\ 0 & 0 \\ ⋱ & ⋱ \\ 0 & 0 \end{matrix}) (\begin{matrix} \sqrt{m} (μ - μ_{0}) \\ m (ϕ - ϕ_{0}) \end{matrix}) + o_{p} (1) \\ : = & Γ (\begin{matrix} \sqrt{m} (μ - μ_{0}) \\ m (ϕ - ϕ_{0}) \end{matrix}) + o_{p} (1), \end{matrix}

(A10)

uniformly for

(μ, ϕ) \in B

as

n \to \infty

.

Based on (A10), it is then easy to check that the minimizer, say

(\hat{μ}, \hat{ϕ})

, of

ℓ (μ, ϕ, ψ_{0}) - ℓ (μ_{0}, ϕ_{0}, ψ_{0})

must be in

B

, and satisfies

\begin{matrix} (\begin{matrix} \sqrt{m} (\hat{μ} - μ_{0}) \\ m (\hat{ϕ} - ϕ_{0}) \end{matrix}) = - {(Γ^{⊤} Σ_{0}^{- 1} Γ)}^{- 1} Γ^{⊤} Σ_{0}^{- 1} \cdot \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) + o_{p} (1), as n \to \infty . \end{matrix}

Then, it follows

\begin{matrix} ℓ (\hat{μ}, \hat{ϕ}, ψ_{0}) \\ = & {(\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}))}^{⊤} (Σ_{0}^{- 1} - Σ_{0}^{- 1} Γ {(Γ^{⊤} Σ_{0}^{- 1} Γ)}^{- 1} Γ^{⊤} Σ_{0}^{- 1}) \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) + o_{p} (1) . \end{matrix}

Further note that

\begin{matrix} Γ {(Γ^{⊤} Σ_{0}^{- 1} Γ)}^{- 1} Γ^{⊤} = Γ {(Γ_{1}^{⊤} Σ_{11}^{- 1} Γ_{1})}^{- 1} Γ = (\begin{matrix} Σ_{11}^{- 1} \\ 0 \end{matrix}), \end{matrix}

where

Γ_{1} = (\begin{matrix} γ_{0} & γ_{0} \int_{0}^{1} J_{c} (s) d s \\ γ_{0} & γ_{0} \int_{1}^{2} sgn (J_{c} (s)) d s \end{matrix})

. Then, we have

\begin{matrix} Σ_{0}^{- 1} - Σ_{0}^{- 1} Γ {(Γ^{⊤} Σ_{0}^{- 1} Γ)}^{- 1} Γ^{⊤} Σ_{0}^{- 1} = Σ_{0}^{- 1} - Σ_{0}^{- 1} (\begin{matrix} Σ_{11}^{- 1} \\ 0 \end{matrix}) Σ_{0}^{- 1} = (\begin{matrix} 0 \\ Σ_{22}^{- 1} \end{matrix}) . \end{matrix}

Hence,

\begin{matrix} ℓ^{p} (ψ_{0}) & = & ℓ (\hat{μ}, \hat{ϕ}, ψ_{0}) \\ = & {(\frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}))}^{⊤} (\begin{matrix} 0 \\ Σ_{22}^{- 1} \end{matrix}) \frac{1}{\sqrt{m}} \sum_{t = p + 1}^{m} Z_{t} (θ_{0}) + o_{p} (1) \\ = & \tilde{W} {(1)}^{⊤} Σ_{22}^{- 1} \tilde{W} (1) + o_{p} (1) \\ \overset{d}{⟶} & χ_{p}^{2}, as n \to \infty . \end{matrix}

This completes the proof of this Theorem. □

References

Chan, N.H.; Wei, C.Z. Asymptotic inference for nearly nonstationary AR (1) processes. Ann. Stat. 1987, 15, 1050–1063. [Google Scholar] [CrossRef]
Phillips, P.C. Towards a unified asymptotic theory for autoregression. Biometrika 1987, 74, 535–547. [Google Scholar] [CrossRef]
Elliott, G.; Rothenberg, T.J.; Stock, J.H. Efficient Tests for an Autoregressive Unit Root; National Bureau of Economic Research: Cambridge, MA, USA, 1992. [Google Scholar]
Mikusheva, A. Uniform inference in autoregressive models. Econometrica 2007, 75, 1411–1452. [Google Scholar] [CrossRef]
Zhang, R.; Li, C.; Peng, L. Inference for the tail index of a GARCH(1,1) model and an AR(1) model with ARCH(1) errors. Econom. Rev. 2019, 38, 151–169. [Google Scholar] [CrossRef]
Liu, X.; Yang, B.; Cai, Z.; Peng, L. A unified test for predictability of asset returns regardless of properties of predicting variables. J. Econom. 2019, 208, 141–159. [Google Scholar] [CrossRef]
Murray, C.J.; Papell, D.H. The purchasing power parity persistence paradigm. J. Int. Econ. 2002, 56, 1–19. [Google Scholar] [CrossRef] [Green Version]
Rapach, D.E.; Wohar, M.E. The persistence in international real interest rates. Int. J. Financ. Econ. 2004, 9, 339–346. [Google Scholar] [CrossRef]
Datteo, A.; Luca, F.; Busca, G. Statistical pattern recognition approach for long-time monitoring of the G.Meazza stadium by means of AR models and PCA. Eng. Struct. 2017, 153, 317–333. [Google Scholar] [CrossRef]
Said, S.E.; Dickey, D.A. Testing for unit roots in autoregressive-moving average models of unknown order. Biometrika 1984, 71, 599–607. [Google Scholar] [CrossRef]
Phillips, P.C.; Perron, P. Testing for a unit root in time series regression. Biometrika 1988, 75, 335–346. [Google Scholar] [CrossRef]
Kwiatkowski, D.; Phillips, P.C.; Schmidt, P.; Shin, Y. Testing the null hypothesis of stationarity against the alternative of a unit root: How sure are we that economic time series have a unit root? J. Econom. 1992, 54, 159–178. [Google Scholar] [CrossRef]
Hill, J.; Li, D.; Peng, L. Uniform interval estimation for an AR (1) process with AR errors. Stat. Sin. 2016, 26, 119–136. [Google Scholar] [CrossRef]
Li, C.; Li, D.; Peng, L. Uniform test for predictive regression with AR errors. J. Bus. Econ. Stat. 2017, 35, 29–39. [Google Scholar] [CrossRef]
Silvey, S.D. The Lagrangian multiplier test. Ann. Math. Stat. 1959, 30, 389–407. [Google Scholar] [CrossRef]
Box, G.E.; Pierce, D.A. Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. J. Am. Stat. Assoc. 1970, 65, 1509–1526. [Google Scholar] [CrossRef]
Ljung, G.M.; Box, G.E. On a measure of lack of fit in time series models. Biometrika 1978, 65, 297–303. [Google Scholar] [CrossRef]
Owen, A.B. Empirical likelihood ratio confidence intervals for a single functional. Biometrika 1988, 75, 237–249. [Google Scholar] [CrossRef]
Owen, A. Empirical likelihood ratio confidence regions. Ann. Stat. 1990, 18, 90–120. [Google Scholar] [CrossRef]
Owen, A. Empirical likelihood for linear models. Ann. Stat. 1991, 19, 1725–1747. [Google Scholar] [CrossRef]
Qin, J.; Lawless, J. Empirical likelihood and general estimating equations. Ann. Stat. 1994, 22, 300–325. [Google Scholar] [CrossRef]
Chan, N.H.; Li, D.; Peng, L. Toward a unified interval estimation of autoregressions. Econom. Theory 2012, 28, 705–717. [Google Scholar] [CrossRef]
Liu, F.; Wang, P.F.; Chun-Yan, L.I.; Kang, X.M.; Statistics, D.O. Empirical Likelihood Based Diagnostics for Heteroscedasticity in Single-index Model. J. Chongqing Technol. Bus. Univ. (Nat. Sci. Ed.) 2018, 81, 255–281. [Google Scholar]
Zhu, F.; Cai, Z.; Peng, L. Predictive regressions for macroeconomic data. Ann. Appl. Stat. 2014, 8, 577–594. [Google Scholar] [CrossRef] [Green Version]
Liu, X.; Liu, Y.; Lu, F. Empirical likelihood-based unified confidence region for a predictive regression model. Commun. Stat. Simul. Comput. 2019, 51, 2122–2139. [Google Scholar] [CrossRef]
Hall, P.; Heyde, C.C. Martingale Limit Theory and Its Application; Academic Press: Cambridge, MA, USA, 1981. [Google Scholar]
Phillips, P.C. Time series regression with a unit root. Econom. J. Econom. Soc. 1987, 55, 277–301. [Google Scholar] [CrossRef] [Green Version]
Ma, Y.; Zhou, M.; Peng, L.; Zhang, R. Test for zero median of errors in an arma–garch model. Econom. Theory 2022, 38, 536–561. [Google Scholar] [CrossRef]
Phillips, P.C.; Solo, V. Asymptotics for linear processes. Ann. Stat. 1992, 20, 971–1001. [Google Scholar] [CrossRef]
Phillips, P.C.; Magdalinos, T. Limit theory for moderate deviations from a unit root. J. Econom. 2007, 136, 115–130. [Google Scholar] [CrossRef] [Green Version]
Guo, G.; Sun, Y.; Wang, S. Testing for moderate explosiveness. Econom. J. 2019, 22, 73–95. [Google Scholar] [CrossRef]

Figure 1. The power performance of EL method at

τ = 0.05

.

Figure 1. The power performance of EL method at

τ = 0.05

.

Figure 2. Empirical reject probabilities at

τ = 0.05

.

Figure 2. Empirical reject probabilities at

τ = 0.05

.

Figure 3. The results of the test for AR error structure between EL and AIC.

Figure 4. Time series graphs of 8 countries.

Table 1. Empirical reject probabilities at

τ = 0.05

.

Table 1. Empirical reject probabilities at

τ = 0.05

.

			EL				AIC
Scenarios	$μ$	$ϕ$	300	600	900	1200	300	600	900	1200
S1	0	0.5	0.0601	0.0518	0.0541	0.0554	0.0715	0.0693	0.0762	0.0768
		1 $- \frac{1}{n}$	0.0721	0.0659	0.0609	0.0600	0.2285	0.2308	0.2337	0.2404
		1	0.0733	0.0657	0.0596	0.0597	0.2328	0.2337	0.2357	0.2396
	0.01	0.5	0.0699	0.0575	0.0509	0.0509	0.0760	0.0719	0.0732	0.0698
		1 $- \frac{1}{n}$	0.0710	0.0611	0.0608	0.0588	0.2301	0.2323	0.2407	0.2301
		1	0.0733	0.0578	0.0612	0.0581	0.2267	0.2330	0.2300	0.2406
S2	0	0.5	0.0587	0.0464	0.0469	0.0517	1.0000	1.0000	1.0000	1.0000
		1 $- \frac{1}{n}$	0.0652	0.0620	0.0629	0.0570	0.5531	0.3458	0.2635	0.2243
		1	0.0689	0.0587	0.0630	0.0563	0.5505	0.3450	0.2574	0.2246
	0.01	0.5	0.0591	0.0454	0.0481	0.0514	0.9999	1.0000	1.0000	1.0000
		1 $- \frac{1}{n}$	0.0691	0.0546	0.0586	0.0607	0.5481	0.3394	0.2665	0.2339
		1	0.0694	0.0589	0.0587	0.0578	0.5454	0.3463	0.2661	0.2254
S3	0	0.5	0.0557	0.0498	0.0474	0.0463	0.8969	0.8466	0.8131	0.7832
		1 $- \frac{1}{n}$	0.0778	0.0613	0.0621	0.0590	0.5963	0.3284	0.2238	0.1732
		1	0.0862	0.0635	0.0604	0.0601	0.5978	0.3318	0.2252	0.1789
	0.01	0.5	0.0550	0.0496	0.0483	0.0461	0.9005	0.8445	0.8195	0.7848
		1 $- \frac{1}{n}$	0.0859	0.0692	0.0601	0.0587	0.6041	0.3332	0.2173	0.1832
		1	0.0834	0.0646	0.0578	0.0585	0.5971	0.3318	0.2201	0.1812

Table 2. The power performance of unit root test at

τ = 0.05

.

Table 2. The power performance of unit root test at

τ = 0.05

.

	Right Order				Wrong Order
$ϕ$	300	600	900	1200	300	600	900	1200
1 $- \frac{1}{n}$	0.0474	0.0455	0.0422	0.0437	0.0438	0.0453	0.0419	0.0425
1 $- \frac{3}{n}$	0.1029	0.0941	0.0923	0.0862	0.0998	0.0918	0.0897	0.0851
1 $- \frac{5}{n}$	0.1795	0.1730	0.1718	0.1678	0.1747	0.1716	0.1666	0.1664
1 $- \frac{10}{n}$	0.4748	0.4596	0.4455	0.4367	0.4491	0.4496	0.4354	0.4307
1 $- \frac{15}{n}$	0.7620	0.7343	0.7159	0.6983	0.7276	0.7168	0.7028	0.6864

Table 3. Test results of 8 countries.

Coutry	$\hat{μ}$	$\hat{ϕ}$	EL	p-Values	LB-Test	AIC
Canada	0.0340	0.9992	$A R (0)$	0.2122	$7.7250 \times 10^{- 3}$ ***	$A R (3)$
China	0.0169	0.9975	$A R (0)$	0.8572	0.1391	$A R (0)$
Norway	0.0462	0.9947	$A R (0)$	0.4315	$1.9450 \times 10^{- 6}$ ***	$A R (2)$
Singapore	0.0146	0.9893	$A R (0)$	0.8393	$0.0238$ **	$A R (3)$
Thailand	0.1453	0.9953	$A R (0)$	0.1666	$4.0220 \times 10^{- 3}$ ***	$A R (1)$
Sweden	0.0274	0.9968	$A R (0)$	0.2165	$5.6510 \times 10^{- 3}$ ***	$A R (3)$
Japan	1.9372	0.9822	$A R (0)$	0.2392	0.8896	$A R (0)$
Sri Lanka	0.1738	0.9992	$A R (2)$	0.2622	$7.2230 \times 10^{- 6}$ ***	$A R (2)$

Significance levels: *

p \leq 0.1

, **

p \leq 0.05

, ***

p \leq 0.01

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wei, X.; Liu, X.; Fan, Y.; Tan, L.; Liu, Q. A Unified Test for the AR Error Structure of an Autoregressive Model. Axioms 2022, 11, 690. https://doi.org/10.3390/axioms11120690

AMA Style

Wei X, Liu X, Fan Y, Tan L, Liu Q. A Unified Test for the AR Error Structure of an Autoregressive Model. Axioms. 2022; 11(12):690. https://doi.org/10.3390/axioms11120690

Chicago/Turabian Style

Wei, Xinyi, Xiaohui Liu, Yawen Fan, Li Tan, and Qing Liu. 2022. "A Unified Test for the AR Error Structure of an Autoregressive Model" Axioms 11, no. 12: 690. https://doi.org/10.3390/axioms11120690

APA Style

Wei, X., Liu, X., Fan, Y., Tan, L., & Liu, Q. (2022). A Unified Test for the AR Error Structure of an Autoregressive Model. Axioms, 11(12), 690. https://doi.org/10.3390/axioms11120690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Unified Test for the AR Error Structure of an Autoregressive Model

Abstract

1. Introduction

2. Methodologies and Asymptotic Results

3. Simulation Results

4. A Financial Real Data Application

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proof of the Main Result

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI