1. Introduction
This paper puts forward a general statistical model in the time domain based on the concept of fractional integration [
1] ore specifically, in the proposed framework, instead of imposing that the roots are strictly on the unit circle, as is the case with unit root models and purely fractional integration approaches, we also allow them to be within the unit circle. The reason for this is that a fractional root can be any real number, and therefore may or may not lie on the unit circle, which contains only complex numbers with a magnitude (or modulus) of 1 [
2,
3]. This approach enables us to specify the time series in terms of its infinite past, with a rate of dependence between the observations much smaller than that produced by the classic I(d) representations. In particular, we consider processes of the following form:
with |α| < 1 and d > 0. For simplicity, we assume that u
t is i.i.d. Clearly, if α = 1 in (1), x
t is I(d) and exhibits long memory since d > 0 [
4]. However, if |α| < 1, the process is no longer characterized by long memory, although x
t will still be a function of its infinite past as long as d is a real value. Thus, the main contribution of this work is the specification of a novel model that combines both autoregressions with fractional differentiation in a single framework; however, instead of considering a fractional process with autoregressive disturbances (e.g., as in [
5]), we incorporate the autoregression in the fractional polynomial. In doing so, we allow for a more flexible specification of the model, incorporating stationary and nonstationary structures with different degrees of dependence across time.
The remainder of the paper is organized as follows:
Section 2 describes the time series model;
Section 3 presents a procedure for simultaneously testing the two parameters, d and α, in (1);
Section 4 reports on various Monte Carlo experiments;
Section 5 discusses some empirical applications to show the usefulness of the proposed approach; and
Section 6 offers some concluding remarks. Additional technical details are provided in the
Appendix A and
Appendix B.
2. The Statistical Model
The starting point is the model given by (1). As already mentioned, if α = 1, the polynomial in (1) is exactly on the unit circle, and x
t is said to be integrated of order d, denoted by x
t~I(d). Note that given that d > 0, the polynomial (1 − L)
d can be expressed in terms of its Binomial expansion, such that, for all real values of d,
These processes (of the form (1 − L)
d x
t = u
t) were introduced by [
1,
2,
6], and they have been widely employed in the time series literature [
7,
8]. A useful survey of I(d) statistical models can be found in Baillie (1996) [
4], and reviews of the relevant theoretical and applied issues can be found in [
9] and, more recently, in [
10].
For the purpose of the present paper, we define long memory as a process whose spectral density function, f(λ), tends to infinity as λ approximates 0, i.e.,
In the case of α = 1, the spectral density function of x
t is:
where σ
2 = Var(u
t), implying that at λ = 0, the spectral density function is unbounded for d > 0. However, if |α| < 1, then
which is bounded for all α ≠ 1, and thus, in this case (|α| < 1), the process is stationary for all real values of d. If d is an integer value, x
t will be a function of a finite number of lag values of x
t; that is, x
t is an AR(d) process in this case. However, if d is a real number, one can still use the Binomial expansion, such that
and x
t in (1) will be a function of its infinite past.
Note that, using (5), (1) can be written as:
with
which is an AR(∞) process. Similarly, (1) can also be expressed as
and again using the standard Binomial expansion:
where Γ(x) stands for the Gamma function. It follows that
Thus, xt also admits a MA(∞) representation.
Figure 1 displays the region of (α, d)-values for the interval (0, 2). It can be seen that the process considered is stationary for all values of α < 1. Also, if α = 1 and d < 0.5, x
t is stationary, while it is not for the remaining values of these parameters. Note that if α > 1, the process is explosive. Concerning the stationary region, as d moves from 0 towards 1, and α from 0 to higher values, the dependence between the observations becomes higher. In fact, one of the advantages of this approach is that it allows for a higher degree of flexibility in the dynamic specification of the model, avoiding the rigidities caused by the exponential decay of the autoregressive part and the hyperbolically slower one produced by the fractional differencing part.
Figure 2 displays plots of simple realizations of processes of the form given by (1) for different values of α and d and T = 100. The plots on the left-hand-side of the figure correspond to the case of long memory (α = 1), while those on the right-hand-side correspond to short memory, with α = 0.80. In all cases we take d = 0.35, 0.75 and 1. In particular, the upper-left plot corresponds to the case with d = 1 and α = 0.35, namely to an I(0.35) process, which exhibits long memory despite being stationary; by contrast, in the two plots below (I(0.75) and I(1)), the processes are clearly nonstationary. Finally, stationarity is found in all cases when α = 0.80.
Figure 3 displays the first 50 sample autocorrelation values of the realizations in
Figure 2. It can be seen that in the long memory cases these decay at a hyperbolic rate, and in the short memory case the rate of decay slows down as d increases.
Figure 4 displays the periodograms, with higher peaks corresponding to lower frequencies in the long memory cases.
Figure 5 displays the impulse response functions obtained from the coefficients of the infinite MA representation in (7). As expected, mean reversion occurs in all cases except when α = d = 1 (the random walk model). These values correspond to the coefficients b
j in the MA(∞) representation of x
t in (7). This figure shows that when increasing α and d, the impulse responses have a longer time horizon before disappearing completely.
3. The Testing Procedure
In this section we develop a score test of the null hypothesis:
in the model given by (1) for any real values d
o > 0 and |α
o| < 1, under the assumption that u
t in (1) is a sequence of zero-mean uncorrelated random variables with unknown variance σ
2.
Specifically, the score statistic for H
o in (8) is constructed as follows. Consider a column vector of parameters η = (α, d, σ
2)’ and let L(η) be an objective function, such as the negative of the log-likelihood, which has a minimum under H
o at
Following [
11], a score statistic is
where the expectation is taken under H
o (8) prior to substitution of
In standard problems such as ours, the null limit distribution of the test will be unaffected if the inverted matrix is replaced by alternatives such as a sample average or the Hessian. However, testing the null of long memory, i.e., α = 1, leads to a non-standard null limit distribution.
With Gaussian u
t, the negative of the log-likelihood with η = (α, d, σ
2)’ is
where ρ(L; α, d) = (1 − αL)
d. In
Appendix A, it is shown that (9) takes the form
where T is the sample size, and
where
and
and
are obtained by expanding
and
Asymptotically, the above expressions can be approximated by
and their limits for different values of α are given in
Table 1. Routine calculations lead to the frequency domain version of the test, which is given by
where
where
is the periodogram of
,
and
, i,j = 1,2 can be approximated by
where
and
with λ
k = 2πk/T.
Theorem 1. Let us consider the model given by (1), where ut is a stationary random variable satisfyingandwhere 0 < σ2 < ∞, and Bt is the σ-field of events generated by us, s ≤ t. Also, the functions ψ in (16) and (17) must satisfy some technical restrictions to justify approximating integrals by sums. These restrictions correspond to those in Class H in [
3], Robinson (1994) (p. 1433). Then, under H
o defined as in (8),
Note that (18) and (19) impose a martingale difference assumption on the white noise u
t which is substantially weaker than the Gaussianity assumed for developing the test statistic, since it requires only a second moment condition, which is clearly a minimal requirement. Finite sample critical values based on Monte Carlo simulation results are reported in
Table 2.
Note that the model and the testing procedure described above can be extended to include deterministic terms, with x
t in (1) denoting the errors in the following multiple regression model (instead of being observable):
where y
t and the (kx1) vector z
t are observable and β is a (kx1) unknown vector. We assume that the elements of z
t are non-stochastic, such as polynomials in t, to include, for instance, the case of a linear time trend, if z
t = (1, t)’. Note that under H
o (6), the model becomes
where
Then, using the OLS estimate of β,
the residuals are
and the functional form of the test statistic takes the same form as before, replacing the values of
with those given by (21).
4. A Monte Carlo Simulation Study
This section examines the finite sample behavior of the test described in
Section 3 by means of Monte Carlo simulations. The computations were carried out using Fortran, and the codes of the programs are available from the author upon request.
Table 3,
Table 4 and
Table 5 report the rejection frequencies of the test statistic
in (8), testing Ho (6) for α
o and d
o values equal to 0.25, 0.50, 0.75 and 1.00, in a model given by (1) with (α, d) = (0.25, 0.50), (0.50, 0.50), (0.75, 0.50) and (1.00, 0.50). The nominal size is 5% in all cases, and T = 100 (
Table 3), 500 (
Table 4) and 1000 (
Table 5). Starting with T = 100, we see that the size is slightly above its nominal value (0.0524) and the rejection frequencies are very low in some cases; for example, if α = 0.25 and d = 0.50, the rejection probability at (α
o, d
o) = (0.50, 0.25) is smaller than 10%, implying that the procedure has some problems in identifying the true parameters. When increasing the sample size (in
Table 4 and
Table 5), the size of the test converges to its nominal value (0.0489 with T = 500 and 0.0495 with T = 1000), and the rejection frequencies are now higher in all cases.
Based on the efficiency property of the selected test, we also conduct in this section a simulation study based on local alternatives. For this purpose, we use the same DGP as in the previous cases, with T = 1000, but the alternatives are now all from −0.10 to 0.10 with 0.01 increments. The results are reported in
Table 6.
We observe that, even for small departures from the null, the rejection frequencies are higher than 0.100 in all cases, with the lowest values obtained with α = d = 0.50 and departures with α = d = 0.49 and 0.51. In all the other cases, they exceed 0.200, being higher as α approaches 1.
5. Empirical Applications
This section follows the proposed approach to examine long-range dependence in two sets of series, namely (i) water levels in the Nile River dataset, Northern Hemisphere temperatures and weight measurements of a 1 kg check standard weight performed at the National Institute of Standards and Technology (NIST, formerly, NBS) in the US; (ii) stock returns in five Latin American countries. These series have been chosen for illustrative purposes, all of them having been analyzed in previous studies using either autoregressive or fractional processes.
We first analyze the Nile River data collected by [
12] starting in 622 CE and ending in 1284 CE, specifically the yearly minimal water levels of the river, measured at the Roda Gauge near Cairo. This dataset can also be found in [
13] (Beran, 1994, pp. 237–239) (ref. [
12] collected data up until 1921, but 622–1284 is the longest period without gaps). Empirical applications using this series include, among others, [
14,
15,
16,
17].
We perform the procedure described in
Section 3, first testing H
o (6) in the model given by (1) for the following (α
o, d
o)-values: α
o = 0.01, 0.02, …, 1; d
o = 0.01, 0.02, …, 2. From all these combinations there emerges only a single case when H
o (6) cannot be rejected at the 5% significance level—this corresponds to α
o = 1 and d
o = 0.72, which is consistent with the long-range dependence observed in the series. Next, we also consider two specifications including, respectively, an intercept only and an intercept as well as a linear time trend. These two sets of results are very similar, and thus we only report in
Figure 6 those based on the model with an intercept only. It can be seen that the non-rejection values of d range between 0.35 and 0.50 and occur when α
o is in the interval [0.95, 1]. In the case of no regressors, d is higher than 0.5, which implies nonstationarity, while in the cases of an intercept and/or a linear time trend, d appears to be smaller than 0.5 and thus implies stationarity. We would argue that the specifications with deterministic terms are preferable, since the model without regressors assumes a zero mean for the series, whose values are instead all positive. Moreover, the former produce results which are consistent with those of [
13], who examined the same series using ARFIMA models and concluded that the fractional differencing parameter d was around 0.40 with a 95% confidence interval of (0.34, 0.46).
The second series examined is monthly temperatures in the Northern Hemisphere over the years 1854–1989, which were obtained from the database of the Climate Research Unit of the University of East Anglia, Norwich, England [
18]. More precisely, this series reports the temperature (degrees C) difference (anomaly) from the monthly average over the period 1950–1979, and its plot ([
13] p. 30) suggests an upward trend that may reflect global warming during the last 100 years. See also [
19,
20,
21,
22,
23,
24].
It can be seen that in the case of the model without regressors, the null is rejected for all values of α
o and d
o. When including a linear time trend there is a wide range of non-rejection values, which are displayed in
Figure 7. These are between 0.32 and 0.57 for d
o and between 0.8 and 1 for α
o. The lowest statistic corresponds to α
o = 0.97 and d
o = 0.40, which implies short memory but with an AR coefficient very close to 1. Ref. [
13] used a Whittle estimator for d in an ARFIMA(0, d, 0) model with a linear time trend and estimated the value of d to be 0.37 and the trend coefficient to be 0.00032.
The third series reports high-precision weight measurements of a 1 kg check standard weight performed at the National Institute of Standards and Technology (NIST, formerly, NBS), Gaithersburg, MD, USA (see [
25,
26,
27,
28,
29]). The measurements were taken between 24 June 1963 and 17 October 1975, using the same weight machine. The differences (in micrograms) from 1 kg were recorded. One difficulty with this dataset is that the dates at which measurements were taken are not equidistant. This makes the analysis of short-term correlation problematic. However, the overall long-range dependence structure is likely to be largely unaffected. In the case of this series, when including no regressors, all non-rejection values occur for α
o = 1 (i.e., long memory), with values of d
o ranging between 0.93 and 1.13. However, if an intercept is also included in the model, the region of non-rejection values is wider (see
Figure 8) with α
o equal to or slightly smaller than 1 and d
o ranging between 0.02 and 0.23. The lowest statistic occurs at α
o = 0.98 and d
o = 0.11, and setting α
o = 1 and d
o = 0.13. By applying standard semi-parametric long memory methods, ref. [
13] found in this case estimates of d ranging between 0.02 and 0.28, and when using other, more robust techniques (p. 140), an estimate of about 0.10.
Finally, we analyzed daily data on stock market returns in Venezuela, Argentina, Brazil, Mexico and Colombia, which were obtained from the International Monetary Fund database. The starting date is 4 January 1988 for Argentina and Mexico, 2 January 1990 for Venezuela, 2 January 1992 for Colombia, and 4 July 1994 for Brazil. The end date is 22 January 2002 in all cases. Stock market returns have been examined with long memory and fractional integration techniques in numerous papers, including [
30,
31,
32], among many others.
It can be seen in
Table 7 that for Mexico, α is exactly equal to 1, which implies long memory with an order of integration equal to 0.13. Short memory is found in all other cases. In particular, for Colombia, α = 0.87 and d = 0.33; for Argentina, α = 0.57 and d = 0.21; and for Venezuela and Brazil, d = 1.92 but α is then very close to 0 (0.09 for Venezuela and 0.06 for Colombia). We checked the validity of the above models by testing for autocorrelation on the residuals of the selected models. The results achieved using Box–Pierce-type statistics supported the null of no autocorrelation in all cases examined.
Table 8 reports the impulse response for each series; these are significant in all cases in the first period, up to the third period in Mexico, and up to the fifth in Colombia.