Next Article in Journal
Comparative Analysis of Numerical Methods for Solving 3D Continuation Problem for Wave Equation
Previous Article in Journal
Strong Gelfand Pairs of the Symplectic Group Sp4(q) Where q Is Even
Previous Article in Special Issue
A Long-Memory Model for Multiple Cycles with an Application to the US Stock Market
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Fractional Integration Model and Testing Procedure with Roots Within the Unit Circle

by
Guglielmo Maria Caporale
1,* and
Luis Alberiko Gil-Alana
2,3
1
Department of Economics, Finance and Accounting, Brunel University of London, London UB8 3PH, UK
2
Faculty of Economics, University of Navarra, 31009 Pamplona, Spain
3
Department of Economics, Universidad Francisco de Vitoria, 28223 Madrid, Spain
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(18), 2978; https://doi.org/10.3390/math13182978
Submission received: 30 June 2025 / Revised: 4 September 2025 / Accepted: 4 September 2025 / Published: 15 September 2025

Abstract

In this paper we propose a statistical model that combines both autoregressions and fractional differentiation in a unified treatment. However, instead of imposing that the roots are strictly on the unit circle, we also allow them to be within the unit circle. This permits a higher degree of flexibility in the specification of the model, with rates of dependence combining exponential with hyperbolic decays. Monte Carlo experiments and empirical applications to climatological and financial data show that the proposed approach performs well.

1. Introduction

This paper puts forward a general statistical model in the time domain based on the concept of fractional integration [1] ore specifically, in the proposed framework, instead of imposing that the roots are strictly on the unit circle, as is the case with unit root models and purely fractional integration approaches, we also allow them to be within the unit circle. The reason for this is that a fractional root can be any real number, and therefore may or may not lie on the unit circle, which contains only complex numbers with a magnitude (or modulus) of 1 [2,3]. This approach enables us to specify the time series in terms of its infinite past, with a rate of dependence between the observations much smaller than that produced by the classic I(d) representations. In particular, we consider processes of the following form:
( 1     α L ) d x t   =   u t ,    t   =   1 ,   2 ,  
with |α| < 1 and d > 0. For simplicity, we assume that ut is i.i.d. Clearly, if α = 1 in (1), xt is I(d) and exhibits long memory since d > 0 [4]. However, if |α| < 1, the process is no longer characterized by long memory, although xt will still be a function of its infinite past as long as d is a real value. Thus, the main contribution of this work is the specification of a novel model that combines both autoregressions with fractional differentiation in a single framework; however, instead of considering a fractional process with autoregressive disturbances (e.g., as in [5]), we incorporate the autoregression in the fractional polynomial. In doing so, we allow for a more flexible specification of the model, incorporating stationary and nonstationary structures with different degrees of dependence across time.
The remainder of the paper is organized as follows: Section 2 describes the time series model; Section 3 presents a procedure for simultaneously testing the two parameters, d and α, in (1); Section 4 reports on various Monte Carlo experiments; Section 5 discusses some empirical applications to show the usefulness of the proposed approach; and Section 6 offers some concluding remarks. Additional technical details are provided in the Appendix A and Appendix B.

2. The Statistical Model

The starting point is the model given by (1). As already mentioned, if α = 1, the polynomial in (1) is exactly on the unit circle, and xt is said to be integrated of order d, denoted by xt~I(d). Note that given that d > 0, the polynomial (1 − L)d can be expressed in terms of its Binomial expansion, such that, for all real values of d,
( 1     L ) d   =   j = 0 d j ( 1 ) j L j   =   1     d L   +   d ( d     1 ) 2 ! L 2     d ( d     1 ) ( d     2 ) 3 ! L 3   +   .
These processes (of the form (1 − L)d xt = ut) were introduced by [1,2,6], and they have been widely employed in the time series literature [7,8]. A useful survey of I(d) statistical models can be found in Baillie (1996) [4], and reviews of the relevant theoretical and applied issues can be found in [9] and, more recently, in [10].
For the purpose of the present paper, we define long memory as a process whose spectral density function, f(λ), tends to infinity as λ approximates 0, i.e.,
lim λ 0 + f ( λ )   =   .
In the case of α = 1, the spectral density function of xt is:
f ( λ )   =   σ 2 2 π 1     e i λ 2 d   =   σ 2 2 π 2     2 cos λ d   =   σ 2 2 π 2 sin 2 λ 2 d ,
where σ2 = Var(ut), implying that at λ = 0, the spectral density function is unbounded for d > 0. However, if |α| < 1, then
f ( λ )   =   σ 2 2 π 1     α e i λ 2 d   =   σ 2 2 π 1   +   α 2   2 α cos λ d ,
which is bounded for all α ≠ 1, and thus, in this case (|α| < 1), the process is stationary for all real values of d. If d is an integer value, xt will be a function of a finite number of lag values of xt; that is, xt is an AR(d) process in this case. However, if d is a real number, one can still use the Binomial expansion, such that
( 1 α L ) d = j = 0 d j ( 1 ) j α L j   =   1     d   α L   +   d ( d     1 ) 2 ! ( α L ) 2 d ( d     1 ) ( d 2 ) 3 ! ( α L ) 3   +   ,
and xt in (1) will be a function of its infinite past.
Note that, using (5), (1) can be written as:
x t   =   j = 1 a j x t j   +   ε t ,    t   =   1 ,   2 ,   ,
with
a j   =   ( 1 ) j + 1 d ( d     1 ) ( d     j   +   1 ) ! j ! α j ,    j     1 ,
which is an AR(∞) process. Similarly, (1) can also be expressed as
x t   =   ( 1     α L ) d ε t   =   j = 0 b j ε t j ,    t   =   1 ,   2 ,   ,
and again using the standard Binomial expansion:
( 1     α z ) d   =   1   +   j = 1 Γ ( j   +   d )   ( α z ) j Γ ( d ) Γ ( j   +   1 ) ,
where Γ(x) stands for the Gamma function. It follows that
b j   =   Γ ( j   +   d ) Γ ( d ) Γ ( j   +   1 ) α j , j     1 .
Thus, xt also admits a MA(∞) representation.
Figure 1 displays the region of (α, d)-values for the interval (0, 2). It can be seen that the process considered is stationary for all values of α < 1. Also, if α = 1 and d < 0.5, xt is stationary, while it is not for the remaining values of these parameters. Note that if α > 1, the process is explosive. Concerning the stationary region, as d moves from 0 towards 1, and α from 0 to higher values, the dependence between the observations becomes higher. In fact, one of the advantages of this approach is that it allows for a higher degree of flexibility in the dynamic specification of the model, avoiding the rigidities caused by the exponential decay of the autoregressive part and the hyperbolically slower one produced by the fractional differencing part.
Figure 2 displays plots of simple realizations of processes of the form given by (1) for different values of α and d and T = 100. The plots on the left-hand-side of the figure correspond to the case of long memory (α = 1), while those on the right-hand-side correspond to short memory, with α = 0.80. In all cases we take d = 0.35, 0.75 and 1. In particular, the upper-left plot corresponds to the case with d = 1 and α = 0.35, namely to an I(0.35) process, which exhibits long memory despite being stationary; by contrast, in the two plots below (I(0.75) and I(1)), the processes are clearly nonstationary. Finally, stationarity is found in all cases when α = 0.80.
Figure 3 displays the first 50 sample autocorrelation values of the realizations in Figure 2. It can be seen that in the long memory cases these decay at a hyperbolic rate, and in the short memory case the rate of decay slows down as d increases. Figure 4 displays the periodograms, with higher peaks corresponding to lower frequencies in the long memory cases.
Figure 5 displays the impulse response functions obtained from the coefficients of the infinite MA representation in (7). As expected, mean reversion occurs in all cases except when α = d = 1 (the random walk model). These values correspond to the coefficients bj in the MA(∞) representation of xt in (7). This figure shows that when increasing α and d, the impulse responses have a longer time horizon before disappearing completely.

3. The Testing Procedure

In this section we develop a score test of the null hypothesis:
H o :   ( α , d )   =   ( α o , d o ) ,
in the model given by (1) for any real values do > 0 and |αo| < 1, under the assumption that ut in (1) is a sequence of zero-mean uncorrelated random variables with unknown variance σ2.
Specifically, the score statistic for Ho in (8) is constructed as follows. Consider a column vector of parameters η = (α, d, σ2)’ and let L(η) be an objective function, such as the negative of the log-likelihood, which has a minimum under Ho at η ^   =   ( α o , d o , σ ^ 2 ) . Following [11], a score statistic is
L ( η ) η E o L ( η ) η L ( η ) η 1 L ( η ) η α = α o d = d o σ 2 = σ ^ 2 ,
where the expectation is taken under Ho (8) prior to substitution of σ ^ 2 . In standard problems such as ours, the null limit distribution of the test will be unaffected if the inverted matrix is replaced by alternatives such as a sample average or the Hessian. However, testing the null of long memory, i.e., α = 1, leads to a non-standard null limit distribution.
With Gaussian ut, the negative of the log-likelihood with η = (α, d, σ2)’ is
L ( η ) = T 2 log 2 π σ 2 + 1 2 σ 2 t = 1 T [ ρ ( L ; α , d ) x t ] 2 ,
where ρ(L; α, d) = (1 − αL)d. In Appendix A, it is shown that (9) takes the form
S ^   =   T σ ^ 4 a ^ 1 a ^ 2 a ^ 11 a ^ 12 a ^ 21 a ^ 22 1 a ^ 1 a ^ 2 ,
where T is the sample size, and
σ ^ 2   =   1 T t = 1 T u ^ t 2 ;    u ^ t   =   ( 1     α o L ) d o x t ;
a ^ 1   =   j = 1 T 1 ψ j ( α ) C u ^ ( j ) ;   a ^ 2   =   j = 1 T 1 ψ j ( d ) C u ^ ( j ) ;
where C u ^ ( j )   =   1 T     j t = 1 T j u ^ t u ^ t + j , and ψ j ( α ) and ψ j ( d ) are obtained by expanding
log ρ ( L ; α o , d o ) α   =   d o L ( 1     α o L )   =   j = 1 ψ j ( α ) L j ,
and
log ρ ( L ; α o , d o ) d   =   log ( 1     α o L )   =   j = 1 ψ j ( d ) L j ,
Then, clearly
ψ j ( α )   =   d o α o j 1 and   ψ j ( d )   =   1 j α o j
Finally,
a ^ 11   =   j = 1 T 1 1     j T ψ j ( α ) 2 ;   a ^ 12   =   j = 1 T 1 1     j T ψ j ( α ) ψ j ( d ) ;
a ^ 21   =   j = 1 T 1 1     j T ψ j ( d ) ψ j ( α ) ;   a ^ 22   =   j = 1 T 1 1     j T ψ j ( d ) 2 .
Asymptotically, the above expressions can be approximated by
a ˜ 11   =   j = 1 1 j α j 2   =   j = 1 1 j 2 α 2 j
a ˜ 12   =   a ˜ 21   =   j = 1 1 j α j d α j 1   =   d j = 1 1 j α 2 j 1
a ˜ 22   =   j = 1 d α j 1   =   d 2 j = 1 α 2 j 2 ,
and their limits for different values of α are given in Table 1. Routine calculations lead to the frequency domain version of the test, which is given by
S ˜   =   T σ ˜ 4 a ˜ 1 a ˜ 2 a ˜ 11   a ˜ 12 a ˜ 21   a ˜ 22 1 a ˜ 1 a ˜ 2 ,
where
a ˜ i   =   2 π T k * ψ i ( λ k ) I u ^ ( λ k ) ,
where I u ^ ( λ k ) is the periodogram of u ^ t ,
I u ^ ( λ k )   =   1 2 π T t = 1 T u ^ t e i t λ k 2 ,
and a ˜ i j , i,j = 1,2 can be approximated by
a ˜ i j   =   2 T k = 1 * ψ i ( λ k ) ψ j ( λ k ) ,
where
ψ 1 ( λ k )   =   log 1   +   α 2     2 α cos λ k
and
ψ 2 ( λ k )   =   d ( α cos λ k ) 1 + α 2     2 α cos λ k ,
with λk = 2πk/T.
Theorem 1.
Let us consider the model given by (1), where ut is a stationary random variable satisfying
E u t B t 1   =   0 ,
and
E u t 2 B t 1   =   σ 2   almost   surely ,
where 0 < σ2 < ∞, and Bt is the σ-field of events generated by us, s ≤ t.
Also, the functions ψ in (16) and (17) must satisfy some technical restrictions to justify approximating integrals by sums. These restrictions correspond to those in Class H in [3], Robinson (1994) (p. 1433). Then, under Ho defined as in (8),
S ^   d   χ 2 2
Proof. 
See Appendix B. □
Note that (18) and (19) impose a martingale difference assumption on the white noise ut which is substantially weaker than the Gaussianity assumed for developing the test statistic, since it requires only a second moment condition, which is clearly a minimal requirement. Finite sample critical values based on Monte Carlo simulation results are reported in Table 2.
Note that the model and the testing procedure described above can be extended to include deterministic terms, with xt in (1) denoting the errors in the following multiple regression model (instead of being observable):
y t   =   β z t   +   x t ,    t   =   1 , 2 ,
where yt and the (kx1) vector zt are observable and β is a (kx1) unknown vector. We assume that the elements of zt are non-stochastic, such as polynomials in t, to include, for instance, the case of a linear time trend, if zt = (1, t)’. Note that under Ho (6), the model becomes
( 1     α o L ) d o y t   =   β w t   +   u t ,   t   =   1 ,   2 ,  
where w t   =   ( 1     α o L ) d o z t . Then, using the OLS estimate of β,
β ^   =   t = 1 T w t w t 1 t = 1 T w t ( 1     L ) d o y t ,
the residuals are
u ^ t   =   ( 1     L ) d o y t     β ^ w t ,   t   =   1 ,   2 ,  
and the functional form of the test statistic takes the same form as before, replacing the values of u ^ t with those given by (21).

4. A Monte Carlo Simulation Study

This section examines the finite sample behavior of the test described in Section 3 by means of Monte Carlo simulations. The computations were carried out using Fortran, and the codes of the programs are available from the author upon request.
Table 3, Table 4 and Table 5 report the rejection frequencies of the test statistic S ^ in (8), testing Ho (6) for αo and do values equal to 0.25, 0.50, 0.75 and 1.00, in a model given by (1) with (α, d) = (0.25, 0.50), (0.50, 0.50), (0.75, 0.50) and (1.00, 0.50). The nominal size is 5% in all cases, and T = 100 (Table 3), 500 (Table 4) and 1000 (Table 5). Starting with T = 100, we see that the size is slightly above its nominal value (0.0524) and the rejection frequencies are very low in some cases; for example, if α = 0.25 and d = 0.50, the rejection probability at (αo, do) = (0.50, 0.25) is smaller than 10%, implying that the procedure has some problems in identifying the true parameters. When increasing the sample size (in Table 4 and Table 5), the size of the test converges to its nominal value (0.0489 with T = 500 and 0.0495 with T = 1000), and the rejection frequencies are now higher in all cases.
Based on the efficiency property of the selected test, we also conduct in this section a simulation study based on local alternatives. For this purpose, we use the same DGP as in the previous cases, with T = 1000, but the alternatives are now all from −0.10 to 0.10 with 0.01 increments. The results are reported in Table 6.
We observe that, even for small departures from the null, the rejection frequencies are higher than 0.100 in all cases, with the lowest values obtained with α = d = 0.50 and departures with α = d = 0.49 and 0.51. In all the other cases, they exceed 0.200, being higher as α approaches 1.

5. Empirical Applications

This section follows the proposed approach to examine long-range dependence in two sets of series, namely (i) water levels in the Nile River dataset, Northern Hemisphere temperatures and weight measurements of a 1 kg check standard weight performed at the National Institute of Standards and Technology (NIST, formerly, NBS) in the US; (ii) stock returns in five Latin American countries. These series have been chosen for illustrative purposes, all of them having been analyzed in previous studies using either autoregressive or fractional processes.
We first analyze the Nile River data collected by [12] starting in 622 CE and ending in 1284 CE, specifically the yearly minimal water levels of the river, measured at the Roda Gauge near Cairo. This dataset can also be found in [13] (Beran, 1994, pp. 237–239) (ref. [12] collected data up until 1921, but 622–1284 is the longest period without gaps). Empirical applications using this series include, among others, [14,15,16,17].
We perform the procedure described in Section 3, first testing Ho (6) in the model given by (1) for the following (αo, do)-values: αo = 0.01, 0.02, …, 1; do = 0.01, 0.02, …, 2. From all these combinations there emerges only a single case when Ho (6) cannot be rejected at the 5% significance level—this corresponds to αo = 1 and do = 0.72, which is consistent with the long-range dependence observed in the series. Next, we also consider two specifications including, respectively, an intercept only and an intercept as well as a linear time trend. These two sets of results are very similar, and thus we only report in Figure 6 those based on the model with an intercept only. It can be seen that the non-rejection values of d range between 0.35 and 0.50 and occur when αo is in the interval [0.95, 1]. In the case of no regressors, d is higher than 0.5, which implies nonstationarity, while in the cases of an intercept and/or a linear time trend, d appears to be smaller than 0.5 and thus implies stationarity. We would argue that the specifications with deterministic terms are preferable, since the model without regressors assumes a zero mean for the series, whose values are instead all positive. Moreover, the former produce results which are consistent with those of [13], who examined the same series using ARFIMA models and concluded that the fractional differencing parameter d was around 0.40 with a 95% confidence interval of (0.34, 0.46).
The second series examined is monthly temperatures in the Northern Hemisphere over the years 1854–1989, which were obtained from the database of the Climate Research Unit of the University of East Anglia, Norwich, England [18]. More precisely, this series reports the temperature (degrees C) difference (anomaly) from the monthly average over the period 1950–1979, and its plot ([13] p. 30) suggests an upward trend that may reflect global warming during the last 100 years. See also [19,20,21,22,23,24].
It can be seen that in the case of the model without regressors, the null is rejected for all values of αo and do. When including a linear time trend there is a wide range of non-rejection values, which are displayed in Figure 7. These are between 0.32 and 0.57 for do and between 0.8 and 1 for αo. The lowest statistic corresponds to αo = 0.97 and do = 0.40, which implies short memory but with an AR coefficient very close to 1. Ref. [13] used a Whittle estimator for d in an ARFIMA(0, d, 0) model with a linear time trend and estimated the value of d to be 0.37 and the trend coefficient to be 0.00032.
The third series reports high-precision weight measurements of a 1 kg check standard weight performed at the National Institute of Standards and Technology (NIST, formerly, NBS), Gaithersburg, MD, USA (see [25,26,27,28,29]). The measurements were taken between 24 June 1963 and 17 October 1975, using the same weight machine. The differences (in micrograms) from 1 kg were recorded. One difficulty with this dataset is that the dates at which measurements were taken are not equidistant. This makes the analysis of short-term correlation problematic. However, the overall long-range dependence structure is likely to be largely unaffected. In the case of this series, when including no regressors, all non-rejection values occur for αo = 1 (i.e., long memory), with values of do ranging between 0.93 and 1.13. However, if an intercept is also included in the model, the region of non-rejection values is wider (see Figure 8) with αo equal to or slightly smaller than 1 and do ranging between 0.02 and 0.23. The lowest statistic occurs at αo = 0.98 and do = 0.11, and setting αo = 1 and do = 0.13. By applying standard semi-parametric long memory methods, ref. [13] found in this case estimates of d ranging between 0.02 and 0.28, and when using other, more robust techniques (p. 140), an estimate of about 0.10.
Finally, we analyzed daily data on stock market returns in Venezuela, Argentina, Brazil, Mexico and Colombia, which were obtained from the International Monetary Fund database. The starting date is 4 January 1988 for Argentina and Mexico, 2 January 1990 for Venezuela, 2 January 1992 for Colombia, and 4 July 1994 for Brazil. The end date is 22 January 2002 in all cases. Stock market returns have been examined with long memory and fractional integration techniques in numerous papers, including [30,31,32], among many others.
It can be seen in Table 7 that for Mexico, α is exactly equal to 1, which implies long memory with an order of integration equal to 0.13. Short memory is found in all other cases. In particular, for Colombia, α = 0.87 and d = 0.33; for Argentina, α = 0.57 and d = 0.21; and for Venezuela and Brazil, d = 1.92 but α is then very close to 0 (0.09 for Venezuela and 0.06 for Colombia). We checked the validity of the above models by testing for autocorrelation on the residuals of the selected models. The results achieved using Box–Pierce-type statistics supported the null of no autocorrelation in all cases examined.
Table 8 reports the impulse response for each series; these are significant in all cases in the first period, up to the third period in Mexico, and up to the fifth in Colombia.

6. Conclusions

This paper proposes a general statistical model in the time domain (with a corresponding testing procedure) which is based on the concept of fractional integration and allows the roots of the process to be not only on but also within the unit circle. This is because a fractional root can be any real number, and therefore may or may not lie on the unit circle. As a result, we introduce a model with a high degree of flexibility, incorporating time dependencies that may decay at a lower rate than the exponential ones but faster than with the hyperbolic rates associated, respectively, with the autoregressive and fractional processes. The evidence, based on both Monte Carlo simulations and some empirical applications, shows that the proposed framework performs well in finite samples and accurately captures the stochastic behavior of various series, ranging from climatological to financial ones. Future work could extend the proposed model to allow for cyclical structures, where the singularity or pole in the spectrum occurs at one or more frequencies away from zero.

Author Contributions

Conceptualization, G.M.C. and L.A.G.-A.; Methodology, G.M.C. and L.A.G.-A.; Software, L.A.G.-A.; Formal analysis, L.A.G.-A.; Data curation, L.A.G.-A.; Writing—original draft, L.A.G.-A.; Writing—review & editing, G.M.C. All authors have read and agreed to the published version of the manuscript.

Funding

Luis A. Gil-Alana gratefully acknowledges financial support from the project from ‘Ministerio de Ciencia, Innovación y Universidades’ Agencia Estatal de Investigación’ (AEI) Spain and ‘Fondo Europeo de Desarrollo Regional’ (FEDER), Grant D2023-149516NB-I00 funded by MCIN/AEI/10.13039/501100011033. He also acknowledges support from an internal project of the Universidad Francisco de Vitoria.

Data Availability Statement

The data presented in this study are openly available in National Institute of Standards and Technology (NIST), www.nist.gov.

Acknowledgments

Comments from the Editor and four anonymous reviewers are gratefully acknowledged.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivation of the Score Statistic S ^

The negative of the log-likelihood under (1) and Gaussianity of ut is given by
L ( α , d , σ 2 )   =   T 2 log 2 π σ 2   +   1 2 σ 2 t = 1 T ρ ( L ; α , d ) x t 2 ,
for any admissible α, d and σ2. Then,
L ( α , d , σ 2 ) α   =   1 σ 2 t = 1 T ρ ( L ; α , d ) α x t ρ ( L ; α , d ) x t ,
so that
L ( α o , d o , σ 2 ) α   =   1 σ 2 t = 1 T log ρ ( L ; α o , d o ) α u t u t ,
where u t   =   ( 1     α o L ) d o x t . Using the expansion in (11), we obtain that
L ( α o , d o , σ 2 ) α   =   1 σ 2 t = 1 T j = 1 ψ j ( α ) u t j u t   =   1 ο 2 j = 1 T 1 ψ j ( α ) t = 1 T j u t u t + j T σ 2 j = 1 T 1 ψ j ( α ) C u ( j ) ,
where C u ( j )   =   1 T j .
Similarly,
L ( α o , d o , σ 2 ) d   =   1 σ 2 t = 1 T log ρ ( L ; α o , d o ) d u t u t ,
and using (12),
L ( α o , d o , σ 2 ) d   =   1 σ 2 t = 1 T j = 1 ψ j ( d ) u t j u t   =   1 ο 2 j = 1 T 1 ψ j ( d ) t = 1 T j u t u t + j   T σ 2 j = 1 T 1 ψ j ( d ) C u ( j ) .
Finally, L ( α o , d o , σ 2 ) σ 2   =   1 2 σ 4 t = 1 T u t 2     σ 2 , and this last expression vanishes for σ 2   =   σ ^ 2 . It also follows that the expectation in (9) is
T a ^ 0 0 1 2 σ 4 ,
where a ^ is a (2 × 2) matrix formed by the elements a ^ i j in (13), which can be asymptotically approximated (for α   <   1 ) by a ˜ in (14). Then, using (9), we deduce S ^ .

Appendix B. Proof of Theorem 1

Proof. 
Note that the test statistic is based on the Lagrange Multiplier (LM) principle and thus everything is evaluated under the null hypothesis Ho (7), using αo and do in the equations and with the residuals being I(0).
Calling a ~ t = a 1 ~ ; a 2 ~ , A ~ = a 11 ~   0 0   a 22 ~ , and following conditions (18) and (19) along with Class H of functions for ψ ( λ k ) t = ψ 1 λ k ; ψ 1 λ k as defined in Robinson (1994, p. 1433) [3], the proof is a straightforward extension of that of Theorem 1 in that paper using Lemmas 1, 2 and 3 and an application of a martingale difference Central Limit Theorem as in [33]. Specifically, the proof that σ 2 ^     σ 2 follows from the fact that u t 2     σ 2 are stationary martingale differences. □

References

  1. Granger, C.W.J. Long memory relationships and the aggregation of dynamic models. J. Econom. 1980, 14, 227–238. [Google Scholar] [CrossRef]
  2. Hosking, J.R.M. Fractional differencing. Biometrika 1981, 68, 168–176. [Google Scholar] [CrossRef]
  3. Robinson, P.M. Efficient tests of nonstationary hypotheses. J. Am. Stat. Assoc. 1994, 89, 1420–1437. [Google Scholar] [CrossRef]
  4. Baillie, R.T. Long memory processes and fractional integration in econometrics. J. Econom. 1996, 73, 5–59. [Google Scholar] [CrossRef]
  5. Sowell, F. Maximum likelihood estimation of stationary univariate fractionally integrated time series models. J. Econom. 1992, 53, 165–188. [Google Scholar] [CrossRef]
  6. Granger, C.W.J.; Joyeux, R. An introduction to long memory time series and fractionally differencing. J. Time Ser. Anal. 1980, 1, 15–29. [Google Scholar] [CrossRef]
  7. Diebold, F.X.; Rudebusch, G.D. Long memory and persistence in aggregate output. J. Monet. Econ. 1989, 24, 189–209. [Google Scholar] [CrossRef]
  8. Gil-Alana, L.A.; Robinson, P.M. Testing of unit roots and other nonstationary hypotheses in macroeconomic time series. J. Econom. 1997, 80, 241–268. [Google Scholar] [CrossRef]
  9. Gil-Alana, L.A.; Hualde, J. Fractional Integration and Cointegration: An Overview and an Empirical Application. In Palgrave Handbook of Econometrics; Mills, T.C., Patterson, K., Eds.; Palgrave Macmillan: London, UK, 2009. [Google Scholar] [CrossRef]
  10. Huade, J.; Nielsen, S. Fractional integration and cointegration. Oxf. Res. Encycl. Econ. Financ. 2023. [Google Scholar] [CrossRef]
  11. Rao, C.R. Linear Statistical Inference and Its Applications; John Wiley: Hoboken, NJ, USA, 1973. [Google Scholar]
  12. Toussoun, O. Memoire sur l’historie du Nil. In Memoires a L’Institut D’Eggipte; Institut Français d’Archéologie Orientale: Cairo, Egypt, 1925; Volume 18, pp. 366–404. [Google Scholar]
  13. Beran, J. Statistics for Long Memory Processes; Clapman and Hall: New York, NY, USA, 1994. [Google Scholar]
  14. Mudelsee, M. Long memory of rivers from spatial aggregation. Water Resour. Res. 2007, 43. [Google Scholar] [CrossRef]
  15. Rickert, J. Long Memory and the Nile: Herodotus, Hurst and H. 2014. Available online: https://blog.revolutionanalytics.com/2014/09/intro-to-long-memory-herodotus-hurst-and-h.html (accessed on 12 June 2013).
  16. Abdelaziz, S.; Ahmed, A.M.M.; Eltahan, A.M.; Elhamid, A.M.I.A. Long-Term Stochastic Modeling of Monthly Streamflow in River Nile. Sustainability 2023, 15, 2170. [Google Scholar] [CrossRef]
  17. Elgendy, R.; Coulibaly, P.; Hassini, S.; El-Dakhakhni, W.; Elsaie, Y.; Tolera, M.B.; Hatiye, S.D.; Ayana, M. Assessment of monthly to daily streamflow disaggregation methods: A case study of the Nile River Basin. J. Hydrol. Reg. Stud. 2024, 56, 101969. [Google Scholar] [CrossRef]
  18. Jones, P.D.; Briffa, K.R. Global surface air temperature variations during the twentieth century: Part I Spatial, temporal and seasonal details. Holocene 1992, 2, 165–179.b. [Google Scholar] [CrossRef]
  19. Mann, M.E.; Bradley, R.S.; Hughes, M.K. Northern hemisphere temperatures during the past millennium: Inferences, uncertainties, and limitations. Geophys. Res. Lett. 1999, 26, 759–762. [Google Scholar] [CrossRef]
  20. Ljungqvist, F.C.; Krusic, P.; Brattstrom, G.; Sundqvist, H.S. Northern Hemisphere temperature patterns in the last 12 centuries. Clim. Past 2012, 8, 227–249. [Google Scholar] [CrossRef]
  21. Ogurtsov, M.G. New paleoclimatic evidence of an extraordinary rise in temperature in the Northern Hemisphere in the last 3–4 decades. Geogr. Ann. Ser. A Phys. Geogr. 2022, 104, 288–297. [Google Scholar] [CrossRef]
  22. Sevgin, F.; Ozturk, A. Variation of temperature increase rate in the Northern Hemisphere according to latitude, longitude and altitude: The Turkey example. Sci. Rep. 2024, 14, 18207. [Google Scholar] [CrossRef] [PubMed]
  23. Gil-Alana, L.A.; Carmona-Gonzalez, N. Temperature Anomalies in the Northern and Southern Hemispheres: Evidence of Persistence and Trends. Adv. Meteorol. 2024, 2024, 8900065. [Google Scholar] [CrossRef]
  24. Caporale, G.M.; Gil-Alana, L.A.; Carmona-Gonzalez, N. Some new evidence using fractional integration about trends, breaks and persistence in polar amplification. Sci. Rep. 2025, 15, 8327. [Google Scholar] [CrossRef] [PubMed]
  25. Pollak, M.; Croarkin, C.; Hagwood, C. Surveillance Schemes with Applications to Mass Calibration; NIST Technology Report 5158; National Institute of Standards and Technology: Gaithersburg, MD, USA, 1993. [Google Scholar]
  26. Graf, H.P. Long Range Correlation and Estimation of the Self Similarity Parameter. Ph.D. Thesis, ETH Zurich, Zurich, Switzerland, 1983. [Google Scholar]
  27. Graf, H.P.; Hampel, F.R.; Tacier, J. The problem of unsuspected serial correlation. In Robust and Non-linear Time Series Analysis; France, J., Härdle, W., Martin, R.D., Eds.; Lecture Notes in Statistics; Springer: New York, NY, USA, 1984; Volume 26, pp. 127–145. [Google Scholar]
  28. Kubarych, Z.J.; Abbott, P.J. The Dissemination of Mass in the United States: Results and Implications of Recent BIPM Calibration of US National Prototype Kilograms. J. Res. Natl. Inst. Stand. Technol. 2014, 119, 1–5. [Google Scholar] [CrossRef]
  29. Kubarych, Z.J.; Yaniv, S.L. The Kilogram and Measurements of Mass and Force. J. Res. Natl. Inst. Stand. Technol. 2001, 106, 25–46. [Google Scholar] [CrossRef] [PubMed]
  30. Mishra, A.K.; Mishra, S. Revisiting the Long Memory in Global Stock Market Returns: An Empirical Analysis. Glob. Bus. Rev. 2020, 26, 24–38. [Google Scholar] [CrossRef]
  31. Caporale, G.M.; Gil-Alana, L.A.; You, K. Stock Market Linkages between the Asean Countries, China and the US: A Fractional Integration/cointegration Approach. Emerg. Mark. Financ. Trade 2022, 58, 1502–1514. [Google Scholar] [CrossRef]
  32. Bhattacharya, S.; Bhattacharya, M. Long memory in stock returns. A study of emerging markets. Iran. J. Manag. Stud. 2025, 5, 67–88. [Google Scholar]
  33. Brown, B.M. Martingale Central Limit Theorems. Ann. Math. Stat. 1971, 42, 59–66. [Google Scholar] [CrossRef]
Figure 1. Stationary/nonstationary (α,d) region in Equation (1).
Figure 1. Stationary/nonstationary (α,d) region in Equation (1).
Mathematics 13 02978 g001
Figure 2. Simple realizations of models of the form (1 − αL)dxt = εt. The plots are sample realizations of the model in Equation (1) for a range of values for α and d.
Figure 2. Simple realizations of models of the form (1 − αL)dxt = εt. The plots are sample realizations of the model in Equation (1) for a range of values for α and d.
Mathematics 13 02978 g002
Figure 3. Correlograms of the realizations in Figure 2. The large-sample standard error under the null hypothesis of no autocorrelation is 1/√T.
Figure 3. Correlograms of the realizations in Figure 2. The large-sample standard error under the null hypothesis of no autocorrelation is 1/√T.
Mathematics 13 02978 g003
Figure 4. Periodograms of the realizations in Figure 2. The periodograms were computed on the basis of the discrete Fourier frequencies λj = 2πj/T.
Figure 4. Periodograms of the realizations in Figure 2. The periodograms were computed on the basis of the discrete Fourier frequencies λj = 2πj/T.
Mathematics 13 02978 g004
Figure 5. Impulse response functions of the realizations in Figure 2. The values indicate the impulse responses based on the coefficients of the MA(∞) representation of the series given by Equation (7).
Figure 5. Impulse response functions of the realizations in Figure 2. The values indicate the impulse responses based on the coefficients of the MA(∞) representation of the series given by Equation (7).
Mathematics 13 02978 g005
Figure 6.o, do)-values for which Ho (7) cannot be rejected. Nile river data. The region in blue corresponds to the values of α and d for which the null hypothesis cannot be rejected at the 95% level.
Figure 6.o, do)-values for which Ho (7) cannot be rejected. Nile river data. The region in blue corresponds to the values of α and d for which the null hypothesis cannot be rejected at the 95% level.
Mathematics 13 02978 g006
Figure 7.o, do)-values for which Ho (7) cannot be rejected. Northern Hemisphere data. The region in blue corresponds to the values of α and d for which the null hypothesis cannot be rejected at the 95% level.
Figure 7.o, do)-values for which Ho (7) cannot be rejected. Northern Hemisphere data. The region in blue corresponds to the values of α and d for which the null hypothesis cannot be rejected at the 95% level.
Mathematics 13 02978 g007
Figure 8.o, do)-values for which Ho (7) cannot be rejected; 1 kg standard weight NIST. The region in blue corresponds to the values of α and d for which the null hypothesis cannot be rejected at the 95% level.
Figure 8.o, do)-values for which Ho (7) cannot be rejected; 1 kg standard weight NIST. The region in blue corresponds to the values of α and d for which the null hypothesis cannot be rejected at the 95% level.
Mathematics 13 02978 g008
Table 1. Limiting values of the expressions reported in (14).
Table 1. Limiting values of the expressions reported in (14).
α a ˜ 11 a ˜ 12 a ˜ 13
0.100.0100250.1005031.010101
0.200.0404070.2041091.041666
0.300.0921100.3143681.098901
0.400.1669000.4358831.190476
0.500.2676520.5753641.333333
0.600.3989580.7438111.562499
0.700.5684380.9619201.960785
0.800.7900261.2770642.777777
0.901.0951031.8452565.263157
1.001.645000
Table 2. Finite sample critical values of the test statistic for different sample sizes.
Table 2. Finite sample critical values of the test statistic for different sample sizes.
αT = 100T = 300T = 500T = 1000T = 5000
0.16.0285.9915.9085.9215.862
0.25.9906.0255.9255.9235.858
0.35.9926.0425.9015.8945.846
0.46.0866.0455.9215.9515.853
0.56.1156.0455.9185.9665.860
0.66.1856.0445.9995.9355.899
0.76.3226.0585.9885.9145.893
0.86.5996.1895.9905.9505.901
0.97.1266.2815.9625.8806.020
Table 3. Rejection frequencies of the test statistic in Section 3. T = 300 observations.
Table 3. Rejection frequencies of the test statistic in Section 3. T = 300 observations.
d = 0.25 and α = 0.50d = 0.50 and α = 0.50
DαRejection Prob.dαRejection Prob.
0.250.250.19030.250.250.8856
0.250.500.05240.250.500.5508
0.250.750.11810.250.750.1775
0.251.000.63940.251.000.2395
0.500.250.06300.500.250.5859
0.500.500.43490.500.500.0524
0.500.750.98430.500.750.5000
0.501.001.00000.501.000.9891
0.750.250.14070.750.250.2354
0.750.500.97390.750.500.4349
0.750.751.00000.750.750.9995
0.751.001.00000.751.001.0000
1.000.250.45761.000.250.0914
1.000.501.00001.000.500.9739
1.000.751.00001.000.751.0000
1.001.001.00001.001.001.0000
d = 0.75 and α = 0.50d = 1.00 and α = 0.50
dαRejection Prob.dαRejection Prob.
0.250.250.99970.250.251.0000
0.250.500.98520.250.501.0000
0.250.750.85880.250.750.9993
0.251.000.34560.251.000.8121
0.500.250.98830.500.251.0000
0.500.500.55080.500.500.9852
0.500.750.10450.500.750.5325
0.501.000.61430.501.000.1891
0.750.250.90810.750.250.9998
0.750.500.05240.750.500.5508
0.750.750.90530.750.750.2813
0.751.000.99990.751.000.9723
1.000.250.64611.000.250.9896
1.000.500.43491.000.500.0524
1.000.751.00001.000.750.9960
1.001.001.00001.001.001.0000
The size of the test statistic is in bold. The nominal size is 5% and 10,000 replications were used in each case.
Table 4. Rejection frequencies of the test statistic in Section 3. T = 500 observations.
Table 4. Rejection frequencies of the test statistic in Section 3. T = 500 observations.
d = 0.25 and α = 0.50d = 0.50 and α = 0.50
DαRejection Prob.dαRejection Prob.
0.250.250.27760.250.250.9850
0.250.500.04890.250.500.7701
0.250.750.22150.250.750.2573
0.251.000.89990.251.000.2562
0.500.250.06440.500.250.8042
0.500.500.69240.500.500.0489
0.500.751.00000.500.750.8240
0.501.001.00000.501.001.0000
0.750.250.21360.750.250.3514
0.750.500.99920.750.500.6924
0.750.751.00000.750.751.0000
0.751.001.00000.751.001.0000
1.000.250.69181.000.250.1082
1.000.501.00001.000.500.9992
1.000.751.00001.000.751.0000
1.001.001.00001.001.001.0000
d = 0.75 and α = 0.50d = 1.00 and α = 0.50
dαRejection Prob.dαRejection Prob.
0.250.251.00000.250.251.0000
0.250.500.99970.250.501.0000
0.250.750.97720.250.751.0000
0.251.000.37260.251.000.9413
0.500.250.99990.500.251.0000
0.500.500.77010.500.500.9997
0.500.750.17920.500.750.7660
0.501.000.90500.501.000.2183
0.750.250.98900.750.251.0000
0.750.500.04890.750.500.7701
0.750.750.99730.750.750.5990
0.751.001.00000.751.000.9999
1.000.250.85361.000.250.9999
1.000.500.69241.000.500.0489
1.000.751.00001.000.751.0000
1.001.001.00001.001.001.0000
The size of the test statistic is in bold. The nominal size is 5% and 10,000 replications were used in each case.
Table 5. Rejection frequencies of the test statistic in Section 3. T = 1000 observations.
Table 5. Rejection frequencies of the test statistic in Section 3. T = 1000 observations.
d = 0.25 and α = 0.50d = 0.50 and α = 0.50
DαRejection Prob.dαRejection Prob.
0.250.250.49710.250.251.0000
0.250.500.04950.250.500.9716
0.250.750.52210.250.750.4638
0.251.000.99830.251.000.3605
0.500.250.07680.500.250.9811
0.500.500.95710.500.500.0495
0.500.751.00000.500.750.9950
0.501.001.00000.501.001.0000
0.750.250.40800.750.250.6061
0.750.501.00000.750.500.9571
0.750.751.00000.750.751.0000
0.751.001.00000.751.001.0000
1.000.250.95111.000.250.1658
1.000.501.00001.000.501.0000
1.000.751.00001.000.751.0000
1.001.001.00001.001.001.0000
d = 0.75 and α = 0.50d = 1.00 and α = 0.50
dαRejection Prob.dαRejection Prob.
0.250.251.00000.250.251.0000
0.250.501.00000.250.501.0000
0.250.751.00000.250.751.0000
0.251.000.46350.251.000.9992
0.500.251.00000.500.251.0000
0.500.500.97160.500.501.0000
0.500.750.40310.500.750.9690
0.501.000.99920.501.000.3959
0.750.251.00000.750.251.0000
0.750.500.04950.750.500.9716
0.750.751.00000.750.750.9571
0.751.001.00000.751.001.0000
1.000.250.99001.000.251.0000
1.000.500.95711.000.500.0495
1.000.751.00001.000.751.0000
1.001.001.00001.001.001.0000
The size of the test statistic is in bold. The nominal size is 5% and 10,000 replications were used in each case.
Table 6. Rejection frequencies of the test statistic in Section 3 under local alternatives. T = 1000 observations.
Table 6. Rejection frequencies of the test statistic in Section 3 under local alternatives. T = 1000 observations.
d = 0.25 and α = 0.50 d = 0.50 and α = 0.50
dαRejection Prob.dARejection Prob.
0.210.460.61240.460.460.6003
0.220.470.49330.470.470.5992
0.230.480.30770.480.480.3413
0.240.490.21960.490.490.1011
0.250.500.04950.500.500.0495
0.260.510.19580.510.510.1104
0.270.520.12410.520.5202568
0.290.530.23770.530.530.3578
0.300.540.45750.540.540.4501
0.310.550.66740.550.550.6908
d = 0.75 and α = 0.50d = 1.00 and α = 0.50
dαRejection Prob.dαRejection Prob.
0.710.460.78930.960.460.7093
0.720.470.64420.970.470.6732
0.730.480.60090.980.480.6007
0.740.490.48970.990.490.5114
0.750.500.04951.000.500.0495
0.780.510.39511.010.510.4545
0.770.520.56091.020.520.5672
0.780.530.60871.030.530.5906
0.790.540.69811.040.540.4501
0.800.550.78431.050.550.6655
The size of the test statistic is in bold. The nominal size is 5% and 10,000 replications were used in each case.
Table 7. Estimates of α and d for each country.
Table 7. Estimates of α and d for each country.
CountryαdTest Statistic
Argentina0.570.210.0054
Mexico1.000.131.0599
Venezuela0.091.920.1002
Brazil0.061.920.0983
Colombia0.870.330.0234
Table 8. Impulse response functions of the selected models for each country.
Table 8. Impulse response functions of the selected models for each country.
MexicoColombiaArgentinaVenezuelaBrazil
10.130000000.287100020.119699990.172800000.11520000
20.073450000.166101710.041278540.022705920.01009152
30.052149500.112234940.017332860.002670220.00079118
40.040806980.081288960.007928480.000295595.8389 × 10−5
50.033706570.061244730.003805203.1498 × 10−54.1479 × 10−6
60.028819120.047332990.001883383.2695 × 10−62.8700 × 10−7
70.025237310.037238210.000952373.3290 × 10−71.9500 × 10−8
80.022492750.029683980.000489253.3400 × 10−81.3000 × 10−9
90.020318450.023902530.000254393.3000 × 10−91.0000 × 10−10
100.018550750.019401920.000133553.0000 × 10−100
110.017083550.015851547.0655 × 10−500
120.015845000.013020853.7622 × 10−500
130.014784600.010744312.0142 × 10−500
140.013865840.008900201.0833 × 10−500
150.013061620.007397315.8495 × 10−600
160.012351400.006166173.1696 × 10−600
170.011719300.005153141.7227 × 10−600
180.011152860.004316369.3880 × 10−700
190.010642180.003622815.1290 × 10−700
200.010179240.003046262.8080 × 10−700
The impulse responses were obtained on the basis of the coefficients of the MA(∞) representation of the series given by Equation (7). Significant values (at the 95% level) appear in bold.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Caporale, G.M.; Gil-Alana, L.A. A Fractional Integration Model and Testing Procedure with Roots Within the Unit Circle. Mathematics 2025, 13, 2978. https://doi.org/10.3390/math13182978

AMA Style

Caporale GM, Gil-Alana LA. A Fractional Integration Model and Testing Procedure with Roots Within the Unit Circle. Mathematics. 2025; 13(18):2978. https://doi.org/10.3390/math13182978

Chicago/Turabian Style

Caporale, Guglielmo Maria, and Luis Alberiko Gil-Alana. 2025. "A Fractional Integration Model and Testing Procedure with Roots Within the Unit Circle" Mathematics 13, no. 18: 2978. https://doi.org/10.3390/math13182978

APA Style

Caporale, G. M., & Gil-Alana, L. A. (2025). A Fractional Integration Model and Testing Procedure with Roots Within the Unit Circle. Mathematics, 13(18), 2978. https://doi.org/10.3390/math13182978

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop