Next Article in Journal
A Bayesian Hierarchical Model for 2-by-2 Tables with Structural Zeros
Previous Article in Journal
Mixed Poisson Processes with Dropout for Consumer Studies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Preliminary Test Estimation for Parallel 2-Sampling in Autoregressive Model

by
Syed Ejaz Ahmed
1,†,
Arsalane Chouaib Guidoum
2,*,† and
Sara Bendjeddou
3,†
1
Department of Mathematics and Statistics, Brock University, 1812 Sir Isaac Brock Way, St. Catharines, ON L2S 3A1, Canada
2
Department of Mathematics and Computer Science, Faculty of Sciences and Technology, University of Tamanghasset, Tamanghasset 11000, Algeria
3
Laboratory MSTD, Faculty of Mathematics, University of Science and Technology Houari Boumediene, P.O. Box 32, El Alia 16111, Algeria
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Stats 2024, 7(4), 1141-1158; https://doi.org/10.3390/stats7040067
Submission received: 17 September 2024 / Revised: 7 October 2024 / Accepted: 11 October 2024 / Published: 14 October 2024
(This article belongs to the Section Computational Statistics)

Abstract

:
The purpose of this paper is to discuss the problem of estimation and testing the equality of two autoregressive parameters of two first-order autoregressive processes AR ( 1 ) , where for each process, the observations are made at different time points. The primary interest is to propose the testing procedures for the homogeneity of autocorrelation parameters ρ 1 and ρ 2 . Furthermore, we are interested in estimating ρ 1 under uncertain and weak prior information about the possible equality of ρ 1 and ρ 2 , though we may not have full confidence in the tenacity of this information. A large sample test for the homogeneity of the parameters is developed. Pooled “P” (or restricted estimator) and preliminary test “PT” estimators are proposed, and their properties are investigated and compared with the unrestricted estimator “UE” of ρ 1 .

1. Introduction

The estimation of model parameters is a fundamental aspect of statistical analysis, particularly when data originate from distinct sources or experiments. Suppose that two research stations are conducting the same experiment independently, measuring the same characteristics at different time points. We have a sample of n 1 measurements corresponding to the time points t 11 < t 12 < < t 1 n 1 for station 1 and a sample of n 2 measurements corresponding to the time points t 21 < t 22 < < t 2 n 2 for station 2. It is assumed that there is an autoregression of order-1 in the population of both research stations. Let ρ 1 and ρ 2 be population autoregressive coefficients of two stations, respectively. The first-order stationary autoregressive model for research station 1 is given by
X 1 t = ρ 1 X 1 ( t 1 ) + ϵ 1 t , t = 1 , , n 1 ,
where ϵ 1 t are independently and identically distributed with mean zero and finite variance σ 1 2 . Similarly, the first-order stationary autoregressive model for research station 2 is given by
X 2 t = ρ 2 X 2 ( t 1 ) + ϵ 2 t , t = 1 , , n 2 ,
where ϵ 2 t are independently and identically distributed with mean zero and finite variance σ 2 2 . Suppose that for j = 1 , 2 , we have samples for n j measurements x j t j 1 , x j t j 2 , , x j t j n j corresponding to the time points t j 1 < t j 2 < < t j n j for station j .
The ordinary least squares estimate (OLSE) or unrestricted estimator “UE” ρ ^ j of ρ j based on n j observations x j t j 1 , x j t j 2 , , x j t j n j is
ρ ^ j = i = 1 n j 1 x j t j i x j t j ( i 1 ) i = 1 n j 1 x j t j ( i 1 ) 2 .
In many practical scenarios, the choice between the pooled estimation and separate estimation of parameters can significantly impact the efficiency and reliability of the resulting estimators. The debate over pooling versus separate estimation has been extensively studied in the literature. Ahmed [1] investigated the conditions under which pooling discrete data yields more efficient estimators, laying the groundwork for subsequent research. Building on these foundational insights, Wang et al. [2] explored effective strategies for parameter estimation and forecasting in panel regressions, while Juodis et al. [3] examined the asymptotic properties of pooled factor-augmented regressions under more realistic conditions. Recent advancements, such as those by Aldeni et al. [4], who focused on pretest and shrinkage estimators for log-normal means, and Waqas et al. [5], who developed robust methodologies for optimizing shape parameters in Birnbaum–Saunders distributions, further contribute to the growing body of research in this area.
The concept of pretest estimators employs preliminary tests to decide whether to pool data or estimate parameters separately. This approach aims to balance the efficiency gains of pooled estimators (see [3]) with the robustness of separate estimation when there is uncertainty about parameter equality. Furthermore, Ahmed [1] explored penalty, shrinkage, and pretest strategies for variable selection and estimation, providing valuable methods for improving estimation efficiency. Recent works by Piladaeng et al. [6] introduced penalized, post-pretest, and post-shrinkage strategies in non-linear growth models and Stein-rule M-estimation in sparse partially linear models, respectively. In another contribution, Ahmed et al. [7] developed efficient estimators of reliability characteristics for a family of lifetime distributions under progressive censoring. These advancements expand the scope of pretest estimation methods and their practical applications. For a comprehensive review, Danilov [8] explored the effects of pretesting in econometrics, particularly in applications to finance, highlighting the comparative advantages of pretest estimators over unrestricted estimators.
In contrast to these approaches, our study focuses on estimating ρ 1 under the hypothesis that ρ 1 = ρ 2 , emphasizing scenarios where combining data from two separate sources is crucial. Suppose we have two research stations conducting the same experiment independently, providing separate datasets. One dataset is known to be from the model of interest, while the second, acquired at a different time or location, is suspected to belong to the same model but is not definitively confirmed. The challenge is to estimate ρ 1 when prior information about the potential equality of ρ 1 and ρ 2 is uncertain. In this context, we propose a large sample test statistic to assess the homogeneity of autocorrelation parameters and introduce both pooled and preliminary estimators tailored to parallel sampling contexts. Our objective is to evaluate whether integrating information from these two datasets improves parameter estimation, given the uncertainty surrounding parameter equality. By combining information from the two sources, we aim to enhance the efficiency and accuracy of the estimators. Let the null or preliminary hypothesis be
H 0 : ρ 1 = ρ 2 = ρ 0 ( unknown ) ,
The remainder of this article is structured as follows: In Section 2, we propose a large sample test statistic for assessing the homogeneity of autocorrelation parameters and introduce the pooled and preliminary estimators of ρ 1 . Section 3 investigates the properties of these estimators, providing a detailed analysis of their asymptotic bias and mean squared error. Section 4 compares the performance of the estimators using various tools such as dominance criteria. Section 5 presents a simulation study to evaluate both the asymptotic properties and the finite-sample performance of the estimators. Section 6 demonstrates two applications of the estimation method using real-world data from weather and economic studies. Finally, Section 7 summarizes the findings and discusses future research.

2. Improved Estimation

Let m j = n j 1 . Wei [9] showed that for the stochastic regression model,
m j ρ ^ j ρ j L N 0 , ν j ,
where L means convergence in distribution and ν j = σ j 2 γ j ( 0 ) , with γ j ( h ) being the autocovariance function of order h for the j th population. Note that for a stationary AR ( 1 ) process, the autocovariance at lag h = 0 is given by γ j ( 0 ) = σ j 2 1 ρ j 2 . In light of the above asymptotic distributional result, we propose a Pooled “P” or restricted estimator of ρ 1 under the null (preliminary) hypothesis (4) as
ρ ^ P = n 1 ν ^ 1 ρ ^ 1 + n 2 ν ^ 2 ρ ^ 2 n 1 ν ^ 1 + n 2 ν ^ 2 ,
where ν ^ j = σ ^ j 2 γ ^ j ( 0 ) , with σ ^ j 2 and γ ^ j ( 0 ) as consistent estimators of σ j 2 and γ j ( 0 ) , respectively, given by
γ j ^ ( 0 ) = i = 1 n j x j t j i 2 n j and σ ^ j 2 = i = 1 m j x j t j i ρ ^ j x j t j ( i 1 ) 2 m j ,
This formulation is the most general case. If ν 1 = ν 2 , we recover the scenario where the variances are equal, and thus the weights simplify. Additionally, if n 1 = n 2 , the pooled estimator simplifies even further to a simple average. Therefore, our approach captures the most general situation, allowing for variations in both sample sizes and variances. Typically, the pooled estimator ρ ^ P performs better than the unrestricted estimator ρ ^ 1 under the null hypothesis (4). However, as the scale distance between ρ 1 and ρ 2 grows, ρ ^ P becomes considerably biased and inefficient. The performance of ρ ^ 1 remains unchanged over such departures. In an effort to improve the precision of estimators, its is reasonable to develop an estimator which is a combination of ρ ^ 1 and ρ ^ P by incorporating a preliminary test on the null hypothesis (4), following an approach which was discussed in [1]. As a result, when H 0 is rather suspicious, it is desirable to have a compromise estimator using a preliminary test on H 0 in (4) and then choose ρ ^ P or ρ ^ 1 based upon the outcome of the test.
First, we propose a large sample test statistic Z 2 for H 0 :
Z 2 = ν ^ 1 n 1 + ν ^ 2 n 2 1 ρ ^ 1 ρ ^ 2 2 ,
The distribution of Z 2 under the (4) tends to central chi-square distribution with 1 degree of freedom χ ( 1 ) 2 if n 1 + n 2 . Consequently, under (4), the critical value of Z 2 may be approximated by χ 1 , α 2 which denotes the upper 100 α % critical value of χ ( 1 ) 2 . Finally, the preliminary test estimator “PT” is defined by
ρ ^ PT = ρ ^ P 1 Z 2 < χ 1 , α 2 + ρ ^ 1 1 Z 2 χ 1 , α 2 ,
where 1 A is an indicator function of the set A. We notice that ρ ^ PT is a convex combination of ρ ^ 1 and ρ ^ P via a test statistic for testing H 0 , and this estimator may not be perfect in the whole parameter space. For some account of the parametric theory on the subject, we refer the reader to [1,10], among others. Two bibliographies in this area of research are provided by Piladaeng et al. [6], Chien-Pai Han and Ravichandran [11]. The asymptotic theory of preliminary test estimation for discrete models is discussed by Ahmed [1], among others.
We use the mean squared error (MSE) criterion to appraise the performance of the estimators under the following squared loss function:
L ρ ^ , ρ 1 = ρ ^ ρ 1 2 ,
where ρ ^ is a suitable estimator of ρ 1 . Then, the MSE of ρ ^ is given by MSE ρ ^ , ρ 1 = E ρ ^ ρ 1 2 ; further, ρ ^ is termed an inadmissible estimator of ρ 1 if there exists an alternative estimator ρ ^ such that
MSE ρ ^ MSE ρ ^ for all   ρ 1 ,
with strict inequality for some ρ 1 . If, instead of (8) holding for every n, we use the asymptotic MSE (AMSE), then we require that
lim n MSE ρ ^ lim n MSE ρ ^ for all   ρ 1 ,
with strict inequality for some ρ 1 . Then, ρ ^ is termed an asymptotically inadmissible estimator of ρ 1 .

3. Main Results

For large values of (6), the preliminary test estimator in (7) leads to the asymptotic equivalence with the usual maximum likelihood estimator. Also, when (4) does not hold and n is very large, by virtue of the consistency of the test-statistic, (6) will be large in probability, and the (7) will be asymptotically equivalent in probability, to the (3). Hence, in the asymptotic set-up, a fixed alternative will not make sense. A meaningful investigation is possible for local alternatives where ρ 1 is close to ρ 2 . Therefore, it is reasonable to restrict ourselves to local alternative K n , which contains the null hypothesis as a special case. Under an asymptotic set-up, we shall examine the performance of the estimators. We may also remark here that for parametric preliminary test estimation, unless the true parameters are close to each other, there is no substantial gain from the preliminary test estimation over classical methods. This closeness is precisely characterized in terms of local alternatives. Moreover, the asymptotic set-up enables us to obtain results which are quite comparable to the existing parametric preliminary test estimation theory. For given sample size n = n 1 + n 2 , replacing ρ 1 by ρ 1 + δ n , we obtain
K n : ρ 2 = ρ 1 + δ n ,
where δ is a fixed real number. Under the local alternatives in (10) and as n , in such a way that n 1 / n π [ 0 , 1 ] and n 2 / n ( 1 π ) [ 0 , 1 ] , the following lemmas are the consequences of the results of Wei [9].
Lemma 1.
X 1 = n ρ ^ 1 ρ 0 d X N δ , ν 1 π .
Proof. 
The detailed proof is provided in Appendix A. □
Lemma 2.
Z 1 = n ρ ^ P ρ 0 d Z N 0 , ν 2 ν 1 π ν 2 + ( 1 π ) ν 1 .
Proof. 
See Appendix B for the detailed proof. □
Lemma 3.
Y 1 = n ρ ^ 1 ρ ^ P d Y N ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 δ , ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 ν 1 π .
Proof. 
For a detailed proof, see Appendix C. □
Lemma 4.
As n , the test statistics Z 2 given in (6) has asymptotically a non-central chi-square distribution with 1 degree of freedom and a non-centrality parameter Δ = π ( 1 π ) δ 2 ( 1 π ) ν 1 + π ν 2 .
Proof. 
See Appendix D. □
Now, we present the expressions for the asymptotic biases of the estimators as follows. By direct computation and using the results from [10], we arrive at the following bias expressions of the estimators:
B 1 = asymptotic   bias   of n ρ ^ 1 ρ 1 = 0 , B 2 = asymptotic   bias   of n ρ ^ P ρ 1 = ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 δ , B 3 = asymptotic   bias   of n ρ ^ PT ρ 1 = B 2 G 3 χ 1 , α 2 ; Δ ,
where G m . , Δ is the cumulative distribution function of a non-central chi-square distribution with m degrees of freedom and non-centrality parameter Δ . First, note that lim δ δ G 3 χ 1 , α 2 ; Δ = 0 . Thus, we conclude that ρ ^ PT is asymptotically unbiased (in the sense of δ ), but ρ ^ P is not so. However, since the bias is a component of the asymptotic mean squared error (AMSE) and the control of the AMSE would control both bias and variance, we shall only compare the AMSE from this point onwards. Expressions for the AMSE of the estimators are presented in the following theorem, which follow from the results in [10].
Theorem 1.
Under the local alternatives (10), the AMSE of the estimators is as follows:
AMSE 1 Δ = asymptotic   MSE   of n ρ ^ 1 ρ 1 = ν 1 π ,
AMSE 2 Δ = asymptotic   MSE   of n ρ ^ P ρ 1 = ν 1 π 1 + ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 ( Δ 1 ) , AMSE 3 Δ = asymptotic   MSE   of n ρ ^ PT ρ 1
= ν 1 π 1 + ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 Δ 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ G 3 χ 1 , α 2 ; Δ .
Proof. 
For a detailed proof, see Appendix E. □

4. Asymptotic Properties of the Estimators

We note that the AMSE of ρ ^ 1 is a constant line, while the AMSE of ρ ^ P is a straight line in terms of Δ , which intersects the AMSE of ρ ^ 1 at Δ = 1 . Using the AMSE criterion to appraise performance, if the restriction ρ 1 = ρ 2 is correct, then the AMSE of ρ ^ P is less than the AMSE of ρ ^ 1 . Furthermore, A M S E 2 A M S E 1 if 0 Δ 1 . Hence, for Δ [ 0 , 1 ] , ρ ^ P outperforms ρ ^ 1 . However, beyond this interval, the AMSE of ρ ^ P increases without bound. In an effort to identify some important characteristic of the PT estimators, first note that
G 5 χ 1 , α 2 ; Δ G 3 χ 1 , α 2 ; Δ G 3 χ 1 , α 2 ; 0 = 1 α ,
for α 0 , 1 and Δ 0 . The left-hand side of (14) converges to 0 as Δ approaches infinity. Now, we compare the AMSE of PT with that of the UE. We notice that ρ ^ PT dominates ρ ^ 1 whenever
Δ G 3 χ 1 , α 2 ; Δ 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ 1 .
It is obvious from (15) that the AMSE of PT is less than the AMSE of UE when Δ is equal to or near 0. Further, as α , the level of the statistical significance, approaches one, the AMSE of PT tends to the AMSE of UE. Also, when Δ increases and tends to infinity, the AMSE of ρ ^ PT approaches the AMSE of ρ ^ 1 . Further, for larger values of Δ , the value of the AMSE of ρ ^ PT increases, reaches its maximum after crossing the AMSE of ρ ^ 1 , and then monotonically decreases and approaches the AMSE of ρ ^ 1 . There are, therefore, points in the parameter space where ρ ^ PT has a larger AMSE than ρ ^ 1 and a sufficient condition for this result to occur is that the following inequality holds:
AMSE 3 > AMSE 1 according   as Δ > G 3 χ 1 , α 2 ; Δ 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ 1 .
Finally, under (4), the dominance picture of the estimators is as follows:
  • None of the three estimators are inadmissible with respect to the other two.
  • However, at Δ = 0 , the estimators may be ordered according to the magnitude of their AMSE as follows: ρ ^ P ρ ^ PT ρ ^ 1 . Here, ≻ denotes domination.

5. Simulation Study

In this section, we employ simulated data to explore the relative efficiency of the estimators under consideration. The comparisons between the ρ ^ P and ρ ^ PT estimators are conducted with respect to the ρ ^ 1 unrestricted estimator “UE”, based on the simulated relative efficiency (SRE). The SRE of an estimator ρ ^ in comparison to the ρ ^ 1 is defined as
SRE ρ ^ 1 ; ρ ^ = SR ρ ^ 1 SR ρ ^ ,
here, SR ( · ) represents the simulated risk of an estimator. The SR of an estimator ρ ^ is determined by the average risk over M replications, calculated by
SR ρ ^ = 1 M l = 1 M ρ ^ l ρ 0 2 .
It is evident that the SRE of the ρ ^ 1 will always equal one, as it serves as the baseline for comparison. An SRE value greater than one indicates that the estimator ρ ^ outperforms ρ ^ 1 , suggesting a higher efficiency relative to the ρ ^ 1 . Conversely, an SRE value less than one implies that ρ ^ is less efficient than ρ ^ 1 , indicating a certain degree of inferiority in its performance. This metric allows for a clear quantification of the relative merits of different estimators. We perform Monte Carlo simulations using various combinations of sample sizes n 1 and n 2 . Both small and large sample sizes are considered to assess their impact on the performance of the estimators, where n = n 1 + n 2 . Two autoregressive models, (1) and (2), are generated with varying sizes n 1 and n 2 , and across a range of ρ 0 = ( 0.3 , 0.5 , 0.7 ) . The parameters are defined as follows: ρ 1 = ρ 2 and ρ 2 = ρ 1 + δ n . The scale parameter δ is selected to take small values, starting from 0 up to its maximum, ensuring the stationarity of our autoregressive models:
0 δ < 1 ρ 0 n .
First, we compute the unrestricted estimators ρ ^ j for j = 1 , 2 using (3). Following this, the pooled or restricted estimator ρ ^ P is determined using (5). Subsequently, we calculate the test statistic Z 2 as described in (6), with the distribution of the test statistic being evaluated under the null hypothesis (4). Finally, the pretest estimators ρ ^ PT are derived using (7). This process allows us to systematically compare the performance of the different estimators under various conditions, ensuring a thorough assessment of their relative efficiencies. The SR for each estimator is computed by repeating the simulation M = 10 , 000 times to ensure stable results. The SREs of the proposed estimators are evaluated across various combinations of n 1 , n 2 , and ρ 0 , with α fixed at 0.05 and 0.10, and δ varying within the range [ 0 , 1 ρ 0 n 1 + n 2 ] . Table 1 and Table 2 present the SRE ρ ^ 1 ; ρ ^ values for the different estimators for both smaller and larger sample sizes, respectively, while the corresponding graphs are displayed in Figure 1. The figure illustrates the results when n 1 = 25 and n 2 = 50 at the top, and n 1 = n 2 = 500 at the bottom.
From the simulation outcomes, the following observations can be made:
  • Since the estimator ρ ^ P achieves the highest SRE at δ = 0 , it outperforms all other estimators. This effectiveness further improves as the parameter of the autoregressive model increases. However, for the ρ ^ PT estimator, its efficiency declines as both the level of significance and the values of ρ 0 increase.
  • When δ > 0 , none of the three estimators are superior to the others. This result is due to the SRE of ρ ^ PT decreasing continuously with increasing δ . However, beyond a certain point, as δ continues to rise, the efficiency gradually approaches one. Before reaching this point, ρ ^ PT experiences a sharp decline in efficiency, eventually being surpassed by ρ ^ P . After this point, the SRE of the ρ ^ P estimator becomes less significant compared to the ρ ^ PT estimator.
As we proceed in this study, our focus will shift towards a deeper investigation into the statistical properties and the performance of a test statistic Z 2 given by (6). We derive the asymptotic distribution of Z 2 under large sample conditions, which is a non-central chi-square distribution χ 1 , Δ 2 . Specifically, we prove the Lemma 4. To validate this result, we perform a series of simulations: we simulate 10 , 000 times the statistic Z 2 for different values of ρ 0 = ( 0.3 , 0.5 , 0.7 ) and a range of δ values (where δ satisfies the condition (16)). For each scenario, the degree of freedom df and non-centrality parameters Δ are estimated using the maximum likelihood method (MLE). Furthermore, we will compare the empirical findings with the theoretical expectations to ensure consistency across different scenarios. The simulation results confirm the theoretical findings derived in Lemma 4. The estimated values df ^ and Δ ^ coincide with the theoretically predicted values, demonstrating the accuracy of the asymptotic distribution for the test statistic Z 2 . Additionally, we compile Table 3, which summarizes the descriptive statistics and the MLE for the asymptotic distribution of Z 2 under different values of ρ 0 and δ , assuming n 1 = n 2 = 500 . The table presents the mean, variance, median, skewness, and kurtosis of the simulated data, as well as the estimated degrees of freedom and non-centrality parameters for the corresponding χ 2 distribution. This is further complemented by Figure 2, which provides a graphical representation of the fit, illustrating the consistency between the theoretical and simulated distributions.

6. Real Data Applications

To demonstrate the usefulness of the suggested estimators from Section 2, we present two real-data applications (in weather and economy) in this section to validate their practical relevance and effectiveness.

6.1. Mean Annual Precipitation

Various precipitation metrics, including the daily totals, monthly accumulations, annual precipitation, extreme values, rainfall durations, and rain rates, have been modeled using a time series autoregressive model. As our initial application, we analyze two time series representing mean annual precipitation from two stations located in Washington State, USA. The first station “HOQUIAM BOWERMAN AP” provides the mean annual precipitation records from 1970 to 2024, covering a period of 54 years ( n 1 = 55 observations), and is denoted as X 1 t . The second station “ABERDEEN” offers data spanning from 1950 to 2022, with a total of 72 years of precipitation records ( n 2 = 73 observations), and is denoted as X 2 t . These datasets are publicly available on the National Weather Service website at https://www.weather.gov/wrh/Climate?wfo=sew (accessed on 25 August 2024). A summary of the statistical characteristics of the two time series X 1 t and X 2 t , along with the estimation of ρ j and ν j for each series, is presented in Table 4, while their graphical representation is provided in Figure 3.
The scientists wish to estimate ρ 1 from station 1, and they suspect that the parameter may be the same for both populations, i.e., ρ 1 = ρ 2 . In such a case, it is advantageous to pool the two time series X 1 t and X 2 t to estimate ρ 1 . Hence, by (5),
ρ ^ P = 55 2.0338 × 0.9733 + 73 1.9005 × 0.9767 55 2.0338 + 73 1.9005 0.9753
The homogeneity of the two parameters, i.e., ρ 1 = ρ 2 may be examined by means of (6), or numerically,
Z 2 = 2.0338 55 + 1.9005 73 1 × 0.9733 0.9767 2 0.0001834
for α = 0.05 , the upper 95% point of the non-central chi-squared distribution (asymptotically) χ 1 , 0.05 2 3.8415 which is the minimum value at δ = 0 ( Δ = 0 ). The preliminary test estimator ρ ^ PT given by (7) is
ρ ^ PT = ρ ^ P × 1 Z 2 < 3.8415 TRUE + ρ ^ 1 × 1 Z 2 3.8415 FALSE
The critical value of such a test statistic is 0.0001834 with an upper tail probability value p-value = 0.9892. Thus, the result is not significant at the 5 % level, and the null hypothesis H 0 of homogeneity of the autocorrelation parameters ρ 1 = ρ 2 cannot be rejected. Hence, ρ ^ PT = ρ ^ P is selected as an estimate of ρ 1 . We conclude that a scientist who wishes to find a good alternative to the ρ ^ P and ρ ^ 1 should be able to specify the minimum relative efficiency.

6.2. Annual Inflation Rates (AIRs)

Annual inflation rates represent the average change in the prices of goods and services within an economy over a one-year period. Inflation is a key indicator of economic health, reflecting the variation in the purchasing power of a currency. It is commonly measured by the Consumer Price Index (CPI), which tracks price changes in a representative basket of goods and services consumed by households. As a macroeconomic indicator, the annual inflation rate is crucial for understanding economic trends and inflationary pressures. To analyze the dynamic behavior of inflation over time, econometric models such as autoregressive models are often employed. These models are particularly useful in capturing the persistence of inflation, as they assume that the current inflation rate is dependent on the inflation rate from the previous period, along with a random shock. These models help policymakers and economists forecast future inflation and assess the potential impacts on monetary policy, wages, prices, and the purchasing power of consumers.
In this second application, we apply our estimation strategies to the annual inflation rates of two countries: Poland ( PL ) and Algeria ( DZ ). The datasets for PL cover the period from 1971 to 2023, comprising 53 observations, while the datasets for DZ span from 1970 to 2023, with a total of 54 observations; we test the homogeneity of the autoregressive parameters by assessing whether the autocorrelation coefficients ρ 1 and ρ 2 for the two countries are statistically equal. The inflation data, which serve as the basis for this analysis, are publicly available through the World Bank’s World Development Indicators (WDI) database at https://databank.worldbank.org/ (accessed on 28 August 2024). For users of the R programming language [12], these data can also be directly accessed using the WDI package [13] available at https://CRAN.R-project.org/package=WDI (accessed on 28 August 2024). This datasets provide comprehensive time series data on various macroeconomic indicators, including the annual percentage change in the Consumer Price Index (CPI) for both countries. A summary of the statistical characteristics of the two countries, PL and DZ , along with the estimates of ρ j and ν j for each country, are provided in Table 5. A graphical representation of the inflation dynamics for both countries is shown in Figure 4, where the boxplot is presented on a logarithmic scale to enhance the visibility of the data, particularly in the presence of large data points.
First, we obtain the pooled (or restricted) estimator ρ ^ P = 0.8016 . The test statistic Z 2 = 7.3216 is computed using ρ ^ 1 and ρ ^ 2 (unrestricted estimator). Under the null hypothesis, this test statistic follows a non-central chi-square distribution with one degree of freedom (asymptotically). For α = 0.05 , the upper 95% critical value of the chi-square reference distribution is χ 1 , 0.05 2 3.8415 at δ = 0 ( Δ = 0 ) which yields p - value = 0.0069 . Thus, the result is significant at the 5% level, and the null hypothesis of homogeneity of the autocorrelation parameters is rejected. The pretest estimator ρ ^ PT is equal to the unrestricted estimator, i.e., ρ ^ PT = ρ ^ 1 . When the assumption of homogeneity of the autocorrelation parameters does not hold, the estimators based on the test statistic Z 2 converge to the unrestricted estimator. This indicates that by rejecting the hypothesis of homogeneity, we conclude that the annual inflation dynamics in PL differ from those in DZ . In other words, there are significant differences between the autocorrelation parameters of the inflation for these two countries. These findings are consistent with both the theoretical predictions and numerical results, especially for large values of δ .
To further analyze the impact of larger values of δ on the test of homogeneity of the autocorrelation parameters, we estimate non-centrality parameter Δ by the formula (see Lemma 4)
Δ ^ = π ( 1 π ) δ 2 ( 1 π ) ν ^ 1 + π ν ^ 2 ,
where π = n 1 n 1 + n 2 and 1 π = n 2 n 1 + n 2 . The results of the pretest estimator for different values of δ are presented in Table 6.

7. Conclusions and Future Research

In this article, we develop and analyze preliminary test estimation methods for the homogeneity of autocorrelation parameters in parallel two-sample autoregressive models. Our findings demonstrate that the proposed pooled and preliminary estimators offer significant improvements in efficiency compared to traditional estimators, particularly when the assumption of parameter homogeneity does not hold. The large-sample properties of these estimators, including asymptotic bias and mean squared error, have been thoroughly investigated, showing that the proposed methods perform well in both asymptotic and finite-sample scenarios. Building on these theoretical results, the simulation study further underscores the robustness of the preliminary test estimator across various sample sizes, validating the theoretical properties derived earlier. Notably, the results reveal that when the homogeneity assumption is strongly challenged, the pooled estimator can introduce substantial bias, whereas the preliminary test estimator effectively adjusts by incorporating the test of homogeneity. This highlights the importance of pretesting in situations where parameter equality is uncertain. Despite these promising results, the performance of these estimators can still vary depending on specific data characteristics, such as the degree of autocorrelation and sample size. For example, in real-world applications like the inflation data for Poland and Algeria, the proposed method offers valuable insights into the differences in economic dynamics between countries, demonstrating its utility beyond purely theoretical settings.
Looking ahead, several avenues for future research can be explored. One potential direction is to extend the current methodology to more complex time series models, such as higher-order autoregressive models or models with mixed effects. Additionally, investigating the impact of non-stationarity and structural breaks on the performance of the preliminary test estimators could yield further insights. Another promising area for future work is the application of these methods to high-dimensional settings, where the number of parameters exceeds the number of observations, which is increasingly common in modern data analysis. Furthermore, exploring alternative testing methods, such as permutation, bootstrap, or other non-parametric techniques, in the context of our framework could offer additional flexibility. Finally, incorporating machine learning techniques for model selection and parameter estimation, particularly in cases where the data exhibit non-linear patterns, could further enhance the efficiency of the proposed estimators. These advancements could broaden the applicability of preliminary test estimation methods to a wider range of practical problems in statistics and econometrics.

Author Contributions

Methodology, S.E.A. and S.B.; Software, A.C.G.; Validation, A.C.G.; Formal analysis, S.B.; Investigation, S.E.A. and S.B.; Data curation, A.C.G.; Writing—original draft, S.E.A.; Writing—review & editing, S.E.A. and S.B.; Visualization, A.C.G.; Supervision, S.E.A. and S.B.; Project administration, S.E.A.; Funding acquisition, S.E.A.. All authors have read and agreed to the published version of the manuscript.

Funding

Prof. Ahmed S.E research is funded by the Natural Sciences and Engineering Research Council (NSERC), Canada.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The R code can be found on GitHub at: https://github.com/acguidoum/R-code-stats-2024 (accessed on 25 August 2024), so that readers can replicate the results. The data used in this study are publicly available from multiple sources. The mean annual precipitation data for the “HOQUIAM BOWERMAN AP” and “ABERDEEN” stations were obtained from the National Weather Service and can be accessed at https://www.weather.gov/wrh/Climate?wfo=sew (accessed on 28 August 2024). The inflation data for Poland and Algeria were sourced from the World Bank’s World Development Indicators (WDI) database, which can be accessed through the World Bank’s Databank at https://databank.worldbank.org/ (accessed on 28 August 2024).

Acknowledgments

The authors are thankful to the editor and the anonymous referees for their thorough review and highly appreciate the comments and suggestions which significantly contributed to increasing the quality of our paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Proof of Lemma 1

From the asymptotic theory of estimation, it is known that A 1 = n 1 ( ρ ^ 1 ρ 1 ) L N ( 0 , ν 1 ) . Let X 1 = n ( ρ ^ 1 ρ 0 ) = n ( ρ ^ 1 ρ 1 + ρ 1 ρ 0 ) = n ( ρ ^ 1 ρ 1 ) + n ( ρ 1 ρ 0 ) = A 1 π + δ which is a linear function of A 1 , so E ( X 1 ) = δ and V ( X 1 ) = ν 1 π .

Appendix B. Proof of Lemma 2

As ρ ^ P = B 0 ρ ^ 1 , where B 0 ρ 0 = ρ 0 and ( 1 B 0 ) = ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 , we have
Z 1 = n ρ ^ P ρ 0 = n B 0 ρ ^ 1 B 0 ρ 0 = B 0 n ρ ^ 1 ρ 0 = B 0 X 1 ,
under the assumption π ν 1 δ 0 , we get E Z 1 = B 0 δ = 0 and V ( Z 1 ) = B 0 ν 1 π .

Appendix C. Proof of Lemma 3

Let
Y 1 = n ρ ^ 1 ρ ^ P = n ρ ^ 1 ρ 0 + ρ 0 ρ ^ P = n ρ ^ 1 ρ 0 n ρ ^ P ρ 0 = X 1 B 0 X 1 = ( 1 B 0 ) X 1
we obtain E Y 1 = 1 B 0 δ = ( 1 π ) ν 1 δ ( 1 π ) ν 1 + π ν 2 and V Y 1 = 1 B 0 ν 1 π = ( 1 π ) ν 1 2 π ( 1 π ) ν 1 + π ν 2 .

Appendix D. Proof of Lemma 4

It is known that
ρ ^ j N ρ j , ν j n j ,
which implies that
ρ ^ 1 ρ ^ 2 N ρ 1 ρ 2 , ν 1 n 1 + ν 2 n 2 .
Under the null hypothesis H 0 (4), we have
ρ ^ 1 ρ ^ 2 N 0 , ν 1 n 1 + ν 2 n 2 ,
Thus, the test statistic Z 2 is given by (6), asymptotically, and the chi-square distribution with one degree of freedom becomes non-central, where the non-centrality parameter Δ is given by
Δ = E ρ ^ 1 ρ ^ 2 2 V ρ ^ 1 ρ ^ 2 ,
where
  • E ρ ^ 1 ρ ^ 2 = ρ 1 ρ 2 = δ n
  • V ρ ^ 1 ρ ^ 2 = ν 1 n 1 + ν 2 n 2 = 1 n ν 1 π + ν 2 ( 1 π )
Hence, the non-centrality parameter Δ is
Δ = π ( 1 π ) ( 1 π ) ν 1 + π ν 2 δ 2 .

Appendix E. Proof of Theorem 1

Lemma A1
(see [10]). Let y be a k-dimensional random vector that follows multivariate normal distribution with mean vector μ y and covariance matrix C k . Then, for any measurable function ϕ, we have
E y ϕ y y = μ y E ϕ χ k + 2 2 Δ
E y y ϕ y y = C k E ϕ χ k + 2 2 Δ + μ y μ y E ϕ χ k + 4 2 Δ
where Δ = 1 2 μ y μ y .
For (11), we have
AMSE 1 Δ = lim n E n ρ ^ 1 ρ 1 2 = lim n E n ρ ^ 1 ρ 1 2 = lim n E n ρ ^ 1 ρ 0 δ n 2 = lim n E n ρ ^ 1 ρ 0 δ 2 = E X 1 δ 2 = V X 1 = ν 1 π .
and for (12), we can write
AMSE 2 Δ = lim n E n ρ ^ P ρ 1 2 = lim n E n ρ ^ P ρ 0 + ρ 0 ρ 1 2 = lim n E n ρ ^ P ρ 0 n ρ 1 ρ 0 2 = lim n E n ρ ^ P ρ ^ 1 + ρ ^ 1 ρ 0 δ 2 = lim n E n ρ ^ P ρ ^ 1 + n ρ ^ 1 ρ 0 δ 2 = E X 1 Y 1 δ 2 = E X 1 2 + E Y 1 2 2 E X 1 Y 1 + δ 2 2 δ E X 1 + 2 δ E Y 1 = V X 1 2 E X 1 δ Y 1 A + E Y 1 2 ,
where
Stats 07 00067 i001
and therefore,
AMSE 2 Δ = V X 1 2 V Y 1 + V Y 1 + E Y 1 2 E Y 1 2 = ν 1 π ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 ν 1 π + ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 δ 2 = ν 1 π 1 + ( 1 π ) ν 1 ( 1 π ) ν 1 + π ν 2 ( Δ 1 ) ,
with Δ = π ( 1 π ) δ 2 ( 1 π ) ν 1 + π ν 2 . Finally, for (13), we have
AMSE 3 Δ = lim n E n ρ ^ PT ρ 1 2 = lim n E n ρ ^ 1 ρ ^ 1 ρ ^ PT 1 Z 2 < χ 1 , α 2 ρ 0 δ n 2 = E X 1 δ Y 1 1 Z 2 < χ 1 , α 2 2 = V X 1 2 E X 1 δ Y 1 1 Z 2 < χ 1 , α 2 B + E Y 1 2 1 Z 2 < χ 1 , α 2 C ,
by using (A1) and (A2), we have
B = E E X 1 δ | Y 1 Y 1 1 Z 2 < χ 1 , α 2 = E E X 1 δ + C ov X 1 , Y 1 V Y 1 1 Y 1 E Y 1 Y 1 1 Z 2 < χ 1 , α 2 = E Y 1 E Y 1 Y 1 1 Z 2 < χ 1 , α 2 = E Y 1 2 1 Z 2 < χ 1 , α 2 E Y 1 E Y 1 1 Z 2 < χ 1 , α 2 = V Y 1 G 3 χ 1 , α 2 ; Δ + E Y 1 2 G 5 χ 1 , α 2 ; Δ E Y 1 2 G 3 χ 1 , α 2 ; Δ
and
C = V ( Y 1 ) E ϕ χ k 1 + 2 2 ( Δ ) + E Y 1 2 E ϕ χ k 1 + 4 2 ( Δ ) = V ( Y 1 ) P χ k + 1 2 ( Δ ) < χ k 1 , α 2 + E Y 1 2 P χ k + 3 2 ( Δ ) < χ k 1 , α 2 = V ( Y 1 ) G 3 χ 1 , α 2 ; Δ + E ( Y 1 2 ) G 5 χ 1 , α 2 ; Δ
substituting B and C in (A3), we obtain
AMSE 3 ( Δ ) = V X 1 V ( Y 1 ) G 3 χ 1 , α 2 ; Δ E Y 1 2 G 5 χ 1 , α 2 ; Δ + 2 E Y 1 2 G 3 χ 1 , α 2 ; Δ = V X 1 V ( Y 1 ) G 3 χ 1 , α 2 ; Δ + E Y 1 2 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ = ν 1 π 1 B 0 ν 1 π G 3 χ 1 , α 2 ; Δ + 1 B 0 2 δ 2 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ = ν 1 π 1 1 B 0 G 3 χ 1 , α 2 ; Δ + 1 B 0 Δ 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ = ν 1 π 1 + 1 B 0 Δ 2 G 3 χ 1 , α 2 ; Δ G 5 χ 1 , α 2 ; Δ G 3 χ 1 , α 2 ; Δ

References

  1. Ahmed, S.E. Penalty, Shrinkage and Pretest Strategies: Variable Selection and Estimation, 1st ed.; SpringerBriefs in Statistics; Springer: Cham, Switzerland, 2014. [Google Scholar] [CrossRef]
  2. Wang, W.; Zhang, X.; Paap, R. To pool or not to pool: What is a good strategy for parameter estimation and forecasting in panel regressions. J. Appl. Econ. 2019, 34, 579–596. [Google Scholar] [CrossRef]
  3. Juodis, A.; Karabiyik, H.; Westerlund, J. On the robustness of the pooled CCE estimator. J. Econ. 2021, 220, 325–348. [Google Scholar] [CrossRef]
  4. Aldeni, M.; Wagaman, J.; Amezziane, M.; Ahmed, S.E. Pretest and shrinkage estimators for log-normal means. Comput. Stat. 2023, 38, 1555–1578. [Google Scholar] [CrossRef]
  5. Waqas, M.; Muhammad, K.A.S.; Nighat, Z.; Ahmed, S.E. Optimizing robust shape parameter: Improved methodologies for Birnbaum–Saunders distribution. J. Stat. Theory Pract. 2024, 18, 1559–8616. [Google Scholar] [CrossRef]
  6. Piladaeng, J.; Ahmed, S.E.; Lisawadi, S. Penalised, post-pretest, and post-shrinkage strategies in nonlinear growth models. Aust. N. Z. J. Stat. 2022, 64, 381–405. [Google Scholar] [CrossRef]
  7. Ahmed, S.E.; Belaghi, R.A.; Hussein, A.; Safariyan, A. New and Efficient Estimators of Reliability Characteristics for a Family of Lifetime Distributions under Progressive Censoring. Mathematics 2024, 12, 1599. [Google Scholar] [CrossRef]
  8. Danilov, D.L. The Effects of Pretesting in Econometrics with Applications in Finance; Tilburg University Research Paper; Tilburg University: Tilburg, The Netherlands, 2003. [Google Scholar]
  9. Wei, C.Z. Asymptotic properties of least-squares estimates in stochastic regression models. Ann. Stat. 1985, 13, 1498–1508. Available online: https://www.jstor.org/stable/2241368 (accessed on 2 September 2024). [CrossRef]
  10. Judge, G.G.; Bock, M.E. The Statistical Implications of Pre-Test and Stein-Rule Estimators in Econometrics, 2nd ed.; Studies in Mathematical and Managerial Economics; North-Holland Publishing Co.: Amsterdam, The Netherlands, 1978. [Google Scholar] [CrossRef]
  11. Chien-Pai Han, C.R.; Ravichandran, J. Inference based on conditional speclfication. Commun. Stat.-Theory Methods 1988, 17, 1945–1964. [Google Scholar] [CrossRef]
  12. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2024; Available online: https://www.r-project.org/.
  13. Arel-Bundock, V.; Bacher, E. WDI: World Development Indicators and Other World Bank Data, R package version 2.7.8; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar] [CrossRef]
Figure 1. SREs of the ρ ^ P and ρ ^ PT estimators for ρ 0 = 0.3 , 0.5 , 0.7 , α = 0.05 , 0.1 . (top) n 1 = 25 and n 2 = 50 , (bottom) n 1 = n 2 = 500 .
Figure 1. SREs of the ρ ^ P and ρ ^ PT estimators for ρ 0 = 0.3 , 0.5 , 0.7 , α = 0.05 , 0.1 . (top) n 1 = 25 and n 2 = 50 , (bottom) n 1 = n 2 = 500 .
Stats 07 00067 g001
Figure 2. Fitted distribution of Z 2 for different values of ρ 0 and δ = 0 , 0.4 , 1 , 2 , with n 1 = n 2 = 500 .
Figure 2. Fitted distribution of Z 2 for different values of ρ 0 and δ = 0 , 0.4 , 1 , 2 , with n 1 = n 2 = 500 .
Stats 07 00067 g002aStats 07 00067 g002b
Figure 3. Mean annual precipitation from the “HOQUIAM BOWERMAN AP” station ( X 1 t ) and the “ABERDEEN” station ( X 2 t ) over the periods 1970−2024 and 1950−2022, respectively. (Top Left): Time series; (Top Right): Autocorrelation function estimation (AFC); (Bottom): Box-plots.
Figure 3. Mean annual precipitation from the “HOQUIAM BOWERMAN AP” station ( X 1 t ) and the “ABERDEEN” station ( X 2 t ) over the periods 1970−2024 and 1950−2022, respectively. (Top Left): Time series; (Top Right): Autocorrelation function estimation (AFC); (Bottom): Box-plots.
Stats 07 00067 g003
Figure 4. Annual inflation rates for Poland “ PL ” and Algeria “ DZ ” over the periods 1971−2023 and 1970−2023, respectively. (Top Left) Time series; (Top Right) Autocorrelation function estimation (AFC); (Bottom) Box-plots.
Figure 4. Annual inflation rates for Poland “ PL ” and Algeria “ DZ ” over the periods 1971−2023 and 1970−2023, respectively. (Top Left) Time series; (Top Right) Autocorrelation function estimation (AFC); (Bottom) Box-plots.
Stats 07 00067 g004
Table 1. SREs of estimators ρ ^ P and ρ ^ PT when sample sizes n 1 and n 2 are small, for varying ρ 0 values.
Table 1. SREs of estimators ρ ^ P and ρ ^ PT when sample sizes n 1 and n 2 are small, for varying ρ 0 values.
ρ 0 = 0.3 ρ 0 = 0.5 ρ 0 = 0.7
ρ ^ P ρ ^ PT ρ ^ P ρ ^ PT ρ ^ P ρ ^ PT
n 1 n 2 δ α = 0.05 α = 0.10 α = 0.05 α = 0.10 α = 0.05 α = 0.10
25500.02.90371.88621.58753.03961.89321.59203.27311.92071.6215
0.22.91571.85721.55503.09181.86401.58233.59971.91911.6015
0.42.85251.79561.51183.07911.80391.51323.70491.84871.4826
0.62.68661.63571.43033.01931.63131.40613.50481.68111.3935
0.82.45841.51451.34582.73411.50131.31893.14261.49011.2674
1.02.19601.37971.24212.37091.38601.20462.65271.31311.1361
2.01.07170.82780.83041.03110.76650.77590.86770.64570.6775
3.00.53170.55900.62830.45720.51050.5940---
4.00.28830.46900.58660.22460.48320.6411---
5.00.16520.52780.6918------
6.00.09810.78740.9199------
50500.01.99111.56341.39541.98971.56351.37762.14481.59731.4076
0.21.96991.52421.36342.01131.53271.37632.22391.59301.4014
0.41.94891.48121.32792.01271.50031.34302.28291.54321.3348
0.61.88661.41231.27952.01151.41791.27132.20851.45201.2773
0.81.78731.33881.22281.86381.33261.21792.07511.30971.1769
1.01.66171.25051.15051.71971.23531.13711.85081.18171.0887
2.00.96470.82690.84390.90610.77240.80050.73970.66760.7255
3.00.52460.61480.69270.42140.57110.6784---
4.00.28640.56620.68440.20290.62290.7656---
5.00.16330.65960.7979------
6.00.09540.85820.9342------
751000.02.26201.70541.49042.37901.72341.49112.49391.71671.4861
0.22.26741.71101.48012.40861.71811.45802.64101.70261.4545
0.42.20801.64061.44132.36161.63661.39852.58221.55791.3477
0.62.09381.55611.37722.23581.50951.32752.35411.46061.2700
0.81.93961.45061.29162.05431.37341.24742.15891.29901.1630
1.01.76841.29741.19701.83531.25081.15841.84781.14661.0636
2.00.96670.82560.83650.88410.75340.78280.67710.63950.7026
3.00.50910.59040.66560.42340.56240.65960.26740.55800.7032
4.00.29120.53530.65220.22130.57030.7136---
5.00.17730.60670.74850.12440.76140.8851---
6.00.11290.77520.88750.07210.96220.9875---
7.00.07360.94020.9771------
8.00.04910.99530.9988------
9.00.03291.00001.0000------
1001000.01.96361.52781.37712.02751.56331.39982.06261.58181.4085
0.21.97331.53541.37332.06691.54221.36132.18801.60361.3921
0.41.93431.47911.34042.03451.49301.32862.13281.48601.3044
0.61.85661.41631.28831.94841.42121.26142.06921.40491.2381
0.81.74801.33551.22811.80251.30991.19571.89461.26171.1463
1.01.64931.26811.17461.64951.21441.11891.67261.13641.0651
2.00.95960.83770.85200.88120.78140.80740.67110.66920.7365
3.00.53600.62630.69880.43920.60020.69300.26500.60740.7455
4.00.31310.57910.69530.23290.61710.74320.11350.87480.9590
5.00.19120.65290.78780.12890.79230.9002---
6.00.12160.79520.90610.07380.96190.9865---
7.00.07870.94050.97870.04270.99970.9997---
8.00.05190.99370.9963------
9.00.03430.99921.0000------
The stability condition (16) is not fulfilled and is noted by a ‘-’.
Table 2. SREs of estimators ρ ^ P and ρ ^ PT when sample sizes n 1 and n 2 are larger, for varying ρ 0 values.
Table 2. SREs of estimators ρ ^ P and ρ ^ PT when sample sizes n 1 and n 2 are larger, for varying ρ 0 values.
ρ 0 = 0.3 ρ 0 = 0.5 ρ 0 = 0.7
ρ ^ P ρ ^ PT ρ ^ P ρ ^ PT ρ ^ P ρ ^ PT
n 1 n 2 δ α = 0.05 α = 0.10 α = 0.05 α = 0.10 α = 0.05 α = 0.10
1502500.02.65171.80581.52502.64861.78191.52432.68711.78701.5302
0.22.67081.81051.54052.65141.76991.50052.79061.79461.4986
0.42.56151.74281.50062.56481.69421.45332.70221.64981.3995
0.62.38301.64261.42032.37051.57951.38092.39871.47001.2912
0.82.16281.52661.34842.12471.43581.28382.07121.30441.1764
1.01.92751.39851.25041.86251.29091.18371.74131.14591.0658
2.00.97560.83370.84330.86470.75980.78770.65930.63620.6966
4.00.30790.52150.63360.24220.53430.66580.14370.70440.8465
6.00.13280.70190.82690.09500.85330.9451---
8.00.06780.95980.98750.04390.99861.0000---
10.00.03781.00001.0000------
12.00.02211.00001.0000------
13.00.01701.00001.0000------
2502500.02.00091.57161.38412.00441.58531.37982.02321.56801.3856
0.21.98071.55191.38092.00571.55411.36352.03761.51861.3550
0.41.92461.50761.34581.93451.48021.34112.03761.44741.3079
0.61.83211.43091.28791.83751.40521.27571.90191.34401.2330
0.81.71281.33641.22531.70581.31411.21231.70721.23771.1417
1.01.57761.24301.15421.55381.21211.13591.47931.10701.0536
2.00.93060.83640.85790.84770.77920.81640.64780.67630.7353
4.00.32840.57320.68400.25740.60030.72200.14470.76760.8813
6.00.14420.75080.87490.10020.88880.95070.04260.99941.0000
8.00.07310.97000.99220.04470.99710.9994---
10.00.04010.99931.00000.02111.00001.0000---
12.00.02281.00001.0000------
13.00.01721.00001.0000------
15.00.00981.00001.0000------
2505000.02.93861.89861.57192.94331.91061.57773.02491.89691.5904
0.22.94241.90961.56082.92231.87181.54783.10221.83121.5068
0.42.79801.80191.51082.83001.78061.50732.90071.70021.4092
0.62.57471.67321.43982.57471.64511.40662.49831.51951.3278
0.82.30951.53821.34712.27221.48951.30942.12171.34491.1967
1.02.03531.39391.25031.96521.32951.20121.77381.16051.0762
3.00.53460.58990.65810.44950.54830.63380.30800.51360.6271
5.00.20600.53160.66640.16210.60720.74760.09690.87950.9537
7.00.10230.79490.90320.07630.93300.97730.04031.00001.0000
9.00.05850.97850.99050.04130.99810.9993---
11.00.03641.00001.00000.02411.00001.0000---
13.00.02391.00001.00000.01471.00001.0000---
15.00.01621.00001.0000------
17.00.01111.00001.0000------
19.00.00781.00001.0000------
5005000.01.97761.56081.38831.98251.54611.36981.97101.54691.3867
0.22.01351.56591.37281.97961.53531.36372.06331.55461.3681
0.41.95911.52021.35261.92471.48331.33471.99301.47641.3128
0.61.86731.44821.30871.80271.39461.28311.78591.36801.2400
0.81.74801.36941.25341.67321.31341.21081.61101.23491.1531
1.01.61211.27681.19241.52571.21071.13311.45781.12541.0557
3.00.55940.63990.70540.46750.60290.68370.31380.58360.6952
5.00.22870.61530.74820.17460.69500.81840.09640.91620.9726
7.00.11490.87130.94690.08090.95990.98670.03711.00001.0000
9.00.06530.99020.99630.04250.99931.00000.01561.00001.0000
11.00.04001.00001.00000.02381.00001.0000---
13.00.02561.00001.00000.01371.00001.0000---
15.00.01691.00001.00000.00801.00001.0000---
17.00.01131.00001.0000------
19.00.00751.00001.0000------
21.00.00501.00001.0000------
The stability condition (16) is not fulfilled and is noted by a ‘-’.
Table 3. Summary statistics and asymptotic distribution of Z 2 for different values of ρ 0 and δ , when n 1 = n 2 = 500 .
Table 3. Summary statistics and asymptotic distribution of Z 2 for different values of ρ 0 and δ , when n 1 = n 2 = 500 .
Descriptive StatisticsMLE of χ 2 df , Δ
ρ 0 δ Mean Variance Median Skewness Kurtosis df ^ Δ ^
0.30.00.98192.00710.43563.004613.90220.98810.0000
0.20.99691.97740.44252.66669.64350.98700.0100
0.41.02852.11240.46172.67509.73080.98240.0466
0.61.08222.33490.48292.67039.71250.99310.0891
0.81.15822.64560.52192.64079.47821.01320.1431
1.01.25673.04510.57322.58329.01950.99510.2600
1.51.60214.43740.78432.35187.32841.00260.5952
2.02.09156.40201.17292.07955.62590.99321.0964
0.50.00.99852.00750.44892.915813.07870.99520.0033
0.21.01292.06700.45992.928913.02070.99570.0172
0.41.05412.23280.48792.922512.78620.99340.0608
0.61.10222.33240.50752.631110.00751.01160.0916
0.81.19502.69680.55102.57959.56330.99930.1984
1.01.31563.17080.61482.49748.87240.99190.3279
1.51.74134.84540.90162.21136.74610.98020.7683
2.02.35817.57811.41402.04465.72821.01081.3410
0.70.00.98031.89860.44232.732910.56030.99190.0000
0.21.00172.00010.46242.946713.27981.00940.0001
0.41.06062.22850.49422.944513.32321.00490.0559
0.61.15622.63740.52892.64709.52450.98400.1723
0.81.29563.20420.60702.56818.86110.99700.2968
1.01.48433.86660.72362.46898.55491.02450.4606
1.52.15856.46441.24912.04935.98781.00781.1591
2.03.084810.03872.14031.69673.77721.00932.0936
Table 4. Summary of statistical characteristics and autocorrelation estimates for time series X 1 t and X 2 t .
Table 4. Summary of statistical characteristics and autocorrelation estimates for time series X 1 t and X 2 t .
Station j n j n j n 1 + n 2 Mean Variance σ ^ j 2   γ ^ j ( 0 )   ν ^ j   ρ ^ j
X 1 t 550.42970.18750.00100.00200.00102.03380.9733
X 2 t 730.57030.23050.00150.00270.00141.90050.9767
Table 5. Summary of statistical characteristics and autocorrelation estimates for annual inflation rates for Poland “ PL ” and Algeria “ DZ ”.
Table 5. Summary of statistical characteristics and autocorrelation estimates for annual inflation rates for Poland “ PL ” and Algeria “ DZ ”.
Country j n j n j n 1 + n 2 Mean Variance σ ^ j 2   γ ^ j ( 0 )   ν ^ j   ρ ^ j
PL 530.495328.22507107.405788.156973.300.83000.5188
DZ 540.504708.612554.217219.504653.21310.36650.9242
Table 6. Results of pretest estimator ρ ^ PT for different values of δ .
Table 6. Results of pretest estimator ρ ^ PT for different values of δ .
δ Δ ^ χ 2 1 , Δ ^ p-Value Z 2 < χ 2 1 , Δ ^ H 0 ρ ^ PT
0.00.00003.84150.0068
0.20.01673.90530.0073
0.40.06664.09470.0087
0.60.14994.40270.0112
0.80.26654.81590.0149FALSErejected ρ ^ PT = ρ ^ 1
1.00.41635.31600.0201
1.20.59955.88470.0270
1.40.81606.50690.0359
1.61.06587.17300.0472
1.81.34897.87720.0613TRUEaccepted ρ ^ PT = ρ ^ P
2.01.66538.61680.0785
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ahmed, S.E.; Guidoum, A.C.; Bendjeddou, S. Preliminary Test Estimation for Parallel 2-Sampling in Autoregressive Model. Stats 2024, 7, 1141-1158. https://doi.org/10.3390/stats7040067

AMA Style

Ahmed SE, Guidoum AC, Bendjeddou S. Preliminary Test Estimation for Parallel 2-Sampling in Autoregressive Model. Stats. 2024; 7(4):1141-1158. https://doi.org/10.3390/stats7040067

Chicago/Turabian Style

Ahmed, Syed Ejaz, Arsalane Chouaib Guidoum, and Sara Bendjeddou. 2024. "Preliminary Test Estimation for Parallel 2-Sampling in Autoregressive Model" Stats 7, no. 4: 1141-1158. https://doi.org/10.3390/stats7040067

APA Style

Ahmed, S. E., Guidoum, A. C., & Bendjeddou, S. (2024). Preliminary Test Estimation for Parallel 2-Sampling in Autoregressive Model. Stats, 7(4), 1141-1158. https://doi.org/10.3390/stats7040067

Article Metrics

Back to TopTop