Next Article in Journal
Bifurcation, Hidden Chaos, Entropy and Control in Hénon-Based Fractional Memristor Map with Commensurate and Incommensurate Orders
Previous Article in Journal
Enhanced Whale Optimization Algorithm for Improved Transient Electromagnetic Inversion in the Presence of Induced Polarization Effects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Parametric Bootstrap Approach for a One-Way Error Component Regression Model with Measurement Errors

1
School of Statistics and Data Science, Nanjing Audit University, Nanjing 211815, China
2
School of Mathematics and Computer Science, Shanxi Normal University, Taiyuan 030032, China
3
School of Statistics, Beijing Normal University, Beijing 100875, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(19), 4165; https://doi.org/10.3390/math11194165
Submission received: 4 September 2023 / Revised: 2 October 2023 / Accepted: 3 October 2023 / Published: 4 October 2023

Abstract

:
In this paper, a one-way error component regression model with measurement errors is considered. The unknown parameter vector is estimated by using the bias-corrected method, and its corresponding asymptotic properties are also developed. For the hypothesis testing problem of the vector of the coefficient parameter in the model, a parametric bootstrap (PB) method is proposed. Under various sample sizes and parameter configurations, the effectiveness of our proposed PB test method is discussed by using some numerical simulations and a real data analysis.

1. Introduction

In this paper, we will consider the one-way error component regression model of Baltagi [1] and assume that the true covariates cannot be exactly or directly observed, that is, the model has measurement errors, which take the following form
Y i j = α 0 + x i j T β + μ i + ε i j , ω i j = x i j + δ i j , i = 1 , , n , j = 1 , , m ,
where Y i j denotes the j-th time response variable of the i-th subject; θ = ( α 0 , β T ) T = ( α 0 , β 1 , , β p ) T is the unknown vector of the coefficient parameter; x i j = ( x i j 1 , , x i j p ) T is the true fixed covariate; ω i j is the observed surrogate variable; and the individual effect μ i , random error ε i j and measurement error δ i j = ( δ i j 1 , , δ i j p ) T are independent. Assume that μ i , ε i j and δ i j are independent and identically distributed (i.i.d.) as N ( 0 , σ μ 2 ) , N ( 0 , σ ε 2 ) and N ( 0 p , Σ δ ) , respectively, 0 p is the p × 1 vector of zeros and σ μ 2 0 , σ ε 2 > 0 , Σ δ is known. The information of Σ δ may be obtained by replicating and observing ω i j ; the details can be found in Lin and Carroll [2], Carroll et al. [3], Li et al. [4], Dong et al. [5] and Guo et al. [6]. If the measurement errors do not exist, model (1) has been discussed by refs. [7,8], etc. When the individual effect and the time index of j = 1 , , m do not exist, model (1) reduces to the classical linear errors-in-variables (EV) model, which has been studied by refs. [9,10]. For the linear mixed effects EV model, Cui et al. [11] developed some moment estimation procedures, Xiao et al. [12] provided a GMM estimation approach and Yue et al. [13,14] proposed bias-corrected least square estimators for an unbalanced and balanced panel data model with measurement errors, respectively.
In practice, the hypothesis testing problem for the vector of the coefficient parameter
H 0 : θ = θ * H 1 : θ θ *
is interested, where θ * is an arbitrarily specified vector. For the nonhomogenous linear hypothesis testing problems ( H 0 : H θ = d ), Yue et al. [14] provided an adjusted testing method for model (1) by considering the discrepancies of the corrected residual sums of squares between the null and alternative hypotheses. When the measurement errors do not exist in model (1) (i.e., x i j is observed directly), Esmaeli-Ayan et al. [15] and Yue et al. [16] proposed a parametric bootstrap approach. For the hypothesis testing problem (2) in the linear EV model, Huwang et al. [17] developed a uniformly robust (RT) test method. However, when the sample size is small, the Type I error rates of the RT test in Huwang et al. [17] seem to be conservative. To solve this problem, many researchers suggested adopting the PB test to control the Type I error rates, such as Efron and Tibshirani [18], Krishnamoorthy et al. [19], Xu et al. [20], Xu [21], Ye et al. [22], Sun and Fisher [23], among others.
In this paper, we consider the hypothesis testing problem (2) of model (1) and develop a parametric bootstrap (PB) approach. The main contributions of this paper are three-fold. First, the biased-corrected estimator of θ is obtained for a one-way error component regression model with measurement errors in (1), and its asymptotic properties are also established. Second, a PB test is developed by using the obtained estimator of θ . Then, we provide the corresponding algorithm for computing the p-value of the proposed PB test. Third, our proposed PB test can control the Type I error rates well, which can be found in finite sample numerical simulations.
The organization of this paper is listed as follows. In Section 2, a parametric bootstrap (PB) approach for the hypothesis testing problem (2) of model (1) is developed, and the corresponding algorithm is provided. In Section 3, the effectiveness of the proposed PB test is assessed by some simulation studies, and the numerical results indicate that the proposed PB test performs better than the uniformly robust (RT) test. In Section 4, we apply the PB test to analyse Grunfeld data. In Section 5, we make some summaries and concluding remarks for this paper. In Appendix A, we give the corresponding proofs of Theorems and needed lemmas.
  Notations: For a matrix A , let A T denote the transpose for A and A B denote that A B is positive semi-definite. Let · be the Euclidean norm and “ L ” and “ P ” be the convergence in distribution and probability, respectively.

2. Methodology

2.1. Parameter Estimator and Its Theoretical Properties

For each term in model (1), we obtain its means for all the time j = 1 , , m of each subject i. Combining this with model (1), we have
Y ¯ i · = α 0 + x ¯ i · T β + μ i + ε ¯ i · , ω ¯ i · = x ¯ i · + δ ¯ i · , i = 1 , , n ,
where Y ¯ i · = 1 m j = 1 m Y i j , x ¯ i · = 1 m j = 1 m x i j , ε ¯ i · = 1 m j = 1 m ε i j , ω ¯ i · = 1 m j = 1 m ω i j and δ ¯ i · = 1 m j = 1 m δ i j . Here and after, the following notations are used, τ ¯ i · = ( 0 , δ ¯ i · T ) T , u ¯ i · = ( 1 , x ¯ i · T ) T and z ¯ i · = ( 1 , ω ¯ i · T ) T = u ¯ i · + τ ¯ i · , and let
M Y ¯ Y ¯ = 1 n i = 1 n Y ¯ i · 2 , M z ¯ z ¯ = 1 n i = 1 n z ¯ i · z ¯ i · T , M u ¯ u ¯ = 1 n i = 1 n u ¯ i · u ¯ i · T , M z ¯ Y ¯ = 1 n i = 1 n z ¯ i · T Y ¯ i · , M Y ¯ z ¯ = 1 n i = 1 n Y ¯ i · z ¯ i · ,
where M u ¯ u ¯ is assumed to be nonsingular, lim n M u ¯ u ¯ = M 0 and M 0 is a nonsingular matrix. Then, we have
E M Y ¯ Y ¯ M z ¯ Y ¯ M Y ¯ z ¯ M z ¯ z ¯ = θ T M u ¯ u ¯ θ + σ μ 2 + 1 m σ ε 2 θ T M u ¯ u ¯ M u ¯ u ¯ θ M u ¯ u ¯ + 1 m Σ τ ,
where Σ τ / m is a known covariance matrix of τ ¯ i · (since Σ δ is known). From ref. [9], we obtain a biased-corrected estimator of θ as follows,
θ ^ = M z ¯ z ¯ 1 m Σ τ 1 M Y ¯ z ¯ .
Denote W = M z ¯ z ¯ Σ τ / m , then
θ ^ = W 1 W θ + M Y ¯ z ¯ W θ = θ + W 1 M Y ¯ z ¯ W θ .
That is, W ( θ ^ θ ) = M Y ¯ z ¯ W θ .
Proposition 1.
Let υ ¯ i · = μ i + ε ¯ i · τ ¯ i · T θ = Y ¯ i · z ¯ i · T θ and M υ ¯ z ¯ = n 1 i = 1 n υ ¯ i · z ¯ i · = M Y ¯ z ¯ M z ¯ z ¯ θ . Together with (4), we have
W ( θ ^ θ ) = M υ ¯ z ¯ E M υ ¯ z ¯ = 1 n i = 1 n υ ¯ i · z ¯ i · E υ ¯ i · z ¯ i · ,
and the covariance matrix of   W ( θ ^ θ ) is
Cov W ( θ ^ θ ) = 1 n σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ + 1 m 2 Σ τ θ θ T Σ τ S n ,
where E M υ ¯ z ¯ = Σ τ θ / m and S = σ υ ¯ 2 M u ¯ u ¯ + m 1 Σ τ + m 2 Σ τ θ θ T Σ τ is positive definite.
Theorem 1.
For model (3), when M u ¯ u ¯ is nonsingular, the moment generating function
M ( t ) = E exp n ( θ ^ θ ) T W T S 1 2 t e t T t / 2
uniformly over the parameter space Ω for any t R p + 1 , and ( θ T , ϕ T ) T Ω , ϕ is the nuisance parameter, which consists of x ¯ 1 · , , x ¯ n · in model (3), σ μ 2 and σ ε 2 . As n , we have
n S 1 2 W ( θ ^ θ ) L N 0 p + 1 , I p + 1
uniformly over the parameter space Ω.
Remark 1.
Theorem 1 shows the asymptotic normality of θ ^ θ . Based on this result and assuming the null hypothesis of (2) holds, we have
T 0 = n ( θ ^ θ * ) T W T S 1 W ( θ ^ θ * ) L χ p + 1 2
uniformly over the parameter space Ω * ; Ω * is a subspace of Ω, with θ being fixed at θ * .
Theorem 2.
Under model (3), when M u ¯ u ¯ is nonsingular and n , we have
T R = n ( θ ^ θ ) T W T S ˜ 1 W ( θ ^ θ ) L χ p + 1 2
uniformly over the parameter space Ω, where
S ˜ = σ ˜ υ ¯ 2 M z ¯ z ¯ + 1 m 2 Σ τ θ θ T Σ τ , σ ˜ υ ¯ 2 = 1 n i = 1 n Y ¯ i · z ¯ i · T θ 2 .
Thus, when T R | θ = θ * > χ p + 1 , α 2 holds, the null hypothesis H 0 in (2) should be rejected, where P ( χ p + 1 2 > χ p + 1 , α 2 ) = α .
Remark 2.
Theorem 2 gives a tool to assess whether the null hypothesis in (2) holds; the test method is viewed as the existing uniformly robust (RT) test method of Huwang et al. [17]. In this method, S ˜ is a “pseudo” estimator of S because the parameter θ therein is not estimated.

2.2. The Parametric Bootstrap Approach

There are unknown parameters θ in T 0 and T R . In such cases, the test statistic can be defined as
T = n ( θ ^ θ * ) T W T S ^ 1 W ( θ ^ θ * ) ,
where
S ^ = σ ^ υ ¯ 2 M z ¯ z ¯ + 1 m 2 Σ τ θ ^ θ ^ T Σ τ , σ ^ υ ¯ 2 = 1 n i = 1 n Y ¯ i · z ¯ i · T θ ^ 2 .
Unfortunately, the distribution of T in (10) is not known. To solve this problem, a PB test method can be developed similar to that in Yue et al. [16], Xu et al. [20] and Xu [21], where we can generate samples from estimated models by replacing the unknown parameter with its estimator.
Note that, when the null hypothesis H 0 : θ = θ * holds, we have Y ¯ i · N ( z ¯ i · T θ * , σ υ ¯ 2 ) . The PB pivot variable can be developed as follows, Y ¯ B i · N ( z ¯ i · T θ * , σ ^ υ ¯ 0 2 ) , and σ ^ υ ¯ 0 2 is the observed value of σ ^ υ ¯ 2 . Then,
θ ^ B = M z ¯ z ¯ 1 m Σ τ 1 M Y ¯ B z ¯ , M Y ¯ B z ¯ = 1 n i = 1 n Y ¯ B i · z ¯ i · ,
S ^ B = σ ^ υ ¯ B 2 M z ¯ z ¯ + 1 m 2 Σ τ θ ^ B θ ^ B T Σ τ , σ ^ υ ¯ B 2 = 1 n i = 1 n Y ¯ B i · z ¯ i · T θ ^ B 2 .
According to (10), the PB pivot variable can be constructed as
T B = n ( θ ^ B θ * ) T W T S ^ B 1 W ( θ ^ B θ * ) ,
where the PB test rejects H 0 in (2) whenever
P ( T B > t o b s ) < α ,
where t o b s is an observed T-value of (10), the significance level α is always given. The probability in (14) can be obtained using the following Algorithm.

3. Simulation Studies

In this section, the performance of our proposed PB test is studied using the Type I error rates and powers. The Type I error rates of the PB test are evaluated with following steps. Firstly, we evaluate the p-values by using Algorithm 1 based on generated samples ( Y ¯ 1 · , , Y ¯ n · ) and ( z ¯ 1 · , , z ¯ n · ) , the above step is repeated M times and the M p-values are kept; then, the Type I error rate can be obtained by computing the ratio of M p-values less than α . Here, M and r are taken to be 2500 and 5000, respectively. The RT test method is evaluated by (8) in Section 2, which is similar to that in ref. [17]. Each test method is carried out at a given significance level α = 0.05 .
Algorithm 1 Given samples ( Y ¯ 1 · , , Y ¯ n · ) and ( z ¯ 1 · , , z ¯ n · ) ,
     Step 1. Obtain estimations θ ^ and σ ^ υ ¯ 2 , then obtain the T-value in (10) and denote it t o b s ;
     Step 2. For i = 1 , , n , generate bootstrap samples Y ¯ B i · N ( z ¯ i · T θ * , σ ^ υ ¯ 2 ) , and provide θ ^ B and S ^ B based on (11) and (12), respectively;
     Step 3. Using (13) to obtain the value of PB pivot variable T B . Let R = 1 if T B > t o b s , R = 0 if T B < t o b s ;
     Step 4. Repeat r times for Steps 2–3, keep the r values of R and marked as R 1 , , R r ;
     Step 5. Using (14), the p-value can be evaluated as r 1 t = 1 r R t .
In the simulation, the data are generated from the model
Y i j = α 0 + x i j T β + μ i + ε i j , ω i j = x i j + δ i j , i = 1 , , n , j = 1 , , m ,
where the sample size n = 10 , 20 , 30 , 50 , and 100 ; m = 10 ; p = 2 ; α 0 = 1.0 ; β = ( 2.0 , 1.0 ) T ; x i j N ( 0 p , I p ) ; μ i N ( 0 , σ μ 2 ) ; ε i j N ( 0 , σ ε 2 ) ; δ i j N ( 0 p , σ δ 2 I p ) ; and σ ε 2 = 1.0 . Six different variances of individual effect, σ μ 2 = 0.25 , 0.5 , 1.0 , 2.0 , 4.0 , and 8.0 , are chosen to compare the estimated Type I error rates of our proposed PB and the existing RT tests. To study the effect of measurement errors on the performance of the PB test method, four different standard deviations of measurement error σ δ are adopted as 0.1, 0.2 , 0.3 and 0.5 in our simulation studies.
In Table 1, we calculate the Type I error rates of the proposed PB and existing RT tests for different sample sizes n, variances in individual effect σ μ 2 and standard deviations of measurement error σ δ .
Table 1 shows that:
(1)
The Type I error rate of our proposed PB test is very close to α = 0.05 , but the existing RT test is unstable. For the smaller sample size n = 10 , the RT test seems to be very conservative, but the PB test can control Type I error rates within the significance level very well.
(2)
The magnitude of variance in individual effect σ μ 2 has little effect on the performance of each test method (PB and RT).
(3)
When the sample sizes n and σ μ 2 are fixed, the Type I error rates of both tests are robust to the variation in the standard deviations of measurement error.
In Table 2, we provide the estimated powers for hypothesis testing problem H 0 : θ = θ * versus H 1 : θ θ * of the PB and RT tests under moderate and large sample sizes, where θ = ( α 0 , β T ) = ( α 0 , β 1 , β 2 ) T and θ * = ( 1.0 , 2.0 , 1.0 ) T . To evaluate the powers of each test method, nine different values of d = j = 1 3 | θ j θ j * | are chosen, which is similar to that in Huwang et al. [17]. The null hypothesis holds when d = 0 . In the simulations of Table 2, the variance in individual effect σ μ 2 = 4.0 , σ ε 2 = 1.0 and the standard deviation of measurement error σ δ is chosen as 0.1, 0.2, 0.3 and 0.5.
From Table 2, we can see that
(1)
When the sample is large ( n = 50 , 100 ), both the PB and RT tests can control the Type I error rates.
(2)
When the sample sizes n and σ δ are fixed, the power increases as the d-value increases, as expected. In most scenarios, the powers of our proposed PB test are slightly higher than those of the RT test.
(3)
For the same standard deviation of measurement error σ δ , the estimated power increases as the sample size increases.
Generally speaking, the proposed PB test is feasible.

4. A Real Data Analysis

In this section, our proposed PB method is applied to analyze Grunfeld data. The dataset was available in the R package “plm” and studied in Baltagi [1] and Grunfeld [24], and was a balanced panel dataset of 10 large US manufacturing firm observations from 1935 to 1954. For this dataset, we considered the following one-way error component regression model,
Y i j = x i j 1 β 1 + x i j 2 β 2 + μ i + ε i j , i = 1 , , n , j = 1 , , m ,
where n = 10 , m = 20 , Y i j is the j-year gross investment of firm i, x i j 1 denotes firm value, x i j 2 is the capital stock of plant and equipment, μ i represents the individual firm effect, ε i j is the random error and we assume the observations of firm and capital stock cannot be observed accurately. That is, x i j l has additive errors δ i j l ( ω i j l = x i j l + δ i j l ) for l = 1 , 2 , δ i j l N ( 0 , σ δ l 2 ) . In the absence of validation or replication data, we conducted a sensitivity analysis by taking σ δ l 2 = 0.25 × Var ( X l ) and X l = ( x 11 l , , x 1 m l , , x n m l ) T , which is similar to that in Lin and Carroll [2], Yue et al. [13] and Yang et al. [25]. The interested hypothesis testing problem is H 0 : θ = θ * vs. H 1 : θ θ * , where θ = ( β 1 , β 2 ) T . To compare the performance of the proposed PB and existing RT tests, we chose three different values of θ * , and evaluate the p-values of the PB and RT tests in Table 3. In addition, the approximate 95% confidence intervals for β 1 and β 2 were also calculated as ( 0.0891 , 0.1880 ) and ( 0.2501 , 0.2410 ) , respectively.
From Table 3, we can see that the PB test rejects the null hypothesis when θ * = ( 0.06 , 0.25 ) T and α = 0.05 , whereas the RT is in favor of the null hypothesis H 0 . According to the 95% confidence intervals for β 1 and β 2 , we should reject the null hypothesis H 0 . That is, the PB test can make a right choice, but the RT test cannot. When θ * = ( 0.09 , 0.24 ) T and θ * = ( 0.11 , 0.11 ) T , both the PB and RT tests are in favor of the null hypothesis, which is consistent with the results from the 95% confidence intervals for β 1 and β 2 . Therefore, the proposed PB test is more informative.

5. Conclusions

In this paper, we consider a one-way error component regression model with measurement errors. When we ignore the measurement errors and simply adopt the observed surrogate variables to obtain the estimation of an unknown parameter vector, an inefficient testing method may be developed. To solve this problem, we combine the biased-corrected estimator of the unknown parameter and propose a parametric bootstrap (PB) method for the hypothesis testing problem of the unknown vector of the coefficient parameter. Through some simulation studies and a real data analysis, we find the proposed PB test can control the Type I error rates within the significance level very well, but the existing uniformly robust (RT) test is conservative when the sample size is small. When the sample size is larger, both test methods have similar performance. In a word, the proposed PB test is feasible.

Author Contributions

Methodology, L.Y., J.S. and J.L. (Jinguan Lin); Software, J.L. (Jingxuan Luo); Formal analysis, J.S.; Investigation, J.L. (Jingxuan Luo); Data curation, L.Y.; Writing – original draft, L.Y.; Writing—review & editing, L.Y. and J.L. (Jinguan Lin); Visualization, J.S. and J.L. (Jinguan Lin). All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (Nos. 12001277, 12071267, 12371267, 11971235, 11831008) and the National Social Science Foundation of China (No. 21BTJ030).

Data Availability Statement

Not applicable.

Acknowledgments

The authors are grateful to the editor, the associate editor and four reviewers for the constructive comments and suggestions that led to significant improvement of the original manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Theoretical Proofs

This section contains the proofs of Proposition 1, Theorems 1 and 2 in the previous section, as well as some Lemmas that are needed to prove the main Theorems.
Proof of Proposition 1.
Based on (6), we have
Cov W ( θ ^ θ ) = 1 n 2 i = 1 n Cov υ ¯ i · z ¯ i · E ( υ ¯ i · z ¯ i · ) .
By simple computations, we have
Cov υ ¯ i · z ¯ i · E ( υ ¯ i · z ¯ i · ) = E υ ¯ i · z ¯ i · E ( υ ¯ i · z ¯ i · ) υ ¯ i · z ¯ i · E ( υ ¯ i · z ¯ i · ) T = E υ ¯ i · u ¯ i · + υ ¯ i · τ ¯ i · Σ υ ¯ τ ¯ υ ¯ i · u ¯ i · + υ ¯ i · τ ¯ i · Σ υ ¯ τ ¯ T = E υ ¯ i · 2 u ¯ i · u ¯ i · T + υ ¯ i · 2 τ ¯ i · τ ¯ i · T + υ ¯ i · 2 u ¯ i · τ ¯ i · T + υ ¯ i · 2 τ ¯ i · u ¯ i · T Σ υ ¯ τ ¯ Σ υ ¯ τ ¯ T = σ υ ¯ 2 u ¯ i · u ¯ i · T + E ( υ ¯ i · 2 τ ¯ i · τ ¯ i · T ) + E ( υ ¯ i · 2 u ¯ i · τ ¯ i · T ) + E ( υ ¯ i · 2 τ ¯ i · u ¯ i · T ) Σ υ ¯ τ ¯ Σ υ ¯ τ ¯ T ,
where σ υ ¯ 2 is the variance of υ ¯ i · , σ υ ¯ 2 = σ μ 2 + σ ε 2 / m + m 1 θ T Σ τ θ and Σ υ ¯ τ ¯ = E ( υ ¯ i · τ ¯ i · ) = m 1 Σ τ θ .
Note that
τ ¯ i · υ ¯ i · N 0 p + 1 0 , 1 m Σ τ Σ υ ¯ τ ¯ Σ υ ¯ τ ¯ T σ υ ¯ 2 ,
and by the formula of conditional expectation, we have
τ ¯ i · υ ¯ i · N ( a υ ¯ i · , B ) ,
where a = Σ υ ¯ τ ¯ / σ υ ¯ 2 = m 1 Σ τ θ / σ υ ¯ 2 and B = m 1 Σ τ Σ υ ¯ τ ¯ Σ υ ¯ τ ¯ T / σ υ ¯ 2 = m 1 Σ τ Σ τ θ θ T Σ τ / ( m 2 σ υ ¯ 2 ) . Then, from (A3), one can show that
E υ ¯ i · 2 u ¯ i · τ ¯ i · T = u ¯ i · E υ ¯ i · 2 τ ¯ i · T = u ¯ i · E E υ ¯ i · 2 τ ¯ i · T υ ¯ i · = u ¯ i · E υ ¯ i · 2 E τ ¯ i · T υ ¯ i · = u ¯ i · E υ ¯ i · 3 a T = 0 M ,
where 0 M is the ( p + 1 ) × ( p + 1 ) matrix of zeros. Similarly, we have E υ ¯ i · 2 τ ¯ i · u ¯ i · T = 0 M . And
E υ ¯ i · 2 τ ¯ i · τ ¯ i · T = E E υ ¯ i · 2 τ ¯ i · τ ¯ i · T υ ¯ i · = E υ ¯ i · 2 E τ ¯ i · τ ¯ i · T υ ¯ i · = E υ ¯ i · 2 Cov ( τ ¯ i · υ ¯ i · ) + E ( τ ¯ i · υ ¯ i · ) E ( τ ¯ i · υ ¯ i · ) T = E υ ¯ i · 2 1 m Σ τ 1 m 2 σ υ ¯ 2 Σ τ θ θ T Σ τ + υ ¯ i · 4 a a T = σ υ ¯ 2 1 m Σ τ + 2 m 2 Σ τ θ θ T Σ τ .
Combining the above facts, (A2) can be rewritten as
Cov υ ¯ i · z ¯ i · E ( υ ¯ i · z ¯ i · ) = σ υ ¯ 2 u ¯ i · u ¯ i · T + 1 m Σ τ + Σ υ ¯ τ ¯ Σ υ ¯ τ ¯ T .
Subsequently, the covariance matrix of W ( θ ^ θ ) is
Cov W ( θ ^ θ ) = 1 n 2 i = 1 n Cov υ ¯ i · z ¯ i · E ( υ ¯ i · z ¯ i · ) = 1 n σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ + 1 m 2 Σ τ θ θ T Σ τ S n ,
where S = σ υ ¯ 2 M u ¯ u ¯ + m 1 Σ τ + m 2 Σ τ θ θ T Σ τ is positive definite. □
Lemma A1.
The absolute value of each element in the vector a T S 1 2 σ υ ¯ 2 is bounded above by p + 1 .
Proof. 
Denote Q = m 2 Σ τ θ θ T Σ τ σ υ ¯ 2 M u ¯ u ¯ + m 1 Σ τ + m 2 Σ τ θ θ T Σ τ 1 . By the definitions of a and S in (A3) and (7), we have
a T S 1 2 σ υ ¯ 2 2 = 1 m 2 θ T Σ τ S 1 Σ τ θ = 1 m 2 θ T Σ τ σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ + 1 m 2 Σ τ θ θ T Σ τ 1 Σ τ θ = trace of Q = sum of the eigenvalues of Q .
Letting λ 1 be any eigenvalue of Q and e 1 be the corresponding eigenvector, e 1 * = σ υ ¯ 2 ( M u ¯ u ¯ + m 1 Σ τ )   + m 2 Σ τ θ θ T Σ τ 1 e 1 , then
1 m 2 Σ τ θ θ T Σ τ e 1 * = λ 1 σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ + 1 m 2 Σ τ θ θ T Σ τ e 1 * .
Premultiplying (A5) by e 1 * T , we have
λ 1 = 1 m 2 e 1 * T Σ τ θ θ T Σ τ e 1 * e 1 * T σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ + 1 m 2 Σ τ θ θ T Σ τ e 1 * .
Using the fact that M u ¯ u ¯ is positive definite, one can obtain 0 λ 1 1 , then a T S 1 2 σ υ ¯ 2 2 p + 1 . The Lemma A1 follows immediately. □
Lemma A2.
The absolute value of each element in the matrix S 1 2 BS 1 2 σ υ ¯ 2 is bounded above by p + 1 , where B is defined in (A3).
Proof. 
Letting λ 2 be any eigenvalue of S 1 2 BS 1 2 σ υ ¯ 2 and e 2 be the corresponding eigenvector, we have
σ υ ¯ 2 S 1 2 BS 1 2 e 2 = λ 2 e 2 .
Denote e 2 * = S 1 2 e 2 . Then,
B e 2 * = λ 2 S σ υ ¯ 2 e 2 * .
Premultiplying (A7) by e 2 * T , e 2 * T B e 2 * = λ 2 σ υ ¯ 2 e 2 * T S e 2 * holds, that is,
λ 2 = e 2 * T B e 2 * σ υ ¯ 2 e 2 * T S e 2 * .
Note that σ υ ¯ 2 S B = M u ¯ u ¯ + 2 ( m 2 σ υ ¯ 2 ) 1 Σ τ θ θ T Σ τ is positive definite; hence, σ υ ¯ 2 S B and 0 λ 2 1 hold.
Since S 1 2 BS 1 2 σ υ ¯ 2 is a symmetric matrix, there exists an orthogonal matrix P such that S 1 2 BS 1 2 σ υ ¯ 2 = PDP T , where D is the diagonal matrix and its elements are the eigenvalues of S 1 2 BS 1 2 σ υ ¯ 2 . Let γ i j be the ( i , j ) -th element of S 1 2 BS 1 2 σ υ ¯ 2 , l i = ( 0 , , 0 , 1 , 0 , , 0 ) T ( i = 1 , , p + 1 ) be the i-th unit vector in R p + 1 . Then, γ i j = l i T PDP T l j . According the above facts, we have
| γ i j |     DP T l i   P T l j   = ( l i T PD T DP T l i ) 1 2 ( l j T PP T l j ) 1 2 = square root of the ( i , i ) th element of PD 2 P T .
Because PD 2 P T is nonnegative definite, we have
the ( i , i ) th element of PD 2 P T trace ( PD 2 P T ) = trace ( D 2 ) = sum of the squares of the eigenvalues of S 1 2 BS 1 2 σ υ ¯ 2 ,
then | γ i j |   p + 1 . □
Proof of Theorem 1.
According to (A3) and the definitions of W and S in Section 2, we have
M ( t ) = E exp n M z ¯ z ¯ 1 m Σ τ ( θ ^ θ ) T S 1 2 t = E exp 1 n i = 1 n υ ¯ i · z ¯ i · T E ( υ ¯ i · z ¯ i · T ) S 1 2 t = i = 1 n E exp υ ¯ i · u ¯ i · T + υ ¯ i · τ ¯ i · T a T σ υ ¯ 2 S 1 2 t = i = 1 n E E exp ( υ ¯ i · u ¯ i · T + υ ¯ i · τ ¯ i · T a T σ υ ¯ 2 ) S 1 2 t | υ ¯ i · = i = 1 n E exp 1 n υ ¯ i · u ¯ i · T S 1 2 t + 1 n E ( υ ¯ i · τ ¯ i · T | υ ¯ i · ) S 1 2 t + 1 2 n t T Cov ( υ ¯ i · τ ¯ i · T S 1 2 | υ ¯ i · ) t 1 n a T σ υ ¯ 2 S 1 2 t = i = 1 n E exp ( u ¯ i · T S 1 2 t σ υ ¯ / n ) υ ¯ i · * + ( a T S 1 2 t σ υ ¯ 2 / n + t T S 1 2 BS 1 2 t σ υ ¯ 2 / 2 n ) υ ¯ i · * 2 a T σ υ ¯ 2 S 1 2 t / n = i = 1 n E exp ( u ¯ i · T S 1 2 t σ υ ¯ / n ) υ ¯ i · * + ξ υ ¯ i · * 2 a T σ υ ¯ 2 S 1 2 t / n ,
where υ ¯ i · * = υ ¯ i · / σ υ ¯ and ξ = a T S 1 2 t σ υ ¯ 2 / n + t T S 1 2 BS 1 2 t σ υ ¯ 2 / 2 n . We rewrite (A8) as
M ( t ) = i = 1 n E [ exp { ξ ( υ ¯ i · * + ( 2 n ξ ) 1 u ¯ i · T S 1 2 t σ υ ¯ ) 2 I 1 ( 4 n ξ ) 1 σ υ ¯ 2 t T S 1 2 u ¯ i · u ¯ i · T S 1 2 t I 2 n 1 / 2 a T σ υ ¯ 2 S 1 2 t } ] .
The expectation and variance in υ ¯ i · * + ( 2 n ξ ) 1 u ¯ i · T S 1 2 t σ υ ¯ for term I 1 are ( 2 n ξ ) 1 u ¯ i · T S 1 2 t σ υ ¯ and 1, respectively. Hence, { υ ¯ i · * + ( 2 n ξ ) 1 u ¯ i · T S 1 2 t σ υ ¯ } 2 follows a noncentral χ 2 distribution with one degree of freedom and noncentrality parameter ζ = t T S 1 2 u ¯ i · u ¯ i · T S 1 2 t σ υ ¯ 2 / ( 4 n ξ 2 ) . According to the exception of the moment generating function of noncentral χ 2 distribution, we have
E { exp ( I 1 ) } = ( 1 2 ξ ) 1 2 exp { ζ ξ ( 1 2 ξ ) 1 } = ( 1 2 ξ ) 1 2 exp σ υ ¯ 2 t T S 1 2 u ¯ i · u ¯ i · T S 1 2 t { 4 n ξ ( 1 2 ξ ) } 1 .
Plugging (A10) in (A9), we have
M ( t ) = i = 1 n ( 1 2 ξ ) 1 2 exp σ υ ¯ 2 t T S 1 2 u ¯ i · u ¯ i · T S 1 2 t ( 2 n ( 1 2 ξ ) ) 1 σ υ ¯ 2 a T S 1 2 t / n = ( 1 2 ξ ) n 2 exp σ υ ¯ 2 t T S 1 2 M u ¯ u ¯ S 1 2 t / ( 2 4 ξ ) n σ υ ¯ 2 a T S 1 2 t = exp n log ( 1 2 ξ ) / 2 + σ υ ¯ 2 t T S 1 2 M u ¯ u ¯ S 1 2 t / ( 2 4 ξ ) n σ υ ¯ 2 a T S 1 2 t .
By the Taylor expansion of n log ( 1 2 ξ ) / 2 in (A11), we have
M ( t ) = exp { 1 2 σ υ ¯ 2 t T S 1 2 BS 1 2 t + σ υ ¯ 4 t T S 1 2 a a T S 1 2 t + 1 n σ υ ¯ 4 a T S 1 2 t t T S 1 2 BS 1 2 t + 1 4 n ( t T S 1 2 BS 1 2 t σ υ ¯ 2 ) 2 + 4 3 n ξ * 3 + σ υ ¯ 2 t T S 1 2 M u ¯ u ¯ S 1 2 t / ( 2 4 ξ ) } = exp { 1 2 t T S 1 2 S * S 1 2 t + 1 n σ υ ¯ 4 a T S 1 2 t t T S 1 2 BS 1 2 t + 1 4 n ( t T S 1 2 BS 1 2 t σ υ ¯ 2 ) 2 + 4 3 n ξ * 3 ] ,
where S * = σ υ ¯ 2 B + 2 σ υ ¯ 2 a a T + M u ¯ u ¯ / ( 1 2 ξ ) = σ υ ¯ 2 M u ¯ u ¯ / ( 1 2 ξ ) + Σ τ / m + Σ τ θ θ T Σ τ / m 2 and 0 | ξ * | | ξ | .
To show that M ( t ) e t T t / 2 uniformly over the parameter space Ω for any t R p + 1 , it is sufficient to prove that
S 1 2 S * S 1 2 I p + 1
and
exp 1 n σ υ ¯ 4 a T S 1 2 t t T S 1 2 BS 1 2 t + 1 4 n ( t T S 1 2 BS 1 2 t σ υ ¯ 2 ) 2 + 4 3 n ξ * 3 1
uniformly over the parameter space Ω . Using Lemmas A1 and A2, we have ξ = O p 1 n . Then, (A12) and (A13) hold. Theorem 1 follows immediately. □
Proof of Theorem 2.
To prove (8), it is sufficient to show that
n ( θ ^ θ ) T W T S ˜ 1 S 1 W ( θ ^ θ ) = n ( θ ^ θ ) T W T S 1 2 S 1 2 S ˜ 1 S 1 S 1 2 S 1 2 W ( θ ^ θ ) P 0
uniformly over the parameter space Ω . According to Theorem 1, it is straightforward to prove (A14) if we can show that
S 1 2 S ˜ 1 S 1 S 1 2 = S 1 2 S ˜ 1 S 1 2 I p + 1 P 0 M ,
that is,
S 1 2 S ˜ S 1 2 I p + 1 = S 1 2 S ˜ S S 1 2 P 0 M ,
uniformly over the parameter space Ω . Because n σ ˜ υ ¯ 2 / σ υ ¯ 2 χ 2 ( n ) , we have σ ˜ υ ¯ 2 = σ υ ¯ 2 + σ υ ¯ 2 O p 1 n , where O p 1 n does not rely on any parameters. Consequently, by the definition of S and S ˜ in (A4) and (9), we can show that
S 1 2 ( S ˜ S ) S 1 2 = S 1 2 σ ˜ υ ¯ 2 M z ¯ z ¯ σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ S 1 2 = S 1 2 σ υ ¯ 2 M z ¯ z ¯ + σ υ ¯ 2 O p 1 n M z ¯ z ¯ σ υ ¯ 2 M u ¯ u ¯ + 1 m Σ τ S 1 2 = O p 1 n σ υ ¯ 2 S 1 2 M z ¯ z ¯ S 1 2 + σ υ ¯ 2 S 1 2 M z ¯ z ¯ M u ¯ u ¯ + 1 m Σ τ S 1 2 .
Similar to the proofs of Lemma 3 and Lemma 4 in [17], we obtain the following facts: (1) the absolute value of every element in the matrix σ υ ¯ 2 S 1 2 M z ¯ z ¯ S 1 2 is less than p + 1 ( p + 3 l ) + O p 1 n , where l 1 is the number of zero elements of τ ¯ i · ; (2) the matrix σ υ ¯ 2 S 1 2 M z ¯ z ¯ M u ¯ u ¯ + 1 m Σ τ S 1 2 converges to a zero matrix in probability uniformly over the parameter space Ω . Then, the theorem follows immediately. □

References

  1. Baltagi, B.H. Econometric Analysis of Panel Data; John Wiley & Sons: New York, NY, USA, 2008. [Google Scholar]
  2. Lin, X.H.; Carroll, R.J. Nonparametric function estimation for clustered data when the predictor is measured without/with Error. J. Am. Stat. Assoc. 2000, 95, 520–534. [Google Scholar] [CrossRef]
  3. Carroll, R.J.; Ruppert, D.; Stefanski, L.A.; Crainiceanu, C.M. Measurement Error in Nonlinear Models: A Modern Perspective; Chapman & Hall/CRC: London, UK, 2006. [Google Scholar]
  4. Li, G.R.; Zhang, J.; Feng, S.Y. Modern Measurement Error Models; Science Press: Beijing, China, 2016. [Google Scholar]
  5. Dong, H.; Otsu, T.; Taylor, L. Estimation of varying coefficient models with measurement error. J. Econom. 2022, 230, 388–415. [Google Scholar] [CrossRef]
  6. Guo, L.R.; Song, W.X.; Shi, J.H. Estimating multivariate density and its derivatives for mixed measurement error data. J. Multivar. Anal. 2022, 191, 105005. [Google Scholar] [CrossRef]
  7. Montes-Rojas, G.; Sosa-Escudero, W. Robust tests for heteroskedasticity in the one-way error components model. J. Econom. 2011, 160, 300–310. [Google Scholar] [CrossRef]
  8. Zhao, H.B. Exact tests in panel data using generalized p-values. Commun. Stat.-Theory Methods 2008, 37, 18–36. [Google Scholar] [CrossRef]
  9. Fuller, W.A. Measurement Error Models; John Wiley & Sons: New York, NY, USA, 1987. [Google Scholar]
  10. Shi, J.H.; Zhang, Y.J.; Yu, P.; Song, W.X. SIMEX estimation in parametric modal regression with measurement error. Comput. Stat. Data Anal. 2021, 157, 107158. [Google Scholar] [CrossRef]
  11. Cui, H.J.; Ng, K.W.; Zhu, L.X. Estimation in mixed effects model with errors in variables. J. Multivar. Anal. 2004, 91, 53–73. [Google Scholar] [CrossRef]
  12. Xiao, Z.G.; Shao, J.; Palta, M. Instrumental variable and GMM estimation for panel data with measurement error. Stat. Sin. 2010, 20, 1725–1747. [Google Scholar]
  13. Yue, L.L.; Li, G.R.; Zhang, J.H. Statistical inference for the unbalanced two-way error component regression model with errors-in-variables. J. Korean Stat. Soc. 2017, 46, 593–607. [Google Scholar] [CrossRef]
  14. Yue, L.L.; Shi, J.H.; Li, G.R. Statistical inference for the two-way error component regression model with errors-in-variables. Sci. China Math. 2017, 47, 1077–1088. (In Chinese) [Google Scholar] [CrossRef]
  15. Esmaeli-Ayan, A.; Malekzadeh, A.; Hormozinejad, F. Inferences on the regression coefficients in panel data models: Parametric bootstrap approach. Math. Sci. 2020, 14, 67–73. [Google Scholar] [CrossRef]
  16. Yue, L.L.; Shi, J.H.; Song, W.X. A parametric bootstrap approach for two-way error component regression models. Commun. Stat.-Simul. Comput. 2017, 46, 3952–3961. [Google Scholar] [CrossRef]
  17. Huwang, L.C.; Huang, Y.H.S.; Wang, Y.H.T. Uniformly robust tests in errors-in-variables models. Ann. Inst. Stat. Math. 2009, 61, 789–810. [Google Scholar] [CrossRef]
  18. Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap; Chapman & Hall/CRC: London, UK, 1993. [Google Scholar]
  19. Krishnamoorthy, K.; Lu, F.; Mathew, T. A parametric bootstrap approach for ANOVA with unequal variances: Fixed and random models. Comput. Stat. Data Anal. 2007, 51, 5731–5742. [Google Scholar] [CrossRef]
  20. Xu, L.W.; Yang, F.Q.; Abula, A.; Qin, S. A parametric bootstrap approach for two-way ANOVA in presence of possible interactions with unequal variances. J. Multivar. Anal. 2013, 115, 172–180. [Google Scholar] [CrossRef]
  21. Xu, L.W. Parametric bootstrap inferences for the growth curve models with intraclass correlation structure. Commun. Stat.-Theory Methods 2017, 46, 3308–3320. [Google Scholar] [CrossRef]
  22. Ye, R.D.; Xu, L.J.; Luo, K.; Jiang, L. A parametric bootstrap approach for one-way classification model with skew-normal random effects. Appl. Math. J. Chin. Univ. 2019, 34, 423–435. [Google Scholar] [CrossRef]
  23. Sun, Z.Q.; Fisher, T.J. Testing for correlation between two time series using a parametric bootstrap. J. Appl. Stat. 2020, 48, 2042–2063. [Google Scholar] [CrossRef] [PubMed]
  24. Grunfeld, Y. The Determinants of Corporate Investment. Ph.D. Dissertation, University of Chicago, Chicago, IL, USA, 1958. [Google Scholar]
  25. Yang, Y.P.; Li, G.R.; Peng, H. Empirical likelihood of varying coefficient errors-in-variables models with longitudinal data. J. Multivar. Anal. 2014, 127, 1–18. [Google Scholar] [CrossRef]
Table 1. Evaluated Type I error rates of PB and RT test methods.
Table 1. Evaluated Type I error rates of PB and RT test methods.
n σ μ 2 σ δ = 0.1 σ δ = 0.2 σ δ = 0.3 σ δ = 0.5
PBRTPBRTPBRTPBRT
100.250.04640.01040.04880.01400.05040.01880.05200.0392
0.50.04600.00860.04760.01080.04640.01620.05240.0338
1.00.04720.00840.04880.01020.04600.01320.05240.0244
2.00.04680.00840.04760.00980.05240.01200.05040.0198
4.00.04680.00840.04720.00980.05160.01200.04800.0160
8.00.04720.00800.04680.00920.04960.01080.04960.0150
200.250.04640.03780.04760.03840.04840.03700.04880.0432
0.50.04600.04000.04920.03880.04880.03660.04800.0414
1.00.04680.03740.05080.03900.04600.03860.04960.0364
2.00.04720.03860.04960.03940.04840.03940.04920.0362
4.00.04760.03860.04960.03860.04880.03860.04600.0356
8.00.04760.03920.04880.03780.04960.03760.04720.0354
300.250.05040.03840.04960.03880.04680.03740.04880.0442
0.50.05080.03880.05080.03880.04760.03860.04880.0446
1.00.05040.03880.05120.03860.04880.03940.04800.0432
2.00.05000.03880.05000.03900.05000.03920.04600.0434
4.00.05000.03860.04960.03880.05040.03880.04640.0428
8.00.05000.03860.05040.03920.04920.03960.04680.0414
500.250.04960.04640.04760.04300.04920.04180.05000.0474
0.50.04880.04480.05000.04260.04680.04400.04920.0462
1.00.04920.04460.04760.04380.04760.04440.04840.0448
2.00.04920.04520.04880.04680.04920.04600.05000.0466
4.00.05000.04660.05000.04720.05120.04700.04880.0474
8.00.05040.04660.05120.04680.05080.04720.04880.0484
Table 2. Simulated powers of the test methods when σ μ 2 = 4.0 .
Table 2. Simulated powers of the test methods when σ μ 2 = 4.0 .
n σ δ Methods d = 0.0 d = 0.2 d = 0.6 d = 0.8 d = 1.0 d = 1.2 d = 1.4 d = 1.6 d = 1.8
30 0.1 PB0.05000.05480.06160.07080.07800.08920.10000.10920.1256
RT0.03860.04180.05480.06180.07420.08580.10180.12160.1454
0.2 PB0.04960.05320.06280.07120.08000.09080.09880.11360.1248
RT0.03880.04180.05340.06300.07500.08620.10200.12020.1394
0.3 PB0.05040.05200.06240.07320.08240.09240.10200.11280.1268
RT0.03880.04100.05600.06280.07380.08640.10180.11760.1378
0.5 PB0.04640.05320.06520.07360.08480.09680.11120.12440.1424
RT0.04280.04560.05560.06440.07200.08240.09480.11260.1280
50 0.1 PB0.05360.05400.06560.07600.09200.11240.13760.16560.1928
RT0.04660.04660.06000.07180.08860.11020.13300.16640.2036
0.2 PB0.05240.05480.06960.08280.10080.12120.14600.17000.2028
RT0.04720.04820.05980.07200.08840.10720.13080.16340.1970
0.3 PB0.05120.05680.07560.09320.10560.12840.15520.18200.2096
RT0.04700.04920.05940.07360.08680.10400.12680.15480.1864
0.5 PB0.05320.05760.08040.09760.11560.13720.16520.19360.2284
RT0.04740.04980.06080.07060.08000.09540.11880.14240.1622
100 0.1 PB0.05040.05120.07520.10200.13000.17400.21920.27680.3528
RT0.04660.04720.07240.10320.13380.17540.23040.29160.3630
0.2 PB0.05040.05120.08480.11000.14520.18120.23600.29520.3732
RT0.04600.04840.07280.10140.12960.17040.22320.28480.3498
0.3 PB0.04640.05720.09480.12480.15440.20360.25640.32080.3912
RT0.04440.04780.07380.09600.12540.16040.21120.27000.3346
0.5 PB0.04880.06680.11080.13760.18000.22880.28360.34960.4120
RT0.04500.04740.07060.08620.11040.14320.18420.23520.2960
Table 3. Estimated p-values for the PB and RT tests.
Table 3. Estimated p-values for the PB and RT tests.
Methods θ *
( 0.06 ,   0.25 ) T ( 0.09 ,   0.24 ) T   ( 0.11 ,   0.11 ) T
PB0.04940.25300.6050
RT0.07250.23110.5593
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yue, L.; Shi, J.; Luo, J.; Lin, J. A Parametric Bootstrap Approach for a One-Way Error Component Regression Model with Measurement Errors. Mathematics 2023, 11, 4165. https://doi.org/10.3390/math11194165

AMA Style

Yue L, Shi J, Luo J, Lin J. A Parametric Bootstrap Approach for a One-Way Error Component Regression Model with Measurement Errors. Mathematics. 2023; 11(19):4165. https://doi.org/10.3390/math11194165

Chicago/Turabian Style

Yue, Lili, Jianhong Shi, Jingxuan Luo, and Jinguan Lin. 2023. "A Parametric Bootstrap Approach for a One-Way Error Component Regression Model with Measurement Errors" Mathematics 11, no. 19: 4165. https://doi.org/10.3390/math11194165

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop