Abstract
In this paper, a one-way error component regression model with measurement errors is considered. The unknown parameter vector is estimated by using the bias-corrected method, and its corresponding asymptotic properties are also developed. For the hypothesis testing problem of the vector of the coefficient parameter in the model, a parametric bootstrap (PB) method is proposed. Under various sample sizes and parameter configurations, the effectiveness of our proposed PB test method is discussed by using some numerical simulations and a real data analysis.
Keywords:
parametric bootstrap; one-way error component regression model; measurement errors; hypothesis test MSC:
62F03; 62F40; 62J05
1. Introduction
In this paper, we will consider the one-way error component regression model of Baltagi [1] and assume that the true covariates cannot be exactly or directly observed, that is, the model has measurement errors, which take the following form
where denotes the j-th time response variable of the i-th subject; is the unknown vector of the coefficient parameter; is the true fixed covariate; is the observed surrogate variable; and the individual effect , random error and measurement error are independent. Assume that , and are independent and identically distributed (i.i.d.) as , and , respectively, is the vector of zeros and , , is known. The information of may be obtained by replicating and observing ; the details can be found in Lin and Carroll [2], Carroll et al. [3], Li et al. [4], Dong et al. [5] and Guo et al. [6]. If the measurement errors do not exist, model (1) has been discussed by refs. [7,8], etc. When the individual effect and the time index of do not exist, model (1) reduces to the classical linear errors-in-variables (EV) model, which has been studied by refs. [9,10]. For the linear mixed effects EV model, Cui et al. [11] developed some moment estimation procedures, Xiao et al. [12] provided a GMM estimation approach and Yue et al. [13,14] proposed bias-corrected least square estimators for an unbalanced and balanced panel data model with measurement errors, respectively.
In practice, the hypothesis testing problem for the vector of the coefficient parameter
is interested, where is an arbitrarily specified vector. For the nonhomogenous linear hypothesis testing problems (), Yue et al. [14] provided an adjusted testing method for model (1) by considering the discrepancies of the corrected residual sums of squares between the null and alternative hypotheses. When the measurement errors do not exist in model (1) (i.e., is observed directly), Esmaeli-Ayan et al. [15] and Yue et al. [16] proposed a parametric bootstrap approach. For the hypothesis testing problem (2) in the linear EV model, Huwang et al. [17] developed a uniformly robust (RT) test method. However, when the sample size is small, the Type I error rates of the RT test in Huwang et al. [17] seem to be conservative. To solve this problem, many researchers suggested adopting the PB test to control the Type I error rates, such as Efron and Tibshirani [18], Krishnamoorthy et al. [19], Xu et al. [20], Xu [21], Ye et al. [22], Sun and Fisher [23], among others.
In this paper, we consider the hypothesis testing problem (2) of model (1) and develop a parametric bootstrap (PB) approach. The main contributions of this paper are three-fold. First, the biased-corrected estimator of is obtained for a one-way error component regression model with measurement errors in (1), and its asymptotic properties are also established. Second, a PB test is developed by using the obtained estimator of . Then, we provide the corresponding algorithm for computing the p-value of the proposed PB test. Third, our proposed PB test can control the Type I error rates well, which can be found in finite sample numerical simulations.
The organization of this paper is listed as follows. In Section 2, a parametric bootstrap (PB) approach for the hypothesis testing problem (2) of model (1) is developed, and the corresponding algorithm is provided. In Section 3, the effectiveness of the proposed PB test is assessed by some simulation studies, and the numerical results indicate that the proposed PB test performs better than the uniformly robust (RT) test. In Section 4, we apply the PB test to analyse Grunfeld data. In Section 5, we make some summaries and concluding remarks for this paper. In Appendix A, we give the corresponding proofs of Theorems and needed lemmas.
Notations: For a matrix , let denote the transpose for and denote that is positive semi-definite. Let be the Euclidean norm and “” and “” be the convergence in distribution and probability, respectively.
2. Methodology
2.1. Parameter Estimator and Its Theoretical Properties
For each term in model (1), we obtain its means for all the time of each subject i. Combining this with model (1), we have
where , , , and . Here and after, the following notations are used, , and , and let
where is assumed to be nonsingular, and is a nonsingular matrix. Then, we have
where is a known covariance matrix of (since is known). From ref. [9], we obtain a biased-corrected estimator of as follows,
Denote , then
That is, .
Proposition 1.
Let and . Together with (4), we have
and the covariance matrix of is
where and is positive definite.
Theorem 1.
Remark 1.
Theorem 1 shows the asymptotic normality of . Based on this result and assuming the null hypothesis of (2) holds, we have
uniformly over the parameter space ; is a subspace of Ω, with being fixed at .
Theorem 2.
Remark 2.
Theorem 2 gives a tool to assess whether the null hypothesis in (2) holds; the test method is viewed as the existing uniformly robust (RT) test method of Huwang et al. [17]. In this method, is a “pseudo” estimator of because the parameter therein is not estimated.
2.2. The Parametric Bootstrap Approach
There are unknown parameters in and . In such cases, the test statistic can be defined as
where
Unfortunately, the distribution of T in (10) is not known. To solve this problem, a PB test method can be developed similar to that in Yue et al. [16], Xu et al. [20] and Xu [21], where we can generate samples from estimated models by replacing the unknown parameter with its estimator.
Note that, when the null hypothesis holds, we have . The PB pivot variable can be developed as follows, , and is the observed value of . Then,
According to (10), the PB pivot variable can be constructed as
where the PB test rejects in (2) whenever
where is an observed T-value of (10), the significance level is always given. The probability in (14) can be obtained using the following Algorithm.
3. Simulation Studies
In this section, the performance of our proposed PB test is studied using the Type I error rates and powers. The Type I error rates of the PB test are evaluated with following steps. Firstly, we evaluate the p-values by using Algorithm 1 based on generated samples and , the above step is repeated M times and the M p-values are kept; then, the Type I error rate can be obtained by computing the ratio of M p-values less than . Here, M and r are taken to be 2500 and 5000, respectively. The RT test method is evaluated by (8) in Section 2, which is similar to that in ref. [17]. Each test method is carried out at a given significance level .
| Algorithm 1 Given samples and , |
Step 1. Obtain estimations and , then obtain the T-value in (10) and denote it ; Step 3. Using (13) to obtain the value of PB pivot variable . Let if , if ; Step 4. Repeat r times for Steps 2–3, keep the r values of R and marked as ; Step 5. Using (14), the p-value can be evaluated as . |
In the simulation, the data are generated from the model
where the sample size and ; ; ; ; ; ; ; ; ; and . Six different variances of individual effect, and , are chosen to compare the estimated Type I error rates of our proposed PB and the existing RT tests. To study the effect of measurement errors on the performance of the PB test method, four different standard deviations of measurement error are adopted as 0.1, , and in our simulation studies.
In Table 1, we calculate the Type I error rates of the proposed PB and existing RT tests for different sample sizes n, variances in individual effect and standard deviations of measurement error .
Table 1.
Evaluated Type I error rates of PB and RT test methods.
Table 1 shows that:
- (1)
- The Type I error rate of our proposed PB test is very close to , but the existing RT test is unstable. For the smaller sample size , the RT test seems to be very conservative, but the PB test can control Type I error rates within the significance level very well.
- (2)
- The magnitude of variance in individual effect has little effect on the performance of each test method (PB and RT).
- (3)
- When the sample sizes n and are fixed, the Type I error rates of both tests are robust to the variation in the standard deviations of measurement error.
In Table 2, we provide the estimated powers for hypothesis testing problem versus of the PB and RT tests under moderate and large sample sizes, where and . To evaluate the powers of each test method, nine different values of are chosen, which is similar to that in Huwang et al. [17]. The null hypothesis holds when . In the simulations of Table 2, the variance in individual effect , and the standard deviation of measurement error is chosen as 0.1, 0.2, 0.3 and 0.5.
Table 2.
Simulated powers of the test methods when .
From Table 2, we can see that
- (1)
- When the sample is large (), both the PB and RT tests can control the Type I error rates.
- (2)
- When the sample sizes n and are fixed, the power increases as the d-value increases, as expected. In most scenarios, the powers of our proposed PB test are slightly higher than those of the RT test.
- (3)
- For the same standard deviation of measurement error , the estimated power increases as the sample size increases.
Generally speaking, the proposed PB test is feasible.
4. A Real Data Analysis
In this section, our proposed PB method is applied to analyze Grunfeld data. The dataset was available in the R package “plm” and studied in Baltagi [1] and Grunfeld [24], and was a balanced panel dataset of 10 large US manufacturing firm observations from 1935 to 1954. For this dataset, we considered the following one-way error component regression model,
where , , is the j-year gross investment of firm i, denotes firm value, is the capital stock of plant and equipment, represents the individual firm effect, is the random error and we assume the observations of firm and capital stock cannot be observed accurately. That is, has additive errors () for , . In the absence of validation or replication data, we conducted a sensitivity analysis by taking and , which is similar to that in Lin and Carroll [2], Yue et al. [13] and Yang et al. [25]. The interested hypothesis testing problem is vs. , where . To compare the performance of the proposed PB and existing RT tests, we chose three different values of , and evaluate the p-values of the PB and RT tests in Table 3. In addition, the approximate 95% confidence intervals for and were also calculated as and , respectively.
Table 3.
Estimated p-values for the PB and RT tests.
From Table 3, we can see that the PB test rejects the null hypothesis when and , whereas the RT is in favor of the null hypothesis . According to the 95% confidence intervals for and , we should reject the null hypothesis . That is, the PB test can make a right choice, but the RT test cannot. When and , both the PB and RT tests are in favor of the null hypothesis, which is consistent with the results from the 95% confidence intervals for and . Therefore, the proposed PB test is more informative.
5. Conclusions
In this paper, we consider a one-way error component regression model with measurement errors. When we ignore the measurement errors and simply adopt the observed surrogate variables to obtain the estimation of an unknown parameter vector, an inefficient testing method may be developed. To solve this problem, we combine the biased-corrected estimator of the unknown parameter and propose a parametric bootstrap (PB) method for the hypothesis testing problem of the unknown vector of the coefficient parameter. Through some simulation studies and a real data analysis, we find the proposed PB test can control the Type I error rates within the significance level very well, but the existing uniformly robust (RT) test is conservative when the sample size is small. When the sample size is larger, both test methods have similar performance. In a word, the proposed PB test is feasible.
Author Contributions
Methodology, L.Y., J.S. and J.L. (Jinguan Lin); Software, J.L. (Jingxuan Luo); Formal analysis, J.S.; Investigation, J.L. (Jingxuan Luo); Data curation, L.Y.; Writing – original draft, L.Y.; Writing—review & editing, L.Y. and J.L. (Jinguan Lin); Visualization, J.S. and J.L. (Jinguan Lin). All authors have read and agreed to the published version of the manuscript.
Funding
This research was supported by the National Natural Science Foundation of China (Nos. 12001277, 12071267, 12371267, 11971235, 11831008) and the National Social Science Foundation of China (No. 21BTJ030).
Data Availability Statement
Not applicable.
Acknowledgments
The authors are grateful to the editor, the associate editor and four reviewers for the constructive comments and suggestions that led to significant improvement of the original manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Theoretical Proofs
This section contains the proofs of Proposition 1, Theorems 1 and 2 in the previous section, as well as some Lemmas that are needed to prove the main Theorems.
Proof of Proposition 1.
Lemma A1.
The absolute value of each element in the vector is bounded above by .
Proof.
Lemma A2.
The absolute value of each element in the matrix is bounded above by , where is defined in (A3).
Proof.
Letting be any eigenvalue of and be the corresponding eigenvector, we have
Denote . Then,
Premultiplying (A7) by , holds, that is,
Note that is positive definite; hence, and hold.
Since is a symmetric matrix, there exists an orthogonal matrix such that , where is the diagonal matrix and its elements are the eigenvalues of . Let be the -th element of , be the i-th unit vector in . Then, . According the above facts, we have
Because is nonnegative definite, we have
then . □
Proof of Theorem 1.
According to (A3) and the definitions of and in Section 2, we have
where and . We rewrite (A8) as
The expectation and variance in for term are and 1, respectively. Hence, follows a noncentral distribution with one degree of freedom and noncentrality parameter . According to the exception of the moment generating function of noncentral distribution, we have
Plugging (A10) in (A9), we have
By the Taylor expansion of in (A11), we have
where and .
Proof of Theorem 2.
To prove (8), it is sufficient to show that
uniformly over the parameter space . According to Theorem 1, it is straightforward to prove (A14) if we can show that
that is,
uniformly over the parameter space . Because , we have , where does not rely on any parameters. Consequently, by the definition of and in (A4) and (9), we can show that
Similar to the proofs of Lemma 3 and Lemma 4 in [17], we obtain the following facts: (1) the absolute value of every element in the matrix is less than , where is the number of zero elements of ; (2) the matrix converges to a zero matrix in probability uniformly over the parameter space . Then, the theorem follows immediately. □
References
- Baltagi, B.H. Econometric Analysis of Panel Data; John Wiley & Sons: New York, NY, USA, 2008. [Google Scholar]
- Lin, X.H.; Carroll, R.J. Nonparametric function estimation for clustered data when the predictor is measured without/with Error. J. Am. Stat. Assoc. 2000, 95, 520–534. [Google Scholar] [CrossRef]
- Carroll, R.J.; Ruppert, D.; Stefanski, L.A.; Crainiceanu, C.M. Measurement Error in Nonlinear Models: A Modern Perspective; Chapman & Hall/CRC: London, UK, 2006. [Google Scholar]
- Li, G.R.; Zhang, J.; Feng, S.Y. Modern Measurement Error Models; Science Press: Beijing, China, 2016. [Google Scholar]
- Dong, H.; Otsu, T.; Taylor, L. Estimation of varying coefficient models with measurement error. J. Econom. 2022, 230, 388–415. [Google Scholar] [CrossRef]
- Guo, L.R.; Song, W.X.; Shi, J.H. Estimating multivariate density and its derivatives for mixed measurement error data. J. Multivar. Anal. 2022, 191, 105005. [Google Scholar] [CrossRef]
- Montes-Rojas, G.; Sosa-Escudero, W. Robust tests for heteroskedasticity in the one-way error components model. J. Econom. 2011, 160, 300–310. [Google Scholar] [CrossRef]
- Zhao, H.B. Exact tests in panel data using generalized p-values. Commun. Stat.-Theory Methods 2008, 37, 18–36. [Google Scholar] [CrossRef]
- Fuller, W.A. Measurement Error Models; John Wiley & Sons: New York, NY, USA, 1987. [Google Scholar]
- Shi, J.H.; Zhang, Y.J.; Yu, P.; Song, W.X. SIMEX estimation in parametric modal regression with measurement error. Comput. Stat. Data Anal. 2021, 157, 107158. [Google Scholar] [CrossRef]
- Cui, H.J.; Ng, K.W.; Zhu, L.X. Estimation in mixed effects model with errors in variables. J. Multivar. Anal. 2004, 91, 53–73. [Google Scholar] [CrossRef]
- Xiao, Z.G.; Shao, J.; Palta, M. Instrumental variable and GMM estimation for panel data with measurement error. Stat. Sin. 2010, 20, 1725–1747. [Google Scholar]
- Yue, L.L.; Li, G.R.; Zhang, J.H. Statistical inference for the unbalanced two-way error component regression model with errors-in-variables. J. Korean Stat. Soc. 2017, 46, 593–607. [Google Scholar] [CrossRef]
- Yue, L.L.; Shi, J.H.; Li, G.R. Statistical inference for the two-way error component regression model with errors-in-variables. Sci. China Math. 2017, 47, 1077–1088. (In Chinese) [Google Scholar] [CrossRef]
- Esmaeli-Ayan, A.; Malekzadeh, A.; Hormozinejad, F. Inferences on the regression coefficients in panel data models: Parametric bootstrap approach. Math. Sci. 2020, 14, 67–73. [Google Scholar] [CrossRef]
- Yue, L.L.; Shi, J.H.; Song, W.X. A parametric bootstrap approach for two-way error component regression models. Commun. Stat.-Simul. Comput. 2017, 46, 3952–3961. [Google Scholar] [CrossRef]
- Huwang, L.C.; Huang, Y.H.S.; Wang, Y.H.T. Uniformly robust tests in errors-in-variables models. Ann. Inst. Stat. Math. 2009, 61, 789–810. [Google Scholar] [CrossRef]
- Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap; Chapman & Hall/CRC: London, UK, 1993. [Google Scholar]
- Krishnamoorthy, K.; Lu, F.; Mathew, T. A parametric bootstrap approach for ANOVA with unequal variances: Fixed and random models. Comput. Stat. Data Anal. 2007, 51, 5731–5742. [Google Scholar] [CrossRef]
- Xu, L.W.; Yang, F.Q.; Abula, A.; Qin, S. A parametric bootstrap approach for two-way ANOVA in presence of possible interactions with unequal variances. J. Multivar. Anal. 2013, 115, 172–180. [Google Scholar] [CrossRef]
- Xu, L.W. Parametric bootstrap inferences for the growth curve models with intraclass correlation structure. Commun. Stat.-Theory Methods 2017, 46, 3308–3320. [Google Scholar] [CrossRef]
- Ye, R.D.; Xu, L.J.; Luo, K.; Jiang, L. A parametric bootstrap approach for one-way classification model with skew-normal random effects. Appl. Math. J. Chin. Univ. 2019, 34, 423–435. [Google Scholar] [CrossRef]
- Sun, Z.Q.; Fisher, T.J. Testing for correlation between two time series using a parametric bootstrap. J. Appl. Stat. 2020, 48, 2042–2063. [Google Scholar] [CrossRef] [PubMed]
- Grunfeld, Y. The Determinants of Corporate Investment. Ph.D. Dissertation, University of Chicago, Chicago, IL, USA, 1958. [Google Scholar]
- Yang, Y.P.; Li, G.R.; Peng, H. Empirical likelihood of varying coefficient errors-in-variables models with longitudinal data. J. Multivar. Anal. 2014, 127, 1–18. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).