Next Article in Journal
A Joint Specification Test for Response Probabilities in Unordered Multinomial Choice Models
Next Article in Special Issue
Counterfactual Distributions in Bivariate Models—A Conditional Quantile Approach
Previous Article in Journal / Special Issue
A New Family of Consistent and Asymptotically-Normal Estimators for the Extremal Index
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Bootstrap Inference for Quantile Regression Panel Data: A Monte Carlo Study

by
Antonio F. Galvao
1 and
Gabriel Montes-Rojas
2,3,*
1
Department of Economics, University of Iowa, W284 PBB, 21 E. Market Street, Iowa City, IA 52242, USA
2
CONICET-Universidad de San Andrés, Vito Dumas 284, Victoria, B1644BID, Pcia. de Bs. As., Argentina
3
Department of Economics, City University London, Northampton Square, London EC1V 0HB, UK
*
Author to whom correspondence should be addressed.
Econometrics 2015, 3(3), 654-666; https://doi.org/10.3390/econometrics3030654
Submission received: 26 June 2015 / Revised: 10 August 2015 / Accepted: 19 August 2015 / Published: 10 September 2015
(This article belongs to the Special Issue Quantile Methods)

Abstract

:
This paper evaluates bootstrap inference methods for quantile regression panel data models. We propose to construct confidence intervals for the parameters of interest using percentile bootstrap with pairwise resampling. We study three different bootstrapping procedures. First, the bootstrap samples are constructed by resampling only from cross-sectional units with replacement. Second, the temporal resampling is performed from the time series. Finally, a more general resampling scheme, which considers sampling from both the cross-sectional and temporal dimensions, is introduced. The bootstrap algorithms are computationally attractive and easy to use in practice. We evaluate the performance of the bootstrap confidence interval by means of Monte Carlo simulations. The results show that the bootstrap methods have good finite sample performance for both location and location-scale models.
JEL classifications:
C13; C21; C23

1. Introduction

Quantile regression (QR) for panel data has attracted interest in both theoretical and applied literature. It allows researchers to explore a range of conditional quantile functions, thereby exposing a variety of forms of conditional heterogeneity, and to control for unobserved individual characteristics. Controlling for individual-specific heterogeneity via fixed-effects (FE), while exploring heterogeneous covariate effects within the quantile regression framework, offers a more flexible approach to the analysis of panel data than that afforded by the classical Gaussian fixed and random effects estimators. The authors in [1,2] develop the asymptotic properties of these estimators (for other recent developments, see, e.g., [3,4,5]). In particular, the asymptotic variance of QR estimators depends on the density of the innovation term, and it is not easy to compute in practice. Thus, we argue that inference procedures and confidence interval construction can be greatly simplified by using bootstrap methods, and this paper evaluates bootstrapping procedures for panel quantile regression estimators with FE.
Bootstrapping techniques have been used to construct confidence intervals for QR in the cross-sectional context extensively. Buchinsky [6] uses Monte Carlo simulation to study several estimation procedures of the asymptotic covariance matrix in quantile regression models, and the results favor the bootstrap design. Hahn [7] shows that the construction of confidence intervals based for the QR estimators can be greatly simplified by using bootstrapping. Moreover, the confidence intervals constructed by the bootstrap percentile method have asymptotically correct coverage probabilities. Horowitz [8] proposes bootstrap methods for median regression models. Feng, He, and Hu [9] proposes an adaptation wild bootstrap methods for QR. Wang and He [10] develops inference procedures based on rank-score tests with random effects. In the panel data FE context, [11,12] use bootstrapping for constructing confidence intervals in the QR panel data. However, they do not provide evidence that this procedure is valid, nor do they provide an explicit methodology to implement the bootstrapping for panel data QR. This paper fills this gap.
Inference in panel datasets using least squares methods has been mainly used in asymptotic approximations for the construction of test statistics and the variances of the estimators. The use of bootstrap as an alternative to such asymptotic approximations has been considered, but its properties have not received the same amount of attention as in the cross-section or time series literature. The consideration of the bootstrap for panel data has focused on resampling in the time dimension, extending the work on the bootstrap in time series. Resampling in the cross-sectional dimension has received attention, as well. However, the literature on the combination of the two resampling schemes in panel data is very limited. Kapetanios [13] discusses bootstrapping for panel data when resampling occurs in both cross-sectional and time series dimensions. A treatment of resampling methods when N is large, but T is assumed small and fixed can be found in [14].
In this paper, we suggest inference procedures to construct confidence intervals for the parameters of interest in QR FE panel data models based on the application of bootstrap percentiles. This alternative approach circumvents the problem of estimating the sparsity function and the asymptotic covariance matrix. We apply the pairwise bootstrap resampling technique, since bootstrapping pairs is, in general, less sensitive to certain regularity conditions than bootstrapping residuals. Following [13], we study three different possibilities of sampling schemes for panel data QR: cross-sectional, temporal and cross-sectional and temporal. In the first case, we construct the bootstrap samples by resampling only from cross-sectional units with replacement. In the second case, the temporal resampling is only from the time series dimension, without changing the cross-sectional dimension. In the final case, a more general resampling scheme is proposed, which considers sampling from both cross-sectional and temporal dimensions.
A Monte Carlo study is conducted to study and evaluate the finite sample properties of the proposed inference procedure. We use the FE QR estimator. We evaluate the bootstrapping procedure using two different underlying models: (i) random-effects (where the covariates are not related to the individual effects); and (ii) fixed-effects (where covariates are correlated with the individual effects) models. We compute empirical rejection rates of the confidence interval for all different bootstrapping procedures. The results show that for the cross-sectional bootstrapping, the confidence intervals’ empirical rejection rates are close to the nominal size, and more importantly, as expected, the results indicate that the empirical sizes approximate the nominal size as the sample size increases. Overall, the results show good performance for large N, with marginal improvements as T increases. In addition, similar results are obtained for the random- and fixed-effects models.
The rest of the paper is organized as follows. Section 2 briefly presents the fixed effects quantile regression model. Section 3 describes the bootstrap inference procedures. Section 4 describes the Monte Carlo experiment and results. Conclusions appear in Section 5.

2. The Model and Estimator

2.1. The Model

This paper considers a QR model with individual FE. Thus, the τ-th conditional quantile function of the response y i t of the t-th observation on the i-th individual can be represented as
Q y i t ( τ | x i t , η i ) = η i ( τ ) + x i t β ( τ ) ,
where y i t is the response variable, η i denotes the individual FE, x i t is a p-vector of exogenous covariates and Q y i t ( τ | x i t , η i ) is the conditional τ-quantile of y i t given ( x i t , η i ) . In Model (1), both the effects of the covariates ( x i t ) and the individual-specific effects are allowed to depend on the quantile, τ, of interest. In general, each η i and β can depend on τ, but we assume τ to be fixed throughout the paper and suppress such a dependence for notational simplicity. The η’s are intended to capture some individual specific source of variability, or “unobserved heterogeneity,” that was not adequately controlled for by other covariates.
It is important to note that in the QR framework there is no transformation that can eliminate the FE; thus, we are required to deal with the full problem. This intrinsic difficulty has been recognized by [11], among others, and is clarified by Koenker and Hallock [15] (p.19) : “Quantiles of convolutions of random variables are rather intractable objects, and preliminary differencing strategies familiar from Gaussian models have sometimes unanticipated effects.” Therefore, following [2], we consider the FE estimation of β, which is implemented by treating each individual effect as a parameter to be estimated.1

2.2. Fixed-Effects Quantile Regression Estimator

We consider the standard QR FE estimation, which is implemented by treating each individual effect as a parameter to be estimated. The estimator is defined by
( η ^ , β ^ ) = arg min η , β i = 1 N t = 1 T ρ τ ( y i t z i t η x i t β ) ,
where z i t identifies the individual FE, η = ( η 1 , . . . , η N ) is the N × 1 vector of individual specific effects or intercepts, β is a vector of slope parameters and ρ τ ( u ) : = u ( τ I ( u < 0 ) ) as in [16].
The asymptotic properties of the ( η ^ , β ^ ) estimators are derived in [2]. They show consistency and asymptotic normality of the estimator under a condition on the sample size growth. This assumption is relatively stringent and basically requires the time dimension to grow quickly relative to the cross-section, i.e., N 2 / T 0 . However, a detailed discussion on such assumptions is outside the scope of this paper.2

3. Inference

3.1. The Bootstrap

The main concern of this paper is the application of bootstrapping procedures to the problem of constructing confidence intervals for the slope coefficients of the QR FE panel data model in (1), based on the point estimation methods described in the previous section.
The bootstrap resampling methods are designed to be used when the innovations of a regression are not identically distributed. The bootstrapping procedures are based on the so-called ( y , x ) -pairs bootstrap, which was originally proposed by [17]. This is a fully-nonparametric procedure that is applicable to a wide variety of models. Unlike resampling residuals, the pairs’ bootstrap is not limited to i i d errors in regression models. In the pairs’ bootstrap, instead of resampling the dependent variable, or residuals, possibly centered or rescaled, it bootstraps pairs consisting of an observation of the dependent variable along with the vector of explanatory variables for that same observation.
However, this bootstrap implicitly assumes that the pairs ( y i t , x i t ) are independent. Although this is still a restrictive assumption, ruling out any form of dependence among observations, it does allow for arbitrary forms of heteroskedasticity of y i t conditional on x i t . The objects resampled are i i d drawings from the joint distribution of y i t and x i t . Each bootstrap sample consists of some of the original pairs once, some of them more than once and some of them not at all. This procedure does not condition on x and does not assume that the innovation term is i i d .
As in [13], we discuss several possibilities for bootstrap resampling schemes that can be applied in panel datasets, such as the cross-sectional resampling, the temporal resampling and the cross-sectional and temporal resampling.
Cross-sectional resampling: The first scheme is the cross-sectional resampling, which consists of resampling Y and X with replacement from the cross-section dimension with probability 1 / N , maintaining intact the temporal structure for each individual i. Thus, in this case, let Y * = ( y i 1 , . . . , y i s , . . . , y i N ) , where each element of the vector of indices ( i 1 , . . . , i N ) is obtained by drawing with replacement from ( 1 , . . . , N ) , and each element is y i s = ( y i s 1 , . . . , y i s T ) for s ( i 1 , . . . , i N ) . The same vector of indices is used to obtain X * .
Temporal resampling: The second scheme consist of resampling both Y and X with replacement from the temporal dimension for each individual with probability 1 / T , maintaining fixed the individual structure. The implementation constructs, for each i ( 1 , . . . , N ) , y i * = ( y i t 1 , . . . , y i t r , . . . , y i t T ) , where each element of the vector of indices ( t 1 , . . . , t T ) is obtained by drawing with replacement from ( 1 , . . . , T ) . Then, construct Y * = ( y 1 * , . . . , y i * , . . . y N * ) . The same vector of indices is used to obtain X * .
Cross-sectional and temporal resampling: The third scheme involves both cross-sectional and temporal resampling. Here, it first resamples from the cross-sectional dimension with probability 1 / N (maintaining intact the temporal structure for each individual), and second, it resamples for each constructed cross-sectional unit across the time dimension with probability 1 / T .

3.2. Practical Implementation

For a given τ-quantile of interest, the bootstrapped panel data QR FE estimator is given by
( η ^ * , β ^ * ) = arg min η , β i = 1 N t = 1 T ρ τ ( y i t * z i t * η x i t * β ) ,
where ( y i t * , x i t * ) is the pairwise bootstrap resampled data and z i t * identifies the FE.
Implementation of the bootstrap in practice is simple. The steps for implementing the tests are as follows. Take B as a large integer. Given one of the three bootstraps discussed above, for each b = 1 , , B :
(i)
Obtain the resampled data { ( y i t b , z i t b , x i t b ) , i = 1 , , n t = 1 , , T } .
(ii)
Estimate β ^ b .
The limiting distribution of β ^ can be approximated by repeating these steps a large number of times, and the confidence intervals can be obtained by computing the quantiles of the implied empirical distribution using β ^ * . More specifically, let G ^ be the cumulative distribution function of a bootstrapped estimator. Let α be the level of the confidence interval. Then, one can compute the 1 α percentile confidence interval for each element in the coefficient vector β by the α / 2 and 1 α / 2 empirical percentiles of G ^ as:
[ β ^ α / 2 * , β ^ 1 α / 2 * ] = [ G ^ 1 ( α / 2 ) , G ^ 1 ( 1 α / 2 ) ] .

4. Monte Carlo

In this section, we conduct a small Monte Carlo experiment to study and evaluate the finite sample properties of the proposed inference procedure. We evaluate the bootstrapping procedures for inference with the standard QR FE estimator. We use two different underlying models to generate the data: (i) random-effects (where the covariates are not related to the individual effects); and (ii) fixed-effects (where covariates are correlated with the individual effects) models. We compute empirical rejection rates of the confidence interval for the three different bootstrapping procedures described above.

4.1. Design

We follow the Monte Carlo framework of [1,2]. We evaluate the finite sample properties of the proposed inference procedure by computing the empirical rejection rates. We consider the following data-generating process:
y i t = η i + x i t β + ( γ 1 + γ 2 x i t ) u i t i = 1 , . . . , N , t = 1 , . . . , T .
Two simple versions of our basic model (3) are considered in the simulation experiments. In the first, the covariate x i t exerts a pure location shift effect, and we set γ 1 = 1 and γ 2 = 0 . In the second, it has both location and scale shift effects, and we set γ 1 = 1 and γ 2 = 0 . 1 . Note that in the location shift model, β ( τ ) = β for all τ, while in the second location and scale shift model, β ( τ ) = β + γ 2 F u 1 ( τ ) , where F u 1 is the inverse distribution function, or quantile, of u.
We consider different data-generating processes (DGPs). First, we consider three different distributions for η i and u i t . In the first variant, both η i and u i t are standard Gaussian; in the second, they are both χ 3 2 3 ; in the third, they are both Cauchy.
The scalar covariate x is generated in two variants, with differences in each case for the location and location-scale shift models.
  • Random-effects model: x i t = θ i + ϵ i t , where both θ i and ϵ i t are standard Gaussian in the location shift model and χ 3 2 in the location-scale shift model3. This model has a covariate that is not correlated with the unobserved FE, and thus, the random-effects model is correct.
  • Fixed-effects model: x i t = 0 . 3 η i + ϵ i t , where ϵ i t is standard Gaussian in the location shift model and χ 3 2 in the location-scale shift model. In this model, x is correlated with the unobserved FE.
We consider panel sizes of N { 10 , 25 , 50 } and T { 5 , 10 , 50 } . Moreover, we consider bootstrap replications of size 500 and, for each experiment, compute all three types of bootstrap discussed in Section 3: cross-sectional (CS), temporal (TE) and cross-sectional and temporal (CT). We set the number of Monte Carlo simulations to 500. The performance of the confidence interval bootstrap estimator is evaluated by computing the empirical rejection rate. Let [ β ^ α / 2 * ( τ ) , β ^ 1 α / 2 * ( τ ) ] be the constructed ( 1 α ) × 100 % bootstrap confidence interval for β ( τ ) in a particular sample. Then, we compute the proportion of cases that β ( τ ) < β ^ α / 2 * ( τ ) or β ( τ ) > β ^ 1 α / 2 * ( τ ) over the 500 Monte Carlo simulations, which is called the rejection rate. If the bootstrapping procedure correctly simulates the distribution of β ^ ( τ ) , this proportion should be close to α. Finally, we consider α = 0 . 1 .

4.2. Results

Results for the case where the covariate x i t is not correlated with η are presented in Table 1 and Table 2. The results for which the covariate x i t is correlated with η are presented in Table 3 and Table 4.
Simulations results for the random-effects model (i.e., x not correlated with η) appear in Table 1 and Table 2 for the location and location-scale models, respectively. The best performance is observed for the CS bootstrapping procedure, where the confidence intervals coverage approaches 0.1 for sample sizes with N = 25 , 50 , although the coverage is too large for N = 10 . TE produce slight under-coverage, that is confidence intervals that cover less than 0.1 for all cases of N and T. Note that CT has a particularly low coverage, close to 0.01, 1/10 of the target coverage. Normal, chi-squared and Cauchy DGPs show similar trends regarding the effect of both N and T, and the worst performance is for the Cauchy model. As suggested by an anonymous referee, the fact that TE and CT has a worse performance than CS could be due to the fact that conditional on the sample, the covariance terms present in the original variance are missing the TE or CT bootstrap, and thus, they do not mimic some features of the data.
The location model has better coverage properties than the location-scale one for τ = 0 . 25 and τ = 0 . 75 , while they have similar properties for the median. The Cauchy model in which the density is very sparse at the corresponding first and third quartiles has a larger coverage than normal and chi-squared ones.
Note that the performance of the coverage for the estimated confidence intervals is not linear in T. That is, for a given N, increasing T does not monotonically increase or decrease coverage. In general, for a given N, a larger T results in a coverage closer to the nominal size of 0 . 1 .
Simulations results for the fixed-effects model (i.e., x correlated with η) appear in Table 3 and Table 4 for the location and location-scale models, respectively. Similar results to those of the previous tables are observed. That is, the coverage is correct for the CS procedure, but requires N = 25 , 50 , while both the TE and CT produce are smaller than the nominal coverage. Note, however, that the location-scale in Table 4 shows a better performance than that of Table 2. This suggests that using information from the covariates helps in reducing the effect of the unobserved individual effects (i.e., η).
Overall, the results confirm that, as expected, the empirical sizes approximate the nominal size as the sample size increases. In addition, the results show evidence that the CS procedure produces the closest coverage to the nominal size of 0.1, and this is achieved for large N (i.e., 25 ) improving with larger T (in a non-linear relationship).
Table 1. Monte Carlo simulations, data-generating process (DGP): location model, random-effects model.
Table 1. Monte Carlo simulations, data-generating process (DGP): location model, random-effects model.
Cross-SectionalTemporalCross-Sectional and Temporal
NT τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75
η i N ( 0 , 1 ) , u i t N ( 0 , 1 )
1050.1540.1560.1680.0740.0620.0640.0200.0160.030
10100.1600.1920.1720.0580.0460.0560.0120.0160.030
10500.1600.1720.1580.0740.0560.0760.0080.0100.014
2550.1440.1360.1280.0780.0620.0720.0200.0260.034
25100.1520.1280.1300.0700.0400.0560.0240.0200.018
25500.1260.1360.1160.0400.0540.0520.0120.0200.022
5050.1220.1080.1120.0720.0640.0760.0020.0060.010
50100.0880.0980.1120.0740.0560.0560.0100.0120.010
50500.1080.1000.1120.0540.0520.0600.0080.0080.008
η i χ 3 2 , u i t χ 3 2
1050.1620.1680.1660.0700.0620.0900.0260.0300.022
10100.1460.1520.1660.0720.0640.0760.0320.0160.014
10500.1700.1620.1600.0680.0540.0600.0180.0160.026
2550.1100.1260.1420.0660.0780.0760.0240.0220.024
25100.1300.1280.1280.0500.0380.0700.0140.0040.012
25500.1320.1340.1420.0660.0800.0940.0260.0180.008
5050.1300.1200.1020.0940.0700.0680.0120.0080.012
50100.1300.1240.0940.0520.0460.0780.0140.0140.012
50500.1080.0820.0900.0680.0660.0720.0060.0180.010
η i t 1 , u i t t 1
1050.1820.1540.1480.0700.0600.0660.0220.0220.022
10100.1540.1620.1500.0500.0520.0560.0140.0100.018
10500.1740.1560.1900.0760.0620.0580.0140.0320.028
2550.1260.1240.1220.0720.0600.0680.0120.0080.006
25100.1480.1220.1000.0380.0240.0440.0120.0100.010
25500.1360.1280.1160.0580.0740.0720.0200.0160.020
5050.1240.1020.1240.0740.0600.0780.0260.0240.044
50100.1180.1360.1320.0600.0440.0680.0240.0160.018
50500.1140.1100.1020.0740.0860.0780.0300.0160.024
Notes: Rejection rates based on 500 replications. Monte Carlo experiments using 500 simulations. Theoretical size: 10%. See the text for details.
Table 2. Monte Carlo simulations, DGP: location-scale model, random-effects model.
Table 2. Monte Carlo simulations, DGP: location-scale model, random-effects model.
Cross-SectionalTemporalCross-Sectional and Temporal
NT τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75
η i N ( 0 , 1 ) , u i t N ( 0 , 1 )
1050.1900.1780.1800.1300.0880.0760.0160.0160.032
10100.1580.1740.1580.0900.0740.0880.0200.0300.042
10500.1500.1360.1540.0960.0740.0720.0300.0180.018
2550.1480.1320.1400.1220.0800.1300.0540.0280.042
25100.1220.1160.1340.0940.0580.0880.0300.0120.022
25500.1180.1140.1220.0960.0820.1000.0160.0280.022
5050.1380.1500.1520.1360.0640.1540.0480.0200.066
50100.1200.1280.0920.0980.0520.0760.0240.0160.034
50500.1020.1280.1220.0820.0760.0900.0260.0160.034
η i χ 3 2 , u i t χ 3 2
1050.1780.1900.1860.1080.0880.1240.0400.0280.046
10100.1360.1520.1520.0600.0540.0980.0140.0120.018
10500.1960.1540.1600.0760.0600.0820.0360.0440.034
2550.1460.1380.1480.0780.0900.1480.0220.0180.068
25100.1180.1360.1280.0800.0620.0900.0200.0160.038
25500.1380.1300.1400.0940.0640.0860.0240.0180.024
5050.1360.1220.1520.0960.1000.2400.0340.0260.100
50100.1060.1100.1260.0940.0800.1000.0180.0120.050
50500.1280.0860.1200.0820.0640.0800.0200.0160.016
η i t 1 , u i t t 1
1050.2100.1780.1960.1260.0640.0840.0440.0220.040
10100.2100.1660.1740.0920.0720.1120.0440.0200.046
10500.1720.1600.1660.0740.0760.0820.0200.0200.018
2550.1580.1180.1440.1000.0640.1320.0440.0200.056
25100.1280.1240.1500.0880.0460.0960.0500.0260.044
25500.1260.1300.1480.0720.0760.0680.0160.0200.030
5050.1720.1460.1280.1480.0400.1360.0480.0100.056
50100.1160.1120.1300.1420.0600.1140.0200.0020.022
50500.1540.1020.1480.0860.0840.1020.0160.0180.024
Notes: Rejection rates based on 500 replications. Monte Carlo experiments using 500 simulations. Theoretical size: 10%. See the text for details.
Table 3. Monte Carlo simulations, DGP: location model, fixed-effects model.
Table 3. Monte Carlo simulations, DGP: location model, fixed-effects model.
Cross-SectionalTemporalCross-Sectional and Temporal
NT τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75
η i N ( 0 , 1 ) , u i t N ( 0 , 1 )
1050.1740.1560.1480.0820.0600.0680.0340.0260.030
10100.1520.1720.1520.0580.0320.0380.0220.0120.022
10500.1400.1480.1880.0760.0720.0820.0240.0180.012
2550.1100.1080.1220.0800.0660.0820.0160.0100.016
25100.1360.1360.1380.0600.0480.0760.0140.0140.024
25500.1160.1260.1120.0600.0700.0720.0160.0100.006
5050.1260.1120.1080.0940.0520.0720.0260.0220.020
50100.1060.1140.1120.0560.0380.0400.0100.0200.020
50500.1360.1320.1020.0900.0840.0720.0200.0200.010
η i χ 3 2 , u i t χ 3 2
1050.1880.1720.1580.0880.0720.0700.0080.0100.012
10100.1820.1600.1600.0640.0680.0760.0160.0220.026
10500.1560.1600.1280.0740.0720.0780.0140.0160.016
2550.1240.1220.1120.0980.0660.0620.0320.0320.030
25100.1560.1360.1260.0560.0440.0660.0220.0080.012
25500.1400.1400.1240.0580.0720.0620.0160.0120.016
5050.1000.1080.1460.0780.0580.0720.0160.0140.016
50100.0960.1100.1100.0440.0260.0500.0120.0120.012
50500.1220.1100.0880.0640.0640.0840.0180.0200.016
η i t 1 , u i t t 1
1050.1760.1660.1540.0900.0780.0960.0280.0180.016
10100.1300.1440.1760.0560.0520.0560.0140.0060.014
10500.1900.1660.1460.0720.0600.0620.0280.0140.018
2550.1280.0900.1100.0720.0420.0700.0160.0240.030
25100.1200.1180.1080.0520.0620.0620.0160.0120.010
25500.1360.1460.1300.0800.0840.0640.0200.0200.018
5050.1300.1340.1100.0720.0580.0680.0240.0220.028
50100.1320.1200.1040.0520.0440.0560.0200.0120.010
50500.1200.1180.1220.0600.0520.0700.0120.0140.020
Notes: Rejection rates based on 500 replications. Monte Carlo experiments using 500 simulations. Theoretical size: 10%. See the text for details.
Table 4. Monte Carlo simulations, DGP: location-scale model, fixed-effects model.
Table 4. Monte Carlo simulations, DGP: location-scale model, fixed-effects model.
Cross-SectionalTemporalCross-Sectional and Temporal
NT τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75 τ = 0 . 25 τ = 0 . 50 τ = 0 . 75
η i N ( 0 , 1 ) , u i t N ( 0 , 1 )
1050.1660.1500.1840.1020.0740.1220.0440.0360.032
10100.2060.1700.1640.1000.0660.0960.0200.0120.038
10500.1720.1500.1880.0800.0800.1040.0300.0180.016
2550.1300.1160.1440.1420.0740.1280.0300.0120.038
25100.1260.1180.1260.0940.0520.0860.0380.0200.044
25500.1520.1180.1340.0780.0700.0680.0100.0100.010
5050.1300.1500.1320.1620.0760.1460.0460.0220.048
50100.1320.1120.1340.1060.0760.1400.0300.0100.018
50500.1120.1100.1220.0780.0620.0760.0200.0140.026
η i χ 3 2 , u i t χ 3 2
1050.1840.1420.1720.0840.0840.1300.0280.0400.072
10100.1580.1620.1860.0720.0540.0980.0140.0200.050
10500.1880.1560.1800.0860.0860.0960.0260.0320.032
2550.1580.1380.1460.1120.0780.1900.0240.0320.072
25100.1300.1500.1120.0860.0520.1000.0200.0140.040
25500.1100.1040.1060.0700.0760.0720.0340.0160.012
5050.1360.1300.1660.1020.0840.2540.0320.0280.086
50100.1200.1320.1360.0780.0720.1500.0160.0080.036
50500.1220.1220.1160.0660.0860.0940.0200.0140.014
η i t 1 , u i t t 1
1050.2000.1640.1700.1080.0480.1300.0340.0140.024
10100.1500.1380.1840.0980.0580.1060.0340.0180.038
10500.1580.1480.1380.1040.0860.1060.0260.0220.024
2550.1400.1400.1460.1460.0820.1360.0500.0200.050
25100.1320.1360.1420.1100.0380.1020.0360.0160.026
25500.1000.1060.1420.1040.0660.0920.0340.0080.036
5050.1560.1040.1460.1920.0680.1780.0720.0140.064
50100.1180.1160.1200.1380.0520.1240.0440.0120.036
50500.1220.1480.1400.0980.0760.1080.0360.0200.008
Notes: Rejection rates based on 500 replications. Monte Carlo experiments using 500 simulations. Theoretical size: 10%. See the text for details.

5. Conclusions

The asymptotic variance-covariance matrix of the techniques developed in [1,2] for QR FE estimators cannot be easily obtained. They require estimation of the density of the innovations term. In order to overcome this difficulty, this paper explores bootstrap inference techniques for QR panel data. We propose to construct confidence intervals for the parameters of interest using percentile bootstrapping with a pairwise resampling technique. We evaluate the performance of the bootstrap confidence interval by means of Monte Carlo simulations. We assess the finite sample performance of the intervals by computing their empirical rejection rates. The results look promising and confirm that bootstrapping techniques can be used for inference in these models. In particular, we recommend using cross-section bootstrap with replacement from the cross-section dimension with probability 1 / N , maintaining intact the temporal structure for each individual i.

Acknowledgments

The authors would like to express their appreciation to the Editor, two anonymous referees, Anil Bera, Roger Koenker, Xuming He, Steve Portnoy and the participants in the seminars at the University of Illinois at Urbana-Champaign for helpful comments and discussions. All of the remaining errors are ours.

Author Contributions

The authors contributed equally to this work.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. R. Koenker. “Quantile Regression for Longitudinal Data.” J. Multivar. Anal. 91 (2004): 74–89. [Google Scholar] [CrossRef]
  2. K. Kato, A.F. Galvao, and G. Montes-Rojas. “Asymptotics for Panel Quantile Regression Models with Individual Effects.” J. Econom. 170 (2012): 76–91. [Google Scholar] [CrossRef]
  3. A.F. Galvao. “Quantile regression for dynamic panel data with fixed effects.” J. Econom. 164 (2011): 142–157. [Google Scholar] [CrossRef]
  4. A.F. Galvao, C. Lamarche, and L. Lima. “Estimation of Censored Quantile Regression for Panel Data with Fixed Effects.” J. Am. Stat. Assoc. 108 (2013): 1075–1089. [Google Scholar] [CrossRef]
  5. A.F. Galvao, and L. Wang. “Efficient Minimum Distance Estimator for Quantile Regression Fixed Effects Panel Data.” J. Multivar. Anal. 133 (2015): 1–26. [Google Scholar] [CrossRef]
  6. M. Buchinsky. “Estimating the Asymptotic Covariance Matrix for Quantile Regression Models: A Monte Carlo Study.” J. Econom. 68 (1995): 303–338. [Google Scholar] [CrossRef]
  7. J. Hahn. “Bootstrapping Quantile Regression Estimators.” Econom. Theory 11 (1995): 105–121. [Google Scholar] [CrossRef]
  8. J.L. Horowitz. “Bootstrap Methods for Median Regression Models.” Econometrica 66 (1998): 1327–1351. [Google Scholar] [CrossRef]
  9. X. Feng, X. He, and J. Hu. “Wild bootstrap for quantile regression.” Biometrika 98 (2011): 995–999. [Google Scholar] [CrossRef] [PubMed]
  10. H. Wang, and X. He. “Detecting Differential Expressions in GeneChip Microarray Studies.” J. Am. Stat. Assoc. 102 (2007): 104–112. [Google Scholar] [CrossRef]
  11. J. Abrevaya, and C.M. Dahl. “The Effects of Birth Inputs on Birthweight: Evidence From Quantile Estimation on Panel Data.” J. Bus. Econ. Stat. 26 (2008): 379–397. [Google Scholar] [CrossRef]
  12. C. Lamarche. “Penalized Quantile Regression Estimation for a Model with Endogenous Individual Effects.” J. Econom. 157 (2010): 396–408. [Google Scholar] [CrossRef]
  13. G. Kapetanios. “A Bootstrap Procedure for Panel Datasets with Many Cross-Sectional Units.” Econom. J. 11 (2008): 377–395. [Google Scholar] [CrossRef]
  14. A.C. Cameron, and P.K. Trivedi. Microeconometrics: Methods and Applications. Cambridge, MA, USA: Cambridge University Press, 2005. [Google Scholar]
  15. R. Koenker, and K. Hallock. “Quantile Regression: An Introduction.” University of Illinois at Urbana-Champaign, Champaign-Urbana, IL, USA, Unpublished work. 2000. [Google Scholar]
  16. R. Koenker, and G.W. Bassett. “Regression Quantiles.” Econometrica 46 (1978): 33–49. [Google Scholar] [CrossRef]
  17. D.A. Freedman. “Bootstrapping Regression Models.” Ann. Stat. 9 (1981): 1218–1228. [Google Scholar] [CrossRef]
  • 1An alternative model was developed by [1], where the individual effects are the same across quantiles. In most applications, the time series dimension T is relatively small compared to the number of individuals N. Therefore, it might be difficult to estimate a τ-dependent distributional individual effect. The restriction of the individual effects, η, to be independent of the specific quantile, τ, is implemented by estimating the model for several quantiles simultaneously.
  • 2The work in [1] introduced a general approach to estimate quantile regression models for panel data with FE that may be subject to shrinkage by l 1 regularization methods. It is well know that the optimal estimator for the random effects Gaussian model involves shrinking the individual effects toward a common value. When there is an intercept in the model, this common value can be taken to be the conditional central tendency of the response at a point determined by the centering of the other covariates. In the quantile regression model, this would be some corresponding conditional quantile of the response. Particularly, when N is large relative to T, shrinkage may be advantageous in controlling the variability introduced by the large number of estimated individual-specific parameters.
  • 3This is to avoid non-linearities in the linear quantile functions, which arise if x can take both positive and negative values.

Share and Cite

MDPI and ACS Style

Galvao, A.F.; Montes-Rojas, G. On Bootstrap Inference for Quantile Regression Panel Data: A Monte Carlo Study. Econometrics 2015, 3, 654-666. https://doi.org/10.3390/econometrics3030654

AMA Style

Galvao AF, Montes-Rojas G. On Bootstrap Inference for Quantile Regression Panel Data: A Monte Carlo Study. Econometrics. 2015; 3(3):654-666. https://doi.org/10.3390/econometrics3030654

Chicago/Turabian Style

Galvao, Antonio F., and Gabriel Montes-Rojas. 2015. "On Bootstrap Inference for Quantile Regression Panel Data: A Monte Carlo Study" Econometrics 3, no. 3: 654-666. https://doi.org/10.3390/econometrics3030654

Article Metrics

Back to TopTop