Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random

Luo, Shuanghua; Zhang, Cheng-yi; Wang, Meihua

doi:10.3390/sym11091065

Open AccessArticle

Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random

by

Shuanghua Luo

¹,

Cheng-yi Zhang

^2,*

and

Meihua Wang

³

¹

School of Science, Xi’an Polytechnic University, Xi’an 710048, China

²

School of Economics and Finance, Xi’an Jiaotong University, Xi’an 710061, China

³

School of Economics and Management, Xidian University, Xi’an 710071, China

^*

Author to whom correspondence should be addressed.

Symmetry 2019, 11(9), 1065; https://doi.org/10.3390/sym11091065

Submission received: 24 July 2019 / Revised: 17 August 2019 / Accepted: 19 August 2019 / Published: 21 August 2019

Download

Browse Figures

Versions Notes

Abstract

:

Composite quantile regression (CQR) estimation and inference are studied for varying coefficient models with response data missing at random. Three estimators including the weighted local linear CQR (WLLCQR) estimator, the nonparametric WLLCQR (NWLLCQR) estimator, and the imputed WLLCQR (IWLLCQR) estimator are proposed for unknown coefficient functions. Under some mild conditions, the proposed estimators are asymptotic normal. Simulation studies demonstrate that the unknown coefficient estimators with IWLLCQR are superior to the other two with WLLCQR and NWLLCQR. Moreover, bootstrap test procedures based on the IWLLCQR fittings is developed to test whether the coefficient functions are actually varying. Finally, a type of investigated real-life data is analyzed to illustrated the applications of the proposed method.

Keywords:

varying coefficient model; composite quantile regression; missing at random; inverse probability weighting; imputed method

1. Introduction

The varying coefficient model, proposed originally by Hastie and Tibshirani [1], is flexible and powerful to examine the dynamic changes of regression coefficients over some factors such as time and age and has gained much popularity during the past few decades (see [2,3,4,5,6]).

A classical varying coefficient model has the following structure:

Y = X^{T} β (U) + ε,

(1)

where

Y \in R

is a response variable,

X = {(X_{1}, \dots, X_{p})}^{T} \in R^{p}

is a covariate vector,

β (\cdot) = {(β_{1} (\cdot), \dots, β_{p} (\cdot))}^{T}

\in R^{p}

is an unknown coefficient vector function with a smoothing variable U, and

ε

is a random error independent of

(X, U)

.

Recently, some estimates of

β (\cdot)

for Model (1) with the least squares regression have attracted many researchers’ attention. Above all, Hastie and Tibshirani [1] considered

L_{2}

penalized least squares estimation and attained some good results. Following, Fan et al. [4] and Fan et al. [7] applied the least squares regression to propose a two-step local polynomial estimation procedure and a profile estimator for Model (1), respectively, and designed some suitable statistical inference procedures. However, there arises a dilemma that these estimation procedures of the least squares could be very sensitive to outliers [8]. In order to overcome this problem, quantile regression proposed by Koenker [9] can be thought of as an alternative because as a mean model, traditional least squares regression only gives the effects of the covariates at the center of the distribution, while quantile regression can not only directly estimate the ones at different quantiles, but also characterize the entire conditional distribution of a dependent variable of the regression [8]. Thus, this regression has a much better robust property when processing outlier observations.

Due to its significant theoretical advances, some scholars had integrated the quantile regression into the varying coefficient model. Kim [10] attained the quantile regression model with the varying coefficient. For the processing of time series data, Cai and Xu [11] developed nonparametric quantile estimations with dynamic smooth coefficient models. Later, Cai and Xiao [12] applied dynamic models with partially-varying coefficients to investigate semiparametric quantile regression and obtained some useful results. Tang [13] derived a robust quantile regression estimation using the spatial semiparametric partially-linear regression model with a varying coefficient. Unfortunately, a relative small efficiency may result by a single quantile regression procedure compared with the least squares regression. In order to overcome this drawback, it is very necessary to get a desirable efficient and stable estimator. In recent years, an oracle procedure of composite quantile regression (CQR) was proposed by Zou and Yuan [14] to select the significant variables, and some important theoretical and applied results were derived. So far, the CQR method is widely used in many situations. For example, some efficient estimators based on the CQR method were proposed by Kai et al. [15] and Guo et al. [16] for semi-parametric partially-linear models with a varying coefficient. In addition, a data-driven weighted CQR (WCQR) estimation was studied by Sun et al. [17] and Yang et al. [18] for linear models with a varying coefficient, respectively.

Although the QR has significant theoretical properties such that its literature on the complete data has been rapidly growing, people have paid scant attention to the incomplete data, i.e., the data samples containing missing values since this class of data may lead easily to substantially-distorted results. In fact, as is commonplace, missing data often appear in real life. There are various reasons such as failure on the part of investigators when gathering correct information, the unwillingness of some sampled units when supplying the desired information, loss of information caused by uncontrollable factors, and so forth, resulting in the data missing. In the early 1970s, the advances in computer technology such that many laborious numerical calculations were possible to perform spurred the literature on statistical analysis of real data containing missing values in applied work; see [19,20,21,22,23,24,25]. Despite a long history on missing data analysis, little work on QR has taken missing data into account. Recently, an iterative imputation procedure was developed by Wei et al. [26] in a linear QR model with non-i.i.d. error terms for the covariates with missing values. A smoothed empirical likelihood analysis was discussed by Lv and Li [27] for partially-linear quantile regression with missing response. An inverse probability weighting QR approach was proposed by Sherwood et al. [28] in the last few years for analyzing healthcare cost data with missing covariates at random. The QR for competing risk data was studied by Sun et al. [29] when the failure type was missing. An efficient QR analysis was discussed by Chen [30] with missing observations. Some imputation methods were proposed by Shu [31] for quantile estimation under data missing at random.

In this paper, a coherent inference framework based on CQR estimation and inference is explored for varying coefficient models with response data missing at random. The main contribution of this paper can be summarized as follows:

A composite quantile regression estimation (CQRE) method is proposed for the analysis of varying coefficient models with response data missing at random. This method has the following two advantages: (1) the CQRE method can effectively overcome not only the drawback of a relative small efficiency that may result from a single quantile regression procedure compared with the least-squares regression, but also the interference of non-normal error; hence, it improves its estimation efficiency significantly; (2) since different quantiles are used in the imputation instead of actually observed responses or means and the robustness of quantile regression is inherited, the CQRE method is less sensitive to outliers; thus, the CQRE method is more effective and robust than the single quantile regression method and the classical least squares method.
Three estimators including the weighted local linear CQR (WLLCQR) estimator, the nonparametric WLLCQR (NWLLCQR) estimator, and the imputed WLLCQR (IWLLCQR) estimator are proposed for an unknown coefficient function in the varying coefficient model to establish the asymptotic normality of these estimators under some mild conditions.

The rest of this paper is organized as follows. The CQR varying coefficient model will be introduced with missing response data in Section 2 to construct a class of estimators for an unknown coefficient function. Then, some theoretical results on the asymptotic property of the proposed estimators are proposed in Section 3. In Section 4, a bootstrap-based test procedure is developed to perform a simulation study in Section 5 that demonstrates the finite-sample performance of the proposed method. Following, an application to a real dataset illustrates the effectiveness of our approach in Section 6. In addition, some discussions and conclusion remarks are presented in Section 7 and Section 8, respectively. Finally, the proof of the main results is given in Appendix A.

2. Estimation Based on the CQR Varying Coefficient Model With Missing Response

In this section, the CQR varying coefficient model will be introduced with missing response data to construct a class of estimators for an unknown coefficient function. In particular, as the main estimate methods in this paper, three estimators including the weighted local linear CQR (WLLCQR) estimator, the nonparametric WLLCQR (NWLLCQR) estimator, and the imputed WLLCQR (IWLLCQR) estimator are constructed and emphasized.

Let

{(X_{i}, U_{i}, Y_{i}, δ_{i}) : i = 1, 2, \dots, n}

be a random sample coming from Model (1), such that:

Y_{i} = X_{i}^{T} β (U_{i}) + ε_{i}, i = 1, 2, \dots, n,

(2)

where all the

X_{i} \in R^{p}

and

U_{i} \in R

are always observed, and

β (\cdot) = {(β_{1} (\cdot), \dots, β_{j} (\cdot), \dots, β_{p} (\cdot))}^{T} \in R^{p}

is the coefficient vector function. Further,

δ_{i} = 0

if

Y_{i}

is missing and

δ_{i} = 1

otherwise. We assume that throughout this paper,

Y_{i}

is missing at random (MAR) for some i. This assumption indicates that

δ_{i}

and

Y_{i}

are conditionally independent given

X_{i}

and

U_{i}

, that is,

P (δ_{i} = 1 | X_{i}, U_{i}, Y_{i}) = P (δ = 1 | X_{i}, U_{i}) = p (X_{i}, U_{i}) = π (Z_{i}),

(3)

where

Z_{i} = {(X_{i}, U_{i})}^{T}

. Moreover, we also assume that across different quantile regression models, there is the same coefficient vector function

β (\cdot)

. Thus, we can express the conditional τ-quantile function of Y as:

Q_{τ} (X, U) = X^{T} β (U) + c_{τ},

where

c_{τ}

is the

τ

-quantile of

ε

. If

β_{j} (\cdot)

is differentiable, Taylor’s expansion yields that:

β_{j} (U) \approx β_{j} (u) + β_{j}^{^{'}} (u) (U - u) = a_{j} + b_{j} (U - u),

for

j = 1, 2, \dots, p

, where u is a fixed value of a random variable and U lies in a neighborhood of u. For the case of no missing response data, minimizing the following criterion:

\sum_{i = 1}^{n} ρ_{τ} (Y_{i} - c_{τ} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u),

(4)

we can attain the local linear quantile regression (LLQR) estimator of

β (u)

, where

ρ_{τ} (u) = u (τ - I_{(u < 0)})

is called the quantile loss function of τ-quantile regression,

a = {(a_{1}, \dots, a_{p})}^{T}, b = {(b_{1}, \dots, b_{p})}^{T}

, and

K_{h} (\cdot) = K_{h} (\cdot / h) / h

is a Gaussian kernel function with bandwidth h. In order to improve the quantile regression estimation efficiently, the local linear composite quantile regression (LLCQR) estimation is adopted from Guo et al. [16] for the varying coefficient models. Let q be the number of quantiles and

τ_{k} = k / (1 + q)

for

k = 1, \dots, q .

The loss function of the LLCQR estimation is defined as:

\sum_{k = 1}^{q} \sum_{i = 1}^{n} ρ_{τ_{k}} (Y_{i} - c_{k} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u),

(5)

where

c_{k}

is the

τ_{k}

-quantile of

ε

.

In what follows, this technique of LLCQR will be extended to handle the case of response data missing at random.

2.1. WLLCQR Estimation

The inverse probability weighting (IPW) version of local linear CQR estimation will be considered to handle missing responses data at random, that is the CC (complete-case) analysis will be adjusted by using the inverse of the selection probability as the weight. However, the nonparametric smoothing estimation of

π (\cdot)

will encounter the curse of dimensionality when the dimension of Z is high enough. Motivated by Wang [24], we use the inverse marginal probability weighted approach.

Let

P (δ = 1 | X_{i} = x, U_{i} = u) = P (δ = 1 | U_{i} = u) = Δ (u)

, i.e., the propensity score just depends on U. When the inverse marginal probability function

Δ (u)

is known, the WLLCQR estimator

\hat{β} (u)

of

β (u)

is defined as:

(\hat{c}, \hat{a}, \hat{b}) = arg m i n_{c, a, b} \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i}}{Δ (U_{i})} ρ_{τ_{k}} (Y_{i} - c_{k} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u),

(6)

where

c = {(c_{1}, \dots, c_{q})}^{T}

and

Δ (u) = P (δ = 1 | U_{i} = u)

. Here,

\hat{β} (u) = \hat{a}

is called the WLLCQR estimator of

β (u)

with

Δ (u)

.

2.2. Nonparametric WLLCQR Estimation

However, the inverse marginal probability function in practical situations is usually unknown, and thus, it needs to be estimated. We often employ nonparametric smoothing estimation approaches to estimate the unknown selection probability

Δ (\cdot)

. The Nadaraya–Watson estimation [32] is one of these nonparametric smoothing estimation approaches. We can define the Nadaraya–Watson estimator of

Δ (u)

as:

\hat{Δ} (u) = \frac{\sum_{i = 1}^{n} δ_{i} L_{h_{0}} (U_{i} - u)}{\sum_{i = 0}^{n} L_{h_{0}} (U_{i} - u)},

(7)

where

L_{h_{0}} (\cdot) = L_{h_{0}} (\cdot / h_{0}) / h_{0}

is a density kernel function and

h_{0}

is a bandwidth. Therefore, the NWLLCQR estimation procedure with

\hat{Δ} (u)

is formally defined as:

({\hat{c}}_{N}, {\hat{a}}_{N}, {\hat{b}}_{N}) = arg m i n_{c, a, b} \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i}}{\hat{Δ} (U_{i})} ρ_{τ_{k}} (Y_{i} - c_{k} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u),

(8)

where

{\hat{β}}_{N} (u) = {\hat{a}}_{N}

is called the NWLLCQR estimator of

β (u)

with

\hat{Δ} (u)

.

2.3. Imputed WLLCQR Estimation

Although both the WLLCQR estimator and NWLLCQR estimator can well estimate the inverse marginal probability function, the information contained in the data is not explored fully. Now, we use quantile regression imputation to resolve the issue by imputing

Y_{i}

by

X^{T} {\hat{β}}_{C} (U)

if

Y_{i}

is missing, where

{\hat{β}}_{C} (U) = \hat{a}

and

\hat{a}

is defined in:

(\hat{c}, \hat{a}, \hat{b}) = arg m i n_{c, a, b} \sum_{k = 1}^{q} \sum_{i = 1}^{n} δ_{i} ρ_{τ_{k}} (Y_{i} - c_{k} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u) .

(9)

Therefore, the imputed WLLCQR estimation procedure can be defined as:

\begin{matrix} ({\hat{c}}_{I}, {\hat{a}}_{I}, {\hat{b}}_{I}) = arg m i n_{c, a, b} \sum_{k = 1}^{q} \sum_{i = 1}^{n} ρ_{τ_{k}} (Y_{i}^{*} - c_{k} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u), \end{matrix}

(10)

where

Y_{i}^{*} = \frac{δ_{i}}{\hat{Δ} (U_{i})} Y_{i} + (1 - \frac{δ_{i}}{\hat{Δ} (U_{i})}) X_{i}^{T} {\hat{β}}_{C} (U_{i})

and

{\hat{β}}_{I} (u) = {\hat{a}}_{I}

is called the IWLLCQR estimator of

β (u)

.

Remark 1.

Since local results in interpolation vary greatly and are unstable, as a smoothing method, the kernel function is used in Equations (4)–(10) such that the interpolation results of these equations are much smoother and stabler.

3. Asymptotic Properties

In this section, the asymptotic distribution will be considered for the estimators proposed in Section 2 to establish some theoretical results of these estimators.

Let

f (\cdot)

and

f_{U} (\cdot)

be the density functions of

ε

and U, respectively. For simplicity, the following notations:

μ_{j} = \int μ^{j} K (u) d u

and

ν_{j} = \int μ^{j} K^{2} (u) d u

for

j = 0, 1, 2,

η_{i} = \sum_{k = 1}^{q} [I (ε_{i} \leq c_{k}) - τ_{k}]

for

i = 1, 2

,⋯, n, and

D_{u} = E (X_{i} X_{i}^{T} | U = u) \sum_{k = 1}^{q} f (c_{k})

will be used in this section.

Now, the following results are established.

Theorem 1.

Suppose that Conditions

C 1

–

C 7

in the Appendix hold. If

Δ (u)

is known, then:

\sqrt{n h} (\hat{β} (u) - β (z) - \frac{1}{2} h^{2} μ_{2} β^{″} (u)) \overset{d}{⟶} N (0, \frac{v_{0}}{f_{U} (u)} D_{u}^{- 1} Ω_{u} D_{u}^{- 1}),

where

\overset{d}{⟶}

represents the convergence in the distribution,

Ω_{u} = E {\frac{η_{i}^{2}}{Δ (U_{i})} X_{i} X_{i}^{T} | U = u}

.

Theorem 2.

Suppose

Δ (u) > 0

is a smoothing function of u, based on Conditions

C 1

–

C 7

in the Appendix holding. Then:

\sqrt{n h} ({\hat{β}}_{E} (u) - β (z) - \frac{1}{2} h^{2} μ_{2} β^{″} (u)) \overset{d}{⟶} N (0, \frac{v_{0}}{f_{U} (u)} D_{u}^{- 1} Ω_{u}^{*} D_{u}^{- 1}),

where

Ω_{u}^{*} = E {\frac{η_{i}^{2}}{Δ (U_{i})} X_{i} X_{i}^{T} | U_{i}} - E {\frac{1 - Δ (U_{i})}{Δ (U_{i})} E {[X_{i}^{T} η_{i} | U_{i}]}^{\otimes^{2}} | U_{i}}

.

Theorem 3.

Assuming

Δ (u) > 0

is a smoothing function of u, based on the Conditions

C 1

–

C 7

in the Appendix hold. Then:

\sqrt{n h} ({\hat{β}}_{I} (u) - β (z) - \frac{1}{2} h^{2} μ_{2} β^{″} (u)) \overset{d}{⟶} N (0, \frac{v_{0}}{f_{U} (u)} D_{u}^{- 1} Ω_{u}^{* *} D_{u}^{- 1}),

where:

Ω_{u k}^{* *} = E [η_{i}^{2} π (U_{i}) X_{i} X_{i}^{T} | U_{i} = u] - E \{\frac{(1 - π (U_{i})) (2 π (U_{i}) + 1)}{π (U_{i})} E {[X_{i} η_{i} | U_{i}]}^{\otimes^{2}}\} .

4. A Bootstrap-Based Goodness-of-Fit Test

In investigating the varying coefficient model, how to test whether unknown coefficient functions are actually varying is of importance. In this section, the testing problem is considered for Model (1) under response missing, and then, a goodness-of-fit test is proposed based on the difference between the weighted residual sums of the quantile (WRSQ) and the LLCQR fittings under both the null and alternative hypotheses.

The following testing problem:

H_{0} : β (u) = β versus H_{1} : β (u) \neq β,

(11)

for simplicity, is considered, where

β

is a constant vector. The model (1) becomes a classical linear model with missing responses under the null hypothesis. The WRSQ under

H_{0}

is defined as:

\begin{matrix} W R S Q_{0} = \sum_{k = 1}^{q} \sum_{i = 1}^{n} ρ_{τ_{k}} (Y_{i}^{*} - {\hat{c}}_{k} - X_{i}^{T} {\hat{β}}_{I}), \end{matrix}

(12)

where

{\hat{c}}_{1}

, ⋯,

{\hat{c}}_{q}

and

{\hat{β}}_{I}

are given by the following IWCQR estimation procedure:

({\hat{c}}_{1}, \dots, {\hat{c}}_{q}, {\hat{β}}_{I}) = arg min \sum_{k = 1}^{q} \sum_{i = 1}^{n} ρ_{τ_{k}} (Y_{i}^{*} - c_{k} - X_{i}^{T} β) .

Similarly, the WRSQ under

H_{1}

can be defined as:

\begin{matrix} W R S Q_{1} = \sum_{k = 1}^{q} \sum_{i = 1}^{n} ρ_{τ_{k}} (Y_{i}^{*} - {\hat{c}}_{I k} - X_{i}^{T} {\hat{β}}_{I} (U_{i})), \end{matrix}

(13)

where

c_{\hat{I} 1}, \dots, c_{\hat{I} q}

and

{\hat{β}}_{I} (u)

are given in (10). Then, the following test statistic is given as:

\begin{matrix} T_{n} = \frac{W R S Q_{0} - W R S Q_{1}}{W R S Q_{1}} = \frac{W R S Q_{0}}{W R S Q_{1}} - 1 . \end{matrix}

(14)

For a large value of

T_{n}

, the null hypothesis (11) is rejected. In what follows, based on the bootstrap method, we evaluate the p values of the test along the lines of Wong et al. [33] and Guo et al. [16]:

Step 1.: Assume the number of complete data is m. We get the IWLLCQR estimator ${\hat{β}}_{I} (U_{i})$ .
Step 2.: The bootstrap residuals $ε_{i}^{*}$ are generated from series ${{\hat{ε}}_{i} - \bar{\hat{ε}}}_{i = 1}^{n}$ , where:

${\hat{ε}}_{i} = Y_{i}^{*} - X_{i}^{T} \hat{β} (U_{i}), \bar{\hat{ε}} = \frac{1}{m} \sum_{i = 1}^{m} {\hat{ε}}_{i},$
Step 3.: Step 2 is repeated for M times, and then, series sets $E_{j} = {Y_{i, j}, X_{i}, U_{i}, δ_{i}}_{i = 1}^{n}$ are obtained for $j = 1, \dots, M$ . The bootstrap test statistic is calculated for each bootstrap sample $E_{j}$ , denoted by $T_{n, j}^{*}$ .
Step 4.: The p value is approximately estimated by $\hat{p} = \frac{S}{M}$ , where S is the cardinality of the set $S = {j | T_{n, j}^{*} \geq T_{n}, j = 1, \dots, M}$ .

5. Simulation Study

A simulation study was carried out to investigate the finite-sample properties of our proposed method by a comparison among the WLLCQR estimation method, the NWLLCQR estimation method, the IWLLCQR estimation method, the INWLLCQR (imputed not weighted LLCQR) estimation method, and the WLLCQR estimation method without data missing, defined in (5).

In numerical studies, generally, the kernel function

K (x)

is taken to be

K (x) = 0.75 (1 - x^{2}) I_{(| x | \leq 1)}

. Here, this function is still adopted. It follows that the cross-validation method is used to select the optimal bandwidths

h_{o p t}

. In the subsequent examples, let the composite level

q = 8 .

Example 1.

Consider the following model:

Y_{i} = sin (π U_{i}) X_{i} + ε_{i},

where

X_{i} = Z_{i 1} + Z_{i 2} + Z_{i 3}

,

U_{i} = Z_{i 1} + Z_{i 2}

,

Z_{i 1}, Z_{i 2}, Z_{i 3}

are independent,

Z_{i 1}

and

Z_{i 2}

follow a uniform distribution on

[- 1, 1]

,

Z_{i 3} \sim N (0, 1)

, and three error distributions of

ε_{i}

are considered including

ε_{i} \sim N (0, 1),

ε_{i} \sim t (3)

, and

ε_{i} \sim 0.8 N (0, 1) + 0.2 N (0, 3^{2})

.

An analysis of the fitting of five different estimators including WLLCQR

_{Δ}

, NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR is done by using the following three selection probability functions:

$C a s e 1 : Δ_{1} (u) = 0.9 + 0.2 u i f | u | \leq 0.5, a n d 0.95 o t h e r w i s e .$
$C a s e 2 : Δ_{2} (u) = 0.7 + 0.2 u i f | u | \leq 0.5, a n d 0.75 o t h e r w i s e .$
$C a s e 3 : Δ_{3} (u) = 0.5 + 0.2 u i f | u | \leq 0.5, a n d 0.55 o t h e r w i s e .$

The average missing rates of Y corresponding to these three selection probability functions are approximately 0.15, 0.36, and 0.45, respectively. For each of the three cases, we generated 500 Monte Carlo random samples of size 200. The performance of the estimators is illustrated via the MSE. The simulation results are given in Table 1.

From Table 1, we can make the following observations:

Under the same selection probability function

Δ (u)

and the same sample size n, the MSE of IWLLCQR

_{\hat{Δ}}

is only slightly smaller than the ones of WLLCQR

_{Δ}

and NWLLCQR

_{\hat{Δ}}

, respectively; the MSE of INWLLCQR is also slightly smaller than the ones of WLLCQR

_{Δ}

and NWLLCQR

_{\hat{Δ}}

, because much more information on missing data is considered in IWLLCQR

_{\hat{Δ}}

and INWLLCQR, while the MSE of INWLLCQR is slightly greater than the one of IWLLCQR

_{\hat{Δ}}

. Further the MSE of IWLLCQR

_{\hat{Δ}}

is only slightly greater than the one of WLLCQR; this further confirms that the IWLLCQR

_{\hat{Δ}}

method is a safe alternative to WLLCQR

_{Δ}

and NWLLCQR

_{\hat{Δ}}

.

Now, the simulated curves are plotted with the case of

ε_{i} \sim N (0, 1)

under different levels of missing rates. Here, the results are presented only when

n = 200

, while the ones for the case of

n = 100

are not given since these results were similar. Figure 1, Figure 2 and Figure 3 summarize the finite sample performance of the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR methods for

β (u)

under different levels of missing rates. The red dashed curve, the blue dashed curve, the blue dotted curve, and the green dashed-dotted curve represent the results obtained by the NWLLCQR

_{\hat{Δ}}

method, the IWLLCQR

_{\hat{Δ}}

method, the INWLLCQR method, and the WLLCQR method, respectively. In addition, the red solid curve denotes the real curve of

β (u)

. From Figure 1, Figure 2 and Figure 3, we can see that:

(1) The simulation results based on the IWLLCQR

_{\hat{Δ}}

method were similar to those based on the NWLLCQR

_{\hat{Δ}}

method, the INWLLCQR

_{\hat{Δ}}

method, and the WLLCQR method under a lower level of missing rate. However, the IWLLCQR

_{\hat{Δ}}

method outperformed the INWLLCQR method, and the INWLLCQR method outperformed the NWLLCQR

_{\hat{Δ}}

method, under a higher level of missing rate.

(2) It can be easily found that the simulated curve obtained by the IWLLCQR

_{\hat{Δ}}

method was very close to the true curve. Thus, the imputed estimation was reasonable. However, the bias of the INWLLCQR method was slightly greater than those for the IWLLCQR

_{\hat{Δ}}

and the WLLCQR method.

Example 2.

To examine the performance of the proposed test method, we consider the following model:

Y = β (U) X + ε,

where

X \sim N (0, 1), ε \sim t (5)

, U follows a uniform distribution on

[0, 1]

, and

X, U

, and ε are independent.

In order to illustrate our methods by using the dataset, artificial missing data were created by deleting some of the response values in the dataset at random. Assume that

40 %

of the response values in this data are missed in this example. Consider the testing problem:

H_{0} : β (u) = 1 v . s H_{1} : β (u) = 1 + λ (u^{2} - 0.5) (0 \leq λ \leq 1) .

In what follows, the proposed test procedure is applied in a simulation with 500 replications. For each replication, 500 samples were generated, and the bootstrap sampling was repeated 300 times. Suppose the significance level

α = 0.05

. Figure 4 shows that the simulated powers increased quickly as

λ

increased. In particular, the simulated size of the test

T_{n}

was 0.043, which is close to the true significant level of

α = 0.05

, when the null hypothesis holds. This demonstrates that the bootstrap estimate of the null distribution was considerably effective, which shows that our test was very powerful.

6. A Real Data Example

In this section, we apply the methods proposed in this paper to the dataset on air pollution that the Norwegian Public Roads Administration collected. The dataset, which can be found in StatLib, consists of 500 observations. The varying coefficient model based on the CQR method was used by Guo and Tian [16] to fit the relation among the hourly values of the logarithm of the number of cars per hour

(X_{1})

, wind speed

(X_{2})

, the logarithm of the concentration of NO

_{2}

(Y), and the hour of the day

(T)

. We deleted about

35 %

of the completely observed Y randomly to illustrate our proposed methods. Now, we investigate the varying coefficient model with the response data missing:

Y = β_{1} (T) X_{1} + β_{2} (T) X_{2} + ε .

(15)

Since the coefficient functions of the model (15) are really time varying, we need to consider the following testing problem:

H_{0} : β (\cdot) = β v . s H_{1} : β (\cdot) \neq β

(16)

where

β = {(β_{1}, β_{2})}^{T}

is a constant vector. The model (15) is just a classical linear model if the null hypothesis in (16) is true. For the testing problem (16), we should reject the null hypothesis

H_{0}

at a significance level of 0.05 because the p value of test

T_{n}

was 0.00 based on 500 resampling bootstraps.

In addition, the estimated functions of

β_{1} (\cdot)

and

β_{2} (\cdot)

are given, and the computational results came from 500 simulation runs. The estimated coefficients and the standard deviations are summarized in Table 2 for WLLCQR

_{Δ}

, NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR.

From Table 2, we can find that IWLLCQR

_{\hat{Δ}}

and WLLCQR had much smaller standard deviations than WLLCQR

_{Δ}

, WLLCQR

_{\hat{Δ}}

, and INWLLCQR, respectively. In what follows, we give estimated functions of

β_{1} (\cdot)

and

β_{2} (\cdot)

along with the 95% bootstrap confidence bands. The results in Figure 5 and Figure 6 show that

β_{1} (u)

and

β_{2} (u)

were time varying. Furthermore, we can also see that the IWLLCQR

_{\hat{Δ}}

method had almost equal confidence intervals as the WLLCQR method. Hence, the IWLLCQR

_{\hat{Δ}}

method was reasonable.

7. Discussions

In the simulation study and the practical applications, we mainly found that the CQRE method was much more stable, efficient, and effective for varying coefficient models with response data missing at random than the single QRE method and least squares method when the sample size was large enough and the error faced a different distribution, which means the bias was at a relatively low level and the correct selection rate relatively higher.

In the face of high-dimensional data, these three method were not ideal, but the CQRE method was relatively better. This case arises widely in many research fields such as reliability life testing, genetic data research, medical tracking trials, population census, economics and finance, environment monitoring and biomedical research, etc. How to modify our method to improve the performance of the CQRE method for high-dimensional varying coefficient models with response data missing at random is an important topic that we will study further.

On the other hand, this paper with only response data missing at random studied the CQR estimation and inference for varying coefficient models. However, it did not consider the case of covariant data missing, nor even the more general case of both response and covariant data missing. These problems are more challenging topics that we will explore and study further in the coming year.

8. Concluding Remarks

In this paper, a CQRE method was proposed for varying coefficient models with response data missing at random to develop three estimators including the WLLCQR estimator, the NWLLCQR estimator, and the IWLLCQ estimator for unknown coefficient functions and establish some results on the asymptotic normality of these proposed estimators under some mild conditions. Following, a bootstrap-based test procedure was designed to perform a simulation study, which demonstrated that the unknown coefficient estimators with IWLLCQR were superior to the other two ones with WLLCQR and NWLLCQR. Meanwhile, based on the IWLLCQR fittings, a bootstrap test procedure was also designed to test whether the coefficient functions were actually varying. Finally, a type of investigated real-life dataset was analyzed to illustrate that the CQRE method was much more stable, efficient, and effective for varying coefficient models with response data missing at random than the single QRE method and least squares method.

Author Contributions

All the authors inferred the main conclusions and approved of the current version of this manuscript.

Acknowledgments

The authors would like to thank the anonymous referees for their valuable comments and suggestions, which actually stimulated this work. The work was supported by the National Natural Science Foundations of China (11601409, 71501155 and 11201362), the Natural Science Foundations of Shaanxi Province of China (2016JM1009), and the Natural Science Foundations of the Department of Shaanxi Province of China (2017JK0344).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

QR	quantile regression
CQR	composite QR
WCQR	weighted CQR
CQRE	CQR estimation
LLQR	local linear QR
LLCQR	local linear CQR
WLLCQR	weighted LLCQR
NWLLCQR	nonparametric WLLCQR
IWLLCQR	imputed WLLCQR

Appendix A

The following conditions are needed for the results in Section 3.

(C1)

{(Y_{i}, X_{i}) : i = 1, 2, \dots, n}

are independent and identically distributed random vectors.

(C2) The density function

f (\cdot)

of

ε

has a continuous and uniformly-bounded derivative, namely

0 < {sup}_{s} f^{'} (s) < B_{0}

.

(C3) Matrix

E (X_{i}^{T} X_{i} | T = t)

is a positive definite matrix, and

E (X_{i} | T) = 0

.

(C4) Random variable U has a second-order differentiable density function

f_{U} (u) > 0

in some neighborhood of u.

(C5) The coefficient function

β (u)

is second-order differentiable in a neighborhood of a given u, and

β^{″} \neq 0

is continuous.

(C6) The kernel function

K (\cdot)

is a symmetric density function with a compact support, whose bandwidth

h \to 0, n h \to \infty

as

n \to \infty

.

(C7) The bandwidth

h_{0} \to 0, h_{0} / h \to 0

, and

n h_{0} \to \infty

as

n \to \infty

.

(C8) The selection probability function

Δ (u) > 0

has a bounded and continuous second derivative on the support of U.

The following lemma is useful for proving some theorems given in Section 3.

Lemma A1

(See Lemma 2 in [15]). Let

(Y_{1}, X_{1}), (Y_{2}, X_{2}), \dots, (Y_{n}, X_{n})

be independent and identically distributed (i.i.d) random vectors, where the

Y_{i}^{^{'}} s

are scalar random variables. Suppose that

E | Y_{i} |^{3} < \infty

and

{sup}_{x} \int {| y |}^{s} φ (x, y) d y < \infty,

where

f (\cdot, \cdot)

represents the density of

(X, Y)

. Let

K (\cdot)

be a bounded positive function with a bounded support, satisfying the Lipschitz condition. Then:

s u p_{x} | \frac{1}{n} \sum_{1}^{n} {K_{h} (X_{i} - x) Y_{i} - E [K_{h} (X_{i} - x) Y_{i}]} | = O_{p} (\frac{l n^{1 / 2} (1 / h)}{\sqrt{n h}}) .

(A1)

In what follows, the main theorems in Section 3 will be proven.

Proof of Theorem 1.

Let

K_{i} (u) = K_{h} {(U_{i} - u) / h}, s_{i} (u) = (U_{i} - u) / h

,

η_{i, k} (u) = I_{(ε_{i} \leq c_{k} - r_{i} (u))} - τ_{k}

with

r_{i} (u) = X_{i}^{T} (β (U_{i}) - β (U_{i}) - β^{'} (u) (U_{i} - u))

,

θ = \sqrt{n h} {{\hat{c}}_{1} - c_{1}, \dots, {\hat{c}}_{q} - c_{q}, {[\hat{a} - β (u)]}^{T}, {[\hat{b} - β^{^{'}} (u)]}^{T}}^{T}

,

X_{i, k} (u) = {e_{k}^{T}, X_{i}^{T}, X_{i}^{T} (U_{i} - u) / h}^{T}

with

e_{k}

a q-dimensional vector with one at the

k^{th}

position and zero, elsewhere. Since:

(\hat{c}, \hat{a}, \hat{b}) = arg m i n_{c, a, b} \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i}}{Δ (U_{i})} ρ_{τ_{k}} (Y_{i} - c_{k} - X_{i}^{T} [a + b (U_{i} - u)]) K_{h} (U_{i} - u) .

θ

is the minimizer of the criterion:

T_{n} (π (U), θ) = \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ (U_{i})} (ρ_{τ_{k}} (ε_{i} - c_{k} + r_{i} (u) - Δ_{i, k}) - ρ_{τ_{k}} (ε_{i} - c_{k} + r_{i} (u))),

where

Δ_{i, k} = X_{i, k}^{T} θ / \sqrt{n h}

. Applying the following identity (see Knight [34]):

ρ_{τ} (x - y) - ρ_{τ} (x) = y [I_{(x < 0)} - τ] + \int_{0}^{y} [I_{(x \leq s)} - I_{(x \leq 0)}] d s,

we can rewrite

T_{n} (Δ (U_{i}), θ)

as:

\begin{matrix} T_{n} (Δ (U_{i}), θ) & = \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ (U_{i})} {Δ_{i, k} [I_{(ε_{i} \leq c_{k} - r_{i} (u))} - τ_{k}] \\ + \int_{0}^{Δ_{i, k}} [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z \\ = {[W_{n} (u)]}^{T} θ + \sum_{k = 1}^{q} B_{n, k} (θ), \end{matrix}

where

W_{n} (u) = \frac{1}{\sqrt{n h}} \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ (U_{i})} η_{i, k} (u) X_{i, k} (u) and,

B_{n, k} (θ) = \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ (U_{i})} \int_{0}^{Δ_{i, k}} [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z .

Then, it follows from Lemma A1 that

B_{n, k} (θ) = E (B_{n, k} (θ)) + o_{p} (1)

. Denote:

S_{n} (u) = D i a g \{(\begin{matrix} S_{n, 11} (u) & S_{n, 12} (u) \\ S_{n, 21} (u) & S_{n, 22} (u) \end{matrix}), S_{n, 33} (u)\},

where:

S_{n, 11} (u) = \frac{1}{n h} D i a g [f (c_{1}) \sum_{i = 1}^{n} K_{i} (u), \dots, f (c_{q}) \sum_{i = 1}^{n} K_{i} (u)],

S_{n, 12} (u) = \frac{1}{n h} {[f (c_{1}) \sum_{i = 1}^{n} K_{i} (u) X_{i}, \dots, f (c_{q}) \sum_{i = 1}^{n} K_{i} (u) X_{i}]}^{T},

S_{n, 22} (u) = \frac{1}{n h} \sum_{k = 1}^{q} f (c_{k}) \sum_{i = 1}^{n} K_{i} (u) X_{i} X_{i}^{T},

S_{n, 33} (u) = \frac{1}{n h} \sum_{k = 1}^{q} f (c_{k}) \sum_{i = 1}^{n} K_{i} (u) X_{i} X_{i}^{T} s_{i}^{2} .

We observe that by the iterative expectation:

\begin{matrix} \sum_{k = 1}^{q} E [B_{n, k} (θ) | X, U] & = \sum_{k = 1}^{q} \sum_{i = 1}^{n} K_{i} (u) \int_{0}^{Δ_{i, k}} E [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}] d z \\ = \sum_{k = 1}^{q} \sum_{i = 1}^{n} K_{i} (u) \int_{0}^{Δ_{i, k}} [F (c_{k} - r_{i} (u) + z) - F (c_{k} - r_{i} (u))] d z \\ = \frac{1}{2} θ^{T} (\frac{1}{n h} \sum_{k = 1}^{q} \sum_{i = 1}^{n} K_{i} (u) f (c_{k}) (1 + o (1)) X_{i, k} (u) X_{i, k}^{T} (u)) θ + o_{p} (1) \\ = \frac{1}{2} θ^{T} S_{n} (u) θ + o_{p} (1) . \end{matrix}

As in Parzen [35], we have:

S_{n, 11} (u) \overset{P}{⟶} S_{11} (u) = f_{U} (u) D i a g {f (c_{1}, \dots, f (c_{q})},

S_{n, 12} (u) \overset{P}{⟶} S_{12} (u) = f_{U} (u) E (X_{i} | U = u) [f (c_{1}, \dots, f (c_{q}))],

S_{n, 22} (u) \overset{P}{⟶} S_{22} (u) = f_{U} (u) \sum_{k = 1}^{q} f (c_{k}) E (X_{i} X_{i}^{T} | U = u),

S_{n, 33} (u) \overset{P}{⟶} S_{33} (u) = f_{U} (u) μ_{2} \sum_{k = 1}^{q} f (c_{k}) E (X_{i} X_{i}^{T} | U = u) .

Based on the above results, we can prove that:

T_{n} (Δ (U_{i}), θ) = \frac{1}{2} θ^{T} S_{n} (u) θ + {[W_{n} (u)]}^{T} θ + o_{p} (1),

where:

S (u) = D i a g \{(\begin{matrix} S_{11} (u) & S_{12} (u) \\ S_{21} (u) & S_{22} (u) \end{matrix}), S_{33} (u)\} .

By Corollary 2 of Knight [34], we have

θ \overset{P}{⟶} - {[S (u)]}^{- 1} W_{n} (u)

. Assume Condition (C3) is satisfied. Then,

S (u) = D i a g {S_{11} (u), S_{22} (u), S_{33} (u)}

. Simple calculation of the block matrix yields:

\sqrt{n h} (\hat{β} (u) - β (u)) \overset{P}{⟶} - {[S_{22} (u)]}^{- 1} W_{n, 2} (u),

(A2)

where

W_{n, 2} (u) = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ (U_{i})} η_{i} (u) X_{i}

. Let

{\tilde{W}}_{n, 2} (u) = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ (U_{i})} η_{i} X_{i}

. It is easy to verify that

E ({\tilde{W}}_{n, 2} (u)) = 0

. As in Parzen [35], we obtain:

V a r ({\tilde{W}}_{n, 2} (u)) = E [\frac{1}{n h} \sum_{i = 1}^{n} \frac{K_{i}^{2} (u)}{Δ (U_{i})} η_{i}^{2} X_{i} X_{i}^{T}] = f_{U} (u) ν_{0} E [\frac{η_{i}^{2}}{Δ (U_{i})} X_{i} X_{i}^{T} | U = u] .

By the central limit theorem, we get

{\tilde{W}}_{n, 2} (u) \overset{L}{⟶} N (0, f_{U} (u) ν_{0} Ω_{u})

. Similar to Kai et al. (2011) [15], we can show that:

V a r ({\tilde{W}}_{n, 2} (u) - W_{n, 2} (u) | X, U) \leq \frac{q^{2}}{n h} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{Δ^{2} (U_{i})} X_{i} X_{i}^{T} \times max_{k} {F (c_{k} + | r_{i} (u) | - F (c_{k})} = o_{p} (1) .

By Slutsky’s theorem, we obtain:

W_{n, 2} (u) - E [W_{n, 2} (u)] \overset{L}{⟶} N (0, f_{U} (u) ν_{0} Ω_{u}) .

(A3)

Since:

\begin{matrix} \frac{1}{\sqrt{n h}} E [W_{2 n} (u) | X, U] & = \frac{1}{n h} \sum_{k = 1}^{q} \sum_{i = 1}^{n} K_{i} (u) [F (c_{k} - r_{i} (u) - F (c_{k})] X_{i} \\ = - \frac{1}{n h} \sum_{k = 1}^{q} \sum_{i = 1}^{n} K_{i} (u) f (c_{k}) [1 + o (1)] r_{i} (u) X_{i}, \end{matrix}

it is easy to obtain:

\frac{1}{\sqrt{n h}} E [W_{2 n} (u)] = - \frac{μ_{2} h^{2}}{2} f_{U} (u) S_{22} β^{″} (u) + o (h^{2}) .

(A4)

Together with (A2) and (A4), we have:

\sqrt{n h} (\hat{β} (u) - β (u) - \frac{1}{2} μ_{2} h^{2} β^{″} (u)) \overset{L}{⟶} N (0, \frac{ν_{0}}{f_{U} (u)} D_{u}^{- 1} Ω_{u} D_{u}^{- 1}) .

This completes the proof of Theorem 1. □

Proof of Theorem 2.

Let

θ^{*} = \sqrt{n h} {{\hat{c}}_{1}^{*} - c_{1}, \dots, {\hat{c}}_{q}^{*} - c_{q}, {[{\hat{a}}^{*} - β (u)]}^{T}, h {[{\hat{b}}^{*} - β^{^{'}} (u)]}^{T}}^{T}

. Similar to the proof of Theorem 1, we have:

\begin{matrix} T_{n}^{*} (\hat{Δ} (U), θ^{*}) & = \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{\hat{Δ} (U_{i})} {Δ_{i, k}^{*} [I_{(ε_{i} \leq c_{k} - r_{i} (u))} - τ_{k}] \\ + \int_{0}^{Δ_{i, k}^{*}} [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z \\ = {[W_{n}^{*} (u)]}^{T} θ^{*} + \sum_{k = 1}^{q} B_{n, k}^{*} (θ^{*}), \end{matrix}

where:

W_{n}^{*} (u) = \frac{1}{\sqrt{n h}} \sum_{k = 1}^{q} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{\hat{Δ} (U_{i})} η_{i, k} (u) X_{i, k} (u),

B_{n, k}^{*} (θ^{*}) = \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{\hat{Δ} (U_{i})} \int_{0}^{Δ_{i, k}^{*}} [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z .

Let:

\begin{matrix} H_{n, k}^{*} (θ^{*}) = \sum_{i = 1}^{n} \frac{δ_{i} (Δ (U_{i}) - \hat{Δ} (U_{i}))}{\hat{Δ} (U_{i}) Δ (U_{i})} \times & \int_{0}^{Δ_{i, k}^{*}} K_{i} (u) [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z . \end{matrix}

Then,

B_{n, k}^{*} (θ^{*}) = B_{n, k} (θ^{*}) + H_{n, k}^{*} (θ^{*})

. It is easy to verify that:

\sum_{i = 1}^{n} \frac{δ_{i}}{Δ (U_{i})} \int_{0}^{Δ_{i, k}^{*}} K_{i} (u) [I_{(ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z = O_{p} (1) .

(A5)

Considering the fact that

{sup}_{u} | \hat{Δ} (u) - Δ (u) | = o (1)

, it follows from (A5) that

H_{n, k}^{*} (θ^{*}) = o_{p} (1)

, and then:

\sum_{k = 1}^{q} B_{n, k}^{*} (θ^{*}) = \sum_{k = 1}^{q} B_{n, k} (θ^{*}) + o_{p} (1) .

(A6)

Similar to the proof of Theorem 1, we can prove that:

\sqrt{n h} ({\hat{β}}_{N} (u) - β (u)) \overset{P}{⟶} - {[S_{22} (u)]}^{- 1} W_{n, 2}^{*} (u),

(A7)

where

W_{n, 2}^{*} (u) = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{\hat{Δ} (U_{i})} η_{i} (u) X_{i}

. Let

{\tilde{W}}_{n, 2}^{*} (u) = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i} K_{i} (u)}{\hat{Δ} (U_{i})} η_{i} X_{i}

. By the proof of Theorem 3 in Wong [33], we can obtain:

\begin{matrix} {\tilde{W}}_{n, 2}^{*} (u) & = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i}}{\hat{Δ} (U_{i})} K_{i} (u) η_{i} X_{i} + \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i} - Δ (U_{i})}{Δ (U_{i})} E [K_{i} (u) η_{i} X_{i} | Z_{i}] + o_{p} (h^{2}) \\ = {\tilde{W}}_{n, 21}^{*} (u) + {\tilde{W}}_{n, 22}^{*} (u) + o_{p} (h^{2}), \end{matrix}

where

E ({\tilde{W}}_{n, 21}^{*} (u)) = 0

and

E ({\tilde{W}}_{n, 22}^{*} (u)) = 0

. Furthermore,

V a r ({\tilde{W}}_{n, 21}^{*} (u)) = f_{U} (u) ν_{0} E [\frac{η_{i}^{2}}{Δ (U_{i})} X_{i} X_{i}^{T} | U_{i}],

V a r ({\tilde{W}}_{n, 22}^{*} (u)) = f_{U} (u) ν_{0} E [\frac{1 - Δ (U_{i})}{Δ (U_{i})} E {X_{i} η_{i} | U_{i}}^{\otimes 2}],

C o v ({\tilde{W}}_{n, 21}^{*} (u), {\tilde{W}}_{n, 22}^{*} (u)) = f_{U} (u) ν_{0} E [\frac{1 - Δ (U_{i})}{Δ (U_{i})} E {X_{i} η_{i} | U_{i}}^{\otimes 2}] .

Completing the calculation, we obtain:

V a r ({\tilde{W}}_{n, 2}^{*} (u)) = f_{U} (u) ν_{0} \{E (\frac{η_{i}^{2}}{Δ (U_{i})} X_{i} X_{i}^{T} | U_{i}) - E [\frac{1 - Δ (U_{i})}{Δ (U_{i})} E {X_{i} η_{i} | U_{i}}^{\otimes 2}]\} + o (1) .

Based on the above results, it follows that

{\tilde{W}}_{n, 2}^{*} (u) \overset{L}{⟶} N (0, f_{U} (u) ν_{0} Ω_{u}^{*})

. Similar to the proof of Theorem 1, we have:

V a r ({\tilde{W}}_{n, 2}^{*} (u) - W_{n, 2}^{*} (u) | X, U) = o_{p} (1)

. Thus:

W_{n, 2}^{*} (u) - E [W_{n, 2}^{*} (u)] \overset{L}{⟶} N (0, f_{U} (u) ν_{0} Ω_{u}^{*})

(A8)

By Lemma A1, we get:

\frac{1}{n h} \sum_{i = 1}^{n} δ_{i} K_{i} (u) η_{i} X_{i} \overset{P}{⟶} E [\frac{1}{n h} \sum_{i = 1}^{n} δ_{i} K_{i} (u) η_{i} X_{i}] = O (h^{2}) .

Since

\frac{1}{\hat{Δ} (U_{i})} - \frac{1}{Δ (U_{i})} = o_{p} (1)

, then

\begin{matrix} \frac{1}{\sqrt{n h}} W_{n, 2}^{*} (u) & = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} \frac{δ_{i}}{\hat{Δ} (U_{i})} K_{i} (u) η_{i} X_{i} + \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} δ_{i} [\frac{1}{\hat{Δ} (U_{i})} - \frac{1}{Δ (U_{i})}] K_{i} (u) η_{i} X_{i} \\ = \frac{1}{\sqrt{n h}} \sum_{i = 1}^{n} W_{n, 2} (u) + o_{p} (h^{2}) . \end{matrix}

Thus, we can show that:

\frac{1}{\sqrt{n h}} E [W_{n, 2}^{*} (u)] = \frac{1}{\sqrt{n h}} E [W_{n, 2} (u)] + o (h^{2}) .

(A9)

Following (A7), (A9), and Theorem 1, we complete the proof of Theorem 2. □

Proof of Theorem 3.

Write

θ^{* *} = \sqrt{n h} {{\hat{c}}_{1}^{* *} - c_{1}, \dots, {\hat{c}}_{q}^{* *} - c_{q}, {{\hat{a}}^{* *} - β (u)}^{T}, h {{\hat{b}}^{* *} - β^{^{'}} (u)}^{T}}^{T}

.

Δ_{i, k}^{* *} = X_{i, k}^{T} θ^{* *} / \sqrt{n h}

,

η_{i, k}^{*} (u) = I_{(\frac{δ_{i}}{\hat{Δ} (U)} ε_{i} \leq - r_{i} (u))}

, then we have:

\begin{matrix} Y_{i}^{*} & = \frac{δ_{i}}{\hat{π} (U_{i})} Y_{i} + (1 - \frac{δ_{i}}{\hat{π} (U_{i})}) X_{i}^{T} {\hat{β}}_{C} (U_{i}) \\ = \frac{δ_{i}}{\hat{π} (U)} ε_{i} + X_{i}^{T} {\hat{β}}_{C} (U_{i}) + o_{p} (1), \end{matrix}

Similar to the proof of Theorem 2, we have:

\begin{matrix} T_{n}^{* *} (\hat{π} (U), θ^{* *}) & = \sum_{k = 1}^{q} \sum_{i = 1}^{n} δ_{i} K_{i} (u) {Δ_{i, k}^{* *} [I_{(\frac{1}{\hat{π} (U)} ε_{i} \leq c_{k} - r_{i} (u))} - τ_{k}] \\ + \int_{0}^{Δ_{i, k}^{* *}} [I_{(\frac{1}{\hat{π} (U)} ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(\frac{1}{\hat{π} (U)} ε_{i} - c_{k} + r_{i} (u) \leq 0)}]} d z \\ + \sum_{k = 1}^{q} \sum_{i = 1}^{n} (1 - \frac{δ_{i}}{\hat{π} (U)}) K_{i} (u) {Δ_{i, k}^{* *} [I_{(0 \leq c_{k} - r_{i} (u))} - τ_{k}] \\ + \int_{0}^{Δ_{i, k}^{* *}} [I_{(r_{i} (u) - c_{k} \leq z)} - I_{(r_{i} (u) - c_{k} \leq 0)}]} d z \\ = {[W_{n}^{* *} (u)]}^{T} θ^{* *} + \sum_{k = 1}^{q} B_{n, k}^{* *} (θ^{* *}), \end{matrix}

where:

W_{n}^{* *} (u) = \frac{1}{\sqrt{n h}} \sum_{k = 1}^{q} \sum_{i = 1}^{n} δ_{i} K_{i} (u) η_{i, k}^{*} (u) X_{i} + \frac{1}{\sqrt{n h}} \sum_{k = 1}^{q} \sum_{i = 1}^{n} (1 - \frac{δ_{i}}{\hat{π} (U)}) K_{i} (u) ξ_{i, k}^{*} (u) X_{i},

\begin{matrix} B_{n, k}^{* *} (θ^{* *}) & = \sum_{i = 1}^{n} δ_{i} K_{i} (u) \int_{0}^{Δ_{i, k}^{* *}} [I_{(\frac{1}{\hat{π} (U)} ε_{i} - c_{k} + r_{i} (u) \leq z)} - I_{(\frac{1}{\hat{π} (U)} ε_{i} - c_{k} + r_{i} (u) \leq 0)}] d z \\ + \sum_{i = 1}^{n} (1 - \frac{δ_{i}}{\hat{π} (U)}) K_{i} (u) \int_{0}^{Δ_{i, k}^{* *}} [I_{(r_{i} (u) - c_{k} \leq z)} - I_{(r_{i} (u) - c_{k} \leq 0)}] d z . \end{matrix}

We can prove:

\sum_{i = 1}^{n} (1 - \frac{δ_{i}}{\hat{π} (U)}) K_{i} (u) \int_{0}^{Δ_{i, k}^{* *}} [I_{(r_{i} (u) - c_{k} \leq z)} - I_{(r_{i} (u) - c_{k} \leq 0)}] d z = o_{p} (1) .

Similar to the proof of Theorem 1, we can complete the proof. □

References

Hastie, T.J.; Tibshirani, R.J. Varying-coefficient models. J. R. Stat. Soc. Ser. 1993, 55, 757–796. [Google Scholar] [CrossRef]
Chiang, C.T.; Rice, J.A.; Wu, C.O. Smoothing spline estimation for varying coefficient models with repeatedly measured dependent variables. J. Am. Stat. Assoc. 2001, 96, 605–619. [Google Scholar] [CrossRef]
Eubank, R.L.; Huang, C.; Maldonado, Y.M.; Wang, N.; Wang, S.; Buchanan, R.J. Smoothing spline estimation in varying coefficient models. J. R. Stat. Soc. Ser. 2004, 66, 653–667. [Google Scholar] [CrossRef]
Fan, J.; Zhang, J.T. Statistical estimation in varying coefficient models. Ann. Stat. 1999, 27, 1491–1518. [Google Scholar]
Huang, J.; Wu, C.O.; Zhou, L. Varying coefficient models and basis function approximations for the analysis of repeated measurements. Biometrika 2002, 89, 111–128. [Google Scholar] [CrossRef]
Wu, C.O.; Yu, K.F.; Chiang, C.T. A two-step smoothing method for varying coefficient models with repeated measurements. Ann. Inst. Stat. Math. 2000, 52, 519–543. [Google Scholar] [CrossRef]
Fan, J.; Huang, T. Profile likelihood inferences on semiparametric varying-cofficient partially linear models. Bernoulli 2005, 11, 1031–1057. [Google Scholar] [CrossRef]
Whang, Y.J. Smoothed empirical likelihood methods for quantile regression models. Econom. Theory 2006, 22, 173–205. [Google Scholar] [CrossRef]
Koenker, R. Quantiles Regression; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
Kim, M.O. Quantile regression with varying coefficients. Ann. Stat. 2007, 35, 92–108. [Google Scholar] [CrossRef]
Cai, Z.; Xu, X. Nonparametric quantile estimations for dynamic smooth coefficient models. J. Am. Stat. Assoc. 2008, 103, 1595–1608. [Google Scholar] [CrossRef]
Cai, Z.; Xiao, Z. Semiparametric quantile regression estimation in dynamic models with partially varying coefficients. J. Econom. 2012, 167, 413–425. [Google Scholar] [CrossRef]
Tang, Q.G. Robust estimation for spatial semiparametric varying coefficient partially linear regression. Stat. Pap. 2015, 56, 1137–1161. [Google Scholar]
Zou, H.; Yuan, M. Composite quantile regression and the oracle model selection theory. Ann. Stat. 2008, 36, 1108–1126. [Google Scholar] [CrossRef]
Kai, B.; Li, R.; Zou, H. New efficient estimation and variable selection methods for semiparametric varying coefficient partially linear models. Ann. Stat. 2011, 39, 305–332. [Google Scholar] [CrossRef] [PubMed]
Guo, J.; Tian, M.Z. New efficient and robust estimation in varying coefficient models with heteroscedasticity. Stat. Sin. 2012, 22, 1075–1101. [Google Scholar]
Sun, J.; Gai, Y.; Lin, L. Weighted local linear composite quantile estimation for the case of general error distributions. J. Stat. Plan. Inference 2013, 143, 1049–1063. [Google Scholar] [CrossRef]
Yang, H.; Lv, J.; Guo, C.H. Weighted composite quantile regression estimation and variable selection for varying coefficient models with heteroscedasticity. J. Korean Stat. Soc. 2015, 44, 77–94. [Google Scholar] [CrossRef]
Luo, S.; Zhang, C.-Y. Nonparametric M-type regression estimation under missing response data. Stat. Pap. 2016, 57, 641–664. [Google Scholar] [CrossRef]
Rubin, D.B. Inference and missing data. Biometrika 1976, 63, 581–592. [Google Scholar] [CrossRef]
Sterne, J.; White, I.; Carlin, J.; Spratt, M.; Royston, P.; Kenward, M.; Carpenter, J. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ 2009, 338, b2393. [Google Scholar] [CrossRef] [PubMed]
Wang, Q.; Linton, O.; HÄrdle, W. Semiparametric regression analysis with missing response at random. J. Am. Stat. Assoc. 2004, 99, 334–345. [Google Scholar] [CrossRef]
Wang, Q.; Sun, Z. Estimation in partially linear models with missing responses at random. J. Multivar. Anal. 2007, 98, 1470–1493. [Google Scholar] [CrossRef] [Green Version]
Wang, Q.; Rao, N.K. Empirical Likelihood-based inference under imputation for missing response data. Ann. Stat. 2002, 30, 896–924. [Google Scholar] [CrossRef]
Xue, L.G. Empirical likelihood confidence intervals for response mean with data missing at random. Scand. J. Stat. 2009, 36, 671–685. [Google Scholar] [CrossRef]
Wei, Y.; Ma, Y.; Carroll, R. Multiple imputation in quantile regression. Biometrika 2012, 99, 423–438. [Google Scholar] [CrossRef] [PubMed]
Lv, X.; Li, R. Smoothed empirical likelihood analysis of partially linear quantile regression models with missing response variables. Adv. Stat. Anal. 2013, 97, 317–347. [Google Scholar] [CrossRef]
Sherwood, B.; Wang, L.; Zhou, X. Weighted quantile regression for analyzing health care cost data with missing covariates. Stat. Med. 2013, 32, 4967–4979. [Google Scholar] [CrossRef]
Sun, Y.; Wang, Q.; Gilbert, P. Quantile regression for competing risks data with missing cause of failure. Ann. Stat. 2012, 22, 703–728. [Google Scholar] [CrossRef] [Green Version]
Chen, X.; Wan, T.K.; Zhou, Y. Efficient quantile regression analysis with missing observations. J. Am. Stat. Assoc. 2015, 110, 723–741. [Google Scholar] [CrossRef]
Kim, S.Y. Imputation methods for quantile estimation under missing at random. Stat. Its Interface 2013, 6, 369–377. [Google Scholar] [Green Version]
Nageswara, S.; Rao, V. Nadaraya-Watson estimator for sensor fusion. Opt. Eng. 1997, 36, 642–647. [Google Scholar] [Green Version]
Wong, H.; Guo, S.J.; Chen, M.; Wai-Cheung, I.P. On locally weighted estimation and hypothesis testing on varying coefficient models with missing covariates. J. Stat. Plan. Inference 2009, 139, 2933–2951. [Google Scholar] [CrossRef]
Knight, K. Limiting distributions for L1 regression estimators under general conditions. Ann. Stat. 1998, 26, 755–770. [Google Scholar]
Parzen, E. On estimation of a probability density function and model. Ann. Math. Stat. 1962, 33, 1065–1076. [Google Scholar] [CrossRef]

Figure 1. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{1} (u)

.

Figure 1. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{1} (u)

.

Figure 2. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{2} (u)

.

Figure 2. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{2} (u)

.

Figure 3. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 3. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 4. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 4. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 5. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 5. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 6. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Figure 6. The comparison between the true curve and the NWLLCQR

_{\hat{Δ}}

, IWLLCQR

_{\hat{Δ}}

, INWLLCQR, and WLLCQR simulation curve when

n = 200

and the selection probability function is

Δ_{3} (u)

.

Table 1. The MSE for estimators in Example 1.

Model Error	$Δ (u)$	n	MSE
Model Error	$Δ (u)$	n	WLLCQR $_{Δ}$	NWLLCQR $_{\hat{Δ}}$	IWLLCQR $_{\hat{Δ}}$	INWLLCQR	WLLCQR
Error(1)	$Δ_{1} (u)$	100	0.1219	0.1213	0.1192	0.1201	0.1182
	$Δ_{1} (u)$	200	0.1017	0.1012	0.0997	0.1003	0.0973
	$Δ_{2} (u)$	100	0.1845	0.1792	0.1701	0.1715	0.1645
	$Δ_{2} (u)$	200	0.1701	0.1698	0.1641	0.1654	0.1583
	$Δ_{3} (u)$	100	0.2685	0.2518	0.2346	0.2371	0.2207
	$Δ_{3} (u)$	200	0.1976	0.1903	0.1826	0.1842	0.1612
Error(2)	$Δ_{1} (u)$	100	0.0789	0.0775	0.0719	0.0728	0.0696
	$Δ_{1} (u)$	200	0.0646	0.0612	0.0598	0.0609	0.0559
	$Δ_{2} (u)$	100	0.1124	0.1102	0.1019	0.1027	0.0921
	$Δ_{2} (u)$	200	0.0997	0.0904	0.0898	0.0905	0.0802
	$Δ_{3} (u)$	100	0.3954	0.3542	0.3257	0.3302	0.3024
	$Δ_{3} (u)$	200	0.3356	0.3298	0.3021	0.3075	0.2814
Error(3)	$Δ_{1} (u)$	100	0.0598	0.0568	0.0514	0.0529	0.0498
	$Δ_{1} (u)$	200	0.0528	0.0515	0.0498	0.0502	0.0439
	$Δ_{2} (u)$	100	0.0687	0.0665	0.0621	0.0632	0.0596
	$Δ_{2} (u)$	200	0.0579	0.0558	0.0523	0.0545	0.0495
	$Δ_{3} (u)$	100	0.1102	0.1017	0.0987	0.1004	0.0812
	$Δ_{3} (u)$	200	0.0957	0.0922	0.0892	0.0904	0.0759

Table 2. The coefficient estimates and sample standard deviations (in parentheses) for the air pollution data.

	WLLCQR $_{Δ}$	NWLLCQR $_{\hat{Δ}}$	IWLLCQR $_{\hat{Δ}}$	INWLLCQR	WLLCQR
$β_{1} (t)$	−0.312 (0.046)	−0.307 (0.045)	−0.315 (0.042)	−0.30 (0.044)	−0.316 (0.041)
$β_{2} (t)$	−0.379 (0.104)	−0.378 (0.102)	−0.375 (0.099)	−0.376 (0.101)	−0.374 (0.098)

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Luo, S.; Zhang, C.-y.; Wang, M. Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random. Symmetry 2019, 11, 1065. https://doi.org/10.3390/sym11091065

AMA Style

Luo S, Zhang C-y, Wang M. Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random. Symmetry. 2019; 11(9):1065. https://doi.org/10.3390/sym11091065

Chicago/Turabian Style

Luo, Shuanghua, Cheng-yi Zhang, and Meihua Wang. 2019. "Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random" Symmetry 11, no. 9: 1065. https://doi.org/10.3390/sym11091065

APA Style

Luo, S., Zhang, C.-y., & Wang, M. (2019). Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random. Symmetry, 11(9), 1065. https://doi.org/10.3390/sym11091065

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Composite Quantile Regression for Varying Coefficient Models with Response Data Missing at Random

Abstract

1. Introduction

2. Estimation Based on the CQR Varying Coefficient Model With Missing Response

2.1. WLLCQR Estimation

2.2. Nonparametric WLLCQR Estimation

2.3. Imputed WLLCQR Estimation

3. Asymptotic Properties

4. A Bootstrap-Based Goodness-of-Fit Test

5. Simulation Study

6. A Real Data Example

7. Discussions

8. Concluding Remarks

Author Contributions

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI