Abstract
Spatial data not only enables smart cities to visualize, analyze, and interpret data related to location and space, but also helps departments make more informed decisions. We apply a Bayesian quantile regression (BQR) of the partially linear varying coefficient spatial autoregressive (PLVCSAR) model for spatial data to improve the prediction of performance. It can be used to capture the response of covariates to linear and nonlinear effects at different quantile points. Through an approximation of the nonparametric functions with free-knot splines, we develop a Bayesian sampling approach that can be applied by the Markov chain Monte Carlo (MCMC) approach and design an efficient Metropolis–Hastings within the Gibbs sampling algorithm to explore the joint posterior distributions. Computational efficiency is achieved through a modified reversible-jump MCMC algorithm incorporating adaptive movement steps to accelerate chain convergence. The simulation results demonstrate that our estimator exhibits robustness to alternative spatial weight matrices and outperforms both quantile regression (QR) and instrumental variable quantile regression (IVQR) in a finite sample at different quantiles. The effectiveness of the proposed model and estimation method is demonstrated by the use of real data from the Boston median house price.
1. Introduction
The technological progress in the field of spatial data management enables cities to better control the processing of spatial information. Geospatial applications enable people to access real-time maps to visualize spatial data, and this updated information can be used for decision-making. Spatial data, as widely collected and inexpensive geographical information systems on the internet, are frequently encountered in such diverse fields as real estate, finance, economics, geography, epidemiology, and environmetrics. The spatial autoregressive (SAR) model of Cliff and Ord [1] has attracted a great deal of interest among the class of spatial models, and it has extensively been used to address spatial interaction effects among geographical units in cross-sectional or panel data. A number of early studies are summarized in Anselin [2], Case [3], Cressie [4], LeSage [5], and Kazar and Celik [6], among others. These studies are mainly focused on the parametric SAR models, which are highly susceptible to model misspecification. However, the relationship between the covariates and the response variables are usually nonlinear effects in reality. Indeed, it has been proven in practice that many economic variables have highly nonlinear effects on response variables [7]. Neglecting potential nonlinear relationships often leads to inconsistent parameter estimates in spatial models, and even misleading conclusions [8]. Although fully nonparametric models possess the flexibility to capture underlying complex nonlinear effects, they may suffer from the “curse of dimensionality” [9] with high dimensional covariates. Semiparametric SAR models are used as an alternative for processing spatial data to overcome the problems mentioned above. It not only offers greater flexibility than generalized linear models but also alleviates the “curse of dimensionality” inherent in nonparametric models.
Semiparametric models have garnered significant attention in econometrics and statistics due to their parametric interpretability combined with nonparametric flexibility. Among these, the partially linear varying coefficient (PLVC) model [10] stands out as one of the most widely used. It provides a good tradeoff between the interpretation of a partially linear model and the adaptability of the varying coefficient model. In spatial data analysis, the partially linear varying coefficient spatial (PLVCS) model has been widely applied, improving the capacity of spatial analysis by exploring regression relationships. To approximate the varying coefficient functions in the PLVCS model, many authors have developed various methods, including the local smoothing method [11], basis expansion technique [12], Bayesian approach [13], profile quasi-maximum likelihood estimation method [14], etc. While much of the existing literature on PLVC models focuses on methodological developments and applications, rigorous theoretical analyses often assume data observed on grid points within a rectangular domain. To balance the interpretability and flexibility inherent in PLVC models while explicitly accounting for spatial dependence, we consider the PLVCSAR model.
It is well known that the basic spline [15] approximation technique is a popular method for modeling nonlinearities in the semiparametric model [7,16]. Compared to the kernel estimate, the main advantage of the splines method is that its calculations are relatively simple and easily combined with classical spatial econometric estimates. However, it requires the user to choose the appropriate position and number of knots. If the number of knots is too large, the model may be seriously overfitted. Conversely, if the number of knots is too small, the splines may not fully reflect the nonlinearity of the modeling curve. Various approaches have been proposed to solve the problem mentioned above. A common approach could vary the number of knots to minimize some criteria, such as the form of information criteria including AIC and BIC. Moreover, Bayesian penalty splines [17] offer an alternative approach through Bayesian shrinkage implemented via careful prior specification. In contrast, Bayesian free-knot splines ([18,19,20], among others) offer a distinct advantage: by treating both the position and number of spline knots as random variables within the modeling framework, this approach achieves intrinsic spatial adaptivity. This approach enables the model to automatically determine optimal smoothing parameters and implement variable bandwidth selection, a critical capability for handling heterogeneous patterns in data [19]. This adaptive capability constitutes the primary motivation for employing free-knot splines in our modeling framework, particularly when dealing with spatial or temporal processes exhibiting non-stationary characteristics.
The recent literature has increasingly focused on semiparametric SAR models. While quasi-likelihood estimation [8,21] remains prevalent, maximum likelihood-based approaches face computational challenges due to the need for repeated determinant calculations of large-dimensional matrices. Furthermore, these methods typically require homoscedastic error assumptions. Alternative estimators employing instrumental variables (IV) [22], generalized method of moments (GMM) [23], and Bayesian approach [24] relax the homoscedasticity constraint but remain fundamentally limited to mean regression frameworks. Crucially, all existing approaches rely on the restrictive zero-mean error assumption, thereby failing to characterize potential heterogeneous covariate effects across response quantiles.
Quantile regression (QR), introduced by Koenker and Bassett [25], offers significant advantages over traditional mean regression. It not only delivers robust estimations but also comprehensively delineates the fundamental relationship between covariates and the response variable throughout the entire conditional distribution. Crucially, QR captures heterogeneous covariate effects at different quantile levels of the response variable, thereby addressing a fundamental limitation of conventional approaches. For instance, Dai et al. [26] considered the IV approach for the quantile regression of PLVCSAR models. As concerns the frequency method, the estimation strategy hinges on minimizing the sum of asymmetric absolute deviations. Particularly at the extreme quantile levels, not only is the estimation accuracy low, but it is also prone to instability in estimation. As a result, more and more scholars tend to use the Bayesian approach to estimate models. Pioneering work by Yu and Moyeed [27] established a Bayesian QR paradigm utilizing an asymmetric Laplace error distribution as a working likelihood for inferential procedures, and sampling unknown quantities from its posterior distribution via the MCMC approach is feasible even under the most complex model specifications.
To sum up, Bayesian quantile regression (BQR) has emerged as a versatile framework, spanning diverse applications including longitudinal analysis [28], risk measure [29], variable selection [30], empirical likelihood [31], etc. In the progress of Bayesian spatial quantile regression methodology, Lee and Neocleous [32] first proposed fixed effect model to process spatial counting data; King and Song [33] then extended the framework to include random spatial effects. Related developments also include the spatial process multiple (or individual) quantile regression method proposed by Lum and Gelfand [34] and the spatiotemporal trend joint (or simultaneous) quantile regression method established by Reich et al. [35]. The former models each quantile independently, while the latter estimates all quantiles simultaneously. In the spatial context, Castillo et al. [36] considered the multi-quantile regression of mixed effects autoregressive models, while Chen and Tokdar [37] and Castillo et al. [38] developed a joint quantile regression of spatial data. However, BQR, to the best of our knowledge, remains largely unexplored within semiparametric SAR models. Notably, Chen et al. [39] proposed a Bayesian P-splines approach for the quantile regression of PLVCSAR models.
Building upon these collective results, we present a Bayesian free-knot splines method for the quantile regression of the PLVCSAR model to be applied to spatial dependent response. The contribution of this paper extends the traditional PLVCSAR models by allowing for quantile-specific effects and unknown heteroscedasticity. It can allow for varying degrees of spatial dependence in the response distributions at different quantiles. We develop an enhanced Bayesian approach that estimates unknown parameters and uses free-knot splines to approximate the nonparametric function. To enhance algorithmic convergence, we modify the movement step of the reversible-jump MCMC algorithm so that each iteration can relocate the positions of all knots instead of only one knot position. We develop a Bayesian sampling-based method that can be executed via the MCMC approach, and we design a Gibbs sampler to explore the joint posterior distributions. The computational tractability of MCMC methods in deriving posterior distributions through careful prior specification establishes Bayesian inference as a particularly attractive framework, even for complex modeling scenarios.
The remainder of this article is organized as follows. Section 2 specifies the Bayesian quantile regression for the PLVCSAR model for spatial data, subsequently approximating the link function via B-splines to derive the likelihood function. To establish a Bayesian sampling-based analytical framework, we delineate the prior distributions and derive the full conditional posteriors of the latent variables and unknown parameters, and we describe the detailed sampling algorithms in Section 3. Section 4 reports Monte Carlo results for the finite sample performance of the BQR estimator, and for the comparisons with the QR and IVQR estimators at different quantile points. Section 5 provides an empirical illustration, as well as the study results. Finally, we conclude the paper in Section 6.
2. Methodology
2.1. Model
Consider the following quantile regression for the PLVCSAR model:
where is the dependent variable for spatial unit i; , , and are the observations of the relevant covariates; is a prespecified constant spatial weight; is a q-dimensional vector of unknown smooth functions; is the smoothing variable; represents an unknown spatial autoregressive coefficient that measures neighbor-based autocorrelation with stability constraint ; is a p-dimensional unknown parameters; and is the error term whose th quantile on equals zero for .
2.2. Likelihood
Quantile regression is usually achieved by solving the minimization problem based on the check loss function. With model (1), the specific problem is to estimate , , and by minimizing the following objective function:
where , , , , is the so called check function, and is the indication function. In a Bayesian setup, we assume the error terms are mutually independent and identically distributed (i.i.d.) random variables from an asymmetric Laplace distribution with density
where is the scale parameter. Then, the conditional distribution of y is in the form of
Therefore, maximizing the likelihood (3) is equivalent to minimizing the objective function (2), giving (2) a likelihood-based interpretation. The location-scale mixture representation by introducing the asymmetric Laplace distribution [40], (1) can be equivalently written as
where , , and follows an exponential distribution with mean and is independent of . For ease of notation, we omit in the following expressions.
For , we represent the unknown functions in model (1) via B-splines approximations [15]. Specifically, for , is defined as a polynomial spline of degree with interior knots satisfying ; i.e.,
where , denotes the B-spline basis vector determined by the knot vector ; is the corresponding spline coefficients vector, and
are boundary knots.
Consequently, model (4) with spline specification (5) takes the following representation:
where and . We view as latent variables for and denote . Then, the matrix formulation of (7) can be written as
where denotes an specified spatial weight matrix, , , and is an matrix whose ith row is . Further, , where each is an submatrix.
The likelihood function associated with model (8) is given by the following:
where , , denotes the identity matrix, , and .
3. Bayesian Estimation
We develop a Bayesian approach employing a Gibbs sampler to fit the proposed model. We begin with describing the prior distributions for all unknown parameters, then, we derive the full conditional posteriors of the latent variables and all unknown parameters, and we narrate a detailed MCMC sampling scheme. Furthermore, we improve the movement step of the reversible-jump MCMC algorithm to relocate all knot positions at each iteration rather than adjusting a single knot as in conventional implementations.
3.1. Priors
To complete the Bayesian model specification, we assign prior distributions for all unknown parameters, including the spatial autocorrelation coefficient , regression coefficient vectors and , and scale parameter , as well as the number k and location of knots for the splines.
For , we choose a Poisson prior with mean
for the number of knots and a conditional uniform prior on knot positions given
where , , and are defined in Equation (6); we assign a hierarchical prior for coefficients and , which consists of a conjugate normal prior
and an inverse-gamma prior
on the unknown hyperparameter , where and are pre-specified hyperparameters for . This hierarchical specification effectively induces heavy-tailed t-distribution priors for and . In addition, we assign a conjugate inverse-gamma prior
on the scale parameter , where and are pre-specified hyperparameters. Throughout this study, we specify to induce a Cauchy prior for . For hyperparameters (), we employ weakly informative inverse gamma priors and . Finally, the spatial autocorrelation coefficient is assigned a uniform prior for , where and denote the extreme eigenvalues of the standardized spatial weight matrix W.
The joint prior of all unknown parameters is specified as follows:
where denotes the vector of hyperparameters. Note that hyperparameters are explicitly included in the parameter vector for computational tractability.
3.2. The Full Conditional Posteriors of the Latent Variables
From the likelihood function (9) together with a standard exponential density, for , the full conditional posterior distribution of the latent variables is as follows. Given the observation data and the remaining unknown parameters, the conditional density of is proportional to
where and . Since (11) constitutes the kernel of a generalized inverse Gaussian distribution, it follows that
The probability density function of is given by
where denotes the modified Bessel function of the third kind. The availability of efficient sampling algorithms for the generalized inverse Gaussian distribution [41] ensures the computational tractability of our Gibbs sampler for Bayesian quantile regression estimation.
3.3. The Full Conditional Posterior Distributions of the Parameters
Given the joint posterior’s analytical complexity, direct sampling is impracticable. Consequently, we develop a Metropolis–Hastings within the Gibbs algorithm to sample from the posterior distributions. Therefore, we derive the full conditional posterior distributions for all parameters and delineate the corresponding Markov chain Monte Carlo sampling procedures.
It follows from the likelihood function (9) that given the remaining unknown quantities, the conditional posterior distribution of the spatial autocorrelation coefficient is given by
Direct sampling from (12) is analytically intractable as the density lacks a standard closed form. To address this, we implement a Metropolis–Hastings algorithm [42,43] with the following procedure: generate candidate from a Cauchy distribution truncated to , centered at the current with scale , where serves as the proposal tuning parameter; and accept with probability
given the factor
Combining the likelihood function (9) with a standard exponential density, we obtain the full conditional posterior for the scale parameter conditional on as
where . Since (13) is an inverse-gamma distribution, we have
where and . Thus, the integration of scale parameters imposes no computational impediments within our Gibbs sampling framework.
Directly observed from likelihood function (9) and priors (10), the conditional joint posterior of given takes the form:
From the conditional joint posterior (14), the full conditional posterior of is given by
where , , and . It is easy to generate from the multivariate normal density (15).
We have from the joint posterior (14) that the full conditional posterior of for is given by
where , , , and , which gives rise to a marginal posterior distribution
where , , , are , k, , with , , , excluded, respectively.
We can see from (16) that the method of composition [44] can be used to draw from the conditional normal posterior
As it is convenient to generate from multivariate normal density (18), we mainly focus on sampling from (17). Hence, we design a partially collapsed Gibbs sampler [45] and use a reversible-jump sampler [18] to update the number and location of knots. Following the standard reversible-jump algorithm [46], we implement three transition operators: birth, death, and movement steps. While preserving the birth and death steps, we modify the movement step via the hit-and-run algorithm [47] so that all the knots can be relocated instead of only one knot in each iteration. That is, we generate a random direction vector , define a feasible range , and sample a signed distance from a Cauchy distribution with location 0 and scale truncated on . Finally, we set , reorder the knots, and accept candidate knots with probability
where and denote candidate analogues of current-state and .
3.4. Sampling
Bayesian estimates are obtained via MCMC simulations from the full conditional posterior distributions of all parameters. The detailed pseudocode for our MCMC algorithm (Algorithm 1) is as follows:
| Algorithm 1 The pseudocode of the MCMC sampling scheme | |
| Input: Observed data . Initialization: Set with initial states and . MCMC iterations: For to T: Given current states and , sequentially update
| |
4. Monte Carlo Simulations
In this section, Monte Carlo simulations are used to assess the finite sample performance of the proposed model and estimation method. We assess the performance of the estimated varying-coefficient functions using the mean absolute deviation error (MADE) and global mean absolute deviation error (GMADE), defined as
evaluated at equidistant points spanning .
We conducted simulation studies using the date generated from the following model:
with covariates where
with for , the scalar smoothing variable being . The varying-coefficient functions with and , the regression coefficients are . The random error with being the standard normal CDF of , ensuring . By subtracting the th quantile, we obtain a random error for which the th quantile is equal to zero. For comparison, we specify two types of matrices to study the impact of the spatial weight matrix W on estimation performance, where W is generated under two scenarios: the Rook weight matrix [2] with and the Case weight matrix [3] with , respectively. For each configuration, we consider spatial dependence parameters at quantile levels , capturing weak to strong spatial autocorrelation.
According to the aforementioned design, we conducted 1000 replications of each group of simulation results. We assigned hyperparameters and apply a quadratic B-spline in our computation for . The initial states of the Markov chain are derived from the respective prior distributions of unknown parameters. By adjusting the values of the tuning parameters, and for are used such that the resultant acceptable rates of parameters can reach about 25%. We generated 6000 sampling values by running a Monte Carlo experiment, which removed the first 3000 values of each replication as a burn-in period. Based on the last 3000 sampling values, we computed the posterior mean (Mean), standard error (SE), and the 95% posterior credible intervals (95% CI) of the parameters across 1000 replications. Furthermore, we calculated the posterior standard deviation (SD) and compared it with the mean of the posterior SE. Following LeSage and Pace [48], we employed scalar summary measures for the marginal effects, derived as under the spatial specification in model (21). Total effects comprised direct effects (mean of diagonal entries) and indirect effects (mean of column-wise or row-wise sums of off-diagonal entries, excluding diagonal entries).
To assess MCMC convergence, we conducted five parallel Markov chains with distinct initial values through Monte Carlo experiments using an arbitrarily selected replication. The sampled traces of parts of unknown quantities, including model parameters and fitted varying-coefficient functions evaluated at grid points, are plotted in Figure 1 (only a replication with and at quantile is displayed). It is obvious that the five parallel MCMC chains are adequately mixed. We further calculated the “potential scale reduction factor” for all model parameters and fitted varying-coefficient functions at 10 equidistant grid points based on the five parallel sequences. Figure 2 (for ) demonstrates the evolution of values throughout iterations. It is easy to see that convergence is achieved within 2000 burn-in iterations, as all estimates stabilize below the recommended threshold of 1.2, following the suggestion of Gelman and Rubin [49].
Figure 1.
Trace plots of five parallel sequences corresponding to different starting values for parts of the unknown quantities.
Figure 2.
The “potential scale reduction factor” for simulation results.
Figure 3a shows the boxplots of the MADE and GMADE values with a at quantile under sample size . We calculated the medians , , and under a Rook weight matrix and the medians , , and under a Case weight matrix. The boxplots of the MADE and GMADE with a at quantile under sample size are displayed Figure 3b. We also computed the medians , , and under a Rook weight matrix and the medians , , and under a Case weight matrix. These results indicate that the MADE and GMADE values of the varying-coefficient functions decrease as the sample size increases, which suggests that the estimated performance of the unknown varying-coefficient functions improves as the sample size grows. It is evident that both the Case weight matrix and Rook weight matrix yield reasonable estimates in fitting varying-coefficient functions.
Figure 3.
Mean absolute deviation errors are presented in boxplots: (a) for and (b) for (the three left panels are based on a Rook weight matrix and the three right panels are based on a Case weight matrix with ).
The simulation results of parameter estimation are reported in Table 1 and Table 2. The simulation results demonstrate both minimal bias in parameter estimates (mean estimates closely align with true values) and robust uncertainty quantification (SE approximate empirical SD), confirming a high estimation accuracy. Moreover, the covariates affecting the response are different at different quantile points of the response distribution. Estimation precision improves with a larger sample sizes under identical spatial weight matrices. By comparing the estimates of spatial coefficient at the same quantile level with the same sample sizes, we observe that the estimation performances of becomes more and more accurate along with the increase in , and the Case weight matrices are slightly better than that with the Rook weight matrix. Furthermore, Table 1 and Table 2 indicate that all parameter estimators produce larger estimation errors for total effects under conditions of strong positive spatial dependence, regardless of comparable sample sizes. The results of repeating the above experiments with different starting values were similar, confirming the robustness of the proposed Gibbs sampler.
Table 1.
Simulation results for parameter estimation at .
Table 2.
Simulation results for parameter estimation at (const.).
Figure 4 depicts the estimated varying-coefficient functions at different quantile points, together with its 95% pointwise posterior credible intervals from a typical sample under when the sample size is and , respectively. The way to select a typical sample is to make its GMADE value equal to the median of the 1000 replicates. The results show that the estimation performance of the variable coefficient function improves with the increase in the sample size, while the effects exhibit quantile-specific heterogeneity across the response distribution.
Figure 4.
The estimated varying-coefficient functions (dotted(), starred (), and forked () lines) and their 95% pointwise posterior credible intervals (dot-dashed lines) for a typical sample. The solid lines represents the true varying-coefficient functions.
The simulations were implemented in C++ on an Intel(R) Core(TM) i7-8750H processor (2.20 GHz PC). The mean CPU times per replication reached 5 s () and 25 s (). The implementation code is available from the authors upon request.
In addition, in order to compare the performance between our Bayesian quantile regression estimator (BQRE) and the instrumental variable quantile regression estimator (IVQRE) [26], the following [26] spatial weight matrix was generated based on a mechanism that for , and then row normalization.
Example 1.
The samples were generated as follows:
where , , , and . The error term is specified as with Φ being the standard normal CDF of . Independent covariates are simulated such that , , and bivariate with and . For each configuration, 1000 simulation replications yield a bias and an RMSE (in parentheses) for parameter estimators, along with MADE [in brackets] for varying-coefficient function accuracy. Table 3 compares the QR, IVQR, and BQR methods under homoscedastic errors.
Table 3.
Simulation results for parameter estimation using model (22).
Example 2.
The samples were generated as follows:
where , , , and . The error term follows , where Φ denotes the standard normal CDF of . and , are bivariates, where and are generated independently.
Table 4 reports the comparison results of the QR, IVQR, and BQR estimation methods with a heteroscedastic error term. The results in Table 3 and Table 4 indicate that all estimators exhibit a decreasing bias, RMSE, and MADE with an increasing sample size. Simultaneously, the influence of explanatory variables on the response differs significantly across quantile levels of the conditional distribution. Moreover, the BQR estimator yields a marginally lower bias, RMSE, and MADE for both parameters and the varying-coefficient functions compared to the QR and IVQR estimators at the same quantile level under the same sample size. It is evident that BQR has superior relative performance, though QR and IVQR still produce statistically reasonable estimates. This comparative advantage persists despite QR/IVQR maintaining estimation feasibility, further reinforcing BQR’s methodological superiority in spatial quantile regression.
Table 4.
Simulation results for parameter estimation in model (23).
5. Application
To demonstrate our proposed method, we analyzed a real data set collected from the well-known Boston housing data. The data set was collected in the Boston Standard Meteropolitan Statistical Area of 1970. It contains about 506 different houses’ information from a variety of locations, and it is available from the spData package in R developed by Bivand et al. [50]. Since our model and method investigate not only the influencing effects of covariates but also the spatial effects of response variable at different quantile points, it is interesting to examine the socioeconomic drivers of housing price variation.
In this application, we mainly considered the influencing factors of the median value of owner-occupied homes (MEDV) from the aspects of pupil–teacher ratio by town school district (PTRATIO), full value property tax rate (TAX), the percentage of lower status proportion (LSTAT), per capita crime rate by town (CRIM), and nitric oxides concentration parts per 10 million (NOX) in Boston. In order to perform fitting at different lower status percentages in our model, we took LSTAT as the index variable. Meanwhile, log-transformed MEDV, PTRATIO, and TAX mitigate scale disparities induced by domain magnitude variations. This theoretical foundation motivated the specification of the PLVCSAR model:
where the response variable , , , , , . We adopted the Sun et al. [21] approach: constructing spatial weights via spherical Euclidean distances between housing coordinates (longitude, latitude). That is, the spatial weight
where , and is the Euclidean norm. Based on the above designs, we conducted 1000 replications of each experiment. We executed five independent runs of the proposed Gibbs sampler with different initial states and generated 10,000 sampled values after a burn-in of 20,000 iterations in each experiment. Part traces of parameters are plotted in Figure 5 (only a replication at the median quantile () is displayed), showing satisfactory mixing across parallel sequences. Using the five parallel sequences, we computed the “potential scale reduction factor” , which is plotted in Figure 6 (the case of quantile). These diagnostics confirm Markov chain convergence within the 20,000-iteration burn-in period.
Figure 5.
Trace plots for five parallel MCMC chains with different initializations for parts of the unknown quantities at quantile ().
Figure 6.
The “potential scale reduction factor” for Boston housing price data.
Table 5 lists the estimated parameters, their standard errors, and the 95% posterior credible intervals at different quantile points. The results reveal significant spatial autocorrelation () at , confirming statistically significant positive spatial spillover effects in housing markets. More interestingly, we see that the spatial effect slightly increases with the increase in quantiles. That is, the spatial effect changes across the quantile points. On the other hand, we observe that the regression coefficients of two explanatory variables PTRATIO and TAX are and at a quantile, respectively. It implies that PTRATIO and TAX have a positive and significant effect on the housing price. That is, the effect of PTRATIO increased with the increase in quantiles, while the effect of TAX decreased with the increase in quantiles. Therefore, we can observe the way that this quantile-dependent heterogeneity demonstrates differential mechanisms through which covariates influence housing markets along the price distribution.
Table 5.
Parameter estimation in model (24) for Boston housing data.
The estimated varying-coefficient functions at different quantile points, along with their corresponding 95% pointwise posterior credible intervals, are presented in Figure 7, exhibiting nonlinear characteristics. The results indicate that reaches a local maximum of at around at a quantile, and attains a local maximum of 0.8079 at around and a local minimum of at around at a quantile. This provides evidence that the analysis reveals significant nonlinear relationships: CRIM exhibits an inverted U-shaped effect, whereas NOX demonstrates a U-shaped pattern in housing prices. Furthermore, the varying-coefficient functions show distinct quantile-specific modulation across the price distribution, indicating differential response mechanisms at varying market valuation tiers.
Figure 7.
The estimated function (dotted (), starred (), and forked () lines) and its 95% pointwise posterior credible intervals (dot-dashed lines) at different quantile points in the model (24) for Boston housing price data.
6. Conclusions
Spatial data analysis, ubiquitous in empirical research, frequently employs spatial autoregressive (SAR) frameworks. To address specification risks in conventional SAR modeling, we propose a Bayesian quantile regression approach integrating partial linear varying coefficient (PLVC) structures with SAR components. This unified framework simultaneously captures linear and nonlinear covariate effects across response quantiles while enhancing predictive accuracy. Methodologically, we develop a fully Bayesian free-knot spline estimation procedure coupled with an optimized Metropolis–Hastings within the Gibbs sampler for posterior exploration. Computational efficiency is achieved through a modified reversible-jump MCMC algorithm incorporating adaptive movement steps to accelerate chain convergence. Monte Carlo simulations demonstrate the BQR estimator’s superior robustness to spatial weight matrix specifications compared to conventional QR and IVQR methods. While alternative estimators produce reasonable results, our approach exhibits statistically significant performance across quantile levels in finite-sample scenarios. Empirical validation through real-world data analysis further substantiates the methodology’s practical utility.
The proposed method can analyze spatial data exhibiting either homoscedastic or heteroscedastic errors without requiring a specification of the error distribution. A few more issues still merit further research. While the PLVCSAR model is used herein to evaluate covariate effects on the response, other approaches, including the partially linear single-index SAR and partially linear additive SAR models, could also be considered. We further characterize the asymptotic properties of spatial quantile processes under heterogeneous spatial dependency regimes. In addition, it is also interesting to study issues such as model selection and variable selection for high-dimensional predictors. Finally, the error distribution of model-based quantile regression is asymmetric Laplacian distribution, which has the advantage of a convenient calculation, but it has some limitations in accurately capturing uncertainty. Similar recent alternatives, such as the generalization method proposed by Yan et al. [51], may provide a more robust modeling approach and may become a promising direction for future research.
Author Contributions
Supervision, R.C. and Z.C.; software, R.C. and Z.C.; methodology, R.C. and Z.C.; writing—original draft preparation, R.C. and Z.C.; writing—review and editing, Z.C. All authors have read and agreed to the published version of the manuscript.
Funding
The project was supported by the National Social Science Foundation of China (Series number: 24BTJ067) and Open Fund of Xiamen Software Supply Chain Security Public Technology Service Platform (3502Z20231042).
Institutional Review Board Statement
Not applicable.
Data Availability Statement
The data presented in this study are openly available in Reference [50].
Acknowledgments
The authors are deeply grateful to the editors and anonymous referees for their careful reading and insightful comments, which helped to significantly improve this paper.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Cliff, A.D.; Ord, J.K. Spatial Autocorrelation; Pion Ltd.: London, UK, 1973. [Google Scholar]
- Anselin, L. Spatial Econometrics: Methods and Models; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1988. [Google Scholar]
- Case, A.C. Spatial patterns in householed demand. Econometrica 1991, 59, 953–965. [Google Scholar] [CrossRef]
- Cressie, N. Statistics for Spatial Data; John Wiley and Sons: New York, NY, USA, 1993. [Google Scholar]
- LeSage, J.P. Bayesian estimation of spatial autoregressive models. Int. Reg. Sci. Rev. 1997, 20, 113–129. [Google Scholar] [CrossRef]
- Kazar, B.M.; Celik, M. Spatial Autoregressive Model; Springer Press: New York, NY, USA, 2012. [Google Scholar]
- Basile, R. Regional economic growth in Europe: A semiparametric spatial dependence approach. Pap. Reg. Sci. 2008, 87, 527–544. [Google Scholar] [CrossRef]
- Su, L.; Jin, S. Profile quasi-maximum likelihood estimation of partially linear spatial autoregressive models. J. Econom. 2010, 157, 18–33. [Google Scholar] [CrossRef]
- Bellman, R.E. Adaptive Control Processes; Princeton University Press: Princeton, NJ, USA, 1961. [Google Scholar]
- Hastie, T.J.; Tibshirani, R.J. Varying-coefficient models. J. R. Stat. Soc. Ser. B 1993, 55, 757–796. [Google Scholar] [CrossRef]
- Brunsdon, C.; Fotheringham, A.S.; Charlton, M.E. Geographically weighted regression: A method for exploring spatial nonstationarity. Geogr. Anal. 1996, 28, 281–298. [Google Scholar] [CrossRef]
- Mu, J.R.; Wang, G.N.; Wang, L. Estimation and inference in spatially varying coefficient models. Environmetrics 2018, 29, e2485. [Google Scholar] [CrossRef]
- Gelfand, A.E.; Kim, H.J.; Sirmans, C.F.; Banerjee, S. Spatial modeling with spatially varying coefficient processes. J. Am. Stat. Assoc. 2003, 98, 387–396. [Google Scholar] [CrossRef]
- Li, S.S.; Chen, J.B.; Chen, D.Q. PQMLE and Generalized F-Test of Random Effects Semiparametric Model with Serially and Spatially Correlated Nonseparable Error. Fractal Fract. 2024, 8, 386. [Google Scholar] [CrossRef]
- de Boor, C. A Practical Guide to Splines; Springer: New York, NY, USA, 1978. [Google Scholar]
- Chen, Z.Y.; Chen, J.B. Bayesian analysis of partially linear, single-index, spatial autoregressive models. Comput. Stat. 2021, 37, 327–353. [Google Scholar] [CrossRef]
- Eilers, P.H.C.; Marx, B.D. Flexible smoothing with B-splines and penalties. Stat. Sci. 1996, 11, 89–121. [Google Scholar] [CrossRef]
- Green, P. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 1995, 82, 711–732. [Google Scholar] [CrossRef]
- Holmes, C.C.; Mallick, B.K. Generalized nonlinear modeling with multivariate free-knot regression splines. J. Am. Stat. Assoc. 2003, 98, 352–368. [Google Scholar] [CrossRef]
- Wang, H.-B. Bayesian estimation and variable selection for single index models. Comput. Stat. Data Anal. 2009, 53, 2617–2627. [Google Scholar] [CrossRef]
- Sun, Y.; Yan, H.J.; Zhang, W.Y.; Lu, Z. A Semiparametric spatial dynamic model. Ann. Stat. 2014, 42, 700–727. [Google Scholar] [CrossRef]
- Xu, X.B.; Lee, L.F. A spatial autoregressive model with a nonlinear transformation of the dependent variable. J. Econom. 2015, 186, 1–18. [Google Scholar] [CrossRef]
- Hoshino, T. Semiparametric spatial autoregressive models with endogenous regressors: With an application to crime data. J. Bus. Econ. Stat. 2017, 36, 160–172. [Google Scholar] [CrossRef]
- Liu, T.; Xu, D.K.; Ke, S.Q. A Semiparametric Bayesian Approach to Heterogeneous Spatial Autoregressive Models. Entropy 2024, 26, 498. [Google Scholar] [CrossRef]
- Koenker, R.; Bassett, G. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar] [CrossRef]
- Dai, X.W.; Li, S.Y.; Tian, M.Z. Quantile regression for partially linear varying coefficient spatial autoregressive models. Commun. Stat.-Simul. Comput. 2016, 53, 4396–4411. [Google Scholar] [CrossRef]
- Yu, K.; Moyeed, R.A. Bayesian quantile regression. Stat. Probab. Lett. 2001, 54, 437–447. [Google Scholar] [CrossRef]
- Yuan, Y.; Yin, G.S. Bayesian quantile regression for longitudinal studies with nonignorable missing data. Biometrics 2010, 66, 105–114. [Google Scholar] [CrossRef] [PubMed]
- Bernardi, M.; Gayraud, G.; Petrella, L. Bayesian tail risk interdependence using quantile regression. Bayesian Anal. 2015, 10, 553–603. [Google Scholar] [CrossRef]
- Alhamzawi, R.; Yu, K.M. Variable selection in quantile regression via Gibbs sampling. J. Appl. Stat. 2012, 39, 799–813. [Google Scholar] [CrossRef]
- Yang, Y.W.; He, X.M. Bayesian empirical likelihood for quantile regression. Ann. Stat. 2012, 40, 1102–1131. [Google Scholar] [CrossRef]
- Lee, D.; Neocleous, T. Bayesian quantile regression for count data with application to environmental epidemiology. J. R. Stat. Soc. Ser. C 2010, 59, 905–920. [Google Scholar] [CrossRef]
- King, C.; Song, J.J. Bayesian spatial quantile regression for areal count data, with application on substitute care placements in Texas. J. Appl. Stat. 2019, 46, 580–597. [Google Scholar] [CrossRef]
- Lum, K.; Gelfand, A.E. Spatial quantile multiple regression using the asymmetric Laplace process. Bayesian Anal. 2012, 7, 235–258. [Google Scholar] [CrossRef]
- Reich, B.J.; Fuentes, M.; Dunson, D.B. Bayesian spatial quantile regression. J. Am. Stat. Assoc. 2012, 106, 6–20. [Google Scholar] [CrossRef]
- Castillo-Mateo, J.; Asín, J.; Cebrían, A.C.; Gelfand, A.E.; Abaurrea, J. Spatial quantile autoregression for season within year daily maximum temperature data. Ann. Appl. Stat. 2023, 17, 2305–2325. [Google Scholar] [CrossRef]
- Chen, X.; Tokdar, S.T. Joint quantile regression for spatial data. J. R. Stat. Soc. Ser. Stat. Methodol. 2021, 83, 826–852. [Google Scholar] [CrossRef]
- Castillo-Mateo, J.; Gelfand, A.E.; Asín, J.; Cebrían, A.C.; Abaurrea, J. Bayesian joint quantile autoregression. Test 2024, 33, 335–357. [Google Scholar] [CrossRef]
- Chen, Z.Y.; Chen, M.H.; Ju, F.Y. Bayesian P-splines quantile regression of partially linear varying coefficient spatial autoregressive models. Symmetry 2022, 14, 1175. [Google Scholar] [CrossRef]
- Kozumi, H.; Kobayashi, G. Gibbs sampling methods for bayesian quantile regression. J. Stat. Comput. Simul. 2011, 81, 1565–1578. [Google Scholar] [CrossRef]
- Dagnapur, J.S. An easily implemented generalised inverse Gaussian generator. Commun. Stat.-Simul. Comput. 1989, 18, 703–710. [Google Scholar]
- Hastings, W.K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
- Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H.; Teller, E. Equations of state calculations by fast computing machine. J. Chem. Phys. 1953, 21, 1087–1091. [Google Scholar] [CrossRef]
- Tanner, M.A. Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, 2nd ed.; Springer: New York, NY, USA, 1993. [Google Scholar]
- Liu, J.S. The collapsed Gibbs sampler in Bayesian computations with applications to a gene regulation problem. J. Am. Stat. Assoc. 1994, 89, 958–966. [Google Scholar] [CrossRef]
- Poon, W.-Y.; Wang, H.-B. Bayesian analysis of generalized partially linear single-index models. Comput. Stat. Data Anal. 2013, 68, 251–261. [Google Scholar] [CrossRef]
- Chen, M.-H.; Schmeiser, B.W. General hit-and-run Monte Carlo sampling for evaluating multidimensional integrals. Oper. Res. Lett. 1996, 19, 161–169. [Google Scholar] [CrossRef]
- LeSage, P.J.; Pace, R.K. Introduction to Spatial Econometrics; CRC Press: Boca Raton, FL, USA; London, UK; New York, NY, USA, 2009. [Google Scholar]
- Gelman, A.; Rubin, D.B. Inference from iterative simulation using multiple sequences. Stat. Sci. 1992, 7, 457–511. [Google Scholar] [CrossRef]
- Biv, R.; Nowosad, J.; Lovelace, R. spData: Datasets for Spatial Analysis. R Package Version 2.3.4. 2025. Available online: https://CRAN.R-project.org/package=spData (accessed on 1 May 2025).
- Yan, Y.F.; Zheng, X.T.; Kottas, A. A New Family of Error Distributions for Bayesian Quantile Regression. Bayesian Anal. 2025, 1, 1–29. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).