4.1. Descriptive Statistics and Correlation Analysis
Table 2 reports the descriptive statistics for the main variables. The final sample contains 20,836 county-year observations from 2013 to 2024, indicating an unbalanced county-level panel. The mean value of
irg is 0.284, with a standard deviation of 0.117, suggesting substantial variation in inclusive regional growth across counties. The average value of
road is 3.824, and its standard deviation is 0.671, indicating clear cross-county differences in rural road network density. The mean value of
gap is 2.456, showing that the urban–rural income gap remains a salient feature of county-level development.
The descriptive statistics also show meaningful variation in the mechanism and control variables. The mean values of lab, mkt, and ind are 0.612, 9.351, and 10.452, respectively, suggesting that counties differ considerably in labor mobility, market access, and non-agricultural industrial development. The control variables also display sufficient variation across counties and years. Overall, the sample provides an appropriate empirical basis for estimating the effect of rural road network density on inclusive regional growth.
To provide a more intuitive description of the temporal evolution of the core variables,
Figure 1 plots the annual sample averages of
irg and
road from 2013 to 2024. To make the two series directly comparable despite differences in units and scales, both variables are normalized to 2013 = 100. The figure shows that both inclusive regional growth and rural road network density exhibit upward trends during the sample period, although inclusive regional growth increases more rapidly than rural road network density.
Table 3 reports the multicollinearity diagnosis for the explanatory variables used in the baseline specification. The maximum VIF value is 2.53 for
urb, followed by 2.21 for
agr. Prior transportation and safety studies have commonly used VIF-based diagnostics to assess multicollinearity and have treated values above conventional thresholds, such as 7.5 or 10, as potential signals of problematic collinearity [
28,
29]. In this study, all VIF values are far below these commonly used thresholds, and the mean VIF is 1.70. In particular, the VIF value of the core explanatory variable
road is only 1.15, indicating that rural road network density is not highly collinear with the control variables. These results suggest that multicollinearity is not a serious concern in the empirical analysis.
4.2. Baseline Effects of Rural Road Network Density on Inclusive Regional Growth
Table 4 reports the baseline effects of rural road network density on inclusive regional growth. Column (1) includes county fixed effects and year fixed effects only. The coefficient of
road is 0.094 and is significant at the 1% level, indicating a positive association between rural road network density and
irg. After adding the full set of control variables in Column (2), the coefficient decreases to 0.078 but remains statistically significant at the 1% level. Based on the standard deviation of
road reported in
Table 2, a one-standard-deviation increase in rural road network density is associated with an increase of approximately 0.052 in
irg, which is economically meaningful relative to the sample mean of 0.284.
Column (3) further controls for province-specific linear time trends. The coefficient of road remains positive and significant, suggesting that the baseline result is not driven by differential provincial development trajectories. Column (4) uses a balanced panel sample, and the estimated coefficient is 0.082, again significant at the 1% level. This confirms that the main result is not sensitive to the unbalanced structure of the sample. Overall, the baseline results provide strong evidence that rural road network density promotes inclusive regional growth at the county level.
The control variables are generally consistent with expectations. Urbanization, fixed asset investment, education expenditure, and financial development are positively associated with irg, while agricultural dependence is negatively associated with irg. These results suggest that inclusive regional growth is shaped not only by rural road network density but also by broader socioeconomic and structural development conditions.
4.4. Mechanism Analysis
This subsection examines whether labor reallocation toward non-agricultural employment, market access, and non-agricultural industrial development help explain the relationship between rural road network density and inclusive regional growth. The mechanism results are reported in the mechanism-analysis table. Columns (1)–(3) first test whether rural road network density affects the proposed mechanism variables, while Columns (4)–(6) add each mechanism variable to the inclusive-growth regression.
Columns (1)–(3) show that road is positively associated with all three mechanism variables. In Column (1), the coefficient of road is 0.034 and is significant at the 1% level, indicating that higher rural road network density increases lab. Because lab is measured as the share of employment in secondary and tertiary industries, this result should be interpreted as evidence that better rural road conditions are associated with labor reallocation toward non-agricultural employment, rather than as direct evidence of worker movement, commuting behavior, or migration. In Column (2), the coefficient of road is 0.061 and is significant at the 1% level, showing that higher rural road network density improves mkt. This result is consistent with the market-access mechanism, as rural roads reduce transport costs and strengthen connections between villages, townships, county seats, and consumer markets. In Column (3), road has a positive and significant coefficient of 0.042, indicating that higher rural road network density promotes ind, namely county-level non-agricultural industrial development. Columns (4)–(6) further show that the three mechanism variables are positively associated with irg. The coefficients of lab, mkt, and ind are 0.215, 0.142, and 0.118, respectively, and all are significant at the 1% level. After these mechanism variables are added, the coefficient of road remains positive and statistically significant, but its magnitude declines from 0.078 in the baseline model to 0.056, 0.048, and 0.051, respectively. This pattern suggests that labor reallocation toward non-agricultural employment, market access, and non-agricultural industrial development partially explain the inclusive-growth effect of rural road network density. The results support Hypotheses 3a–3c. Rural road network density promotes inclusive regional growth not only by improving physical connectivity, but also by supporting employment-structure adjustment toward non-agricultural sectors, strengthening market access, and expanding county-level non-agricultural economic activity.
Because the three mechanism variables may be correlated with each other, we further add a joint mechanism regression in Column (7) of
Table 6. In this specification,
lab,
mkt, and
ind are included simultaneously. The results show that all three mechanism variables remain positively associated with inclusive regional growth. Specifically, the coefficient of
lab is 0.124 and significant at the 1% level, the coefficient of
mkt is 0.083 and significant at the 1% level, and the coefficient of
ind is 0.055 and significant at the 5% level. The coefficient of
road remains positive and statistically significant, with a coefficient of 0.038. Compared with the separate mechanism regressions, the joint regression shows that the estimated coefficients of the three mechanism variables decline in magnitude but remain statistically significant, suggesting that each pathway provides distinct explanatory content after conditioning on the other two channels. These results indicate that labor reallocation toward non-agricultural employment, market access, and industrial development are all empirically relevant channels linking rural road network density to inclusive regional growth. Among the three variables,
lab has the largest conditional coefficient, followed by
mkt and
ind. However, because the mechanism variables are measured on different scales and may still be jointly determined with inclusive regional growth, this comparison should be interpreted as relative empirical relevance rather than as a definitive causal ranking. Therefore, the mechanism results should still be understood as channel-consistent evidence rather than as a strict causal decomposition.
To further assess the statistical significance of the proposed pathways, we conduct bootstrap indirect-effect tests, and the results are reported in
Appendix A Table A3. The estimated indirect effect through
lab is 0.0073, with a bootstrap 95% confidence interval of [0.0028, 0.0135]. The estimated indirect effect through
mkt is 0.0087, with a bootstrap 95% confidence interval of [0.0034, 0.0152]. The estimated indirect effect through
ind is 0.0050, with a bootstrap 95% confidence interval of [0.0019, 0.0094]. Since the confidence intervals do not include zero, the bootstrap results provide additional support for the labor reallocation, market access, and industrial development pathways. At the same time, we interpret these results cautiously. The bootstrap indirect-effect test improves the statistical assessment of the proposed pathways, but it does not fully eliminate potential endogeneity of the mechanism variables. Labor reallocation, market access, and industrial development may themselves be affected by inclusive regional growth. Therefore, the mechanism analysis should be understood as evidence consistent with the proposed channels rather than as a strict causal decomposition of the total effect.
4.5. Robustness Checks and Endogeneity Tests
Table 7 reports a set of robustness checks. Column (1) replaces the baseline road-density measure with
rper, which measures rural road length per 10,000 residents. The coefficient remains positive and significant at the 1% level, indicating that the main conclusion is not sensitive to the measurement of rural road network density. Column (2) uses
lrod, the one-year lag of rural road network density, and the coefficient is still positive and significant. This result suggests that the association between rural road network density and inclusive regional growth is not limited to a purely contemporaneous relationship. However, we do not interpret the lagged specification as a sufficient solution to reverse causality, because road construction may reflect long-term development strategies and anticipated future growth. Columns (3) and (4) further examine whether the baseline result is affected by extreme values or metropolitan spillovers. After applying 1% and 99% winsorization, the coefficient of the road variable remains positive and significant. After excluding counties adjacent to provincial capitals and municipalities directly under the central government, the coefficient also remains positive and significant. These results show that the baseline finding is robust to alternative road measures, lagged specifications, outlier treatment, and sample exclusion. Nevertheless, because economically stronger counties may have greater capacity to finance and expand rural roads, potential reverse causality remains an important econometric concern. For this reason, the instrumental-variable analysis is used as the main endogeneity test rather than relying on the lagged road specification alone.
We further examine whether the baseline conclusion depends on the inclusion of nighttime-light intensity in the inclusive regional growth index. Nighttime-light data are useful for supplementing official economic statistics, but their measurement properties may differ between urban and rural areas. Therefore, we reconstruct an alternative inclusive regional growth index,
irgn, using only real GDP per capita, rural income, and the reverse-coded urban–rural income gap. The results are reported in
Table 7, Column (5). The coefficient of
road remains positive and statistically significant, with a coefficient of 0.081, when
irgn is used as the dependent variable. This finding indicates that the positive relationship between rural road network density and inclusive regional growth is not driven by the nighttime-light component of the baseline index.
We further estimate spatial econometric models to examine whether the baseline conclusion is affected by spatial dependence. The results are reported in
Table 8. In the SAR specification, the coefficient of
road is 0.0692 and remains statistically significant at the 1% level. In the SEM specification, the coefficient of
road is 0.0743 and remains statistically significant at the 1% level. In the SDM specification, the coefficient of
road is 0.0654 and remains statistically significant at the 1% level. The spatial lag parameters in the SAR and SDM models are 0.1935 and 0.1741, respectively, and both are significant at the 1% level. The spatial error parameter in the SEM model is 0.2148 and significant at the 1% level. These results confirm the presence of spatial dependence in county-level inclusive regional growth.
The decomposed effects further support the robustness of the main conclusion. The direct effects of road are 0.0718 in the SAR model, 0.0743 in the SEM model, and 0.0682 in the SDM model, all significant at the 1% level. The indirect effect is 0.0184 in the SAR model and 0.0315 in the SDM model, both significant at the 5% level. In the SDM specification, the spatially lagged road variable is also positive and significant, with a coefficient of 0.0213. These findings suggest that rural road network density not only promotes inclusive regional growth within the focal county but may also generate positive spillover effects on neighboring counties through cross-county connectivity. Overall, the SAR, SEM, and SDM results support the main conclusion that rural road network density promotes inclusive regional growth, even after accounting for spatial spillovers.
We further conduct a dynamic panel system GMM robustness test to account for persistence in inclusive regional growth and potential dynamic endogeneity. The results are reported in
Appendix A Table A2. The coefficient of the lagged dependent variable is positive and statistically significant across all specifications, confirming the persistence of county-level inclusive regional growth, economic output, and the urban–rural income gap. More importantly, the coefficient of
road remains positive and statistically significant for
irg and
pgdp, with coefficients of 0.0438 and 0.0573, respectively. The coefficient of
road is −0.0492 when the dependent variable is
gap, indicating that rural road network density continues to reduce the urban–rural income gap after dynamic endogeneity is considered. All three coefficients are significant at the 1% level.
The diagnostic tests support the validity of the dynamic panel specification. The AR(1) tests are significant, as expected in first-differenced residuals, while the AR(2) tests do not reject the null hypothesis of no second-order serial correlation. The AR(2) p-values are 0.318, 0.247, and 0.403 for irg, pgdp, and gap, respectively. The Hansen test also does not reject the validity of the instrument set, with p-values of 0.294, 0.381, and 0.216. To reduce instrument proliferation, the instrument matrix is collapsed and the lag depth is restricted; the number of instruments is 68, which is much smaller than the number of counties. Overall, the system GMM results provide additional evidence that rural road network density promotes inclusive regional growth after accounting for dynamic endogeneity.
Table 9 reports the instrumental-variable estimation results. Column (1) presents the first-stage regression. The coefficient of
sltr is −0.018 and is significant at the 1% level. Since
sltr is defined as the interaction between county mean terrain slope and a linear time trend, a one-unit increase in
sltr represents a one-unit greater slope–time interaction. Economically, this means that steeper counties became increasingly exposed over time to terrain-related constraints on rural road expansion. The estimated first-stage coefficient of −0.018 indicates that, conditional on county fixed effects, year fixed effects, and time-varying controls, counties with stronger slope-related construction constraints experienced slower growth in rural road network density over time. This is consistent with the interpretation that terrain slope raises engineering difficulty and construction costs, thereby limiting the pace of rural road expansion.
To provide a more complete assessment of instrument relevance, we report not only the Kleibergen–Paap rk Wald F statistic, but also the first-stage , the partial of the excluded instrument, and Shea’s partial . The first-stage is 0.5937, indicating that the first-stage model explains a substantial share of the variation in rural road network density when fixed effects and controls are included. The partial of the excluded instrument is 0.0023, and Shea’s partial is also 0.0023, indicating that the excluded instrument provides modest but nonzero incremental explanatory power after accounting for county fixed effects, year fixed effects, and time-varying socioeconomic controls. The Kleibergen–Paap rk Wald F statistic is 34.62, which is above the conventional threshold of 10. Taken together, these diagnostics suggest that the instrument is statistically relevant, although the IV results should still be interpreted cautiously and as complementary evidence. Columns (2)–(4) report the second-stage results. The fitted value of rural road network density, , has a positive and statistically significant effect on irg, with a coefficient of 0.112. This estimate is larger than the baseline two-way fixed-effects estimate of 0.078. We interpret this difference cautiously. One possible explanation is that measurement error in county-level rural road statistics may attenuate the fixed-effects estimate, while the instrumental-variable approach partially corrects this attenuation by using terrain-induced variation in rural road expansion. Another possible explanation is that the IV estimate captures a local average treatment effect for counties whose rural road expansion is more strongly constrained by terrain conditions. These counties are likely to be relatively less connected and may experience larger marginal inclusive-growth gains from additional rural road density. Therefore, the IV estimate should not be interpreted simply as a uniformly larger average effect for all counties, but as complementary evidence based on exogenous terrain-related variation in road-density growth. The coefficient of is also positive and significant for pgdp, while it is negative and significant for gap. These results indicate that rural road network density promotes county-level economic growth and reduces the urban–rural income gap. Overall, the robustness checks and instrumental-variable estimates provide consistent evidence that the main findings are not driven by variable measurement, sample composition, extreme observations, or potential endogeneity.
Because the instrumental-variable model uses one excluded instrument for one endogenous variable, it is exactly identified. Therefore, a standard overidentification test is not applicable. We also do not use historical rural road networks as an additional instrument because consistent historical county-level rural road data harmonized to the 2013 county boundary are not available for the full national sample. For this reason, the revised manuscript focuses on strengthening the exclusion-restriction discussion and reporting additional first-stage diagnostics. The IV results should therefore be interpreted cautiously as complementary evidence based on terrain-related variation in rural road expansion.
4.6. Heterogeneity and Threshold Analysis
Table 10 reports the heterogeneous effects of rural road network density across different regional and development conditions. Columns (1) and (2) divide the sample into eastern counties and central-western counties. The coefficient of
road is positive in both subsamples, but the estimated coefficient is larger in the central-western sample. Specifically, the coefficient is 0.038 in eastern counties and 0.095 in central-western counties. This pattern suggests that rural road network density generates larger inclusive-growth gains in regions where development constraints and transport bottlenecks are more binding.
Columns (3) and (4) divide counties according to their initial rural road network density in 2013. The coefficient of road is 0.104 for counties with low initial road, compared with 0.045 for counties with high initial road. This indicates that the marginal effect of rural road network density is larger in counties with weaker initial road accessibility. In other words, increasing rural road network density appears to be more valuable when the existing rural road network is relatively underdeveloped.
Columns (5) and (6) further divide the sample according to the initial level of economic development. The coefficient of road is 0.098 for counties with low initial pgdp, while it is 0.041 for counties with high initial pgdp. This result suggests that less-developed counties benefit more from rural road network density in terms of inclusive regional growth. Overall, the heterogeneity results are consistent with Hypothesis 4. rural road network density has larger estimated effects in central-western counties, low-accessibility counties, and less-developed counties, indicating that improving rural road conditions can serve as an important policy instrument for reducing spatial development disadvantages and promoting balanced regional development.
To further interpret the heterogeneity results, we estimate a panel threshold model using rural road network density as the threshold variable. The results are reported in
Table 11. The estimated threshold value of
road is 3.8164, and the bootstrap
p-value for the threshold effect is 0.006, indicating statistically significant nonlinear threshold-type heterogeneity. When
road is below the estimated threshold, the coefficient of
road is 0.0962 and is statistically significant at the 1% level. When
road is above the threshold, the coefficient is 0.0437 and is also statistically significant at the 1% level. These results indicate that rural road network density promotes inclusive regional growth in both regimes, but the estimated effect is much larger in the low-road-density regime. Therefore, the stronger effects found in central-western counties, low-accessibility counties, and less-developed counties are more consistent with diminishing marginal returns than with a strict activation-threshold effect. In other words, rural roads do not appear to promote inclusive growth only after reaching a minimum density level; rather, the marginal benefits are larger where initial road access is weaker and development constraints are more binding. This interpretation is consistent with the idea that underdeveloped and less-connected counties have more room to benefit from additional rural road connectivity.