1. Introduction
Over the past two decades, China and other large emerging economies such as India and Brazil have experienced an unprecedented wave of digital transformation. Massive investments in broadband networks, data centers, cloud platforms, and mobile communication systems have reshaped industrial structures, spatial linkages, and modes of service delivery [
1,
2,
3]. Digital infrastructure is often expected to narrow information gaps and improve access to economic opportunities. Yet, growing empirical evidence suggests that digitalization may also amplify pre-existing inequalities by disproportionately benefiting urban, capital-intensive, and high-skilled sectors [
4,
5]. This ambiguity raises an important policy question for emerging economies: Does digital infrastructure function as an equalizer or a divider?
Despite rising scholarly interest, the existing literature remains inadequate in several respects. First, most studies rely on national or provincial data, which obscure the substantial city-level spatial heterogeneity through which digitalization affects income inequality [
6,
7]. Second, prior work typically focuses on a single channel, such as innovation upgrading or labor-market polarization, without integrating multiple mechanisms into a unified empirical framework [
8]. Third, although scholars increasingly acknowledge spatial spillovers, few examine whether the digitalization of neighboring cities can offset local inequality, a possibility highly relevant for regionally interconnected economies [
9]. Finally, conventional measures of digital infrastructure emphasize coverage or output indicators, but seldom capture the policy-driven, infrastructure-specific dynamics that shape the timing, intensity, and strategic priorities of digital deployment across cities [
10].
To motivate our empirical investigation,
Figure 1 illustrates the long-run evolution of China’s urban–rural income inequality and digital infrastructure development between 2007 and 2023. The Theil index steadily declined from 0.118 in 2007 to 0.045 in 2023, whereas the digital infrastructure index, constructed from more than 5000 official local government work reports, rose sharply, particularly following major national initiatives such as the Broadband China Strategy (2013) and the “Internet+” Action Plan (2015). This mirror-shaped pattern suggests that, despite an observable narrowing in headline income ratios, the rapid expansion of digital infrastructure may be reshaping more profound structural inequalities. Whether digitalization mitigates or reinforces these underlying disparities at the city level remains empirically unresolved.
To address these gaps, this study investigates the distributional consequences of digital infrastructure using a balanced panel of 285 prefecture-level cities from 2007 to 2023. We construct a novel text-based index of digital infrastructure from government work reports and employ fixed-effects models, nonlinear specifications, spatial dependence diagnostics, and Spatial Durbin Models (SDMs) to identify both direct and interregional effects. This empirical strategy enables a comprehensive examination of how digitalization affects inequality within cities and through spatial spillovers across cities. Based on these aims, the study addresses four core research questions:
RQ1: Does digital infrastructure significantly widen the Urban–rural income gap at the city level?
RQ2: Do agricultural-innovation crowding-out and wage-structure polarization jointly mediate this effect?
RQ3:Can neighboring cities’ digitalization generate spatial spillovers that offset local inequality?
RQ4: Under what institutional and market conditions does digitalization’s inequality effect weaken or reverse?
This study makes four key contributions. First, it provides the first long-horizon, city-level evidence on the inequality consequences of digital infrastructure in a large emerging economy. Second, it jointly assesses multiple mechanisms, innovation crowding-out and labor-structure polarization, within a unified identification framework. Third, it uncovers spatial offsetting effects. The paper proceeds as follows:
Section 2 reviews related literature and develops hypotheses;
Section 3 creates the theoretical framework and hypotheses;
Section 4 details data and variables;
Section 5 shows baseline, nonlinear, and mechanism results;
Section 6 tests mechanisms;
Section 7 covers spatial results;
Section 8 and
Section 9 conclude with policy implications for inclusive and regional digital development.
2. Literature Review
Urban–rural income inequality has long been a defining structural challenge in large emerging economies. Existing studies show that disparities in human capital, industrial structure, and access to public services jointly shape income divergence between cities and rural areas, and that these gaps tend to widen during periods of rapid transformation [
11,
12]. With the rapid expansion of digital technologies, scholars have increasingly examined whether digital infrastructure reduces or amplifies such persistent inequalities. A systematic review shows that the literature falls into three broad categories: (1) digitalization and distributional outcomes, (2) digitalization-driven mechanisms affecting inequality, and (3) spatial spillovers in digital development.
2.1. Digitalization and Distributional Outcomes
Digital infrastructure, including broadband networks, mobile communications, and data platforms, is broadly acknowledged as a catalyst for productivity, innovation, and equitable access to information [
13,
14]. However, growing empirical evidence indicates that the allocation of digital benefits is uneven across regions. Recent work highlights that technology adoption is shaped by organizational culture, institutional quality, and local economic development, implying that digitally advanced cities disproportionately capture the gains from digital upgrading [
15]. These findings are consistent with studies emphasizing that digitalization can reinforce pre-existing socioeconomic disparities, particularly the urban–rural divide [
16].
Theoretical research also offers contrasting insights. Endogenous growth theory posits that digital technologies enhance knowledge accumulation and innovation [
17], while new economic geography suggests that such technologies may intensify spatial disparities by concentrating economic activity and high-skilled labor in urban centers [
18,
19]. Classic perspectives, such as Kuznets’ inverted-U hypothesis [
20], imply nonlinearities in development processes [
20]. In the digital era, these nonlinearities may be amplified: regions with limited digital access may fall further behind, while digitally advanced regions may experience accelerated returns [
21,
22]. Overall, the literature indicates that without supportive institutions and inclusive digital governance, digital infrastructure may widen spatial inequality.
2.2. Mechanisms Linking Digitalization to Income Inequality
Beyond documenting its aggregate distributional effects, a second strand of literature examines how digitalization reshapes economic structures and labor markets, thereby influencing income inequality. Two prominent mechanisms are frequently discussed.
2.2.1. Innovation Concentration and Crowding-Out Effects
Schumpeterian growth theory highlights the cumulative nature of technological progress: digitally advanced cities attract skilled labor, R&D investment, and high-tech industries, strengthening their innovation ecosystems [
23]. At the same time, rural regions often lack the human capital and institutional capacity needed to integrate digital technologies effectively. Structural transformation theories suggest that digitalization accelerates the shift of economic resources from agriculture and traditional sectors toward urban and high-tech industries [
24]. Empirical studies confirm that expansion of digital infrastructure increases urban patenting activity, while innovation in less developed rural regions stagnates or declines [
25]. This spatial reallocation of innovation resources may weaken rural productivity growth, reinforcing the income gap.
2.2.2. Wage Structure Polarization
The second mechanism relates to labor-market dynamics driven by skill-biased technological change (SBTC). Digital technologies support high-skilled work but replace routine, low-skilled tasks [
26]. As digital infrastructure advances, the returns to digital literacy and cognitive skills increase, particularly in urban sectors such as ICT, finance, and advanced manufacturing [
27]. Low-skilled rural workers, by contrast, face displacement risks, limited digital training opportunities, and restricted access to urban labor markets due to institutional constraints [
28]. The resulting wage polarization widens income differentials between skill groups and between urban and rural regions.
Together, these studies demonstrate that digitalization can generate multi-layered structural pressures that reinforce pre-existing inequalities.
2.3. Spatial Spillovers in Digital Development
A third strand of literature emphasizes that digital development is intrinsically spatial: technological capabilities, labor mobility, and knowledge diffusion extend beyond administrative boundaries. Classical regional economics highlights the role of geographical proximity, industrial clustering, and shared supply chains in generating cross-regional externalities [
29]. In the digital era, these spillover channels expand through online service platforms, interoperable digital systems, and regional data networks [
30]. Empirical analyses increasingly confirm that digital infrastructure exhibits strong inter-city externalities, reducing regional disparities when neighboring areas undergo parallel digital upgrading [
31].
In the context of urban–rural inequality, such spillovers can either mitigate or reinforce the inequality-enhancing effects of local digitalization. As surrounding cities improve digital accessibility, rural regions may benefit from shared digital services, integrated labor markets, and the diffusion of technology. Conversely, fragmented regional development can limit these benefits. Understanding the spatial dimension of digital infrastructure is therefore crucial for accurately assessing its distributional consequences.
Despite substantial progress, three significant gaps remain in the existing literature:
- (1)
Limited city-level empirical evidence. Most studies rely on provincial or national data, leaving insufficient micro-spatial evidence on how digital infrastructure affects urban–rural inequality within cities.
- (2)
Lack of an integrated multi-mechanism framework. Prior studies examine isolated channels (e.g., digital divide or SBTC) but do not integrate innovation dynamics, labor-market polarization, and spatial spillovers into a unified analytical framework.
- (3)
Insufficient attention to spatial interdependencies. The literature increasingly acknowledges cross-city spillovers, yet few studies incorporate spatial econometric modeling to identify how digital development in neighboring regions influences local inequality.
3. Theoretical Analysis
Building on the gaps identified earlier, this section explores the theoretical mechanisms through which digital infrastructure influences the urban–rural income gap. While the literature offers fragmented insights, a unified theoretical framework remains lacking. Accordingly, this section synthesizes the core mechanisms into three pathways: direct productivity effects, indirect structural effects, and spatial spillovers, and derives the corresponding testable hypotheses.
3.1. Direct Impact: The Role of Digital Infrastructure in Urban–Rural Inequality
The direct impact of digital infrastructure on income inequality arises primarily from its differential effects on regional productivity. Digital infrastructure, including broadband internet, mobile networks, and data centers, enhances economic activities by facilitating access to information, markets, and services. However, its impact is far from uniform. Urban regions, which already possess better access to human capital, financial markets, and institutional capacity, tend to capture the majority of these productivity benefits.
Digital infrastructure acts as a complementary factor in urban economies where high-skilled labor and technological capabilities are abundant. This increases the return on digital investment, driving economic growth in urban centers [
32]. Conversely, rural areas, characterized by lower levels of digital access and fewer resources, are less able to capitalize on these advancements fully [
15]. Thus, the introduction of digital infrastructure may exacerbate spatial inequalities by disproportionately benefiting urban economies, leaving rural regions behind. Moreover, the productivity effects of digital infrastructure may be nonlinear. In the early stages, digital expansion may generate modest gains in both the urban and rural sectors. As digital development advances, however, its complementarities with skilled labor, innovation capacity, and institutional readiness grow stronger, indicating a potentially convex relationship between digitalization and inequality. This possibility justifies the nonlinear model tested in the empirical analysis.
Given the diverse economic structures across urban and rural regions, the direct influence of digital infrastructure is likely to widen the income gap between them. This mechanism is consistent with arguments in new economic geography, where technological advancements tend to concentrate economic activity in areas with stronger digital and human capital foundations, thereby amplifying pre-existing disparities. Accordingly, this research proposes:
H1. Digital infrastructure widens the urban–rural income gap by disproportionately benefiting urban areas, which have greater access to complementary resources, such as skilled labor and capital.
3.2. Indirect Mechanisms: Innovation Crowding-Out and Wage Polarization
In addition to the direct productivity effects, digital infrastructure influences the urban–rural income gap through indirect mechanisms that shape the allocation of innovation resources and the wage structure within labor markets.
3.2.1. Innovation Crowding-Out Effect
One important indirect mechanism is the crowding-out effect in innovation. Digital infrastructure enhances innovation by lowering the cost of information exchange, research collaboration, and technological development. However, these benefits are not equally distributed across regions. Urban areas, with better access to skilled labor, R&D facilities, and innovation networks, are better equipped to absorb and capitalize on these advancements [
33]. As a result, digitalization may concentrate innovation activities in urban centers, leaving rural areas with limited capacity to generate or absorb new technologies.
This dynamic aligns with Schumpeterian growth theory, which argues that innovation is path-dependent and cumulative. As cities accumulate digital infrastructure, they attract additional innovation capital, skilled labor, and high-tech industries [
34]. In contrast, rural areas, lacking these resources, fall behind, unable to fully integrate digital technologies into their local economies. This redistribution of innovation increases the urban–rural income gap by slowing technological progress in rural regions, particularly in agriculture and traditional industries. Furthermore, this research proposes the following:
H2. Digital infrastructure widens the urban–rural income gap by directing innovation efforts toward cities and pushing agricultural R&D and innovation out of rural areas.
3.2.2. Wage Structure Polarization Effect
Another significant indirect mechanism is wage polarization driven by skill-biased technological change (SBTC) [
35]. As digital infrastructure expands, demand for high-skilled labor increases, especially in industries such as ICT, finance, and advanced manufacturing. At the same time, routine and low-skilled jobs in agriculture and traditional sectors face increasing risks of automation or outsourcing. This technological shift places upward pressure on wages for high-skilled urban workers while simultaneously suppressing wages for low-skilled rural workers [
36].
The polarizing effect is particularly pronounced in urban areas, where high-skilled workers benefit from increased demand for their services. Rural areas, however, experience wage stagnation or decline as their economies remain heavily reliant on low-skill sectors with limited opportunities for digital adaptation. This structural transformation thus amplifies wage disparities between urban and rural regions. Furthermore, rural workers are constrained by institutional factors, including limited access to education, healthcare, and digital skills training. This worsens the wage gap because rural workers cannot acquire the digital skills necessary to meet the increasing demand for high-skilled labor in urban areas’ economies. So, this research proposes:
H3. Digital infrastructure widens the urban–rural income gap by increasing the wage premium for high-skilled labor in urban areas while depressing wages for low-skilled rural workers.
3.3. Spatial Spillover Effects: Regional Integration and Digital Externalities
A third critical mechanism through which digital infrastructure affects income inequality is spatial spillovers. Digital infrastructure, as an inherently networked technology, generates inter-regional externalities that can either mitigate or exacerbate the effects of local digital development [
37]. In this context, neighboring cities or regions that simultaneously undergo digital upgrading can benefit from positive spillovers in several ways. First, digital infrastructure enhances the diffusion of knowledge, enabling rural or less developed regions to access digital services, data, and platforms developed in more advanced urban areas. These spillovers can expand access to education, financial services, healthcare, and other digital services, thereby reducing inequality by improving economic opportunities for rural populations [
38]. Second, labor mobility is facilitated by digital connectivity, enabling rural workers to access job opportunities in urban centers [
39]. This can help bridge the wage gap by giving rural workers access to higher-paying digital jobs in urban areas.
However, the magnitude of these positive spillovers depends on several factors, including geographic proximity, industrial complementarity, and regional coordination. In regions with uneven or poorly integrated digital infrastructure, the potential for positive spillovers is limited, and the benefits may remain concentrated in urban areas [
40]. Thus, spatial spillovers can either attenuate or reinforce local inequalities depending on regional integration and policy coordination. When neighboring cities develop complementary digital ecosystems, the externalities generated can help reduce the income gap between urban and rural areas. Accordingly, this research proposes:
H4. Digital infrastructure development in neighboring cities generates spatial spillovers that reduce the income disparity by improving regional connectivity, labor mobility, and cross-regional knowledge diffusion.
The integrated framework in
Figure 2 outlines three mechanisms, M1 (skill-based technological change), M2 (urban innovation concentration), and M3 (spatial digital spillovers), that explain how digital infrastructure affects the income gap.
M1 (Skill-based technological change) leads to routine job substitution and wage structure polarization, increasing inequality by raising wages for high-skilled urban workers while displacing low-skilled rural workers. M2 (Urban innovation concentration) exacerbates inequality by attracting innovation to urban areas, thereby reducing rural agricultural R&D and widening the income gap. M3 (Spatial digital spillovers) presents cross-regional benefits that can mitigate these effects, reducing the income gap when neighboring regions share digital resources and labor mobility. The balance of these mechanisms determines whether digital infrastructure increases or reduces urban–rural inequality.
4. Research Design
4.1. Data Sources and Model Settings
This study uses a panel of 285 Chinese prefecture-level cities. Covering the years 2007–2023. All variables are harmonized under consistent administrative boundaries as defined by the National Bureau of Statistics of China (NBS), as illustrated in
Table 1. Cities with incomplete income data are linearly interpolated in accordance with NBS conventions. Four directly governed municipalities (Beijing, Shanghai, Tianjin, and Chongqing) are retained as standard prefecture-level units.
4.2. Variable Specification
- (1)
Dependent Variable (DV): Urban–rural Income Inequality
Urban–rural income inequality, measured by the Theil index (
), which allows for decomposition into within- and between-group inequality. This measure captures both the intensity and structure of local income distribution. We use inflation-adjusted disposable incomes of urban and rural households, weighted by population shares. A higher Theil index value indicates a broader income gap. The formula is as follows:
where j = 1, 2 represent urban and rural areas, respectively.
is the total disposable income of city
i in year
t, and
denotes the income for the urban (j = 1) and rural (j = 2) population within that city.
is the total population, and
corresponds to the sizes of the urban and rural populations.
- (2)
Independent Variable (IV): Digital Infrastructure
The key explanatory variable (
) represents the degree of digital infrastructure development. It is derived from local government work reports via text mining and is normalized as the logarithm of the keyword frequency per 10,000 words. The measure covers 51 keywords related to broadband networks, data centers, 5G deployment, and cloud computing (see
Appendix A). This index captures city-level policy attention and the intensity of digitalization implementation.
- (3)
Mediating Variables
To uncover the mechanisms linking digital infrastructure to income inequality, we incorporate two mediating pathways consistent with the theoretical framework:
Innovation Crowding-Out Channel: measured by the ratio of agricultural research and development expenditure to total R&D (AgriR&D). A decline in agrarian R&D intensity indicates that digitalization may divert innovation resources toward urban, high-tech sectors.
Wage Structure Polarization Channel: captured by the standard deviation of average wages across industries (WageGap). This variable reflects how digitalization amplifies skill-biased returns and wage dispersion between high- and low-skill sectors.
- (4)
Control Variables
To isolate the net impact of digital infrastructure, we include control variables covering economic development, industrial structure, fiscal capacity, openness, demographic density, and educational attainment. These variables are standard in regional inequality studies and ensure robustness against omitted-variable bias. All controls are lagged by 1 year to mitigate simultaneity further.
These variables operationalize the theoretical framework empirically, with detailed definitions, units, and signs in
Table 2, complementing the data source overview in
Table 1.
4.3. Model Specification
We estimate a series of econometric models to test the causal relationship between digital infrastructure and Urban–rural income inequality while accounting for spatial interdependence and endogeneity.
(a) Baseline Fixed-Effects Model
To investigate how digital infrastructure affects the urban–rural income gap, this study develops the following baseline model:
where
denotes the Urban–rural income disparity of city
in year
, measured by the Thei index (
is the level of digital infrastructure development.
represents a vector of city-level control variables.
and
are city and year fixed effects; and
is a random error term. If the regression results show that
and statistically significant, it indicates that greater digital infrastructure intensity widens the Urban–rural income gap (supporting H1).
(b) Mediation Models
To analyze how digital infrastructure influences urban–rural income inequality, we adopt a unified two-stage mediation framework in which all mediators are denoted by . In this study, represents either agricultural R&D intensity (the innovation-crowding-out channel) or wage polarization (the labor-market restructuring channel).
Step 1: From Digitalization to Mediators
Step 2: From Mediators to Income
A significant and provides evidence of the corresponding mediating pathway.
(c) Spatial Durbin Model (SDM)
To capture both local and inter-regional interactions, we extend the baseline specification to a Spatial Durbin Model (SDM) that includes spatial lags of both dependent and independent variables:
Here is the spatial-weight matrix (constructed using contiguity or inverse-distance criteria). The parameter measures the spatial autocorrelation of inequality, while captures the indirect spatial effect of neighboring cities’ digitalization. A negative and significant () indicates an offsetting spatial impact, consistent with H4. The inclusion of accounts for potential environmental spillovers associated with control variables.
5. Empirical Results
5.1. Descriptive Statistics and Multicollinearity Diagnostics
Table 3 presents the descriptive statistics for all variables used in the empirical analysis. The mean Theil index is 0.080, indicating moderate but ongoing income disparity. The average digital infrastructure index is 0.0065, reflecting substantial heterogeneity across prefecture-level cities. Considerable variation is also observed in economic and fiscal variables, particularly in Finance and Comm, highlighting significant differences in financial depth and digital service intensity across regions.
To visualize the relationships among variables,
Figure 3 plots the heatmap of pairwise correlations. Theil and Digit show a negative correlation (r = –0.42), suggesting that higher levels of digital infrastructure are generally associated with narrower income gaps. All other coefficients are below 0.7, indicating that multicollinearity is not a serious concern. For additional robustness, the variance inflation factor (VIF) value is reported in
Appendix Table A1, which confirms that the mean VIF is 1.48, well below the critical threshold of 10. To ensure that the panel variables do not suffer from spurious regression, we conduct Fisher-type ADF and PP panel unit-root tests. Results (reported in
Appendix Table A2) confirm that the dependent variable (Theil index) is stationary, while the raw digitalization index is non-stationary in levels. Accordingly, this research uses log-transformed data in all regressions to absorb non-stationary city-specific and time-specific trends.
5.2. Main Empirical Results
Table 4 presents the outcomes of the impact of digital infrastructure on urban–rural income inequality. Across all model specifications, the coefficient of Dgital remains positive and significant, indicating that cities with more intensive digital infrastructure tend to experience larger Theil indices. This result holds when only fixed effects are included (Column 1), when socioeconomic, fiscal, and demographic controls are sequentially added (Column 2), and when alternative clustering strategies at the province and city levels are applied (Columns 3–5). The magnitude of the estimated coefficients declines slightly after controls are incorporated. Still, the significance remains robust, suggesting that omitted variables or clustering choices do not drive the inequality-enhancing effect of digital development.
Regarding the control variables, lngdp has a negative, significant coefficient, indicating that more economically developed cities exhibit narrower income disparities, consistent with convergence theory. Science_exp is positive and significant, suggesting that R&D expenditure may initially benefit high-skill labor or urban industries, thereby widening the Urban–rural divide. Debt also shows a positive, significant association with income inequality, suggesting that fiscal expansion may disproportionately favor urban areas. Population density (ln_popdens) appears to mitigate inequality, while ln_innov is negative and marginally significant, implying that innovation modestly reduces income disparities. Other controls, such as industrial structure, education expenditure, and telecommunications development, show reasonable signs and reinforce the model’s stability.
Across all specifications, the R2 values range from 0.881 to 0.884, and adjusted R2 values remain above 0.87, highlighting the model’s strong explanatory capabilities. Together, these results provide strong, consistent evidence that the expansion of digital infrastructure, despite its economic benefits, tends to intensify Urban–rural income inequality in the absence of accompanying inclusive development mechanisms.
5.3. Robustness Tests
To further establish the credibility of the baseline findings and alleviate reviewers’ concerns about model sensitivity, six complementary robustness tests were conducted, as illustrated in
Table 5.
First, the independent variable was remeasured using an alternative digital infrastructure index (digital_sum). The estimated coefficient remains significantly positive and economically meaningful (0.0001 **, t = 2.18), indicating that the core conclusion is not driven by variable construction.
Second, lag all control variables by one period to mitigate reverse causality and contemporaneous feedback. Even under this stricter criterion, the Digital coefficient stays positive and significant at 5% level (0.6735 **, t = 2.46), supporting the view that digital expansion leads rather than merely correlates with the widening of the income gap.
Third, the sample period 2009–2019 was chosen to exclude major disruptions like the financial crisis and the COVID-19 pandemic. The coefficient remains robust and statistically significant (0.5213 ***, t = 3.06), suggesting that extreme macro-shocks do not drive the baseline effect.
Fourth, provincial capital cities were excluded to address concerns that highly developed, policy-favored cities might disproportionately drive the main results. The estimated effect remains significantly positive (1.0598 **, t = 2.58), indicating that the relationship between digital inequality and outcomes is not limited to large, economically dominant cities.
Fifth, regressions were re-estimated using population weights, acknowledging that digital adoption and income dynamics may differ substantially in more populous cities. The coefficient remains strongly significant (0.6260 **, t = 2.34), confirming that the relationship is robust to alternative weighting schemes and not dominated by small or large cities alone.
Finally, Driscoll-Kraay standard errors are corrected for serial correlation and cross-sectional dependence, common in long city-level panels. Even under this conservative variance estimator, the effect of digital infrastructure remains positive and highly significant (2.9182 ***, t = 8.01), offering strong evidence that statistical dependence across cities does not bias the baseline findings.
Across all six robustness checks, the magnitude of the coefficient on digital infrastructure remains stable (ranging from approximately 0.0001 to 2.91, depending on the measurement scale). These results show that digital infrastructure significantly increases urban–rural income inequality, not due to issues with variable definition, model, sample, or error covariance. Instead, the effect is systematic, persistent, and structurally embedded within China’s digitalization process for regional development.
5.4. Endogeneity Analysis
To address potential endogeneity concerns, such as reverse causality, simultaneity, and omitted variable bias, in the estimated relationship between digital economy development and regional income inequality, we adopt a multi-pronged identification strategy that includes lagged regressors and an instrumental variable (IV) approach.
First, we use the one-period lag of the digital economy index as a predetermined regressor. As illustrated in Column (1) of
Table 6, the coefficient for L.Digit stays positive and highly significant (0.752,
p < 0.01). This aligns with our baseline estimates and suggests that immediate feedback mechanisms are unlikely to be responsible for our results.
Second, we employ two external instruments motivated by the literature on historical infrastructure persistence and geographic technological constraints. Specifically, IV1 (telephone penetration per 100 persons in the early years) captures the initial telecommunications infrastructure endowment that shaped later digital development but is plausibly orthogonal to current inequality after controlling for city- and year-fixed effects. IV2 (terrain ruggedness) reflects exogenous variation in the cost of broadband deployment and influences digital infrastructure expansion but has no direct impact on income inequality, except through the digital economy. The first-stage estimates in Column (2) confirm that both instruments strongly predict digital development (IV1: coefficient = 0.000, t = 0.19; IV2: coefficient = −0.000, t = −4.00). The second-stage results in Column (3) indicate that the instrumented digital economy variable stays positive and is statistically significant (0.047, t = 2.66), providing robust evidence that the digital economy increases regional income disparity even after accounting for potential endogeneity.
Thirdly, to address endogeneity, I conduct a permutation-based placebo test, randomly reassigning the digitalization variable 200 times to break any systematic link to inequality.
Figure 4a presents the distribution of placebo coefficients: the histogram and kernel density curve are tightly centered around zero, while the actual estimated coefficient (β = 0.635) lies far to the right of the placebo distribution, indicating that no random reshuffling of the data can generate an effect of comparable magnitude. Complementing this,
Figure 4b displays the distribution of placebo
p-values, which closely approximates a uniform 0–1 distribution, consistent with pure statistical noise. In contrast, the actual model yields a
p-value well below 0.01 and falls in the extreme left tail. Together, these two figures provide compelling evidence that the estimated effect is not driven by random chance, model artifacts, or unobserved shocks, thereby reinforcing the causal interpretation of digitalization’s impact on income inequality.
5.5. Nonlinear Baseline Model (Digit2)
To explore if digital infrastructure’s impact is nonlinear, we extend the baseline with a quadratic digitalization term. Results are in
Table 7.
Across all specifications, the quadratic term Digit2 is positive and highly significant, while the linear term is statistically insignificant. This pattern indicates a clear convex relationship: at low levels of digital development, digital infrastructure does not significantly affect the Urban–rural income gap, whereas at medium to high levels, the inequality-widening effect accelerates sharply. The emerging convexity indicates that the distributional impacts of digital infrastructure are becoming more concentrated in urban areas with greater absorptive capacity, an advanced industrial structure, and higher human capital intensity. Combined with the baseline estimates in
Table 4, these results imply that digital infrastructure not only widens the income gap on average but does so at an increasing rate as digital development deepens. Most Chinese cities appear to be in a stage where digital infrastructure amplifies structural advantages rather than diffuses benefits evenly across urban and rural areas. This nonlinear pattern highlights the significance of targeted interventions and inclusive digital governance to prevent a deepening digital divide.
Figure 5 illustrates the marginal effect of digital infrastructure on the Theil index based on the nonlinear specification in
Table 7. The marginal effect is calculated as
, using coefficients from the whole model. The results reveal a convex pattern: at lower levels of digital development, the effect on inequality is weak; however, as digital infrastructure intensifies, the marginal impact increases sharply, indicating an accelerating widening of the Urban–rural income gap.
To formally verify whether this convexity reflects a genuine U-shaped relationship, we further applied the U-test method from Lind and Mehlum [
41]. The results reject the existence of a U-shape, as shown in
Appendix Table A3: the estimated turning point (0.0006) lies at the extreme left boundary of the data range, and the overall U-test is statistically insignificant (
p = 0.46). Hence, the evidence points to a monotonically increasing and convex relationship rather than a true U-shape. The predictive curve in
Figure 6, therefore, complements the formal statistical test by showing that digital infrastructure exerts increasingly stronger marginal effects on inequality as its level rises, despite the absence of a meaningful turning point within the economically relevant range.
5.6. Heterogeneous Effects Across City Types
To investigate whether the distributional effects of digital infrastructure vary across structural contexts, this study conducts a set of heterogeneity analyses along three theoretically grounded dimensions: regional economic development, geographic location, and population flow dynamics. These dimensions jointly capture persistent differences in resource endowment, digital absorptive capacity, institutional readiness, and labor mobility, factors widely recognized as shaping the extent to which digitalization influences Urban–rural inequality.
(1) Economic Development Levels.
We begin by classifying cities according to their long-term average GDP per capita from 2007 to 2023. This measure reflects stable cross-city differences in fiscal capacity, human capital, industrial composition, and the ability to internalize technological inputs. Columns (1)–(2) of
Table 8 show that digital infrastructure significantly increases the Urban–rural income gap in both high- and low-development groups. However, the magnitude in the low-development group is nearly twice that in the high-development group, and bootstrap-based coefficient difference tests confirm that the gap is statistically significant (
p < 0.05). These findings suggest that regions with weaker economic foundations, characterized by limited human capital, constrained public budgets, and less developed digital service systems, are less capable of converting digital investments into inclusive outcomes. Instead, digitalization may exacerbate structural segmentation through skill mismatches, uneven broadband accessibility, and persistent Urban–rural divides in digital capacity.
(2) Geographic Location: North–South Division.
Following previous Research, we further distinguish cities according to the natural geographic boundary at 35° North latitude, which separates China into two historically and structurally distinct macroregions. This division is particularly relevant for digital development because the North features a more dispersed spatial settlement pattern, harsher climate conditions, and higher maintenance costs for communication infrastructure. In contrast, the South exhibits denser economic networks and faster digital adoption.
Columns (3)–(4) of
Table 8 show that the inequality-enhancing effect of digitalization is substantially more substantial in northern cities, while the effect is weaker and less stable in the South. These results suggest that digital infrastructure tends to reinforce pre-existing structural disadvantages in northern regions where fragmented spatial structures, lower market integration, and slower digital diffusion prevent rural areas from benefiting proportionately from digital expansion. By contrast, southern cities, supported by agglomeration economies and more mature digital ecosystems, can better transmit digital gains to rural sectors, thereby mitigating the divergence effect.
(3) Population Flow Dynamics.
To capture the role of demographic mobility, we further separate cities into those with net population inflows and those with net population outflows. Columns (5)–(6) in
Table 8 illustrate that digitalization significantly increases the Urban–rural income gap in population-outflow cities but yields insignificant effects in inflow cities, a difference confirmed by bootstrap tests (
p < 0.01). The results indicate that digitalization amplifies inequality in regions experiencing demographic decline, where labor, capital, and productivity increasingly concentrate in core urban districts. In these contexts, digital technologies fail to generate sufficient spatial spillovers to compensate for resource drain. Conversely, inflow cities benefit from stronger agglomeration dynamics, denser digital infrastructure networks, and a growing supply of skilled workers, allowing digital expansion to operate more equitably.
Together, the heterogeneity results reveal that the inequality effects of digital infrastructure are structurally dependent rather than spatially uniform. Digitalization widens the Urban–rural income gap most sharply in regions with weaker economic foundations, northern geographic characteristics, and persistent population outflows. These findings underscore the importance of strengthening digital absorptive capacity, improving digital public service provision, and designing regionally tailored governance mechanisms to prevent the deepening of spatial inequality during China’s ongoing digital transformation.
7. Spatial Offsetting Effects
To examine whether digital infrastructure generates cross-city spillovers beyond its local impact, we first conduct spatial dependence tests and then estimate a Spatial Durbin Model (SDM).
Table 11 reports the diagnostics. Moran’s I statistics for both Theil and Digit are positive and highly significant, indicating pronounced spatial clustering of inequality and digitalization. LM-lag, LM-error, and their robust versions all reject the null of no spatial dependence, and LR tests favor SDM over SAR and SEM. These results confirm that both inequality and digital infrastructure exhibit spatial interdependence, making SDM the appropriate specification.
Table 12 presents the SDM estimates using the rook and queen contiguity matrices. Across models, Digit remains positive and highly significant, with coefficients near 0.90, similar to the baseline fixed-effects estimate (≈0.78). This result demonstrates that digital infrastructure consistently widens the Urban–rural income gap within cities, even after controlling for endogenous spatial processes. The spatial autoregressive coefficient ρ is also significantly positive, consistent with the spatial clustering indicated above. By contrast, the spatially lagged digitalization term (W·Digit) and the contiguity-based indicators (digit_rook and digit_queen) are negative but statistically insignificant, suggesting that any spillover-induced inequality mitigation is weak once complete spatial dependence is accounted for.
To quantify spillover magnitudes,
Table 13 reports the decomposition of SDM estimates into direct, indirect, and total effects following LeSage and Pace (2010) [
42]. The direct impact of digitalization is large and highly significant (0.817–0.824), confirming that the principal transmission channel operates within cities. The indirect effects (0.097–0.115), however, are small and statistically indistinguishable from zero, indicating that cross-city diffusion, although directionally consistent with a mitigating mechanism, does not materially alter local inequality patterns. Consequently, the total effects remain strongly positive under both spatial weight matrices.
Taken together, the evidence indicates that digital infrastructure reinforces local economic advantages far more strongly than it transmits benefits across city boundaries. While simpler spatial-lag specifications (see
Table 5) suggested significant negative spillovers consistent with an offsetting mechanism, the full SDM results reveal that such mitigating channels lose statistical significance once endogenous spatial interactions and spatially lagged covariates are jointly modeled. This pattern aligns with spatial econometric theory, in which the inclusion of full SDM components often reallocates cross-city variation and attenuates indirect effects. Overall, neighboring cities’ digitalization generates only modest diffusion benefits that are overshadowed by the dominant within-city inequality-enhancing effect. Strengthening regional coordination in digital infrastructure governance may therefore be essential for transforming spatial digital connectivity into meaningful reductions in inequality.
8. Discussion
8.1. Summary of Empirical Findings
The empirical analysis provides consistent evidence that digital infrastructure has a significant, inequality-enhancing effect on the urban–rural income gap. Across various fixed-effects specifications, a higher level of digital infrastructure is associated with a widening of the income divide, confirming the direction predicted by H1. The results also demonstrate meaningful indirect effects, particularly through the innovation mechanism: regions with stronger digital development attract a larger share of innovation resources, while agricultural and rural innovation capacity declines. This pattern offers empirical support for H2. In contrast, the wage polarization mechanism (H3) receives partial support. Although the coefficients align with the theoretical expectations, indicating rising wage premiums in skill-intensive urban sectors, the magnitudes are smaller than anticipated, suggesting that labor-market frictions and institutional constraints may dilute the full effect.
Spatial spillover effects (H4) emerge as another noteworthy finding. The spatial Durbin and SAR models show that digitalization in neighboring cities tends to mitigate local inequality by enhancing regional integration and enabling shared access to digital services and innovation networks. This indicates that negative local effects may be partially offset by positive regional externalities when jurisdictions coordinate. Beyond the baseline results, two additional insights arise. First, the nonlinear specification reveals a convex relationship between digitalization and inequality: the marginal inequality-enhancing effect becomes stronger as digital development deepens. Secondly, the heterogeneity analysis suggests that the impact of inequality is more pronounced in economically developed, highly urbanized regions. Whereas rural or resource-based regions benefit marginally at early stages. These findings imply that digital infrastructure does not operate uniformly across space but interacts with existing development foundations.
8.2. Theoretical Interpretation of Key Results
The empirical results align closely with the theoretical mechanisms developed in
Section 3. The substantial direct effect confirms the role of digital infrastructure as a productivity-enhancing yet unevenly distributed input. Consistent with the logic of skill-biased technological change and new economic geography, urban areas, given their superior human capital, institutional readiness, and absorptive capacity, capture disproportionately higher gains from digital investment. Rural economies, by contrast, lack complementary factors such as digital literacy, innovation systems, and capital depth, preventing them from fully converting digital access into productivity gains. This asymmetric productivity response is the primary pathway by which digitalization widens the urban–rural income gap.
The innovation crowding-out mechanism is also supported empirically. Urban regions experience a substantial boost in innovation outputs and digital-intensive activities as digital infrastructure expands. Meanwhile, agricultural R&D and rural innovation capacity stagnate or decline. These results reflect the cumulative and path-dependent nature of innovation: once cities attain a threshold level of digital development, they attract further R&D resources, skilled labor, and investment, reinforcing the concentration of technological advantages. The findings also validate the theoretical prediction that digitalization accelerates structural transformation away from agriculture, thereby weakening rural productivity growth.
The wage polarization mechanism receives weaker empirical support, although the signs are consistent with SBTC. The muted magnitude may be attributable to institutional realities in China, including labor mobility restrictions, segmented social benefits, and slower rural labor’s adaptation to digital skill requirements. These frictions hinder labor reallocation and delay wage adjustments, attenuating the observable impact in the short to medium term.
The spatial spillover effects confirm the importance of regional context. Neighboring cities’ digital development contributes positively to local equality by expanding shared digital services, enabling cross-boundary labor flows, and diffusing knowledge and innovation. This aligns with theories of spatial externalities and networked development, which posit that technological systems generate benefits beyond local boundaries. Thus, digital inequality should be understood not only in a local sense but also as a regional and interdependent phenomenon.
8.3. Additional Insights: Nonlinear and Heterogeneous Effects
The nonlinear analysis provides further nuance. The convex pattern indicates that inequality effects strengthen as digitalization matures, which is consistent with the theoretical expectation that complementarities with skilled labor, innovation capital, and institutional strength intensify at higher levels of digital development. Early digital expansion may generate broad-based benefits, but once urban ecosystems reach a critical mass, the returns become increasingly skewed. This finding reinforces the necessity of stage-specific policy design.
Heterogeneity across regions underscores the differentiated absorptive capacity of digital infrastructure. Developed, coastal, and highly urbanized areas experience more substantial inequality-enhancing effects due to their advanced digital ecosystems, while less developed regions show weaker or even neutral effects at lower levels of digital penetration. This pattern suggests that digitalization interacts with pre-existing economic structures, amplifying advantages where conditions are favorable and yielding only marginal improvements where complementary factors are limited.
9. Conclusions and Policy Recommendations
9.1. Conclusions
This study offers a detailed analysis of how digital infrastructure influences the urban–rural income disparity in China. Using balanced panel data and multiple robustness checks, including instrumental variables, nonlinear specifications, placebo tests, and spatial econometric models, the analysis shows that digital infrastructure significantly widens the income divide. The findings reveal that digitalization affects inequality through three interconnected pathways: direct productivity effects (M1), innovation concentration (M2), and spatial digital spillovers (M3). While M1 and M2 intensify inequality, M3 provides a partial counterbalance through the regional diffusion of digital benefits. The study also identifies nonlinear and heterogeneous effects, indicating that digitalization interacts strongly with regional development foundations. These results contribute to the literature by integrating direct, structural, and spatial mechanisms into a unified framework and by providing empirical evidence from a large, city-level dataset.
9.2. Policy Recommendations
First, policymakers should adopt a stage-specific approach to digital development. Given the nonlinear effects, early-stage digital expansion in rural areas can reduce inequality, whereas advanced-stage digitalization in urban centers tends to magnify disparities. Thus, priority should be placed on accelerating digital access, digital literacy, and basic infrastructure in underdeveloped rural regions.
Second, the government should strengthen rural innovation systems to counteract the urban innovation crowding-out effect. This includes increasing agricultural R&D investment, fostering digital-enabled agricultural technologies, establishing rural innovation labs, and enhancing connections between urban research institutions and rural industries.
Third, reducing labor-market segmentation is essential. Policies should expand rural access to digital skills training, vocational education, and mobility pathways. Strengthening human capital investment in rural areas can mitigate wage polarization and improve labor adaptability to digital transformation.
Fourth, regional coordination should be promoted to harness positive spatial spillovers. Joint digital infrastructure planning, cross-regional data-sharing platforms, integrated labor markets, and regional innovation networks can enhance externalities and reduce local disparities.
9.3. Limitations and Future Research
Several limitations should be acknowledged. Measuring digital infrastructure may not fully capture emerging technologies such as AI, cloud computing, and platform ecosystems. The analysis is based on city-level data, which limits the investigation of household or firm behavior at the micro level. Finally, causal pathways may differ across countries, suggesting that cross-national comparisons would be a fruitful direction for future research.