Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework

Ru, Jiayu; Li, Jiahui; Gan, Lu; Yusufu, Gulinaer

doi:10.3390/land15010159

Open AccessArticle

Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework

¹

School of Economics and Management, Xinjiang University, Urumqi 830049, China

²

School of Business, Urumqi Vocational University, Urumqi 830001, China

^*

Author to whom correspondence should be addressed.

Land 2026, 15(1), 159; https://doi.org/10.3390/land15010159

Submission received: 12 December 2025 / Revised: 9 January 2026 / Accepted: 10 January 2026 / Published: 13 January 2026

(This article belongs to the Special Issue Innovative Strategies for Sustainable Smart Cities and Territories)

Download

Browse Figures

Versions Notes

Abstract

The goal of this scientific study is to clarify whether and how digital–intelligent integration contributes to urban carbon efficiency and to identify the conditions under which this contribution becomes nonlinear and policy-relevant. Focusing on 39 prefecture-level cities in the middle reaches of the Yellow River Basin during 2011–2022, we adopt an integrated measurement–modelling approach that combines efficiency evaluation, machine-learning interpretation, and dynamic–spatial validation. Specifically, we construct two super-efficiency DEA indicators: an undesirable-output SBM incorporating CO₂ emissions and a conventional super-efficiency CCR index. We then estimate nonlinear city-level relationships using XGBoost and interpret the marginal effects with SHAP, while panel vector autoregression (PVAR) and spatial diagnostics are employed to validate the dynamic responses and spatial dependence. The results show that digital–intelligent integration is positively associated with both carbon-related and conventional efficiency, but its marginal contribution is strongly conditioned by human capital, urbanisation, and environmental regulation, exhibiting threshold-type behaviour and diminishing returns at higher digitalisation levels. Green efficiency reacts more strongly to short-run shocks, whereas conventional efficiency follows a steadier improvement trajectory. Heterogeneity across urban agglomerations and evidence of spatial clustering further suggest that uniform policy packages are unlikely to perform well. These findings highlight the importance of sequencing and policy complementarity: investments in digital infrastructure should be coordinated with institutional and structural measures such as green finance, environmental standards, and industrial upgrading and place-based pilots can help scale effective digital applications toward China’s dual-carbon objectives. The proposed framework is transferable to other regions where the digital–climate nexus is central to smart and sustainable urban development.

Keywords:

digital–intelligent integration; super-efficiency SBM; super-efficiency CCR; XGBoost–SHAP; sustainable smart cities

1. Introduction

The compounding pressures of climate change and rapid urbanisation have intensified the constraints on China’s efforts to reduce regional carbon emissions and deliver its “dual-carbon” commitments [1,2]. National policy has clarified the pathway to peaking through the Action Plan for Carbon Dioxide Peaking Before 2030 and has further consolidated the overall framework and progress in a recent white paper [2]. At the urban scale, emission trajectories are not a mechanical outcome of urban expansion; rather, they reflect the joint adjustment of industrial structure, energy use, and evolving development patterns. The Yellow River Basin constitutes a particularly informative case because it combines the functions of a national energy supply base with the mandate of ecological security and has been incorporated into a state-led strategy for ecological protection and high-quality development [3]. According to the most recent consolidated full-year statistics, the nine provincial-level units in the basin account for approximately 420 million residents—around one-third of China’s total population—and generate about CNY 31.64 trillion in GDP (often rounded to CNY 32 trillion), roughly one-quarter of the national total [4,5]. While existing studies have documented pronounced spatial clustering of emissions and heterogeneous drivers within the basin, robust and interpretable evidence remains limited on how these city-level disparities evolve over time and whether, and through what mechanisms, accelerating digital and intelligent transformation reshapes these dynamics. Moreover, decarbonisation is increasingly framed not only as an efficiency issue but also as a governance and rights-based imperative, given the growing recognition of the right to a healthy environment in international climate discourse [6].

From an international perspective, river-basin urban corridors (e.g., the Danube, Rhine, Mississippi, and Mekong) commonly face a structural tension characterised by linear agglomeration along waterways, cross-jurisdictional externalities, and binding ecological constraints, which makes basin economies a suitable unit for comparative analysis. A complexity-based perspective on environmental governance further suggests that coupled socio-ecological and institutional systems often exhibit nonlinear and threshold-type responses to policy and technological interventions [7]. In parallel, digital infrastructure and data-driven governance have been integrated into the SDGs agenda and the smart (sustainable) city framework [8], alongside standardised indicator systems and evaluation toolkits for assessing urban performance [9]. Relatedly, EU experience shows that green-transition pathways (e.g., bioenergy expansion) are explicitly bounded by sustainability standards and land-use safeguards to prevent unintended environmental externalities [10]. Against this backdrop, this study focuses on cities in the Yellow River Basin. We measure urban carbon efficiency under a unified accounting approach, construct indicators capturing digital–intelligent transformation, and test the associated pathways within an interpretable econometric framework that explicitly accounts for spatial dependence and heterogeneity. The analysis is intended to generate methodologically transferable evidence for cross-basin and cross-city comparison, while informing coordinated emission reduction and green governance at the basin scale.

In the urban agglomeration along the middle reaches of the Yellow River, provincial capital cities such as Zhengzhou, Taiyuan, Xi’an and Hohhot play a pivotal role in linking high-quality economic growth with low-carbon transition [11]. These cities are typically resource-based and energy-intensive; their industrial structures rely heavily on coal and heavy manufacturing, the carrying capacity of local environments is relatively limited, and environmental constraints are tightening. The literature reports substantial variation in emissions performance and in the degree of decoupling between economic growth and emissions across urban agglomerations in the Yellow River Basin and highlights the importance of industrial upgrading and cross-regional coordination. However, for these core cities there is still a lack of targeted analysis of how their emissions trajectories have responded to the recent wave of digital–intelligent (shuzhi) development and of the spatial–temporal characteristics and underlying mechanisms of this response [12,13].

In recent years, the digital economy—characterised by the expansion of digital infrastructure, data-driven forms of production and intelligent management systems—has become an important driver of China’s green transition. A growing body of empirical work suggests that the development of the digital economy tends to reduce urban carbon emissions or improve emissions efficiency, although the magnitude and channels of these effects vary markedly across regions and city types [14]. For the middle-reach urban agglomeration, which combines high emissions intensity, strong policy attention and a complex industrial base, a central question is whether, and through what mechanisms, digital–intelligent transformation reshapes urban emissions pathways and mitigation efficiency. Most existing studies are based on national or provincial panels and treat the urban agglomerations of the Yellow River Basin as subsamples rather than primary objects of analysis, so systematic evidence on the mitigation effects and transmission mechanisms of digital–intelligent transformation in key middle-reach cities remains scarce.

At the same time, digital–intelligent governance has been progressively incorporated into the national strategy for ecological protection and high-quality development in the Yellow River Basin. The Digital Twin Yellow River Construction Plan (2022–2025) calls for an integrated basin-wide digital platform to provide data and decision support for river-basin governance [15]. The 2024 policy document Opinions on Promoting the Construction of New Urban Infrastructure and Building Resilient Cities sets the goal of establishing a group of liveable, resilient and smart cities by 2027, signalling a pathway for using digital–intelligent transformation to support pollution and emissions reduction, ecological restoration, industrial upgrading and infrastructure renewal. The 2024 Opinions on Strengthening Ecological and Environmental Zoning Control further emphasise the need to embed new-generation information technologies and artificial intelligence into ecological and environmental zoning [16,17]. Spanning eastern, central and western China and encompassing diverse landforms such as plateaus, the Loess Plateau and alluvial plains, the Yellow River Basin exhibits marked heterogeneity in environmental conditions and socio-economic foundations, combined with a high share of resource-based cities and traditional industrial enterprises and increasing exposure to extreme climate events. Under these conditions, zoned and differentiated governance becomes essential. Comparative evidence also indicates that the effectiveness of environmental rules depends critically on enforcement capacity and civic oversight, as environmental activism and administrative-law mechanisms can shape real-world compliance [18].

Against this policy and practical backdrop, this study focuses on the core cities of the urban agglomeration in the middle reaches of the Yellow River and asks two related questions. First, how has digital–intelligent transformation affected their carbon-emissions pathways and mitigation efficiency over time? Second, how can an ecological-resilience-oriented zoning governance framework be designed that is compatible with the progress of digital–intelligent transformation and that supports low-carbon and climate-resilient development? Addressing these questions requires, on the one hand, a clear conceptual understanding of the mechanisms through which digitalisation and intelligentisation influence urban emissions and, on the other hand, methodological tools that can capture high-dimensional nonlinear responses while remaining interpretable at the city level.

Against this background, this study focuses on the core cities of the urban agglomeration in the middle reaches of the Yellow River and addresses two related questions. First, how does digital–intelligent development, together with changes in industrial structure, fiscal capacity and environmental governance, shape the evolution of urban carbon-emissions efficiency and broader green development over time? Second, how do these effects differ across city types, and what do they imply for designing differentiated, resilience-oriented governance in a river-basin context? To answer these questions, we construct a city-level panel for 2011–2022 that combines indicators with socio-economic and institutional variables, measures green and conventional production efficiency using super-efficiency DEA, and relates these indicators to digital–intelligent integration and other drivers through a flexible, interpretable non-linear modelling framework, complemented by dynamic and spatial diagnostics. The remainder of the study is organised as follows: Section 2 reviews the relevant literature and develops the analytical framework; Section 3 introduces the study area, data and variable design; Section 4 describes the empirical methods; Section 5 presents the main results; Section 6 discusses the findings in light of existing study and basin-wide policy initiatives; and Section 7 concludes with key policy implications and directions for further research.

2. Literature Review and Analytical Framework

This section situates the study within two closely related strands of research and, on that basis, develops an integrated analytical framework. One strand investigates how digitalisation and intelligentisation shape urban carbon mitigation and green development through shifts in industrial structure, energy use, innovation dynamics, and environmental governance. The other strand focuses on methodological advances—particularly interpretable machine-learning approaches—for identifying high-dimensional, nonlinear drivers of city-level green performance. Figure 1 summarises this structure by linking the four transmission channels identified in the literature with the hybrid measurement–modelling approach used in the analysis.

Within the first strand, existing studies point to several pathways through which digital–intelligent transformation can influence urban carbon outcomes and broader green performance. In line with the focus of this paper, four types of effects are particularly relevant: a structural effect, whereby digitalisation reshapes industrial and energy structures; an efficiency effect, through improvements in energy use, production organisation and regulatory performance; an innovation effect, via green technological change and the diffusion of cleaner technologies; and a governance effect, through enhanced monitoring, coordination and differentiated regulation. The following four paragraphs review evidence on each of these channels in turn, before returning to the methodological literature and the implications for the analytical framework.

From a structural perspective, A substantial body of evidence indicates that digital–intelligent integration can accelerate the orderly exit of highly polluting and energy-intensive activities, expand high-end manufacturing and producer services, and facilitate greener manufacturing transformation, sometimes with spillovers to nearby cities [19,20,21]. At the same time, the reported functional form is often nonlinear: U-shaped or inverted-U relationships suggest that carbon outcomes can deteriorate at early stages—when infrastructure build-out, energy demand, and output expansion dominate—before improving once structural upgrading and efficiency gains become sufficiently strong, or conversely improve early and weaken later as marginal returns diminish and rebound pressures rise [22,23]. This nonlinear evidence resonates with the broader “decoupling” discussion, where results depend materially on development stage and accounting boundary. Recent studies show that decoupling assessments can diverge across production-based, consumption-based, and income-based perspectives, and that apparent decoupling may be unstable across sectors and periods [24,25]. Cross-country historical evidence likewise cautions that “green growth” episodes can be reversible rather than monotonic [26]. For China, multi-regional input–output analyses highlight sizeable embodied-carbon transfers through interregional trade, implying that local improvements in measured performance may partly reflect geographical reallocation of emissions along supply chains rather than net reductions [27,28]. These boundary and transfer issues motivate explicit attention to spatial dependence and basin-scale coordination when interpreting city-level carbon-efficiency outcomes.

Second, from the efficiency perspective, empirical study at both city and firm levels generally finds that intelligent technologies can improve energy efficiency and carbon-emissions efficiency by optimising production processes, reducing input–output distortions, and strengthening process control, thereby allowing higher output under given emissions constraints [25,26,27,28,29,30,31,32,33,34]. Evidence from new digital infrastructure and big-data pilot initiatives further suggests that digital investments can enhance the joint efficiency of pollution and carbon reduction, although effect sizes vary by region and stage of development [35,36]. In this channel, the central issue is not whether optimisation is feasible, but whether digital tools are sufficiently embedded in production organisation and infrastructure operation to deliver persistent efficiency gains rather than one-off improvements.

Regarding the innovation channel, A third line of study treats digital–intelligent transformation as a catalyst for green technological change and the diffusion of cleaner technologies [37,38,39,40]. From the perspective of innovation-chain governance, digital integration can improve matching between innovation supply and demand, strengthen firms’ absorptive capacity, and reduce information and transaction frictions, thereby raising technology-transfer and commercialisation efficiency [41,42]. More broadly, intelligent technologies are often framed as general-purpose technologies that support cross-sector green innovation and deployment of low-carbon solutions through prediction, high-dimensional optimisation, and real-time monitoring [43]. Compared with short-run efficiency gains, innovation-driven effects are commonly argued to be more durable, provided that cities possess the requisite human capital and innovation inputs [44].

Fourth, A rapidly expanding literature focuses on governance effects, especially in contexts that require differentiated regulation and resilience-oriented environmental management [45]. Studies of major river basins and urban agglomerations report that digital–intelligent integration can strengthen coordination of pollution and carbon reduction, improve the timeliness and precision of environmental regulation, and enhance monitoring, early warning, and emergency response capacities [46,47]. Quasi-natural experiments based on broadband expansion, smart-city pilots, and big-data comprehensive pilot zones are frequently used to identify these governance effects [48]. The evidence suggests that such interventions can reduce emissions intensity and improve environmental-welfare performance, often with spatial spillovers that cross administrative borders [49,50]. At the same time, governance benefits tend to be conditional: where regulatory credibility, fiscal capacity, or enforcement institutions are weak, digital monitoring alone is less likely to translate into sustained compliance improvements [51].

The four channels above describe a coupled system rather than separable mechanisms. In practice, digital–intelligent integration can simultaneously reshape industrial structure, improve operational efficiency, enable innovation diffusion, and strengthen governance, with combined impacts that vary sharply across city types and stages of development. Yet much of the existing empirical study relies on national or provincial samples and predominantly linear specifications that examine one or two channels at a time [52]. As a result, nonlinear responses, threshold effects, and higher-order interactions—especially those tied to local endowments and institutional capacity—remain insufficiently characterised. Moreover, boundary issues such as embodied-emissions transfers imply that city-level performance cannot be fully interpreted without considering interregional linkages and spatial spillovers [24,25,26,27,28].

These limitations connect directly to the second strand of research on methods for analysing high-dimensional, nonlinear drivers of green performance. Traditional panel and spatial econometric approaches are well suited to testing targeted hypotheses, but they often struggle to approximate complex response surfaces and interaction structures without strong functional-form assumptions. Machine-learning models can improve predictive performance, yet many are difficult to interpret and therefore provide limited guidance on “why” marginal effects differ across cities and periods. Recent advances in interpretable machine learning help to narrow this gap: gradient-boosted decision trees can flexibly capture nonlinearities in structured data, while SHAP decomposes predictions into feature-level contributions at both global and local levels, enabling city-specific “portraits” and interaction diagnostics.

Figure 1. Overview of the research workflow: Super-efficiency DEA → XGBoost coupled with SHAP for explanation → PVAR estimation with spatial diagnostic procedures [48].

Against this background, the present study develops an integrated framework tailored to the urban agglomerations in the middle reaches of the Yellow River Basin. The design aligns measurement, nonlinear explanation, dynamics, and spatial diagnostics so that cross-sectional interpretability can be connected to time profiles and spatial dependence rather than treated as a purely predictive exercise.

Within this overarching structure, the study contributes in three respects. Substantively, it focuses on the middle reaches of the Yellow River Basin and jointly considers digital–intelligent integration, structural upgrading, and governance capacity in explaining city-level carbon-efficiency performance. Methodologically, it connects DEA-based efficiency measurement (with undesirable outputs) to an interpretable nonlinear model, XGBoost–SHAP, and further embeds dynamic and spatial checks to address time propagation and cross-city dependence. From a policy perspective, city-level profiles and clustered spatial patterns support more differentiated, place-based strategies for digital–intelligent transition and low-carbon governance in river-basin urban systems.

3. Study Area and Data

3.1. Study Area

As one of China’s most important river systems, the Yellow River runs across northern China and spans the country’s eastern, central and western regions [53]. In recent years, the strategy of “ecological protection and high-quality development of the Yellow River Basin” has been proposed and elevated to a major national agenda [53,54]. Within this strategic framework, the Yellow River Basin is positioned as a benchmark for large-river governance, a critical ecological security barrier, and a pilot zone for high-quality development [54].

Despite this strategic importance, the basin continues to face severe challenges, including overexploitation of water resources, environmental pollution, and serious soil erosion [52]. These pressures make ecological protection a core task for current and future development. Under such constraints, exploring differentiated, place-based pathways to high-quality development—paths consistent with local resource endowments and development conditions—has become essential for promoting sustainable regional transformation [55,56].

Following the delimitation scheme proposed by [57], we define the urban agglomerations in the middle reaches of the Yellow River by jointly considering natural geographical units, administrative coherence, and inter-city economic linkages. After excluding cities with incomplete data coverage (including Yangquan), the final study sample comprises 39 prefecture-level cities across five provincial-level jurisdictions: Gansu, Shaanxi, Shanxi, the Inner Mongolia Autonomous Region, and Henan. The study area covers approximately 3.95 × 10⁵ km² and includes four major urban agglomerations: the Hohhot–Baotou–Ordos–Yulin urban agglomeration, the Central Shanxi urban agglomeration, the Guanzhong Plain urban agglomeration, and the Central Plains urban agglomeration [57,58,59]. The composition of these agglomerations is summarised in Table 1.

The empirical analysis focuses on these middle-reach urban agglomerations because they constitute a key platform for coordinating ecological protection and high-quality development in the basin [60]. The sample includes provincial capital cities such as Zhengzhou, Taiyuan, Xi’an, and Hohhot, together with surrounding prefecture-level cities, forming an integrated economic–ecological urban system along the midstream of the Yellow River. On the one hand, these cities have historically exhibited a high concentration of coal, metallurgy, chemicals, and other energy-intensive industries and thus serve as an important national base for energy and heavy industry [55]. On the other hand, they are priority areas for implementing the basin strategy and the “dual-carbon” goals and therefore operate under relatively stringent ecological and environmental constraints [61,62]. These combined features make the middle-reach urban agglomerations an appropriate case for examining how digital–intelligent development, structural upgrading, and governance capacity jointly shape urban green performance and carbon-mitigation-oriented efficiency under multiple development and environmental constraints [63,64].

Figure 2 depicts the spatial pattern of the study area, which stretches across central and northwestern China and forms a relatively continuous urban belt along the middle reaches of the Yellow River. This spatial configuration provides the geographical basis for the city-level panel dataset and empirical analysis described below.

3.2. Data and Variables

Building on the study-area delineation above, this study constructs an annual city-level panel dataset for the middle-reach urban agglomerations of the Yellow River over 2011–2022. The sample covers 39 prefecture-level cities that satisfy data-availability requirements. Data on economic and social development, industrial structure, fiscal variables, and indicators related to digital–intelligent development are obtained from city statistical yearbooks and provincial/local statistical yearbooks, with harmonised statistical calibres across cities and years. Unless otherwise noted, all monetary variables are converted to constant prices using official deflators with a common base year to ensure intertemporal and interregional comparability. In addition, ratio- and intensity-type indicators (e.g., per capita values, shares, and intensity measures) are constructed consistently across cities and years to mitigate scale effects and maintain standardisation in the empirical analysis.

In line with the empirical design, variables are organised into dependent variables, a core explanatory variable, and control variables (Table 2). The dependent variables are two super-efficiency indices derived from data envelopment analysis (DEA). The super-efficiency SBM index (y₁) evaluates city-level green production efficiency by explicitly incorporating undesirable outputs, while the super-efficiency CCR index (y₂) measures conventional production efficiency under constant returns to scale without explicitly modelling undesirable outputs. Together, these two indicators provide complementary views of city-level performance under economic and environmental constraints.

The core explanatory variable is a digital–intelligent integration index (x). This composite indicator is updated from our prior study on digital–intelligent integration in the middle-reach urban agglomerations of the Yellow River [48]. The updated index captures overall city performance in digital transformation and intelligent development, covering key dimensions such as digital infrastructure, industrial digitalisation, and intelligent services.

The control variables capture major aspects of city development that are likely to correlate with efficiency outcomes. They include human capital (c₁), internet penetration (c₂), environmental regulation intensity (c₃), urbanisation rate (c₄), science and technology expenditure (c₅), fiscal investment intensity (c₆), openness to external markets (c₇), and financial development level (c₈). All control variables are compiled from statistical yearbooks with consistent definitions and coverage over the sample period. This variable system provides the empirical basis for the interpretable nonlinear modelling framework and the dynamic and spatial analyses introduced in subsequent sections.

Unlike conventional green total factor productivity, carbon-emission efficiency requires that efficiency measurement jointly account for economic output and carbon emissions. Following the environmental production technology and the directional distance function proposed by Färe et al. [65], we incorporate energy consumption into the input set and treat city-level carbon emissions as an undesirable output. Specifically, the DEA input vector includes labour, capital, and energy; the desirable output is city-level GDP, and the undesirable output is city-level carbon emissions. In terms of model choice, the standard SBM model may yield multiple cities with efficiency scores equal to one, which reduces discriminatory power and hampers subsequent regression analysis. Therefore, building on the super-efficiency approach proposed by Andersen [66], we employ a super-efficiency SBM model with undesirable outputs to measure carbon-emission-oriented efficiency, allowing a finer comparison among efficient decision-making units. The resulting super-efficiency SBM score (with undesirable outputs) is used as y₁, while the super-efficiency CCR score (without undesirable outputs) is used as y₂.

Section 4 builds on this dataset by detailing the efficiency measurement procedures and the empirical modelling strategy used to characterise nonlinearities, interaction patterns, and their dynamic and spatial implications.

4. Methodology

4.1. Measuring Urban Performance Under Carbon Constraints

To empirically assess how digital–intelligent integration relates to urban development outcomes, we first construct internally consistent measures of city-level production performance. Following Section 2, urban production performance is evaluated from two complementary perspectives: green (carbon-emission-oriented) production efficiency and conventional production efficiency [67,68]. Each prefecture-level city is treated as a decision-making unit (DMU), and the resulting efficiency indicators are used as dependent variables in the subsequent empirical analysis.

Green production efficiency, denoted by y_1it, is measured using a super-efficiency slacks-based measure (SBM) model with undesirable outputs within a non-radial, non-oriented DEA framework [69,70]. Consistent with the input–output specification in Section 3.2, each city is characterised by an input vector consisting of labour, capital, and energy, a desirable output (real GDP), and an undesirable output (city-level CO₂ emissions). Slack variables are introduced to capture input excesses, shortfalls in desirable output, and excesses in undesirable output. Under the super-efficiency setting, the evaluated city is excluded from the reference set when constructing the frontier, allowing efficiency scores greater than one. This feature is crucial for distinguishing among frontier cities and provides a more sensitive measure of performance when carbon constraints are binding.

Conventional production efficiency, denoted by y_2it is obtained from a super-efficiency CCR model under constant returns to scale [66,71]. This indicator follows the classical DEA setting with a radial, CRS-based efficiency measure and does not explicitly incorporate undesirable outputs, thereby providing a benchmark measure of conventional technical efficiency that is conceptually distinct from, yet comparable to y_1it. As with the SBM specification, the super-efficiency extension permits scores above unity and thus yields a finer ranking among technically efficient cities along the conventional production dimension. Before entering the subsequent models, both y_1it and y_2it are pre-processed to improve numerical stability and cross-city comparability. Specifically, we mitigate the influence of extreme values using winsorisation, apply a natural logarithmic transformation, and then standardise the transformed series (z-score scaling). This preprocessing reduces undue leverage from outliers and facilitates comparability of the efficiency indicators across cities and years, particularly for the panel-based dynamic analysis implemented in later sections.

4.2. Modelling the Digital–Intelligent Integration–Efficiency Relationship

Given the efficiency measures defined in Section 4.1, the next step is to examine how they are associated with digital–intelligent integration and structural characteristics of cities. Let denote the digital–intelligent integration index introduced in Section 3.2, and let c_it = (c₁, …, c₈) collect, respectively, human capital, informatisation level, environmental regulation, urbanisation, science and technology expenditure, fiscal investment intensity, openness to external markets and financial development.

For each efficiency indicator y_k,it(k = 1, 2), we approximate its conditional expectation as

E (y_{k, i t} |x_{i t}, c_{i t}, μ_{i}, τ_{t}) = f_{k} (x_{i t}, c_{i t}, μ_{i}, τ_{t}),

where

f_{k} (\cdot)

is an unknown, potentially non-linear response function,

μ_{i}

and

τ_{t}

denote city and year effects, and

ε_{k, i t}

is an idiosyncratic error term such that

y_{k, i t} = f_{k} (x_{i t}, c_{i t}, μ_{i}, τ_{t}) + ε_{k, i t}

. Here, the objective is associational explanation and prediction rather than causal identification.

City and year effects are implemented as sets of dummy variables and are included directly as features, allowing the model to learn region-specific and period-specific shifts in the efficiency distribution rather than imposing a homogeneous intercept across cities and years.

The theoretical channels discussed in Section 2 indicate that nonlinearities, threshold effects and interactions among explanatory variables are likely to be important. To accommodate such complexity without committing to a restrictive parametric form,

f_{k} (\cdot)

is approximated by a tree-based ensemble model of the gradient-boosting type. Specifically, we employ Extreme Gradient Boosting (XGBoost), which is well suited to high-dimensional, mixed-scale tabular data and can flexibly approximate complex response surfaces through an additive ensemble of decision trees [72]. The recursive nature of boosting enables the model to successively capture residual patterns that would be difficult to accommodate in linear or purely additive frameworks [73].

We adopt XGBoost as the baseline learner because its inductive bias is well matched to structured city-level panel/tabular data and to the mechanism we aim to uncover. In particular, it can accommodate nonlinear responses, high-order interactions, and threshold-like patterns without imposing a priori functional forms, while its built-in regularisation learning-rate shrinkage, row subsampling, and tree-structure constraints helps limit overfitting. Relative to linear fixed-effects or additive specifications, this flexibility is important for capturing piecewise or capacity-contingent relationships that are plausible in urban transition processes.

Building on this modelling framework, we employ SHAP to render the fitted XGBoost model interpretable and to translate the learned nonlinear structure into evidence that can be discussed in mechanism terms. SHAP attribution is implemented using TreeExplainer for tree ensembles. To maintain reproducibility under the installed SHAP environment, summary and dependence graphics are generated through the modern interface when available, with fallbacks to legacy APIs or manual plotting routines if compatibility issues arise. Because region and year encodings are included in estimation to absorb city-invariant heterogeneity and common time shocks, these identifier-type controls are excluded from the reported explanation set; all attributions are therefore presented for substantive covariates only. Global importance is summarised by the mean absolute SHAP value across observations, and observation-level (city–year) attributions are archived to support replication and post hoc analyses. Where supported, we compute SHAP interaction values and summarise complementarities using the mean absolute interaction matrix, visualised as a heatmap with a fixed colour range to facilitate comparisons across runs. For local illustration, we produce force plots for selected cities using the SHAP expected value as the baseline. The representative city–year is chosen from the most recent year as the record whose core index is closest to the city-specific mean, so that the visualisation reflects a typical rather than an extreme profile. The force plot displays contributions for core covariates only (x and c₁–c₈) for readability, while the prediction shown in the title is computed using the full SHAP decomposition to preserve additivity.

Model hyperparameters are selected using cross-validation constructed to respect the time structure on a training subset. Predictive adequacy is evaluated on a held-out test subset using R², mean absolute error (MAE), and mean absolute percentage error (MAPE). To assess temporal robustness under evolving structural conditions, we further implement a rolling-origin validation scheme: the model is recursively estimated using data up to year t, and predictions are generated for year t + 1. Averaging MAE and MAPE across all rolling windows yields a conservative measure of out-of-sample performance over time and guards against overfitting to specific subperiods.

4.3. Interpreting Nonlinearities and Complementarities

While the gradient-boosting model provides flexible non-linear approximations of

f_{k} (\cdot)

, it is still necessary to relate these estimates back to the conceptual mechanisms outlined in Section 2. We therefore complement the predictive exercise with both global and local measures of feature importance in the spirit of recent study on interpretable machine learning [74,75]. These measures jointly allow us to identify, first, which variables matter most on average and, second, how they contribute in specific city–year contexts.

4.3.1. Leave-One-Covariate-Out (LOCO) Importance

At the global level, we assess the contribution of each covariate to predictive accuracy using a leave-one-covariate-out (LOCO) analysis. Starting from the full model that includes x_it, c_1,it, …, c_8,it and fixed effects, we sequentially remove one covariate z, re-estimate the model and recompute the out-of-sample MAE. The change in prediction error,

Δ M A E (z) = M A E_{(- z)} - M A E_{f u l l}

quantifies the marginal contribution of that covariate to overall prediction quality, in line with LOCO-type importance measures based on predictive performance [76]. Ranking covariates by MAE(z) yields a global ordering of variables in terms of their empirical relevance for green and conventional efficiency, which can be compared with prior expectations concerning the structural, efficiency, innovation and governance channels.

4.3.2. Local Attribution and City-Level Diagnostics

Global importance measures, however, do not reveal how factors combine at the level of individual cities or how their marginal effects vary across the distribution of covariates. To obtain city-specific decompositions of predicted efficiency, we therefore use the SHapley Additive exPlanations (SHAP) framework associated with the XGBoost model [77,78]. For each city–year observation, the model prediction can be written as

{\overset{⌢}{y}}_{k, i t} = ϕ_{0, k} + \sum_{j = 1}^{J} ϕ_{j, k, i t}

where

ϕ_{0, k}

is a baseline prediction (the expected model output for a reference input distribution) and

ϕ_{j, k, i t}

denotes the contribution of feature j to the prediction for city i in year t. Positive values of

ϕ_{j, k, i t}

indicate that the corresponding feature pushes the prediction above the baseline, whereas negative values indicate a downward contribution.

These local contributions are visualised using force plots for representative cities, which display how digital–intelligent integration, structural controls and fixed effects jointly move the predicted efficiency away from the baseline. Combined with radar charts of standardised x_it and c_1,it, …, c_8,it, this approach provides a concise yet informative description of each city’s endowment profile and how it translates into higher or lower efficiency within the nonlinear model. Similar combinations of gradient-boosting models and SHAP-based explanations have been shown to be effective in remote-sensing applications, for example, in rapid landslide mapping from high-resolution imagery [73]. In this way, global and local importance measures together link the empirical model back to city-level mechanisms highlighted in the conceptual framework.

4.4. Dynamic Propagation: PVAR and Impulse Responses

Let z_it denote an L-dimensional vector that includes green and conventional efficiency, digital–intelligent integration and selected structural controls (such as urbanisation and fiscal investment):

Z_{i t} = {(y_{1, i t}, y_{2, i t}, x_{i t}, u_{i t}, f_{i t}, \dots)}^{T},

where u_it and f_it represent, for example, urbanisation and fiscal investment intensity for city i in year t. A PVAR(P) model for zit is specified as

z_{i t} = A_{1} z_{i, t - 1} + \dots + A_{p} z_{i, t - p} + μ_{i} + τ_{t} + u_{i t}

where A_p(p = 1, …, P) are L × L coefficient matrices, μ_i and τ_t denote city and time effects, respectively, and u_it is a vector of idiosyncratic disturbances. This specification allows for dynamic feedbacks among efficiency, digital–intelligent integration and structural controls while controlling for unobserved heterogeneity across cities and common shocks over time [79]. The lag order P is selected using standard information criteria and stability diagnostics to ensure that the resulting system is dynamically stable [80].

Following the panel VAR literature, the system is estimated using a system GMM estimator [81,82,83]. This approach treats lagged endogenous variables as instruments for their own current values in the quasi-differenced equations, combining moment conditions in first differences and in levels to improve efficiency when the time dimension is relatively short [82,83]. In particular, suitably lagged levels and differences in z_it are used as instruments to address the endogeneity of lagged dependent variables and to control for the presence of city-specific effects μ_i. Hansen or Sargan-type over-identification tests and Arellano–Bond tests for serial correlation in the residuals are employed to assess the validity of the instruments and the dynamic specification [82,83].

On the basis of the estimated PVAR coefficients, we compute orthogonalised impulse–response functions (IRFs) to trace the dynamic effects of shocks. Using a Cholesky decomposition of the residual covariance matrix under a recursive ordering consistent with the theoretical channels, we derive IRFs for one-standard-deviation innovations to x and to each of the selected structural controls [84]. The resulting IRFs describe the responses of y_1,it and y_2,it over subsequent periods to shocks in digital–intelligent integration and structural variables. To account for sampling uncertainty, we construct bootstrap confidence bands around the IRFs using standard resampling procedures for VAR-type models.

In combination with the static non-linear model in Section 4.2, these dynamic responses allow us to assess whether digital–intelligent integration primarily affects efficiency contemporaneously or also exerts persistent effects over time. Consistent and statistically significant responses in the PVAR framework provide complementary evidence on the temporal propagation of digital–intelligent shocks through the structural, efficiency, innovation and governance channels identified in Section 2.

4.5. Spatial Dependence Diagnostics: Moran’s I and LISA

The cities in the study area are embedded in a spatially connected system, and spatial dependence is particularly relevant in a remote sensing context, where many variables are derived from spatially contiguous observations and exhibit strong spatial autocorrelation [85]. Even after conditioning on a rich set of determinants, residual spatial structure may remain. To evaluate this, we implement two complementary diagnostics: global Moran’s I statistics for model residuals and local LISA statistics for the efficiency indicators themselves, following standard practice in spatial statistics [85,86].

First, using a queen-contiguity spatial weights matrix W = (w_ij)—where two cities are treated as neighbours if they share either a common boundary or a common vertex—we compute global Moran’s I for the residuals of the y_1,it and y_2,it models in each year from 2011 to 2022. For a generic residual vector

e_{t} = {(e_{1 t}, \dots e_{N t})}^{T}

, Moran’s I is defined as [84,86]:

I = \frac{N}{S} \frac{\sum_{i} \sum_{j} w_{i j} (e_{i t} - {\bar{e}}_{t}) (e_{j t} - {\bar{e}}_{t})}{\sum_{i} {(e_{i t} - {\bar{e}}_{t})}^{2}}

where N is the number of cities,

{\bar{e}}_{t}

is the mean residual in year t, and

S_{0} = \sum_{i} \sum_{j} w_{i j}

. The magnitude and statistical significance of Moran’s I—assessed using permutation-based reference distributions or asymptotic normality—indicate whether substantial spatial autocorrelation remains after controlling for digital–intelligent integration, structural covariates and fixed effects [87]. This provides a diagnostic check on whether the non-linear model has captured the main spatial patterns in the data or whether important spatial processes are left unexplained.

Second, we compute Local Indicators of Spatial Association (LISA) for y_1,it and y_2,it in selected benchmark years (2011, 2017, 2022), following the local Moran’s I framework of Anselin [85]. For each city, the local indicator decomposes the global Moran’s I into location-specific contributions and allows the identification of local “hot spots” and “cold spots” of efficiency. Cities are classified into High–High, Low–Low, High–Low and Low–High clusters based on the sign of the local statistic and the deviation from the global mean. Monte Carlo permutation tests are used to determine statistical significance at conventional levels, thereby filtering out spurious clusters due to random variation [85,88]. The resulting LISA cluster maps, presented in Section 5, provide a spatially explicit view of how green and conventional efficiency cluster and evolve over time within the study area [87].

4.6. Robustness and Cross-Agglomeration Validation

The empirical modelling framework is developed for the urban agglomerations in the middle reaches of the Yellow River as a whole. Within this area, however, the Hohhot–Baotou–Ordos–Yulin (Hubao’e–Yulin) urban agglomeration, the Central Shanxi urban agglomeration, the Guanzhong Plain urban agglomeration and the Central Plains urban agglomeration differ markedly in their industrial structures, policy environments and development trajectories. To examine the external validity and portability of the empirical approach in this heterogeneous setting, we conduct cross-regional robustness checks at the urban-agglomeration scale.

For each of the major urban agglomerations identified in Section 3.1, we re-estimate the gradient-boosting model using only the corresponding sub-sample of cities. The model specification, hyperparameter selection procedure and evaluation metrics are kept identical to those in Section 4.2. For each sub-sample, we compute R², MAE and MAPE for both y_1,it and y_2,it on out-of-sample observations. Comparing these metrics across urban agglomerations allows us to assess whether the estimated relationships between efficiency, digital–intelligent integration and the control variables are stable across different regional contexts.

Differences in error measures are interpreted jointly with information on the internal structure and development patterns of each urban agglomeration, rather than as evidence against the modelling strategy per se. In particular, moderate variations in performance are expected in regions with pronounced structural specificities. Overall, this set of cross-regional exercises evaluates the suitability of the proposed empirical framework for comparative analysis across urban agglomerations and provides a check on its robustness within the broader Yellow River Basin.

4.7. Summary

This section develops a multi-layered methodological framework that connects the conceptual channels in Section 2 and the data structure in Section 3 with the empirical analysis in Section 5. Super-efficiency SBM models with undesirable outputs and super-efficiency CCR models provide internally consistent measures of green and conventional production efficiency at the city level. These efficiency indicators are then related to digital–intelligent integration and the structural covariates defined in Section 3.2 through a tree-based gradient-boosting ensemble, whose performance is evaluated using conventional train–test splits, rolling-origin validation and cross-regional robustness checks.

Building on this predictive framework, global and local feature-importance measures—implemented via leave-one-covariate-out analyses and SHAP-based decompositions—together with city-level visualisations such as force plots and radar charts, permit a detailed examination of how digital–intelligent integration and structural characteristics jointly shape efficiency outcomes. A PVAR–IRF set-up is used to characterise the temporal propagation of shocks, while global Moran’s I and local LISA statistics document residual spatial dependence and clustering patterns. Taken together, these components constitute the methodological basis for the empirical results and policy implications presented in Section 5.

5. Empirical Results

5.1. Overall Predictive Performance of the Hybrid Model

A necessary condition for interpreting the model-based decompositions in later subsections is that the underlying specification reproduces the main spatial and temporal patterns observed in the data. We therefore begin by evaluating the in-sample and out-of-sample performance of the hybrid DEA–gradient-boosting framework for the two efficiency indicators, y₁ (green production efficiency)and y₂ (conventional production efficiency). Table 3 summarises the key diagnostics.

Using the full panel of 39 prefecture-level cities, the gradient-boosting model attains coefficients of determination of 0.9980 and 0.9971 for y₁ and y₂, respectively. The corresponding in-sample MAE values are 0.0051 and 0.0061, and MAPE values are 1.74% and 1.30%. These figures should be interpreted in the context of the efficiency indices, which are dimensionless. The raw efficiency indices are dimensionless and lie in a relatively narrow band around unity; after the logarithmic transformation and z-score standardisation (Section 4.1), the model is estimated on centred and scale-normalised outcomes.

A necessary condition for interpreting the model-based decompositions in later subsections is that the underlying specification reproduces the main spatial and temporal patterns observed in the data. We therefore begin by evaluating the in-sample and out-of-sample performance of the hybrid DEA–gradient-boosting framework for the two efficiency indicators, y₁ (green production efficiency)and y₂ (conventional production efficiency). Table 3 summarises the key diagnostics. Using the full panel of 39 prefecture-level cities, the gradient-boosting model attains very high coefficients of determination for both y₁ and y₂. We interpret these values cautiously and provide several diagnostics. First, the dependent variables are DEA-based efficiency indices constructed from consistently measured inputs/outputs, which reduces measurement noise compared with raw emissions or output series. Second, the sample is a balanced panel with strong temporal persistence in city characteristics so a sizeable share of variation is systematic rather than idiosyncratic. Third, we train the model with time-respecting cross-validation and rolling-origin validation, and we report MAE/MAPE alongside R² to avoid over-reliance on a single fit statistic. Moreover, boosted-tree models are less sensitive to classical multicollinearity than linear regressions, but we still limit model complexity through regularisation and show that feature-importance patterns remain stable under alternative hyperparameters and subsamples. Finally, we emphasise that the purpose of the machine-learning stage is to recover flexible associations and conditional patterns, not to claim causal identification.

Against this scale, the in-sample MAE corresponds to less than 2% of the observed range of each index, indicating that the model captures the dominant cross-sectional differences across cities and the inter-annual variation in efficiency levels. This is particularly important given that the explanatory variables combine conventional yearbook statistics with governance- and digitalisation-related indicators that exhibit strong cross-sectional heterogeneity but only moderate inter-annual variation.

To assess temporal robustness and guard against overfitting to specific years, we implement a rolling-origin validation scheme in which the model is repeatedly re-estimated on an expanding window of data and then used to predict the subsequent year (Section 4.2). As expected, the average MAE and MAPE from this exercise are higher than the in-sample values: the rolling-origin MAE is 0.1624 for y₁ and 0.1643 for y₂, while the corresponding MAPE values are 52.58% and 34.76%. The relatively large MAPE figures mainly reflect the sensitivity of percentage errors when actual efficiency scores are close to one; small absolute deviations in such cases translate into large relative percentages even though the predicted paths remain close to the observed trajectories. The MAE values, which are not affected by this denominator issue, are of a magnitude that is still compatible with regional planning and comparative analysis at the urban-agglomeration scale. Because percentage-based errors can be unstable when the dependent variables are transformed and standardised, we interpret MAPE with caution and primarily rely on MAE (and, where reported, RMSE) for out-of-sample comparisons.

Inspection of the time profile of rolling errors shows that years associated with major macroeconomic or policy shocks exhibit somewhat larger forecast errors. In particular, 2018 when ecological protection and high-quality development in the Yellow River Basin were elevated to a national strategy and 2020 when China’s “dual-carbon” targets were formally announced stand out as periods in which abrupt regime shifts are not fully captured by the smoothly evolving components of the model. This pattern is consistent with the expectation that structural breaks induced by policy announcements or external shocks are difficult to forecast solely from lagged covariates and trend dynamics, and it motivates the complementary dynamic and spatial analyses reported in later subsections.

Residual diagnostics further support the adequacy of the specification. For both y₁ and y₂, the residuals are tightly centred around zero, with mean values effectively equal to zero and standard deviations of 0.0069 and 0.0086, respectively (Table 3). The dispersion of residuals is small relative to the scale and cross-sectional variation in the dependent variables, and extreme positive or negative residuals are rare. The few observations with relatively large residuals correspond to rapidly transforming cities—typically those undergoing pronounced industrial restructuring or experiencing strong digital–intelligent policy interventions—which are discussed in more detail in Section 5.4. Overall, the combination of high in-sample fit, acceptable out-of-sample performance under rolling validation, and well-behaved residuals indicates that the hybrid model provides a sufficiently reliable basis for the subsequent analysis of feature contributions, spatial clustering and dynamic responses.

5.2. Cross-Regional Performance Across Urban Agglomerations

The study area comprises four major urban agglomerations that differ markedly in industrial base, policy environment and ecological conditions. Before interpreting model-based results at the basin scale, it is therefore useful to verify whether the estimated relationships between efficiency, digital–intelligent integration and control variables remain stable across these regional contexts. This subsection reports goodness-of-fit indicators for the four agglomerations, estimated separately for each region in line with the cross-regional robustness design in Section 4.6.

Table 4 and Table 5, summarise the performance of the hybrid model for green production efficiency (y₁) and conventional production efficiency (y₂) when the sample is restricted to one urban agglomeration at a time. For y₁, the in-sample R² values are extremely high and effectively equal to one in all four agglomerations (Table 4). Given that the dependent variable is an efficiency index bounded around unity and pre-processed through log transformation and standardisation, such high R² values are not unexpected and should be interpreted primarily as indicating that the model captures the smooth trend and cross-sectional structure of the data within each region. The more informative statistics for cross-regional comparison are the MAE and MAPE, which remain small in absolute terms but display systematic differences across agglomerations.

For y₁, the Hubao–Eyu urban agglomeration (Hohhot–Baotou–Ordos–Yulin), characterised by a relatively stable, energy-oriented industrial base and a comparatively consistent policy environment, shows the smallest absolute and relative errors (MAE = 0.0004; MAPE = 0.11%). The Jinzhong (Central Shanxi) and Guanzhong Plain agglomerations also exhibit low MAE and MAPE, indicating that green production efficiency is well captured in regions where industrial structures and policy settings evolve in a gradual manner. By contrast, the Central Plains agglomeration records somewhat larger MAE and MAPE (0.0013 and 0.44%, respectively), despite an R² that is essentially identical to the other regions. This pattern is consistent with the fact that the Central Plains has experienced more rapid structural adjustment and frequent policy innovations over the sample period, generating short-term fluctuations in green efficiency that are more difficult to track with a model calibrated on annual data.

The pattern for y₂ is related but not identical (Table 5). The hybrid model again attains near-perfect R² in all agglomerations, confirming that the relationship between the explanatory variables and conventional production efficiency can be approximated accurately in each regional subsample. However, the ranking of error measures changes. Here, the Hubao–Eyu and Central Plains agglomerations display the smallest MAPE (0.07% and 0.10%, respectively), whereas the Guanzhong Plain exhibits the highest MAE and MAPE (0.0020 and 0.40%). These differences are small in absolute magnitude but suggest that conventional production efficiency is particularly predictable in regions with relatively clear and stable production regimes, while areas undergoing more active restructuring or stronger ecological-protection mandates may follow more volatile efficiency trajectories even if the overall fit remains high.

Taken together, the cross-regional results indicate that the proposed modelling framework is highly transferable across urban agglomerations in the middle reaches of the Yellow River: the basic functional relationships estimated at the basin scale remain valid when the model is re-estimated separately for each region. At the same time, the modest but systematic differences in MAE and MAPE across agglomerations point to meaningful variation in predictability that reflects institutional, structural and ecological contrasts. These contrasts provide important context for the spatial clustering results discussed in Section 5.6 and the differentiated policy implications in Section 5.7.

5.3. Global Feature Importance

As is shown in Table 6,for the green-efficiency model y₁, the LOCO results indicate that urbanisation (c₄) and fiscal investment intensity (c₆) produce the largest increases in MAE when removed, suggesting that agglomeration dynamics and public-investment intensity are central to explaining green production efficiency. The digital–intelligent integration index x also yields a sizeable positive ΔMAE, confirming its role as a key correlate of green efficiency alongside conventional development factors. Environmental regulation (c₃), openness (c₇), and financial development (c₈) contribute positively but with smaller marginal effects on prediction accuracy. Notably, several covariates display negative ΔMAE values, implying that removing them slightly improves out-of-sample accuracy. This pattern is consistent with redundancy or noise in the presence of correlated predictors and fixed effects; we therefore interpret LOCO rankings jointly with SHAP-based importance rather than relying on LOCO alone.

For the conventional efficiency model y₂, the importance ranking shifts in a manner consistent with its more production-oriented nature. Urbanisation (c₄), openness to external markets (c₇), science and technology expenditure (c₅), and human capital (c₁) exhibit the largest LOCO effects, indicating that conventional production efficiency is particularly sensitive to agglomeration forces, external linkages, innovation inputs, and labour-force quality. The digital–intelligent index x remains non-negligible, but its marginal contribution is smaller than in the y₁ model, suggesting that digital–intelligent development aligns more strongly with the green-efficiency dimension than with conventional technical efficiency alone.

The SHAP-based global importance measures (Figure 3) reinforce these patterns. Across both y₁ and y₂, a compact set of drivers—urbanisation, public-investment intensity, human capital, science and technology expenditure, openness, financial development, and digital–intelligent integration—accounts for most of the variation in predicted efficiency. The stability of this determinant set across indicators and importance metrics supports the joint analysis of green and conventional efficiency and provides an empirical link between the literature on urban development fundamentals and the emerging evidence on digital–intelligent transformation.

5.4. City-Level Heterogeneity and Representative Cases

Global importance measures summarise average relationships across all cities and years. To examine how different combinations of drivers translate into heterogeneous outcomes at the city scale, we conduct a case-based analysis for four representative provincial capitals Hohhot, Taiyuan, Xi’an, and Zhengzhou which occupy distinct positions within the middle-reach urban agglomerations.

The radar charts in Figure 4 plot the standardised values of the digital–intelligent index x and the covariates c₁–c₈ for these cities, revealing clearly differentiated development profiles. Hohhot, embedded in a resource-oriented northern agglomeration, shows relatively strong levels of urbanisation (c₄) and fiscal investment intensity (c₆) but comparatively weaker digital–intelligent integration (x) and science and technology expenditure (c₅). Taiyuan exhibits comparatively strong fiscal investment intensity (c₆) and relatively stringent environmental regulation (c₃), alongside a medium level of internet penetration (c₂). Xi’an combines high human capital (c₁) with strong digital–intelligent integration (x) and science and technology expenditure (c₅), while several fiscal- and finance-related indicators are closer to sample averages. Zhengzhou records high values for digitalisation (x), human capital (c₁), and internet penetration (c₂), whereas environmental regulation (c₃) and financial development (c₈) are more moderate.

The SHAP force plots in Figure 5 translate these profiles into local contributions to predicted efficiency. For cities such as Hohhot and Taiyuan, higher fiscal investment intensity and urbanisation-related factors contribute positively in certain periods, while comparatively weaker digital–intelligent integration or innovation inputs contribute less to efficiency gains. In Xi’an, strong human capital, digital integration, and science and technology expenditure generate positive contributions to both green and conventional efficiency. Zhengzhou exhibits a mixed configuration: strengths in digitalisation and human capital raise predicted efficiency, while other constraints limit the extent of gains, particularly for y₁. Overall, the city-level evidence indicates that similar observed efficiency levels can arise from distinct configurations of drivers, and that the effectiveness of a given policy lever depends critically on local development conditions. Recognising this heterogeneity is essential for designing place-based policy measures in the middle reaches of the Yellow River.

5.5. Dynamic Responses of Green Development to Key Drivers

The static relationships documented above do not reveal how shocks propagate over time or whether the influence of individual drivers remains stable across different phases. To investigate the temporal dimension, we estimate panel VAR systems including y₁, y₂, the digital–intelligent index x, and the full set of controls c₁–c₈, and derive orthogonalised impulse–response functions (IRFs) with respect to one-standard-deviation shocks in each driver (Figure 6 and Figure 7).

For green production efficiency y₁, the IRFs suggest a relatively front-loaded role for development- and investment-related factors. Shocks to urbanisation (c₄) and fiscal investment intensity (c₆) are associated with positive responses early in the sample period, with effects that attenuate over time. This pattern is consistent with the view that early-stage agglomeration and infrastructure-driven expansion can generate readily attainable efficiency gains, while later improvements depend more on complementary factors. Science and technology expenditure (c₅) often displays hump-shaped responses, with positive effects peaking after several periods and then weakening, consistent with adjustment lags and diminishing marginal returns.

Regulation, finance, and openness exhibit different dynamics. Environmental regulation (c₃) shows episodic positive responses during periods of strengthened regulatory effort, indicating that stricter policies can support green-efficiency improvements by accelerating cleaner production and reducing undesirable outputs. The contribution of financial development (c₈) tends to strengthen over time, suggesting a gradually increasing role of finance-related support—potentially including the mobilisation of green-oriented financial resources—in facilitating efficiency gains. The digital–intelligent index x exerts a moderate positive effect on y₁ in many periods and shows strengthening around 2020, consistent with a transition from extensive digital expansion to a more intensive phase where further gains increasingly depend on organisational, institutional, and complementary-factor adjustments.

For conventional production efficiency y₂, the dynamic structure differs in magnitude and timing. Human capital (c₁), openness (c₇), and science and technology expenditure (c₅) tend to play more prominent roles, consistent with channels of technology diffusion, competitive pressure, and productivity-enhancing innovation. Digital–intelligent integration x is predominantly positive in many years and peaks around 2020, coinciding with accelerated deployment of digital monitoring and management platforms (e.g., smart-city data platforms, online reporting systems, and administrative-data-enabled supervision). Taken together, the dynamic results indicate that early improvements in green efficiency are more closely associated with agglomeration and investment dynamics, whereas later-phase gains increasingly rely on regulation, finance, openness, and digital–intelligent tools in combination.

5.6. Spatial Clustering, Residual Moran’s I and Model Adequacy

Because cities are embedded in a spatially connected system, it is important to assess whether the model adequately captures spatial processes. To this end, we analyse Local Indicators of Spatial Association (LISA) for y₁ and y₂ and compute global Moran’s I statistics for the corresponding model residuals (Figure 8 and Figure 9, Table 7).

For y₁, the LISA maps reveal strong and evolving spatial clustering. In 2011, high–high clusters are mainly concentrated in the southern and south-eastern parts of the study area, whereas a large low–low block covers much of the northern Loess Plateau and core energy base. By 2017, high–high clusters extend northwards and several southern cities shift into the low–low group, suggesting diffusion of efficiency improvements along major corridors alongside emerging within-region disparities. In 2022, the configuration becomes more fragmented, with high–high cities concentrated in parts of the north and reappearing in some eastern prefectures, while low–low belts persist in several southern and south-western areas.

For y₂, spatial evolution is related but not identical. High–high clusters expand in some northern areas over time, while low–low clusters become more concentrated in parts of the south-west and central region in later years. This pattern is broadly consistent with the spatial structure of ecological-function zones, river-valley corridors, transport linkages, and inter-city industrial spillovers that jointly shape efficiency outcomes.

Global Moran’s I statistics for residuals (Table 7) provide a complementary diagnostic. Residual Moran’s I values are consistently negative and moderate in magnitude, and they are frequently statistically significant across years, indicating a mild dispersion-type spatial pattern in the residuals rather than residual clustering. This suggests that the nonlinear hybrid model captures the dominant high–high/low–low structure observed in the raw efficiency indicators, while some systematic cross-border contrasts remain. Importantly, the remaining spatial dependence does not manifest as coherent residual hot spots or cold spots, implying limited scope for residual-driven spatial clustering beyond what is explained by the observed covariates and fixed effects.

5.7. Interaction Structure Among Drivers and Policy Implications

The analyses above have focused on marginal (ceteris paribus) effects. Yet the contribution of each driver to efficiency may depend on the level of other variables, and policy measures in practice often study through such complementarities. To characterise these patterns, we compute pairwise SHAP interaction values for both efficiency models and summarise them in interaction heatmaps (Figure 10). Each cell reports the mean absolute interaction between two features, i.e., the extent to which their joint contribution to predicted efficiency deviates from the sum of their individual effects.

Several robust interaction structures emerge. First, the digital–intelligent integration index x exhibits strong interactions with human capital (c₁) and urbanisation (c₄). This implies that the efficiency gains associated with digital–intelligent development are amplified in cities that already possess a skilled labour force and stronger agglomeration economies. Conversely, where human capital and agglomeration are weaker, the marginal return to additional digital investment is more limited.

Second, the heatmaps highlight interactions between digital–intelligent development and regulation-, investment-, and finance-related variables. In particular, combinations such as x,c3 (environmental regulation), x,c6 (fiscal investment intensity), and x,c8 (financial development) stand out, suggesting that digital–intelligent tools are most effective when complementary regulatory frameworks, public-investment capacity, and financial-market development provide support for implementation and scaling.

Third, there is evidence of complementarity among conventional development drivers. For example, interactions between science and technology expenditure (c₅) and human capital (c₁), as well as between openness (c₇) and innovation-related variables, indicate that innovation inputs and external linkages can reinforce each other in shaping both green and conventional efficiency. These interaction structures argue against single-instrument interventions. Promoting digital transformation in isolation—without parallel progress in human-capital formation, innovation inputs, regulation, and enabling fiscal/financial conditions—is unlikely to unlock the full potential efficiency gains.

From a policy perspective, these results support complementarity-based packages: prioritising digital–intelligent investment where human capital and urbanisation-related scale effects can absorb and diffuse new technologies; coupling digital platforms with environmental regulation to improve implementation capacity and transparency; aligning fiscal investment with innovation-oriented projects that raise efficiency under carbon constraints; and strengthening financial development (including the capacity to mobilise green-oriented financial resources where applicable) to support city-level efficiency improvements. For the middle reaches of the Yellow River—where resource dependence, ecological vulnerability, and digital–intelligent transformation are intertwined—such coordinated strategies are important for achieving simultaneous progress in high-quality growth and carbon-mitigation-oriented efficiency.

6. Discussion

6.1. Value of the Hybrid Modelling Approach

The first implication concerns the modelling strategy. The super-efficiency SBM indicator with undesirable outputs (y₁) and the super-efficiency CCR indicator (y₂) are consistent with a growing body of study that uses DEA variants to evaluate urban eco-efficiency or green development efficiency and then links these scores to socio-economic drivers [76,77,78]. In many existing applications, DEA scores are analysed using linear panel or spatial econometric models; once a broader set of covariates is introduced, explanatory power often declines due to nonlinearities and interaction effects that are difficult to capture parsimoniously [70,88,89,90]. By integrating the DEA-based measures y₁ and y₂ with the digital–intelligent integration index x and a comprehensive set of city characteristics (human capital, internet penetration, environmental regulation, urbanisation, science and technology expenditure, fiscal investment intensity, openness, and financial development) in a nonlinear ensemble framework, the present study achieves very strong goodness-of-fit within each urban agglomeration and stable predictive performance under rolling-origin validation.

From a methodological perspective, these results suggest that the DEA-derived efficiency indices contain systematic information that can be recovered more fully when multiple development, institutional and governance-related dimensions are modelled jointly rather than examined one channel at a time. Importantly, the objective here is not causal identification but high-quality associational explanation with transparent decomposition through global and local importance measures.

Residual diagnostics reinforce this interpretation. Although the raw DEA-based efficiency measures display clear high–high and low–low spatial clusters, the residual Moran’s I statistics are consistently negative and moderate in magnitude, indicating a mild dispersion-type spatial pattern rather than residual clustering. This implies that the dominant spatial structure in y₁ and y₂ is largely absorbed by observed cross-city differences (including fixed effects), while remaining spatial dependence mainly reflects cross-border contrasts rather than omitted “hot spots” [90,91]. Overall, combining DEA-based efficiency indices with flexible yet interpretable nonlinear specifications—augmented by dynamic and spatial diagnostics—appears to be a promising direction for research on sustainable smart-city transitions and territorial governance in resource- and policy-constrained regions.

6.2. Digital–Intelligent Integration in a Constrained System

The role of digital–intelligent integration in this study is more nuanced than a simple “more technology is always better” narrative. In global importance rankings, the composite digital–intelligent index x consistently contributes to explaining both y₁ (green efficiency with undesirable outputs) and y₂ (conventional efficiency without explicitly modelling undesirable outputs). At the same time, SHAP results indicate that the contribution of x is strongly context-dependent and varies systematically with cities’ factor endowments, development stage, and institutional settings. This is consistent with evidence that the effects of the digital economy on green performance and productivity are heterogeneous across regions and stages of development [92].

Two interaction patterns are particularly salient. The first concerns factor endowments and agglomeration conditions. Cities with stronger human capital are associated with substantially larger positive contributions of x, and the interaction pair x,c₁ (digital–intelligent integration and human capital) ranks among the strongest for both y₁ and y₂. A similarly prominent interaction emerges between x and urbanisation x,c₄, suggesting that digital–intelligent development is more effective in environments with stronger agglomeration economies where technologies, skills and organisational practices can diffuse more rapidly. In contrast, where the labour-force skill base is weaker and agglomeration effects are limited, marginal increases in x tend to yield smaller improvements. Related studies likewise suggest that digitalisation improves green performance largely through complementary conditions such as human-capital accumulation and broader development fundamentals [93,94].

The second interaction pattern is institutional and enabling-capacity related. The SHAP interaction profiles indicate that x interacts strongly with environmental regulation (c₃), fiscal investment intensity (c₆), and financial development (c₈), and also shows meaningful complementarities with openness (c₇) in some settings. These patterns imply that a given level of digital–intelligent integration is associated with larger efficiency gains where regulatory standards and enforcement capacity are stronger, where public investment can support complementary infrastructure and implementation, and where finance can facilitate technology upgrading and resource reallocation (including, where relevant, mobilisation of green-oriented financial resources) [95].

Taken together, these findings point to digital–intelligent development primarily as an amplifier of broader development and institutional conditions rather than an isolated lever. For policy design, this has two implications. First, prioritising digital–intelligent projects in cities where human capital and urbanisation-related scale effects are already strong is likely to generate larger improvements in both green and conventional efficiency. Second, bundling digital–intelligent investments with stronger environmental regulation, well-designed public investment, and more developed financial systems can help exploit complementarities that single-instrument interventions would miss—an especially important point for smart-city strategies in resource-dependent and environmentally constrained territories.

6.3. Different Time Profiles of Economic and Ecological Gains

The dynamic analysis indicates that the two efficiency measures do not evolve in the same way over time. Recall that y₁ is defined as a super-efficiency SBM indicator reflecting production efficiency while explicitly incorporating undesirable outputs (green efficiency), whereas y₂ is a super-efficiency CCR indicator that provides a conventional efficiency benchmark without explicitly modelling undesirable outputs.

For y₁, shocks to urbanisation (c₄), science and technology expenditure (c₅) and fiscal investment intensity (c₆) tend to generate stronger positive responses earlier in the sample period, with effects that attenuate in later years. This profile is consistent with the view that early rounds of agglomeration-driven expansion, investment and innovation spending can capture relatively “easy” improvements, while subsequent phases face diminishing marginal returns and higher risks of redundancy or misallocation if complementary institutional reforms lag behind [96,97]. In other words, mechanisms that initially raise super-efficiency SBM scores may lose traction once the most straightforward adjustments have been realised, and further progress requires higher-quality innovation, organisational upgrading and more effective regulation.

By contrast, the dynamics for y₂ place relatively greater weight on institutional and enabling factors. Variables such as environmental regulation (c₃), financial development (c₈) and openness (c₇) show more persistent roles in several periods, consistent with channels of competitive pressure, technology diffusion, and resource reallocation that affect conventional production efficiency under ongoing constraints [98,99]. Digital–intelligent integration contributes modestly but consistently to y₂, with stronger responses around 2020—consistent with accelerated deployment of digital monitoring, management and platform-based coordination tools in cities (e.g., smart-city data platforms and administrative-data-enabled supervision), which can improve operational efficiency and compliance capacity [100,101,102].

This asymmetry has direct policy implications. Measures that maximise short-term gains in green efficiency y₁ are not necessarily those that deliver the most persistent improvements in conventional efficiency y₂. Early-stage gains may be driven more by agglomeration dynamics, investment intensity and innovation inputs; later-stage improvements depend increasingly on regulatory quality, financial-system support and openness-linked diffusion mechanisms. The broader literature on decoupling between growth and environmental pressure is consistent with such sequencing effects [103,104], and the results here provide basin-specific evidence by distinguishing the temporal behaviour of y₁ and y₂ within a unified framework.

6.4. Spatial Heterogeneity and Multiple Paths

The spatial analyses underline that the urban agglomerations in the middle reaches of the Yellow River are far from homogeneous. High–high clusters of y₁ and y₂ appear in different parts of the study area and follow distinct trajectories over time. Some cities combine strong conventional efficiency with only moderate green efficiency, often reflecting production strength alongside ongoing pressure from undesirable outputs. Other cities achieve comparatively higher green efficiency under tighter constraints, potentially associated with stronger regulation intensity or different development profiles.

City-level profiles make this heterogeneity more concrete. The radar charts reveal markedly different configurations of human capital, internet penetration, environmental regulation, urbanisation, science and technology expenditure, fiscal investment intensity, openness and financial development across representative provincial capitals such as Hohhot, Taiyuan, Xi’an and Zhengzhou. The SHAP force plots translate these configurations into contributions to y₁ and y₂: similar overall efficiency levels can arise from different mixes of drivers. This pattern is compatible with the notion of “functional equivalence” in regional systems, whereby cities with different internal structures can play comparable roles within a wider network [105,106].

For policy design, the implication is straightforward: there is no single “model city” that can be replicated across the basin. Instead, interventions should be tailored to each city’s development profile and ecological role. For resource- and heavy-industry-oriented cities, policy emphasis should include strengthening environmental regulation, improving the effectiveness of fiscal investment, and targeting digital–intelligent tools toward efficiency gains and compliance capacity rather than scale expansion. For innovation- and service-oriented cores, the priority is to deepen digital–intelligent integration with human-capital upgrading and innovation expenditure while maintaining regulatory effectiveness. For gateway and logistics cities, the focus is on managing transport- and commerce-related pressures while using openness and digital connectivity to enhance efficiency. For ecologically sensitive zones, stricter land-use and pollution constraints combined with selective digital applications for monitoring and governance can support sustainable territorial management [107]. The present study extends related arguments on differentiated pathways to green development by grounding them in city-level efficiency outcomes derived from super-efficiency SBM and CCR measures.

7. Conclusions and Policy Implications

7.1. Main Conclusions

Using a framework that combines super-efficiency DEA, a nonlinear ensemble model, panel dynamics and spatial statistics, this study examines city-level performance in the middle reaches of the Yellow River Basin from two complementary dimensions. Green production efficiency is measured by y₁, a super-efficiency SBM indicator that evaluates city-level production efficiency while explicitly incorporating undesirable outputs. Conventional production efficiency is measured by y₂, a super-efficiency CCR indicator that provides a benchmark efficiency measure without explicitly modelling undesirable outputs. The main conclusions are as follows.

(1): The modelling framework explains most observed variation in y₁ and y₂.

Across the full sample and within each urban agglomeration, the hybrid model reproduces the level and evolution of both efficiency measures with very high goodness-of-fit and small absolute errors. Residual Moran’s I is consistently negative and moderate in magnitude, indicating a mild dispersion-type spatial pattern rather than residual clustering. Thus, the dominant spatial patterns in green and conventional efficiency can largely be attributed to observable differences in development conditions, regulation intensity, fiscal investment, openness, financial development and digital–intelligent integration, rather than being left as unexplained residual hot spots.

(2): Digital–intelligent integration matters but operates through complementarities.

The composite index of digital–intelligent integration x is positively associated with both y₁ and y₂ and ranks among the key predictors. However, it does not replace conventional drivers. Human capital (c₁), urbanisation (c₄), science and technology expenditure (c₅), fiscal investment intensity (c₆), environmental regulation (c₃), openness (c₇) and financial development (c₈) remain central determinants. Interaction results indicate that the contribution of x is largest where human capital and agglomeration conditions are stronger and where enabling institutions (regulation, fiscal investment capacity and finance) support implementation and scaling. This pattern suggests that several factors function primarily as moderators: innovation and skills proxy absorptive capacity for adopting general-purpose digital technologies; agglomeration conditions reflect urban-economy mechanisms (matching and learning) that raise returns to data-enabled coordination; and regulation, fiscal capacity and finance constitute institutional complements that allow digital systems to operate as enforceable MRV and investable upgrading projects. In this sense, digital–intelligent integration acts mainly as an enabling input whose effectiveness depends on local capability, sectoral structure and governance capacity.

(3): The time profiles of y₁ and y₂ differ.

Green efficiency y₁ responds more strongly to shocks in urbanisation, innovation expenditure and fiscal investment intensity earlier in the period, with marginal effects weakening later, consistent with diminishing returns and potential redundancy if investment is not matched by higher-quality adjustment [107]. Conventional efficiency y₂ shows more persistent roles for regulation, openness, finance and digital–intelligent integration, with timing consistent with diffusion and institutional channels that build gradually and then stabilise [97,108]. Digital–intelligent development supports both measures, but the timing and strength of impacts depend on the phase of rollout and the degree of integration into organisational and regulatory practice.

(4): Spatial heterogeneity implies multiple pathways.

High values of y₁ and y₂ do not fully overlap in space, and cities with similar efficiency levels can reach those outcomes through different combinations of endowments and institutions. The city profiles of Hohhot, Taiyuan, Xi’an and Zhengzhou illustrate how local conditions shape the configuration of drivers behind y₁ and y₂ even within a single basin. Multiple viable pathways to green development therefore exist, rather than a single “best practice” model. The spatial non-overlap also accords with differentiated functional roles and constraints within the basin (e.g., resource-dependent energy-base cities versus ecologically constrained areas), implying that evaluation and policy design should be organised at the urban-agglomeration and basin scales rather than inferred from a subset of leading cities.

7.2. Policy Implications

The results yield several policy implications for the middle reaches of the Yellow River Basin. These implications align with China’s dual-carbon roadmap (peaking emissions before 2030 and achieving carbon neutrality before 2060) and, more specifically, with the current configuration of industrial decarbonisation policies that is increasingly organised as a portfolio of pathway-specific instruments. A recent synthesis study characterises China’s industrial decarbonisation framework as exhibiting a “differentiated governance logic”: policy consistency is strongest for direct emission abatement, while linkages to circular-economy integration and socio-economic risk mitigation remain comparatively weaker. This implies that effective implementation depends not only on instrument intensity but also on place-based coordination across pathways and governance capacities [109]. In this context, digital–intelligent tools should be treated as complements to industrial decarbonisation—strengthening monitoring, reporting and verification (MRV), improving evaluation feedback, and lowering coordination costs for spatially targeted interventions—rather than as stand-alone substitutes for structural reform. Given the basin’s urban-agglomeration structure, policy experiments such as low-carbon city pilots and related green-governance initiatives can serve as practical testbeds for scaling localised digital solutions (e.g., smart energy management, real-time emissions monitoring, and data-enabled industrial upgrading) into basin-wide strategies. Our conditional-effect results further indicate that digital solutions deliver larger gains when they are aligned with local administrative capacity and sectoral structure—consistent with the pathway-based, differentiated governance perspective [100].

(1): Plan digital–intelligent development jointly with structural and institutional reform.

The largest improvements in both y₁ and y₂ are observed in cities that combine digital–intelligent investment with ongoing industrial upgrading, clearer environmental standards and active use of green-finance instruments. In these settings, digital systems improve monitoring, enforcement and project evaluation. Where such complementary factors are weak, additional digital infrastructure has a more limited effect on green performance. Digital–intelligent initiatives should therefore be designed as part of integrated packages that also address industrial structure, regulatory quality and green finance.

(2): Match policy mixes to the position of each city in the y₁–y₂ space.

Cities with high green production efficiency y₁ but weaker ecology-oriented efficiency y₂ are often industrial and logistics hubs with strong fiscal revenues and a heavy legacy of historical emissions. For these cities, policy should focus on tightening environmental enforcement, accelerating the phase-out of highly emission-intensive activities and directing finance towards cleaner production and remediation so that structural upgrading does not lock in a new round of high-carbon growth. Cities with comparatively high y₂ but modest improvements in y₁, typically located in areas with important ecological functions, can place more emphasis on industries and services that are compatible with ecological constraints, using digital tools to generate income from ecological assets while maintaining ecological quality.

(3): Consider sequencing: combine early investment with institution-building.

Short-term public investment and fiscal expansion can quickly raise production-related efficiency, especially y₁, but the dynamic analysis suggests that these gains are not sufficient on their own and may fade if they are not accompanied by structural adjustment and stronger institutions. Environmental governance and green-finance development take longer to establish but are associated with more stable improvements in y₂. Policy portfolios should therefore combine early investment with measures that strengthen regulatory and financial frameworks so that y₁ and y₂ move in the same direction over time [110].

(4): Strengthen coordination at urban-agglomeration and basin scales.

The persistence of low–low clusters and the differences between urban agglomerations indicate that governance at the level of individual cities is not sufficient. Industrial links, labour flows and environmental externalities, as well as the indicators used to monitor them, do not follow administrative boundaries. Interpreting the identified clusters alongside existing regional development arrangements (urban-agglomeration strategies, energy-base transition tasks, ecological-function zoning, and policy pilots such as low-carbon city programme and green-finance initiatives) points to clear alignment and misalignment. Persistent low–low clusters are more likely to occur where carbon lock-in is strong but governance and financing complements are weaker, suggesting that basin-scale coordination should link digital rollout with shared MRV standards and data interoperability, targeted fiscal–financial support, and coordinated industrial restructuring to internalise cross-border spillovers. Joint planning of industrial layouts, digital infrastructure and environmental standards at the scale of urban agglomerations and the basin as a whole can help to narrow gaps between core and peripheral cities and align local actions with the objective of high-quality, low-carbon and resilient development in the Yellow River Basin.

7.3. Limitations and Future Research

Several limitations should be noted.

(1): Identification and causal inference.

The analysis is based on observational panel data and is designed to describe conditional relationships. The results are robust to alternative specifications and consistent with institutional knowledge of the region, but they do not identify causal effects. Stronger conclusions about the impact of digital–intelligent integration, environmental regulation or green finance on y₁ and y₂ would require quasi-experimental designs that exploit the timing and spatial distribution of specific policies or infrastructure projects.

(2): Spatial and temporal aggregation.

To match the availability of socio-economic and governance indicators, the analysis is conducted at the city–year level, and spatial information is aggregated to the same scale. This aggregation obscures intra-urban differences in land use, digital access and environmental quality. Extending the framework to finer spatial units—such as grid cells or neighbourhoods—would allow high-resolution remote-sensing products to be combined with more detailed statistical or administrative information and would make it possible to examine within-city inequalities in green development.

(3): Limited representation of interaction and network effects.

The interaction analysis focuses on pairwise effects between drivers and does not explicitly represent higher-order interactions or network relationships among cities. In practice, competitive and cooperative linkages along supply chains, transport corridors and river systems are likely to influence the diffusion of green technologies and practices. Incorporating measures of connectivity and inter-city dependence into the modelling framework would provide a more complete picture of regional transition processes.

(4): Scope of application and period of observation.

The empirical study covers one group of urban agglomerations in the middle reaches of the Yellow River over 2011–2022. Applying and extending the framework to other basins and regions, and to later years as new data become available, would make it possible to test which patterns are specific to the middle reaches of the Yellow River and which generalise to other settings. Comparative analysis across river basins and urban agglomerations with different policy regimes, industrial structures and ecological conditions would also help clarify how these factors mediate the relationship between digital–intelligent integration and the two super-efficiency indicators at the city level.

Author Contributions

Conceptualization, J.R.; Methodology, J.R.; Software, J.R.; Formal analysis, J.R.; Investigation, J.R.; Resources, J.R.; Data curation, J.R. and L.G.; Writing—original draft, J.R.; Writing—review & editing, J.R.; Visualization, L.G.; Supervision, J.L., L.G. and G.Y.; Funding acquisition, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the “Tianchi Talent” Introduction Programme of Xinjiang Uygur Autonomous Region Department of Human Resources and Social Security (No. 25LJH01382553). The APC was funded by: Xinjiang Uygur Autonomous Region.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

The State Council Information Office of the People’s Republic of China. Working Guidance for Carbon Dioxide Peaking and Carbon Neutrality in Full and Faithful Implementation of the New Development Philosophy; Xinhua News Agency: Beijing, China, 2021.
The State Council Information Office of the People’s Republic of China. Carbon Peaking and Carbon Neutrality: China’s Plans and Solutions; White Paper; State Council Information Office: Beijing, China, 2025.
The Communist Party of China Central Committee and the State Council. Outline of the Plan for the Yellow River Basin Ecological Protection and High-Quality Development; Xinhua News Agency: Beijing, China, 2021.
The 5th Interprovincial Joint Conference on Ecological Protection and High-Quality Development of the Yellow River Basin Held in Chengdu. 2024. Available online: https://www.sc.gov.cn/10462/10464/10797/2024/7/6/19234d802e3d40c580ed037f79d640c2.shtml (accessed on 5 January 2026).
The 2024 China Yellow River Cities Tourism Development Report Released in Gansu. 2024. Available online: https://whhlyt.nx.gov.cn/xxfb/wlyw/202409/t20240927_4677036.html (accessed on 5 January 2026).
Mladenov, M. COP28 in the Light of the Right to a Healthy Environment: Destructive Creation or Creative Destruction? Jurid. Trib.—Rev. Comp. Int. Law 2024, 14, 481–493. [Google Scholar] [CrossRef]
Pessina, S. The Link Between Environmental Rights and the Rights of Nature: The Virtues of a Complexity-Based Approach. Jurid. Trib.—Rev. Comp. Int. Law 2025, 15, 406–422. [Google Scholar]
United Nations General Assembly. Transforming our World: The 2030 Agenda for Sustainable Development; Resolution A/RES/70/1; United Nations General Assembly: New York, NY, USA, 2015; Available online: https://www.un.org/en/development/desa/population/migration/generalassembly/docs/globalcompact/A_RES_70_1_E.pdf (accessed on 5 January 2026).
International Telecommunication Union—Telecommunication Standardization Sector (ITU-T). Recommendation ITU-T Y.4903/L.1603 (03/2022): Key Performance Indicators for Smart Sustainable Cities to Assess the Achievement of Sustainable Development Goals; ITU: Geneva, Switzerland, 2022; Available online: https://www.itu.int/rec/T-REC-Y.4903/en (accessed on 5 January 2026).
Vačková, I.; Lazíková, J.; Prčík, M.; Rumanovská, Ľ. Energy Biomass and Land Use Change in EU Law. Jurid. Trib.—Rev. Comp. Int. Law 2025, 15, 248–280. [Google Scholar]
Zhang, G.; Huang, J.; Liu, X.; Guan, X.; Wei, Y.; Ding, L.; Han, D. The Evolution of Ecological Security and Its Drivers in the Yellow River Basin. Environ. Sci. Pollut. Res. 2023, 30, 47501–47515. [Google Scholar] [CrossRef]
Yan, X.; Deng, Y.; Peng, L.; Jiang, Z. Study on the Impact of Digital Economy Development on Carbon Emission Intensity of Urban Agglomerations and Its Mechanism. Environ. Sci. Pollut. Res. 2023, 30, 33142–33159. [Google Scholar] [CrossRef]
Zhang, Z.; Wang, W.; Chen, J.; Han, C.; Zhang, L.; Lv, X.; Yang, L.; Cui, G. Spatial Association and Driving Factors of the Carbon Emission Decoupling Effect in Urban Agglomerations of the Yellow River Basin. Land 2025, 14, 1838. [Google Scholar] [CrossRef]
Zhang, W.; Liu, X.; Wang, D.; Zhou, J. Digital Economy and Carbon Emission Performance: Evidence at China’s City Level. Energy Policy 2022, 165, 112927. [Google Scholar] [CrossRef]
Yellow River Conservancy Commission of the Ministry of Water Resources. Plan for the Construction of the Digital Twin Yellow River (2022–2025); Yellow River Conservancy Commission: Zhengzhou, China, 2022. [Google Scholar]
General Office of the CPC Central Committee. Opinions on Promoting the Construction of New Urban Infrastructure and Building Resilient Cities; Xinhua News Agency: Beijing, China, 2024.
Xinhua News Agency. Opinions on Strengthening Ecological and Environmental Zoning Control; Xinhua News Agency: Beijing, China, 2024. [Google Scholar]
Todorović, B. A Comparative Analysis of Environmental Activism in Albania, Romania and Serbia: Lessons in Civil Society Enforcement of Administrative Law. Jurid. Trib.—Rev. Comp. Int. Law 2025, 15, 108–140. [Google Scholar]
Kuang, C.; Huang, J.; Di, Y. How Does Digital–Intelligence Integration Affect “Carbon Unlocking”? Audit. Econ. Res. 2025, 40, 118–127. (In Chinese) [Google Scholar]
Huang, J.; Fu, A.; Di, Y. Impact and spatial effects of digital–intelligence integration in the urban agglomeration in the middle reaches of the Yangtze River on green transformation of manufacturing. Econ. Geogr. 2025, 45, 122–131. (In Chinese) [Google Scholar] [CrossRef]
Lan, H.; Sun, G.; Feng, J.; Wang, L. Realization paths for developing new quality productive forces in the coal-fired power industry through the synergistic promotion of digitalization, intelligence, and greening. Sci. Technol. Manag. Res. 2025, 45, 196–205. (In Chinese) [Google Scholar]
Ma, Z.; Xiao, H.; Li, J.; Chen, H.; Chen, W. Study on how the digital economy affects urban carbon emissions. Renew. Sustain. Energy Rev. 2025, 207, 114910. [Google Scholar] [CrossRef]
Wu, Y.; Liu, Y. How does the digital economy affect urban CO₂ emissions? Mechanism discussion and empirical test. Environ. Dev. Sustain. 2025, 27, 14097–14122. [Google Scholar] [CrossRef]
Hubacek, K.; Chen, X.; Feng, K.; Wiedmann, T.; Shan, Y. Evidence of decoupling consumption-based CO₂ emissions from economic growth. Adv. Appl. Energy 2021, 4, 100074. [Google Scholar] [CrossRef]
Zhang, Z.; Yu, Y.; Kharrazi, A. Unstable decoupling of CO₂ emissions from sectoral economic growth calls for decarbonization policies based on multi-perspective accounting: A case study of Zhejiang, China. Environ. Sci. Pollut. Res. 2023, 30, 57503–57517. [Google Scholar] [CrossRef]
Infante-Amate, J.; Travieso, E.; Aguilera, E. Unsustainable prosperity? Decoupling wellbeing, economic growth, and greenhouse gas emissions over the past 150 years. World Dev. 2024, 184, 106754. [Google Scholar] [CrossRef]
Liang, Y.; Li, Y.; Chen, S.; Liang, S. Decoupling carbon dioxide emissions from economic growth across multiregional supply chains within China. J. Clean. Prod. 2024, 482, 144200. [Google Scholar] [CrossRef]
Du, W.; Fan, Y.; Yuan, N.; Li, M. Can digitalization promote cities’ low-carbon development: Insights from local and neighboring regions. Energy Strategy Rev. 2025, 58, 101680. [Google Scholar] [CrossRef]
Chang, K.; Yang, M.; Li, B.; Wang, Y. The impact of the digital economy on urban carbon emissions reduction: Evidence from cities in the Yangtze River Delta. J. Knowl. Econ. 2025. [Google Scholar] [CrossRef]
Guo, X.; Xu, J. Carbon reduction in the AI era: How does urban digital intelligence transformation facilitate low-carbon development? Urban Clim. 2025, 61, 102449. [Google Scholar] [CrossRef]
Tian, Q.; Zang, J.; Dai, H.; Xu, Z. Carbon emission reduction in the digital age: The impact and applications of artificial intelligence. Chin. Manag. Stud. 2025. [Google Scholar] [CrossRef]
Li, J.; Wang, Y.; Zhang, S.; Sheng, X. Digital economy, fiscal decentralization, and carbon emission intensity: Evidence from China. Sustain. Futures 2025, 9, 100522. [Google Scholar] [CrossRef]
Wen, J.; Khalid, S.; Mahmood, H.; Alam, K.; Zakaria, M. Examining the influence of ICT on carbon emissions in emerging economies. Struct. Change Econ. Dyn. 2025, 74, 353–360. [Google Scholar] [CrossRef]
Xia, W.; Ruan, Z.; Ma, S.; Zhao, J.; Yan, J. Can the digital economy enhance carbon emission efficiency? Evidence from 269 cities in China. Int. Rev. Econ. Financ. 2025, 97, 103815. [Google Scholar] [CrossRef]
Liu, N.; Wang, S.; Peng, Y. The carbon reduction effect of AI policy: Quasi-experimental evidence from China’s National AI Innovation Pilot Zones. Prepr. Res. Sq. 2025, 1–27. [Google Scholar] [CrossRef]
Hai, M.; Zhang, Z. Carbon emission reduction effects of urban digital–intelligent transformation: Synergistic effects of the “Broadband China” and “Smart City” pilot policies. J. Hunan Agric. Univ. (Soc. Sci.) 2025, 26, 94–103. (In Chinese) [Google Scholar]
Li, H. Carbon emission optimization and value creation strategies in the intelligent transformation of e-commerce. Sci. Technol. Product. 2025, 46, 54–57. (In Chinese) [Google Scholar]
Gao, C.; Huang, P. Pathways for artificial intelligence–driven reduction of embodied export carbon emissions in Jiangsu Province. Mark. Outlook 2025, 15, 46–48. (In Chinese) [Google Scholar]
Zou, X.; Zhong, M. Intelligent manufacturing assisting enterprise carbon emission reduction: Internal mechanisms and micro-level evidence. China Popul. Resour. Environ. 2025, 35, 52–64. (In Chinese) [Google Scholar]
Wang, T.; Shen, Y. Unlocking the potential of supply chain digitalization for enhancing enterprise green transformation performance: Evidence from China. Humanit. Soc. Sci. Commun. 2025, 12, 1339. [Google Scholar] [CrossRef]
Yang, Y. Policy-driven sustainable property development: Strategic pathways and systemic challenges in China’s dual carbon transition. World J. Econ. Bus. Res. 2025, 3, 40–52. [Google Scholar]
Wu, H. Industrial robots, labor intensification, and regional carbon emission efficiency. Jiangsu Commer. Forum 2025, 131–136. (In Chinese) [Google Scholar] [CrossRef]
Xu, W.; Wan, W. Research on the carbon emissions reduction effects of China’s digital economy: Moderating role of the national big data comprehensive pilot zone policy. Front. Environ. Sci. 2025, 13, 1523560. [Google Scholar] [CrossRef]
Jiang, H.; Lu, J.; Zhang, R.; Xiao, X. Investigation of diverse urban carbon emission reduction pathways in China: Based on the technology–organization–environment framework for promoting socio-environmental sustainability. Land 2025, 14, 260. [Google Scholar] [CrossRef]
Sha, N.; Ghardallou, W.; Murshed, M.; Makhmudov, S. The role of internet services in carbon productivity enhancement and environmentally sustainable economic growth. J. Environ. Manag. 2025, 395, 127734. [Google Scholar] [CrossRef]
Zhang, L.; Chen, B.; Jiao, Y.; Dong, Z. Can artificial intelligence pave a greener path for China? Exploring the synergistic effects of intelligentization and industrialization on carbon emission efficiency. Emerg. Mark. Financ. Trade 2025. [Google Scholar] [CrossRef]
Pan, X.; Lin, S. Can intelligent manufacturing reduce corporate carbon emissions? Empirical evidence from China’s listed manufacturing firms. Emerg. Mark. Financ. Trade 2025, 61, 3296–3307. [Google Scholar] [CrossRef]
Ru, J.; Li, J.; Gan, L.; Sun, J.; Wang, S. Urbanization, digital–intelligent integration, and carbon productivity: Spatiotemporal dynamics in the middle reaches urban agglomeration of the Yellow River. Land 2025, 14, 2087. [Google Scholar] [CrossRef]
Sun, S.; Li, Y. The impact of industrial digitalization development on carbon emissions. Hebei Enterp. 2025, 10, 9–14. (In Chinese) [Google Scholar]
Su, C.; Wang, F. Digital new infrastructure and carbon emission intensity: Internal mechanisms and empirical evidence. Areal Res. Dev. 2025, 44, 36–43. (In Chinese) [Google Scholar]
Zheng, Y.; Sheng, P. The impact of technological innovation on environmental governance in the logistics industry. Asia Pac. Econ. Manag. Rev. 2025, 2, 5. [Google Scholar] [CrossRef]
Wang, Y.; Li, N.; Zhao, Z. Mechanisms of digital economy development driving CO₂ emission evolution: Empirical evidence from China. Clean Energy 2025, 9, 35–45. [Google Scholar] [CrossRef]
Protection of Yellow River a Major National Strategy: Xi. State Council of the People’s Republic of China. 2019. Available online: https://english.www.gov.cn/news/topnews/201909/19/content_WS5d8385dcc6d0bcf8c4c13b2b.html (accessed on 11 September 2025).
Zhou, Z.; Sun, X.; Zhang, X.; Wang, Y. Inter-Regional Ecological Compensation in the Yellow River Basin Based on the Value of Ecosystem Services. J. Environ. Manag. 2022, 322, 116073. [Google Scholar] [CrossRef] [PubMed]
Xu, Y.; Wang, C. Ecological Protection and High-Quality Development in the Yellow River Basin: Framework, Path, and Countermeasure. Bull. Chin. Acad. Sci. 2020, 35, 875–883. [Google Scholar] [CrossRef]
Yin, D.; Yu, H.; Shi, Y.; Zhao, M.; Zhang, J.; Li, X. Matching supply and demand for ecosystem services in the Yellow River Basin, China: A perspective of the water-energy-food nexus. J. Clean. Prod. 2023, 384, 135469. [Google Scholar]
Zhang, K.; Zhang, Y. Evolution of Regional Economic Disparities in the Yellow River Basin at Different Spatial Scales. Econ. Geogr. 2020, 40, 1–11. [Google Scholar] [CrossRef]
Chen, Y.; Miao, Q.; Zhou, Q. Spatiotemporal Differentiation and Driving Force Analysis of the High-Quality Development of Urban Agglomerations along the Yellow River Basin. Int. J. Environ. Res. Public Health 2022, 19, 2484. [Google Scholar] [CrossRef]
Li, Y.; Xu, B.; Wan, Y.; Li, Y.; Li, H. Digital Economy, Spatial Imbalance, and Coordinated Growth: Evidence from Urban Agglomerations in the Middle and Lower Reaches of the Yellow River Basin. Sustainability 2024, 17, 9743. [Google Scholar] [CrossRef]
Jin, Z.; Teng, C.; Jiao, X.; Miao, Y.; Wang, C. The Integrated Development and Regional Disparities of Urban Agglomerations in the Yellow River Basin, China. Sustainability 2024, 16, 10353. [Google Scholar] [CrossRef]
Xia, Y.; Shi, J.; Zhao, Z.; Liu, Z.; Liu, X.; Zhang, C. Mapping the Coal–Water Nexus in the Yellow River Basin. J. Clean. Prod. 2023, 411, 137249. [Google Scholar] [CrossRef]
Song, C.; Yin, G.; Lu, Z.; Chen, Y. Industrial Ecological Efficiency of Cities in the Yellow River Basin in the Background of China’s Economic Transformation: Spatial-Temporal Characteristics and Influencing Factors. Environ. Sci. Pollut. Res. 2022, 29, 4334–4349. [Google Scholar] [CrossRef] [PubMed]
Zhao, D.; Liang, Y.; Li, L.; Ma, Y.; Xiao, G. Spatio-Temporal Differentiation and Enhancement Path of Tourism Eco-Efficiency in the Yellow River Basin under the “Dual Carbon” Goals. Sustainability 2025, 17, 7827. [Google Scholar] [CrossRef]
Xi, J. Xi Stresses Ecological Protection, High-Quality Development in Yellow River Basin; State Council Information Office: Beijing, China, 2024. Available online: https://english.www.gov.cn/news/202409/14/content_WS66e59530c6d0868f4e8eaff9.html (accessed on 11 October 2025).
Färe, R.; Grosskopf, S.; Lovell, C.A.K.; Pasurka, C. Multilateral Productivity Comparisons When Some Outputs Are Undesirable: A Nonparametric Approach. Rev. Econ. Stat. 1989, 71, 90–98. [Google Scholar] [CrossRef]
Andersen, P.; Petersen, N.C. A Procedure for Ranking Efficient Units in Data Envelopment Analysis. Manag. Sci. 1993, 39, 1261–1264. [Google Scholar] [CrossRef]
Tone, K. A Slacks-Based Measure of Efficiency in Data Envelopment Analysis. Eur. J. Oper. Res. 2001, 130, 498–509. [Google Scholar] [CrossRef]
Tone, K. A Slacks-Based Measure of Super-Efficiency in Data Envelopment Analysis. Eur. J. Oper. Res. 2002, 143, 32–41. [Google Scholar] [CrossRef]
Li, H.; Fang, K.; Yang, W.; Wang, D.; Hong, X. Regional Environmental Efficiency Evaluation in China: Analysis Based on the Super-SBM Model with Undesirable Outputs. Math. Comput. Model. 2013, 58, 1018–1031. [Google Scholar] [CrossRef]
Long, L. Eco-Efficiency and Effectiveness Evaluation toward Sustainable Urban Development in China: A Super-Efficiency SBM–DEA with Undesirable Outputs. Environ. Dev. Sustain. 2021, 23, 14982–14997. [Google Scholar] [CrossRef]
Bolos, V.J.; Benitez, R.; Coll-Serrano, V. Continuous Models Combining Slacks-Based Measures of Efficiency and Super-Efficiency. Cent. Eur. J. Oper. Res. 2022, 30, 123–150. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Lin, N.; Zhang, D.; Feng, S.; Ding, K.; Tan, L.; Wang, B.; Chen, T.; Li, W.; Dai, X.; Pan, J.; et al. Rapid Landslide Extraction from High-Resolution Remote Sensing Images Using SHAP-OPT-XGBoost. Remote Sens. 2023, 15, 3901. [Google Scholar] [CrossRef]
Molnar, C. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable; Lulu Press: Morrisville, NC, USA, 2020. [Google Scholar]
Murdoch, W.J.; Singh, C.; Kumbier, K.; Abbasi-Asl, R.; Yu, B. Interpretable Machine Learning: Definitions, Methods, and Applications. Proc. Natl. Acad. Sci. USA 2019, 116, 22071–22080. [Google Scholar] [CrossRef]
Lei, J.; G’Sell, M.; Rinaldo, A.; Tibshirani, R.J.; Wasserman, L. Distribution-Free Predictive Inference for Regression. J. Am. Stat. Assoc. 2018, 113, 1094–1111. [Google Scholar] [CrossRef]
Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems 30 (NIPS 2017); Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 4765–4774. [Google Scholar]
Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From Local Explanations to Global Understanding with Explainable AI for Trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef] [PubMed]
Holtz-Eakin, D.; Newey, W.; Rosen, H.S. Estimating Vector Autoregressions with Panel Data. Econometrica 1988, 56, 1371–1395. [Google Scholar] [CrossRef]
Lütkepohl, H. New Introduction to Multiple Time Series Analysis; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
Love, I.; Zicchino, L. Financial Development and Dynamic Investment Behavior: Evidence from Panel Vector Autoregressions. Q. Rev. Econ. Financ. 2006, 46, 190–210. [Google Scholar] [CrossRef]
Arellano, M.; Bover, O. Another Look at the Instrumental Variable Estimation of Error-Components Models. J. Econom. 1995, 68, 29–51. [Google Scholar] [CrossRef]
Blundell, R.; Bond, S. Initial Conditions and Moment Restrictions in Dynamic Panel Data Models. J. Econom. 1998, 87, 115–143. [Google Scholar] [CrossRef]
Zhang, C.; Lv, W.; Liu, G.; Wang, Y. Moran’s I for Multivariate Spatial Data. Mathematics 2024, 12, 2746. [Google Scholar] [CrossRef]
Anselin, L. Local Indicators of Spatial Association—LISA. Geogr. Anal. 1995, 27, 93–115. [Google Scholar] [CrossRef]
Moran, P.A.P. Notes on Continuous Stochastic Phenomena. Biometrika 1950, 37, 17–23. [Google Scholar] [CrossRef]
Anselin, L.; Bera, A.K.; Florax, R.; Yoon, M.J. Simple Diagnostic Tests for Spatial Dependence. Reg. Sci. Urban Econ. 1996, 26, 77–104. [Google Scholar] [CrossRef]
Adhikari, H.; Hansen, A.J.; Dincă, M. Exploring the Spatial Patterns of Vegetation Fragmentation Using Local Spatial Autocorrelation Statistics. J. Appl. Remote Sens. 2019, 13, 024523. [Google Scholar]
Yang, Q.; Wan, X.; Ma, H. Assessing Green Development Efficiency of Municipalities and Provinces in China Integrating Models of Super-Efficiency DEA and Malmquist Index. J. Sustain. 2015, 7, 4492–4510. [Google Scholar] [CrossRef]
Ye, T.; Xiang, X.; Ge, X.; Yang, K. Research on Green Finance and Green Development Based Eco-Efficiency and Spatial Econometric Analysis. J. Sustain. 2022, 14, 2825. [Google Scholar] [CrossRef]
Liu, X.; Zhang, X.; Yuan, M.; Liu, J.; Zhou, G. Spatial-Temporal Differentiation of Urban Eco-Efficiency and Its Driving Factors: A Comparison of Five Major Urban Agglomerations in China. PLoS ONE 2024, 19, e0300419. [Google Scholar] [CrossRef]
Sheng, Z.; Zhu, C.; Chen, M. Exploring the Impact of the Digital Economy on Green Total Factor Productivity—Evidence from Chinese Cities. Sustainability 2024, 16, 2734. [Google Scholar] [CrossRef]
Yang, Y.; Liang, Q. Digital Economy, Environmental Regulation and Green Eco-Efficiency—Empirical Evidence from 285 Cities in China. Front. Environ. Sci. 2023, 11, 1113293. [Google Scholar] [CrossRef]
Li, Z.; Wang, J.; Zhang, X. Digital Economy, Industrial Structure and Carbon Emissions: An Empirical Study of China. J. Clean. Prod. 2022, 362, 132321. [Google Scholar]
Liu, Y.; Sun, J.; He, L. Digital Finance, Environmental Regulation, and Green Development Efficiency of China. Front. Environ. Sci. 2023, 11, 1131058. [Google Scholar] [CrossRef]
Gong, G.; Liu, Z.; Chen, W. Infrastructure Investment and Growth in China: A Quantitative Assessment. J. Econ. Dyn. Control 2022, 141, 104420. [Google Scholar]
Ai, K.; Yan, X. Can Green Infrastructure Investment Reduce Urban Carbon Emissions: Empirical Evidence from China. Land 2024, 13, 226. [Google Scholar] [CrossRef]
Yu, H.; Luo, X.; Shao, J.; Dai, Y.; Khan, H. Assessing the Impact of Green Finance Initiatives on Carbon Emission Reductions in China: A Provincial-Level Analysis. Environ. Dev. Sustain. 2025; Epub ahead of printing. [Google Scholar]
Zhang, P.; Liu, Y.; Li, F. Exploring the Effect of Green Finance on Green Development of China’s Equipment Manufacturing Industry. Resour. Policy 2024, 89, 104196. [Google Scholar]
Xu, Y.; Ding, Z. Sustainable Growth Unveiled: Exploring the Nexus of Green Finance and High-Quality Economic Development in China. Front. Environ. Sci. 2024, 12, 1414365. [Google Scholar] [CrossRef]
Li, J.; Wang, L.; Chen, Y. Mapping Spatiotemporal Dynamic Changes in Urban CO₂ Emissions in China Using Multi-Source Data. Remote Sens. 2024, 17, 611. [Google Scholar]
OECD. Indicators to Measure Decoupling of Environmental Pressure from Economic Growth; OECD: Paris, France, 2002. [Google Scholar]
Jia, Z.; Wang, C.; Li, X. Decoupling Monetary Resource–Environmental Pressure from Economic Growth in China. Environ. Dev. Sustain. 2023, 26, 10115–10137. [Google Scholar]
Zhang, M.; Da, Y. Decoupling Analysis between Economic Growth and Resource–Environment Pressure in China. Sci. Total Environ. 2020, 714, 136655. [Google Scholar]
Li, W.; Chen, R.; Zhou, D. Functional Division of Urban Agglomerations and Regional Coordinated Development in China. Ann. Reg. Sci. 2025; Epub ahead of printing. [Google Scholar]
Wang, J.; Yang, H.; Zhao, Q. Green Development Effect Assessment and Driving-Force Analysis of the Yellow River Basin in China. Environ. Dev. Sustain. 2024; Epub ahead of printing. [Google Scholar]
Sang, J.; Pan, L. Impact of Green Infrastructure Investment on Urban Carbon Emissions in China. Sustainability 2024, 16, 2668. [Google Scholar] [CrossRef]
Zhou, X.; Tang, X.; Zhang, R. Impact of Green Finance on Economic Development and Environmental Quality: A Study Based on Provincial Panel Data from China. Environ. Sci. Pollut. Res. 2020, 27, 19915–19932. [Google Scholar] [CrossRef]
Zhou, C.; Zhang, W.; Richardson-Barlow, C.; Zhang, Z. Navigating Carbon Neutrality: Policy Pathways and Consistency on Industrial Decarbonization in China. Carbon Balance Manag. 2025, 20, 66. [Google Scholar] [CrossRef]
Li, C.; Chen, Z.; Wu, Y.; Zuo, X.; Jin, H.; Xu, Y.; Zeng, B.; Zhao, G.; Wan, Y. Impact of Green Finance on China’s High-Quality Economic Development, Environmental Pollution, and Energy Consumption. Front. Environ. Sci. 2022, 10, 1032586. [Google Scholar] [CrossRef]

Figure 2. Geographic Scope of the Urban Agglomeration in the Middle Reaches of the Yellow River.

Figure 3. Global SHAP importance of explanatory variables for the two efficiency indicators. Note: (y₁) denotes the super-efficiency SBM index incorporating undesirable outputs, and (y₂) denotes the super-efficiency CCR index as a conventional production-efficiency measure.

Figure 4. City-level radar charts of the digital–intelligent profile (x) and explanatory variables (c₁–c₈) for four representative provincial capital cities in the middle reaches of the Yellow River. Panels (a–d) correspond to Hohhot, Taiyuan, Xi’an, and Zhengzhou, respectively; the same set of radar charts is used for both y₁ and y₂. (Panels (a–d) show the results for indicator y₁ in Hohhot, Taiyuan, Xi’an, and Zhengzhou, respectively; panels (e–h) show the corresponding results for indicator y₂.).

Figure 5. SHAP force plots illustrating the local contributions of x and c₁–c₈ to the predictions of y₁ and y₂ for four representative provincial capital cities (Hohhot, Taiyuan, Xi’an, and Zhengzhou). Positive (red) and negative (blue) segments indicate factors pushing the prediction above or below the benchmark value. (Panels (a–h) correspond to Hohhot, Taiyuan, Xi’an, and Zhengzhou, respectively, with the left column showing y₁ and the right column showing y₂.).

Figure 6. Impulse responses of y₁ to x and c₁–c₈ (2011–2022). (Panels (a–i) correspond to shocks in x, c₁, …, c₈, respectively).

Figure 7. Presents the corresponding impulse responses for y₂. (Panels (a–i) correspond to shocks in x, c₁, …, c₈, respectively).

Figure 8. LISA cluster maps for y₁ (2011, 2017, 2022). (Panels (a–c) show the LISA cluster maps for y₁ in 2011, 2017, and 2022, respectively.).

Figure 9. LISA. cluster maps for y₂ (2011, 2017, 2022). (Panels (a–c) show the LISA cluster maps for y₂ in 2011, 2017, and 2022, respectively.).

Figure 10. SHAP interaction heatmaps for the models of y₁ (a) and y₂ (b). The cells show pairwise SHAP interaction values between the digital–intelligent profile (x) and the explanatory variables (c₁–c₈), with warmer (cooler) colours indicating stronger positive (negative) interactions.

Table 1. Prefecture-level cities located within the middle Yellow River urban agglomeration (MYRUA).

Name of Urban Cluster	Cities
Central Plains Urban Cluster	Zhengzhou, Kaifeng, Luoyang, Nanyang, Anyang, Shangqiu, Xinxiang, Pingdingshan, Xuchang, Jiaozuo, Xinyang, Hebi, Puyang, Luohe, Sanmenxia, Zhoukou, Zhumadian, Changzhi, Jincheng
Central Shanxi Urban Cluster	Taiyuan, Jinzhong, Xinzhou, Lvliang
Guanzhong Plain Urban Cluster	Xi’an, Baoji, Tongchuan, Weinan, Xianyang, Yan’an, Shangluo, Tianshui, Pingliang, Qingyang, Yuncheng, Linfen
Hohhot-Baotou-Ordos-Yulin Urban Cluster	Hohhot, Baotou, Ordos, Yulin

Source: Authors’ compilation from official planning documents and national statistical yearbooks. The table was independently compiled and reordered by the authors and verified up to April 2025, based on the following sources: National Development and Reform Commission, Outline of the Plan for Ecological Protection and High-Quality Development of the Yellow River Basin; National Bureau of Statistics, China Statistical Yearbook 2023 (China Statistics Press, Beijing, 2023); and Ministry of Ecology and Environment, 2024 air-quality rankings of prefecture-level and above cities.

Table 2. Variable definitions and measurement.

Category	Variable	Conceptual Meaning	Operational Definition (Baseline Measure)	Unit
Dependent variable	y₁	Slack-adjusted “green” production performance accounting for undesirable output	Super-efficiency SBM score with CO₂ emissions as undesirable output (city-year)	Unit-free index; values > 1 indicate super-efficient DMUs
Dependent variable	y₂	Benchmark production efficiency under constant returns to scale	Super-efficiency CCR score under CRS (city-year)	Unit-free index; values > 1 indicate super-efficient DMUs
Core variable	x	Composite level of digital-intelligent transformation reflecting multi-dimensional integration	Composite index covering digital infrastructure, digital industrialisation, industrial digitalisation, and intelligent services (updated from Ru et al., 2025 [48])	Unit-free index; higher values indicate stronger integration
Control variable	c₁	Local stock/flow of skilled labour supporting technology adoption and green upgrading	Higher-education intensity from city statistical yearbooks	Typically per 10,000 residents or % (report the exact denominator used)
Control variable	c₂	Connectivity foundation enabling digital applications and data-driven management	Internet or broadband access intensity	Per 100 residents (report baseline choice: users vs. broadband)
Control variable	c₃	Stringency and implementation effort of local environmental governance affecting compliance and abatement	Environmental regulation intensity proxied by yearbook indicators; recommended baseline intensity formula: pollution-control effort = (industrial pollution-control expenditure/GDP) × 100	%
Control variable	c₄	Agglomeration and urban-scale conditions shaping energy demand, infrastructure efficiency, and governance capacity	Urban population/total population × 100 (city-year)	%
Control variable	c₅	Fiscal innovation input supporting green innovation and diffusion	S&T expenditure intensity in local fiscal accounts (S&T spending/total public budget expenditure × 100)	%
Control variable	c₆	Public investment and implementation capacity affecting infrastructure, upgrading, and governance	Public budget expenditure intensity (general public budget expenditure/GDP × 100)	%
Control variable	c₇	External-market exposure affecting technology inflows, industrial restructuring, and factor allocation	Openness intensity based on available yearbook data; baseline recommended: trade openness = (imports + exports)/GDP × 100	%
Control variable	c₈	Depth of local financial intermediation supporting investment and green transition	Financial depth proxy (deposit balance + loan balance)/(GDP × 100)	%

Source: Authors’ compilation and calculation based on hand-collected statistical yearbook data; digital–intelligent integration index updated from Ru et al. (2025) [48].

Table 3. Comparison of overall performance and residual diagnostics for y₁ and y₂.

Category	Indicator	y₁ (Value)	y₂ (Value)
Overall performance	R²	0.9980	0.9971
	MAE	0.0051	0.0061
	MAPE (%)	1.74	1.30
Rolling validation	Average MAE	0.1624	0.1643
	Average MAPE (%)	52.58	34.76
Residual diagnostics	Mean residual	0.0000	−0.0000
	Std. dev. of residuals	0.0069	0.0086
	Max residual	0.0305	0.0377
	Min residual	−0.0232	−0.0349

Note: y₁ and y₂ denote the models for the two dependent variables. “Overall performance” refers to in-sample goodness of fit, while “Rolling validation” reports the average prediction errors from rolling-origin cross-validation. MAE and MAPE denote mean absolute error and mean absolute percentage error, respectively. Residual diagnostics are based on the model residuals over the full sample period.

Table 4. Cross-regional comparison of R², MAE and MAPE for the y₁ model by urban agglomeration 2011–2022.

Urban Agglomeration	R²	MAE	MAPE (%)
Hubao Eyu Urban Agglomeration	1.0000	0.0004	0.11
Central Plain Urban Agglomeration	0.9999	0.0013	0.44
Jinzhong Urban Agglomeration	0.9999	0.0005	0.20
Guanzhong Plain Urban Agglomeration	0.9999	0.0007	0.26

Note: The table reports cross-regional goodness-of-fit indicators for the y₁ model (super-efficiency SBM-based efficiency), estimated separately for each urban agglomeration. MAE and MAPE denote mean absolute error and mean absolute percentage error, respectively.

Table 5. Cross-regional comparison of R², MAE and MAPE for the y₂ model by urban agglomeration 2011–2022.

Urban Agglomeration	R²	MAE	MAPE (%)
Hubao Eyu Urban Agglomeration	1.0000	0.0004	0.07
Central Plain Urban Agglomeration	1.0000	0.0005	0.10
Jinzhong Urban Agglomeration	0.9999	0.0007	0.15
Guanzhong Plain Urban Agglomeration	0.9998	0.0020	0.40

Note: The table reports cross-regional goodness-of-fit indicators for the y₂ model (super-efficiency CCR-based efficiency), estimated separately for each urban agglomeration. MAE and MAPE denote mean absolute error and mean absolute percentage error, respectively.

Table 6. Comparison of LOCO-based feature importance for y₁ and y₂ (including interaction features).

Feature	ΔMAE (y₁)	ΔMAE (%) (y₁)	ΔMAE (y₂)	ΔMAE (%) (y₂)
c1	−0.0000	−0.37	0.0013	21.27
c2	−0.0006	−10.99	0.0003	4.27
c3	0.0001	1.84	0.0008	13.22
c4	0.0009	16.97	0.0020	32.50
c5	−0.0003	−6.13	0.0018	29.25
c6	0.0007	13.69	0.0006	10.47
c7	0.0001	0.98	0.0020	32.44
c8	−0.0002	−3.97	0.0008	13.83
x	0.0005	9.33	0.0009	14.43

Note: The table reports LOCO (leave-one-covariate-out) feature importance based on changes in MAE. ΔMAE denotes the increase in mean absolute error when the corresponding feature is removed from the model; a larger positive value indicates a more important feature in terms of prediction accuracy. Regional and time dummy variables are excluded from the LOCO analysis.

Table 7. Global Moran’s I Statistics for Model Residuals of y₁ and y₂.

Year	y₁		y₂
Year	Moran’I	p-Value	Moran’I	p-Value
2011	−0.1184	0.070	−0.0127	0.310
2012	−0.0340	0.430	−0.0790	0.120
2013	−0.1913	0.020	−0.1611	0.060
2014	−0.1526	0.036	−0.1297	0.090
2015	−0.1574	0.035	−0.1310	0.085
2016	−0.1243	0.050	−0.1825	0.030
2017	−0.1598	0.035	−0.1793	0.035
2018	−0.1711	0.028	−0.1582	0.042
2019	−0.1680	0.030	−0.1223	0.095
2020	−0.2212	0.010	−0.1882	0.025
2021	−0.1301	0.046	−0.1009	0.098
2022	−0.1178	0.070	−0.10208	0.096

Note: Global Moran’s I statistics are calculated for the residuals of the models for y₁ and y₂ for each year from 2011 to 2022. Negative Moran’s I values indicate negative spatial autocorrelation (spatial dispersion) of the residuals, whereas values close to zero indicate spatial randomness. The p-values are obtained from permutation tests.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ru, J.; Li, J.; Gan, L.; Yusufu, G. Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework. Land 2026, 15, 159. https://doi.org/10.3390/land15010159

AMA Style

Ru J, Li J, Gan L, Yusufu G. Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework. Land. 2026; 15(1):159. https://doi.org/10.3390/land15010159

Chicago/Turabian Style

Ru, Jiayu, Jiahui Li, Lu Gan, and Gulinaer Yusufu. 2026. "Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework" Land 15, no. 1: 159. https://doi.org/10.3390/land15010159

APA Style

Ru, J., Li, J., Gan, L., & Yusufu, G. (2026). Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework. Land, 15(1), 159. https://doi.org/10.3390/land15010159

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Digital–Intelligent Transformation and Urban Carbon Efficiency in the Yellow River Basin: A Hybrid Super-Efficiency DEA and Interpretable Machine-Learning Framework

Abstract

1. Introduction

2. Literature Review and Analytical Framework

3. Study Area and Data

3.1. Study Area

3.2. Data and Variables

4. Methodology

4.1. Measuring Urban Performance Under Carbon Constraints

4.2. Modelling the Digital–Intelligent Integration–Efficiency Relationship

4.3. Interpreting Nonlinearities and Complementarities

4.3.1. Leave-One-Covariate-Out (LOCO) Importance

4.3.2. Local Attribution and City-Level Diagnostics

4.4. Dynamic Propagation: PVAR and Impulse Responses

4.5. Spatial Dependence Diagnostics: Moran’s I and LISA

4.6. Robustness and Cross-Agglomeration Validation

4.7. Summary

5. Empirical Results

5.1. Overall Predictive Performance of the Hybrid Model

5.2. Cross-Regional Performance Across Urban Agglomerations

5.3. Global Feature Importance

5.4. City-Level Heterogeneity and Representative Cases

5.5. Dynamic Responses of Green Development to Key Drivers

5.6. Spatial Clustering, Residual Moran’s I and Model Adequacy

5.7. Interaction Structure Among Drivers and Policy Implications

6. Discussion

6.1. Value of the Hybrid Modelling Approach

6.2. Digital–Intelligent Integration in a Constrained System

6.3. Different Time Profiles of Economic and Ecological Gains

6.4. Spatial Heterogeneity and Multiple Paths

7. Conclusions and Policy Implications

7.1. Main Conclusions

7.2. Policy Implications

7.3. Limitations and Future Research

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI